In this paper we present an analysis on the usage of Deep Neural Networks for extreme multi-label and multiclass text classification. We will consider two network models: the first one is formed by a word embeddings (WEs) stage followed by two dense layers, hereinafter Dense, and a second model with a convolution stage between the WEs and the dense layers, hereinafter CNN-Dense. We will take into account classification problems characterized by different number of labels, ranging from an order of 10 to an order of 30,000, showing the different performances of the neural networks varying the total label number and the average number of labels for sample, exploiting the hierarchical structure of the label space of the dataset used for experimental assessment. It is worth noting that multi-label classification is an harder problem if compared to multi-class, due to the variable number of labels associated to each sample. We will even investigate on the behaviour of the neural networks as function of the training hyperparameters, analysing the link between them and the dataset complexity. All the result will be evaluated using the PubMed scientific articles collection as test case.

Deep convolution neural network for extreme multi-label text classification

Francesco Gargiulo;Stefano Silvestri
;
Mario Ciampi
2018

Abstract

In this paper we present an analysis on the usage of Deep Neural Networks for extreme multi-label and multiclass text classification. We will consider two network models: the first one is formed by a word embeddings (WEs) stage followed by two dense layers, hereinafter Dense, and a second model with a convolution stage between the WEs and the dense layers, hereinafter CNN-Dense. We will take into account classification problems characterized by different number of labels, ranging from an order of 10 to an order of 30,000, showing the different performances of the neural networks varying the total label number and the average number of labels for sample, exploiting the hierarchical structure of the label space of the dataset used for experimental assessment. It is worth noting that multi-label classification is an harder problem if compared to multi-class, due to the variable number of labels associated to each sample. We will even investigate on the behaviour of the neural networks as function of the training hyperparameters, analysing the link between them and the dataset complexity. All the result will be evaluated using the PubMed scientific articles collection as test case.
2018
Istituto di Calcolo e Reti ad Alte Prestazioni - ICAR
Inglese
Proceedings of the 11th International Joint Conference on Biomedical Engineering Systems and Technologies (BIOSTEC 2018) - Volume 5: HEALTHINF
AI4Health 2018 : International Workshop on Artificial Intelligence for Health
5
641
650
10
978-989-758-281-3
http://www.scitepress.org/PublicationsDetail.aspx?ID=K7bL9g8CqNk=&t=1
SciTePress
Lisbona
PORTOGALLO
Sì, ma tipo non specificato
19-21/01/2018
Funchal, Madeira, Portugal
Extreme Multi-label Text Classification
Deep Learning
Deep Convolutional Neural Networks
Word Embeddings
3
open
Gargiulo, Francesco; Silvestri, Stefano; Ciampi, Mario
273
info:eu-repo/semantics/conferenceObject
04 Contributo in convegno::04.01 Contributo in Atti di convegno
File in questo prodotto:
File Dimensione Formato  
prod_384195-doc_131160.pdf

accesso aperto

Descrizione: Deep Convolution Neural Network for Extreme Multi-label Text Classification
Tipologia: Versione Editoriale (PDF)
Licenza: Creative commons
Dimensione 218.21 kB
Formato Adobe PDF
218.21 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/346155
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 33
  • ???jsp.display-item.citation.isi??? ND
social impact