In this paper we propose a methodology based on a complex deep learning network topology, named Hierarchical Deep Neural Network (HDNN), applied to eXtreme Multi-label Text Classification (XMTC) problem. The HDNN topology reproduces the label hierarchy. The main idea arises directly from the assumption that, if the label-set structure is defined, forcing this information into the network topology could improve classification performances and results interpretation. In this way, we define a method to force prior knowledge into the DNN. We perform the experimental assessment on a XMTC task related to a real application domain problem, namely the automatic labelling of biomedical scientific literature extracted from the PubMed. The obtained preliminary results show that, despite the very high computational time needed to update the network weights, a slight performance improvement is obtained, with respect to a classical approach based on Convolution Neural Network (CNN). Some considerations will be drawn out to figure out possible key readings.

Exploit Hierarchical Label Knowledge for Deep Learning

Francesco Gargiulo;Stefano Silvestri
;
Mario Ciampi
2019

Abstract

In this paper we propose a methodology based on a complex deep learning network topology, named Hierarchical Deep Neural Network (HDNN), applied to eXtreme Multi-label Text Classification (XMTC) problem. The HDNN topology reproduces the label hierarchy. The main idea arises directly from the assumption that, if the label-set structure is defined, forcing this information into the network topology could improve classification performances and results interpretation. In this way, we define a method to force prior knowledge into the DNN. We perform the experimental assessment on a XMTC task related to a real application domain problem, namely the automatic labelling of biomedical scientific literature extracted from the PubMed. The obtained preliminary results show that, despite the very high computational time needed to update the network weights, a slight performance improvement is obtained, with respect to a classical approach based on Convolution Neural Network (CNN). Some considerations will be drawn out to figure out possible key readings.
2019
Istituto di Calcolo e Reti ad Alte Prestazioni - ICAR
Deep Learning
Hierarchical Deep Neural Network
Extreme Multi-label Text Classification
NLP
File in questo prodotto:
File Dimensione Formato  
Exploit_hierarchical_label.pdf

non disponibili

Licenza: NON PUBBLICO - Accesso privato/ristretto
Dimensione 125.5 kB
Formato Adobe PDF
125.5 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/361252
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 5
  • ???jsp.display-item.citation.isi??? ND
social impact