In this paper we propose TreeBoost.MH, an algorithm for multi-label Hierarchical Text Categorization (HTC) consisting of a hierarchical variant of AdaBoost.MH. TreeBoost.MH embodies several intuitions that had arisen before within HTC: e.g. the intuitions that both feature selection and the selection of negative training examples should be performed 'locally', i.e. by paying attention to the topology of the classification scheme. It also embodies the novel intuition that the weight distribution that boosting algorithms update at every boosting round should likewise be updated 'locally'. We present the results of experimenting TreeBoost.MH on two HTC benchmarks, and discuss analytically its computational cost.

TreeBoost.MH : a boosting algorithm for multi-label hierarchical text categorization

Esuli A;Fagni T;Sebastiani F
2006

Abstract

In this paper we propose TreeBoost.MH, an algorithm for multi-label Hierarchical Text Categorization (HTC) consisting of a hierarchical variant of AdaBoost.MH. TreeBoost.MH embodies several intuitions that had arisen before within HTC: e.g. the intuitions that both feature selection and the selection of negative training examples should be performed 'locally', i.e. by paying attention to the topology of the classification scheme. It also embodies the novel intuition that the weight distribution that boosting algorithms update at every boosting round should likewise be updated 'locally'. We present the results of experimenting TreeBoost.MH on two HTC benchmarks, and discuss analytically its computational cost.
2006
Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo" - ISTI
I.2.6 Learning
Text categorization
File in questo prodotto:
File Dimensione Formato  
prod_43905-doc_114812.pdf

solo utenti autorizzati

Descrizione: TreeBoost.MH : a boosting algorithm for multi-label hierarchical text categorization
Tipologia: Versione Editoriale (PDF)
Dimensione 459.47 kB
Formato Adobe PDF
459.47 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/43507
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 189
  • ???jsp.display-item.citation.isi??? ND
social impact