In this paper we propose TreeBoost.MH, an algorithm for multi-label Hierarchical Text Categorization (HTC) consisting of a hierarchical variant of AdaBoost.MH. TreeBoost.MH embodies several intuitions that had arisen before within HTC: e.g. the intuitions that both feature selection and the selection of negative training examples should be performed 'locally', i.e. by paying attention to the topology of the classification scheme. It also embodies the novel intuition that the weight distribution that boosting algorithms update at every boosting round should likewise be updated 'locally'. We present the results of experimenting TreeBoost.MH on two HTC benchmarks, and discuss analytically its computational cost.

TreeBoost.MH: a boosting algorithm for multi-label hierarchical text categorization

Esuli A;Fagni T;Sebastiani F
2006

Abstract

In this paper we propose TreeBoost.MH, an algorithm for multi-label Hierarchical Text Categorization (HTC) consisting of a hierarchical variant of AdaBoost.MH. TreeBoost.MH embodies several intuitions that had arisen before within HTC: e.g. the intuitions that both feature selection and the selection of negative training examples should be performed 'locally', i.e. by paying attention to the topology of the classification scheme. It also embodies the novel intuition that the weight distribution that boosting algorithms update at every boosting round should likewise be updated 'locally'. We present the results of experimenting TreeBoost.MH on two HTC benchmarks, and discuss analytically its computational cost.
2006
Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo" - ISTI
Istituto di informatica e telematica - IIT
Inglese
Crestani F., Ferragina P., Sanderson M.
String Processing and Information Retrieval
SPIRE 2006: 13th International Conference on String Processing and Information Retrieval
4209
13
24
12
978-3-540-45774-9
https://link.springer.com/chapter/10.1007/11880561_2
Sì, ma tipo non specificato
11-13/10/2006
Glasgow, UK
I.2.6 Learning
Text categorization
Codice PUMA: cnr.isti/2006-A2-45
Elettronico
3
restricted
Esuli, A; Fagni, T; Sebastiani, F
273
info:eu-repo/semantics/conferenceObject
04 Contributo in convegno::04.01 Contributo in Atti di convegno
File in questo prodotto:
File Dimensione Formato  
prod_43905-doc_114812.pdf

solo utenti autorizzati

Descrizione: TreeBoost.MH : a boosting algorithm for multi-label hierarchical text categorization
Tipologia: Versione Editoriale (PDF)
Licenza: NON PUBBLICO - Accesso privato/ristretto
Dimensione 459.47 kB
Formato Adobe PDF
459.47 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/43507
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 11
  • ???jsp.display-item.citation.isi??? 7
social impact