In this work we report our experience in the use of CART classifiers in the difficult problem of distinguishing among photographs, graphics, texts and compound digital documents. To cope with the great variety of compound documents we have designed a hierarchical strategy which first classifies documents as compound or non-compound by verifying their homogeneity. Non-compound documents are then classified as photographs, graphics or texts. Documents are indexed only by low-level perceptual features such as color, texture and shape.

A hierarchical classification strategy for digital documents

Brambilla C;
2002

Abstract

In this work we report our experience in the use of CART classifiers in the difficult problem of distinguishing among photographs, graphics, texts and compound digital documents. To cope with the great variety of compound documents we have designed a hierarchical strategy which first classifies documents as compound or non-compound by verifying their homogeneity. Non-compound documents are then classified as photographs, graphics or texts. Documents are indexed only by low-level perceptual features such as color, texture and shape.
2002
Istituto di Matematica Applicata e Tecnologie Informatiche - IMATI -
classificazione
immagini digitali
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/51496
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact