We propose a technique for automatic recognition of content in images. Our technique uses machine learning methods to build classifiers which are able to decide about the presence of semantic concepts in images. Our classifiers exploit a representation of images in terms of vectors of visual terms. A visual term represents a set of visually similar regions that can be found in images. Various types of visual terms are used at the same time to take into account various similarity criteria and region representations that are available to compare regions. Specifically, we compare regions using the 5 MPEG-7 visual descriptors. An image is indexed by first using a segmentation algorithm to extract its regions, and then the image is associated with the visual terms that are more similar to the extracted regions. The proposed technique offers very good performance as demonstrated by the experiments that we performed.

Use of weighted visual terms for machine learning techniques for image content recognition relying on MPEG-7 visual descriptors

Amato G;Savino P;
2008

Abstract

We propose a technique for automatic recognition of content in images. Our technique uses machine learning methods to build classifiers which are able to decide about the presence of semantic concepts in images. Our classifiers exploit a representation of images in terms of vectors of visual terms. A visual term represents a set of visually similar regions that can be found in images. Various types of visual terms are used at the same time to take into account various similarity criteria and region representations that are available to compare regions. Specifically, we compare regions using the 5 MPEG-7 visual descriptors. An image is indexed by first using a segmentation algorithm to extract its regions, and then the image is associated with the visual terms that are more similar to the extracted regions. The proposed technique offers very good performance as demonstrated by the experiments that we performed.
2008
Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo" - ISTI
978-1-60558-316-7
Information Retrieval
File in questo prodotto:
File Dimensione Formato  
prod_91903-doc_23809.pdf

solo utenti autorizzati

Descrizione: Use of weighted visual terms for machine learning techniques for image content recognition relying on MPEG-7 visual descriptors
Tipologia: Versione Editoriale (PDF)
Dimensione 111.56 kB
Formato Adobe PDF
111.56 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/58560
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact