In this report we define a representation formalism for describing multimedia documents containing any combination of video, still images, music, speech, and text. A document description in this formalism includes metadata (author, title, etc.), as well as the results of automatic feature extraction for use in indexing, search, and browsing. By defining a single representation format that covers all media, we intend to support cross-media search; for example, an image similarity search might retrieve both videos and still images; and a keyword search on titles might receive documents of all media types. The representation is based on the MPEG-7 standard, with extensions to cover media, features, and metadata not covered by the standard. MPEG-7 provides a rich vocabulary for describing document structure and content, and its status as a standard means that SAPIR will be interoperable with other multimedia management systems. The SAPIR-specific extensions are defined in such a way as to preserve this interoperability. The report describes project activities undertaken as part of task T3.1

SAPIR - D3.1 - Common Schema for Feature Extraction

Falchi F;
2007

Abstract

In this report we define a representation formalism for describing multimedia documents containing any combination of video, still images, music, speech, and text. A document description in this formalism includes metadata (author, title, etc.), as well as the results of automatic feature extraction for use in indexing, search, and browsing. By defining a single representation format that covers all media, we intend to support cross-media search; for example, an image similarity search might retrieve both videos and still images; and a keyword search on titles might receive documents of all media types. The representation is based on the MPEG-7 standard, with extensions to cover media, features, and metadata not covered by the standard. MPEG-7 provides a rich vocabulary for describing document structure and content, and its status as a standard means that SAPIR will be interoperable with other multimedia management systems. The SAPIR-specific extensions are defined in such a way as to preserve this interoperability. The report describes project activities undertaken as part of task T3.1
2007
Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo" - ISTI
Rapporto intermedio di progetto
MPEG-7
Image
Video
Speech
Music
File in questo prodotto:
File Dimensione Formato  
prod_160800-doc_122829.pdf

solo utenti autorizzati

Descrizione: SAPIR - Common Schema for Feature Extraction
Dimensione 243.1 kB
Formato Adobe PDF
243.1 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/152966
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact