In this paper, we introduce the fifth release of VISIONE, an advanced video retrieval system offering diverse search functionalities. The user can search for a target video using textual prompts, drawing objects and colors appearing in the target scenes in a canvas, or images as query examples to search for video keyframes with similar content. Compared to the previous version of our system, which was runner-up at VBS 2023, the forthcoming release, set to participate in VBS 2024, showcases a refined user interface that enhances its usability and updated AI models for more effective video content analysis.

VISIONE 5.0: enhanced user interface and AI models for VBS2024

Giuseppe Amato
Co-ultimo
;
Paolo Bolettieri
Co-primo
;
Fabio Carrara
Co-primo
;
Fabrizio Falchi
Co-ultimo
;
Claudio Gennaro
Co-ultimo
;
Nicola Messina
Co-primo
;
Lucia Vadicamo
Co-primo
;
Claudio Vairo
Co-primo
2024

Abstract

In this paper, we introduce the fifth release of VISIONE, an advanced video retrieval system offering diverse search functionalities. The user can search for a target video using textual prompts, drawing objects and colors appearing in the target scenes in a canvas, or images as query examples to search for video keyframes with similar content. Compared to the previous version of our system, which was runner-up at VBS 2023, the forthcoming release, set to participate in VBS 2024, showcases a refined user interface that enhances its usability and updated AI models for more effective video content analysis.
2024
Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo" - ISTI
978-3-031-53302-0
Information Search and Retrieval
Content-based video retrieval
Video search
Surrogate Text Representation
Multi-modal Retrieval
Cross-modal retrieval
File in questo prodotto:
File Dimensione Formato  
2024_VBS2024_VISIONE_V_431.pdf

embargo fino al 28/01/2025

Descrizione: This is the Author Accepted Manuscript (postprint)  version of the following paper: Amato G. et al., “VISIONE 5.0: Enhanced User Interface and AI Models for VBS2024”, 2024, peer-reviewed and accepted for publication in “MultiMedia Modeling 30th International Conference, MMM 2024, Amsterdam, The Netherlands, January 29 – February 2, 2024, Proceedings, Part IV”. DOI: 110.1007/978-3-031-53302-0_29
Tipologia: Documento in Post-print
Licenza: Altro tipo di licenza
Dimensione 581.45 kB
Formato Adobe PDF
581.45 kB Adobe PDF   Visualizza/Apri   Richiedi una copia
978-3-031-53302-0_29.pdf

solo utenti autorizzati

Descrizione: VISIONE 5.0: Enhanced User Interface and AI Models for VBS2024
Tipologia: Versione Editoriale (PDF)
Licenza: NON PUBBLICO - Accesso privato/ristretto
Dimensione 1.24 MB
Formato Adobe PDF
1.24 MB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/485001
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 1
  • ???jsp.display-item.citation.isi??? ND
social impact