In this paper, we introduce the fifth release of VISIONE, an advanced video retrieval system offering diverse search functionalities. The user can search for a target video using textual prompts, drawing objects and colors appearing in the target scenes in a canvas, or images as query examples to search for video keyframes with similar content. Compared to the previous version of our system, which was runner-up at VBS 2023, the forthcoming release, set to participate in VBS 2024, showcases a refined user interface that enhances its usability and updated AI models for more effective video content analysis.
VISIONE 5.0: enhanced user interface and AI models for VBS2024
Giuseppe AmatoCo-ultimo
;Paolo BolettieriCo-primo
;Fabio CarraraCo-primo
;Fabrizio FalchiCo-ultimo
;Claudio GennaroCo-ultimo
;Nicola MessinaCo-primo
;Lucia Vadicamo
Co-primo
;Claudio VairoCo-primo
2024
Abstract
In this paper, we introduce the fifth release of VISIONE, an advanced video retrieval system offering diverse search functionalities. The user can search for a target video using textual prompts, drawing objects and colors appearing in the target scenes in a canvas, or images as query examples to search for video keyframes with similar content. Compared to the previous version of our system, which was runner-up at VBS 2023, the forthcoming release, set to participate in VBS 2024, showcases a refined user interface that enhances its usability and updated AI models for more effective video content analysis.File | Dimensione | Formato | |
---|---|---|---|
2024_VBS2024_VISIONE_V_431.pdf
Open Access dal 29/01/2025
Descrizione: This is the Author Accepted Manuscript (postprint) version of the following paper: Amato G. et al., “VISIONE 5.0: Enhanced User Interface and AI Models for VBS2024”, 2024, peer-reviewed and accepted for publication in “MultiMedia Modeling 30th International Conference, MMM 2024, Amsterdam, The Netherlands, January 29 – February 2, 2024, Proceedings, Part IV”. DOI: 110.1007/978-3-031-53302-0_29
Tipologia:
Documento in Post-print
Licenza:
Altro tipo di licenza
Dimensione
581.45 kB
Formato
Adobe PDF
|
581.45 kB | Adobe PDF | Visualizza/Apri |
978-3-031-53302-0_29.pdf
solo utenti autorizzati
Descrizione: VISIONE 5.0: Enhanced User Interface and AI Models for VBS2024
Tipologia:
Versione Editoriale (PDF)
Licenza:
NON PUBBLICO - Accesso privato/ristretto
Dimensione
1.24 MB
Formato
Adobe PDF
|
1.24 MB | Adobe PDF | Visualizza/Apri Richiedi una copia |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.