Automatic people counting from images has recently drawn attention for urban monitoring in modern Smart Cities due to the ubiquity of surveillance camera networks. Current computer vision techniques rely on deep learning-based algorithms that estimate pedestrian densities in still, individual images. Only a bunch of works take advantage of temporal consistency in video sequences. In this work, we propose a spatio-temporal attentive neural network to estimate the number of pedestrians from surveillance videos. By taking advantage of the temporal correlation between consecutive frames, we lowered state-of-the-art count error by 5% and localization error by 7.5% on the widely-used FDST benchmark.

A spatio-temporal attentive network for video-based crowd counting

Ciampi L;Falchi F;Gennaro C;Messina N
2022

Abstract

Automatic people counting from images has recently drawn attention for urban monitoring in modern Smart Cities due to the ubiquity of surveillance camera networks. Current computer vision techniques rely on deep learning-based algorithms that estimate pedestrian densities in still, individual images. Only a bunch of works take advantage of temporal consistency in video sequences. In this work, we propose a spatio-temporal attentive neural network to estimate the number of pedestrians from surveillance videos. By taking advantage of the temporal correlation between consecutive frames, we lowered state-of-the-art count error by 5% and localization error by 7.5% on the widely-used FDST benchmark.
2022
Istituto di Scienza e Tecnologie dell'Informazione "Alessandro Faedo" - ISTI
Inglese
2022 IEEE Symposium on Computers and Communications (ISCC)
ISCC 2022 - 27th IEEE Symposium on Computers and Communications
6
978-1-6654-9792-3
https://ieeexplore.ieee.org/document/9913019
Sì, ma tipo non specificato
30/06/2022-03/07/2022
Rhodes Island, Greece
Crowd counting
Deep learning
Visual counting
Sma
6
partially_open
Avvenuti, M; Bongiovanni, M; Ciampi, L; Falchi, F; Gennaro, C; Messina, N
273
info:eu-repo/semantics/conferenceObject
04 Contributo in convegno::04.01 Contributo in Atti di convegno
   A European Excellence Centre for Media, Society and Democracy
   AI4Media
   H2020
   951911
File in questo prodotto:
File Dimensione Formato  
prod_470912-doc_191113.pdf

accesso aperto

Descrizione: Preprint - A spatio-temporal attentive network for video-based crowd counting
Tipologia: Versione Editoriale (PDF)
Dimensione 1.41 MB
Formato Adobe PDF
1.41 MB Adobe PDF Visualizza/Apri
prod_470912-doc_192352.pdf

solo utenti autorizzati

Descrizione: A spatio-temporal attentive network for video-based crowd counting
Tipologia: Versione Editoriale (PDF)
Dimensione 1.25 MB
Formato Adobe PDF
1.25 MB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/20.500.14243/415245
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 14
  • ???jsp.display-item.citation.isi??? 9
social impact