The indexing process required by content-based video databases is correlated to visual characteristics of their content. A preliminary step consists in the partitioning of the video into a sequence of short dynamic scenes, generally characterised by a set of homogeneous features. Each scene may be therefore characterised by the features of one or more representative frames, i.e. still images. In this paper, a method for scene detection of MPEG-1 and MPEG-2 video sequences is reported. The method does not need to decode the streams, because it is based on the analysis of their external characteristics, such as the frame pattern and the sizes of I-, P- and B-frames. Changes in above characteristics are used to detect the frames representing each scene, using heuristics and statistical considerations. Since the analysis is based on very simple computation, the algorithm performs very fast. Its computational cost is Linear dependent on the number of frames. On the other hand, the method is not well suited for video clip of short length, for which a statistical analysis is not significant. For its low computational cost and accuracy, the method is a suitable tool for preliminary segmentation step of long video sequences.
Scene detection for MPEG video sequences
Lodato C;Lopes S
2001
Abstract
The indexing process required by content-based video databases is correlated to visual characteristics of their content. A preliminary step consists in the partitioning of the video into a sequence of short dynamic scenes, generally characterised by a set of homogeneous features. Each scene may be therefore characterised by the features of one or more representative frames, i.e. still images. In this paper, a method for scene detection of MPEG-1 and MPEG-2 video sequences is reported. The method does not need to decode the streams, because it is based on the analysis of their external characteristics, such as the frame pattern and the sizes of I-, P- and B-frames. Changes in above characteristics are used to detect the frames representing each scene, using heuristics and statistical considerations. Since the analysis is based on very simple computation, the algorithm performs very fast. Its computational cost is Linear dependent on the number of frames. On the other hand, the method is not well suited for video clip of short length, for which a statistical analysis is not significant. For its low computational cost and accuracy, the method is a suitable tool for preliminary segmentation step of long video sequences.I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.