TY - JOUR
T1 - Rapid estimation of camera motion from compressed video with application to video annotation
AU - Tan, Yap Peng
AU - Saur, Drew D.
AU - Kulkarni, Sanjeev R.
AU - Ramadge, Peter J.
N1 - Funding Information:
Manuscript received September 22, 1997; revised August 31, 1998. The research of Y.-P. Tan was supported in part by an IBM Graduate Fellowship. The research of D. D. Saur was supported in part by the New Jersey Center for Multimedia Research. This paper was recommended by Associate Editor R. Stevenson.
PY - 2000
Y1 - 2000
N2 - As digital video becomes more pervasive, efficient ways of searching and annotating video according to content will be increasingly important. Such tasks arise, for example, in the management of digital video libraries for content-based retrieval and browsing. In this paper, we develop tools based on camera motion for analyzing and annotating a class of structured video using the low-level information available directly from MPEG-compressed video. In particular, we show that in certain structured settings, it is possible to obtain reliable estimates of camera motion by directly processing data easily obtained from the MPEG format. Working directly with the compressed video greatly reduces the processing time and enhances storage efficiency. As an illustration of this idea, we have developed a simple basketball annotation system which combines the low-level information extracted from an MPEG stream with the prior knowledge of basketball structure to provide high-level content analysis, annotation, and browsing for events such as wide-angle and close-up views, fast breaks, probable shots at the basket, etc. The methods used in this example should also be useful in the analysis of high-level content of structured video in other domains.
AB - As digital video becomes more pervasive, efficient ways of searching and annotating video according to content will be increasingly important. Such tasks arise, for example, in the management of digital video libraries for content-based retrieval and browsing. In this paper, we develop tools based on camera motion for analyzing and annotating a class of structured video using the low-level information available directly from MPEG-compressed video. In particular, we show that in certain structured settings, it is possible to obtain reliable estimates of camera motion by directly processing data easily obtained from the MPEG format. Working directly with the compressed video greatly reduces the processing time and enhances storage efficiency. As an illustration of this idea, we have developed a simple basketball annotation system which combines the low-level information extracted from an MPEG stream with the prior knowledge of basketball structure to provide high-level content analysis, annotation, and browsing for events such as wide-angle and close-up views, fast breaks, probable shots at the basket, etc. The methods used in this example should also be useful in the analysis of high-level content of structured video in other domains.
UR - http://www.scopus.com/inward/record.url?scp=0033891550&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=0033891550&partnerID=8YFLogxK
U2 - 10.1109/76.825867
DO - 10.1109/76.825867
M3 - Article
AN - SCOPUS:0033891550
SN - 1051-8215
VL - 10
SP - 133
EP - 146
JO - IEEE Transactions on Circuits and Systems for Video Technology
JF - IEEE Transactions on Circuits and Systems for Video Technology
IS - 1
ER -