Perceiving Systems, Computer Vision

Analysis of gesture and action in technical talks for video indexing

1997

Conference Paper

ps


In this paper, we present an automatic system for analyzing and annotating video sequences of technical talks. Our method uses a robust motion estimation technique to detect key frames and segment the video sequence into subsequences containing a single overhead slide. The subsequences are stabilized to remove motion that occurs when the speaker adjusts their slides. Any changes remaining between frames in the stabilized sequences may be due to speaker gestures such as pointing or writing and we use active contours to automatically track these potential gestures. Given the constrained domain we define a simple ``vocabulary'' of actions which can easily be recognized based on the active contour shape and motion. The recognized actions provide a rich annotation of the sequence that can be used to access a condensed version of the talk from a web page.

Author(s): Ju, S. X. and Black, M. J. and Minneman, S. and Kimber, D.
Book Title: IEEE Conf. on Computer Vision and Pattern Recognition
Pages: 595-601
Year: 1997
Month: June
Publisher: CVPR-97

Department(s): Perceiving Systems
Bibtex Type: Conference Paper (inproceedings)
Paper Type: Conference

Address: Puerto Rico

Links: pdf

BibTex

@inproceedings{Black:CVPR:1997,
  title = {Analysis of gesture and action in technical talks for video indexing},
  author = {Ju, S. X. and Black, M. J. and Minneman, S. and Kimber, D.},
  booktitle = {IEEE Conf. on Computer Vision and Pattern Recognition},
  pages = {595-601},
  publisher = { CVPR-97},
  address = {Puerto Rico},
  month = jun,
  year = {1997},
  doi = {},
  month_numeric = {6}
}