메뉴 건너뛰기




Volumn 111, Issue 2, 2008, Pages 142-154

Audiovisual integration with Segment Models for tennis video parsing

Author keywords

Hidden Markov Models; Multimodal fusion; Segment Models; Video indexing; Video summarization

Indexed keywords

COMPUTATIONAL GRAMMARS; FUSION REACTIONS; IMAGING TECHNIQUES; LEARNING SYSTEMS; MARKOV PROCESSES; NUCLEAR PHYSICS; OBJECT RECOGNITION; PHOTOGRAPHY; RECORDING INSTRUMENTS; SPEECH RECOGNITION; VIDEO RECORDING; VISUAL COMMUNICATION;

EID: 47049092919     PISSN: 10773142     EISSN: 1090235X     Source Type: Journal    
DOI: 10.1016/j.cviu.2007.09.002     Document Type: Article
Times cited : (16)

References (32)
  • 2
    • 10044236762 scopus 로고    scopus 로고
    • Multimodal video indexing: a review of the state-of-the-art
    • Snoek C., and Worring M. Multimodal video indexing: a review of the state-of-the-art. Multimedia Tools and Applications 25 1 (2005) 5-35
    • (2005) Multimedia Tools and Applications , vol.25 , Issue.1 , pp. 5-35
    • Snoek, C.1    Worring, M.2
  • 3
    • 0024610919 scopus 로고
    • A tutorial on hidden Markov models and selected applications in speech recognition
    • Rabiner L. A tutorial on hidden Markov models and selected applications in speech recognition. Proceedings of the IEEE 77 2 (1989) 257-285
    • (1989) Proceedings of the IEEE , vol.77 , Issue.2 , pp. 257-285
    • Rabiner, L.1
  • 4
    • 0030705367 scopus 로고    scopus 로고
    • W. Wolf, Hidden Markov model parsing of video programs, in: Proceedings of ICASSP, 1997, pp. 2609-2611.
    • W. Wolf, Hidden Markov model parsing of video programs, in: Proceedings of ICASSP, 1997, pp. 2609-2611.
  • 6
    • 0030245363 scopus 로고    scopus 로고
    • From HMMs to segment models: a unified view of stochastic modeling for speech recognition
    • Ostendorf M., Digalakis V., and Kimball O. From HMMs to segment models: a unified view of stochastic modeling for speech recognition. IEEE Transactions on Speech and Audio Processing 4 5 (1996) 360-378
    • (1996) IEEE Transactions on Speech and Audio Processing , vol.4 , Issue.5 , pp. 360-378
    • Ostendorf, M.1    Digalakis, V.2    Kimball, O.3
  • 8
    • 78651471921 scopus 로고    scopus 로고
    • J. Calic, N. Campbell, S. Dasiopoulou, Y. Kompatsiaris, An overview of multimodal video representation for semantic analysis, in: Proceedings of the European Workshop on the Integration of Knowledge, Semantics and Digital Media Technologies (EWIMT 2005), IEE, 2005.
    • J. Calic, N. Campbell, S. Dasiopoulou, Y. Kompatsiaris, An overview of multimodal video representation for semantic analysis, in: Proceedings of the European Workshop on the Integration of Knowledge, Semantics and Digital Media Technologies (EWIMT 2005), IEE, 2005.
  • 9
    • 47049100015 scopus 로고    scopus 로고
    • J. Huang, Z. Liu, Y. Wang, Y. Chen, E. Wong, Integration of multimodal features for video classification based on HMM, in: Proceedings of IEEE Signal Processing Society Workshop on Multimedia Signal Processing, 1999, pp. 53-58.
    • J. Huang, Z. Liu, Y. Wang, Y. Chen, E. Wong, Integration of multimodal features for video classification based on HMM, in: Proceedings of IEEE Signal Processing Society Workshop on Multimedia Signal Processing, 1999, pp. 53-58.
  • 10
    • 0031619139 scopus 로고    scopus 로고
    • J. Boreczky, L. Wilcox, A hidden Markov model framework for video segmentation using audio and image features, in: Proceedings of ICASSP, 1998, pp. 3741-3744.
    • J. Boreczky, L. Wilcox, A hidden Markov model framework for video segmentation using audio and image features, in: Proceedings of ICASSP, 1998, pp. 3741-3744.
  • 11
    • 26444510258 scopus 로고    scopus 로고
    • T. Bae, S. Jin, Y. Ro, Video segmentation using hidden Markov model with multimodal features, in: Proceedings of the International Conference on Image and Video Retrieval, 2004, pp. 401-409.
    • T. Bae, S. Jin, Y. Ro, Video segmentation using hidden Markov model with multimodal features, in: Proceedings of the International Conference on Image and Video Retrieval, 2004, pp. 401-409.
  • 12
    • 0035368101 scopus 로고    scopus 로고
    • Multi-modal dialog scene detection using hidden Markov models for content-based multimedia indexing
    • Alatan A., Akansu A., and Wolf W. Multi-modal dialog scene detection using hidden Markov models for content-based multimedia indexing. Multimedia Tools and Applications 14 2 (2001) 137-151
    • (2001) Multimedia Tools and Applications , vol.14 , Issue.2 , pp. 137-151
    • Alatan, A.1    Akansu, A.2    Wolf, W.3
  • 13
    • 84937046785 scopus 로고    scopus 로고
    • N. Dimitrova, L. Agnihorti, G. Wei, Video classification based on HMM using text and faces, in: Proceedings of the European Signal Processing Conference, 2000.
    • N. Dimitrova, L. Agnihorti, G. Wei, Video classification based on HMM using text and faces, in: Proceedings of the European Signal Processing Conference, 2000.
  • 14
    • 0032629746 scopus 로고    scopus 로고
    • S. Eickeler, S. Muller, Content-based video indexing of TV broadcast news using Hidden Markov Models, in: IEEE Int. Conference on Acoustics, Speech, and Signal Processing (ICASSP), 1999, pp. 2997-3000.
    • S. Eickeler, S. Muller, Content-based video indexing of TV broadcast news using Hidden Markov Models, in: IEEE Int. Conference on Acoustics, Speech, and Signal Processing (ICASSP), 1999, pp. 2997-3000.
  • 15
    • 0034846553 scopus 로고    scopus 로고
    • U. Iurgel, R. Meermeier, S. Eickeler, G. Rigoll, New approaches to audio-visual segmentation of TV news for automatic topic retrieval, in: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2001, pp. 1397-1400.
    • U. Iurgel, R. Meermeier, S. Eickeler, G. Rigoll, New approaches to audio-visual segmentation of TV news for automatic topic retrieval, in: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2001, pp. 1397-1400.
  • 17
    • 0030355935 scopus 로고    scopus 로고
    • H. Bourlard, S. Dupont, A new ASR approach based on independent processing and recombination of partial frequency bands, in: Proceedings of the ICSLP'96, 1, Philadelphia, PA, 1996, pp. 426-429.
    • H. Bourlard, S. Dupont, A new ASR approach based on independent processing and recombination of partial frequency bands, in: Proceedings of the ICSLP'96, 1, Philadelphia, PA, 1996, pp. 426-429.
  • 18
    • 0034842451 scopus 로고    scopus 로고
    • H. Glotin, D. Vergyri, C. Neti, G. Potamianos, J. Luettin, Weighting schemes for audio-visual fusion in speech recognition, in: Proceedings of Int. Conf. Acoust. Speech Signal Process, 2001.
    • H. Glotin, D. Vergyri, C. Neti, G. Potamianos, J. Luettin, Weighting schemes for audio-visual fusion in speech recognition, in: Proceedings of Int. Conf. Acoust. Speech Signal Process, 2001.
  • 19
    • 84898971246 scopus 로고    scopus 로고
    • An asynchronous hidden Markov model for audio-visual speech recognition
    • Advances in Neural Information Processing Systems. Becker S., Thrun S., and Obermayer K. (Eds), MIT Press
    • Bengio S. An asynchronous hidden Markov model for audio-visual speech recognition. In: Becker S., Thrun S., and Obermayer K. (Eds). Advances in Neural Information Processing Systems. NIPS 15 (2003), MIT Press 1237-1244
    • (2003) NIPS 15 , pp. 1237-1244
    • Bengio, S.1
  • 21
    • 4944221356 scopus 로고    scopus 로고
    • Layered representations for learning and inferring office activity from multiple sensory channels
    • Oliver N., Garg A., and Horvitz E. Layered representations for learning and inferring office activity from multiple sensory channels. Computer Vision and Image Understanding 96 2 (2004) 163-180
    • (2004) Computer Vision and Image Understanding , vol.96 , Issue.2 , pp. 163-180
    • Oliver, N.1    Garg, A.2    Horvitz, E.3
  • 22
    • 85199275816 scopus 로고    scopus 로고
    • D. Zhang, D. Gatica-Perez, S. Bengio, I. McCowan, G. Lathoud, Modeling individual and group actions in meetings: a two-layer HMM framework, in: IEEE Workshop on Event Mining at the Conference on Computer Vision and Pattern Recognition, CVPR, vol. 7, 2004, pp. 117-124.
    • D. Zhang, D. Gatica-Perez, S. Bengio, I. McCowan, G. Lathoud, Modeling individual and group actions in meetings: a two-layer HMM framework, in: IEEE Workshop on Event Mining at the Conference on Computer Vision and Pattern Recognition, CVPR, vol. 7, 2004, pp. 117-124.
  • 23
    • 2142771243 scopus 로고    scopus 로고
    • Structure analysis of soccer video with domain knowledge and hidden Markov models
    • Xie L., Xu P., Chang S.-F., Divakaran A., and Sun H. Structure analysis of soccer video with domain knowledge and hidden Markov models. Pattern Recognition Letters 25 7 (2004) 767-775
    • (2004) Pattern Recognition Letters , vol.25 , Issue.7 , pp. 767-775
    • Xie, L.1    Xu, P.2    Chang, S.-F.3    Divakaran, A.4    Sun, H.5
  • 24
    • 34250761822 scopus 로고    scopus 로고
    • M. Delakis, G. Gravier, P. Gros, Score oriented viterbi search in sport video structuring using HMM and segment models, in: Proceedings of the International Workshop on Multimedia Signal Processing (MMSP'06), 2006.
    • M. Delakis, G. Gravier, P. Gros, Score oriented viterbi search in sport video structuring using HMM and segment models, in: Proceedings of the International Workshop on Multimedia Signal Processing (MMSP'06), 2006.
  • 25
    • 0034442267 scopus 로고    scopus 로고
    • B. Truong, C. Dorai, S. Venkatesh, New enhancements to cut, fade, and dissolve detection processes in video segmentation, in: Proceedings of the ACM on Multimedia, 2000, pp. 219-227.
    • B. Truong, C. Dorai, S. Venkatesh, New enhancements to cut, fade, and dissolve detection processes in video segmentation, in: Proceedings of the ACM on Multimedia, 2000, pp. 219-227.
  • 26
    • 85143191520 scopus 로고    scopus 로고
    • R. Dahyot, A. Kokaram, N. Rea, H. Denman, Joint audio visual retrieval for tennis broadcasts, in: IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 3, 2003, pp. 561-564.
    • R. Dahyot, A. Kokaram, N. Rea, H. Denman, Joint audio visual retrieval for tennis broadcasts, in: IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 3, 2003, pp. 561-564.
  • 27
    • 84908281148 scopus 로고    scopus 로고
    • D. Zhong, S.-F. Chang, Structure analysis of sports video using domain models, in: IEEE International Conference on Multimedia and Expo, 2001.
    • D. Zhong, S.-F. Chang, Structure analysis of sports video using domain models, in: IEEE International Conference on Multimedia and Expo, 2001.
  • 29
    • 11244315230 scopus 로고    scopus 로고
    • M. Betser, G. Gravier, Multiple events tracking in sound tracks, in: Intl. Conf. Multimedia and Exhibition, 2004.
    • M. Betser, G. Gravier, Multiple events tracking in sound tracks, in: Intl. Conf. Multimedia and Exhibition, 2004.
  • 30
    • 47049128594 scopus 로고    scopus 로고
    • A. Tritschler, A segmentation-enabled speech recognition application using the BIC criterion, Ph.D. thesis, Institut EURECOM, France (1998).
    • A. Tritschler, A segmentation-enabled speech recognition application using the BIC criterion, Ph.D. thesis, Institut EURECOM, France (1998).
  • 31
    • 0032119668 scopus 로고    scopus 로고
    • The hierarchical hidden Markov model: analysis and applications
    • Fine S., Singer Y., and Tishby N. The hierarchical hidden Markov model: analysis and applications. Machine Learning 32 1 (1998) 41-62
    • (1998) Machine Learning , vol.32 , Issue.1 , pp. 41-62
    • Fine, S.1    Singer, Y.2    Tishby, N.3
  • 32
    • 84883126733 scopus 로고    scopus 로고
    • C. Snoek, M. Worring, A. Smeulders, Early versus late fusion in semantic video analysis, in: MULTIMEDIA'05: Proceedings of the 13th annual ACM international conference on Multimedia, ACM Press, New York, NY, USA, 2005, pp. 399-402.
    • C. Snoek, M. Worring, A. Smeulders, Early versus late fusion in semantic video analysis, in: MULTIMEDIA'05: Proceedings of the 13th annual ACM international conference on Multimedia, ACM Press, New York, NY, USA, 2005, pp. 399-402.


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.