-
2
-
-
10044236762
-
Multimodal video indexing: a review of the state-of-the-art
-
Snoek C., and Worring M. Multimodal video indexing: a review of the state-of-the-art. Multimedia Tools and Applications 25 1 (2005) 5-35
-
(2005)
Multimedia Tools and Applications
, vol.25
, Issue.1
, pp. 5-35
-
-
Snoek, C.1
Worring, M.2
-
3
-
-
0024610919
-
A tutorial on hidden Markov models and selected applications in speech recognition
-
Rabiner L. A tutorial on hidden Markov models and selected applications in speech recognition. Proceedings of the IEEE 77 2 (1989) 257-285
-
(1989)
Proceedings of the IEEE
, vol.77
, Issue.2
, pp. 257-285
-
-
Rabiner, L.1
-
4
-
-
0030705367
-
-
W. Wolf, Hidden Markov model parsing of video programs, in: Proceedings of ICASSP, 1997, pp. 2609-2611.
-
W. Wolf, Hidden Markov model parsing of video programs, in: Proceedings of ICASSP, 1997, pp. 2609-2611.
-
-
-
-
5
-
-
33749394511
-
Audiovisual integration for tennis broadcast structuring
-
Kijak E., Gravier G., Oisel L., and Gros P. Audiovisual integration for tennis broadcast structuring. Multimedia Tools and Applications 30 3 (2006) 289-311
-
(2006)
Multimedia Tools and Applications
, vol.30
, Issue.3
, pp. 289-311
-
-
Kijak, E.1
Gravier, G.2
Oisel, L.3
Gros, P.4
-
8
-
-
78651471921
-
-
J. Calic, N. Campbell, S. Dasiopoulou, Y. Kompatsiaris, An overview of multimodal video representation for semantic analysis, in: Proceedings of the European Workshop on the Integration of Knowledge, Semantics and Digital Media Technologies (EWIMT 2005), IEE, 2005.
-
J. Calic, N. Campbell, S. Dasiopoulou, Y. Kompatsiaris, An overview of multimodal video representation for semantic analysis, in: Proceedings of the European Workshop on the Integration of Knowledge, Semantics and Digital Media Technologies (EWIMT 2005), IEE, 2005.
-
-
-
-
9
-
-
47049100015
-
-
J. Huang, Z. Liu, Y. Wang, Y. Chen, E. Wong, Integration of multimodal features for video classification based on HMM, in: Proceedings of IEEE Signal Processing Society Workshop on Multimedia Signal Processing, 1999, pp. 53-58.
-
J. Huang, Z. Liu, Y. Wang, Y. Chen, E. Wong, Integration of multimodal features for video classification based on HMM, in: Proceedings of IEEE Signal Processing Society Workshop on Multimedia Signal Processing, 1999, pp. 53-58.
-
-
-
-
10
-
-
0031619139
-
-
J. Boreczky, L. Wilcox, A hidden Markov model framework for video segmentation using audio and image features, in: Proceedings of ICASSP, 1998, pp. 3741-3744.
-
J. Boreczky, L. Wilcox, A hidden Markov model framework for video segmentation using audio and image features, in: Proceedings of ICASSP, 1998, pp. 3741-3744.
-
-
-
-
11
-
-
26444510258
-
-
T. Bae, S. Jin, Y. Ro, Video segmentation using hidden Markov model with multimodal features, in: Proceedings of the International Conference on Image and Video Retrieval, 2004, pp. 401-409.
-
T. Bae, S. Jin, Y. Ro, Video segmentation using hidden Markov model with multimodal features, in: Proceedings of the International Conference on Image and Video Retrieval, 2004, pp. 401-409.
-
-
-
-
12
-
-
0035368101
-
Multi-modal dialog scene detection using hidden Markov models for content-based multimedia indexing
-
Alatan A., Akansu A., and Wolf W. Multi-modal dialog scene detection using hidden Markov models for content-based multimedia indexing. Multimedia Tools and Applications 14 2 (2001) 137-151
-
(2001)
Multimedia Tools and Applications
, vol.14
, Issue.2
, pp. 137-151
-
-
Alatan, A.1
Akansu, A.2
Wolf, W.3
-
13
-
-
84937046785
-
-
N. Dimitrova, L. Agnihorti, G. Wei, Video classification based on HMM using text and faces, in: Proceedings of the European Signal Processing Conference, 2000.
-
N. Dimitrova, L. Agnihorti, G. Wei, Video classification based on HMM using text and faces, in: Proceedings of the European Signal Processing Conference, 2000.
-
-
-
-
14
-
-
0032629746
-
-
S. Eickeler, S. Muller, Content-based video indexing of TV broadcast news using Hidden Markov Models, in: IEEE Int. Conference on Acoustics, Speech, and Signal Processing (ICASSP), 1999, pp. 2997-3000.
-
S. Eickeler, S. Muller, Content-based video indexing of TV broadcast news using Hidden Markov Models, in: IEEE Int. Conference on Acoustics, Speech, and Signal Processing (ICASSP), 1999, pp. 2997-3000.
-
-
-
-
15
-
-
0034846553
-
-
U. Iurgel, R. Meermeier, S. Eickeler, G. Rigoll, New approaches to audio-visual segmentation of TV news for automatic topic retrieval, in: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2001, pp. 1397-1400.
-
U. Iurgel, R. Meermeier, S. Eickeler, G. Rigoll, New approaches to audio-visual segmentation of TV news for automatic topic retrieval, in: IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), 2001, pp. 1397-1400.
-
-
-
-
16
-
-
4544290191
-
Recent advances in the automatic recognition of audiovisual speech
-
Potamianos G., Neti C., Gravier G., Garg A., and Senior A.W. Recent advances in the automatic recognition of audiovisual speech. Proceedings of the IEEE 91 9 (2003) 1306-1326
-
(2003)
Proceedings of the IEEE
, vol.91
, Issue.9
, pp. 1306-1326
-
-
Potamianos, G.1
Neti, C.2
Gravier, G.3
Garg, A.4
Senior, A.W.5
-
17
-
-
0030355935
-
-
H. Bourlard, S. Dupont, A new ASR approach based on independent processing and recombination of partial frequency bands, in: Proceedings of the ICSLP'96, 1, Philadelphia, PA, 1996, pp. 426-429.
-
H. Bourlard, S. Dupont, A new ASR approach based on independent processing and recombination of partial frequency bands, in: Proceedings of the ICSLP'96, 1, Philadelphia, PA, 1996, pp. 426-429.
-
-
-
-
18
-
-
0034842451
-
-
H. Glotin, D. Vergyri, C. Neti, G. Potamianos, J. Luettin, Weighting schemes for audio-visual fusion in speech recognition, in: Proceedings of Int. Conf. Acoust. Speech Signal Process, 2001.
-
H. Glotin, D. Vergyri, C. Neti, G. Potamianos, J. Luettin, Weighting schemes for audio-visual fusion in speech recognition, in: Proceedings of Int. Conf. Acoust. Speech Signal Process, 2001.
-
-
-
-
19
-
-
84898971246
-
An asynchronous hidden Markov model for audio-visual speech recognition
-
Advances in Neural Information Processing Systems. Becker S., Thrun S., and Obermayer K. (Eds), MIT Press
-
Bengio S. An asynchronous hidden Markov model for audio-visual speech recognition. In: Becker S., Thrun S., and Obermayer K. (Eds). Advances in Neural Information Processing Systems. NIPS 15 (2003), MIT Press 1237-1244
-
(2003)
NIPS 15
, pp. 1237-1244
-
-
Bengio, S.1
-
20
-
-
15044354466
-
Automatic analysis of multimodal group actions in meetings
-
McCowan I., Gatica-Perez D., Bengio S., Lathoud G., Barnard M., and Zhang D. Automatic analysis of multimodal group actions in meetings. IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI) 27 3 (2005) 305-317
-
(2005)
IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI)
, vol.27
, Issue.3
, pp. 305-317
-
-
McCowan, I.1
Gatica-Perez, D.2
Bengio, S.3
Lathoud, G.4
Barnard, M.5
Zhang, D.6
-
21
-
-
4944221356
-
Layered representations for learning and inferring office activity from multiple sensory channels
-
Oliver N., Garg A., and Horvitz E. Layered representations for learning and inferring office activity from multiple sensory channels. Computer Vision and Image Understanding 96 2 (2004) 163-180
-
(2004)
Computer Vision and Image Understanding
, vol.96
, Issue.2
, pp. 163-180
-
-
Oliver, N.1
Garg, A.2
Horvitz, E.3
-
22
-
-
85199275816
-
-
D. Zhang, D. Gatica-Perez, S. Bengio, I. McCowan, G. Lathoud, Modeling individual and group actions in meetings: a two-layer HMM framework, in: IEEE Workshop on Event Mining at the Conference on Computer Vision and Pattern Recognition, CVPR, vol. 7, 2004, pp. 117-124.
-
D. Zhang, D. Gatica-Perez, S. Bengio, I. McCowan, G. Lathoud, Modeling individual and group actions in meetings: a two-layer HMM framework, in: IEEE Workshop on Event Mining at the Conference on Computer Vision and Pattern Recognition, CVPR, vol. 7, 2004, pp. 117-124.
-
-
-
-
23
-
-
2142771243
-
Structure analysis of soccer video with domain knowledge and hidden Markov models
-
Xie L., Xu P., Chang S.-F., Divakaran A., and Sun H. Structure analysis of soccer video with domain knowledge and hidden Markov models. Pattern Recognition Letters 25 7 (2004) 767-775
-
(2004)
Pattern Recognition Letters
, vol.25
, Issue.7
, pp. 767-775
-
-
Xie, L.1
Xu, P.2
Chang, S.-F.3
Divakaran, A.4
Sun, H.5
-
24
-
-
34250761822
-
-
M. Delakis, G. Gravier, P. Gros, Score oriented viterbi search in sport video structuring using HMM and segment models, in: Proceedings of the International Workshop on Multimedia Signal Processing (MMSP'06), 2006.
-
M. Delakis, G. Gravier, P. Gros, Score oriented viterbi search in sport video structuring using HMM and segment models, in: Proceedings of the International Workshop on Multimedia Signal Processing (MMSP'06), 2006.
-
-
-
-
25
-
-
0034442267
-
-
B. Truong, C. Dorai, S. Venkatesh, New enhancements to cut, fade, and dissolve detection processes in video segmentation, in: Proceedings of the ACM on Multimedia, 2000, pp. 219-227.
-
B. Truong, C. Dorai, S. Venkatesh, New enhancements to cut, fade, and dissolve detection processes in video segmentation, in: Proceedings of the ACM on Multimedia, 2000, pp. 219-227.
-
-
-
-
26
-
-
85143191520
-
-
R. Dahyot, A. Kokaram, N. Rea, H. Denman, Joint audio visual retrieval for tennis broadcasts, in: IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 3, 2003, pp. 561-564.
-
R. Dahyot, A. Kokaram, N. Rea, H. Denman, Joint audio visual retrieval for tennis broadcasts, in: IEEE International Conference on Acoustics, Speech, and Signal Processing, vol. 3, 2003, pp. 561-564.
-
-
-
-
27
-
-
84908281148
-
-
D. Zhong, S.-F. Chang, Structure analysis of sports video using domain models, in: IEEE International Conference on Multimedia and Expo, 2001.
-
D. Zhong, S.-F. Chang, Structure analysis of sports video using domain models, in: IEEE International Conference on Multimedia and Expo, 2001.
-
-
-
-
29
-
-
11244315230
-
-
M. Betser, G. Gravier, Multiple events tracking in sound tracks, in: Intl. Conf. Multimedia and Exhibition, 2004.
-
M. Betser, G. Gravier, Multiple events tracking in sound tracks, in: Intl. Conf. Multimedia and Exhibition, 2004.
-
-
-
-
30
-
-
47049128594
-
-
A. Tritschler, A segmentation-enabled speech recognition application using the BIC criterion, Ph.D. thesis, Institut EURECOM, France (1998).
-
A. Tritschler, A segmentation-enabled speech recognition application using the BIC criterion, Ph.D. thesis, Institut EURECOM, France (1998).
-
-
-
-
31
-
-
0032119668
-
The hierarchical hidden Markov model: analysis and applications
-
Fine S., Singer Y., and Tishby N. The hierarchical hidden Markov model: analysis and applications. Machine Learning 32 1 (1998) 41-62
-
(1998)
Machine Learning
, vol.32
, Issue.1
, pp. 41-62
-
-
Fine, S.1
Singer, Y.2
Tishby, N.3
-
32
-
-
84883126733
-
-
C. Snoek, M. Worring, A. Smeulders, Early versus late fusion in semantic video analysis, in: MULTIMEDIA'05: Proceedings of the 13th annual ACM international conference on Multimedia, ACM Press, New York, NY, USA, 2005, pp. 399-402.
-
C. Snoek, M. Worring, A. Smeulders, Early versus late fusion in semantic video analysis, in: MULTIMEDIA'05: Proceedings of the 13th annual ACM international conference on Multimedia, ACM Press, New York, NY, USA, 2005, pp. 399-402.
-
-
-
|