-
1
-
-
0034270644
-
Audio-visual speech modeling for continuous speech recognition
-
Dupont, S., Luettin, J.: Audio-visual speech modeling for continuous speech recognition. IEEE Transactions on Multimedia 2 (2000) 141-151
-
(2000)
IEEE Transactions on Multimedia
, vol.2
, pp. 141-151
-
-
Dupont, S.1
Luettin, J.2
-
2
-
-
1842854568
-
Multimodal speech processing using asynchronous hidden markov models
-
Bengio, S.: Multimodal speech processing using asynchronous hidden markov models. Information Fusion 5 (2004) 81-89
-
(2004)
Information Fusion
, vol.5
, pp. 81-89
-
-
Bengio, S.1
-
3
-
-
0034825241
-
Multi-stream adaptive evidence combination for noise robust ASR
-
Morris, A., Hagen, A., Glotin, H., Bourlard, H.: Multi-stream adaptive evidence combination for noise robust ASR. Speech Communication (2001)
-
(2001)
Speech Communication
-
-
Morris, A.1
Hagen, A.2
Glotin, H.3
Bourlard, H.4
-
4
-
-
15044354466
-
Automatic analysis of multimodal group actions in meetings
-
McCowan, I., Gatica-Perez, D., Bengio, S., Lathoud, G., Barnard, M., Zhang, D.: Automatic analysis of multimodal group actions in meetings. IEEE Transactions on Pattern Analysis and Machine Intelligence 27 (2005) 305-317
-
(2005)
IEEE Transactions on Pattern Analysis and Machine Intelligence
, vol.27
, pp. 305-317
-
-
McCowan, I.1
Gatica-Perez, D.2
Bengio, S.3
Lathoud, G.4
Barnard, M.5
Zhang, D.6
-
5
-
-
15044348559
-
A mixed-state i-particle filter for multi-camera speaker tracking
-
Gatica-Perez, D., Lathoud, G., McCowan, I., Odobez, J.M.: A mixed-state i-particle filter for multi-camera speaker tracking. In: Proc. of WOMTEC. (2003)
-
(2003)
Proc. of WOMTEC
-
-
Gatica-Perez, D.1
Lathoud, G.2
McCowan, I.3
Odobez, J.M.4
-
6
-
-
0034276470
-
Indexing and retrieval of broadcast news
-
Renals, S., Abberley, D., Kirby, D., Robinson, T.: Indexing and retrieval of broadcast news. Speech Communication 32 (2000) 5-20
-
(2000)
Speech Communication
, vol.32
, pp. 5-20
-
-
Renals, S.1
Abberley, D.2
Kirby, D.3
Robinson, T.4
-
7
-
-
0037299489
-
A probabilistic multimedia retrieval model and its evaluation
-
Westerveld, T., de Vries, A.P., van Ballegooij, A., de Jong, F., Hiemstra, D.: A probabilistic multimedia retrieval model and its evaluation. EURASIP Journal on Applied Signal Processing 2 (2003)
-
(2003)
EURASIP Journal on Applied Signal Processing
, vol.2
-
-
Westerveld, T.1
De Vries, A.P.2
Van Ballegooij, A.3
De Jong, F.4
Hiemstra, D.5
-
8
-
-
30244442625
-
Smart clothing: The wearable computer and wearcam
-
Mann, S.: Smart clothing: The wearable computer and wearcam. Personal Technologies (1997) Volume 1, Issue 1.
-
(1997)
Personal Technologies
, vol.1
, Issue.1
-
-
Mann, S.1
-
10
-
-
0031619139
-
A Hidden Markov Model framework for video segmentation using audio and image features
-
Boreczky, J.S., Wilcox, L.D.: A Hidden Markov Model framework for video segmentation using audio and image features. In: Proc. of ICASSP. Volume 6. (1998)
-
(1998)
Proc. of ICASSP
, vol.6
-
-
Boreczky, J.S.1
Wilcox, L.D.2
-
11
-
-
0036288591
-
Structure analysis of soccer video with Hidden Markov Models
-
Xie, L., Chang, S.F., Divakaran, A., Sun, H.: Structure analysis of soccer video with Hidden Markov Models. In: ICASSP. (2002)
-
(2002)
ICASSP
-
-
Xie, L.1
Chang, S.F.2
Divakaran, A.3
Sun, H.4
-
12
-
-
0032629746
-
Content-based video indexing of TV broadcast news using Hidden Markov Models
-
Eickeler, S., Müller, S.: Content-based video indexing of TV broadcast news using Hidden Markov Models. In: Proc. of ICASSP. (1999)
-
(1999)
Proc. of ICASSP
-
-
Eickeler, S.1
Müller, S.2
-
13
-
-
0002629270
-
Maximum-likelihood from incomplete data via the em algorithm
-
Dempster, A.P., Laird, N.M., Rubin, D.B.: Maximum-likelihood from incomplete data via the EM algorithm. Journal of Royal Statistical Society B 39 (1977) 1-38
-
(1977)
Journal of Royal Statistical Society B
, vol.39
, pp. 1-38
-
-
Dempster, A.P.1
Laird, N.M.2
Rubin, D.B.3
-
14
-
-
0014602879
-
A fast sequential decoding algorithm using a stack
-
Jelinek, F.: A fast sequential decoding algorithm using a stack. IBM Journal of Research and Development 13 (1969) 675-685
-
(1969)
IBM Journal of Research and Development
, vol.13
, pp. 675-685
-
-
Jelinek, F.1
-
15
-
-
84935113569
-
Error bounds for convolutional codes and an asymptotically optimum decoding algorithm
-
Viterbi, A.: Error bounds for convolutional codes and an asymptotically optimum decoding algorithm. IEEE Transactions on Information Theory (1967) 260-269
-
(1967)
IEEE Transactions on Information Theory
, pp. 260-269
-
-
Viterbi, A.1
-
18
-
-
14544298180
-
Audio-visual automatic speech recognition: An overview
-
Bailly, G., Vatikiotis-Bateson, E., Perrier, P., eds.: MIT Press
-
Potamianos, G., Neti, C., Luettin, J., Matthews, I.: Audio-visual automatic speech recognition: An overview. In Bailly, G., Vatikiotis-Bateson, E., Perrier, P., eds.: Issues in Visual and Audio-Visual Speech Processing. MIT Press (2004)
-
(2004)
Issues in Visual and Audio-visual Speech Processing
-
-
Potamianos, G.1
Neti, C.2
Luettin, J.3
Matthews, I.4
-
19
-
-
0003531950
-
Coupled hidden markov models for modeling interacting processes
-
MIT Media Lab Vision and Modeling
-
Brand, M.: Coupled hidden markov models for modeling interacting processes. Technical Report 405, MIT Media Lab Vision and Modeling (1996)
-
(1996)
Technical Report
, vol.405
-
-
Brand, M.1
-
20
-
-
84898971246
-
An asynchronous hidden markov model for audio-visual speech recognition
-
Becker, S., Thrun, S., Obermayer, K., eds.
-
Bengio, S.: An asynchronous hidden markov model for audio-visual speech recognition. In Becker, S., Thrun, S., Obermayer, K., eds.: Advances in Neural Information Processing Systems 15. (2003)
-
(2003)
Advances in Neural Information Processing Systems
, vol.15
-
-
Bengio, S.1
-
21
-
-
84932605936
-
Modeling individual and group actions
-
Zhang, D., Gatica-Perez, D., Bengio, S., McCowan, I., Lathoud, G.: Modeling individual and group actions in meetings: a two-layer hmm framework. In: IEEE Workshop on Event Mining at CVPR. (2004)
-
(2004)
IEEE Workshop on Event Mining at CVPR
-
-
Zhang, D.1
Gatica-Perez, D.2
Bengio, S.3
McCowan, I.4
Lathoud, G.5
-
22
-
-
33645993367
-
Towards using hierarchical posteriors for flexible automatic speech recognition systems
-
Bourlard, H., Bengio, S., Doss, M.M., Zhu, Q., Mesot, B., Morgan, N.: Towards using hierarchical posteriors for flexible automatic speech recognition systems. In: Proc. of DARPA EARS Rich Transcription Workshop. (2004)
-
(2004)
Proc. of DARPA EARS Rich Transcription Workshop
-
-
Bourlard, H.1
Bengio, S.2
Doss, M.M.3
Zhu, Q.4
Mesot, B.5
Morgan, N.6
-
24
-
-
0028392483
-
Learning long-term dependencies with gradient descent is difficult
-
Bengio, Y., Simard, P., Frasconi, P.: Learning long-term dependencies with gradient descent is difficult. IEEE Transactions on Neural Networks 5 (1994) 157-166
-
(1994)
IEEE Transactions on Neural Networks
, vol.5
, pp. 157-166
-
-
Bengio, Y.1
Simard, P.2
Frasconi, P.3
-
26
-
-
0004319968
-
The noisex-92 study on the effect of additive noise on automatic speech recognition
-
DRA Speech Research Unit
-
Varga, A., Steeneken, H., Tomlinson, M., Jones, D.: The noisex-92 study on the effect of additive noise on automatic speech recognition. Technical report, DRA Speech Research Unit (1992)
-
(1992)
Technical Report
-
-
Varga, A.1
Steeneken, H.2
Tomlinson, M.3
Jones, D.4
|