SCOPUS 정보 검색 플랫폼

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Volumn 3635 LNAI, Issue , 2005, Pages 22-36

Multi channel sequence processing

(2) Bengio, Samy a Bourlard, Hervé a,b

a IDIAP RESEARCH INSTITUTE (Switzerland)

b EPFL (Switzerland)

Author keywords

[No Author keywords available]

Indexed keywords

COMPUTATIONAL COMPLEXITY; CORRELATION METHODS; DATA REDUCTION; IMAGE COMPRESSION; SPEECH RECOGNITION; STATISTICAL METHODS;

MEETING ANALYSIS; MULTI-CHANNEL TASKS; MULTIMODAL PROBLEMS;

DATA PROCESSING;

EID: 33645972231 PISSN: 03029743 EISSN: 16113349 Source Type: Book Series
DOI: 10.1007/11559887_2 Document Type: Conference Paper

Times cited : (3)

References (26)

1
- 0034270644
- Audio-visual speech modeling for continuous speech recognition
- Dupont, S., Luettin, J.: Audio-visual speech modeling for continuous speech recognition. IEEE Transactions on Multimedia 2 (2000) 141-151
- (2000) IEEE Transactions on Multimedia , vol.2 , pp. 141-151
- Dupont, S.¹ Luettin, J.²

2
- 1842854568
- Multimodal speech processing using asynchronous hidden markov models
- Bengio, S.: Multimodal speech processing using asynchronous hidden markov models. Information Fusion 5 (2004) 81-89
- (2004) Information Fusion , vol.5 , pp. 81-89
- Bengio, S.¹

3
- 0034825241
- Multi-stream adaptive evidence combination for noise robust ASR
- Morris, A., Hagen, A., Glotin, H., Bourlard, H.: Multi-stream adaptive evidence combination for noise robust ASR. Speech Communication (2001)
- (2001) Speech Communication
- Morris, A.¹ Hagen, A.² Glotin, H.³ Bourlard, H.⁴

4
- 15044354466
- Automatic analysis of multimodal group actions in meetings
- McCowan, I., Gatica-Perez, D., Bengio, S., Lathoud, G., Barnard, M., Zhang, D.: Automatic analysis of multimodal group actions in meetings. IEEE Transactions on Pattern Analysis and Machine Intelligence 27 (2005) 305-317
- (2005) IEEE Transactions on Pattern Analysis and Machine Intelligence , vol.27 , pp. 305-317
- McCowan, I.¹ Gatica-Perez, D.² Bengio, S.³ Lathoud, G.⁴ Barnard, M.⁵ Zhang, D.⁶

5
- 15044348559
- A mixed-state i-particle filter for multi-camera speaker tracking
- Gatica-Perez, D., Lathoud, G., McCowan, I., Odobez, J.M.: A mixed-state i-particle filter for multi-camera speaker tracking. In: Proc. of WOMTEC. (2003)
- (2003) Proc. of WOMTEC
- Gatica-Perez, D.¹ Lathoud, G.² McCowan, I.³ Odobez, J.M.⁴

6
- 0034276470
- Indexing and retrieval of broadcast news
- Renals, S., Abberley, D., Kirby, D., Robinson, T.: Indexing and retrieval of broadcast news. Speech Communication 32 (2000) 5-20
- (2000) Speech Communication , vol.32 , pp. 5-20
- Renals, S.¹ Abberley, D.² Kirby, D.³ Robinson, T.⁴

7
- 0037299489
- A probabilistic multimedia retrieval model and its evaluation
- Westerveld, T., de Vries, A.P., van Ballegooij, A., de Jong, F., Hiemstra, D.: A probabilistic multimedia retrieval model and its evaluation. EURASIP Journal on Applied Signal Processing 2 (2003)
- (2003) EURASIP Journal on Applied Signal Processing , vol.2
- Westerveld, T.¹ De Vries, A.P.² Van Ballegooij, A.³ De Jong, F.⁴ Hiemstra, D.⁵

8
- 30244442625
- Smart clothing: The wearable computer and wearcam
- Mann, S.: Smart clothing: The wearable computer and wearcam. Personal Technologies (1997) Volume 1, Issue 1.
- (1997) Personal Technologies , vol.1 , Issue.1
- Mann, S.¹

9
- 0004244302
- Prentice-Hall
- Rabiner, L.R., Juang, B.H.: Fundamentals of Speech Recognition. Prentice-Hall (1993)
- (1993) Fundamentals of Speech Recognition
- Rabiner, L.R.¹ Juang, B.H.²

10
- 0031619139
- A Hidden Markov Model framework for video segmentation using audio and image features
- Boreczky, J.S., Wilcox, L.D.: A Hidden Markov Model framework for video segmentation using audio and image features. In: Proc. of ICASSP. Volume 6. (1998)
- (1998) Proc. of ICASSP , vol.6
- Boreczky, J.S.¹ Wilcox, L.D.²

11
- 0036288591
- Structure analysis of soccer video with Hidden Markov Models
- Xie, L., Chang, S.F., Divakaran, A., Sun, H.: Structure analysis of soccer video with Hidden Markov Models. In: ICASSP. (2002)
- (2002) ICASSP
- Xie, L.¹ Chang, S.F.² Divakaran, A.³ Sun, H.⁴

12
- 0032629746
- Content-based video indexing of TV broadcast news using Hidden Markov Models
- Eickeler, S., Müller, S.: Content-based video indexing of TV broadcast news using Hidden Markov Models. In: Proc. of ICASSP. (1999)
- (1999) Proc. of ICASSP
- Eickeler, S.¹ Müller, S.²

13
- 0002629270
- Maximum-likelihood from incomplete data via the em algorithm
- Dempster, A.P., Laird, N.M., Rubin, D.B.: Maximum-likelihood from incomplete data via the EM algorithm. Journal of Royal Statistical Society B 39 (1977) 1-38
- (1977) Journal of Royal Statistical Society B , vol.39 , pp. 1-38
- Dempster, A.P.¹ Laird, N.M.² Rubin, D.B.³

14
- 0014602879
- A fast sequential decoding algorithm using a stack
- Jelinek, F.: A fast sequential decoding algorithm using a stack. IBM Journal of Research and Development 13 (1969) 675-685
- (1969) IBM Journal of Research and Development , vol.13 , pp. 675-685
- Jelinek, F.¹

15
- 84935113569
- Error bounds for convolutional codes and an asymptotically optimum decoding algorithm
- Viterbi, A.: Error bounds for convolutional codes and an asymptotically optimum decoding algorithm. IEEE Transactions on Information Theory (1967) 260-269
- (1967) IEEE Transactions on Information Theory , pp. 260-269
- Viterbi, A.¹

16
- 14944356584
- Layered representations for learning and inferring office activity from multiple sensory channels
- Oliver, N., Horvitz, E., Garg, A.: Layered representations for learning and inferring office activity from multiple sensory channels. In: Proc. of the Int. Conf. on Multimodal Interfaces. (2002)
- (2002) Proc. of the Int. Conf. on Multimodal Interfaces
- Oliver, N.¹ Horvitz, E.² Garg, A.³

17
- 0030643240
- Subband-based speech recognition
- Bourlard, H., Dupont. S.: Subband-based speech recognition. In: Proc. IEEE ICASSP. (1997)
- (1997) Proc. IEEE ICASSP
- Bourlard, H.¹ Dupont, S.²

18
- 14544298180
- Audio-visual automatic speech recognition: An overview
- Bailly, G., Vatikiotis-Bateson, E., Perrier, P., eds.: MIT Press
- Potamianos, G., Neti, C., Luettin, J., Matthews, I.: Audio-visual automatic speech recognition: An overview. In Bailly, G., Vatikiotis-Bateson, E., Perrier, P., eds.: Issues in Visual and Audio-Visual Speech Processing. MIT Press (2004)
- (2004) Issues in Visual and Audio-visual Speech Processing
- Potamianos, G.¹ Neti, C.² Luettin, J.³ Matthews, I.⁴

19
- 0003531950
- Coupled hidden markov models for modeling interacting processes
- MIT Media Lab Vision and Modeling
- Brand, M.: Coupled hidden markov models for modeling interacting processes. Technical Report 405, MIT Media Lab Vision and Modeling (1996)
- (1996) Technical Report , vol.405
- Brand, M.¹

20
- 84898971246
- An asynchronous hidden markov model for audio-visual speech recognition
- Becker, S., Thrun, S., Obermayer, K., eds.
- Bengio, S.: An asynchronous hidden markov model for audio-visual speech recognition. In Becker, S., Thrun, S., Obermayer, K., eds.: Advances in Neural Information Processing Systems 15. (2003)
- (2003) Advances in Neural Information Processing Systems , vol.15
- Bengio, S.¹

21
- 84932605936
- Modeling individual and group actions
- Zhang, D., Gatica-Perez, D., Bengio, S., McCowan, I., Lathoud, G.: Modeling individual and group actions in meetings: a two-layer hmm framework. In: IEEE Workshop on Event Mining at CVPR. (2004)
- (2004) IEEE Workshop on Event Mining at CVPR
- Zhang, D.¹ Gatica-Perez, D.² Bengio, S.³ McCowan, I.⁴ Lathoud, G.⁵

22
- 33645993367
- Towards using hierarchical posteriors for flexible automatic speech recognition systems
- Bourlard, H., Bengio, S., Doss, M.M., Zhu, Q., Mesot, B., Morgan, N.: Towards using hierarchical posteriors for flexible automatic speech recognition systems. In: Proc. of DARPA EARS Rich Transcription Workshop. (2004)
- (2004) Proc. of DARPA EARS Rich Transcription Workshop
- Bourlard, H.¹ Bengio, S.² Doss, M.M.³ Zhu, Q.⁴ Mesot, B.⁵ Morgan, N.⁶

23
- 0003487601
- Oxford University Press, London, UK
- Bishop, C.: Neural Networks for Pattern Recognition. Oxford University Press, London, UK (1995)
- (1995) Neural Networks for Pattern Recognition
- Bishop, C.¹

24
- 0028392483
- Learning long-term dependencies with gradient descent is difficult
- Bengio, Y., Simard, P., Frasconi, P.: Learning long-term dependencies with gradient descent is difficult. IEEE Transactions on Neural Networks 5 (1994) 157-166
- (1994) IEEE Transactions on Neural Networks , vol.5 , pp. 157-166
- Bengio, Y.¹ Simard, P.² Frasconi, P.³

25
- 33646001188
- The M2VTS multimodal face database (release 1.00)
- Pigeon, S., Vandendorpe, L.: The M2VTS multimodal face database (release 1.00). In: Proc. of the Conf. on AVBPA. (1997)
- (1997) Proc. of the Conf. on AVBPA
- Pigeon, S.¹ Vandendorpe, L.²

26
- 0004319968
- The noisex-92 study on the effect of additive noise on automatic speech recognition
- DRA Speech Research Unit
- Varga, A., Steeneken, H., Tomlinson, M., Jones, D.: The noisex-92 study on the effect of additive noise on automatic speech recognition. Technical report, DRA Speech Research Unit (1992)
- (1992) Technical Report
- Varga, A.¹ Steeneken, H.² Tomlinson, M.³ Jones, D.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.