SCOPUS 정보 검색 플랫폼

2006 IEEE ACL Spoken Language Technology Workshop, SLT 2006, Proceedings

Volumn , Issue , 2006, Pages 154-157

An asynchronous DBN for audio-visual speech recognition

(2) Saenko, Kate a Livescu, Karen a

a MASSACHUSETTS INSTITUTE OF TECHNOLOGY (United States)

Author keywords

Speech recognition

Indexed keywords

AUDIO ACOUSTICS; BAYESIAN NETWORKS; ERROR ANALYSIS; INFERENCE ENGINES; LINGUISTICS; RIVERS; SPEECH; SPEECH ANALYSIS; UNDERWATER ACOUSTICS; ACOUSTIC NOISE;

ASYNCHRONOUS MODELS; ASYNCHRONY; AUDIO VISUAL SPEECH RECOGNITION (AVSR); AUDIO-VISUAL; AUDIO-VISUAL CORPORA; DE SYNCHRONIZATION; DYNAMIC BAYESIAN NETWORK (DBN); ERROR RATE (ER); LIP READING; NUMBER OF STATES; PRONUNCIATION MODELLING; SPOKEN LANGUAGES; TWO-STREAM; VISUAL SPEECH RECOGNITION;

SPEECH RECOGNITION;

ASYNCHRONY; AUDIOVISUAL SPEECH RECOGNITION; DESYNCHRONIZATION; DYNAMIC BAYESIAN NETWORKS; LIPREADING; NETWORK-BASED MODELING; NUMBER OF STATE; PRONUNCIATION MODELLING; TWO-STREAM; VISUAL SPEECH RECOGNITION;

EID: 48749083240 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/SLT.2006.326841 Document Type: Conference Paper

Times cited : (15)

References (16)

1
- 70350617187
- J. Bilmes, "The Graphical Models Toolkit", http://ssli.ee. washington.edu/bilmes/gmtk/.
- The Graphical Models Toolkit
- Bilmes, J.¹

2
- 0036293559
- The Graphical Models Toolkit: An open source software system for speech and time-series processing, in Proc
- J. Bilmes and G. Zweig, "The Graphical Models Toolkit: An open source software system for speech and time-series processing," in Proc. ICASSP, 2002.
- ICASSP, 2002
- Bilmes, J.¹ Zweig, G.²

3
- 0030685285
- Coupled hidden Markov models for complex action recognition
- San Juan, Puerto Rico, June
- M. Brand, N. Oliver, and A. Pentland, "Coupled hidden Markov models for complex action recognition," in Proc. IEEE International Conference on Computer Vision and Pattern Recognition, pp. 994-999, San Juan, Puerto Rico, June 1997.
- (1997) Proc. IEEE International Conference on Computer Vision and Pattern Recognition , pp. 994-999
- Brand, M.¹ Oliver, N.² Pentland, A.³

4
- 0002629270
- Maximum likelihood from incomplete data via the EM algorithm
- A. Dempster, N. Laird, and D. Rubin, "Maximum likelihood from incomplete data via the EM algorithm," Journal of the Royal Statistical Society, 39:1-38, 1977.
- (1977) Journal of the Royal Statistical Society , vol.39 , pp. 1-38
- Dempster, A.¹ Laird, N.² Rubin, D.³

5
- 85156254941
- Factorial hidden Markov models
- Systems, D. S. Touretzky, M. C. Mozer, and M. E. Hasselmo, eds, MIT Press, Cambridge, MA, USA
- Z. Ghahramani and M. Jordan, "Factorial hidden Markov models," in Proc. Conference Advances in Neural Information Processing Systems, D. S. Touretzky, M. C. Mozer, and M. E. Hasselmo, eds., vol. 8, pp. 472-478, MIT Press, Cambridge, MA, USA, 1995.
- (1995) Proc. Conference Advances in Neural Information Processing , vol.8 , pp. 472-478
- Ghahramani, Z.¹ Jordan, M.²

6
- 4544343002
- DBN based multi-stream models for audio-visual speech recognition
- J. Gowdy, A. Subramanya, C. Bartels, and J. Bilmes, "DBN based multi-stream models for audio-visual speech recognition," in Proc. ICASSP, 2004.
- (2004) Proc. ICASSP
- Gowdy, J.¹ Subramanya, A.² Bartels, C.³ Bilmes, J.⁴

7
- 0012668146
- Asynchrony modeling for audio-visual speech recognition
- San Diego
- G. Gravier, G. Potamianos, and C. Neti, "Asynchrony modeling for audio-visual speech recognition," in Proc. Human Language Technology Conference, San Diego, 2002.
- (2002) Proc. Human Language Technology Conference
- Gravier, G.¹ Potamianos, G.² Neti, C.³

8
- 14944341906
- Feature-based pronunciation modeling for speech recognition
- K. Livescu and J. Glass, "Feature-based pronunciation modeling for speech recognition," in Proc. HLT/NAACL, 2004.
- (2004) Proc. HLT/NAACL
- Livescu, K.¹ Glass, J.²

9
- 78651465434
- Feature-based pronunciation modeling with trainable asynchrony probabilities
- K. Livescu and J. Glass, "Feature-based pronunciation modeling with trainable asynchrony probabilities," in Proc. ICSLP, 2004.
- (2004) Proc. ICSLP
- Livescu, K.¹ Glass, J.²

10
- 0013288412
- Ph.D. thesis, U.C. Berkeley CS Division
- K. Murphy, Dynamic Bayesian Networks: Representation, Inference and Learning. Ph.D. thesis, U.C. Berkeley CS Division, 2002.
- (2002) Dynamic Bayesian Networks: Representation, Inference and Learning
- Murphy, K.¹

11
- 0035790960
- Proc. Works. Signal Processing, pp, Cannes, France
- C. Neti, G. Potamianos, J. Luettin, I. Matthews, H. Glotin, and D. Vergyri, "Large-vocabulary audio-visual speech recognition: A summary of the Johns Hopkins Summer 2000 Workshop," in Proc. Works. Signal Processing, pp. 619-624, Cannes, France, 2001.
- (2001) Large-vocabulary audio-visual speech recognition: A summary of the Johns Hopkins Summer 2000 Workshop , pp. 619-624
- Neti, C.¹ Potamianos, G.² Luettin, J.³ Matthews, I.⁴ Glotin, H.⁵ Vergyri, D.⁶

12
- 84957551318
- Loosely-Coupled HMMs for ASR
- H. Nock and S. Young, "Loosely-Coupled HMMs for ASR," in Proc. ICSLP, 2000.
- (2000) Proc. ICSLP
- Nock, H.¹ Young, S.²

13
- 0036299249
- CUAVE: A new audio-visual database for multimodal human-computer interface research, in Proc
- E.K. Patterson, S. Gurbuz, Z. Tufekci, and J.N. Gowdy, "CUAVE: A new audio-visual database for multimodal human-computer interface research," in Proc. ICASSP, 2002.
- ICASSP, 2002
- Patterson, E.K.¹ Gurbuz, S.² Tufekci, Z.³ Gowdy, J.N.⁴

14
- 4544290191
- Recent Advances in the Automatic Recognition of Audio-Visual Speech
- G. Potamianos, C. Neti, G. Gravier, A. Garg, and A. Senior, "Recent Advances in the Automatic Recognition of Audio-Visual Speech", in Proc. IEEE, 2003.
- (2003) Proc. IEEE
- Potamianos, G.¹ Neti, C.² Gravier, G.³ Garg, A.⁴ Senior, A.⁵

15
- 48749102554
- Visual Speech Recognition with Loosely Synchronized Feature Streams
- K. Saenko, M. Siracusa, K.Wilson, K. Livescu, J. Glass, and T. Darrell, "Visual Speech Recognition with Loosely Synchronized Feature Streams," in Proc. International Conference on Computer Vision, 2006.
- (2006) Proc. International Conference on Computer Vision
- Saenko, K.¹ Siracusa, M.² Wilson, K.³ Livescu, K.⁴ Glass, J.⁵ Darrell, T.⁶

16
- 0141813588
- DBN based multi-stream models for speech, in Proc
- Y. Zhang, Q. Diao, S. Huang, W. Hu, C. Bartels, and J. Bilmes, "DBN based multi-stream models for speech", in Proc. ICASSP, 2003.
- ICASSP, 2003
- Zhang, Y.¹ Diao, Q.² Huang, S.³ Hu, W.⁴ Bartels, C.⁵ Bilmes, J.⁶

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.