SCOPUS 정보 검색 플랫폼

Eurasip Journal on Applied Signal Processing

Volumn 2002, Issue 11, 2002, Pages 1274-1288

Dynamic Bayesian networks for audio-visual speech recognition

(5) Nefian, Ara V a Liang, Luhong b Pi, Xiaobo b Liu, Xiaoxing b Murphy, Kevin c

a INTEL CORPORATION (United States)

b INTEL CORPORATION (China)

c UNIVERSITY OF CALIFORNIA (United States)

Author keywords

Audio visual speech recognition; Coupled hidden Markov models; Dynamic Bayesian networks; Factorial hidden Markov models; Hidden Markov models

Indexed keywords

ACOUSTIC NOISE; ACOUSTIC SIGNAL PROCESSING; ALGORITHMS; CORRELATION THEORY; MARKOV PROCESSES; SPEECH SYNTHESIS; VIDEO SIGNAL PROCESSING;

AUDIO-VISUAL SPEECH RECOGNITION (AVSR); BAYESIAN NETWORKS;

SPEECH RECOGNITION;

EID: 0036874999 PISSN: 11108657 EISSN: None Source Type: Journal
DOI: 10.1155/S1110865702206083 Document Type: Article

Times cited : (262)

References (30)

1
- 0017199877
- Hearing lips and seeing voices
- H. McGurk and J. MacDonald, "Hearing lips and seeing voices," Nature, vol. 264, no. 5588, pp. 746-748, 1976.
- (1976) Nature , vol.264 , Issue.5588 , pp. 746-748
- McGurk, H.¹ Macdonald, J.²

2
- 0012667608
- Tech. Rep., Center for Language and Speech Processing, The Johns Hopkins University, Baltimore, Md, USA
- C. Neti, G. Potamianos, J. Luettin, et al., "Audio visual speech recognition, Final workshop 2000 report," Tech. Rep., Center for Language and Speech Processing, The Johns Hopkins University, Baltimore, Md, USA, 2000.
- (2000) Audio Visual Speech Recognition, Final Workshop 2000 Report
- Neti, C.¹ Potamianos, G.² Luettin, J.³

3
- 0030685285
- Coupled hidden Markov models for complex action recognition
- San Juan, Puerto Rico, June
- M. Brand, N. Oliver, and A. Pentland, "Coupled hidden Markov models for complex action recognition," in Proc. IEEE International Conference on Computer Vision and Pattern Recognition, pp. 994-999, San Juan, Puerto Rico, June 1997.
- (1997) Proc. IEEE International Conference on Computer Vision and Pattern Recognition , pp. 994-999
- Brand, M.¹ Oliver, N.² Pentland, A.³

4
- 85156254941
- Factorial hidden Markov models
- D. S. Touretzky, M. C. Mozer, and M. E. Hasselmo, Eds., MIT Press, Cambridge, Mass, USA
- Z. Ghahramani and M. I. Jordan, "Factorial hidden Markov models," in Proc. Conf. Advances in Neural Information Processing Systems, D. S. Touretzky, M. C. Mozer, and M. E. Hasselmo, Eds., vol. 8, pp. 472-478, MIT Press, Cambridge, Mass, USA, 1995.
- (1995) Proc. Conf. Advances in Neural Information Processing Systems , vol.8 , pp. 472-478
- Ghahramani, Z.¹ Jordan, M.I.²

5
- 0012730694
- A model for reasoning about persistence and causation
- T. Dean and K. Kanazawa, "A model for reasoning about persistence and causation," Artificial Intelligence, vol. 93, no. 1-2, pp. 1-27, 1989.
- (1989) Artificial Intelligence , vol.93 , Issue.1-2 , pp. 1-27
- Dean, T.¹ Kanazawa, K.²

6
- 0004244302
- Prentice-Hall, Englewood Cliffs, NJ, USA
- L. Rabiner and B.-H. Juang, Fundamentals of Speech Recognition, Prentice-Hall, Englewood Cliffs, NJ, USA, 1993.
- (1993) Fundamentals of Speech Recognition
- Rabiner, L.¹ Juang, B.-H.²

7
- 0030362752
- Speechreading using shape and intensity information
- Philadelphia, Pa, USA
- J. Luettin, N. Thacker, and S. Beet, "Speechreading using shape and intensity information," in Proc. the 4th IEEE International Conf. on Spoken Language Processing, vol. 1, pp. 58-61, Philadelphia, Pa, USA, 1996.
- (1996) Proc. the 4th IEEE International Conf. on Spoken Language Processing , vol.1 , pp. 58-61
- Luettin, J.¹ Thacker, N.² Beet, S.³

8
- 85032752352
- Audiovisual speech processing
- January
- T. Chen, "Audiovisual speech processing," IEEE Signal Processing Magazine, vol. 18, pp. 9-21, January 2001.
- (2001) IEEE Signal Processing Magazine , vol.18 , pp. 9-21
- Chen, T.¹

9
- 0029234004
- Nonlinear manifold learning for visual speech recognition
- Boston, Mass, USA
- [9] C. Bregler and S. Omohundro, "Nonlinear manifold learning for visual speech recognition," in Proc. IEEE International Conf. on Computer Vision, pp. 494-499, Boston, Mass, USA, 1995.
- (1995) Proc. IEEE International Conf. on Computer Vision , pp. 494-499
- Bregler, C.¹ Omohundro, S.²

10
- 0030643952
- Fusion of visual and acoustic signals for command-word recognition
- Munich, Germany, April
- R. Kober, U. Harz, and J. Schiffers, "Fusion of visual and acoustic signals for command-word recognition," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing, pp. 1495-1497, Munich, Germany, April 1997.
- (1997) Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing , pp. 1495-1497
- Kober, R.¹ Harz, U.² Schiffers, J.³

11
- 0034270644
- Audio-visual speech modeling for continuous speech recognition
- S. Dupont and J. Luettin, "Audio-visual speech modeling for continuous speech recognition," IEEE Trans. Multimedia, vol. 2, no. 3, pp. 141-151, 2000.
- (2000) IEEE Trans. Multimedia , vol.2 , Issue.3 , pp. 141-151
- Dupont, S.¹ Luettin, J.²

12
- 82055174896
- Audio-visual speech recognition compared across two architectures
- Madrid, Spain
- A. Adjoudani and C. Benoît, "Audio-visual speech recognition compared across two architectures," in European Conference on Speech Communication and Technology, pp. 1563-1566, Madrid, Spain, 1995.
- (1995) European Conference on Speech Communication and Technology , pp. 1563-1566
- Adjoudani, A.¹ Benoît, C.²

13
- 0034853041
- Hierarchical discriminant features for audio-visual LVCSR
- Salt Lake City, Utah, USA, May
- G. Potamianos, J. Luettin, and C. Neti, "Hierarchical discriminant features for audio-visual LVCSR," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing, pp. 165-168, Salt Lake City, Utah, USA, May 2001.
- (2001) Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing , pp. 165-168
- Potamianos, G.¹ Luettin, J.² Neti, C.³

14
- 0034842342
- Asynchronous stream modeling for large vocabulary audio-visual speech recognition
- Salt Lake City, Utah, USA
- J. Luettin, G. Potamianos, and C. Neti, "Asynchronous stream modeling for large vocabulary audio-visual speech recognition," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing, pp. 169-172, Salt Lake City, Utah, USA, 2001.
- (2001) Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing , pp. 169-172
- Luettin, J.¹ Potamianos, G.² Neti, C.³

15
- 0029747053
- Integrating audio and visual information to provide highly robust speech recognition
- Atlanta, Ga, USA, May
- M. J. Tomlinson, M. J. Russell, and N. M. Brooke, "Integrating audio and visual information to provide highly robust speech recognition," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing, pp. 821-824, Atlanta, Ga, USA, May 1996.
- (1996) Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing , pp. 821-824
- Tomlinson, M.J.¹ Russell, M.J.² Brooke, N.M.³

16
- 0034502214
- Speaker independent audio-visual speech recognition
- New York, NY, USA
- Y. Zhang, S. Levinson, and T. Huang, "Speaker independent audio-visual speech recognition," in IEEE International Conference on Multimedia and Expo, vol. 2, pp. 1073-1076, New York, NY, USA, 2000.
- (2000) IEEE International Conference on Multimedia and Expo , vol.2 , pp. 1073-1076
- Zhang, Y.¹ Levinson, S.² Huang, T.³

17
- 0012668146
- Asynchrony modeling for audio-visual speech recognition
- San Diego, Calif, USA, March
- G. Gravier, G. Potamianos, and C. Neti, "Asynchrony modeling for audio-visual speech recognition," in Proc. Human Language Technology Conference, San Diego, Calif, USA, March 2002.
- (2002) Proc. Human Language Technology Conference
- Gravier, G.¹ Potamianos, G.² Neti, C.³

18
- 0012705518
- Advanced Multimedia Processing Lab, Carnegie Mellon University, Pittsburgh, Pa, USA
- Advanced Multimedia Processing Lab, http://amp.ece.cmu.edu/projects/AudioVisualSpeechProcessing/, Carnegie Mellon University, Pittsburgh, Pa, USA.

19
- 0003922190
- John Wiley & Sons, New York, NY, USA, 2nd edition
- R. O. Duda, P. E. Hart, and D. G. Stork, Pattern Classification, John Wiley & Sons, New York, NY, USA, 2nd edition, 2000.
- (2000) Pattern Classification
- Duda, R.O.¹ Hart, P.E.² Stork, D.G.³

20
- 0003409574
- Prentice-Hall, Englewood Cliffs, NJ, USA
- K. R. Castleman, Digital Image Processing, Prentice-Hall, Englewood Cliffs, NJ, USA, 1996.
- (1996) Digital Image Processing
- Castleman, K.R.¹

21
- 0002874631
- A computational scheme for reasoning in dynamic probabilistic networks
- Stanford, Calif, USA
- U. Kjaerulff, "A computational scheme for reasoning in dynamic probabilistic networks," in Proc. the 8th International Conference on Uncertainty in Artificial Intelligence, pp. 121-129, Stanford, Calif, USA, 1992.
- (1992) Proc. the 8th International Conference on Uncertainty in Artificial Intelligence , pp. 121-129
- Kjaerulff, U.¹

22
- 0002049440
- Learning dynamic Bayesian networks
- Adaptive Processing of Sequences and Data Structures, C. Giles and M. Gori, Eds., Springer-Verlag, Berlin, Germany
- Z. Ghahramani, "Learning dynamic Bayesian networks," in Adaptive Processing of Sequences and Data Structures, C. Giles and M. Gori, Eds., Lecture Notes in Artificial Intelligence, pp. 168-197, Springer-Verlag, Berlin, Germany, 1998.
- (1998) Lecture Notes in Artificial Intelligence , pp. 168-197
- Ghahramani, Z.¹

23
- 0003391330
- Morgan Kaufmann Publishers, San Mateo, Calif, USA
- J. Pearl, Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference, Morgan Kaufmann Publishers, San Mateo, Calif, USA, 1988.
- (1988) Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference
- Pearl, J.¹

24
- 0003448310
- Springer-Verlag, New York, USA
- F. V. Jensen, Bayesian Networks and Decision Graphs, Springer-Verlag, New York, USA, 2001.
- (2001) Bayesian Networks and Decision Graphs
- Jensen, F.V.¹

25
- 0004158157
- Ph.D. thesis, University of Illinois, Urbana-Champaign, III, USA
- V. Pavlovic, Dynamic Bayesian networks for information fusion with applications to human-computer interfaces, Ph.D. thesis, University of Illinois, Urbana-Champaign, III, USA, 1999.
- (1999) Dynamic Bayesian Networks for Information Fusion with Applications to Human-computer Interfaces
- Pavlovic, V.¹

26
- 85009135946
- Bimodal speech recognition using coupled hidden Markov models
- Beijing, China
- S. Chu and T. Huang, "Bimodal speech recognition using coupled hidden Markov models," in Proc. IEEE International Conf. on Spoken Language Processing, vol. 2, pp. 747-750, Beijing, China, 2000.
- (2000) Proc. IEEE International Conf. on Spoken Language Processing , vol.2 , pp. 747-750
- Chu, S.¹ Huang, T.²

27
- 0036295989
- Audio-visual speech modeling using coupled hidden Markov models
- Orlando, Fla, USA, May
- S. Chu and T. Huang, "Audio-visual speech modeling using coupled hidden Markov models," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing, pp. 2009-2012, Orlando, Fla, USA, May 2002.
- (2002) Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing , pp. 2009-2012
- Chu, S.¹ Huang, T.²

28
- 0000417467
- Visionary speech: Looking ahead to practical speechreading systems
- Speechreading by Humans and Machines: Models, Systems and Applications, D. G. Stork and M. E. Hennecke, Eds., Springer-Verlag, Berlin, Germany
- M. E. Hennecke, D. G. Stork, and K. V. Prasad, "Visionary speech: Looking ahead to practical speechreading systems," in Speechreading by Humans and Machines: Models, Systems and Applications, D. G. Stork and M. E. Hennecke, Eds., vol. 150 of NATO ASI Series F: Computer and Systems Sciences, pp. 331-349, Springer-Verlag, Berlin, Germany, 1996.
- (1996) NATO ASI Series F: Computer and Systems Sciences, , vol.150 , pp. 331-349
- Hennecke, M.E.¹ Stork, D.G.² Prasad, K.V.³

29
- 0036297183
- A coupled HMM for audio-visual speech recognition
- Orlando, Fla, USA, May
- A. Nefian, L. Liang, X. Pi, X. Liu, C. Mao, and K. Murphy, "A coupled HMM for audio-visual speech recognition," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing, pp. 2013-2016, Orlando, Fla, USA, May 2002.
- (2002) Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing , pp. 2013-2016
- Nefian, A.¹ Liang, L.² Pi, X.³ Liu, X.⁴ Mao, C.⁵ Murphy, K.⁶

30
- 79952493967
- Speaker independent audio-visual continuous speech recognition
- Lausanne, Switzerland, August
- L. Liang, X. Liu, Y. Zhao, X. Pi, and A. Nefian, "Speaker independent audio-visual continuous speech recognition," in IEEE International Conference on Multimedia and Expo, Lausanne, Switzerland, August 2002.
- (2002) IEEE International Conference on Multimedia and Expo
- Liang, L.¹ Liu, X.² Zhao, Y.³ Pi, X.⁴ Nefian, A.⁵

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.