메뉴 건너뛰기




Volumn 2002, Issue 11, 2002, Pages 1274-1288

Dynamic Bayesian networks for audio-visual speech recognition

Author keywords

Audio visual speech recognition; Coupled hidden Markov models; Dynamic Bayesian networks; Factorial hidden Markov models; Hidden Markov models

Indexed keywords

ACOUSTIC NOISE; ACOUSTIC SIGNAL PROCESSING; ALGORITHMS; CORRELATION THEORY; MARKOV PROCESSES; SPEECH SYNTHESIS; VIDEO SIGNAL PROCESSING;

EID: 0036874999     PISSN: 11108657     EISSN: None     Source Type: Journal    
DOI: 10.1155/S1110865702206083     Document Type: Article
Times cited : (257)

References (30)
  • 1
    • 0017199877 scopus 로고
    • Hearing lips and seeing voices
    • H. McGurk and J. MacDonald, "Hearing lips and seeing voices," Nature, vol. 264, no. 5588, pp. 746-748, 1976.
    • (1976) Nature , vol.264 , Issue.5588 , pp. 746-748
    • McGurk, H.1    Macdonald, J.2
  • 4
    • 85156254941 scopus 로고
    • Factorial hidden Markov models
    • D. S. Touretzky, M. C. Mozer, and M. E. Hasselmo, Eds., MIT Press, Cambridge, Mass, USA
    • Z. Ghahramani and M. I. Jordan, "Factorial hidden Markov models," in Proc. Conf. Advances in Neural Information Processing Systems, D. S. Touretzky, M. C. Mozer, and M. E. Hasselmo, Eds., vol. 8, pp. 472-478, MIT Press, Cambridge, Mass, USA, 1995.
    • (1995) Proc. Conf. Advances in Neural Information Processing Systems , vol.8 , pp. 472-478
    • Ghahramani, Z.1    Jordan, M.I.2
  • 5
    • 0012730694 scopus 로고
    • A model for reasoning about persistence and causation
    • T. Dean and K. Kanazawa, "A model for reasoning about persistence and causation," Artificial Intelligence, vol. 93, no. 1-2, pp. 1-27, 1989.
    • (1989) Artificial Intelligence , vol.93 , Issue.1-2 , pp. 1-27
    • Dean, T.1    Kanazawa, K.2
  • 8
    • 85032752352 scopus 로고    scopus 로고
    • Audiovisual speech processing
    • January
    • T. Chen, "Audiovisual speech processing," IEEE Signal Processing Magazine, vol. 18, pp. 9-21, January 2001.
    • (2001) IEEE Signal Processing Magazine , vol.18 , pp. 9-21
    • Chen, T.1
  • 9
  • 11
    • 0034270644 scopus 로고    scopus 로고
    • Audio-visual speech modeling for continuous speech recognition
    • S. Dupont and J. Luettin, "Audio-visual speech modeling for continuous speech recognition," IEEE Trans. Multimedia, vol. 2, no. 3, pp. 141-151, 2000.
    • (2000) IEEE Trans. Multimedia , vol.2 , Issue.3 , pp. 141-151
    • Dupont, S.1    Luettin, J.2
  • 14
    • 0034842342 scopus 로고    scopus 로고
    • Asynchronous stream modeling for large vocabulary audio-visual speech recognition
    • Salt Lake City, Utah, USA
    • J. Luettin, G. Potamianos, and C. Neti, "Asynchronous stream modeling for large vocabulary audio-visual speech recognition," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing, pp. 169-172, Salt Lake City, Utah, USA, 2001.
    • (2001) Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing , pp. 169-172
    • Luettin, J.1    Potamianos, G.2    Neti, C.3
  • 18
    • 0012705518 scopus 로고    scopus 로고
    • Advanced Multimedia Processing Lab, Carnegie Mellon University, Pittsburgh, Pa, USA
    • Advanced Multimedia Processing Lab, http://amp.ece.cmu.edu/projects/AudioVisualSpeechProcessing/, Carnegie Mellon University, Pittsburgh, Pa, USA.
  • 22
    • 0002049440 scopus 로고    scopus 로고
    • Learning dynamic Bayesian networks
    • Adaptive Processing of Sequences and Data Structures, C. Giles and M. Gori, Eds., Springer-Verlag, Berlin, Germany
    • Z. Ghahramani, "Learning dynamic Bayesian networks," in Adaptive Processing of Sequences and Data Structures, C. Giles and M. Gori, Eds., Lecture Notes in Artificial Intelligence, pp. 168-197, Springer-Verlag, Berlin, Germany, 1998.
    • (1998) Lecture Notes in Artificial Intelligence , pp. 168-197
    • Ghahramani, Z.1
  • 26
    • 85009135946 scopus 로고    scopus 로고
    • Bimodal speech recognition using coupled hidden Markov models
    • Beijing, China
    • S. Chu and T. Huang, "Bimodal speech recognition using coupled hidden Markov models," in Proc. IEEE International Conf. on Spoken Language Processing, vol. 2, pp. 747-750, Beijing, China, 2000.
    • (2000) Proc. IEEE International Conf. on Spoken Language Processing , vol.2 , pp. 747-750
    • Chu, S.1    Huang, T.2
  • 27
    • 0036295989 scopus 로고    scopus 로고
    • Audio-visual speech modeling using coupled hidden Markov models
    • Orlando, Fla, USA, May
    • S. Chu and T. Huang, "Audio-visual speech modeling using coupled hidden Markov models," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing, pp. 2009-2012, Orlando, Fla, USA, May 2002.
    • (2002) Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing , pp. 2009-2012
    • Chu, S.1    Huang, T.2
  • 28
    • 0000417467 scopus 로고    scopus 로고
    • Visionary speech: Looking ahead to practical speechreading systems
    • Speechreading by Humans and Machines: Models, Systems and Applications, D. G. Stork and M. E. Hennecke, Eds., Springer-Verlag, Berlin, Germany
    • M. E. Hennecke, D. G. Stork, and K. V. Prasad, "Visionary speech: Looking ahead to practical speechreading systems," in Speechreading by Humans and Machines: Models, Systems and Applications, D. G. Stork and M. E. Hennecke, Eds., vol. 150 of NATO ASI Series F: Computer and Systems Sciences, pp. 331-349, Springer-Verlag, Berlin, Germany, 1996.
    • (1996) NATO ASI Series F: Computer and Systems Sciences, , vol.150 , pp. 331-349
    • Hennecke, M.E.1    Stork, D.G.2    Prasad, K.V.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.