메뉴 건너뛰기




Volumn 7, Issue 3, 2005, Pages 495-506

Integration strategies for audio-visual speech processing: Applied to text-dependent speaker recognition

Author keywords

Audio visual speech processing (AVSP); Classifier combination; Integration strategies; Multistream hidden Markov model (HMM); Speaker recognition

Indexed keywords

ERROR ANALYSIS; GRAPH THEORY; INTEGRATION; MARKOV PROCESSES; MATHEMATICAL MODELS; SPEECH PROCESSING;

EID: 20444375102     PISSN: 15209210     EISSN: None     Source Type: Journal    
DOI: 10.1109/TMM.2005.846777     Document Type: Article
Times cited : (33)

References (31)
  • 2
    • 0032074310 scopus 로고    scopus 로고
    • Audio-visual integration in multimodal communication
    • May
    • T. Chen and R. Rao, "Audio-visual integration in multimodal communication," Proc. IEEE, vol. 86, no. 5, pp. 837-852, May 1998.
    • (1998) Proc. IEEE , vol.86 , Issue.5 , pp. 837-852
    • Chen, T.1    Rao, R.2
  • 3
    • 0029270677 scopus 로고
    • Converting speech into lip movements: A multimedia telephone for hard hearing people
    • Mar.
    • F. Lavagetto, "Converting speech into lip movements: A multimedia telephone for hard hearing people," IEEE Trans. Rehab. Eng., vol. 3, no. 1, pp. 90-102, Mar. 1995.
    • (1995) IEEE Trans. Rehab. Eng. , vol.3 , Issue.1 , pp. 90-102
    • Lavagetto, F.1
  • 4
    • 0017199877 scopus 로고
    • Hearing lips and seeing voices
    • Dec.
    • H. McGurk and J. MacDonald, "Hearing lips and seeing voices," Nature, pp. 746-748, Dec. 1976.
    • (1976) Nature , pp. 746-748
    • McGurk, H.1    MacDonald, J.2
  • 5
    • 0003544881 scopus 로고    scopus 로고
    • NATO ASI Series F: Computer and Systems Sciences, Eds., Springer-Verlag, New York
    • Speechreading by Humans and Machines, vol. 150, NATO ASI Series F: Computer and Systems Sciences, D. G. Stork and M. E. Hennecke, Eds., Springer-Verlag, New York, 1996.
    • (1996) Speechreading by Humans and Machines , vol.150
    • Stork, D.G.1    Hennecke, M.E.2
  • 6
    • 0036502797 scopus 로고    scopus 로고
    • A review of speech-based bimodal recognition
    • Mar.
    • C. C. Chibelushi, F. Deravi, and J. S. D. Mason, "A review of speech-based bimodal recognition," IEEE Trans. Multimedia, vol. 4, no. 1, pp. 23-37, Mar. 2002.
    • (2002) IEEE Trans. Multimedia , vol.4 , Issue.1 , pp. 23-37
    • Chibelushi, C.C.1    Deravi, F.2    Mason, J.S.D.3
  • 7
    • 0034270644 scopus 로고    scopus 로고
    • Audio-visual speech modeling for continuous speech recognition
    • Sep.
    • S. Dupont and J. Luettin, "Audio-visual speech modeling for continuous speech recognition," IEEE Trans. Multimedia, vol. 2, no. 3, pp. 141-151, Sep. 2000.
    • (2000) IEEE Trans. Multimedia , vol.2 , Issue.3 , pp. 141-151
    • Dupont, S.1    Luettin, J.2
  • 12
    • 22444454265 scopus 로고    scopus 로고
    • Combining classifiers: A theoretical framework
    • J. Kittler, "Combining classifiers: A theoretical framework," Pattern Anal. and Applicat., vol. 1, no. 1, pp. 18-27, 1998.
    • (1998) Pattern Anal. and Applicat. , vol.1 , Issue.1 , pp. 18-27
    • Kittler, J.1
  • 13
    • 0004473740 scopus 로고    scopus 로고
    • Modularity and catastrophic fusion: A Bayesian approach with applications to audio-visual speech recognition
    • USCD, Dept. Cognitive Sci., San Diego, CA
    • J. R. Movellan and P. Mineiro, "Modularity and Catastrophic Fusion: A Bayesian Approach with Applications to Audio-Visual Speech Recognition," USCD, Dept. Cognitive Sci., San Diego, CA, Tech. Rep. 97.01, 1997.
    • (1997) Tech. Rep. 97.01
    • Movellan, J.R.1    Mineiro, P.2
  • 14
    • 0024766457 scopus 로고
    • A family of distortion measures based upon projection operation for robust speech recognition
    • Nov.
    • D. Mansour and B. H. Juang, "A family of distortion measures based upon projection operation for robust speech recognition," IEEE Trans. Acoust., Speech, Signal Process., vol. 37, no. 11, pp. 1659-1671, Nov. 1989.
    • (1989) IEEE Trans. Acoust., Speech, Signal Process. , vol.37 , Issue.11 , pp. 1659-1671
    • Mansour, D.1    Juang, B.H.2
  • 15
    • 35248829639 scopus 로고    scopus 로고
    • Data dependence in combining classifiers
    • T. Windeatt and F. Roli, Eds.
    • M. S. Kamel and N. M. Wanas, "Data dependence in combining classifiers," in Multiple Classifier Systems, T. Windeatt and F. Roli, Eds., 2003, pp. 1-14.
    • (2003) Multiple Classifier Systems , pp. 1-14
    • Kamel, M.S.1    Wanas, N.M.2
  • 19
    • 0022019614 scopus 로고
    • Intermodal timing relations and audio-visual speech recognition
    • Feb.
    • M. McGrath and Q. Summerfield, "Intermodal timing relations and audio-visual speech recognition," J. Acoust. Soc. Amer., vol. 77, no. 2, pp. 678-685, Feb. 1985.
    • (1985) J. Acoust. Soc. Amer. , vol.77 , Issue.2 , pp. 678-685
    • McGrath, M.1    Summerfield, Q.2
  • 25
    • 0037360227 scopus 로고    scopus 로고
    • Improved facial-feature detection for AVSP via unsupervised clustering and discriminant analysis
    • S. Lucey, V. Chandran, and S. Sridharan, "Improved facial-feature detection for AVSP via unsupervised clustering and discriminant analysis," EURASIP J. Appl. Signal Process., no. 3, pp. 264-275, 2003.
    • (2003) EURASIP J. Appl. Signal Process. , Issue.3 , pp. 264-275
    • Lucey, S.1    Chandran, V.2    Sridharan, S.3
  • 27
    • 0024610919 scopus 로고
    • A tutorial on hidden Markov models and selected applications in speech recognition
    • Feb.
    • L. R. Rabiner, "A tutorial on hidden Markov models and selected applications in speech recognition," Proc. IEEE, vol. 77, no. 2, pp. 257-286, Feb. 1989.
    • (1989) Proc. IEEE , vol.77 , Issue.2 , pp. 257-286
    • Rabiner, L.R.1
  • 29
    • 85009268624 scopus 로고    scopus 로고
    • A link between cepstral shrinking and the weighted product rule in audio-visual speech recognition
    • S. Lucey, V. Chandran, and S. Sridharan, "A link between cepstral shrinking and the weighted product rule in audio-visual speech recognition," in Proc. Int. Conf. Spoken Language Processing (ICSLP'02), 2002, pp. 1961-1964.
    • (2002) Proc. Int. Conf. Spoken Language Processing (ICSLP'02) , pp. 1961-1964
    • Lucey, S.1    Chandran, V.2    Sridharan, S.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.