메뉴 건너뛰기




Volumn , Issue , 2009, Pages 604-609

Automatic speech recognition improved by two-layered audio-visual integration for robot audition

Author keywords

[No Author keywords available]

Indexed keywords

ACOUSTIC ENVIRONMENT; ACOUSTIC FEATURES; AUDIO FEATURES; AUDIO VISUAL SPEECH RECOGNITION; AUDIO-VISUAL; AUDIO-VISUAL INTEGRATION; AUTOMATIC SPEECH RECOGNITION; CHANGING ENVIRONMENT; EMPIRICAL RESULTS; ENVIRONMENTAL NOISE; MICROPHONE ARRAY PROCESSING; MICROPHONE ARRAYS; NOISE CONDITIONS; NOISY ENVIRONMENT; RELIABILITY ESTIMATION; ROBOT AUDITION; VISUAL FEATURE; VOICE ACTIVITY; VOICE ACTIVITY DETECTION;

EID: 77950563943     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICHR.2009.5379586     Document Type: Conference Paper
Times cited : (35)

References (17)
  • 5
    • 0030638031 scopus 로고    scopus 로고
    • A post-processing systems to yield reduced word error rates: Recogniz er output voting error reduction (rover)
    • J. Fiscus, "A post-processing systems to yield reduced word error rates : Recogniz er output voting error reduction (rover)," in Proc. of the Workshop on Automatic Speech Recognition and Understanding (ASRU). pp. 347-354, 1997.
    • (1997) Proc. of the Workshop on Automatic Speech Recognition and Understanding (ASRU) , pp. 347-354
    • Fiscus, J.1
  • 8
    • 77950574450 scopus 로고    scopus 로고
    • "http://julius.sourceforge.jp/."
  • 10
    • 34447095008 scopus 로고    scopus 로고
    • Visual voice activity detection as a help for speech source separation from convolutive mixtures
    • B. Rivet, L. Girin, and C. Jutten, "Visual voice activity detection as a help for speech source separation from convolutive mixtures," Speech Communication, Vol. 49, no. 7-8, pp. 667-677, 2007.
    • (2007) Speech Communication , vol.49 , Issue.7-8 , pp. 667-677
    • Rivet, B.1    Girin, L.2    Jutten, C.3
  • 11
    • 0037704976 scopus 로고    scopus 로고
    • Face-to-talk: Audio-visual speech detection for robust speech recognition in noisy environment
    • K. Murai and S. Nakamura, "Face-to-talk: audio-visual speech detection for robust speech recognition in noisy environment," IEICE Trans. Inf. & Syst., vol.E86-D, no. 3, pp. 505-513, 2003.
    • (2003) IEICE Trans. Inf. & Syst. , vol.E86-D , Issue.3 , pp. 505-513
    • Murai, K.1    Nakamura, S.2
  • 13
    • 10444237268 scopus 로고    scopus 로고
    • Improvement of recognition of simultaneous speech signals using av integration and scattering theory for humanoid robots
    • K. Nakadai, D. Matsuura, H. G. Okuno, and H. Tsujino, "Improvement of recognition of simultaneous speech signals using av integration and scattering theory for humanoid robots," Speech Communication, Vol. 44, pp. 97-112, 2004.
    • (2004) Speech Communication , vol.44 , pp. 97-112
    • Nakadai, K.1    Matsuura, D.2    Okuno, H.G.3    Tsujino, H.4
  • 14
    • 85009062588 scopus 로고    scopus 로고
    • Real-time sound source localization and separation system and its application to automatic speech recognition
    • Sep.
    • F. Asano, M. Goto, K. Itou, and H. Asoh, "Real-time sound source localization and separation system and its application to automatic speech recognition." in Proc. of International Conference on Speech Processing (Eurospeech). pp. 1013-1016, Sep. 2001.
    • (2001) Proc. of International Conference on Speech Processing (Eurospeech) , pp. 1013-1016
    • Asano, F.1    Goto, M.2    Itou, K.3    Asoh, H.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.