SCOPUS 정보 검색 플랫폼

9th IEEE-RAS International Conference on Humanoid Robots, HUMANOIDS09

Volumn , Issue , 2009, Pages 604-609

Automatic speech recognition improved by two-layered audio-visual integration for robot audition

(3) Yoshida, Takami a Nakadai, Kazuhiro a,b Okuno, Hiroshi G c

a TOKYO INSTITUTE OF TECHNOLOGY (Japan)

b HONDA RESEARCH INSTITUTE JAPAN CO LTD (Japan)

c KYOTO UNIVERSITY (Japan)

Author keywords

[No Author keywords available]

Indexed keywords

ACOUSTIC ENVIRONMENT; ACOUSTIC FEATURES; AUDIO FEATURES; AUDIO VISUAL SPEECH RECOGNITION; AUDIO-VISUAL; AUDIO-VISUAL INTEGRATION; AUTOMATIC SPEECH RECOGNITION; CHANGING ENVIRONMENT; EMPIRICAL RESULTS; ENVIRONMENTAL NOISE; MICROPHONE ARRAY PROCESSING; MICROPHONE ARRAYS; NOISE CONDITIONS; NOISY ENVIRONMENT; RELIABILITY ESTIMATION; ROBOT AUDITION; VISUAL FEATURE; VOICE ACTIVITY; VOICE ACTIVITY DETECTION;

ANTHROPOMORPHIC ROBOTS; ARRAY PROCESSING; AUDIO ACOUSTICS; AUDITION; BAYESIAN NETWORKS; INFERENCE ENGINES; MICROPHONES; RELIABILITY THEORY; REMELTING;

SPEECH RECOGNITION;

EID: 77950563943 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICHR.2009.5379586 Document Type: Conference Paper

Times cited : (35)

References (17)

1
- 85122848536
- Active audition for humanoid
- K. Nakadai, T. Lourens, H. G. Okuno, and H. Kitano, "Active audition for humanoid ," in Proc. of 17th National Conference on Artificial Intelligence (AAAI), pp. 832-839, 2000.
- (2000) Proc. of 17th National Conference on Artificial Intelligence (AAAI) , pp. 832-839
- Nakadai, K.¹ Lourens, T.² Okuno, H.G.³ Kitano, H.⁴

2
- 34250652551
- Real-time robot audition system that recognizes simultaneous speech in the real world
- S. Yamamoto, K. Nakadai, M. Nakano, H. Tsujino , J.-M. Valin, K. Komatani, T. Ogata, and H. G. Okuno, "Real-time robot audition system that recognizes simultaneous speech in the real world," in Proc. ofIEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 5333-5338, 2006.
- (2006) Proc. OfIEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) , pp. 5333-5338
- Yamamoto, S.¹ Nakadai, K.² Nakano, M.³ Tsujino, H.⁴ Valin, J.-M.⁵ Komatani, K.⁶ Ogata, T.⁷ Okuno, H.G.⁸

3
- 0035386489
- A cascade visual front end for speaker independent automatic speechreading
- G. Potamianos , C. Neti, G. Iyengar, A. Senior, and A. Verma, "A cascade visual front end for speaker independent automatic speechreading," Speech Technology, Special Issue on Multimedia, Vol. 4, pp. 193-208,2001.
- (2001) Speech Technology, Special Issue on Multimedia , vol.4 , pp. 193-208
- Potamianos, G.¹ Neti, C.² Iyengar, G.³ Senior, A.⁴ Verma, A.⁵

4
- 33646814706
- A stream-weight optimization method for multi-stream hmms based on likelihood value normalization
- S. Tamura, K. Iwano, and S. Furui, "A stream-weight optimization method for multi-stream hmms based on likelihood value normalization," in Proc. of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), SP-P5.2, 2005.
- Proc. of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), SP-P5.2, 2005
- Tamura, S.¹ Iwano, K.² Furui, S.³

5
- 0030638031
- A post-processing systems to yield reduced word error rates: Recogniz er output voting error reduction (rover)
- J. Fiscus, "A post-processing systems to yield reduced word error rates : Recogniz er output voting error reduction (rover)," in Proc. of the Workshop on Automatic Speech Recognition and Understanding (ASRU). pp. 347-354, 1997.
- (1997) Proc. of the Workshop on Automatic Speech Recognition and Understanding (ASRU) , pp. 347-354
- Fiscus, J.¹

6
- 51349110555
- Coarse speech recognition by audio-visual integration based on missing feature theory
- T. Koiwa, K. Nakadai, and J. Irnura, "Coarse speech recognition by audio-visual integration based on missing feature theory," in Proc. of IEEE/RAS Int. Corf. on Intelligent Robots and Systems (IROS). pp. 1751-1756,2007.
- (2007) Proc. of IEEE/RAS Int. Corf. on Intelligent Robots and Systems (IROS) , pp. 1751-1756
- Koiwa, T.¹ Nakadai, K.² Irnura, J.³

7
- 63549118078
- An open source software system for robot audition HARK and its evaluation
- K. Nakadai, H. Okuno, H. Nakajima, Y. Hasegawa, and H. Tsujino, "An open source software system for robot audition HARK and its evaluation," in Proc. of IEEE-RAS International Conference on Humanoid Robots (Humanoids). pp. 561-566, 2008.
- (2008) Proc. of IEEE-RAS International Conference on Humanoid Robots (Humanoids) , pp. 561-566
- Nakadai, K.¹ Okuno, H.² Nakajima, H.³ Hasegawa, Y.⁴ Tsujino, H.⁵

8
- 77950574450
- "http://julius.sourceforge.jp/."

9
- 4544351504
- Voice activity detection using visual information
- P. Liu and Z. Wang, "Voice activity detection using visual information," in Proc. of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 609-612, 2004.
- (2004) Proc. of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) , pp. 609-612
- Liu, P.¹ Wang, Z.²

10
- 34447095008
- Visual voice activity detection as a help for speech source separation from convolutive mixtures
- B. Rivet, L. Girin, and C. Jutten, "Visual voice activity detection as a help for speech source separation from convolutive mixtures," Speech Communication, Vol. 49, no. 7-8, pp. 667-677, 2007.
- (2007) Speech Communication , vol.49 , Issue.7-8 , pp. 667-677
- Rivet, B.¹ Girin, L.² Jutten, C.³

11
- 0037704976
- Face-to-talk: Audio-visual speech detection for robust speech recognition in noisy environment
- K. Murai and S. Nakamura, "Face-to-talk: audio-visual speech detection for robust speech recognition in noisy environment," IEICE Trans. Inf. & Syst., vol.E86-D, no. 3, pp. 505-513, 2003.
- (2003) IEICE Trans. Inf. & Syst. , vol.E86-D , Issue.3 , pp. 505-513
- Murai, K.¹ Nakamura, S.²

12
- 84901660485
- Fusion of audio and video information for detecting speech events
- F. Asano, Y.Motomura and S. Nakamura, "Fusion of audio and video information for detecting speech events," in Proc. International Conference on Information Fusion, pp. 386-393, 2003.
- (2003) Proc. International Conference on Information Fusion , pp. 386-393
- Asano, F.¹ Motomura, Y.² Nakamura, S.³

13
- 10444237268
- Improvement of recognition of simultaneous speech signals using av integration and scattering theory for humanoid robots
- K. Nakadai, D. Matsuura, H. G. Okuno, and H. Tsujino, "Improvement of recognition of simultaneous speech signals using av integration and scattering theory for humanoid robots," Speech Communication, Vol. 44, pp. 97-112, 2004.
- (2004) Speech Communication , vol.44 , pp. 97-112
- Nakadai, K.¹ Matsuura, D.² Okuno, H.G.³ Tsujino, H.⁴

14
- 85009062588
- Real-time sound source localization and separation system and its application to automatic speech recognition
- Sep.
- F. Asano, M. Goto, K. Itou, and H. Asoh, "Real-time sound source localization and separation system and its application to automatic speech recognition." in Proc. of International Conference on Speech Processing (Eurospeech). pp. 1013-1016, Sep. 2001.
- (2001) Proc. of International Conference on Speech Processing (Eurospeech) , pp. 1013-1016
- Asano, F.¹ Goto, M.² Itou, K.³ Asoh, H.⁴

15
- 14044260635
- Enhanced robot audition based on microphone array source separation with post-filter
- J.-M. Valin, J. Rouat, and F.Michaud , "Enhanced robot audition based on microphone array source separation with post-filter," in Proc. of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). pp. 2123-2128, 2004.
- (2004) Proc. of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) , pp. 2123-2128
- Valin, J.-M.¹ Rouat, J.² Michaud, F.³

16
- 37349116539
- Noise-robust speech recognition using multi-band spectral features
- Y. Nishimura, T. Shinozaki , K. Iwano, and S. Furui, "Noise-robust speech recognition using multi-band spectral features," in Proc. of 148th Acoustical Society ofAmerica Meetings, no. IaSC7, 2004.
- (2004) Proc. of 148th Acoustical Society OfAmerica Meetings , Issue.IASC7
- Nishimura, Y.¹ Shinozaki, T.² Iwano, K.³ Furui, S.⁴

17
- 48149111531
- Speech recognition for a humanoid with motor noise utilizing missing feature theory
- Y. Nishimura, M. Ishizuka, K. Nakadai, M. Nakano, and H. Tsujino, "Speech recognition for a humanoid with motor noise utilizing missing feature theory," in Proc. of 6th IEEE-RAS International Conference on Humanoid Robots (Humanoids). pp. 26-33, 2006.
- (2006) Proc. of 6th IEEE-RAS International Conference on Humanoid Robots (Humanoids) , pp. 26-33
- Nishimura, Y.¹ Ishizuka, M.² Nakadai, K.³ Nakano, M.⁴ Tsujino, H.⁵

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.