SCOPUS 정보 검색 플랫폼

Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

Volumn , Issue , 2008, Pages 992-995

Soft missing-feature mask generation for simultaneous speech recognition system in robots

(6) Takahashi, Toru a Yamamoto, Shun'ichi a Nakadai, Kazuhiro b Komatani, Kazunori a Ogata, Tetsuya a Okuno, Hiroshi G a

a KYOTO UNIVERSITY (Japan)

b HONDA RESEARCH INSTITUTE JAPAN CO LTD (Japan)

Author keywords

Missing feature theory; Robot audition; Simultaneous speech recognition; Soft mask; Speech recognition

Indexed keywords

CONVENTIONAL SYSTEMS; ENERGY ESTIMATION; FREE PARAMETERS; MISSING FEATURE THEORIES; PROBABILITY CALCULATIONS; RECOGNITION PROCESS; RECOGNITION RATES; ROBOT AUDITION; SIGMOID FUNCTION; SIMULTANEOUS SPEECH RECOGNITION; SOFT MASK; SPECTRAL PARAMETERS; SPEECH RECOGNITION SYSTEMS; STATIC AND DYNAMIC; STATIC FEATURES; WORD RECOGNITION;

AUDITION; FEATURE EXTRACTION; ROBOTS; SPEECH COMMUNICATION;

SPEECH RECOGNITION;

EID: 84867201614 PISSN: None EISSN: 19909772 Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (7)

References (12)

1
- 0038582145
- One, two, many - Judging the number of concurrent talkers
- Makio Kashino and Tatsuya Hirahara, "One, two, many - judging the number of concurrent talkers," Journal of Acoustic Society of America, vol. 99, no.4, pp.Pt.2,2596, 1966.
- (1966) Journal of Acoustic Society of America , vol.99 , Issue.4 PART 2 , pp. 2596
- Kashino, M.¹ Hirahara, T.²

2
- 4644317224
- A Bayesian framework for spectrographic mask estimation for missing feature speech recognition
- M. L. Seltzer, B. Raj, and R. M. Stern, "A Bayesian framework for spectrographic mask estimation for missing feature speech recognition," Speech Communication, vol.43, pp.379-393, 2004.
- (2004) Speech Communication , vol.43 , pp. 379-393
- Seltzer, M.L.¹ Raj, B.² Stern, R.M.³

3
- 33846170539
- Enhanced Robot Speech Recognition Based on Microphone Array Source Separation and Missing Feature Theory
- Shun'ichi Yamamoto, Jean-Marc Valin, Kazuhiro Nakadai, Jean Rouat, François Michaud, Tetsuya Ogata, and Hiroshi G. Okuno, "Enhanced Robot Speech Recognition Based on Microphone Array Source Separation and Missing Feature Theory," in Proc. of IEEE ICRA-2005, pp.1489-1494, 2005.
- (2005) Proc. of IEEE ICRA-2005 , pp. 1489-1494
- Yamamoto, S.¹ Valin, J.-M.² Nakadai, K.³ Rouat, J.⁴ Michaud, F.⁵ Ogata, T.⁶ Okuno, H.G.⁷

4
- 85009063707
- Soft decision in missing data techniques for robust automatic speech recognition
- J. Barker, L. Josifovski, M. P. Cooke and P. D. Green, "Soft decision in missing data techniques for robust automatic speech recognition," Proc., ICSLP-2000, 2000.
- (2000) Proc., ICSLP-2000
- Barker, J.¹ Josifovski, L.² Cooke, M.P.³ Green, P.D.⁴

5
- 37349116539
- Noise-Robust Speech Recognition Using Multi- Band Spectral Features
- Yoshitaka Nishimura, Takahiro Shinozaki, Koji Iwano, and Sadaoki Furui, "Noise-Robust Speech Recognition Using Multi- Band Spectral Features," in Proc., 148th Acoustical Society of America Meetings, No. 1aSC7, 2004.
- (2004) Proc., 148th Acoustical Society of America Meetings , Issue.1 ASC7
- Nishimura, Y.¹ Shinozaki, T.² Iwano, K.³ Furui, S.⁴

6
- 34250638496
- Multiband Julius, "http://www.furui.cs.titech.ac.jp/mband julius/".
- Multiband Julius

7
- 85009144958
- Free Software Toolkit for Japanese Large Vocabulary Continuous Speech Recognition
- Tatsuya Kawahara and Akinobu Lee, "Free Software Toolkit for Japanese Large Vocabulary Continuous Speech Recognition," in Proc. of ISCA ICSLP-2000, vol.4, pp.476-479, 2000.
- (2000) Proc. of ISCA ICSLP-2000 , vol.4 , pp. 476-479
- Kawahara, T.¹ Lee, A.²

8
- 79957986619
- Making a Robot Recognize Three Simultaneous Sentences in Real-time
- Shun'ichi Yamamoto, Kazuhiro Nakadai, Jean-Marc Valin, Jean Rouat, François Michaud, Kazunori Komatani, Tetsuya Ogata, and Hiroshi G. Okuno, "Making A Robot Recognize Three Simultaneous Sentences In Real-time," in Proc. of IEEE/RSJ IROS-2005, pp.897-902, 2005.
- (2005) Proc. of IEEE/RSJ IROS-2005 , pp. 897-902
- Yamamoto, S.¹ Nakadai, K.² Valin, J.-M.³ Rouat, J.⁴ Michaud, F.⁵ Komatani, K.⁶ Ogata, T.⁷ Okuno, H.G.⁸

9
- 0036753896
- Geometric Source Separation: Merging Convolutive Source Separation with Geometric Beamforming
- Lucas C. Parra and Cristopher V. Alvino, "Geometric Source Separation: Merging Convolutive Source Separation With Geometric Beamforming," IEEE Trans. Speech and Audio Processing, vol.10, no.6, pp.352-362, 2002.
- (2002) IEEE Trans. Speech and Audio Processing , vol.10 , Issue.6 , pp. 352-362
- Parra, L.C.¹ Alvino, C.V.²

10
- 0035500783
- Speech enhancement for non-stationary noise environments
- Israel Cohen and Baruch Berdugo, "Speech enhancement for non-stationary noise environments," Signal Processing, 81(2), pp.2403-2418, 2001.
- (2001) Signal Processing , vol.81 , Issue.2 , pp. 2403-2418
- Cohen, I.¹ Berdugo, B.²

11
- 33746191291
- Genetic Algorithm-Based Improvement of Robot Hearing Capabilities in Separating and Recognizing Simultaneous Speech Signals
- Proc., IEA/AIE- 2006 Springer-Verlag
- Shun'ichi Yamamoto, Kazuhiro Nakadai, Mikio Nakano, Hiroshi Tsujino, Jean-Marc Valin, Ryu Takeda, Kazunori Komatani, Tetsuya Ogata, and Hiroshi G. Okuno, "Genetic Algorithm-Based Improvement of Robot Hearing Capabilities in Separating and Recognizing Simultaneous Speech Signals," in Proc., IEA/AIE- 2006 LNAI 4031, 2006, pp.207-217, Springer-Verlag.
- (2006) LNAI , vol.4031 , pp. 207-217
- Yamamoto, S.¹ Nakadai, K.² Nakano, M.³ Tsujino, H.⁴ Valin, J.-M.⁵ Takeda, R.⁶ Komatani, K.⁷ Ogata, T.⁸ Okuno, H.G.⁹

12
- 0021892216
- Speech Enchancement Using Minimum Mean-Square Error Log-Spectral Amplitude Estimator
- Y. Ephraim and D. Malah, "Speech Enchancement Using Minimum Mean-Square Error Log-Spectral Amplitude Estimator," IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-33, no.2, pp.443-445, 1985.
- (1985) IEEE Trans. Acoust., Speech, Signal Processing , vol.ASSP-33 , Issue.2 , pp. 443-445
- Ephraim, Y.¹ Malah, D.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.