메뉴 건너뛰기




Volumn , Issue , 2008, Pages 992-995

Soft missing-feature mask generation for simultaneous speech recognition system in robots

Author keywords

Missing feature theory; Robot audition; Simultaneous speech recognition; Soft mask; Speech recognition

Indexed keywords

CONVENTIONAL SYSTEMS; ENERGY ESTIMATION; FREE PARAMETERS; MISSING FEATURE THEORIES; PROBABILITY CALCULATIONS; RECOGNITION PROCESS; RECOGNITION RATES; ROBOT AUDITION; SIGMOID FUNCTION; SIMULTANEOUS SPEECH RECOGNITION; SOFT MASK; SPECTRAL PARAMETERS; SPEECH RECOGNITION SYSTEMS; STATIC AND DYNAMIC; STATIC FEATURES; WORD RECOGNITION;

EID: 84867201614     PISSN: None     EISSN: 19909772     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (7)

References (12)
  • 1
    • 0038582145 scopus 로고
    • One, two, many - Judging the number of concurrent talkers
    • Makio Kashino and Tatsuya Hirahara, "One, two, many - judging the number of concurrent talkers," Journal of Acoustic Society of America, vol. 99, no.4, pp.Pt.2,2596, 1966.
    • (1966) Journal of Acoustic Society of America , vol.99 , Issue.4 PART 2 , pp. 2596
    • Kashino, M.1    Hirahara, T.2
  • 2
    • 4644317224 scopus 로고    scopus 로고
    • A Bayesian framework for spectrographic mask estimation for missing feature speech recognition
    • M. L. Seltzer, B. Raj, and R. M. Stern, "A Bayesian framework for spectrographic mask estimation for missing feature speech recognition," Speech Communication, vol.43, pp.379-393, 2004.
    • (2004) Speech Communication , vol.43 , pp. 379-393
    • Seltzer, M.L.1    Raj, B.2    Stern, R.M.3
  • 3
    • 33846170539 scopus 로고    scopus 로고
    • Enhanced Robot Speech Recognition Based on Microphone Array Source Separation and Missing Feature Theory
    • Shun'ichi Yamamoto, Jean-Marc Valin, Kazuhiro Nakadai, Jean Rouat, François Michaud, Tetsuya Ogata, and Hiroshi G. Okuno, "Enhanced Robot Speech Recognition Based on Microphone Array Source Separation and Missing Feature Theory," in Proc. of IEEE ICRA-2005, pp.1489-1494, 2005.
    • (2005) Proc. of IEEE ICRA-2005 , pp. 1489-1494
    • Yamamoto, S.1    Valin, J.-M.2    Nakadai, K.3    Rouat, J.4    Michaud, F.5    Ogata, T.6    Okuno, H.G.7
  • 4
    • 85009063707 scopus 로고    scopus 로고
    • Soft decision in missing data techniques for robust automatic speech recognition
    • J. Barker, L. Josifovski, M. P. Cooke and P. D. Green, "Soft decision in missing data techniques for robust automatic speech recognition," Proc., ICSLP-2000, 2000.
    • (2000) Proc., ICSLP-2000
    • Barker, J.1    Josifovski, L.2    Cooke, M.P.3    Green, P.D.4
  • 6
    • 34250638496 scopus 로고    scopus 로고
    • Multiband Julius, "http://www.furui.cs.titech.ac.jp/mband julius/".
    • Multiband Julius
  • 7
    • 85009144958 scopus 로고    scopus 로고
    • Free Software Toolkit for Japanese Large Vocabulary Continuous Speech Recognition
    • Tatsuya Kawahara and Akinobu Lee, "Free Software Toolkit for Japanese Large Vocabulary Continuous Speech Recognition," in Proc. of ISCA ICSLP-2000, vol.4, pp.476-479, 2000.
    • (2000) Proc. of ISCA ICSLP-2000 , vol.4 , pp. 476-479
    • Kawahara, T.1    Lee, A.2
  • 9
    • 0036753896 scopus 로고    scopus 로고
    • Geometric Source Separation: Merging Convolutive Source Separation with Geometric Beamforming
    • Lucas C. Parra and Cristopher V. Alvino, "Geometric Source Separation: Merging Convolutive Source Separation With Geometric Beamforming," IEEE Trans. Speech and Audio Processing, vol.10, no.6, pp.352-362, 2002.
    • (2002) IEEE Trans. Speech and Audio Processing , vol.10 , Issue.6 , pp. 352-362
    • Parra, L.C.1    Alvino, C.V.2
  • 10
    • 0035500783 scopus 로고    scopus 로고
    • Speech enhancement for non-stationary noise environments
    • Israel Cohen and Baruch Berdugo, "Speech enhancement for non-stationary noise environments," Signal Processing, 81(2), pp.2403-2418, 2001.
    • (2001) Signal Processing , vol.81 , Issue.2 , pp. 2403-2418
    • Cohen, I.1    Berdugo, B.2
  • 11
    • 33746191291 scopus 로고    scopus 로고
    • Genetic Algorithm-Based Improvement of Robot Hearing Capabilities in Separating and Recognizing Simultaneous Speech Signals
    • Proc., IEA/AIE- 2006 Springer-Verlag
    • Shun'ichi Yamamoto, Kazuhiro Nakadai, Mikio Nakano, Hiroshi Tsujino, Jean-Marc Valin, Ryu Takeda, Kazunori Komatani, Tetsuya Ogata, and Hiroshi G. Okuno, "Genetic Algorithm-Based Improvement of Robot Hearing Capabilities in Separating and Recognizing Simultaneous Speech Signals," in Proc., IEA/AIE- 2006 LNAI 4031, 2006, pp.207-217, Springer-Verlag.
    • (2006) LNAI , vol.4031 , pp. 207-217
    • Yamamoto, S.1    Nakadai, K.2    Nakano, M.3    Tsujino, H.4    Valin, J.-M.5    Takeda, R.6    Komatani, K.7    Ogata, T.8    Okuno, H.G.9
  • 12
    • 0021892216 scopus 로고
    • Speech Enchancement Using Minimum Mean-Square Error Log-Spectral Amplitude Estimator
    • Y. Ephraim and D. Malah, "Speech Enchancement Using Minimum Mean-Square Error Log-Spectral Amplitude Estimator," IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-33, no.2, pp.443-445, 1985.
    • (1985) IEEE Trans. Acoust., Speech, Signal Processing , vol.ASSP-33 , Issue.2 , pp. 443-445
    • Ephraim, Y.1    Malah, D.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.