메뉴 건너뛰기




Volumn 21, Issue 10, 2013, Pages 2182-2192

Noise model transfer: Novel approach to robustness against nonstationary noise

Author keywords

Meeting speech recognition; nonstationary noise; reverberation; robust speech recognition

Indexed keywords

CHANGING PARAMETER; CONVENTIONAL METHODS; NOISE CHARACTERISTIC; NOISE POWER SPECTRUM; NOISE-POWER SPECTRA; NONSTATIONARY NOISE; OPTIMAL TRANSFORMATION; ROBUST SPEECH RECOGNITION;

EID: 84881043147     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2013.2272513     Document Type: Article
Times cited : (10)

References (32)
  • 1
    • 84901773892 scopus 로고    scopus 로고
    • Springer Handbook of Speech Processing, J. Benesty M. M. Sondhi, and Y. Huang, Eds. New York, NY, USA: Springer
    • J. Droppo and A. Acero, "Environmental robustness," in Springer Handbook of Speech Processing, J. Benesty, M. M. Sondhi, and Y. Huang, Eds. New York, NY, USA: Springer, 2008, pp. 653-679
    • (2008) Environmental Robustness , pp. 653-679
    • Droppo, J.1    Acero, A.2
  • 2
    • 0029725301 scopus 로고    scopus 로고
    • A vector Taylor series approach for environmental-independent speech recognition
    • P. J.Moreno, B. Raj, and R.M. Stern, "A vector Taylor series approach for environmental-independent speech recognition," in Proc. Int. Conf. Acoust., Speech, Signal Process., 1996, vol. 2, pp. 733-736
    • (1996) Proc. Int. Conf. Acoust., Speech, Signal Process , vol.2 , pp. 733-736
    • Moreno, P.J.1    Raj, B.2    Stern, R.M.3
  • 3
    • 85006734596 scopus 로고    scopus 로고
    • Evaluation of the splice algorithm on the Aurora2 database
    • J. Droppo, L. Deng, and A. Acero, "Evaluation of the splice algorithm on the Aurora2 database," Proc. Eurospeech, pp. 217-220, 2001
    • (2001) Proc. Eurospeech , pp. 217-220
    • Droppo, J.1    Deng, L.2    Acero, A.3
  • 4
    • 85009142179 scopus 로고    scopus 로고
    • Model-based compensation of the additive noise for continuous speech recognition. Experiments using the Aurora II database and tasks
    • J. C. Segura et al., "Model-based compensation of the additive noise for continuous speech recognition. Experiments using the Aurora II database and tasks," Proc. Eurospeech, pp. 221-224, 2001
    • (2001) Proc. Eurospeech , pp. 221-224
    • Segura, J.C.1
  • 6
    • 0032027527 scopus 로고    scopus 로고
    • Nonstationary environment compensation based on sequential estimation
    • N. S. Kim, "Nonstationary environment compensation based on sequential estimation," IEEE Signal Process. Lett., vol. 5, no. 3, pp. 57-59, Mar. 1998 (Pubitemid 128556794)
    • (1998) IEEE Signal Processing Letters , vol.5 , Issue.3 , pp. 57-59
    • Kim, N.S.1
  • 7
    • 0347968277 scopus 로고    scopus 로고
    • Recursive estimation of nonstationary noise using iterative stochastic approximation for robust speech recognition
    • Nov
    • L. Deng, J. Droppo, and A. Acero, "Recursive estimation of nonstationary noise using iterative stochastic approximation for robust speech recognition," IEEE Trans. Speech, Audio Process., vol. 11, no. 6, pp. 568-580, Nov. 2003
    • (2003) IEEE Trans. Speech, Audio Process , vol.11 , Issue.6 , pp. 568-580
    • Deng, L.1    Droppo, J.2    Acero, A.3
  • 8
    • 0742324997 scopus 로고    scopus 로고
    • Sequential estimation with optimal forgetting for robust speech recognition
    • Jan
    • M. Afify and O. Siohan, "Sequential estimation with optimal forgetting for robust speech recognition," IEEE Trans. Speech Audio Process., vol. 12, no. 1, pp. 19-26, Jan. 2004
    • (2004) IEEE Trans. Speech Audio Process , vol.12 , Issue.1 , pp. 19-26
    • Afify, M.1    Siohan, O.2
  • 11
    • 50449094088 scopus 로고    scopus 로고
    • Closely coupled array processing and modelbased compensation for microphone array speech recognition
    • Mar
    • X. Zhao and Z. Ou, "Closely coupled array processing and modelbased compensation for microphone array speech recognition," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 3, pp. 1114-1122, Mar. 2007
    • (2007) IEEE Trans. Audio, Speech, Lang. Process , vol.15 , Issue.3 , pp. 1114-1122
    • Zhao, X.1    Ou, Z.2
  • 12
    • 70350439261 scopus 로고    scopus 로고
    • Enhanced speech features by single-channel joint compensation of noise and reverberation
    • Feb
    • M.Wölfel, "Enhanced speech features by single-channel joint compensation of noise and reverberation," IEEE Trans. Audio, Speech, Lang. Process., vol. 17, no. 2, pp. 312-323, Feb. 2009
    • (2009) IEEE Trans. Audio, Speech, Lang. Process , vol.17 , Issue.2 , pp. 312-323
    • Wölfel, M.1
  • 13
    • 79961150469 scopus 로고    scopus 로고
    • A microphone array system integrating beamforming, feature enhancement, and spectral mask-based noise estimation
    • T. Yoshioka and T. Nakatani, "A microphone array system integrating beamforming, feature enhancement, and spectral mask-based noise estimation," Proc. Hands-Free Speech Commun., Microphone Arrays, pp. 219-224, 2011
    • (2011) Proc. Hands-Free Speech Commun., Microphone Arrays , pp. 219-224
    • Yoshioka, T.1    Nakatani, T.2
  • 14
    • 84890498342 scopus 로고    scopus 로고
    • Noise model transfer using affine transformation with application to large vocabulary reverberant speech recognition
    • Accepted for publication
    • T. Yoshioka and T. Nakatani, "Noise model transfer using affine transformation with application to large vocabulary reverberant speech recognition," in Proc. Int. Conf. Acoust., Speech, Signal Process., 2013, accepted for publication
    • (2013) Proc. Int. Conf. Acoust., Speech, Signal Process
    • Yoshioka, T.1    Nakatani, T.2
  • 16
    • 85009252959 scopus 로고    scopus 로고
    • Double the trouble: Handling noise and reverberation in far-field automatic speech recognition
    • D. Gelbart and N. Morgan, "Double the trouble: handling noise and reverberation in far-field automatic speech recognition," in Proc. Int. Conf. Spoken Lang. Process., 2002, pp. 2185-2188
    • (2002) Proc. Int. Conf. Spoken Lang. Process , pp. 2185-2188
    • Gelbart, D.1    Morgan, N.2
  • 17
    • 40249089621 scopus 로고    scopus 로고
    • Speech enhancement and recognition in meetings with an audio-visual sensor array
    • Nov
    • H. K. Maganti, D. Gatica-Perez, and I. McCowan, "Speech enhancement and recognition in meetings with an audio-visual sensor array," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 8, pp. 2257-2269, Nov. 2007
    • (2007) IEEE Trans. Audio, Speech, Lang. Process , vol.15 , Issue.8 , pp. 2257-2269
    • Maganti, H.K.1    Gatica-Perez, D.2    McCowan, I.3
  • 20
    • 33745207361 scopus 로고    scopus 로고
    • A Japanese national project on spontaneous speech corpus and processing technology
    • S. Furui, K. Maekawa, and H. Isahara, "A Japanese national project on spontaneous speech corpus and processing technology," in Proc. Autom. Speech Recognition Workshop, 2000, pp. 244-248
    • (2000) Proc. Autom. Speech Recognition Workshop , pp. 244-248
    • Furui, S.1    Maekawa, K.2    Isahara, H.3
  • 21
    • 78049409757 scopus 로고    scopus 로고
    • Discriminative training based on an integrated view of MPE and MMI in margin and error space
    • E. McDermott, S. Watanabe, and A. Nakamura, "Discriminative training based on an integrated view of MPE and MMI in margin and error space," in Proc. Int. Conf. Acoust., Speech, Signal Process., 2010, pp. 4894-4897
    • (2010) Proc. Int. Conf. Acoust., Speech, Signal Process , pp. 4894-4897
    • McDermott, E.1    Watanabe, S.2    Nakamura, A.3
  • 22
    • 77955673019 scopus 로고    scopus 로고
    • Model-based feature enhancement for reverberant speech recognition
    • Sep
    • A. Krueger and R. Haeb-Umbach, "Model-based feature enhancement for reverberant speech recognition," IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 7, pp. 1692-1707, Sep. 2010
    • (2010) IEEE Trans. Audio, Speech, Lang. Process , vol.18 , Issue.7 , pp. 1692-1707
    • Krueger, A.1    Haeb-Umbach, R.2
  • 23
    • 77955683144 scopus 로고    scopus 로고
    • Reverberation model-based decoding in the logmelspec domain for robust distant-talking speech recognition
    • Sep
    • A. Sehr, R. Maas, and W. Kellermann, "Reverberation model-based decoding in the logmelspec domain for robust distant-talking speech recognition," IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 7, pp. 1676-1691, Sep. 2010
    • (2010) IEEE Trans. Audio, Speech, Lang. Process , vol.18 , Issue.7 , pp. 1676-1691
    • Sehr, A.1    Maas, R.2    Kellermann, W.3
  • 25
    • 85032751613 scopus 로고    scopus 로고
    • Making machines understand us in reverberant rooms: Robustness against reverberation for automatic speech recognition
    • Aug
    • T.Yoshioka, A. Sehr,M.Delcroix, K. Kinoshita, R.Maas, T. Nakatani, and W. Kellermann, "Making machines understand us in reverberant rooms: robustness against reverberation for automatic speech recognition," IEEE Signal Process. Mag., vol. 29, no. 6, pp. 114-126, Aug. 2012
    • (2012) IEEE Signal Process. Mag , vol.29 , Issue.6 , pp. 114-126
    • Yoshioka, T.1    Sehr, A.2    Delcroix, M.3    Kinoshita, K.4    Maas, R.5    Nakatani, T.6    Kellermann, W.7
  • 26
  • 27
    • 14344274593 scopus 로고    scopus 로고
    • A new method based on spectral subtraction for speech dereverberation
    • K. Lebart, J. M. Boucher, and P. N. Denbigh, "A new method based on spectral subtraction for speech dereverberation," Acta Acust. United Acust., vol. 87, pp. 359-366, 2001 (Pubitemid 32699291)
    • (2001) Acta Acustica united with Acustica , vol.87 , Issue.3 , pp. 359-366
    • Lebart, K.1    Boucher, J.M.2    Denbigh, P.N.3
  • 31
    • 70350435249 scopus 로고    scopus 로고
    • Integrated speech enhancement method using noise suppression and dereverberation
    • Feb
    • T. Yoshioka, T. Nakatani, and M. Miyoshi, "Integrated speech enhancement method using noise suppression and dereverberation," IEEE Trans. Audio, Speech, Lang. Process., vol. 17, no. 2, pp. 231-246, Feb. 2009
    • (2009) IEEE Trans. Audio, Speech, Lang. Process , vol.17 , Issue.2 , pp. 231-246
    • Yoshioka, T.1    Nakatani, T.2    Miyoshi, M.3
  • 32
    • 77955680097 scopus 로고    scopus 로고
    • Correlation-based and model-based blind single-channel late-reverberation suppression in noisy time-varying acoustical environments
    • Sep
    • J. S. Erkelens and R. Heusdens, "Correlation-based and model-based blind single-channel late-reverberation suppression in noisy time-varying acoustical environments," IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 7, pp. 1746-1765, Sep. 2010
    • (2010) IEEE Trans. Audio, Speech, Lang. Process , vol.18 , Issue.7 , pp. 1746-1765
    • Erkelens, J.S.1    Heusdens, R.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.