SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 21, Issue 10, 2013, Pages 2182-2192

Noise model transfer: Novel approach to robustness against nonstationary noise

(2) Yoshioka, Takuya a Nakatani, Tomohiro a

a Nippon Telegraph and Telephone Corporation (Japan)

Author keywords

Meeting speech recognition; nonstationary noise; reverberation; robust speech recognition

Indexed keywords

CHANGING PARAMETER; CONVENTIONAL METHODS; NOISE CHARACTERISTIC; NOISE POWER SPECTRUM; NOISE-POWER SPECTRA; NONSTATIONARY NOISE; OPTIMAL TRANSFORMATION; ROBUST SPEECH RECOGNITION;

ESTIMATION; FEATURE EXTRACTION; PARAMETER ESTIMATION; POWER SPECTRUM; SPEECH RECOGNITION;

REVERBERATION;

EID: 84881043147 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TASL.2013.2272513 Document Type: Article

Times cited : (10)

References (32)

1
- 84901773892
- Springer Handbook of Speech Processing, J. Benesty M. M. Sondhi, and Y. Huang, Eds. New York, NY, USA: Springer
- J. Droppo and A. Acero, "Environmental robustness," in Springer Handbook of Speech Processing, J. Benesty, M. M. Sondhi, and Y. Huang, Eds. New York, NY, USA: Springer, 2008, pp. 653-679
- (2008) Environmental Robustness , pp. 653-679
- Droppo, J.¹ Acero, A.²

2
- 0029725301
- A vector Taylor series approach for environmental-independent speech recognition
- P. J.Moreno, B. Raj, and R.M. Stern, "A vector Taylor series approach for environmental-independent speech recognition," in Proc. Int. Conf. Acoust., Speech, Signal Process., 1996, vol. 2, pp. 733-736
- (1996) Proc. Int. Conf. Acoust., Speech, Signal Process , vol.2 , pp. 733-736
- Moreno, P.J.¹ Raj, B.² Stern, R.M.³

3
- 85006734596
- Evaluation of the splice algorithm on the Aurora2 database
- J. Droppo, L. Deng, and A. Acero, "Evaluation of the splice algorithm on the Aurora2 database," Proc. Eurospeech, pp. 217-220, 2001
- (2001) Proc. Eurospeech , pp. 217-220
- Droppo, J.¹ Deng, L.² Acero, A.³

4
- 85009142179
- Model-based compensation of the additive noise for continuous speech recognition. Experiments using the Aurora II database and tasks
- J. C. Segura et al., "Model-based compensation of the additive noise for continuous speech recognition. Experiments using the Aurora II database and tasks," Proc. Eurospeech, pp. 221-224, 2001
- (2001) Proc. Eurospeech , pp. 221-224
- Segura, J.C.¹

5
- 50449097354
- Ph.D. dissertation Katholieke University Leuven, Leuven, Belgium
- V. Stouten, "Robust automatic speech recognition in time-varying environments," Ph.D. dissertation, Katholieke University Leuven, Leuven, Belgium, 2006
- (2006) Robust Automatic Speech Recognition in Time-varying Environments
- Stouten, V.¹

6
- 0032027527
- Nonstationary environment compensation based on sequential estimation
- N. S. Kim, "Nonstationary environment compensation based on sequential estimation," IEEE Signal Process. Lett., vol. 5, no. 3, pp. 57-59, Mar. 1998 (Pubitemid 128556794)
- (1998) IEEE Signal Processing Letters , vol.5 , Issue.3 , pp. 57-59
- Kim, N.S.¹

7
- 0347968277
- Recursive estimation of nonstationary noise using iterative stochastic approximation for robust speech recognition
- Nov
- L. Deng, J. Droppo, and A. Acero, "Recursive estimation of nonstationary noise using iterative stochastic approximation for robust speech recognition," IEEE Trans. Speech, Audio Process., vol. 11, no. 6, pp. 568-580, Nov. 2003
- (2003) IEEE Trans. Speech, Audio Process , vol.11 , Issue.6 , pp. 568-580
- Deng, L.¹ Droppo, J.² Acero, A.³

8
- 0742324997
- Sequential estimation with optimal forgetting for robust speech recognition
- Jan
- M. Afify and O. Siohan, "Sequential estimation with optimal forgetting for robust speech recognition," IEEE Trans. Speech Audio Process., vol. 12, no. 1, pp. 19-26, Jan. 2004
- (2004) IEEE Trans. Speech Audio Process , vol.12 , Issue.1 , pp. 19-26
- Afify, M.¹ Siohan, O.²

9
- 33947677142
- Dynamic noise adaptation
- S. Rennie et al., "Dynamic noise adaptation," in Proc. Int. Conf. Acoust., Speech, Signal Process., 2006, pp. 1197-1200
- (2006) Proc. Int. Conf. Acoust., Speech, Signal Process , pp. 1197-1200
- Rennie, S.¹

10
- 33745146930
- NewYork NY USA: Springer
- J. Benesty, S. Makino, and J. Chen, Speech Enhancement. NewYork, NY, USA: Springer, 2005
- (2005) Speech Enhancement
- Benesty, J.¹ Makino, S.² Chen, J.³

11
- 50449094088
- Closely coupled array processing and modelbased compensation for microphone array speech recognition
- Mar
- X. Zhao and Z. Ou, "Closely coupled array processing and modelbased compensation for microphone array speech recognition," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 3, pp. 1114-1122, Mar. 2007
- (2007) IEEE Trans. Audio, Speech, Lang. Process , vol.15 , Issue.3 , pp. 1114-1122
- Zhao, X.¹ Ou, Z.²

12
- 70350439261
- Enhanced speech features by single-channel joint compensation of noise and reverberation
- Feb
- M.Wölfel, "Enhanced speech features by single-channel joint compensation of noise and reverberation," IEEE Trans. Audio, Speech, Lang. Process., vol. 17, no. 2, pp. 312-323, Feb. 2009
- (2009) IEEE Trans. Audio, Speech, Lang. Process , vol.17 , Issue.2 , pp. 312-323
- Wölfel, M.¹

13
- 79961150469
- A microphone array system integrating beamforming, feature enhancement, and spectral mask-based noise estimation
- T. Yoshioka and T. Nakatani, "A microphone array system integrating beamforming, feature enhancement, and spectral mask-based noise estimation," Proc. Hands-Free Speech Commun., Microphone Arrays, pp. 219-224, 2011
- (2011) Proc. Hands-Free Speech Commun., Microphone Arrays , pp. 219-224
- Yoshioka, T.¹ Nakatani, T.²

14
- 84890498342
- Noise model transfer using affine transformation with application to large vocabulary reverberant speech recognition
- Accepted for publication
- T. Yoshioka and T. Nakatani, "Noise model transfer using affine transformation with application to large vocabulary reverberant speech recognition," in Proc. Int. Conf. Acoust., Speech, Signal Process., 2013, accepted for publication
- (2013) Proc. Int. Conf. Acoust., Speech, Signal Process
- Yoshioka, T.¹ Nakatani, T.²

15
- 34547553730
- Ph.D. dissertation Univ. of Cambridge, Cambridge, U.K
- H. Liao, "Uncertainty decoding for noise robust speech recognition," Ph.D. dissertation, Univ. of Cambridge, Cambridge, U.K., 2007
- (2007) Uncertainty Decoding for Noise Robust Speech Recognition
- Liao, H.¹

16
- 85009252959
- Double the trouble: Handling noise and reverberation in far-field automatic speech recognition
- D. Gelbart and N. Morgan, "Double the trouble: handling noise and reverberation in far-field automatic speech recognition," in Proc. Int. Conf. Spoken Lang. Process., 2002, pp. 2185-2188
- (2002) Proc. Int. Conf. Spoken Lang. Process , pp. 2185-2188
- Gelbart, D.¹ Morgan, N.²

17
- 40249089621
- Speech enhancement and recognition in meetings with an audio-visual sensor array
- Nov
- H. K. Maganti, D. Gatica-Perez, and I. McCowan, "Speech enhancement and recognition in meetings with an audio-visual sensor array," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 8, pp. 2257-2269, Nov. 2007
- (2007) IEEE Trans. Audio, Speech, Lang. Process , vol.15 , Issue.8 , pp. 2257-2269
- Maganti, H.K.¹ Gatica-Perez, D.² McCowan, I.³

18
- 85008590333
- Low-latency real-time meeting recognition and understanding using distant microphones and omni-directional camera
- Feb
- T. Hori, S. Araki, T. Yoshioka, M. Fujimoto, S. Watanabe, T. Oba, A. Ogawa, K. Otsuka, D. Mikami, K. Kinoshita, T. Nakatani, A. Nakamura, and J. Yamato, "Low-latency real-time meeting recognition and understanding using distant microphones and omni-directional camera," IEEE Trans. Audio, Speech, Lang. Process., vol. 20, no. 2, pp. 499-513, Feb. 2011
- (2011) IEEE Trans. Audio, Speech, Lang. Process , vol.20 , Issue.2 , pp. 499-513
- Hori, T.¹ Araki, S.² Yoshioka, T.³ Fujimoto, M.⁴ Watanabe, S.⁵ Oba, T.⁶ Ogawa, A.⁷ Otsuka, K.⁸ Mikami, D.⁹ Kinoshita, K.¹⁰ Nakatani, T.¹¹ Nakamura, A.¹² Yamato, J.¹³

19
- 84881063883
- A neural network based regression approach for recognizing simultaneous speech
- W. Li, K. Kumatani, J. Dines, M. M. Doss, and H. Bourlard, "A neural network based regression approach for recognizing simultaneous speech," Proc. Mach. Learn. Multimodal Interact., pp. 110-118, 2007
- (2007) Proc. Mach. Learn. Multimodal Interact , pp. 110-118
- Li, W.¹ Kumatani, K.² Dines, J.³ Doss, M.M.⁴ Bourlard, H.⁵

20
- 33745207361
- A Japanese national project on spontaneous speech corpus and processing technology
- S. Furui, K. Maekawa, and H. Isahara, "A Japanese national project on spontaneous speech corpus and processing technology," in Proc. Autom. Speech Recognition Workshop, 2000, pp. 244-248
- (2000) Proc. Autom. Speech Recognition Workshop , pp. 244-248
- Furui, S.¹ Maekawa, K.² Isahara, H.³

21
- 78049409757
- Discriminative training based on an integrated view of MPE and MMI in margin and error space
- E. McDermott, S. Watanabe, and A. Nakamura, "Discriminative training based on an integrated view of MPE and MMI in margin and error space," in Proc. Int. Conf. Acoust., Speech, Signal Process., 2010, pp. 4894-4897
- (2010) Proc. Int. Conf. Acoust., Speech, Signal Process , pp. 4894-4897
- McDermott, E.¹ Watanabe, S.² Nakamura, A.³

22
- 77955673019
- Model-based feature enhancement for reverberant speech recognition
- Sep
- A. Krueger and R. Haeb-Umbach, "Model-based feature enhancement for reverberant speech recognition," IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 7, pp. 1692-1707, Sep. 2010
- (2010) IEEE Trans. Audio, Speech, Lang. Process , vol.18 , Issue.7 , pp. 1692-1707
- Krueger, A.¹ Haeb-Umbach, R.²

23
- 77955683144
- Reverberation model-based decoding in the logmelspec domain for robust distant-talking speech recognition
- Sep
- A. Sehr, R. Maas, and W. Kellermann, "Reverberation model-based decoding in the logmelspec domain for robust distant-talking speech recognition," IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 7, pp. 1676-1691, Sep. 2010
- (2010) IEEE Trans. Audio, Speech, Lang. Process , vol.18 , Issue.7 , pp. 1676-1691
- Sehr, A.¹ Maas, R.² Kellermann, W.³

24
- 79961153040
- Model-based approaches to handling additive noise in reverberant environments
- M. J. F. Gales and J.-Q. Wang, "Model-based approaches to handling additive noise in reverberant environments," in Proc. Joint Workshop Hands-Free Speech Commun. Microphone Arrays, 2011, pp. 121-126
- (2011) Proc. Joint Workshop Hands-Free Speech Commun. Microphone Arrays , pp. 121-126
- Gales, M.J.F.¹ Wang, J.-Q.²

25
- 85032751613
- Making machines understand us in reverberant rooms: Robustness against reverberation for automatic speech recognition
- Aug
- T.Yoshioka, A. Sehr,M.Delcroix, K. Kinoshita, R.Maas, T. Nakatani, and W. Kellermann, "Making machines understand us in reverberant rooms: robustness against reverberation for automatic speech recognition," IEEE Signal Process. Mag., vol. 29, no. 6, pp. 114-126, Aug. 2012
- (2012) IEEE Signal Process. Mag , vol.29 , Issue.6 , pp. 114-126
- Yoshioka, T.¹ Sehr, A.² Delcroix, M.³ Kinoshita, K.⁴ Maas, R.⁵ Nakatani, T.⁶ Kellermann, W.⁷

26
- 0003870155
- 5th ed. Oxford U.K.: Spon
- H. Kuttruff, Room Acoustics, 5th ed. Oxford, U.K.: Spon, 2009
- (2009) Room Acoustics
- Kuttruff, H.¹

27
- 14344274593
- A new method based on spectral subtraction for speech dereverberation
- K. Lebart, J. M. Boucher, and P. N. Denbigh, "A new method based on spectral subtraction for speech dereverberation," Acta Acust. United Acust., vol. 87, pp. 359-366, 2001 (Pubitemid 32699291)
- (2001) Acta Acustica united with Acustica , vol.87 , Issue.3 , pp. 359-366
- Lebart, K.¹ Boucher, J.M.² Denbigh, P.N.³

28
- 85009242725
- Evaluation of a noise-robust DSR front-end on Aurora databases
- D. Macho, L. Mauuary, B. Noé, Y. M. Cheng, D. Ealey, D. Jouvet, H. Kelleher, D. Pearce, and F. Saadoun, "Evaluation of a noise-robust DSR front-end on Aurora databases," in Proc. Int. Conf. Spoken Lang. Process., 2002, pp. 17-20
- (2002) Proc. Int. Conf. Spoken Lang. Process , pp. 17-20
- Macho, D.¹ Mauuary, L.² Noé, B.³ Cheng, Y.M.⁴ Ealey, D.⁵ Jouvet, D.⁶ Kelleher, H.⁷ Pearce, D.⁸ Saadoun, F.⁹

29
- 84867584057
- On the application of reverberation suppression to robust speech recognition
- R. Maas, E. A. P. Habets, A. Sehr, and W. Kellermann, "On the application of reverberation suppression to robust speech recognition," in Proc. Int. Conf. Acoust., Speech, Signal Process., 2012, pp. 297-300
- Proc. Int. Conf. Acoust., Speech, Signal Process , vol.2012 , pp. 297-300
- Maas, R.¹ Habets, E.A.P.² Sehr, A.³ Kellermann, W.⁴

30
- 51449084820
- Ph.D. dissertation Eindhoven Univ. of Technol., Eindhoven, The Netherlands
- E. A. P. Habets, "Single- andmulti-microphone speech dereverberation using spectral enhancement," Ph.D. dissertation, Eindhoven Univ. of Technol., Eindhoven, The Netherlands, 2006
- (2006) Single- Andmulti-microphone Speech Dereverberation Using Spectral Enhancement
- Habets, E.A.P.¹

31
- 70350435249
- Integrated speech enhancement method using noise suppression and dereverberation
- Feb
- T. Yoshioka, T. Nakatani, and M. Miyoshi, "Integrated speech enhancement method using noise suppression and dereverberation," IEEE Trans. Audio, Speech, Lang. Process., vol. 17, no. 2, pp. 231-246, Feb. 2009
- (2009) IEEE Trans. Audio, Speech, Lang. Process , vol.17 , Issue.2 , pp. 231-246
- Yoshioka, T.¹ Nakatani, T.² Miyoshi, M.³

32
- 77955680097
- Correlation-based and model-based blind single-channel late-reverberation suppression in noisy time-varying acoustical environments
- Sep
- J. S. Erkelens and R. Heusdens, "Correlation-based and model-based blind single-channel late-reverberation suppression in noisy time-varying acoustical environments," IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 7, pp. 1746-1765, Sep. 2010
- (2010) IEEE Trans. Audio, Speech, Lang. Process , vol.18 , Issue.7 , pp. 1746-1765
- Erkelens, J.S.¹ Heusdens, R.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.