메뉴 건너뛰기




Volumn , Issue , 2014, Pages 5527-5531

Impact of single-microphone dereverberation on DNN-based meeting transcription systems

Author keywords

deep neural network; Environmental robustness; meeting transcription; reverberation; single distant microphone

Indexed keywords

MICROPHONES; REVERBERATION; SPEECH RECOGNITION; TRANSCRIPTION;

EID: 84905247922     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2014.6854660     Document Type: Conference Paper
Times cited : (22)

References (21)
  • 2
    • 84055222005 scopus 로고    scopus 로고
    • Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition
    • G. E. Dahl, D. Yu, L. Deng, and A. Acero, "Context-dependent pre-trained deep neural networks for large-vocabulary speech recognition," IEEE Trans. Audio, Speech, Language Process., vol. 20, no. 1, pp. 30-42, 2012.
    • (2012) IEEE Trans. Audio, Speech, Language Process , vol.20 , Issue.1 , pp. 30-42
    • Dahl, G.E.1    Yu, D.2    Deng, L.3    Acero, A.4
  • 6
    • 0032050110 scopus 로고    scopus 로고
    • Maximum likelihood linear transformations for HMM-based speech recognition
    • M. J. F. Gales, "Maximum likelihood linear transformations for HMM-based speech recognition," Comp. Speech, Language, vol. 12, no. 2, pp. 75-98, 1998.
    • (1998) Comp. Speech, Language , vol.12 , Issue.2 , pp. 75-98
    • Gales, M.J.F.1
  • 7
    • 84890492030 scopus 로고    scopus 로고
    • An investigation of deep neural networks for noise robust speech recognition
    • M. L. Seltzer, D. Yu, and Y. Wang, "An investigation of deep neural networks for noise robust speech recognition," in Proc. Int. Conf. Acoust., Speech, Signal Process., 2013, pp. 7398-7402.
    • (2013) Proc. Int. Conf. Acoust., Speech, Signal Process , pp. 7398-7402
    • Seltzer, M.L.1    Yu, D.2    Wang, Y.3
  • 8
    • 66149101303 scopus 로고    scopus 로고
    • Robust speech recognition using a cepstral minimum-meansquare-error-motivated noise suppressor
    • D. Yu, L. Deng, J. Droppo, J. Wu, Y. Gong, and A. Acero, "Robust speech recognition using a cepstral minimum-meansquare-error-motivated noise suppressor," IEEE Trans. Audio, Speech, Language Process., vol. 16, no. 5, pp. 1061-1070, 2008.
    • (2008) IEEE Trans. Audio, Speech, Language Process , vol.16 , Issue.5 , pp. 1061-1070
    • Yu, D.1    Deng, L.2    Droppo, J.3    Wu, J.4    Gong, Y.5    Acero, A.6
  • 9
    • 84890532503 scopus 로고    scopus 로고
    • Noise adaptive front-end normalization based on vector Taylor series for deep neural networks in robust speech recognition
    • B. Li and K. C. Sim, "Noise adaptive front-end normalization based on vector Taylor series for deep neural networks in robust speech recognition," in Proc. Int. Conf. Acoust., Speech, Signal Process., 2013, pp. 7408-7412.
    • (2013) Proc. Int. Conf. Acoust., Speech, Signal Process , pp. 7408-7412
    • Li, B.1    Sim, K.C.2
  • 11
    • 85032751613 scopus 로고    scopus 로고
    • Making machines understand us in reverberant rooms: Robustness against reverberation for automatic speech recognition
    • T. Yoshioka, A. Sehr, M. Delcroix, K. Kinoshita, R. Maas, T. Nakatani, and W. Kellermann, "Making machines understand us in reverberant rooms: Robustness against reverberation for automatic speech recognition," IEEE Signal Process. Mag., vol. 29, no. 6, pp. 114-126, 2012.
    • (2012) IEEE Signal Process. Mag , vol.29 , Issue.6 , pp. 114-126
    • Yoshioka, T.1    Sehr, A.2    Delcroix, M.3    Kinoshita, K.4    Maas, R.5    Nakatani, T.6    Kellermann, W.7
  • 13
  • 14
    • 84867693894 scopus 로고    scopus 로고
    • Generalization of multi-channel linear prediction methods for blind MIMO impulse response shortening
    • T. Yoshioka and T. Nakatani, "Generalization of multi-channel linear prediction methods for blind MIMO impulse response shortening," IEEE Trans. Audio, Speech, Language Process., vol. 20, no. 10, pp. 2707-2720, 2012.
    • (2012) IEEE Trans. Audio, Speech, Language Process , vol.20 , Issue.10 , pp. 2707-2720
    • Yoshioka, T.1    Nakatani, T.2
  • 17
    • 0032638856 scopus 로고    scopus 로고
    • Semi-tied covariance matrices for hidden Markov models
    • M. J. F. Gales, "Semi-tied covariance matrices for hidden Markov models," IEEE Trans. Speech, Audio Process., vol. 7, no. 3, pp. 272-281, 1999.
    • (1999) IEEE Trans. Speech, Audio Process , vol.7 , Issue.3 , pp. 272-281
    • Gales, M.J.F.1
  • 18
    • 0032289099 scopus 로고    scopus 로고
    • Heteroscedastic discriminant analysis and reduced rank HMMs for improved speech recognition
    • N. Kumar and A. G. Andreou, "Heteroscedastic discriminant analysis and reduced rank HMMs for improved speech recognition," Speech Commun., vol. 26, no. 14, pp. 283-297, 1998.
    • (1998) Speech Commun , vol.26 , Issue.14 , pp. 283-297
    • Kumar, N.1    Andreou, A.G.2
  • 19
    • 84858976070 scopus 로고    scopus 로고
    • Feature engineering in context-depencent deep neural networks for conversational speech transcription
    • F. Seide, G. Li, X. Chen, and D. Yu, "Feature engineering in context-depencent deep neural networks for conversational speech transcription," in Proc. Workshop. Automat. Speech Recognition, Understanding, 2011, pp. 24-29.
    • (2011) Proc. Workshop. Automat. Speech Recognition, Understanding , pp. 24-29
    • Seide, F.1    Li, G.2    Chen, X.3    Yu, D.4
  • 21
    • 84893675167 scopus 로고    scopus 로고
    • Model-based approaches to handling uncertainty
    • D. Kolossa and R. Haeb-Umbach, Eds Springer
    • M. J. F. Gales, "Model-based approaches to handling uncertainty," in Robust Speech Recognition of Uncertain or Missing Data, D. Kolossa and R. Haeb-Umbach, Eds., pp. 101-125. Springer, 2011.
    • (2011) Robust Speech Recognition of Uncertain or Missing Data , pp. 101-125
    • Gales, M.J.F.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.