메뉴 건너뛰기




Volumn 2015-January, Issue , 2015, Pages 2449-2453

Combating reverberation in large vocabulary continuous speech recognition

Author keywords

Deep neural networks; Reverberation robustness; Robust features; Robust speech recognition

Indexed keywords

CONTINUOUS SPEECH RECOGNITION; MODELING LANGUAGES; REVERBERATION; SPEECH; SPEECH COMMUNICATION; VOCABULARY CONTROL;

EID: 84959111702     PISSN: 2308457X     EISSN: 19909772     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (3)

References (33)
  • 3
    • 0028478507 scopus 로고
    • Combined acoustic echo cancellation, dereverberation and noise reduction: A two microphone approach
    • R. Martin and P. Vary, "Combined Acoustic Echo Cancellation, Dereverberation and Noise Reduction: A Two Microphone Approach, " Journal of Annales des Telecommunications, Vol. 49, Iss. 7-8, pp. 429-438, 1994.
    • (1994) Journal of Annales des Telecommunications , vol.49 , Issue.7-8 , pp. 429-438
    • Martin, R.1    Vary, P.2
  • 4
    • 84964511330 scopus 로고    scopus 로고
    • Single channel blind dereverberation based on auto-correlation functions of frame-wise time sequences of frequency components
    • K. Ohta and M. Yanagida, "Single Channel Blind Dereverberation Based on Auto-Correlation Functions of Frame-Wise Time Sequences of Frequency Components, " Proc. of IWAENC, pp. 1-4, 2006.
    • (2006) Proc. of IWAENC , pp. 1-4
    • Ohta, K.1    Yanagida, M.2
  • 5
    • 33745761716 scopus 로고    scopus 로고
    • A two-stage algorithm for one-microphone reverberant speech enhancement
    • M. Wu and D. L. Wang, "A Two-Stage Algorithm for One-Microphone Reverberant Speech Enhancement, " IEEE Trans. Aud. Speech & Lang. Process., Vol. 14, No. 3, pp. 774-784, 2006.
    • (2006) IEEE Trans. Aud. Speech & Lang. Process. , vol.14 , Issue.3 , pp. 774-784
    • Wu, M.1    Wang, D.L.2
  • 6
    • 4544336156 scopus 로고    scopus 로고
    • Robust automatic speech recognition in reverberant environments by model selection
    • L. Couvreur and C. Couvreur, "Robust Automatic Speech Recognition in Reverberant Environments by Model Selection, " Proc. of HSC, pp. 147-150, 2001.
    • (2001) Proc. of HSC , pp. 147-150
    • Couvreur, L.1    Couvreur, C.2
  • 7
    • 34547517494 scopus 로고    scopus 로고
    • A new concept for feature-domain dereverberation for robust distant-talking ASR
    • A. Sehr and W. Kellermann, "A New Concept for Feature-Domain Dereverberation for Robust Distant-Talking ASR, " Proc. of ICASSP, pp. 369-372, 2007.
    • (2007) Proc. of ICASSP , pp. 369-372
    • Sehr, A.1    Kellermann, W.2
  • 8
    • 70350450398 scopus 로고    scopus 로고
    • Static and dynamic variance compensation for recognition of reverberant speech with dereverberation preprocessing
    • M. Delcroix and S. Watanabe, "Static and Dynamic Variance Compensation for Recognition of Reverberant Speech with Dereverberation Preprocessing, " IEEE Trans. on Aud. Speech & Lang. Process., Vol. 17, No. 2, pp. 324-334, 2009.
    • (2009) IEEE Trans. on Aud. Speech & Lang. Process. , vol.17 , Issue.2 , pp. 324-334
    • Delcroix, M.1    Watanabe, S.2
  • 9
    • 84928158251 scopus 로고    scopus 로고
    • Use of multiple front-ends and i-vector-based speaker adaptation for robust speech recognition
    • Md. J. Alam, V. Gupta, P. Kenny, P. Dumouchel, "Use Of Multiple Front-Ends And I-Vector-Based Speaker Adaptation For Robust Speech Recognition, " Proc. of REVERB Challenge, 2014.
    • (2014) Proc. of REVERB Challenge
    • Alam, M.J.1    Gupta, V.2    Kenny, P.3    Dumouchel, P.4
  • 12
    • 84055211743 scopus 로고    scopus 로고
    • Acoustic modeling using deep belief networks
    • A. Mohamed, G. E. Dahl and G. Hinton, "Acoustic modeling using deep belief networks, " IEEE Trans. on ASLP, Vol. 20, no. 1, pp. 14-22, 2012.
    • (2012) IEEE Trans. on ASLP , vol.20 , Issue.1 , pp. 14-22
    • Mohamed, A.1    Dahl, G.E.2    Hinton, G.3
  • 14
    • 84910075252 scopus 로고    scopus 로고
    • Evaluating robust features on Deep Neural Networks for speech recognition in noisy and channel mismatched conditions
    • V. Mitra, W. Wang, H. Franco, Y. Lei, C. Bartels, M. Graciarena, "Evaluating robust features on Deep Neural Networks for speech recognition in noisy and channel mismatched conditions, " in Proc. of Interspeech, pp. 895-899, 2014.
    • (2014) Proc. of Interspeech , pp. 895-899
    • Mitra, V.1    Wang, W.2    Franco, H.3    Lei, Y.4    Bartels, C.5    Graciarena, M.6
  • 15
    • 84893691530 scopus 로고    scopus 로고
    • Speaker adaptation of neural network acoustic models using ivectors
    • G. Saon, H. Soltau, D. Nahamoo and M. Picheny, "Speaker Adaptation of Neural Network Acoustic Models using Ivectors, " Proc. ASRU, 2013.
    • (2013) Proc. ASRU
    • Saon, G.1    Soltau, H.2    Nahamoo, D.3    Picheny, M.4
  • 17
    • 0028996854 scopus 로고
    • WSJCAM0: A british english speech corpus for large vocabulary continuous speech recognition
    • T. Robinson, J. Fransen, D. Pye, J. Foote and S. Renals, "WSJCAM0: A British English Speech Corpus for Large Vocabulary Continuous Speech Recognition, " Proc. ICASSP, pp. 81-84, 1995.
    • (1995) Proc. ICASSP , pp. 81-84
    • Robinson, T.1    Fransen, J.2    Pye, D.3    Foote, J.4    Renals, S.5
  • 19
    • 84906260861 scopus 로고    scopus 로고
    • Damped oscillator cepstral coefficients for robust speech recognition
    • V. Mitra, H. Franco and M. Graciarena, "Damped Oscillator Cepstral Coefficients for Robust Speech Recognition, " Proc. of Interspeech, pp. 886-890, 2013.
    • (2013) Proc. of Interspeech , pp. 886-890
    • Mitra, V.1    Franco, H.2    Graciarena, M.3
  • 20
    • 84867589420 scopus 로고    scopus 로고
    • Normalized amplitude modulation features for large vocabulary noise-robust speech recognition
    • V. Mitra, H. Franco, M. Graciarena, and A. Mandal, "Normalized Amplitude Modulation Features for Large Vocabulary Noise-Robust Speech Recognition, " Proc. of ICASSP, pp. 4117-4120, 2012.
    • (2012) Proc. of ICASSP , pp. 4117-4120
    • Mitra, V.1    Franco, H.2    Graciarena, M.3    Mandal, A.4
  • 21
    • 0027676955 scopus 로고
    • Energy separation in signal modulations with application to speech analysis
    • P. Maragos, J. Kaiser and T. Quatieri, "Energy Separation in Signal Modulations with Application to Speech Analysis, " IEEE Trans. Signal Processing, Vol. 41, pp. 3024-3051, 1993.
    • (1993) IEEE Trans. Signal Processing , vol.41 , pp. 3024-3051
    • Maragos, P.1    Kaiser, J.2    Quatieri, T.3
  • 22
    • 84905269267 scopus 로고    scopus 로고
    • Medium duration modulation cepstral feature for robust speech recognition
    • Florence
    • V. Mitra, H. Franco, M. Graciarena, D. Vergyri, "Medium duration modulation cepstral feature for robust speech recognition, " Proc. of ICASSP, Florence, 2014.
    • (2014) Proc. of ICASSP
    • Mitra, V.1    Franco, H.2    Graciarena, M.3    Vergyri, D.4
  • 24
    • 0019075685 scopus 로고
    • Some observations on oral air flow during phonation
    • H. Teager, "Some Observations on Oral Air Flow During Phonation, " in IEEE Trans. ASSP, pp. 599-601, 1980.
    • (1980) IEEE Trans. ASSP , pp. 599-601
    • Teager, H.1
  • 26
    • 0030638031 scopus 로고    scopus 로고
    • A post-processing system to yield reduced word error rates: Recognizer output voting error reduction. (ROVER)
    • J. G. Fiscus, "A Post-Processing System to Yield Reduced Word Error Rates: Recognizer Output Voting Error Reduction. (ROVER), " Proc. of ASRU, pp. 347-354, 1997.
    • (1997) Proc. of ASRU , pp. 347-354
    • Fiscus, J.G.1
  • 27
    • 84867605836 scopus 로고    scopus 로고
    • Applying convolutional neural networks concepts to hybrid NN-HMM model for speech recognition
    • O. Abdel-Hamid, A. Mohamed, H. Jiang, and G. Penn, "Applying convolutional neural networks concepts to hybrid NN-HMM model for speech recognition, " Proc. of ICASSP, pp. 4277-4280, 2012.
    • (2012) Proc. of ICASSP , pp. 4277-4280
    • Abdel-Hamid, O.1    Mohamed, A.2    Jiang, H.3    Penn, G.4
  • 29
    • 80051639873 scopus 로고    scopus 로고
    • Gammatone sub-band magnitude-domain dereverberation for ASR
    • K. Kumar, R. Singh, B. Raj, R. Stern, R., "Gammatone sub-band magnitude-domain dereverberation for ASR, " Proc. of ICASSP, pp. 4604-4607, 2011.
    • (2011) Proc. of ICASSP , pp. 4604-4607
    • Kumar, K.1    Singh, R.2    Raj, B.3    Stern, R.R.4
  • 30
    • 84891308106 scopus 로고    scopus 로고
    • SRILM-an extensible language modeling toolkit
    • A. Stolcke, "SRILM-An Extensible Language Modeling Toolkit, " Proc. of ICSLP 2002, pp. 901-904, 2002.
    • (2002) Proc. of ICSLP 2002 , pp. 901-904
    • Stolcke, A.1
  • 31
    • 70349215697 scopus 로고    scopus 로고
    • Data-driven lexicon expansion for Mandarin broadcast news and conversation speech recognition
    • X. Lei and W. Wang and A. Stolcke, "Data-driven Lexicon Expansion for Mandarin Broadcast News and Conversation Speech Recognition, " Proc. of ICASSP, 2009.
    • (2009) Proc. of ICASSP
    • Lei, X.1    Wang, W.2    Stolcke, A.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.