메뉴 건너뛰기




Volumn , Issue , 2013, Pages 7092-7096

Ideal ratio mask estimation using deep neural networks for robust speech recognition

Author keywords

Aurora 4; Computational Auditory Scene Analysis; instantaneous SNR; noise robust ASR

Indexed keywords

AURORA-4; COMPUTATIONAL AUDITORY SCENE ANALYSIS; DEEP NEURAL NETWORKS; INSTANTANEOUS SNR; MULTI-CONDITION TRAININGS; NOISE ROBUST ASR; ROBUST AUTOMATIC SPEECH RECOGNITIONS (ASR); ROBUST SPEECH RECOGNITION;

EID: 84890493989     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2013.6639038     Document Type: Conference Paper
Times cited : (596)

References (25)
  • 4
    • 0032050110 scopus 로고    scopus 로고
    • Maximum likelihood linear transformations for HMM-based speech recognition
    • M. J. F. Gales, "Maximum likelihood linear transformations for HMM-based speech recognition," Computer speech and language, vol. 12, no. 2, pp. 75-98, 1998
    • (1998) Computer Speech and Language , vol.12 , Issue.2 , pp. 75-98
    • Gales, M.J.F.1
  • 6
    • 62249130045 scopus 로고    scopus 로고
    • A unified framework of HMM adaptation with joint compensation of additive and convolutive distortions
    • J. Li, L. Deng, D. Yu, Y. Gong, and A. Acero, "A unified framework of HMM adaptation with joint compensation of additive and convolutive distortions," Computer, Speech, and Language, vol. 23, pp. 389-405, 2009
    • (2009) Computer, Speech, and Language , vol.23 , pp. 389-405
    • Li, J.1    Deng, L.2    Yu, D.3    Gong, Y.4    Acero, A.5
  • 7
    • 85032752225 scopus 로고    scopus 로고
    • Missing-feature approaches in speech recognition
    • B. Raj and R. Stern, "Missing-feature approaches in speech recognition," IEEE Signal Processing Magazine, vol. 22, no. 5, pp. 101-116, 2005
    • (2005) IEEE Signal Processing Magazine , vol.22 , Issue.5 , pp. 101-116
    • Raj, B.1    Stern, R.2
  • 11
    • 84892233308 scopus 로고    scopus 로고
    • On ideal binary masks as the computational goal of auditory scene analysis
    • P. Divenyi, Ed.Kluwer Academic, Boston, MA
    • D. L.Wang, "On ideal binary masks as the computational goal of auditory scene analysis," in Speech Separation by Humans and Machines, P. Divenyi, Ed., pp. 181-197. Kluwer Academic, Boston, MA, 2005
    • (2005) Speech Separation by Humans and Machines , pp. 181-197
    • Wang, D.L.1
  • 12
    • 84877594942 scopus 로고    scopus 로고
    • Tech. Rep. OSU-CISRC-7/11-TR21, Department of Computer Science and Engineering, The Ohio State University, Columbus, Ohio, USA
    • W. Hartmann, A. Narayanan, E. Fosler-Lussier, and D. L. Wang, "Nothing doing: Re-evaluating missing feature ASR," Tech. Rep. OSU-CISRC-7/11-TR21, Department of Computer Science and Engineering, The Ohio State University, Columbus, Ohio, USA, 2011, Available: ftp://ftp.cse.ohiostate. edu/pub/tech-report/2011
    • (2011) Nothing Doing: Re-evaluating Missing Feature ASR
    • Hartmann, W.1    Narayanan, A.2    Fosler-Lussier, E.3    Wang, D.L.4
  • 14
    • 4644317224 scopus 로고    scopus 로고
    • A Bayesian classifer for spectrographic mask estimation for missing feature speech recognition
    • M. L. Seltzer, B. Raj, and R. M. Stern, "A Bayesian classifer for spectrographic mask estimation for missing feature speech recognition," Speech Communication, vol. 43, no. 4, pp. 379-393, 2004
    • (2004) Speech Communication , vol.43 , Issue.4 , pp. 379-393
    • Seltzer, M.L.1    Raj, B.2    Stern, R.M.3
  • 15
    • 33750311718 scopus 로고    scopus 로고
    • Binary and ratio time-frequency masks for robust speech recognition
    • S. Srinivasan, N. Roman, and D. L. Wang, "Binary and ratio time-frequency masks for robust speech recognition," Speech Communication, vol. 48, pp. 1486-1501, 2006
    • (2006) Speech Communication , vol.48 , pp. 1486-1501
    • Srinivasan, S.1    Roman, N.2    Wang, D.L.3
  • 18
    • 0038712550 scopus 로고    scopus 로고
    • SNR estimation based on amplitude modulation analysis with applications to noise suppression
    • J. Tchorz and B. Kollmeier, "SNR estimation based on amplitude modulation analysis with applications to noise suppression," IEEE Transactions on Audio, Speech, and Signal Processing, vol. 11, pp. 184-192, 2003
    • (2003) IEEE Transactions on Audio, Speech, and Signal Processing , vol.11 , pp. 184-192
    • Tchorz, J.1    Kollmeier, B.2
  • 22
    • 33745805403 scopus 로고    scopus 로고
    • A fast learning algorithm for deep belief nets
    • G.E. Hinton, S. Osindero, and Y.W. Teh, "A fast learning algorithm for deep belief nets," Neural computation, vol. 18, no. 7, pp. 1527-1554, 2006
    • (2006) Neural Computation , vol.18 , Issue.7 , pp. 1527-1554
    • Hinton, G.E.1    Osindero, S.2    Teh, Y.W.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.