메뉴 건너뛰기




Volumn , Issue , 2014, Pages 5507-5511

Extension of uncertainty propagation to dynamic MFCCS for noise robust ASR

Author keywords

Automatic speech recognition; noise robustness; uncertainty handling

Indexed keywords

ACOUSTIC NOISE; COVARIANCE MATRIX; SIGNAL PROCESSING; SPEECH RECOGNITION;

EID: 84905216197     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2014.6854656     Document Type: Conference Paper
Times cited : (10)

References (25)
  • 4
    • 84893704157 scopus 로고    scopus 로고
    • The second 'CHiME' speech separation and recognition challenge: An overview of challenge systems and outcomes
    • E. Vincent, J. Barker, S.Watanabe, J. Le Roux, F. Nesta, and M. Matassoni, "The second 'CHiME' speech separation and recognition challenge: An overview of challenge systems and outcomes," in Proc. ASRU, 2013.
    • (2013) Proc. ASRU
    • Vincent, E.1    Barker, J.2    Watanabe, S.3    Le Roux, J.4    Nesta, F.5    Matassoni, M.6
  • 6
    • 84867608537 scopus 로고    scopus 로고
    • Power-normalized cepstral coefficients (PNCC) for robust speech recognition
    • C. Kim and R. Stern, "Power-normalized cepstral coefficients (PNCC) for robust speech recognition," in Proc. ICASSP, 2012, pp. 4101-4104.
    • (2012) Proc. ICASSP , pp. 4101-4104
    • Kim, C.1    Stern, R.2
  • 7
    • 85009070292 scopus 로고    scopus 로고
    • Large vocabulary speech recognition under adverse acoustic environments
    • L. Deng, A. Acero, M. Plumpe, and X. D. Huang, "Large vocabulary speech recognition under adverse acoustic environments," in Proc. ICSLP, 2000, pp. 806-809.
    • (2000) Proc. ICSLP , pp. 806-809
    • Deng, L.1    Acero, A.2    Plumpe, M.3    Huang, X.D.4
  • 8
    • 0035342414 scopus 로고    scopus 로고
    • Robust automatic speech recognition with missing and unreliable acoustic data
    • June
    • M. Cooke, "Robust automatic speech recognition with missing and unreliable acoustic data," Speech Communication, vol. 34, no. 3, pp. 267-285, June 2001.
    • (2001) Speech Communication , vol.34 , Issue.3 , pp. 267-285
    • Cooke, M.1
  • 9
    • 34547528168 scopus 로고    scopus 로고
    • Adaptive training with joint uncertainty decoding for robust recognition of noisy data
    • H. Liao and M. J. F. Gales, "Adaptive training with joint uncertainty decoding for robust recognition of noisy data," in Proc. ICASSP, 2007, vol. 4, pp. 389-392.
    • (2007) Proc. ICASSP , vol.4 , pp. 389-392
    • Liao, H.1    Gales, M.J.F.2
  • 10
    • 70350450398 scopus 로고    scopus 로고
    • Static and dynamic variance compensation for recognition of reverberant speech with dereverberation preprocessing
    • Jan
    • M. Delcroix, T. Nakatani, and S. Watanabe, "Static and dynamic variance compensation for recognition of reverberant speech with dereverberation preprocessing," IEEE Transactions on Audio, Speech, and Language Processing, vol. 17, no. 2, pp. 324-334, Jan 2009.
    • (2009) IEEE Transactions on Audio, Speech, and Language Processing , vol.17 , Issue.2 , pp. 324-334
    • Delcroix, M.1    Nakatani, T.2    Watanabe, S.3
  • 14
    • 84890541336 scopus 로고    scopus 로고
    • Mask estimation and sparse imputation for missing data speech recognition in multisource reverberant environments
    • H. Kallasjoki, S. Keronen, G. J. Brown, J. F. Gemmeke, U. Remes, and K. J. Palomaki, "Mask estimation and sparse imputation for missing data speech recognition in multisource reverberant environments," in Proc. CHiME, 2011, pp. 58-63.
    • (2011) Proc. CHiME , pp. 58-63
    • Kallasjoki, H.1    Keronen, S.2    Brown, G.J.3    Gemmeke, J.F.4    Remes, U.5    Palomaki, K.J.6
  • 15
    • 84893685019 scopus 로고    scopus 로고
    • A flexible spatial blind source extraction framework for robust speech recognition in noisy environments
    • F. Nesta, M. Matassoni, and R. Astudillo, "A flexible spatial blind source extraction framework for robust speech recognition in noisy environments," in Proc. CHiME, 2013, pp. 33-40.
    • (2013) Proc. CHiME , pp. 33-40
    • Nesta, F.1    Matassoni, M.2    Astudillo, R.3
  • 16
    • 18744401086 scopus 로고    scopus 로고
    • Dynamic compensation of HMM variances using the feature enhancement uncertainty computed from a parametric model of speech distortion
    • May
    • L. Deng, J. Wu, J. Droppo, and A. Acero, "Dynamic compensation of HMM variances using the feature enhancement uncertainty computed from a parametric model of speech distortion," IEEE Transactions on Audio, Speech, and Language Processing, vol. 13, no. 3, pp. 412-421, May 2005.
    • (2005) IEEE Transactions on Audio, Speech, and Language Processing , vol.13 , Issue.3 , pp. 412-421
    • Deng, L.1    Wu, J.2    Droppo, J.3    Acero, A.4
  • 17
    • 84905275072 scopus 로고    scopus 로고
    • Uncertaintybased learning of acoustic models from noisy data
    • Feb
    • A. Ozerov, M. Lagrange, and E. Vincent, "Uncertaintybased learning of acoustic models from noisy data," Computer Speech and Language, vol. 27, no. 3, pp. 874-894, Feb. 2013.
    • (2013) Computer Speech and Language , vol.27 , Issue.3 , pp. 874-894
    • Ozerov, A.1    Lagrange, M.2    Vincent, E.3
  • 18
    • 84897584695 scopus 로고    scopus 로고
    • A general flexible framework for the handling of prior information in audio source separation
    • May
    • A. Ozerov, E. Vincent, and F. Bimbot, "A general flexible framework for the handling of prior information in audio source separation," IEEE Transactions on Audio, Speech, and Language Processing, vol. 20, no. 4, pp. 1118-1133, May 2012.
    • (2012) IEEE Transactions on Audio, Speech, and Language Processing , vol.20 , Issue.4 , pp. 1118-1133
    • Ozerov, A.1    Vincent, E.2    Bimbot, F.3
  • 21
    • 84939730902 scopus 로고
    • Mathematical analysis of random noise
    • S. Rice, "Mathematical analysis of random noise," Bell System Technical Journal, vol. 23, 1944.
    • (1944) Bell System Technical Journal , vol.23
    • Rice, S.1
  • 22
    • 0029725301 scopus 로고    scopus 로고
    • A vector Taylor series approach for environment-independent speech recognition
    • P. J. Moreno, B. Raj, and R. M. Stern, "A vector Taylor series approach for environment-independent speech recognition," in Proc. ICASSP, 1996, vol. 2, pp. 733-736.
    • (1996) Proc. ICASSP , vol.2 , pp. 733-736
    • Moreno, P.J.1    Raj, B.2    Stern, R.M.3
  • 23
    • 84905278580 scopus 로고    scopus 로고
    • S. Young, G. Evermann, D. Kershaw, G. Moore, J. Odell, D. Ollason, V. Valtchev, and P. Woodland, The HTK book, 2002
    • S. Young, G. Evermann, D. Kershaw, G. Moore, J. Odell, D. Ollason, V. Valtchev, and P. Woodland, The HTK book, 2002.
  • 24
    • 33847655586 scopus 로고    scopus 로고
    • A generalized divergence measure fon nonnegative matrix factorization
    • Mar
    • R. Kompass, "A generalized divergence measure fon nonnegative matrix factorization," Neural Computation, vol. 19, no. 3, pp. 780-791, Mar. 2007.
    • (2007) Neural Computation , vol.19 , Issue.3 , pp. 780-791
    • Kompass, R.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.