메뉴 건너뛰기




Volumn 22, Issue 2, 2014, Pages 368-380

Estimating uncertainty to improve exemplar-based feature enhancement for noise robust speech recognition

Author keywords

Exemplar based; Noise robustness; Observation uncertainty; Speech recognition; Uncertainty estimation

Indexed keywords

ESTIMATION; SPEECH RECOGNITION; ACOUSTIC NOISE; GAUSSIAN DISTRIBUTION; HUMAN COMPUTER INTERACTION; SEPARATION; SOURCE SEPARATION; SPEECH; UNCERTAINTY ANALYSIS;

EID: 84897933562     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASLP.2013.2292328     Document Type: Article
Times cited : (8)

References (28)
  • 1
    • 18744401086 scopus 로고    scopus 로고
    • Dynamic compensation of HMM variances using the feature enhancement uncertainty computed from a parametric model of speech distortion
    • DOI 10.1109/TSA.2005.845814
    • L. Deng, J. Droppo, and A. Acero, "Dynamic compensation of HMM variances using the feature enhancement uncertainty computed from a parametric model of speech distortion," IEEE Trans. Speech Audio Process., vol. 13, no. 3, pp. 412-421, May 2005. (Pubitemid 40666175)
    • (2005) IEEE Transactions on Speech and Audio Processing , vol.13 , Issue.3 , pp. 412-421
    • Deng, L.1    Droppo, J.2    Acero, A.3
  • 3
    • 40249103761 scopus 로고    scopus 로고
    • Issues with uncertainty decoding for noise robust automatic speech recognition
    • H. Liao and M. J. F. Gales, "Issues with uncertainty decoding for noise robust automatic speech recognition," Speech Commun., vol. 50, no. 4, pp. 265-277, 2008.
    • (2008) Speech Commun. , vol.50 , Issue.4 , pp. 265-277
    • Liao, H.1    Gales, M.J.F.2
  • 4
    • 0035342414 scopus 로고    scopus 로고
    • Robust automatic speech recognition with missing and unreliable acoustic data
    • [Online]. Available
    • M. Cooke, P. Green, L. Josifovski, and A. Vizinho, "Robust automatic speech recognition with missing and unreliable acoustic data," Speech Commun., vol. 34, no. 3, pp. 267-285, 2001 [Online]. Available: http://www.sciencedirect.com/science/article/pii/S0167639300000340
    • (2001) Speech Commun. , vol.34 , Issue.3 , pp. 267-285
    • Cooke, M.1    Green, P.2    Josifovski, L.3    Vizinho, A.4
  • 5
    • 56249136428 scopus 로고    scopus 로고
    • Transforming binary uncertainties for robust speech recognition
    • Sep.
    • S. Srinivasan and D. L. Wang, "Transforming binary uncertainties for robust speech recognition," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 7, pp. 2130-2140, Sep. 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.7 , pp. 2130-2140
    • Srinivasan, S.1    Wang, D.L.2
  • 6
    • 77954583785 scopus 로고    scopus 로고
    • Independent component analysis and time-frequency masking for speech recognition in multitalker conditions
    • [Online]. Available
    • D. Kolossa, R. Fernandez Astudillo, E. Hoffmann, and R. Orglmeister, "Independent component analysis and time-frequency masking for speech recognition in multitalker conditions," EURASIP J. Audio, Speech, Music Process., vol. 2010, no. 1, p. 651420, 2010 [Online]. Available: http://asmp.eurasipjournals.com/content/2010/1/651420
    • (2010) EURASIP J. Audio, Speech, Music Process. , vol.2010 , Issue.1 , pp. 651420
    • Kolossa, D.1    Fernandez Astudillo, R.2    Hoffmann, E.3    Orglmeister, R.4
  • 7
    • 69249159165 scopus 로고    scopus 로고
    • A computational auditory scene analysis system for speech segregation and robust speech recognition
    • [Online]. Available: Speech Separation and Recognition Challenge
    • Y. Shao, S. Srinivasan, Z. Jin, and D. Wang, "A computational auditory scene analysis system for speech segregation and robust speech recognition," Comput. Speech Lang., vol. 24, no. 1, pp. 77-93, 2010 [Online]. Available: http://www.sciencedirect.com/science/article/pii/ S088523080800020X, Speech Separation and Recognition Challenge
    • (2010) Comput. Speech Lang. , vol.24 , Issue.1 , pp. 77-93
    • Shao, Y.1    Srinivasan, S.2    Jin, Z.3    Wang, D.4
  • 8
    • 84940448917 scopus 로고    scopus 로고
    • Uncertainty-based learning of acoustic models from noisy data
    • Jul. [Online]. Available
    • A. Ozerov, M. Lagrange, and E. Vincent, "Uncertainty-based learning of acoustic models from noisy data," Comput. Speech Lang., Jul. 2012 [Online]. Available: http://www.sciencedirect.com/science/article/pii/ S0885230812000502
    • (2012) Comput. Speech Lang.
    • Ozerov, A.1    Lagrange, M.2    Vincent, E.3
  • 9
    • 84893328634 scopus 로고    scopus 로고
    • Integration of beamforming and uncertainty-of-observation techniques for robust asr in multi-source environments
    • [Online]. Available: special on Speech Separation and Recognition in Multisource Environments
    • R. F. Astudillo, D. Kolossa, A. Abad, S. Zeiler, R. Saeidi, P. Mowlaee, J. P. da Silva Neto, and R.Martin, "Integration of beamforming and uncertainty-of-observation techniques for robust asr in multi-source environments," Comput. Speech Lang., vol. 27, no. 3, pp. 837-850, 2013 [Online]. Available: http://www.sciencedirect.com/science/article/pii/ S0885230812000575, special on Speech Separation and Recognition in Multisource Environments
    • (2013) Comput. Speech Lang. , vol.27 , Issue.3 , pp. 837-850
    • Astudillo, R.F.1    Kolossa, D.2    Abad, A.3    Zeiler, S.4    Saeidi, R.5    Mowlaee, P.6    Da Silva Neto, J.P.7    Martin, R.8
  • 12
    • 79960657803 scopus 로고    scopus 로고
    • Exemplar-based sparse representations for noise robust automatic speech recognition
    • Sep.
    • J. Gemmeke, T. Virtanen, and A. Hurmalainen, "Exemplar-based sparse representations for noise robust automatic speech recognition," IEEE Trans. Audio, Speech, Lang. Process., vol. 19, no. 7, pp. 2067-2080, Sep. 2011.
    • (2011) IEEE Trans. Audio, Speech, Lang. Process. , vol.19 , Issue.7 , pp. 2067-2080
    • Gemmeke, J.1    Virtanen, T.2    Hurmalainen, A.3
  • 15
    • 79959818117 scopus 로고    scopus 로고
    • Non-negative matrix factorization based compensation of music for automatic speech recognition
    • B. Raj, T. Virtanen, S. Chaudhure, and R. Singh, "Non-negative matrix factorization based compensation of music for automatic speech recognition," in Proc. INTERSPEECH, 2010, pp. 717-720.
    • Proc. INTERSPEECH, 2010 , pp. 717-720
    • Raj, B.1    Virtanen, T.2    Chaudhure, S.3    Singh, R.4
  • 16
    • 2942539074 scopus 로고    scopus 로고
    • Techniques for handling convolutional distortion with 'missing data' automatic speech recognition
    • [Online]. Available
    • K. J. Palomäki, G. J. Brown, and J. P. Barker, "Techniques for handling convolutional distortion with 'missing data' automatic speech recognition," Speech Commun., vol. 43, no. 12, pp. 123-142, 2004 [Online]. Available: http://www.sciencedirect.com/science/article/pii/S0167639304000238
    • (2004) Speech Commun. , vol.43 , Issue.12 , pp. 123-142
    • Palomäki, K.J.1    Brown, G.J.2    Barker, J.P.3
  • 20
    • 79959834868 scopus 로고    scopus 로고
    • Artificial and online acquired noise dictionaries for noise robust ASR
    • J. F. Gemmeke and T. Virtanen, "Artificial and online acquired noise dictionaries for noise robust ASR," in Proc. INTERSPEECH, 2010, pp. 2082-2085.
    • Proc. INTERSPEECH, 2010 , pp. 2082-2085
    • Gemmeke, J.F.1    Virtanen, T.2
  • 21
    • 0034842343 scopus 로고    scopus 로고
    • Efficient on-line acoustic environment estimation for FCDCN in a continuous speech recognition system
    • J. Droppo, A. Acero, and L. Deng, "Efficient on-line acoustic environment estimation for FCDCN in a continuous speech recognition system," in Proc. ICASSP, Salt Lake City, UT, USA, May 2001.
    • Proc. ICASSP, Salt Lake City, UT, USA, May 2001
    • Droppo, J.1    Acero, A.2    Deng, L.3
  • 22
    • 85009089669 scopus 로고    scopus 로고
    • Duration modeling techniques for continuous speech recognition
    • J. Pylkkönen and M. Kurimo, "Duration modeling techniques for continuous speech recognition," in Proc. INTERSPEECH, 2004, pp. 385-388.
    • Proc. INTERSPEECH, 2004 , pp. 385-388
    • Pylkkönen, J.1    Kurimo, M.2
  • 23
    • 33746524944 scopus 로고    scopus 로고
    • Unlimited vocabulary speech recognition with morph language models applied to Finnish
    • DOI 10.1016/j.csl.2005.07.002, PII S0885230805000331
    • T. Hirsimäki, M. Creutz, V. Siivola, M. Kurimo, S. Virpioja, and S. Pylkkönen, "Unlimited vocabulary speech recognition with morph language models applied to Finnish," Comput. Speech Lang., vol. 20, no. 4, pp. 515-541, 2006. (Pubitemid 44142005)
    • (2006) Computer Speech and Language , vol.20 , Issue.4 , pp. 515-541
    • Hirsimaki, T.1    Creutz, M.2    Siivola, V.3    Kurimo, M.4    Virpioja, S.5    Pylkkonen, J.6
  • 25
    • 85009154856 scopus 로고    scopus 로고
    • Accounting for the uncertainty of speech estimates in the context of model-based feature enhancement
    • V. Stouten, H. V. Hamme, and P. Wambacq, "Accounting for the uncertainty of speech estimates in the context of model-based feature enhancement," in Proc. ICSLP, Jeju Island, Korea, Oct. 2004.
    • Proc. ICSLP, Jeju Island, Korea, Oct. 2004
    • Stouten, V.1    Hamme, H.V.2    Wambacq, P.3
  • 27
    • 51449111646 scopus 로고    scopus 로고
    • Bayesian extensions to non-negative matrix factorisation for audio signal modelling
    • T. Virtanen, A. Cemgil, and S. Godsill, "Bayesian extensions to non-negative matrix factorisation for audio signal modelling," in Proc. ICASSP, Las Vegas, NV, USA, 2008, pp. 1825-1828.
    • Proc. ICASSP, Las Vegas, NV, USA, 2008 , pp. 1825-1828
    • Virtanen, T.1    Cemgil, A.2    Godsill, S.3
  • 28
    • 79959845286 scopus 로고    scopus 로고
    • The CHiME corpus: A resource and challenge for Computational Hearing in Multisource Environments
    • H. Christensen, J. Barker, N. Ma, and P. Green, "The CHiME corpus: A resource and challenge for Computational Hearing in Multisource Environments," in Proc. INTERSPEECH, Makuhari, Japan, 2010, pp. 1918-1921.
    • Proc. INTERSPEECH, Makuhari, Japan, 2010 , pp. 1918-1921
    • Christensen, H.1    Barker, J.2    Ma, N.3    Green, P.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.