메뉴 건너뛰기




Volumn 27, Issue 3, 2013, Pages 874-894

Uncertainty-based learning of acoustic models from noisy data

Author keywords

Acoustic model; Classification; Expectation maximization; Gaussian mixture model; Hidden Markov model; Noisy data; Training; Uncertainty

Indexed keywords

AUDIO ACOUSTICS; CLASSIFICATION (OF INFORMATION); COMMUNICATION CHANNELS (INFORMATION THEORY); DECODING; GAUSSIAN DISTRIBUTION; HIDDEN MARKOV MODELS; IMAGE SEGMENTATION; MARKOV PROCESSES; MAXIMUM LIKELIHOOD; MAXIMUM LIKELIHOOD ESTIMATION; MAXIMUM PRINCIPLE; OBJECT RECOGNITION; PERSONNEL TRAINING; TRELLIS CODES; UNCERTAINTY ANALYSIS;

EID: 84905275072     PISSN: 08852308     EISSN: 10958363     Source Type: Journal    
DOI: 10.1016/j.csl.2012.07.002     Document Type: Article
Times cited : (21)

References (40)
  • 1
    • 84863754643 scopus 로고    scopus 로고
    • An uncertainty estimation approach for the extraction of individual source features in multisource recordings
    • K. Adiloʇlu, and E. Vincent An uncertainty estimation approach for the extraction of individual source features in multisource recordings EUSIPCO, 19th European Signal Processing Conference 2011 1663 1667
    • (2011) EUSIPCO, 19th European Signal Processing Conference , pp. 1663-1667
    • Adiloʇlu, K.1    Vincent, E.2
  • 2
    • 84858069176 scopus 로고    scopus 로고
    • A tractable framework for estimating and combining spectral source models for audio source separation
    • S. Arberet, A. Ozerov, F. Bimbot, and R. Gribonval A tractable framework for estimating and combining spectral source models for audio source separation Signal Processing 92 8 2012 1886 1901
    • (2012) Signal Processing , vol.92 , Issue.8 , pp. 1886-1901
    • Arberet, S.1    Ozerov, A.2    Bimbot, F.3    Gribonval, R.4
  • 5
    • 11144316019 scopus 로고    scopus 로고
    • Decoding speech in the presence of other sources
    • J.P. Barker, M.P. Cooke, and D.P.W. Ellis Decoding speech in the presence of other sources Speech Communication 45 1 2005 5 25
    • (2005) Speech Communication , vol.45 , Issue.1 , pp. 5-25
    • Barker, J.P.1    Cooke, M.P.2    Ellis, D.P.W.3
  • 7
    • 0035342414 scopus 로고    scopus 로고
    • Robust automatic speech recognition with missing and unreliable acoustic data
    • M. Cooke Robust automatic speech recognition with missing and unreliable acoustic data Speech Communication 34 3 2001, June 267 285
    • (2001) Speech Communication , vol.34 , Issue.3 , pp. 267-285
    • Cooke, M.1
  • 9
    • 70350450398 scopus 로고    scopus 로고
    • Static and dynamic variance compensation for recognition of reverberant speech with dereverberation preprocessing
    • M. Delcroix, T. Nakatani, and S. Watanabe Static and dynamic variance compensation for recognition of reverberant speech with dereverberation preprocessing IEEE Transactions on Audio, Speech and Language Processing 17 2 2009 324 334
    • (2009) IEEE Transactions on Audio, Speech and Language Processing , vol.17 , Issue.2 , pp. 324-334
    • Delcroix, M.1    Nakatani, T.2    Watanabe, S.3
  • 12
    • 18744401086 scopus 로고    scopus 로고
    • Dynamic compensation of HMM variances using the feature enhancement uncertainty computed from a parametric model of speech distortion
    • L. Deng, J. Droppo, and A. Acero Dynamic compensation of HMM variances using the feature enhancement uncertainty computed from a parametric model of speech distortion IEEE Transactions on Speech and Audio Processing 13 3 2005 412 421
    • (2005) IEEE Transactions on Speech and Audio Processing , vol.13 , Issue.3 , pp. 412-421
    • Deng, L.1    Droppo, J.2    Acero, A.3
  • 13
    • 84901773892 scopus 로고    scopus 로고
    • Environmental robustness
    • J. Benesty, M.M. Sondhi, Y. Huang, Springer
    • J. Droppo, and A. Acero Environmental robustness J. Benesty, M.M. Sondhi, Y. Huang, Handbook of Speech Processing 2008 Springer pp. 653-680
    • (2008) Handbook of Speech Processing , pp. 653-680
    • Droppo, J.1    Acero, A.2
  • 14
    • 84948598244 scopus 로고
    • Statistical-model-based speech enhancement systems
    • Y. Ephraim Statistical-model-based speech enhancement systems Proceedings of the IEEE 80 10 1992 1526 1555
    • (1992) Proceedings of the IEEE , vol.80 , Issue.10 , pp. 1526-1555
    • Ephraim, Y.1
  • 18
    • 0028419019 scopus 로고
    • Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains
    • J.-L. Gauvain, and C.-H. Lee Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains IEEE Transactions on Speech and Audio Processing 2 2 1994 291 298
    • (1994) IEEE Transactions on Speech and Audio Processing , vol.2 , Issue.2 , pp. 291-298
    • Gauvain, J.-L.1    Lee, C.-H.2
  • 23
    • 0009623939 scopus 로고
    • Flexible speaker adaptation using maximum likelihood linear regression
    • C. Leggetter, and P. Woodland Flexible speaker adaptation using maximum likelihood linear regression ARPA Spoken Lang. Technol. Workshop 1995 104 109
    • (1995) ARPA Spoken Lang. Technol. Workshop , pp. 104-109
    • Leggetter, C.1    Woodland, P.2
  • 27
    • 0031221099 scopus 로고    scopus 로고
    • Filtering time sequences of spectral parameters for speech recognition
    • C. Nadeu, P. Pachès-Leal, and B.-H. Juang Filtering time sequences of spectral parameters for speech recognition Speech Communication 22 1997 315 332
    • (1997) Speech Communication , vol.22 , pp. 315-332
    • Nadeu, C.1    Pachès-Leal, P.2    Juang, B.-H.3
  • 30
    • 51449094735 scopus 로고    scopus 로고
    • Adaptation of Bayesian models for single-channel source separation and its application to voice/music separation in popular songs
    • A. Ozerov, P. Philippe, F. Bimbot, and R. Gribonval Adaptation of Bayesian models for single-channel source separation and its application to voice/music separation in popular songs IEEE Transactions on Audio, Speech and Language Processing 15 5 2007 1564 1578
    • (2007) IEEE Transactions on Audio, Speech and Language Processing , vol.15 , Issue.5 , pp. 1564-1578
    • Ozerov, A.1    Philippe, P.2    Bimbot, F.3    Gribonval, R.4
  • 32
    • 84897584695 scopus 로고    scopus 로고
    • A general flexible framework for the handling of prior information in audio source separation
    • A. Ozerov, E. Vincent, and F. Bimbot A general flexible framework for the handling of prior information in audio source separation IEEE Transactions on Audio, Speech and Language Processing 20 4 2012 1118 1133
    • (2012) IEEE Transactions on Audio, Speech and Language Processing , vol.20 , Issue.4 , pp. 1118-1133
    • Ozerov, A.1    Vincent, E.2    Bimbot, F.3
  • 33
    • 0024610919 scopus 로고
    • A tutorial on hidden Markov models and selected applications in speech recognition
    • L. Rabiner A tutorial on hidden Markov models and selected applications in speech recognition Proceedings of the IEEE 77 2 1989 257 286
    • (1989) Proceedings of the IEEE , vol.77 , Issue.2 , pp. 257-286
    • Rabiner, L.1
  • 34
    • 0029277506 scopus 로고
    • Large population speaker identification using clean and telephone speech
    • D. Reynolds Large population speaker identification using clean and telephone speech IEEE Signal Processing Letters 2 3 1995 46 48
    • (1995) IEEE Signal Processing Letters , vol.2 , Issue.3 , pp. 46-48
    • Reynolds, D.1
  • 35
    • 27544443176 scopus 로고    scopus 로고
    • Accounting for probe-level noise in principal component analysis of microarray data
    • G. Sanguinetti, M. Milo, M. Rattray, and N.D. Lawrence Accounting for probe-level noise in principal component analysis of microarray data Bioinformatics 21 19 2005 3748 3754
    • (2005) Bioinformatics , vol.21 , Issue.19 , pp. 3748-3754
    • Sanguinetti, G.1    Milo, M.2    Rattray, M.3    Lawrence, N.D.4
  • 36
    • 69249159165 scopus 로고    scopus 로고
    • A computational auditory scene analysis system for speech segregation and robust speech recognition
    • Y. Shao, S. Srinivasan, Z. Jin, and D. Wang A computational auditory scene analysis system for speech segregation and robust speech recognition Computer Speech & Language 24 1 2010 77 93
    • (2010) Computer Speech & Language , vol.24 , Issue.1 , pp. 77-93
    • Shao, Y.1    Srinivasan, S.2    Jin, Z.3    Wang, D.4
  • 37
    • 0004082513 scopus 로고    scopus 로고
    • Tech. rep., Interval Research Corporation
    • Slaney, M., 1998. Auditory toolbox version 2. Tech. rep., Interval Research Corporation.
    • (1998) Auditory Toolbox Version 2
    • Slaney, M.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.