메뉴 건너뛰기




Volumn 21, Issue 1, 2011, Pages 36-53

The integration of principal component analysis and cepstral mean subtraction in parallel model combination for robust speech recognition

Author keywords

Automatic speech recognition; Cepstral mean subtraction (CMS); Parallel model combination (PMC); PCA CMS based PMC (PC PMC); Principal component analysis (PCA); Robustness

Indexed keywords

ADDITIVE NOISE; CONTINUOUS SPEECH RECOGNITION; CONVOLUTION; COVARIANCE MATRIX; MAXIMUM LIKELIHOOD; ROBUSTNESS (CONTROL SYSTEMS); SPEECH;

EID: 78649946011     PISSN: 10512004     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.dsp.2010.07.004     Document Type: Article
Times cited : (19)

References (73)
  • 2
    • 67249104021 scopus 로고    scopus 로고
    • M.S. thesis, Computer Engineering Department, Sharif University of Technology, Tehran, Iran, November 2005
    • H. Veisi, Model-based methods for noise robust speech recognition systems, M.S. thesis, Computer Engineering Department, Sharif University of Technology, Tehran, Iran, November 2005.
    • Model-based Methods for Noise Robust Speech Recognition Systems
    • Veisi, H.1
  • 3
    • 78649915021 scopus 로고    scopus 로고
    • On the using of principal component analysis on the Farsi continuous speech recognition for feature robustness and feature reductions
    • Tehran, Iran
    • H. Veisi, H. Sameti, H.R. Abutalebi, On the using of principal component analysis on the Farsi continuous speech recognition for feature robustness and feature reductions, in: Annual Computer Society of Iran Computer Conference (CSICC'05), Tehran, Iran, 2005, pp. 242-251.
    • (2005) Annual Computer Society of Iran Computer Conference (CSICC'05) , pp. 242-251
    • Veisi, H.1    Sameti, H.2    Abutalebi, H.R.3
  • 9
    • 56949089751 scopus 로고    scopus 로고
    • Feature compensation in the cepstral domain employing model combination
    • W. Kim, and J.H.L. Hansen Feature compensation in the cepstral domain employing model combination Speech Comm. 51 2 2009 83 96
    • (2009) Speech Comm. , vol.51 , Issue.2 , pp. 83-96
    • Kim, W.1    Hansen, J.H.L.2
  • 10
    • 85009106519 scopus 로고    scopus 로고
    • Robust ASR based on clean speech models: An evaluation of missing data techniques for connected digit recognition in noise
    • J. Barker, M. Cooke, P. Green, Robust ASR based on clean speech models: an evaluation of missing data techniques for connected digit recognition in noise, in: Eur. Conf. on Speech Communication and Technology (Eurospeech'01), 2001, pp. 213-216.
    • (2001) Eur. Conf. on Speech Communication and Technology (Eurospeech'01) , pp. 213-216
    • Barker, J.1    Cooke, M.2    Green, P.3
  • 11
    • 0035342414 scopus 로고    scopus 로고
    • Robust automatic speech recognition with missing and unreliable acoustic data
    • M. Cook, P. Green, L. Josifovski, and A. Vizinho Robust automatic speech recognition with missing and unreliable acoustic data Speech Comm. 34 3 2001 267 285
    • (2001) Speech Comm. , vol.34 , Issue.3 , pp. 267-285
    • Cook, M.1    Green, P.2    Josifovski, L.3    Vizinho, A.4
  • 12
    • 4644336054 scopus 로고    scopus 로고
    • Reconstruction of missing features for robust speech recognition
    • B. Raj, M.L. Seltzer, and R.M. Stern Reconstruction of missing features for robust speech recognition Speech Comm. 34 4 2004 275 296
    • (2004) Speech Comm. , vol.34 , Issue.4 , pp. 275-296
    • Raj, B.1    Seltzer, M.L.2    Stern, R.M.3
  • 17
    • 0027622731 scopus 로고
    • Cepstral parameter compensation for HMM recognition in noise
    • M.J.F. Gales, and S.J. Young Cepstral parameter compensation for HMM recognition in noise Speech Comm. 12 1993 231 239
    • (1993) Speech Comm. , vol.12 , pp. 231-239
    • Gales, M.J.F.1    Young, S.J.2
  • 18
    • 0029288633 scopus 로고
    • Maximum likelihood linear regression for speaker adaptation of continuous density Hidden Markov models
    • C.J. Leggetter, and P.C. Woodland Maximum likelihood linear regression for speaker adaptation of continuous density Hidden Markov models Comp. Speech Language 9 1995 171 185
    • (1995) Comp. Speech Language , vol.9 , pp. 171-185
    • Leggetter, C.J.1    Woodland, P.C.2
  • 19
    • 0030263447 scopus 로고    scopus 로고
    • Mean and variance adaptation within the MLLR framework
    • M.J.F. Gales, and P.C. Woodland Mean and variance adaptation within the MLLR framework Comp. Speech Language 10 1996 249 264
    • (1996) Comp. Speech Language , vol.10 , pp. 249-264
    • Gales, M.J.F.1    Woodland, P.C.2
  • 22
  • 27
    • 34547553730 scopus 로고    scopus 로고
    • Uncertainty decoding for noise robust speech recognition
    • University of Cambridge
    • H. Liao, M.J.F. Gales, Uncertainty decoding for noise robust speech recognition, Technical report, CUED/F-INFENG/TR499, University of Cambridge, 2004.
    • (2004) Technical Report, CUED/F-INFENG/TR499
    • Liao, H.1    Gales, M.J.F.2
  • 30
    • 0036293691 scopus 로고    scopus 로고
    • Comparing Jacobian adaptation with cepstral mean normalization and parallel model combination for noise robust speech recognition
    • Orlando, FL, USA
    • K. Parssinen, P. Salmela, M. Harju, I. Kiss, Comparing Jacobian adaptation with cepstral mean normalization and parallel model combination for noise robust speech recognition, in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP'02), Orlando, FL, USA, 2002, pp. I-193-I-196.
    • (2002) IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP'02)
    • Parssinen, K.1    Salmela, P.2    Harju, M.3    Kiss, I.4
  • 33
    • 0003089362 scopus 로고
    • Spectral subtraction based on minimum statistics
    • Edinburgh, Scotland
    • R. Martin, Spectral subtraction based on minimum statistics, in: European Signal Processing Conference (Eusipco'94), Edinburgh, Scotland, 1994, pp. 1182-1185.
    • (1994) European Signal Processing Conference (Eusipco'94) , pp. 1182-1185
    • Martin, R.1
  • 35
    • 0034832359 scopus 로고    scopus 로고
    • Assessing local noise level estimation methods: Application to noise robust ASR
    • C. Ris, and S. Dupont Assessing local noise level estimation methods: application to noise robust ASR Speech Comm. 34 1-2 2001 141 158
    • (2001) Speech Comm. , vol.34 , Issue.12 , pp. 141-158
    • Ris, C.1    Dupont, S.2
  • 36
    • 0032048385 scopus 로고    scopus 로고
    • Speech recognition in noisy environments using first order vector Taylor series
    • D.Y. Kim, C.K. Un, and N. S Kim Speech recognition in noisy environments using first order vector Taylor series Speech Comm. 24 1998 39 49
    • (1998) Speech Comm. , vol.24 , pp. 39-49
    • Kim, D.Y.1    Un, C.K.2    Kim N, S.3
  • 37
    • 0347968277 scopus 로고    scopus 로고
    • Recursive estimation of nonstationary noise using iterative stochastic approximation for robust speech recognition
    • L. Deng, J. Droppo, and A. Acero Recursive estimation of nonstationary noise using iterative stochastic approximation for robust speech recognition IEEE Trans. Speech Audio Process. 11 6 2003 568 580
    • (2003) IEEE Trans. Speech Audio Process. , vol.11 , Issue.6 , pp. 568-580
    • Deng, L.1    Droppo, J.2    Acero, A.3
  • 38
    • 27644486095 scopus 로고    scopus 로고
    • A method of joint compensation of additive and convolutive distortions for speaker-independent speech recognition
    • Y. Gong A method of joint compensation of additive and convolutive distortions for speaker-independent speech recognition IEEE Trans. Speech Audio Process. 13 5 2005 975 983
    • (2005) IEEE Trans. Speech Audio Process. , vol.13 , Issue.5 , pp. 975-983
    • Gong, Y.1
  • 41
    • 34250899234 scopus 로고    scopus 로고
    • A comparative study of approximations for parallel model combination of static and dynamic parameters
    • Denver, Colorado, USA
    • Y. Gong, A comparative study of approximations for parallel model combination of static and dynamic parameters, in: Int. Conf. on Spoken Language Processing (ICSLP'02), Denver, Colorado, USA, 2002, pp. 1029-1032.
    • (2002) Int. Conf. on Spoken Language Processing (ICSLP'02) , pp. 1029-1032
    • Gong, Y.1
  • 42
    • 0035510538 scopus 로고    scopus 로고
    • New approaches for domain transformation and parameter combination for improved accuracy in parallel model combination (PMC) techniques
    • J. Hung, J. Shen, and L. Lee New approaches for domain transformation and parameter combination for improved accuracy in parallel model combination (PMC) techniques IEEE Trans. Speech Audio Process. 9 2001 842 855
    • (2001) IEEE Trans. Speech Audio Process. , vol.9 , pp. 842-855
    • Hung, J.1    Shen, J.2    Lee, L.3
  • 46
    • 0032657798 scopus 로고    scopus 로고
    • Improved parallel model combination techniques with split gaussian mixtures for speech recognition under noisy conditions
    • Phoenix, AZ, USA
    • J. Hung, J. Shen, L. Lee, Improved parallel model combination techniques with split gaussian mixtures for speech recognition under noisy conditions, in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP'9), Phoenix, AZ, USA, 1999, pp. 437-440.
    • (1999) IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP'9) , pp. 437-440
    • Hung, J.1    Shen, J.2    Lee, L.3
  • 47
    • 78649967824 scopus 로고    scopus 로고
    • Weighted parallel model combination for noisy speech recognition
    • Sydney, Australia
    • T.H. Hwang, H.C. Wang, Weighted parallel model combination for noisy speech recognition, in: Int. Conf. on Spoken Language Processing (ICSLP'98), Sydney, Australia, 1998, pp. 1527-1530.
    • (1998) Int. Conf. on Spoken Language Processing (ICSLP'98) , pp. 1527-1530
    • Hwang, T.H.1    Wang, H.C.2
  • 48
    • 0031631071 scopus 로고    scopus 로고
    • Improved robustness for speech recognition under noisy conditions using correlated parallel model combination
    • Seattle, WA, USA
    • J. Hung, J. Shen, L. Lee, Improved robustness for speech recognition under noisy conditions using correlated parallel model combination, in: IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP'98), Seattle, WA, USA, 1998, pp. 553-556.
    • (1998) IEEE Int. Conf. on Acoustics, Speech, and Signal Processing (ICASSP'98) , pp. 553-556
    • Hung, J.1    Shen, J.2    Lee, L.3
  • 54
    • 85009107937 scopus 로고    scopus 로고
    • Model composition by Lagrange polynomial approximation for robust speech recognition in noisy environment
    • Jeju Island, Korea
    • C.K. Raut, T. Nishimoto, S. Sagayama, Model composition by Lagrange polynomial approximation for robust speech recognition in noisy environment, in: Int. Conf. on Spoken Language Processing (ICSLP'04), Jeju Island, Korea, 2004.
    • (2004) Int. Conf. on Spoken Language Processing (ICSLP'04)
    • Raut, C.K.1    Nishimoto, T.2    Sagayama, S.3
  • 58
  • 62
    • 78449235724 scopus 로고    scopus 로고
    • The combination of CMS with PMC for improving the robustness of speech recognition systems
    • Kish Island, Iran, 2008 Springer-Verlag
    • H. Veisi, and H. Sameti The combination of CMS with PMC for improving the robustness of speech recognition systems Annual Computer Society of Iran Computer Conference (CSICC'08) Kish Island, Iran, 2008 2008 Springer-Verlag
    • (2008) Annual Computer Society of Iran Computer Conference (CSICC'08)
    • Veisi, H.1    Sameti, H.2
  • 68
    • 35248891610 scopus 로고    scopus 로고
    • A comparative intelligibility study of single-microphone noise reduction algorithms
    • DOI 10.1121/1.2766778
    • Y. Hu, and P. Loizou A comparative intelligibility study of single-microphone noise reduction algorithms J. Acoust. Soc. Amer. 122 3 2007 1777 1786 (Pubitemid 47560539)
    • (2007) Journal of the Acoustical Society of America , vol.122 , Issue.3 , pp. 1777-1786
    • Hu, Y.1    Loizou, P.C.2
  • 69
    • 0030366664 scopus 로고    scopus 로고
    • Iterative unsupervised adaptation using maximum likelihood linear regression
    • Philadelphia, PA, US, 3-6 October 1996
    • P.C. Woodland, D. Pye, M.J.F. Gales, Iterative unsupervised adaptation using maximum likelihood linear regression, in: Int. Conf. on Spoken Language Processing (ICSLP'96), vol. 2, Philadelphia, PA, US, 3-6 October 1996, pp. 1133-1136.
    • Int. Conf. on Spoken Language Processing (ICSLP'96) , vol.2 , pp. 1133-1136
    • Woodland, P.C.1    Pye, D.2    Gales, M.J.F.3
  • 70
    • 85009110676 scopus 로고    scopus 로고
    • Improved MLLR speaker adaptation using confidence measures for conversational speech recognition
    • Beijing, China
    • M. Pitz, F. Wessel, H. Ney, Improved MLLR speaker adaptation using confidence measures for conversational speech recognition, in: Int. Conf. on Spoken Language Processing (ICSLP'00), Beijing, China, 2000.
    • (2000) Int. Conf. on Spoken Language Processing (ICSLP'00)
    • Pitz, M.1    Wessel, F.2    Ney, H.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.