메뉴 건너뛰기




Volumn 20, Issue 7, 2012, Pages 2149-2158

Speaker and noise factorization for robust speech recognition

Author keywords

Acoustic factorization; noise robustness; speaker adaptation; vector Taylor series (VTS)

Indexed keywords

ACOUSTIC FACTORS; BACKGROUND NOISE; CLEAN SPEECH; FACTORIZATION APPROACH; FLEXIBLE FRAMEWORK; MULTIPLE FACTORS; NOISE CONDITIONS; NOISE FACTOR; NOISE ROBUSTNESS; ROBUST SPEECH RECOGNITION; SPEAKER ADAPTATION; SPEAKER CHARACTERISTICS; SPEECH RECOGNITION SYSTEMS; TRANSMISSION CHANNELS; VECTOR TAYLOR SERIES;

EID: 84862293102     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2012.2198059     Document Type: Article
Times cited : (47)

References (33)
  • 1
    • 80051617808 scopus 로고    scopus 로고
    • Speaker and noise factorisation on the AURORA4 task
    • Y.-Q. Wang and M. J. F. Gales, "Speaker and noise factorisation on the AURORA4 task," in Proc. ICASSP'11, 2011, pp. 4584-4587.
    • (2011) Proc. ICASSP'11 , pp. 4584-4587
    • Wang, Y.-Q.1    Gales, M.J.F.2
  • 3
    • 0029288202 scopus 로고
    • Speech recognition in noisy environments: A survey
    • Y. Gong, "Speech recognition in noisy environments: A survey," Speech Commun., vol. 16, no. 3, pp. 261-291, 1995.
    • (1995) Speech Commun. , vol.16 , Issue.3 , pp. 261-291
    • Gong, Y.1
  • 4
    • 0029747183 scopus 로고    scopus 로고
    • Speaker normalization using efficient frequency warping procedures
    • L. Lee and R. C. Rose, "Speaker normalization using efficient frequency warping procedures," in Proc. ICASSP'96, pp. 353-356.
    • Proc. ICASSP'96 , pp. 353-356
    • Lee, L.1    Rose, R.C.2
  • 5
    • 0029288633 scopus 로고
    • Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
    • C. Leggetter and P. C. Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models," Comput. Speech Lang., vol. 9, pp. 171-186, 1995.
    • (1995) Comput. Speech Lang. , vol.9 , pp. 171-186
    • Leggetter, C.1    Woodland, P.C.2
  • 6
    • 0032050110 scopus 로고    scopus 로고
    • Maximum likelihood linear transformations for HMM-based speech recognition
    • M. J. F. Gales, "Maximum likelihood linear transformations for HMM-based speech recognition," Comput. Speech Lang., vol. 12, pp. 75-98, 1998. (Pubitemid 128383747)
    • (1998) Computer Speech and Language , vol.12 , Issue.2 , pp. 75-98
    • Gales, M.J.F.1
  • 7
    • 0034227757 scopus 로고    scopus 로고
    • Cluster adaptive training of hidden Markov models
    • M. J. F. Gales, "Cluster adaptive training of hidden Markov models," IEEE Trans. Speech Audio Process., vol. 8, no. 4, pp. 417-428, 2002.
    • (2002) IEEE Trans. Speech Audio Process. , vol.8 , Issue.4 , pp. 417-428
    • Gales, M.J.F.1
  • 8
    • 0028419019 scopus 로고
    • Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains
    • Apr.
    • J. L. Gauvain and C.-H. Lee, "Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains," IEEE Trans. Speech Audio Process., vol. 2, no. 2, pp. 291-298, Apr. 1994.
    • (1994) IEEE Trans. Speech Audio Process. , vol.2 , Issue.2 , pp. 291-298
    • Gauvain, J.L.1    Lee, C.-H.2
  • 10
    • 70450163444 scopus 로고    scopus 로고
    • Adaptive training with noisy constrained maximum likelihood linear regression for noise robust speech recognition
    • D. Kim and M. J. F. Gales, "Adaptive training with noisy constrained maximum likelihood linear regression for noise robust speech recognition," in Proc. Interspeech'09, pp. 2382-2386.
    • Proc. Interspeech'09 , pp. 2382-2386
    • Kim, D.1    Gales, M.J.F.2
  • 11
    • 0003904183 scopus 로고    scopus 로고
    • Maximum likelihood eigenspace and MLLR for speech recognition in noisy environments
    • P. Nguyen, C. Wellekens, and J. C. Junqua, "Maximum likelihood eigenspace and MLLR for speech recognition in noisy environments," in Proc. Eurospeech'99.
    • Proc. Eurospeech'99
    • Nguyen, P.1    Wellekens, C.2    Junqua, J.C.3
  • 13
    • 34547528168 scopus 로고    scopus 로고
    • Adaptive training with joint uncertainty decoding for robust recognition of noisy data
    • H. Liao and M. J. F. Gales, "Adaptive training with joint uncertainty decoding for robust recognition of noisy data," in Proc. ICASSP'07, pp. 389-392.
    • Proc. ICASSP'07 , pp. 389-392
    • Liao, H.1    Gales, M.J.F.2
  • 14
    • 70349194599 scopus 로고    scopus 로고
    • Noise adaptive training using a vector Taylor series approach for noise robust automatic speech recognition
    • O. Kalinli, M.L.Seltzer, andA.Acero, "Noise adaptive training using a vector Taylor series approach for noise robust automatic speech recognition," in Proc. ICASSP'09, pp. 3825-3828.
    • Proc. ICASSP'09 , pp. 3825-3828
    • Kalinli, O.1    Seltzer, M.L.2    Acero, A.3
  • 15
    • 38849170676 scopus 로고    scopus 로고
    • Distributed Speech Recognition; Advanced Frontend Feature Extraction Algorithm; Compression Algorithms Tech. Rep. ES 202 050 v1.1.3
    • Speech Processing, Transmission and Quality Aspects (STQ); Distributed Speech Recognition; Advanced Frontend Feature Extraction Algorithm; Compression Algorithms, 2003, Tech. Rep. ES 202 050 v1.1.3.
    • (2003) Speech Processing Transmission and Quality Aspects (STQ)
  • 16
    • 85009070292 scopus 로고    scopus 로고
    • Large vocabulary speech recognition under adverse acoustic environments
    • L. Deng, A. Acero, M. Plumpe, and X. Huang, "Large vocabulary speech recognition under adverse acoustic environments," in Proc. ICSLP'00.
    • Proc. ICSLP'00
    • Deng, L.1    Acero, A.2    Plumpe, M.3    Huang, X.4
  • 17
    • 33750376174 scopus 로고    scopus 로고
    • Model-based feature enhancement with uncertainty decoding for noise robust ASR
    • DOI 10.1016/j.specom.2005.12.006, PII S0167639306000057
    • V. Stouten, H. V. hamme, and P. Wambacq, "Model-based feature enhancement with uncertainty decoding for noise robust ASR," Speech Commun., vol. 48, no. 11, pp. 1502-1514, 2006. (Pubitemid 44634766)
    • (2006) Speech Communication , vol.48 , Issue.11 , pp. 1502-1514
    • Stouten, V.1    Van Hamme, H.2    Wambacq, P.3
  • 20
    • 85009113852 scopus 로고    scopus 로고
    • HMM adaptation using vector Taylor series for noisy speech recognition
    • A. Acero, L. Deng, T. Kristjansson, and J. Zhang, "HMM adaptation using vector Taylor series for noisy speech recognition," in Proc. ICSLP'00.
    • Proc. ICSLP'00
    • Acero, A.1    Deng, L.2    Kristjansson, T.3    Zhang, J.4
  • 21
    • 68549095140 scopus 로고    scopus 로고
    • High-performance HMM adaptation with joint compensation of additive and convolutive distortions via vector Taylor series
    • J. Li, D. Yu, Y. Gong, and A. Acero, "High-performance HMM adaptation with joint compensation of additive and convolutive distortions via vector Taylor series," in Proc. ASRU'07.
    • Proc. ASRU'07
    • Li, J.1    Yu, D.2    Gong, Y.3    Acero, A.4
  • 22
    • 27644486095 scopus 로고    scopus 로고
    • A method of joint compensation of additive and convolutive distortions for speaker-independent speech recognition
    • DOI 10.1109/TSA.2005.851963
    • Y. Gong, "A method of joint compensation of additive and convolutive distortions for speaker-independent speech recognition," IEEE Trans. Speech Audio Process., vol. 13, no. 5, pp. 975-983, Sep. 2005. (Pubitemid 41558911)
    • (2005) IEEE Transactions on Speech and Audio Processing , vol.13 , Issue.5 , pp. 975-983
    • Gong, Y.1
  • 23
    • 44849122740 scopus 로고    scopus 로고
    • Irrelevant variability normalization based HMM training using VTS approximation of an explicit model of environmental distortions
    • Y. Hu and Q. Huo, "Irrelevant variability normalization based HMM training using VTS approximation of an explicit model of environmental distortions," in Proc. Interspeech'07, pp. 1042-1045.
    • Proc. Interspeech'07 , pp. 1042-1045
    • Hu, Y.1    Huo, Q.2
  • 24
    • 85008564998 scopus 로고    scopus 로고
    • Unsupervised data-driven feature vector normalization with acoustic model adaptation for robust speech recognition
    • L. Buera, A. Miguel, O. Saz, A. Ortega, and E. Lleida, "Unsupervised data-driven feature vector normalization with acoustic model adaptation for robust speech recognition," IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 2, pp. 296-309, 2010.
    • (2010) IEEE Trans. Audio, Speech, Lang. Process. , vol.18 , Issue.2 , pp. 296-309
    • Buera, L.1    Miguel, A.2    Saz, O.3    Ortega, A.4    Lleida, E.5
  • 25
    • 0032139556 scopus 로고    scopus 로고
    • Predictive model-based compensation schemes for robust speech recognition
    • PII S0167639398000296
    • M. J. F. Gales, "Predictive model-based compensation schemes for robust speech recognition,"Speech Commun., vol. 25, no. 1-3, pp. 49-74, 1998. (Pubitemid 128413634)
    • (1998) Speech Communication , vol.25 , Issue.1-3 , pp. 49-74
    • Gales, M.J.F.1
  • 26
    • 0347960645 scopus 로고    scopus 로고
    • Separating speaker and environment variabilities for improved recognition in non-stationary conditions
    • L. Rigazio, P. Nguyen, D. Kryze, and J.-C. Junqua, "Separating speaker and environment variabilities for improved recognition in non-stationary conditions," in Proc. Eurospeech'01.
    • Proc. Eurospeech'01
    • Rigazio, L.1    Nguyen, P.2    Kryze, D.3    Junqua, J.-C.4
  • 28
    • 4544253619 scopus 로고    scopus 로고
    • Adaptive training using structured transforms
    • K. Yu and M. J. F. Gales, "Adaptive training using structured transforms," in Proc. ICASSP'04, pp. 317-320.
    • Proc. ICASSP'04 , pp. 317-320
    • Yu, K.1    Gales, M.J.F.2
  • 29
    • 0346126988 scopus 로고
    • Robust speech recognition in noise-Performance of the IBM continuous speech recogniser on the ARPA noise spoke task
    • R. A. Gopinath et al., "Robust speech recognition in noise-Performance of the IBM continuous speech recogniser on the ARPA noise spoke task," in Proc. APRA Workshop Spoken Lang. Syst. Technol., 1995, pp. 127-130.
    • (1995) Proc. APRA Workshop Spoken Lang. Syst. Technol. , pp. 127-130
    • Gopinath, R.A.1
  • 31
    • 33646806075 scopus 로고    scopus 로고
    • Adaptation of precision matrix models on LVCSR
    • K. C. Sim and M. J. F. Gales, "Adaptation of precision matrix models on LVCSR," in Proc. ICASSP'05, pp. 97-100.
    • Proc. ICASSP'05 , pp. 97-100
    • Sim, K.C.1    Gales, M.J.F.2
  • 32
    • 56149125973 scopus 로고    scopus 로고
    • Aurora working group: DSR front end LVCSR evaluation AU/384/02
    • Mississippi State Univ., Tech. Rep.
    • N. Parihar and J. Picone, "Aurora working group: DSR front end LVCSR evaluation AU/384/02," Inst. for Signal and Information Process, Mississippi State Univ., Tech. Rep.
    • Inst. for Signal and Information Process
    • Parihar, N.1    Picone, J.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.