SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 20, Issue 7, 2012, Pages 2149-2158

Speaker and noise factorization for robust speech recognition

(2) Wang, Yongqiang a Gales, Mark J F a

a UNIVERSITY OF CAMBRIDGE (United Kingdom)

Author keywords

Acoustic factorization; noise robustness; speaker adaptation; vector Taylor series (VTS)

Indexed keywords

ACOUSTIC FACTORS; BACKGROUND NOISE; CLEAN SPEECH; FACTORIZATION APPROACH; FLEXIBLE FRAMEWORK; MULTIPLE FACTORS; NOISE CONDITIONS; NOISE FACTOR; NOISE ROBUSTNESS; ROBUST SPEECH RECOGNITION; SPEAKER ADAPTATION; SPEAKER CHARACTERISTICS; SPEECH RECOGNITION SYSTEMS; TRANSMISSION CHANNELS; VECTOR TAYLOR SERIES;

FACTORIZATION; SPEECH RECOGNITION;

ACOUSTIC NOISE;

EID: 84862293102 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TASL.2012.2198059 Document Type: Article

Times cited : (47)

References (33)

1
- 80051617808
- Speaker and noise factorisation on the AURORA4 task
- Y.-Q. Wang and M. J. F. Gales, "Speaker and noise factorisation on the AURORA4 task," in Proc. ICASSP'11, 2011, pp. 4584-4587.
- (2011) Proc. ICASSP'11 , pp. 4584-4587
- Wang, Y.-Q.¹ Gales, M.J.F.²

2
- 0346528936
- Speaker adaptation for continuous density HMMs: A review
- P. C. Woodland, "Speaker adaptation for continuous density HMMs: A review," in Proc. ISCA ITR-Workshop on Adaptation Methods for Speech Recognition, 2001.
- (2001) Proc. ISCA ITR-Workshop on Adaptation Methods for Speech Recognition
- Woodland, P.C.¹

3
- 0029288202
- Speech recognition in noisy environments: A survey
- Y. Gong, "Speech recognition in noisy environments: A survey," Speech Commun., vol. 16, no. 3, pp. 261-291, 1995.
- (1995) Speech Commun. , vol.16 , Issue.3 , pp. 261-291
- Gong, Y.¹

4
- 0029747183
- Speaker normalization using efficient frequency warping procedures
- L. Lee and R. C. Rose, "Speaker normalization using efficient frequency warping procedures," in Proc. ICASSP'96, pp. 353-356.
- Proc. ICASSP'96 , pp. 353-356
- Lee, L.¹ Rose, R.C.²

5
- 0029288633
- Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
- C. Leggetter and P. C. Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models," Comput. Speech Lang., vol. 9, pp. 171-186, 1995.
- (1995) Comput. Speech Lang. , vol.9 , pp. 171-186
- Leggetter, C.¹ Woodland, P.C.²

6
- 0032050110
- Maximum likelihood linear transformations for HMM-based speech recognition
- M. J. F. Gales, "Maximum likelihood linear transformations for HMM-based speech recognition," Comput. Speech Lang., vol. 12, pp. 75-98, 1998. (Pubitemid 128383747)
- (1998) Computer Speech and Language , vol.12 , Issue.2 , pp. 75-98
- Gales, M.J.F.¹

7
- 0034227757
- Cluster adaptive training of hidden Markov models
- M. J. F. Gales, "Cluster adaptive training of hidden Markov models," IEEE Trans. Speech Audio Process., vol. 8, no. 4, pp. 417-428, 2002.
- (2002) IEEE Trans. Speech Audio Process. , vol.8 , Issue.4 , pp. 417-428
- Gales, M.J.F.¹

8
- 0028419019
- Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains
- Apr.
- J. L. Gauvain and C.-H. Lee, "Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains," IEEE Trans. Speech Audio Process., vol. 2, no. 2, pp. 291-298, Apr. 1994.
- (1994) IEEE Trans. Speech Audio Process. , vol.2 , Issue.2 , pp. 291-298
- Gauvain, J.L.¹ Lee, C.-H.²

9
- 0000392884
- Eigenvoices for speaker adaptation
- R. Kuhn, P. Nguyen, J. C. Junqua, L. Goldwasser, N. Niedzielski, S. Fincke, K. Field, and M. Contolini, "Eigenvoices for speaker adaptation," in Proc. ICSLP'98.
- Proc. ICSLP'98
- Kuhn, R.¹ Nguyen, P.² Junqua, J.C.³ Goldwasser, L.⁴ Niedzielski, N.⁵ Fincke, S.⁶ Field, K.⁷ Contolini, M.⁸

10
- 70450163444
- Adaptive training with noisy constrained maximum likelihood linear regression for noise robust speech recognition
- D. Kim and M. J. F. Gales, "Adaptive training with noisy constrained maximum likelihood linear regression for noise robust speech recognition," in Proc. Interspeech'09, pp. 2382-2386.
- Proc. Interspeech'09 , pp. 2382-2386
- Kim, D.¹ Gales, M.J.F.²

11
- 0003904183
- Maximum likelihood eigenspace and MLLR for speech recognition in noisy environments
- P. Nguyen, C. Wellekens, and J. C. Junqua, "Maximum likelihood eigenspace and MLLR for speech recognition in noisy environments," in Proc. Eurospeech'99.
- Proc. Eurospeech'99
- Nguyen, P.¹ Wellekens, C.² Junqua, J.C.³

12
- 0030362995
- A compact model for speaker adaptive training
- T. Anastasakos, J. McDonough, R. Schwartz, and J. Makhoul, "A compact model for speaker adaptive training," in Proc. ICSLP'96, pp. 1137-1140.
- Proc. ICSLP'96 , pp. 1137-1140
- Anastasakos, T.¹ McDonough, J.² Schwartz, R.³ Makhoul, J.⁴

13
- 34547528168
- Adaptive training with joint uncertainty decoding for robust recognition of noisy data
- H. Liao and M. J. F. Gales, "Adaptive training with joint uncertainty decoding for robust recognition of noisy data," in Proc. ICASSP'07, pp. 389-392.
- Proc. ICASSP'07 , pp. 389-392
- Liao, H.¹ Gales, M.J.F.²

14
- 70349194599
- Noise adaptive training using a vector Taylor series approach for noise robust automatic speech recognition
- O. Kalinli, M.L.Seltzer, andA.Acero, "Noise adaptive training using a vector Taylor series approach for noise robust automatic speech recognition," in Proc. ICASSP'09, pp. 3825-3828.
- Proc. ICASSP'09 , pp. 3825-3828
- Kalinli, O.¹ Seltzer, M.L.² Acero, A.³

15
- 38849170676
- Distributed Speech Recognition; Advanced Frontend Feature Extraction Algorithm; Compression Algorithms Tech. Rep. ES 202 050 v1.1.3
- Speech Processing, Transmission and Quality Aspects (STQ); Distributed Speech Recognition; Advanced Frontend Feature Extraction Algorithm; Compression Algorithms, 2003, Tech. Rep. ES 202 050 v1.1.3.
- (2003) Speech Processing Transmission and Quality Aspects (STQ)

16
- 85009070292
- Large vocabulary speech recognition under adverse acoustic environments
- L. Deng, A. Acero, M. Plumpe, and X. Huang, "Large vocabulary speech recognition under adverse acoustic environments," in Proc. ICSLP'00.
- Proc. ICSLP'00
- Deng, L.¹ Acero, A.² Plumpe, M.³ Huang, X.⁴

17
- 33750376174
- Model-based feature enhancement with uncertainty decoding for noise robust ASR
- DOI 10.1016/j.specom.2005.12.006, PII S0167639306000057
- V. Stouten, H. V. hamme, and P. Wambacq, "Model-based feature enhancement with uncertainty decoding for noise robust ASR," Speech Commun., vol. 48, no. 11, pp. 1502-1514, 2006. (Pubitemid 44634766)
- (2006) Speech Communication , vol.48 , Issue.11 , pp. 1502-1514
- Stouten, V.¹ Van Hamme, H.² Wambacq, P.³

18
- 65549153550
- Ph.D. dissertation, Carnegie Mellon Univ., Pittsburgh, PA
- P. Moreno, "Speech recognition in noisy environments," Ph.D. dissertation, Carnegie Mellon Univ., Pittsburgh, PA, 1996.
- (1996) Speech Recognition in Noisy Environments
- Moreno, P.¹

19
- 0003671941
- Ph.D. dissertation Cambridge Univ., Cambridge, U.K.
- M. J. F. Gales, "Model-based techniques for noise robust speech recognition," Ph.D. dissertation, Cambridge Univ., Cambridge, U.K., 1995.
- (1995) Model-based Techniques for Noise Robust Speech Recognition
- Gales, M.J.F.¹

20
- 85009113852
- HMM adaptation using vector Taylor series for noisy speech recognition
- A. Acero, L. Deng, T. Kristjansson, and J. Zhang, "HMM adaptation using vector Taylor series for noisy speech recognition," in Proc. ICSLP'00.
- Proc. ICSLP'00
- Acero, A.¹ Deng, L.² Kristjansson, T.³ Zhang, J.⁴

21
- 68549095140
- High-performance HMM adaptation with joint compensation of additive and convolutive distortions via vector Taylor series
- J. Li, D. Yu, Y. Gong, and A. Acero, "High-performance HMM adaptation with joint compensation of additive and convolutive distortions via vector Taylor series," in Proc. ASRU'07.
- Proc. ASRU'07
- Li, J.¹ Yu, D.² Gong, Y.³ Acero, A.⁴

22
- 27644486095
- A method of joint compensation of additive and convolutive distortions for speaker-independent speech recognition
- DOI 10.1109/TSA.2005.851963
- Y. Gong, "A method of joint compensation of additive and convolutive distortions for speaker-independent speech recognition," IEEE Trans. Speech Audio Process., vol. 13, no. 5, pp. 975-983, Sep. 2005. (Pubitemid 41558911)
- (2005) IEEE Transactions on Speech and Audio Processing , vol.13 , Issue.5 , pp. 975-983
- Gong, Y.¹

23
- 44849122740
- Irrelevant variability normalization based HMM training using VTS approximation of an explicit model of environmental distortions
- Y. Hu and Q. Huo, "Irrelevant variability normalization based HMM training using VTS approximation of an explicit model of environmental distortions," in Proc. Interspeech'07, pp. 1042-1045.
- Proc. Interspeech'07 , pp. 1042-1045
- Hu, Y.¹ Huo, Q.²

24
- 85008564998
- Unsupervised data-driven feature vector normalization with acoustic model adaptation for robust speech recognition
- L. Buera, A. Miguel, O. Saz, A. Ortega, and E. Lleida, "Unsupervised data-driven feature vector normalization with acoustic model adaptation for robust speech recognition," IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 2, pp. 296-309, 2010.
- (2010) IEEE Trans. Audio, Speech, Lang. Process. , vol.18 , Issue.2 , pp. 296-309
- Buera, L.¹ Miguel, A.² Saz, O.³ Ortega, A.⁴ Lleida, E.⁵

25
- 0032139556
- Predictive model-based compensation schemes for robust speech recognition
- PII S0167639398000296
- M. J. F. Gales, "Predictive model-based compensation schemes for robust speech recognition,"Speech Commun., vol. 25, no. 1-3, pp. 49-74, 1998. (Pubitemid 128413634)
- (1998) Speech Communication , vol.25 , Issue.1-3 , pp. 49-74
- Gales, M.J.F.¹

26
- 0347960645
- Separating speaker and environment variabilities for improved recognition in non-stationary conditions
- L. Rigazio, P. Nguyen, D. Kryze, and J.-C. Junqua, "Separating speaker and environment variabilities for improved recognition in non-stationary conditions," in Proc. Eurospeech'01.
- Proc. Eurospeech'01
- Rigazio, L.¹ Nguyen, P.² Kryze, D.³ Junqua, J.-C.⁴

27
- 84862274942
- Acoustic factorisation
- M. J. F. Gales, "Acoustic factorisation," in Proc. ASRU'01.
- Proc. ASRU'01
- Gales, M.J.F.¹

28
- 4544253619
- Adaptive training using structured transforms
- K. Yu and M. J. F. Gales, "Adaptive training using structured transforms," in Proc. ICASSP'04, pp. 317-320.
- Proc. ICASSP'04 , pp. 317-320
- Yu, K.¹ Gales, M.J.F.²

29
- 0346126988
- Robust speech recognition in noise-Performance of the IBM continuous speech recogniser on the ARPA noise spoke task
- R. A. Gopinath et al., "Robust speech recognition in noise-Performance of the IBM continuous speech recogniser on the ARPA noise spoke task," in Proc. APRA Workshop Spoken Lang. Syst. Technol., 1995, pp. 127-130.
- (1995) Proc. APRA Workshop Spoken Lang. Syst. Technol. , pp. 127-130
- Gopinath, R.A.¹

30
- 34547537573
- Cambridge, U.K. Tech. Rep. CUED/F-INFENG/TR552
- H. Liao and M. J. F. Gales, Joint uncertainty decoding for robust large vocabulary speech recognition Univ. of Cambridge, Cambridge, U.K., 2006, Tech. Rep. CUED/F-INFENG/TR552.
- (2006) Joint Uncertainty Decoding for Robust Large Vocabulary Speech Recognition Univ. of Cambridge
- Liao, H.¹ Gales, M.J.F.²

31
- 33646806075
- Adaptation of precision matrix models on LVCSR
- K. C. Sim and M. J. F. Gales, "Adaptation of precision matrix models on LVCSR," in Proc. ICASSP'05, pp. 97-100.
- Proc. ICASSP'05 , pp. 97-100
- Sim, K.C.¹ Gales, M.J.F.²

32
- 56149125973
- Aurora working group: DSR front end LVCSR evaluation AU/384/02
- Mississippi State Univ., Tech. Rep.
- N. Parihar and J. Picone, "Aurora working group: DSR front end LVCSR evaluation AU/384/02," Inst. for Signal and Information Process, Mississippi State Univ., Tech. Rep.
- Inst. for Signal and Information Process
- Parihar, N.¹ Picone, J.²

33
- 79959822392
- Feature versus model based noise robustness
- K. Demuynck, X. Zhang, D. Van Compernolle, and H. Van Hamme, "Feature versus model based noise robustness," in Proc. Inter-speech'10, pp. 721-724.
- Proc. Inter-speech'10 , pp. 721-724
- Demuynck, K.¹ Zhang, X.² Van Compernolle, D.³ Van Hamme, H.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.