SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 14, Issue 6, 2006, Pages 2147-2155

An environment-compensated minimum classification error training approach based on stochastic vector mapping

Author keywords

Feature compensation; Hidden Markov model (HMM); Minimum classification error training (MCE); Noise robustness; Robust speech recognition; Stochastic vector mapping

Indexed keywords

FEATURE COMPENSATION; HIDDEN MARKOV MODEL (HMM); MINIMUM CLASSIFICATION ERROR TRAINING (MCE); NOISE ROBUSTNESS; ROBUST SPEECH RECOGNITION; STOCHASTIC VECTOR MAPPING;

ACOUSTIC NOISE; ERROR COMPENSATION; HIDDEN MARKOV MODELS; MAXIMUM LIKELIHOOD; OBJECT RECOGNITION; RANDOM PROCESSES; SPEECH ANALYSIS; STOCHASTIC MODELS; VECTORS;

SPEECH RECOGNITION;

EID: 44849090158 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TASL.2006.872616 Document Type: Article

Times cited : (20)

References (40)

1
- 0004319970
- Norwell, MA: Kluwer
- A. Acero, Acoustic and Environment Robustness in Automatic Speech Recognition. Norwell, MA: Kluwer, 1993.
- (1993) Acoustic and Environment Robustness in Automatic Speech Recognition
- Acero, A.¹

2
- 0030362995
- A compact model for speaker-adaptive training
- T. Anastasakos, J. McDonough, R. Schwartz, and J. Makhoul, "A compact model for speaker-adaptive training," in Proc. ICSLP, 1996, pp. 1137-1140.
- (1996) Proc. ICSLP , pp. 1137-1140
- Anastasakos, T.¹ McDonough, J.² Schwartz, R.³ Makhoul, J.⁴

3
- 0027221210
- A. Biem and S. Katagiri, Feature extraction based on minimum classification error/generalized probabilistic descent method, in Proc. ICASSP, 1993, pp. II-275-II-278.
- A. Biem and S. Katagiri, "Feature extraction based on minimum classification error/generalized probabilistic descent method," in Proc. ICASSP, 1993, pp. II-275-II-278.

4
- 0035250280
- An application of discriminative feature extraction to filter-bank-based speech recognition
- Mar
- A. Biem, S. Katagiri, E. McDermott, and B.-H. Juang, "An application of discriminative feature extraction to filter-bank-based speech recognition," IEEE Trans. Speech Audio Process., vol. 9, no. 2, pp. 96-110, Mar. 2001.
- (2001) IEEE Trans. Speech Audio Process , vol.9 , Issue.2 , pp. 96-110
- Biem, A.¹ Katagiri, S.² McDermott, E.³ Juang, B.-H.⁴

5
- 0031146514
- HMM-based speech recognition using state-dependent, discriminatively derived transforms on Mel-Warped DFT features
- May
- R. Chengalvarayan and L. Deng, "HMM-based speech recognition using state-dependent, discriminatively derived transforms on Mel-Warped DFT features," IEEE Trans. Speech Audio Process., vol. 5, no. 3, pp. 243-256, May 1997.
- (1997) IEEE Trans. Speech Audio Process , vol.5 , Issue.3 , pp. 243-256
- Chengalvarayan, R.¹ Deng, L.²

6
- 85009072507
- Evaluation of front-end features and noise compensation methods for robust Mandarin speech recognition
- Aalborg, Denmark
- R. Chengalvarayan, "Evaluation of front-end features and noise compensation methods for robust Mandarin speech recognition," in Proc. Eurospeech, Aalborg, Denmark, 2001, pp. 897-900.
- (2001) Proc. Eurospeech , pp. 897-900
- Chengalvarayan, R.¹

7
- 85135190638
- Signal conditioned minimum error rate training
- W. Chou, M. G. Rahim, and E. Buhrke, "Signal conditioned minimum error rate training," in Proc. Eurospeech, 1995, pp. 495-498.
- (1995) Proc. Eurospeech , pp. 495-498
- Chou, W.¹ Rahim, M.G.² Buhrke, E.³

8
- 0002629270
- Maximum likelihood from incomplete data via the EM algorithm
- A. Dempster, N. Laird, and D. Rubin, "Maximum likelihood from incomplete data via the EM algorithm," in J. Roy. Statist. Soc. Ser. B, 1977, vol. 39, no. 1, pp. 1-38.
- (1977) J. Roy. Statist. Soc. Ser. B , vol.39 , Issue.1 , pp. 1-38
- Dempster, A.¹ Laird, N.² Rubin, D.³

9
- 85009070292
- Large-vocabulary speech recognition under adverse acoustic environments
- Oct, pp. III-806-809
- L. Deng, A. Acero, M. Plumpe, and X.-D. Huang, "Large-vocabulary speech recognition under adverse acoustic environments," in Proc. ICSLP, Oct. 2000, pp. III-806-809.
- (2000) Proc. ICSLP
- Deng, L.¹ Acero, A.² Plumpe, M.³ Huang, X.-D.⁴

10
- 0034855352
- High-performance robust speech recognition using stereo training data
- pp. I-301-I-304
- L. Deng, A. Acero, L. Jiang, J. Droppo, and X.-D. Huang, "High-performance robust speech recognition using stereo training data," in Proc. ICASSP, 2001, pp. I-301-I-304.
- Proc. ICASSP , pp. 2001
- Deng, L.¹ Acero, A.² Jiang, L.³ Droppo, J.⁴ Huang, X.-D.⁵

11
- 0347968277
- Recursive estimation of nonstationary noise using iterative stochastic approximation for robust speech recognition
- Nov
- L. Deng, J. Droppo, and A. Acero, "Recursive estimation of nonstationary noise using iterative stochastic approximation for robust speech recognition," IEEE Trans. Speech Audio Process., vol. 11, no. 6, pp. 568-580, Nov. 2003.
- (2003) IEEE Trans. Speech Audio Process , vol.11 , Issue.6 , pp. 568-580
- Deng, L.¹ Droppo, J.² Acero, A.³

12
- 85006734596
- Evaluation of the SPLICE algorithm on the Aurora2 database
- Aalborg, Denmark, Sep
- J. Droppo, L. Deng, and A. Acero, "Evaluation of the SPLICE algorithm on the Aurora2 database," in Proc. Eurospeech, Aalborg, Denmark, Sep. 2001, pp. 217-220.
- (2001) Proc. Eurospeech , pp. 217-220
- Droppo, J.¹ Deng, L.² Acero, A.³

13
- 0442317754
- ETSI ES 202 050 v1.1.1, Oct, ETSI standard document. 2002
- Speech Processing, Transmission and Quality Aspects (STQ); Distributed Speech Recognition; Advanced Front-End Feature Extraction Algorithm; Compression Algorithms, ETSI ES 202 050 v1.1.1 (2002-10), Oct. 2002, ETSI standard document.
- (2010) Speech Processing, Transmission and Quality Aspects (STQ); Distributed Speech Recognition; Advanced Front-End Feature Extraction Algorithm; Compression Algorithms

14
- 0032050110
- Maximum likelihood linear transformations for HMMbased speech recognition
- M. J. F. Gales, "Maximum likelihood linear transformations for HMMbased speech recognition," in Comput. Speech Lang., 1998, vol. 12, pp. 75-98.
- (1998) Comput. Speech Lang , vol.12 , pp. 75-98
- Gales, M.J.F.¹

15
- 0347321460
- Source normalization training for HMM applied to noisy telephone speech recognition
- Y. Gong, "Source normalization training for HMM applied to noisy telephone speech recognition," in Proc. Eurospeech, 1997, pp. 1555-1558.
- (1997) Proc. Eurospeech , pp. 1555-1558
- Gong, Y.¹

16
- 0038669544
- The AURORA experimental framework for the performance evaluations of speech recognition systems under noisy conditions
- Paris, France, Sep
- H. G. Hirsch and D. Pearce, "The AURORA experimental framework for the performance evaluations of speech recognition systems under noisy conditions," in ISCA ITRW ASR, Paris, France, Sep. 2000, pp. 181-188.
- (2000) ISCA ITRW ASR , pp. 181-188
- Hirsch, H.G.¹ Pearce, D.²

17
- 0033888153
- A robust training algorithm for adverse speech recognition
- W.-T. Hong and S.-H. Chen, "A robust training algorithm for adverse speech recognition," Speech Commun., vol. 30, no. 4, pp. 273-293, 2000.
- (2000) Speech Commun , vol.30 , Issue.4 , pp. 273-293
- Hong, W.-T.¹ Chen, S.-H.²

18
- 0141480138
- A discriminative and robust training algorithm for noisy speech recognition
- pp. I-8-I-11
- W.-T. Hong, "A discriminative and robust training algorithm for noisy speech recognition," in Proc. ICASSP, 2003, pp. I-8-I-11.
- Proc. ICASSP , pp. 2003
- Hong, W.-T.¹

19
- 0026982122
- Discriminative learning for minimum error classification
- Dec
- B.-H. Juang and S. Katagiri, "Discriminative learning for minimum error classification," IEEE Trans. Signal Process., vol. 40, no. 12, pp. 3043-3054, Dec. 1992.
- (1992) IEEE Trans. Signal Process , vol.40 , Issue.12 , pp. 3043-3054
- Juang, B.-H.¹ Katagiri, S.²

20
- 0031139839
- Minimum classification error rate methods for speech recognition
- May
- B.-H. Juang, W. Chou, and C.-H. Lee, "Minimum classification error rate methods for speech recognition," IEEE Trans. Speech Audio Process., vol. 5, no. 3, pp. 257-265, May 1997.
- (1997) IEEE Trans. Speech Audio Process , vol.5 , Issue.3 , pp. 257-265
- Juang, B.-H.¹ Chou, W.² Lee, C.-H.³

21
- 0032651723
- Integrated bias removal techniques for robust speech recognition
- C. Lawrence and M. Rahim, "Integrated bias removal techniques for robust speech recognition," in Comput. Speech . Lang., 1999, vol. 13, pp. 283-298.
- (1999) Comput. Speech . Lang , vol.13 , pp. 283-298
- Lawrence, C.¹ Rahim, M.²

22
- 0021226391
- R. G. Leonard, A database for speaker-independent digit recognition, in Proc. ICASSP, 1984, pp. 42.11.1-42.11.4.
- R. G. Leonard, "A database for speaker-independent digit recognition," in Proc. ICASSP, 1984, pp. 42.11.1-42.11.4.

23
- 0023263708
- Multi-style training for robust isolated-word speech recognition
- R. P. Lippmann, E. A. Martin, and D. B. Paul, "Multi-style training for robust isolated-word speech recognition," in Proc. ICASSP, 1987, pp. 705-708.
- (1987) Proc. ICASSP , pp. 705-708
- Lippmann, R.P.¹ Martin, E.A.² Paul, D.B.³

24
- 0742272653
- Discriminative auditory-based features for robust speech recognition
- Jan
- B. Mak, Y.-C. Tam, and P. Li, "Discriminative auditory-based features for robust speech recognition," IEEE Trans. Speech Audio Process., vol. 12, no. 1, pp. 27-36, Jan. 2004.
- (2004) IEEE Trans. Speech Audio Process , vol.12 , Issue.1 , pp. 27-36
- Mak, B.¹ Tam, Y.-C.² Li, P.³

25
- 0036294871
- On maximum mutual information speaker-adapted training
- pp. I-601-I-604
- J. McDonough, T. Schaaf, and A. Waibel, "On maximum mutual information speaker-adapted training," in Proc. ICASSP, 2002, pp. I-601-I-604.
- Proc. ICASSP , pp. 2002
- McDonough, J.¹ Schaaf, T.² Waibel, A.³

26
- 65549153550
- Speech Recognition in Noisy Environments,
- Ph.D. dissertation, Dept. Elect. Comput. Eng, Carnegie Mellon Univ, Pittsburgh, PA
- P. Moreno, "Speech Recognition in Noisy Environments," Ph.D. dissertation, Dept. Elect. Comput. Eng., Carnegie Mellon Univ., Pittsburgh, PA, 1996.
- (1996)
- Moreno, P.¹

27
- 0029215955
- Simultaneous design of feature extractor and pattern classifier using the minimum classification error training algorithm
- K. K. Paliwal, M. Bacchini, and Y. Sagisaka, "Simultaneous design of feature extractor and pattern classifier using the minimum classification error training algorithm," in Proc. NNSP, 1995, pp. 67-76.
- (1995) Proc. NNSP , pp. 67-76
- Paliwal, K.K.¹ Bacchini, M.² Sagisaka, Y.³

28
- 33646905361
- Simultaneous feature and HMM design using string-based minimum classification error training criterion
- M. Rahim and C.-H. Lee, "Simultaneous feature and HMM design using string-based minimum classification error training criterion," in Proc. ICSLP, 1996, pp. 1820-1823.
- (1996) Proc. ICSLP , pp. 1820-1823
- Rahim, M.¹ Lee, C.-H.²

29
- 0030127017
- Signal conditioning techniques for robust speech recognition
- Apr
- M. Rahim, B.-H. Juang, W. Chou, and E. Buhrke, "Signal conditioning techniques for robust speech recognition," IEEE Signal Process. Lett., vol. 3, no. 4, pp. 107-109, Apr. 1996.
- (1996) IEEE Signal Process. Lett , vol.3 , Issue.4 , pp. 107-109
- Rahim, M.¹ Juang, B.-H.² Chou, W.³ Buhrke, E.⁴

30
- 0030149866
- A maximum-likelihood approach to stochastic matching for robust speech recognition
- May
- A. Sankar and C.-H. Lee, "A maximum-likelihood approach to stochastic matching for robust speech recognition," IEEE Trans. Speech Audio Process., vol. 4, no. 3, pp. 190-202, May 1996.
- (1996) IEEE Trans. Speech Audio Process , vol.4 , Issue.3 , pp. 190-202
- Sankar, A.¹ Lee, C.-H.²

31
- 85009217371
- Signal and feature compensation methods for robust speech recognition
- G. M. Davis, Ed. Boca Raton, FL: CRC
- R. Singh, R. M. Stern, and B. Raj, "Signal and feature compensation methods for robust speech recognition," in Noise Reduction Speech Applications, G. M. Davis, Ed. Boca Raton, FL: CRC, 2002, pp. 219-244.
- (2002) Noise Reduction Speech Applications , pp. 219-244
- Singh, R.¹ Stern, R.M.² Raj, B.³

32
- 0002788784
- Signal processing for robust speech recognition
- C.-H. Lee, F. Soong, and K. K. Paliwal, Eds. Norwell, MA: Kluwer
- R. M. Stern, A. Acero, F.-H. Liu, and Y. Ohshima, "Signal processing for robust speech recognition," in Automatic Speech and Speaker Recognition: Advanced Topics, C.-H. Lee, F. Soong, and K. K. Paliwal, Eds. Norwell, MA: Kluwer, 1996, pp. 351-378.
- (1996) Automatic Speech and Speaker Recognition: Advanced Topics , pp. 351-378
- Stern, R.M.¹ Acero, A.² Liu, F.-H.³ Ohshima, Y.⁴

33
- 0030379378
- An application of minimum classification error to feature space transformations for speech recognition
- A. Torre, A. M. Peinado, A. J. Rubio, V. E. Sanchez, and J. E. Diaz, "An application of minimum classification error to feature space transformations for speech recognition," Speech Commun., vol. 20, pp. 273-290, 1996.
- (1996) Speech Commun , vol.20 , pp. 273-290
- Torre, A.¹ Peinado, A.M.² Rubio, A.J.³ Sanchez, V.E.⁴ Diaz, J.E.⁵

34
- 0141477730
- Discriminative linear transforms for feature normalization and speaker adaptation in HMM estimation
- S. Tsakalidis, V. Doumpiotis, and W. Byrne, "Discriminative linear transforms for feature normalization and speaker adaptation in HMM estimation," in Proc. ICSLP, 2002, pp. 2585-2588.
- (2002) Proc. ICSLP , pp. 2585-2588
- Tsakalidis, S.¹ Doumpiotis, V.² Byrne, W.³

35
- 4544345461
- Discriminative adaptive training using the MPE criterion
- L. Wang and P. C. Woodland, "Discriminative adaptive training using the MPE criterion," in Proc. ASRU, 2003, pp. 279-284.
- (2003) Proc. ASRU , pp. 279-284
- Wang, L.¹ Woodland, P.C.²

36
- 85009257847
- An environment compensated minimum classification error training approach and its evaluation on Aurora2 database
- Denver, CO, pp. I-453-I-456
- J. Wu and Q. Huo, "An environment compensated minimum classification error training approach and its evaluation on Aurora2 database," in Proc. ICSLP, Denver, CO, 2002, pp. I-453-I-456.
- Proc. ICSLP , pp. 2002
- Wu, J.¹ Huo, Q.²

37
- 85009181040
- Several HKU approaches for robust speech recognition and their evaluation on Aurora connected digit recognition tasks
- Geneva, Switzerland
- -, "Several HKU approaches for robust speech recognition and their evaluation on Aurora connected digit recognition tasks," in Proc. Eurospeech, Geneva, Switzerland, 2003, pp. 21-24.
- (2003) Proc. Eurospeech , pp. 21-24

38
- 20444395560
- An environment compensated maximum likelihood training approach based on stochastic vector mapping
- Philadelphia, PA
- J. Wu, Q. Huo, and D.-L. Zhu, "An environment compensated maximum likelihood training approach based on stochastic vector mapping," in Proc. ICASSP, Philadelphia, PA, 2005, pp. I-429-I-432.
- (2005) Proc. ICASSP
- Wu, J.¹ Huo, Q.² Zhu, D.-L.³

39
- 64649107002
- Speaker normalization by input space optimization for continuous density hidden Markov models
- Hong Kong, China, Apr
- J.-X. Wu, Z. Qi, C. Chan, and J. Li, "Speaker normalization by input space optimization for continuous density hidden Markov models," in 1994 Int. Symp. Speech, Image Process. Neural Netw., Hong Kong, China, Apr. 1994, pp. 682-685.
- (1994) 1994 Int. Symp. Speech, Image Process. Neural Netw , pp. 682-685
- Wu, J.-X.¹ Qi, Z.² Chan, C.³ Li, J.⁴

40
- 64649098651
- S. Young et al., The HTK Book (for HTK V3.0) July 2000.
- S. Young et al., The HTK Book (for HTK V3.0) July 2000.

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.