SCOPUS 정보 검색 플랫폼

IEEE Transactions on Speech and Audio Processing

Volumn 13, Issue 3, 2005, Pages 367-376

Discriminative linear transforms for feature normalization and speaker adaptation in HMM estimation

(3) Tsakalidis, Stavros a Doumpiotis, Vlasios a Byrne, William a

a Johns Hopkins University (United States)

Author keywords

Adaptive training; Correlation modeling; Discrim inative training

Indexed keywords

ACOUSTIC SIGNAL PROCESSING; ALGORITHMS; CORRELATION METHODS; MARKOV PROCESSES; MATHEMATICAL MODELS; MATHEMATICAL TRANSFORMATIONS; MATRIX ALGEBRA; MAXIMUM LIKELIHOOD ESTIMATION; SPEECH PROCESSING; VECTORS;

ADAPTIVE TRAINING; CORRELATION MODELING; DISCRIMINATIVE TRAINING; HMM ESTIMATION;

FEATURE EXTRACTION;

EID: 18744406714 PISSN: 10636676 EISSN: None Source Type: Journal
DOI: 10.1109/TSA.2005.845806 Document Type: Article

Times cited : (19)

References (23)

1
- 84892187452
- Maximum likelihood modeling with Gaussian distributions for classification
- May
- R. A. Gopinath, "Maximum likelihood modeling with Gaussian distributions for classification," in Proc. Int. Conf. Acoustics, Speech, Signal Processing, vol. 2, May 1998, pp. 661-664.
- (1998) Proc. Int. Conf. Acoustics, Speech, Signal Processing , vol.2 , pp. 661-664
- Gopinath, R.A.¹

2
- 0032638856
- Semi-tied covariance matrices for hidden Markov models
- May
- M. J. F. Gales, "Semi-tied covariance matrices for hidden Markov models," IEEE Trans. Speech Audio Process., vol. 7, no. 5, pp. 272-281, May 1999.
- (1999) IEEE Trans. Speech Audio Process , vol.7 , Issue.5 , pp. 272-281
- Gales, M.J.F.¹

3
- 0030362995
- A compact model for speaker-adaptive training
- T. Anastasakos, J. McDonough, R. Schwartz, and J. Makhoul, "A compact model for speaker-adaptive training," in Proc. Int. Conf. Spoken Language Processing, 1996, pp. 1137-1140.
- (1996) Proc. Int. Conf. Spoken Language Processing , pp. 1137-1140
- Anastasakos, T.¹ McDonough, J.² Schwartz, R.³ Makhoul, J.⁴

4
- 0032050110
- Maximum likelihood linear transformations for HMM-based speech recognition
- Apr.
- M. J. F. Gales, "Maximum likelihood linear transformations for HMM-based speech recognition," Comput. Speech Lang., vol. 12, no. 2, pp. 75-98, Apr. 1998.
- (1998) Comput. Speech Lang. , vol.12 , Issue.2 , pp. 75-98
- Gales, M.J.F.¹

5
- 0002756136
- Maximum mutual information estimation of hidden Markov models
- C.-H. Lee, F. K. Soong, and K. K. Paliwal, Eds. Norwell, MA: Kluwer, ch. 3
- Y. Normandin, "Maximum mutual information estimation of hidden Markov models," in Automatic Speech and Speaker Recognition: Advanced Topics, C.-H. Lee, F. K. Soong, and K. K. Paliwal, Eds. Norwell, MA: Kluwer, 1996, ch. 3, pp. 57-81.
- (1996) Automatic Speech and Speaker Recognition: Advanced Topics , pp. 57-81
- Normandin, Y.¹

6
- 0002867698
- Large scale discriminative training for speech recognition
- P. C. Woodland and D. Povey, "Large scale discriminative training for speech recognition," in Proc. Tutorial Research Workshop on Automatic Speech Recognition, 2000, pp. 7-16.
- (2000) Proc. Tutorial Research Workshop on Automatic Speech Recognition , pp. 7-16
- Woodland, P.C.¹ Povey, D.²

7
- 18744388722
- Discriminative linear transforms for speaker adaptation
- L. F. Uebel and P. C. Woodland, "Discriminative linear transforms for speaker adaptation," in Proc. Tutorial Research Workshop on Adaptation Methods for Speech Recognition, 2001, pp. 61-64.
- (2001) Proc. Tutorial Research Workshop on Adaptation Methods for Speech Recognition , pp. 61-64
- Uebel, L.F.¹ Woodland, P.C.²

8
- 0034855183
- Improvements in linear transforms based speaker adaptation
- May
- _, "Improvements in linear transforms based speaker adaptation," in Proc. Int. Conf. Acoustics, Speech, Signal Processing, vol. 1, May 2001, pp. 49-52.
- (2001) Proc. Int. Conf. Acoustics, Speech, Signal Processing , vol.1 , pp. 49-52

9
- 85009119467
- Discriminative speaker adaptation with conditional maximum likelihood linear regression
- A. Gunawardana and W. Byrne, "Discriminative speaker adaptation with conditional maximum likelihood linear regression," in Proc. Eur. Conf. Speech Communication and Technology, 2001, pp. 1203-1206.
- (2001) Proc. Eur. Conf. Speech Communication and Technology , pp. 1203-1206
- Gunawardana, A.¹ Byrne, W.²

10
- 0036294871
- On maximum mutual information speaker-adapted training
- May
- J. McDonough, T. Schaaf, and A. Waibel, "On maximum mutual information speaker-adapted training," in Proc. Int. Conf. Acoustics, Speech, Signal Processing, vol. 1, May 2002, pp. 601-604.
- (2002) Proc. Int. Conf. Acoustics, Speech, Signal Processing , vol.1 , pp. 601-604
- McDonough, J.¹ Schaaf, T.² Waibel, A.³

11
- 18744393254
- The AT&T LVCSR-2001 system
- A. Ljolje, "The AT&T LVCSR-2001 system," presented at the NIST LVCSR Workshop, 2001.
- (2001) NIST LVCSR Workshop
- Ljolje, A.¹

12
- 0024905238
- A comparison of several acoustic representations for speech recognition with degraded and undegraded speech
- May
- M. Hunt and C. Lefèbvre, "A comparison of several acoustic representations for speech recognition with degraded and undegraded speech," in Proc. Int. Conf. Acoustics, Speech, Signal Processing, vol. 1, May 1989, pp. 262-265.
- (1989) Proc. Int. Conf. Acoustics, Speech, Signal Processing , vol.1 , pp. 262-265
- Hunt, M.¹ Lefèbvre, C.²

13
- 0009643121
- Ph.D. dissertation, RWTH Aachen- Univ. Technol., Aachen, Germany
- R. Schlüter, "Investigations on Discriminative Training Criteria," Ph.D. dissertation, RWTH Aachen- Univ. Technol., Aachen, Germany, 2000.
- (2000) Investigations on Discriminative Training Criteria
- Schlüter, R.¹

14
- 4244119525
- Maximum mutual information estimation of acoustic HMM emission densities
- Johns Hopkins Univerisity, CLSP, Baltimore, MD
- A. Gunawardana, "Maximum Mutual Information Estimation of Acoustic HMM Emission Densities," Johns Hopkins Univerisity, CLSP, Baltimore, MD, Tech. Rep. CLSP Reasearch Note no. 40, 2001.
- (2001) Tech. Rep. CLSP Reasearch Note No. 40 , vol.40
- Gunawardana, A.¹

15
- 1642372928
- Variance compensation within the MLLR framework
- Eng. Dept., Univ. Cambridge, Cambridge, U.K.
- M. J. F. Gales and P. Woodland, "Variance Compensation Within the MLLR Framework," Eng. Dept., Univ. Cambridge, Cambridge, U.K., Tech. Rep. CUED/F-INFENT/TR242, 1996.
- (1996) Tech. Rep. , vol.CUED-F-INFENT-TR242
- Gales, M.J.F.¹ Woodland, P.²

16
- 0003822743
- S. Young, D. Kershaw, J. Odell, D. Ollason, V. Valtchev, and P. Woodland, The HTK Book, 3.0 ed., 2000.
- (2000) The HTK Book, 3.0 Ed.
- Young, S.¹ Kershaw, D.² Odell, J.³ Ollason, D.⁴ Valtchev, V.⁵ Woodland, P.⁶

17
- 2442597230
- The JHU march 2001 Hub-5 conversational speech transcription system
- W. Byrne, "The JHU march 2001 Hub-5 conversational speech transcription system," presented at the NIST LVCSR Workshop, 2001.
- (2001) NIST LVCSR Workshop
- Byrne, W.¹

18
- 0025041264
- Perceptual linear predictive (PLP) analysis of speech
- Apr.
- H. Hermansky, "Perceptual linear predictive (PLP) analysis of speech," J. Acoust. Soc. America, vol. 87, no. 4, pp. 1738-1752, Apr. 1990.
- (1990) J. Acoust. Soc. America , vol.87 , Issue.4 , pp. 1738-1752
- Hermansky, H.¹

19
- 85135259135
- Integrated context-dependent networks in very large vocabulary speech recognition
- M. Mohri and M. Riley, "Integrated context-dependent networks in very large vocabulary speech recognition," in Proc. Eur. Conf. Speech Commun. Technol., 1999, pp. 811-814.
- (1999) Proc. Eur. Conf. Speech Commun. Technol. , pp. 811-814
- Mohri, M.¹ Riley, M.²

20
- 0037519295
- The SRI march 2000 Hub-5 conversational speech transcription system
- A. Stolcke, H. Bratt, J. Butzberger, H. Franco, V. R. Rao Gadde, M. Plauché, C. Richey, E. Shriberg, K. Sönmez, F. Weng, and J. Zheng, 'The SRI march 2000 Hub-5 conversational speech transcription system," presented at the Speech Transcription Workshop, 2000.
- (2000) Speech Transcription Workshop
- Stolcke, A.¹ Bratt, H.² Butzberger, J.³ Franco, H.⁴ Gadde, V.R.R.⁵ Plauché, M.⁶ Richey, C.⁷ Shriberg, E.⁸ Sönmez, K.⁹ Weng, F.¹⁰ Zheng, J.¹¹

21
- 18744376446
- The 2000 NIST evaluation for recognition of conversational speech over the telephone
- A. Martin, M. Przybocki, J. Fiscus, and D. Pallett, "The 2000 NIST evaluation for recognition of conversational speech over the telephone," presented at the Speech Transcription Workshop, 2000.
- (2000) Speech Transcription Workshop
- Martin, A.¹ Przybocki, M.² Fiscus, J.³ Pallett, D.⁴

22
- 18744389914
- The evaluation: Word error rates and confidence analysis
- Linthicum Heights, MD, [Online]
- A. Martin, J. Fiscus, M. Przybocki, and B. Fisher, "The evaluation: Word error rates and confidence analysis," presented at the Hub-5 Workshop, Linthicum Heights, MD, 1998. [Online]. Available: http://www.nist.gov/speech/ tests/ctr/hub5e_98/hub5e_98.htm.
- (1998) Hub-5 Workshop
- Martin, A.¹ Fiscus, J.² Przybocki, M.³ Fisher, B.⁴

23
- 84962901028
- Adaptive training for robust ASR
- Dec.
- M. J. F. Gales, "Adaptive training for robust ASR," in Proc. IEEE Workshop on Automatic Speech Recognition and Understanding, Dec. 2001, pp. 15-20.
- (2001) Proc. IEEE Workshop on Automatic Speech Recognition and Understanding , pp. 15-20
- Gales, M.J.F.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.