SCOPUS 정보 검색 플랫폼

IEEE Transactions on Speech and Audio Processing

Volumn 13, Issue 4, 2005, Pages 554-563

Rapid discriminative acoustic model based on eigenspace mapping for fast speaker adaptation

(2) Zhou, Bowen a,b Hansen, John H L a,c

a UNIVERSITY OF COLORADO (United States)

b IBM (United States)

c UNIVERSITY OF TEXAS AT DALLAS (United States)

Author keywords

Discriminative acoustic model; Eigenspace mapping; Hidden markov models; Rapid speaker adaptation; Speech recognition

Indexed keywords

ADAPTIVE ALGORITHMS; CORRELATION METHODS; EIGENVALUES AND EIGENFUNCTIONS; MARKOV PROCESSES; MATHEMATICAL MODELS; SPEECH RECOGNITION;

DISCRIMINATIVE ACOUSTIC MODEL; EIGENSPACE MAPPING; HIDDEN MARKOV MODELS; RAPID SPEAKER ADAPTATION;

ACOUSTICS;

EID: 22544443963 PISSN: 10636676 EISSN: None Source Type: Journal
DOI: 10.1109/TSA.2005.845808 Document Type: Article

Times cited : (21)

References (33)

1
- 0031177213
- Combined bayesian and predictive techniques for rapid speaker adaptation of continuous density hidden Markov models
- S. M. Ahadi and P. C. Woodland, "Combined bayesian and predictive techniques for rapid speaker adaptation of continuous density hidden Markov models," Comput. Speech Lang., vol. 11, pp. 187-206, 1997.
- (1997) Comput. Speech Lang , vol.11 , pp. 187-206
- Ahadi, S.M.¹ Woodland, P.C.²

2
- 11844281179
- Within-utterance correlation for speech recognition
- Budapest, Hungary
- M. Blomberg, "Within-utterance correlation for speech recognition," in Proc. Eurospeech, Budapest, Hungary, 1999, pp. 2479-2482.
- (1999) Proc. Eurospeech , pp. 2479-2482
- Blomberg, M.¹

3
- 84871620008
- Discounted likelihood linear regression for rapid speaker adaptation
- Budapest, Hungary
- W. Byrne and A. Gunawardana, "Discounted likelihood linear regression for rapid speaker adaptation," in Proc. Eurospeech, Budapest, Hungary, 1999, pp. 203-206.
- (1999) Proc. Eurospeech , pp. 203-206
- Byrne, W.¹ Gunawardana, A.²

4
- 0002595416
- Speaker, environment and channel change detection and clustering via the Bayesian information criterion
- Feb.
- S. Chen and P. Gopalakrishnan, "Speaker, environment and channel change detection and clustering via the Bayesian information criterion," in Proc. Broadcast News Transcription Understanding Workshop, Feb. 1998, pp. 127-132.
- (1998) Proc. Broadcast News Transcription Understanding Workshop , pp. 127-132
- Chen, S.¹ Gopalakrishnan, P.²

5
- 85135272864
- Maximum a posterior linear regression for hidden Markov model adaptation
- Budapest, Hungary
- C. Chesta, O. Siohan, and C. H. Lee, "Maximum a posterior linear regression for hidden Markov model adaptation," in Proc. Eurospeech, Budapest, Hungary, 1999, pp. 203-206.
- (1999) Proc. Eurospeech , pp. 203-206
- Chesta, C.¹ Siohan, O.² Lee, C.H.³

6
- 84874875877
- Maximum a posterior linear regression with elliptically symmetric matrix priors
- Budapest, Hungary
- W. Chou, "Maximum a posterior linear regression with elliptically symmetric matrix priors," in Proc. Eurospeech, Budapest, Hungary, 1999, pp. 1-4.
- (1999) Proc. Eurospeech , pp. 1-4
- Chou, W.¹

7
- 0002629270
- Maximum likelihood estimation from incomplete data via the em algorithm
- A. P. Dempster, N. M. Laird, and D. B. Rubin, "Maximum likelihood estimation from incomplete data via the EM algorithm," J. R. Statist. Soc., vol. B39, pp. 1-38, 1977.
- (1977) J. R. Statist. Soc. , vol.B39 , pp. 1-38
- Dempster, A.P.¹ Laird, N.M.² Rubin, D.B.³

8
- 0032097263
- New York: Academic
- K. Fukunaga, Introduction to Statistical Pattern Recognition, 2nd ed. New York: Academic, 1990.
- (1990) Introduction to Statistical Pattern Recognition, 2nd Ed.
- Fukunaga, K.¹

9
- 0022667694
- Speaker-independent isolated word recognition using dynamic features of speech spectrum
- S. Furui, "Speaker-independent isolated word recognition using dynamic features of speech spectrum," IEEE Trans. Acoust., Speech, Signal Process., vol. 34, no. 1, pp. 52-59, 1986.
- (1986) IEEE Trans. Acoust., Speech, Signal Process. , vol.34 , Issue.1 , pp. 52-59
- Furui, S.¹

10
- 0032638856
- Semi-tied covariance matrices for hidden Markov models
- M. J. F. Gales, "Semi-tied covariance matrices for hidden Markov models," IEEE Trans. Acoust., Speech, Signal Process., vol. 7, no. 3, 1999.
- (1999) IEEE Trans. Acoust., Speech, Signal Process. , vol.7 , Issue.3
- Gales, M.J.F.¹

11
- 0028419019
- Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains
- Apr.
- J. L. Gauvain and C. H. Lee, "Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains," IEEE Trans. Speech Audio Process., vol. 2, pp. 291-298, Apr. 1994.
- (1994) IEEE Trans. Speech Audio Process. , vol.2 , pp. 291-298
- Gauvain, J.L.¹ Lee, C.H.²

12
- 0004134563
- New York: Marcel Dekker, ch. 10
- N. C. Giri, Multivariate Statistical Analysis. New York: Marcel Dekker, 1995, ch. 10.
- (1995) Multivariate Statistical Analysis
- Giri, N.C.¹

13
- 84892187452
- Maximum likelihood modeling with Gaussian distributions for classification
- Seattle, WA
- R. Gopinath, "Maximum likelihood modeling with Gaussian distributions for classification," in Proc. ICASSP, Seattle, WA, 1998.
- (1998) Proc. ICASSP
- Gopinath, R.¹

14
- 33745220319
- CU-move: Analysis & corpus development for interactive in-vehicle speech systems
- Aalborg, Denmark, Sep.
- J. H. L. Hansen, P. Angkititrakul, J. Plucienkowski, S. Gallant, U. Yapanel, B. Pellom, W. Ward, and R. Cole, "CU-move: Analysis & corpus development for interactive in-vehicle speech systems," in Proc. Eurospeech, Aalborg, Denmark, Sep. 2001.
- (2001) Proc. Eurospeech
- Hansen, J.H.L.¹ Angkititrakul, P.² Plucienkowski, J.³ Gallant, S.⁴ Yapanel, U.⁵ Pellom, B.⁶ Ward, W.⁷ Cole, R.⁸

15
- 1842797157
- Ph.D. dissertation, Oregon Graduate Institute of Science and Technology, Portland
- Z. Hu, "Understanding and Adapting to Speaker Variability using Correlation-Based Principal Component Analysis," Ph.D. dissertation, Oregon Graduate Institute of Science and Technology, Portland, 1999.
- (1999) Understanding and Adapting to Speaker Variability using Correlation-based Principal Component Analysis
- Hu, Z.¹

16
- 0141588436
- M.S. thesis, Univ. Cambridge, Cambridge, U.K.
- S. Johnson, "Speaker Tracking," M.S. thesis, Univ. Cambridge, Cambridge, U.K., 1997.
- (1997) Speaker Tracking
- Johnson, S.¹

17
- 84871609195
- EigenVoices for speaker adaptation
- Sydney, Australia, Nov.
- R. Kuhn, P. Nguyen, J. C. Junqua, L. Goldwasser, N. Niedzielski, S. Fincke, K. Field, and M. Contolini, "EigenVoices for speaker adaptation," in Proc. ICSLP, Sydney, Australia, Nov. 1998.
- (1998) Proc. ICSLP
- Kuhn, R.¹ Nguyen, P.² Junqua, J.C.³ Goldwasser, L.⁴ Niedzielski, N.⁵ Fincke, S.⁶ Field, K.⁷ Contolini, M.⁸

18
- 0034857758
- Very fast adaptation with a compact context-dependent eigenvoice model
- Salt Lake City, UT, May
- R. Kuhn, F. Perronnin, P. Nguyen, J.-C. Junqua, and L. Rigazio, "Very fast adaptation with a compact context-dependent eigenvoice model," in Proc. ICASSP, Salt Lake City, UT, May 2001.
- (2001) Proc. ICASSP
- Kuhn, R.¹ Perronnin, F.² Nguyen, P.³ Junqua, J.-C.⁴ Rigazio, L.⁵

19
- 0003871508
- Ph.D. dissertation, Johns Hopkins Univ., Baltimore, MD
- N. Kumar, "Investigation of Silicon-Auditory Models and Generalization of Linear Discriminant Analysis for Improved Speech Recognition," Ph.D. dissertation, Johns Hopkins Univ., Baltimore, MD, 1997.
- (1997) Investigation of Silicon-auditory Models and Generalization of Linear Discriminant Analysis for Improved Speech Recognition
- Kumar, N.¹

20
- 0029288633
- Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
- C. Leggetter and P. Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models," Comput. Speech Lang., vol. 9, pp. 171-185, 1995.
- (1995) Comput. Speech Lang. , vol.9 , pp. 171-185
- Leggetter, C.¹ Woodland, P.²

21
- 85135155427
- A comparative study of speaker adaptation techniques
- L. R. Neumeyer, A. Sankar, and V. V. Digalakis, "A comparative study of speaker adaptation techniques," in Proc. Eurospeech, 1995, pp. 1127-1130.
- (1995) Proc. Eurospeech , pp. 1127-1130
- Neumeyer, L.R.¹ Sankar, A.² Digalakis, V.V.³

22
- 0037584177
- Lawrence Erlbaum, Princeton, NJ
- Invariance and Variability in Speech Processes, J. S. Perkell and D. H. Klatt, Eds., Lawrence Erlbaum, Princeton, NJ, 1986.
- (1986) Invariance and Variability in Speech Processes
- Perkell, J.S.¹ Klatt, D.H.²

23
- 0008746009
- The 1996 hub-4 sphinx-3 system
- P. Placeway, S. Chen, M. Eskenazi, U. Jain, V. Parikh, B. Raj, M. Ravishankar, R. Rosenfeld, K. Seymore, M. Siegler, R. Stern, and E. Thayer, "The 1996 hub-4 sphinx-3 system," in Proc. DARPA Speech Recognition Workshop, 1997.
- (1997) Proc. DARPA Speech Recognition Workshop
- Placeway, P.¹ Chen, S.² Eskenazi, M.³ Jain, U.⁴ Parikh, V.⁵ Raj, B.⁶ Ravishankar, M.⁷ Rosenfeld, R.⁸ Seymore, K.⁹ Siegler, M.¹⁰ Stern, R.¹¹ Thayer, E.¹²

24
- 0033677121
- Maximum likelihood discriminant feature spaces
- Istanbul, Turkey, Jun.
- G. Saon, M. Padmanabhan, R. Gopinath, and S. Chen, "Maximum likelihood discriminant feature spaces," in Proc. ICASSP, Istanbul, Turkey, Jun. 2000.
- (2000) Proc. ICASSP
- Saon, G.¹ Padmanabhan, M.² Gopinath, R.³ Chen, S.⁴

25
- 0030640789
- Structural MAP speaker adaptation using hierarchical priors
- Santa Barbara, CA
- K. Shinoda and C. H. Lee, "Structural MAP speaker adaptation using hierarchical priors," in Proc. IEEE Workshop on Automatic Speech Recognition and Understanding, Santa Barbara, CA, 1997, pp. 381-388.
- (1997) Proc. IEEE Workshop on Automatic Speech Recognition and Understanding , pp. 381-388
- Shinoda, K.¹ Lee, C.H.²

26
- 0036461005
- Structural maximum a posteriori linear regression for fast HMM adaptation
- Jan.
- O. Siohan, T. A. Myrvoll, and C. H. Lee, "Structural maximum a posteriori linear regression for fast HMM adaptation," Comput. Speech Lang., vol. 16, no. 1, pp. 5-24, Jan. 2002.
- (2002) Comput. Speech Lang , vol.16 , Issue.1 , pp. 5-24
- Siohan, O.¹ Myrvoll, T.A.² Lee, C.H.³

27
- 0003840620
- M.Phil, thesis, Univ. Cambridge, Cambridge, U.K.
- R. J. Westwood, "Speaker Adaptation using Eigenvoices," M.Phil, thesis, Univ. Cambridge, Cambridge, U.K., 1999.
- (1999) Speaker Adaptation Using Eigenvoices
- Westwood, R.J.¹

28
- 21444449963
- Rapid speaker adaptation using MLLR and subspace regression classes
- Aalborg, Denmark, Sep.
- K. Wong and B. Mak, "Rapid speaker adaptation using MLLR and subspace regression classes," in Proc. Eurospeech, Aalborg, Denmark, Sep. 2001.
- (2001) Proc. Eurospeech
- Wong, K.¹ Mak, B.²

29
- 0002615167
- Speaker adaptation: Techniques and challenges
- Keystone, CO
- P. C. Woodland, "Speaker adaptation: Techniques and challenges," in Proc. IEEE Workshop on Automatic Speech Recognition & Understanding, Keystone, CO, 1999, pp. 85-90.
- (1999) Proc. IEEE Workshop on Automatic Speech Recognition & Understanding , pp. 85-90
- Woodland, P.C.¹

30
- 85009084294
- A novel algorithm for rapid speaker adaptation based on structural maximum likelihood eigenspace mapping
- Aalborg, Denmark, Sep.
- B. Zhou and J. H. L. Hansen, "A novel algorithm for rapid speaker adaptation based on structural maximum likelihood eigenspace mapping," in Proc. Eurospeech, vol. 2, Aalborg, Denmark, Sep. 2001, pp. 1215-1218.
- (2001) Proc. Eurospeech , vol.2 , pp. 1215-1218
- Zhou, B.¹ Hansen, J.H.L.²

31
- 85009288404
- Improved structural maximum likelihood eigenspace mapping for speaker adaptation
- Denver, CO
- _, "Improved structural maximum likelihood eigenspace mapping for speaker adaptation," in Proc. ICSLP'2002, Denver, CO, 2002, pp. 1433-14367.
- (2002) Proc. ICSLP'2002 , pp. 1433-14367

32
- 85009275098
- SpeechFind: An experimental on-line spoken document retrieval system for historical audio archives
- Denver, CO
- _, "SpeechFind: An experimental on-line spoken document retrieval system for historical audio archives," in Proc. ICSLP'2002, vol. 3, Denver, CO, 2002, pp. 1969-1972.
- (2002) Proc. ICSLP'2002 , vol.3 , pp. 1969-1972

33
- 85009089453
- Unsupervised audio stream segmentation and clustering via the Bayesian information criterion
- Beijing, China, Oct.
- _, "Unsupervised audio stream segmentation and clustering via the Bayesian information criterion," in Proc. ICSLP'2000, Beijing, China, Oct. 2000, pp. 714-717.
- (2000) Proc. ICSLP'2000 , pp. 714-717

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.