SCOPUS 정보 검색 플랫폼

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

Volumn , Issue , 2010, Pages 4422-4425

An acoustic segment model approach to incorporating temporal information into speaker modeling for text-independent speaker recognition

(4) Tsao, Yu a Sun, Hanwu b Li, Haizhou b Lee, Chin Hui c

a NATIONAL INSTITUTE OF INFORMATION AND COMMUNICATIONS TECHNOLOGY (Japan)

b INSTITUTE FOR INFOCOMM RESEARCH (Singapore)

c GEORGIA INSTITUTE OF TECHNOLOGY (United States)

Author keywords

Acoustic segment model; Speaker recognition

Indexed keywords

CHARACTER RECOGNITION; MODELING LANGUAGES; SIGNAL PROCESSING; SPEECH RECOGNITION;

ACOUSTIC SEGMENT MODELS; LANGUAGE MODEL; MODEL-BASED OPC; MODELING APPROACH; MULTIPLE SET; SPEAKER MODEL; SPEAKER RECOGNITION; TEMPORAL INFORMATION; TEXT INDEPENDENTS; UNIVERSAL BACKGROUND MODEL;

COMPUTATIONAL LINGUISTICS;

EID: 78049411640 PISSN: 15206149 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICASSP.2010.5495617 Document Type: Conference Paper

Times cited : (16)

References (18)

1
- 0033884858
- Speaker verification using adapted Gaussian mixture models
- D. A. Reynolds, T. F. Quatieri, and R. B. Dunn, "Speaker verification using adapted Gaussian mixture models," Digital Signal Processing, vol. 10, pp. 19-41, 2000.
- (2000) Digital Signal Processing , vol.10 , pp. 19-41
- Reynolds, D.A.¹ Quatieri, T.F.² Dunn, R.B.³

2
- 0029209272
- Robust text-independent speaker identification using Gaussian mixture speaker models
- D. A. Reynolds and R. C. Rose, "Robust text-independent speaker identification using Gaussian mixture speaker models," IEEE Trans. Speech Audio Proc., vol. 3, pp. 72-83, 1995.
- (1995) IEEE Trans. Speech Audio Proc. , vol.3 , pp. 72-83
- Reynolds, D.A.¹ Rose, R.C.²

3
- 84867191111
- Robust speaker verification using short-time frequency with long-time window and fusion of multi-resolutions
- C.-L. Huang, B. Ma, C.-H. Wu, B. Mak, and H. Li, "Robust speaker verification using short-time frequency with long-time window and fusion of multi-resolutions," Proc. Interspeech, pp. 1897-1900, 2008.
- (2008) Proc. Interspeech , pp. 1897-1900
- Huang, C.-L.¹ Ma, B.² Wu, C.-H.³ Mak, B.⁴ Li, H.⁵

4
- 0023800699
- A segment model based approach to speech recognition
- C.-H. Lee, F. K. Soong, and B.-H. Juang, "A segment model based approach to speech recognition," Proc. ICASSP, pp. 501-541, 1988.
- (1988) Proc. ICASSP , pp. 501-541
- Lee, C.-H.¹ Soong, F.K.² Juang, B.-H.³

5
- 0024610919
- A tutorial on hidden Markov models and selected application in speech recognition
- L. R. Rabiner, "A tutorial on hidden Markov models and selected application in speech recognition," Proc. IEEE, vol. 77, no. 2, pp. 257-286, 1989.
- (1989) Proc. IEEE , vol.77 , Issue.2 , pp. 257-286
- Rabiner, L.R.¹

6
- 34547502608
- A vector space modeling approach to spoken language identification
- H. Li, B. Ma, and C.-H. Lee, "A vector space modeling approach to spoken language identification," IEEE Trans. Audio, Speech, and Language Proc., vol. 15, pp. 271-284, 2007.
- (2007) IEEE Trans. Audio, Speech, and Language Proc. , vol.15 , pp. 271-284
- Li, H.¹ Ma, B.² Lee, C.-H.³

7
- 84873444148
- A study on music genre classification based on universal acoustic models
- J. Reed and C.-H. Lee, "A study on music genre classification based on universal acoustic models," Proc. ISMIR, pp. 89-94, 2006.
- (2006) Proc. ISMIR , pp. 89-94
- Reed, J.¹ Lee, C.-H.²

8
- 0023211850
- On the automatic segmentation of speech signals
- T. Svendsen and F. K. Soong, "On the automatic segmentation of speech signals," Proc. ICASSP, pp. 77-80, 1987.
- (1987) Proc. ICASSP , pp. 77-80
- Svendsen, T.¹ Soong, F.K.²

9
- 43249126081
- Compensation of nuisance factors for speaker and language recognition
- F. Castaldo, D. Colibro, E. Dalmasso, P. Laface, and C. Vair, "Compensation of nuisance factors for speaker and language recognition," IEEE Trans. on Audio, Speech, and Language Processing, vol. 15, pp. 1969-1978, 2007.
- (2007) IEEE Trans. on Audio, Speech, and Language Processing , vol.15 , pp. 1969-1978
- Castaldo, F.¹ Colibro, D.² Dalmasso, E.³ Laface, P.⁴ Vair, C.⁵

10
- 0028419019
- Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains
- J.-L. Gauvain and C.-H. Lee, "Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains," IEEE Trans. Speech Audio Proc., vol. 2, pp. 291-99, 1994.
- (1994) IEEE Trans. Speech Audio Proc. , vol.2 , pp. 291-299
- Gauvain, J.-L.¹ Lee, C.-H.²

11
- 0035279111
- A structural Bayes approach to speaker adaptation
- K. Shinoda and C.-H. Lee, "A structural Bayes approach to speaker adaptation," IEEE Trans. Speech Audio Processing, vol. 9, pp. 276-287, 2001.
- (2001) IEEE Trans. Speech Audio Processing , vol.9 , pp. 276-287
- Shinoda, K.¹ Lee, C.-H.²

12
- 0030149866
- A maximum-likelihood approach to stochastic matching for robust speech recognition
- A. Sankar and C.-H. Lee, "A maximum-likelihood approach to stochastic matching for robust speech recognition," IEEE Trans. Speech Audio Proc., vol. 4, pp. 190-202, 1996.
- (1996) IEEE Trans. Speech Audio Proc. , vol.4 , pp. 190-202
- Sankar, A.¹ Lee, C.-H.²

13
- 85046873967
- The DET curve in assessment of detection task performance
- A. Martin, G. Doddington, T. Kamm, M. Ordowski, and M. Przybocki, "The DET curve in assessment of detection task performance," Proc. Eurospeech, pp. 1895-1898, 1997.
- (1997) Proc. Eurospeech , pp. 1895-1898
- Martin, A.¹ Doddington, G.² Kamm, T.³ Ordowski, M.⁴ Przybocki, M.⁵

14
- 0004097709
- The NIST Year 2001 Speaker Recognition Evaluation Plan. http://www.itl.nist.gov/iad/mig/tests/spk/2001/.
- The NIST Year 2001 Speaker Recognition Evaluation Plan

15
- 0028517164
- RASTA processing of speech
- H. Hermansky and N. Morgan, "RASTA processing of speech," IEEE Trans. Speech Audio Proc., vol. 2, pp. 578-589, 1994.
- (1994) IEEE Trans. Speech Audio Proc. , vol.2 , pp. 578-589
- Hermansky, H.¹ Morgan, N.²

16
- 70349200796
- Joint MAP adaptation of feature transformation and Gaussian mixture model for speaker recognition
- D. Zhu, B. Ma, and H. Li, "Joint MAP adaptation of feature transformation and Gaussian mixture model for speaker recognition," Proc. ICASSP, pp. 4045-4048, 2009.
- (2009) Proc. ICASSP , pp. 4045-4048
- Zhu, D.¹ Ma, B.² Li, H.³

17
- 0033884857
- Score normalization for text-independent speaker verification systems
- R. Auckenthaler, M. Carey, and H. Lloyd-Thomas, "Score normalization for text-independent speaker verification systems," Digital Signal Processing, vol. 10, pp. 42-54, 2000.
- (2000) Digital Signal Processing , vol.10 , pp. 42-54
- Auckenthaler, R.¹ Carey, M.² Lloyd-Thomas, H.³

18
- 67651177785
- An ensemble speaker and speaking environment modeling approach to robust speech recognition
- Y. Tsao and C.-H. Lee, "An ensemble speaker and speaking environment modeling approach to robust speech recognition," IEEE Trans. on Audio, Speech, and Language Proc., vol. 17, pp. 1025-1037, 2009.
- (2009) IEEE Trans. on Audio, Speech, and Language Proc. , vol.17 , pp. 1025-1037
- Tsao, Y.¹ Lee, C.-H.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.