메뉴 건너뛰기




Volumn , Issue , 2010, Pages 4422-4425

An acoustic segment model approach to incorporating temporal information into speaker modeling for text-independent speaker recognition

Author keywords

Acoustic segment model; Speaker recognition

Indexed keywords

CHARACTER RECOGNITION; MODELING LANGUAGES; SIGNAL PROCESSING; SPEECH RECOGNITION;

EID: 78049411640     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2010.5495617     Document Type: Conference Paper
Times cited : (16)

References (18)
  • 1
    • 0033884858 scopus 로고    scopus 로고
    • Speaker verification using adapted Gaussian mixture models
    • D. A. Reynolds, T. F. Quatieri, and R. B. Dunn, "Speaker verification using adapted Gaussian mixture models," Digital Signal Processing, vol. 10, pp. 19-41, 2000.
    • (2000) Digital Signal Processing , vol.10 , pp. 19-41
    • Reynolds, D.A.1    Quatieri, T.F.2    Dunn, R.B.3
  • 2
    • 0029209272 scopus 로고
    • Robust text-independent speaker identification using Gaussian mixture speaker models
    • D. A. Reynolds and R. C. Rose, "Robust text-independent speaker identification using Gaussian mixture speaker models," IEEE Trans. Speech Audio Proc., vol. 3, pp. 72-83, 1995.
    • (1995) IEEE Trans. Speech Audio Proc. , vol.3 , pp. 72-83
    • Reynolds, D.A.1    Rose, R.C.2
  • 3
    • 84867191111 scopus 로고    scopus 로고
    • Robust speaker verification using short-time frequency with long-time window and fusion of multi-resolutions
    • C.-L. Huang, B. Ma, C.-H. Wu, B. Mak, and H. Li, "Robust speaker verification using short-time frequency with long-time window and fusion of multi-resolutions," Proc. Interspeech, pp. 1897-1900, 2008.
    • (2008) Proc. Interspeech , pp. 1897-1900
    • Huang, C.-L.1    Ma, B.2    Wu, C.-H.3    Mak, B.4    Li, H.5
  • 4
    • 0023800699 scopus 로고
    • A segment model based approach to speech recognition
    • C.-H. Lee, F. K. Soong, and B.-H. Juang, "A segment model based approach to speech recognition," Proc. ICASSP, pp. 501-541, 1988.
    • (1988) Proc. ICASSP , pp. 501-541
    • Lee, C.-H.1    Soong, F.K.2    Juang, B.-H.3
  • 5
    • 0024610919 scopus 로고
    • A tutorial on hidden Markov models and selected application in speech recognition
    • L. R. Rabiner, "A tutorial on hidden Markov models and selected application in speech recognition," Proc. IEEE, vol. 77, no. 2, pp. 257-286, 1989.
    • (1989) Proc. IEEE , vol.77 , Issue.2 , pp. 257-286
    • Rabiner, L.R.1
  • 6
    • 34547502608 scopus 로고    scopus 로고
    • A vector space modeling approach to spoken language identification
    • H. Li, B. Ma, and C.-H. Lee, "A vector space modeling approach to spoken language identification," IEEE Trans. Audio, Speech, and Language Proc., vol. 15, pp. 271-284, 2007.
    • (2007) IEEE Trans. Audio, Speech, and Language Proc. , vol.15 , pp. 271-284
    • Li, H.1    Ma, B.2    Lee, C.-H.3
  • 7
    • 84873444148 scopus 로고    scopus 로고
    • A study on music genre classification based on universal acoustic models
    • J. Reed and C.-H. Lee, "A study on music genre classification based on universal acoustic models," Proc. ISMIR, pp. 89-94, 2006.
    • (2006) Proc. ISMIR , pp. 89-94
    • Reed, J.1    Lee, C.-H.2
  • 8
    • 0023211850 scopus 로고
    • On the automatic segmentation of speech signals
    • T. Svendsen and F. K. Soong, "On the automatic segmentation of speech signals," Proc. ICASSP, pp. 77-80, 1987.
    • (1987) Proc. ICASSP , pp. 77-80
    • Svendsen, T.1    Soong, F.K.2
  • 10
    • 0028419019 scopus 로고
    • Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains
    • J.-L. Gauvain and C.-H. Lee, "Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains," IEEE Trans. Speech Audio Proc., vol. 2, pp. 291-99, 1994.
    • (1994) IEEE Trans. Speech Audio Proc. , vol.2 , pp. 291-299
    • Gauvain, J.-L.1    Lee, C.-H.2
  • 11
    • 0035279111 scopus 로고    scopus 로고
    • A structural Bayes approach to speaker adaptation
    • K. Shinoda and C.-H. Lee, "A structural Bayes approach to speaker adaptation," IEEE Trans. Speech Audio Processing, vol. 9, pp. 276-287, 2001.
    • (2001) IEEE Trans. Speech Audio Processing , vol.9 , pp. 276-287
    • Shinoda, K.1    Lee, C.-H.2
  • 12
    • 0030149866 scopus 로고    scopus 로고
    • A maximum-likelihood approach to stochastic matching for robust speech recognition
    • A. Sankar and C.-H. Lee, "A maximum-likelihood approach to stochastic matching for robust speech recognition," IEEE Trans. Speech Audio Proc., vol. 4, pp. 190-202, 1996.
    • (1996) IEEE Trans. Speech Audio Proc. , vol.4 , pp. 190-202
    • Sankar, A.1    Lee, C.-H.2
  • 16
    • 70349200796 scopus 로고    scopus 로고
    • Joint MAP adaptation of feature transformation and Gaussian mixture model for speaker recognition
    • D. Zhu, B. Ma, and H. Li, "Joint MAP adaptation of feature transformation and Gaussian mixture model for speaker recognition," Proc. ICASSP, pp. 4045-4048, 2009.
    • (2009) Proc. ICASSP , pp. 4045-4048
    • Zhu, D.1    Ma, B.2    Li, H.3
  • 17
    • 0033884857 scopus 로고    scopus 로고
    • Score normalization for text-independent speaker verification systems
    • R. Auckenthaler, M. Carey, and H. Lloyd-Thomas, "Score normalization for text-independent speaker verification systems," Digital Signal Processing, vol. 10, pp. 42-54, 2000.
    • (2000) Digital Signal Processing , vol.10 , pp. 42-54
    • Auckenthaler, R.1    Carey, M.2    Lloyd-Thomas, H.3
  • 18
    • 67651177785 scopus 로고    scopus 로고
    • An ensemble speaker and speaking environment modeling approach to robust speech recognition
    • Y. Tsao and C.-H. Lee, "An ensemble speaker and speaking environment modeling approach to robust speech recognition," IEEE Trans. on Audio, Speech, and Language Proc., vol. 17, pp. 1025-1037, 2009.
    • (2009) IEEE Trans. on Audio, Speech, and Language Proc. , vol.17 , pp. 1025-1037
    • Tsao, Y.1    Lee, C.-H.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.