메뉴 건너뛰기




Volumn E86-D, Issue 3, 2003, Pages 464-473

Continuous speech recognition using an on-line speaker adaptation method based on automatic speaker clustering

Author keywords

MAP; MLLR; Speaker adaptation; Speaker clustering; Speech recognition

Indexed keywords

COMMUNICATION CHANNELS (INFORMATION THEORY); MAXIMUM LIKELIHOOD ESTIMATION; PROBABILITY; REGRESSION ANALYSIS; VECTOR QUANTIZATION;

EID: 0038381713     PISSN: 09168532     EISSN: None     Source Type: Journal    
DOI: None     Document Type: Article
Times cited : (2)

References (18)
  • 1
    • 0033708392 scopus 로고    scopus 로고
    • On-line incremental speaker adaptation with automatic speaker change delection
    • Z-P. Zhang, S. Furui, and K. Ohtsuki, "On-line incremental speaker adaptation with automatic speaker change delection," Proc. ICASSP'2000, pp.961-964, 2000.
    • (2000) Proc. ICASSP'2000 , pp. 961-964
    • Zhang, Z.-P.1    Furui, S.2    Ohtsuki, K.3
  • 2
    • 0032091375 scopus 로고    scopus 로고
    • Text-independent speaker recognition using non-linear frame likelihood transformation
    • K. Markov, and S. Nakagawa, "Text-independent speaker recognition using non-linear frame likelihood transformation," Speech Commun., vol.24, no.3, pp.193-209, 1998.
    • (1998) Speech Commun. , vol.24 , Issue.3 , pp. 193-209
    • Markov, K.1    Nakagawa, S.2
  • 3
    • 0027311597 scopus 로고
    • A new speech recognition method based on VQ-distortion measure and HMM
    • S. Nakagawa and H. Suzuki, "A new speech recognition method based on VQ-distortion measure and HMM," Proc. ICASSP'93, pp.676-679, 1993.
    • (1993) Proc. ICASSP'93 , pp. 676-679
    • Nakagawa, S.1    Suzuki, H.2
  • 4
    • 0034857759 scopus 로고    scopus 로고
    • Speaker change detection and speaker clustering using VQ distortion for broadcast news speech recognition
    • K. Mori and S. Nakagawa, "Speaker change detection and speaker clustering using VQ distortion for broadcast news speech recognition," Proc. ICASSP'2001, pp.412-416, 2001.
    • (2001) Proc. ICASSP'2001 , pp. 412-416
    • Mori, K.1    Nakagawa, S.2
  • 5
    • 0029288633 scopus 로고
    • Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
    • Sept.
    • C.J. Leggetter and P.C. Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models," Computer Speech and Language, pp.171-185, Sept. 1995.
    • (1995) Computer Speech and Language , pp. 171-185
    • Leggetter, C.J.1    Woodland, P.C.2
  • 6
    • 0028419019 scopus 로고
    • Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains
    • April
    • J.-L. Gauvain and C.-H. Lee, "Maximum a posteriori estimation for multivariate Gaussian mixture observations of Markov chains," IEEE Trans. Speech Audio Process., vol.2, no.2, pp.291-298, April 1994.
    • (1994) IEEE Trans. Speech Audio Process. , vol.2 , Issue.2 , pp. 291-298
    • Gauvain, J.-L.1    Lee, C.-H.2
  • 7
    • 0036642566 scopus 로고    scopus 로고
    • On-line incremental speaker adaptation for broadcast news transcription
    • Z. Zhang, S. Furui, and K. Ohtsuki, "On-line incremental speaker adaptation for broadcast news transcription," Speech Commun., vol.37, no.3-4, pp.271-281, 2002.
    • (2002) Speech Commun. , vol.37 , Issue.3-4 , pp. 271-281
    • Zhang, Z.1    Furui, S.2    Ohtsuki, K.3
  • 8
    • 0011510430 scopus 로고
    • An unsupervised speaker adaptation method for continuous parameter HMM by maximum a posteriori probability estimation
    • Y. Tsurumi and S. Nakagaw, "An unsupervised speaker adaptation method for continuous parameter HMM by maximum a posteriori probability estimation," Proc. IC-SLP'94, pp.431-434, 1994.
    • (1994) Proc. IC-SLP'94 , pp. 431-434
    • Tsurumi, Y.1    Nakagaw, S.2
  • 11
    • 0031704151 scopus 로고    scopus 로고
    • Speaker clustering and transformation for speaker adaptation in speech recognition systems
    • M.Padmanabhan, L.R. Bahl, D. Nahamoo, and M.A. Picheny, "Speaker clustering and transformation for speaker adaptation in speech recognition systems," IEEE Trans. Speech Audio Process., vol.6, no.1, pp.71-77, 1998.
    • (1998) IEEE Trans. Speech Audio Process. , vol.6 , Issue.1 , pp. 71-77
    • Padmanabhan, M.1    Bahl, L.R.2    Nahamoo, D.3    Picheny, M.A.4
  • 12
    • 0034227757 scopus 로고    scopus 로고
    • Cluster adaptive training of hidden Markov models
    • M.J.F. Gales, "Cluster adaptive training of hidden Markov models," IEEE Trans. Speech Audio Process., vol.8, no.4, pp.417-428, 2000.
    • (2000) IEEE Trans. Speech Audio Process. , vol.8 , Issue.4 , pp. 417-428
    • Gales, M.J.F.1
  • 13
    • 0038338004 scopus 로고    scopus 로고
    • Speaker adaptation using phoneme-dependent tree-structured speaker clustering
    • June
    • M. Suzuki, T. Abe, H. Mori, S. Makino, and H. Aso, "Speaker adaptation using phoneme-dependent tree-structured speaker clustering," IEICE Trans. Inf. & Syst., vol.82-D-II, no.6, pp.981-989, June 1999.
    • (1999) IEICE Trans. Inf. & Syst. , vol.82 D-II , Issue.6 , pp. 981-989
    • Suzuki, M.1    Abe, T.2    Mori, H.3    Makino, S.4    Aso, H.5
  • 15
    • 0038000063 scopus 로고    scopus 로고
    • Acoustic model adaptation by selective training using 2-stage clustering
    • Feb.
    • S. Sato, H. Segi, K. Onoe, E. Miyasaka, H. Isone, T. Imai, and A. Ando, "Acoustic model adaptation by selective training using 2-stage clustering," IEICE Trans. Inf. & Syst., vol.85-D-II, no.2, pp.174-183, Feb. 2002.
    • (2002) IEICE Trans. Inf. & Syst. , vol.85 D-II , Issue.2 , pp. 174-183
    • Sato, S.1    Segi, H.2    Onoe, K.3    Miyasaka, E.4    Isone, H.5    Imai, T.6    Ando, A.7
  • 16
    • 0036293689 scopus 로고    scopus 로고
    • Speaker detection and tracking for telephone transaction
    • J. McLaughlin and D.A. Reynolds, "Speaker detection and tracking for telephone transaction," Proc. ICASSP, pp. 129-132, 2002.
    • (2002) Proc. ICASSP , pp. 129-132
    • McLaughlin, J.1    Reynolds, D.A.2
  • 17
    • 0038000062 scopus 로고    scopus 로고
    • Unsupervised adaptation of an acoustic model based on decoding strategies using word and phoneme posterior probabilities
    • Spring Acoustic Soc. Japan
    • J. Ogata and Y. Ariko, "Unsupervised adaptation of an acoustic model based on decoding strategies using word and phoneme posterior probabilities," Conference Record, Spring Acoustic Soc. Japan, pp. 137-138, 2002.
    • (2002) Conference Record , pp. 137-138
    • Ogata, J.1    Ariko, Y.2
  • 18
    • 0038337984 scopus 로고    scopus 로고
    • Continuous speech recognition using a sequential speaker adaptation method based on automatic speaker clustering
    • Spring Acoustic Soc. Japan
    • W. Zhang and S. Nakagawa, "Continuous speech recognition using a sequential speaker adaptation method based on automatic speaker clustering," Conference Record, Spring Acoustic Soc. Japan, pp. 121-122, 2002.
    • (2002) Conference Record , pp. 121-122
    • Zhang, W.1    Nakagawa, S.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.