메뉴 건너뛰기




Volumn 14, Issue 1, 2006, Pages 266-275

Automatic segmentation and identification of mixed-language speech using delta-BIC and LSA-based GMMs

Author keywords

Gaussian mixture model; Language identification; Latent semantic analysis; Mixed language speech; Single language speech

Indexed keywords

GAUSSIAN MIXTURE MODEL; LANGUAGE IDENTIFICATION; LATENT SEMANTIC ANALYSIS; MIXED-LANGUAGE SPEECH; SINGLE-LANGUAGE SPEECH;

EID: 33745000055     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TSA.2005.852992     Document Type: Conference Paper
Times cited : (43)

References (30)
  • 1
    • 0012327341 scopus 로고    scopus 로고
    • Multilinguality in speech and spoken language systems
    • A. Waibel, P. Geutner, and L. M. Tomokiyo et al., "Multilinguality in speech and spoken language systems," Proc. IEEE, vol. 88, no. 8, pp. 1297-1313, 2000.
    • (2000) Proc. IEEE , vol.88 , Issue.8 , pp. 1297-1313
    • Waibel, A.1    Geutner, P.2    Tomokiyo, L.M.3
  • 4
    • 0029733178 scopus 로고    scopus 로고
    • Comparison of four approaches to automatic language identification of telephone speech
    • M. A. Zissman, "Comparison of four approaches to automatic language identification of telephone speech," IEEE Trans. Speech Audio Processing, vol. 4, no. 1, pp. 31-44, 1996.
    • (1996) IEEE Trans. Speech Audio Processing , vol.4 , Issue.1 , pp. 31-44
    • Zissman, M.A.1
  • 5
    • 85009208002 scopus 로고    scopus 로고
    • NIST 2003 language recognition evaluation
    • A. P. Martin and M. A. Przybocki, "NIST 2003 language recognition evaluation," in Proc. EUROSPEECH'03, 2003, pp. 1341-1344.
    • (2003) Proc. EUROSPEECH'03 , pp. 1341-1344
    • Martin, A.P.1    Przybocki, M.A.2
  • 6
    • 85009275225 scopus 로고    scopus 로고
    • Approaches to language identification using Gaussian mixture models and shift delta ceptral features
    • P. A. Torres-Carrasquillo et al., "Approaches to language identification using Gaussian mixture models and shift delta ceptral features," in Proc. ICSLP'02, 2002, pp. 89-92.
    • (2002) Proc. ICSLP'02 , pp. 89-92
    • Torres-Carrasquillo, P.A.1
  • 7
    • 0033154048 scopus 로고    scopus 로고
    • Joint estimation of feature transformation parameters and Gaussian mixture model for speaker identification
    • K.-H. You and H.-C. Wang, "Joint estimation of feature transformation parameters and Gaussian mixture model for speaker identification," Speech Commun., vol. 28, pp. 227-241, 1999.
    • (1999) Speech Commun. , vol.28 , pp. 227-241
    • You, K.-H.1    Wang, H.-C.2
  • 8
    • 0035426911 scopus 로고    scopus 로고
    • Multilingual phone models for vocabulary-independent speech recognition tasks
    • J. Köhler, "Multilingual phone models for vocabulary- independent speech recognition tasks," Speech Commun., vol. 35, pp. 21-30, 2001.
    • (2001) Speech Commun. , vol.35 , pp. 21-30
    • Köhler, J.1
  • 9
    • 0036497598 scopus 로고    scopus 로고
    • Discriminative training of Gaussian mixture bi-gram models with application to Chinese dialect identification
    • W.-H. Tsai and W.-W. Chang, "Discriminative training of Gaussian mixture bi-gram models with application to Chinese dialect identification," Speech Commun., vol. 36, pp. 317-326, 2002.
    • (2002) Speech Commun. , vol.36 , pp. 317-326
    • Tsai, W.-H.1    Chang, W.-W.2
  • 10
    • 0035510539 scopus 로고    scopus 로고
    • Noise robust speech parameterization using multiresolution feature extraction
    • R. Hariharan, I. Kiss, and O. Viikki, "Noise robust speech parameterization using multiresolution feature extraction," IEEE Trans. Speech Audio Processing, vol. 9, no. 8, pp. 856-865, 2001.
    • (2001) IEEE Trans. Speech Audio Processing , vol.9 , Issue.8 , pp. 856-865
    • Hariharan, R.1    Kiss, I.2    Viikki, O.3
  • 11
    • 0033884177 scopus 로고    scopus 로고
    • Maximum likelihood and minimum classification error factor analysis for automatic speech recognition
    • L. K. Saul and M. G. Rahim, "Maximum likelihood and minimum classification error factor analysis for automatic speech recognition," IEEE Trans. Speech Audio Processing, vol. 8, no. 2, pp. 115-125, 2000.
    • (2000) IEEE Trans. Speech Audio Processing , vol.8 , Issue.2 , pp. 115-125
    • Saul, L.K.1    Rahim, M.G.2
  • 12
    • 0034227923 scopus 로고    scopus 로고
    • Automatic language identification: An alternative approach to phonetic modeling
    • F. Pellegrino and R. Andre-Obrecht, "Automatic language identification: an alternative approach to phonetic modeling," Signal Process., vol. 80, pp. 1231-1244, 2000.
    • (2000) Signal Process. , vol.80 , pp. 1231-1244
    • Pellegrino, F.1    Andre-Obrecht, R.2
  • 13
    • 0035441593 scopus 로고    scopus 로고
    • Spoken language recognition - A step toward multilinguality in speech processing
    • J. Navratil, "Spoken language recognition - a step toward multilinguality in speech processing," IEEE Trans. Speech Audio Processing, vol. 9, no. 6, pp. 678-685, 2001.
    • (2001) IEEE Trans. Speech Audio Processing , vol.9 , Issue.6 , pp. 678-685
    • Navratil, J.1
  • 15
    • 78650540904 scopus 로고    scopus 로고
    • Improved speaker segmentation and segments clustering using the Bayesian information criterion
    • A. Tritschler and R. Gopinath, "Improved speaker segmentation and segments clustering using the Bayesian information criterion," in Proc. EUROSPEECH'99, vol. 2, 1999, pp. 679-682.
    • (1999) Proc. EUROSPEECH'99 , vol.2 , pp. 679-682
    • Tritschler, A.1    Gopinath, R.2
  • 16
    • 0000274403 scopus 로고    scopus 로고
    • Exploiting latent semantic information in statistical language modeling
    • J. R. Bellegarda, "Exploiting latent semantic information in statistical language modeling," Proc. IEEE, vol. 88, no. 8, pp. 1279-1296, 2000.
    • (2000) Proc. IEEE , vol.88 , Issue.8 , pp. 1279-1296
    • Bellegarda, J.R.1
  • 19
    • 0002629270 scopus 로고
    • Maximum likelihood from incomplete data via the em algorithm
    • A. P. Dempster, N. M. Laird, and D. B. Rubin, "Maximum likelihood from incomplete data via the EM algorithm," J. R. Statist. Soc., vol. 39, pp. 1-38, 1977.
    • (1977) J. R. Statist. Soc. , vol.39 , pp. 1-38
    • Dempster, A.P.1    Laird, N.M.2    Rubin, D.B.3
  • 23
    • 33744980231 scopus 로고
    • The modality words in modern mandarin
    • L. L. Chang, "The Modality Words in Modern Mandarin," Tech. Rep., CKIP 93-06, 1993.
    • (1993) Tech. Rep. , vol.CKIP 93-06
    • Chang, L.L.1
  • 24
    • 0036836706 scopus 로고    scopus 로고
    • Generation of robust phonetic set and decision tree for Mandarin using chi-square testing
    • Y.-J. Chen, C.-H, Wu, Y.-H. Chiu, and H.-C. Liao, "Generation of robust phonetic set and decision tree for Mandarin using chi-square testing," Speech Commun., vol. 38, no. 3-4, pp. 349-364, 2002.
    • (2002) Speech Commun. , vol.38 , Issue.3-4 , pp. 349-364
    • Chen, Y.-J.1    Wu, C.-H.2    Chiu, Y.-H.3    Liao, H.-C.4
  • 27
    • 0141740992 scopus 로고    scopus 로고
    • Establish Taiwanese 7-tones syllable-based synthesis units database for the prototype development of text-to-speech system
    • Y.-J. Sher, K.-C. Chung, and C.-H. Wu, "Establish Taiwanese 7-tones syllable-based synthesis units database for the prototype development of text-to-speech system," in Proc. ROCUNG XII, 1999, pp. 15-35.
    • (1999) Proc. ROCUNG XII , pp. 15-35
    • Sher, Y.-J.1    Chung, K.-C.2    Wu, C.-H.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.