SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 14, Issue 1, 2006, Pages 266-275

Automatic segmentation and identification of mixed-language speech using delta-BIC and LSA-based GMMs

(4) Wu, Chung Hsien a,b,c,d Chiu, Yu Hsien b Shia, Chi Jiun b Lin, Chun Yu b

a IEEE (Taiwan)

b NATIONAL CHENG KUNG UNIVERSITY (Taiwan)

c International Speech Communication Association (Taiwan)

d International Speech Communication Association ^* (Taiwan)

Author keywords

Gaussian mixture model; Language identification; Latent semantic analysis; Mixed language speech; Single language speech

Indexed keywords

GAUSSIAN MIXTURE MODEL; LANGUAGE IDENTIFICATION; LATENT SEMANTIC ANALYSIS; MIXED-LANGUAGE SPEECH; SINGLE-LANGUAGE SPEECH;

ACOUSTIC WAVES; DYNAMIC PROGRAMMING; MATHEMATICAL MODELS; MAXIMUM LIKELIHOOD ESTIMATION; SEMANTICS; VECTORS;

SPEECH RECOGNITION;

EID: 33745000055 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TSA.2005.852992 Document Type: Conference Paper

Times cited : (43)

References (30)

1
- 0012327341
- Multilinguality in speech and spoken language systems
- A. Waibel, P. Geutner, and L. M. Tomokiyo et al., "Multilinguality in speech and spoken language systems," Proc. IEEE, vol. 88, no. 8, pp. 1297-1313, 2000.
- (2000) Proc. IEEE , vol.88 , Issue.8 , pp. 1297-1313
- Waibel, A.¹ Geutner, P.² Tomokiyo, L.M.³

2
- 0003673081
- Ph.D. dissertation, Oregon Grad. Inst. Sci. Technol., Beaverton
- Y. K. Muthusamy, "A segmental approach to automatic language identification," Ph.D. dissertation, Oregon Grad. Inst. Sci. Technol., Beaverton, 1993.
- (1993) A Segmental Approach to Automatic Language Identification
- Muthusamy, Y.K.¹

3
- 0028516964
- Reviewing automatic language identification
- Y. K. Muthusamy, E. Barnard, and R. A. Cole, "Reviewing automatic language identification," IEEE Signal Processing Mag., vol. 11, no. 4, pp. 33-41, 1994.
- (1994) IEEE Signal Processing Mag. , vol.11 , Issue.4 , pp. 33-41
- Muthusamy, Y.K.¹ Barnard, E.² Cole, R.A.³

4
- 0029733178
- Comparison of four approaches to automatic language identification of telephone speech
- M. A. Zissman, "Comparison of four approaches to automatic language identification of telephone speech," IEEE Trans. Speech Audio Processing, vol. 4, no. 1, pp. 31-44, 1996.
- (1996) IEEE Trans. Speech Audio Processing , vol.4 , Issue.1 , pp. 31-44
- Zissman, M.A.¹

5
- 85009208002
- NIST 2003 language recognition evaluation
- A. P. Martin and M. A. Przybocki, "NIST 2003 language recognition evaluation," in Proc. EUROSPEECH'03, 2003, pp. 1341-1344.
- (2003) Proc. EUROSPEECH'03 , pp. 1341-1344
- Martin, A.P.¹ Przybocki, M.A.²

6
- 85009275225
- Approaches to language identification using Gaussian mixture models and shift delta ceptral features
- P. A. Torres-Carrasquillo et al., "Approaches to language identification using Gaussian mixture models and shift delta ceptral features," in Proc. ICSLP'02, 2002, pp. 89-92.
- (2002) Proc. ICSLP'02 , pp. 89-92
- Torres-Carrasquillo, P.A.¹

7
- 0033154048
- Joint estimation of feature transformation parameters and Gaussian mixture model for speaker identification
- K.-H. You and H.-C. Wang, "Joint estimation of feature transformation parameters and Gaussian mixture model for speaker identification," Speech Commun., vol. 28, pp. 227-241, 1999.
- (1999) Speech Commun. , vol.28 , pp. 227-241
- You, K.-H.¹ Wang, H.-C.²

8
- 0035426911
- Multilingual phone models for vocabulary-independent speech recognition tasks
- J. Köhler, "Multilingual phone models for vocabulary- independent speech recognition tasks," Speech Commun., vol. 35, pp. 21-30, 2001.
- (2001) Speech Commun. , vol.35 , pp. 21-30
- Köhler, J.¹

9
- 0036497598
- Discriminative training of Gaussian mixture bi-gram models with application to Chinese dialect identification
- W.-H. Tsai and W.-W. Chang, "Discriminative training of Gaussian mixture bi-gram models with application to Chinese dialect identification," Speech Commun., vol. 36, pp. 317-326, 2002.
- (2002) Speech Commun. , vol.36 , pp. 317-326
- Tsai, W.-H.¹ Chang, W.-W.²

10
- 0035510539
- Noise robust speech parameterization using multiresolution feature extraction
- R. Hariharan, I. Kiss, and O. Viikki, "Noise robust speech parameterization using multiresolution feature extraction," IEEE Trans. Speech Audio Processing, vol. 9, no. 8, pp. 856-865, 2001.
- (2001) IEEE Trans. Speech Audio Processing , vol.9 , Issue.8 , pp. 856-865
- Hariharan, R.¹ Kiss, I.² Viikki, O.³

11
- 0033884177
- Maximum likelihood and minimum classification error factor analysis for automatic speech recognition
- L. K. Saul and M. G. Rahim, "Maximum likelihood and minimum classification error factor analysis for automatic speech recognition," IEEE Trans. Speech Audio Processing, vol. 8, no. 2, pp. 115-125, 2000.
- (2000) IEEE Trans. Speech Audio Processing , vol.8 , Issue.2 , pp. 115-125
- Saul, L.K.¹ Rahim, M.G.²

12
- 0034227923
- Automatic language identification: An alternative approach to phonetic modeling
- F. Pellegrino and R. Andre-Obrecht, "Automatic language identification: an alternative approach to phonetic modeling," Signal Process., vol. 80, pp. 1231-1244, 2000.
- (2000) Signal Process. , vol.80 , pp. 1231-1244
- Pellegrino, F.¹ Andre-Obrecht, R.²

13
- 0035441593
- Spoken language recognition - A step toward multilinguality in speech processing
- J. Navratil, "Spoken language recognition - a step toward multilinguality in speech processing," IEEE Trans. Speech Audio Processing, vol. 9, no. 6, pp. 678-685, 2001.
- (2001) IEEE Trans. Speech Audio Processing , vol.9 , Issue.6 , pp. 678-685
- Navratil, J.¹

14
- 4544345457
- Model selection criteria for acoustic segmentation
- Paris, France
- M. Cettolo and A. Federico, "Model selection criteria for acoustic segmentation," in Proc. ISCA ITRW ASR '00 Automatic Speech Recognition, Paris, France, 2000, pp. 221-227.
- (2000) Proc. ISCA ITRW ASR '00 Automatic Speech Recognition , pp. 221-227
- Cettolo, M.¹ Federico, A.²

15
- 78650540904
- Improved speaker segmentation and segments clustering using the Bayesian information criterion
- A. Tritschler and R. Gopinath, "Improved speaker segmentation and segments clustering using the Bayesian information criterion," in Proc. EUROSPEECH'99, vol. 2, 1999, pp. 679-682.
- (1999) Proc. EUROSPEECH'99 , vol.2 , pp. 679-682
- Tritschler, A.¹ Gopinath, R.²

16
- 0000274403
- Exploiting latent semantic information in statistical language modeling
- J. R. Bellegarda, "Exploiting latent semantic information in statistical language modeling," Proc. IEEE, vol. 88, no. 8, pp. 1279-1296, 2000.
- (2000) Proc. IEEE , vol.88 , Issue.8 , pp. 1279-1296
- Bellegarda, J.R.¹

17
- 0003612818
- Cambridge, MA: MIT Press
- C. D. Manning and H. Schutze, Foundations of Statistical Natural Language Processing. Cambridge, MA: MIT Press, 1999.
- (1999) Foundations of Statistical Natural Language Processing
- Manning, C.D.¹ Schutze, H.²

18
- 0003708826
- New York: Wiley
- A. C. Rencher, Multivariate Statistical Inference and Applications. New York: Wiley, 1998.
- (1998) Multivariate Statistical Inference and Applications
- Rencher, A.C.¹

19
- 0002629270
- Maximum likelihood from incomplete data via the em algorithm
- A. P. Dempster, N. M. Laird, and D. B. Rubin, "Maximum likelihood from incomplete data via the EM algorithm," J. R. Statist. Soc., vol. 39, pp. 1-38, 1977.
- (1977) J. R. Statist. Soc. , vol.39 , pp. 1-38
- Dempster, A.P.¹ Laird, N.M.² Rubin, D.B.³

20
- 0004056285
- Englewood Cliffs, NJ: Prentice-Hall
- X. Huang, A. Acero, and H.-W. Hon, Spoken Language Processing: A Guide to Theory, Algorithm, and System Development. Englewood Cliffs, NJ: Prentice-Hall, 2001.
- (2001) Spoken Language Processing: A Guide to Theory, Algorithm, and System Development
- Huang, X.¹ Acero, A.² Hon, H.-W.³

21
- 0031120321
- Inducing features of random fields
- Apr.
- S. D. Pietra, V. D. Pietra, and J. Lafferty, "Inducing features of random fields," IEEE Trans. Pattern Anal. Machine Intell., vol. 19, no. 4, pp. 380-393, Apr. 1997.
- (1997) IEEE Trans. Pattern Anal. Machine Intell. , vol.19 , Issue.4 , pp. 380-393
- Pietra, S.D.¹ Pietra, V.D.² Lafferty, J.³

22
- 0003837293
- Englewood Cliffs, NJ: Prentice-Hall
- S. M. Kay, Fundamentals of Statistical Signal Processing: Detection Theory. Englewood Cliffs, NJ: Prentice-Hall, 1998.
- (1998) Fundamentals of Statistical Signal Processing: Detection Theory
- Kay, S.M.¹

23
- 33744980231
- The modality words in modern mandarin
- L. L. Chang, "The Modality Words in Modern Mandarin," Tech. Rep., CKIP 93-06, 1993.
- (1993) Tech. Rep. , vol.CKIP 93-06
- Chang, L.L.¹

24
- 0036836706
- Generation of robust phonetic set and decision tree for Mandarin using chi-square testing
- Y.-J. Chen, C.-H, Wu, Y.-H. Chiu, and H.-C. Liao, "Generation of robust phonetic set and decision tree for Mandarin using chi-square testing," Speech Commun., vol. 38, no. 3-4, pp. 349-364, 2002.
- (2002) Speech Commun. , vol.38 , Issue.3-4 , pp. 349-364
- Chen, Y.-J.¹ Wu, C.-H.² Chiu, Y.-H.³ Liao, H.-C.⁴

25
- 33745006164
- [Online]
- Carnegie Mellon University Pronouncing Dictionary, [Online], Available: http://www.speech.cs.cmu.edu/cgi-bin/cmudict.
- Carnegie Mellon University Pronouncing Dictionary

26
- 33744983078
- Taiwan EDUTECH Foundation Press
- K. Liim, Taiwan Dictionary of Words With Modern Spelling: Taiwan EDUTECH Foundation Press, 1988.
- (1988) Taiwan Dictionary of Words with Modern Spelling
- Liim, K.¹

27
- 0141740992
- Establish Taiwanese 7-tones syllable-based synthesis units database for the prototype development of text-to-speech system
- Y.-J. Sher, K.-C. Chung, and C.-H. Wu, "Establish Taiwanese 7-tones syllable-based synthesis units database for the prototype development of text-to-speech system," in Proc. ROCUNG XII, 1999, pp. 15-35.
- (1999) Proc. ROCUNG XII , pp. 15-35
- Sher, Y.-J.¹ Chung, K.-C.² Wu, C.-H.³

28
- 0009634526
- Ph.D. Thesis, National Cheng Kung Univ., Tainan, Taiwan, R.O.C.
- Y. J. Chen, "A Study on Conversational Speech Recognition and Verification in Computer Telephony Integration," Ph.D. Thesis, National Cheng Kung Univ., Tainan, Taiwan, R.O.C., 2000.
- (2000) A Study on Conversational Speech Recognition and Verification in Computer Telephony Integration
- Chen, Y.J.¹

29
- 0005540823
- Reading, MA: Addison-Wesley
- R. Baeza-Yates and B. Ribeiro-Neto, Modern Information Retrieval. Reading, MA: Addison-Wesley, 1999.
- (1999) Modern Information Retrieval
- Baeza-Yates, R.¹ Ribeiro-Neto, B.²

30
- 0004006791
- New York: Birkhäuser
- J. Chen and A. K. Gupta, Parametric Statistical Change Point Analysis. New York: Birkhäuser, 2000.
- (2000) Parametric Statistical Change Point Analysis
- Chen, J.¹ Gupta, A.K.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.