SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 15, Issue 2, 2007, Pages 453-464

Dialect/accent classification using unrestricted audio

(3) Huang, Rongqing a,b Hansen, John H L a Angkititrakul, Pongtep a

a The University of Texas at Dallas (United States)

Author keywords

Accent dialect classification; AdaBoost algorithm; Context adapted trianing; Dialect dependency information; Limited training data; Robust acoustic modeling; Word based modeling

Indexed keywords

ACCENT/DIALECT CLASSIFICATION; ADABOOST ALGORITHM; CONTEXT ADAPTED TRIANING; DIALECT DEPENDENCY INFORMATION; LIMITED TRAINING DATA; ROBUST ACOUSTIC MODELING; WORD-BASED MODELING;

ACOUSTICS; ADAPTIVE BOOSTING; ALGORITHMS; CLASSIFIERS; CONTINUOUS SPEECH RECOGNITION; DIELECTRIC WAVEGUIDES; HIDDEN MARKOV MODELS; LEARNING SYSTEMS; NATURAL LANGUAGE PROCESSING SYSTEMS; OBJECT RECOGNITION; PROBABILITY DENSITY FUNCTION; SOFTWARE AGENTS;

TEXT PROCESSING;

EID: 64149085238 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TASL.2006.881695 Document Type: Article

Times cited : (42)

References (44)

1
- 85009231014
- Use of trajectory model for automatic accent classification
- Geneva, Switzerland, Sep
- P. Angkititrakul and J. H. L. Hansen, "Use of trajectory model for automatic accent classification," in Proc. EuroSpeech, Geneva, Switzerland, Sep. 2003, pp. 1353-1356.
- (2003) Proc. EuroSpeech , pp. 1353-1356
- Angkititrakul, P.¹ Hansen, J.H.L.²

2
- 51449095035
- Pittsburgh, PA: Carnegie Mellon Univ, Online, Available
- The CMU Pronunciation Dictionary. Pittsburgh, PA: Carnegie Mellon Univ. [Online]. Available: http://www.speech.cs.cmu.edu/cgibin/cmudict
- The CMU Pronunciation Dictionary

3
- 0030715936
- Development of dialect-specific speech recognizers using adaptation methods
- Munich, Germany, Apr
- V. Diakoloukas, V. Digalakis, L. Neumeyer, and J. Kaja, "Development of dialect-specific speech recognizers using adaptation methods," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process.,Munich, Germany, Apr. 1997, vol. 2, pp. 1455-1458.
- (1997) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , vol.2 , pp. 1455-1458
- Diakoloukas, V.¹ Digalakis, V.² Neumeyer, L.³ Kaja, J.⁴

4
- 4544236424
- Boosting HMMs with an application to speech recognition
- Montreal, QC, Canada, May
- C. Dimitrakakis and S. Bengio, "Boosting HMMs with an application to speech recognition," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Montreal, QC, Canada, May 2004, vol. 5, pp. 621-624.
- (2004) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , vol.5 , pp. 621-624
- Dimitrakakis, C.¹ Bengio, S.²

5
- 0141590276
- A boosted multi-HMM classifier for recognition of visual speech elements
- Hong Kong, China, Apr
- S. W. Foo and L. Dong, "A boosted multi-HMM classifier for recognition of visual speech elements," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Hong Kong, China, Apr. 2003, vol. 2, pp. 285-288.
- (2003) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , vol.2 , pp. 285-288
- Foo, S.W.¹ Dong, L.²

6
- 0031211090
- A decision-theoretic generalization of on-line learning and an application to boosting
- Y. Freund and R. E. Schapire, "A decision-theoretic generalization of on-line learning and an application to boosting," J. Comput. Syst. Sci., vol. 55, no. 1, pp. 119-139, 1997.
- (1997) J. Comput. Syst. Sci , vol.55 , Issue.1 , pp. 119-139
- Freund, Y.¹ Schapire, R.E.²

7
- 0030263447
- Mean and variance adaptation within the MLLR framework
- M. J. F. Gales and P. C. Woodland, "Mean and variance adaptation within the MLLR framework," in Comput. Speech Lang., 1996, vol. 10, pp. 249-264.
- (1996) Comput. Speech Lang , vol.10 , pp. 249-264
- Gales, M.J.F.¹ Woodland, P.C.²

8
- 0039881085
- On the origins of speech intelligibility in the realworld
- Pont-a-Mousson, France
- S. Greenberg, "On the origins of speech intelligibility in the realworld," in Proc. ESCA Workshop on Robust Speech Recognition for Unknown Communication Channels, Pont-a-Mousson, France, 1997, vol. 1, pp. 23-32.
- (1997) Proc. ESCA Workshop on Robust Speech Recognition for Unknown Communication Channels , vol.1 , pp. 23-32
- Greenberg, S.¹

9
- 0141591602
- Speaker and text independent language identification using predictive error histogram vectors
- Hong Kong, China, Apr
- Q. Gu and T. Shibata, "Speaker and text independent language identification using predictive error histogram vectors," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Hong Kong, China, Apr. 2003, vol. 1, pp. 36-39.
- (2003) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , vol.1 , pp. 36-39
- Gu, Q.¹ Shibata, T.²

10
- 0020141497
- Effect of speaker accent on the performance of a speaker-independent, isolated word recognizer
- V. Gupta and P. Mermelstein, "Effect of speaker accent on the performance of a speaker-independent, isolated word recognizer," J. Acoust. Soc. Amer., vol. 71, pp. 1581-1587, 1982.
- (1982) J. Acoust. Soc. Amer , vol.71 , pp. 1581-1587
- Gupta, V.¹ Mermelstein, P.²

11
- 0030643681
- Robust spoken language identification using large vocabulary speech recognition
- Munich, Germany, Apr
- J. L. Hieronymus and S. Kadambe, "Robust spoken language identification using large vocabulary speech recognition," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Munich, Germany, Apr. 1997, vol. 2, pp. 1111-1114.
- (1997) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , vol.2 , pp. 1111-1114
- Hieronymus, J.L.¹ Kadambe, S.²

12
- 85009113198
- Analysis of speaker variability
- Aalborg, Denmark, Sep
- C. Huang, T. Chen, S. Li, E. Chang, and J. L. Zhou, "Analysis of speaker variability," in Proc. EuroSpeech, Aalborg, Denmark, Sep. 2001, vol. 2, pp. 1377-1380.
- (2001) Proc. EuroSpeech , vol.2 , pp. 1377-1380
- Huang, C.¹ Chen, T.² Li, S.³ Chang, E.⁴ Zhou, J.L.⁵

13
- 4544369704
- Advances in unsupervised audio segmentation for the broadcast news and NGSW corpora
- Montreal, QC, Canada, May
- R. Huang and J. H. L. Hansen, "Advances in unsupervised audio segmentation for the broadcast news and NGSW corpora," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Montreal, QC, Canada, May 2004, vol. 1, pp. 741-744.
- (2004) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , vol.1 , pp. 741-744
- Huang, R.¹ Hansen, J.H.L.²

14
- 0031631064
- The use of accent-specific pronunciation dictionaries in acoustic model training
- Seattle,WA,May
- J. J. Humphries and P. C. Woodland, "The use of accent-specific pronunciation dictionaries in acoustic model training," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Seattle,WA,May 1998, vol. 1, pp. 317-320.
- (1998) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , vol.1 , pp. 317-320
- Humphries, J.J.¹ Woodland, P.C.²

15
- 64149098292
- quot;IViE, British dialect corpus, [Online]. Available: http://www.phon.ox.ac.uk/̃esther/ivyweb/
- quot;IViE, British dialect corpus," [Online]. Available: http://www.phon.ox.ac.uk/̃esther/ivyweb/

16
- 0141480122
- Language identification using parallel sub-word recognition
- Hong Kong, China, Apr
- A. Sai Jayram, V. Ramasubramanian, and T. Sreenivas, "Language identification using parallel sub-word recognition," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Hong Kong, China, Apr. 2003, vol. 1, pp. 32-35.
- (2003) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , vol.1 , pp. 32-35
- Sai Jayram, A.¹ Ramasubramanian, V.² Sreenivas, T.³

17
- 0022018101
- A probabilistic distance measure for hidden Markov models
- B.-H. Juang and L. R. Rabiner, "A probabilistic distance measure for hidden Markov models," AT&T Tech. J., vol. 64, no. 2, pp. 391-408, 1985.
- (1985) AT&T Tech. J , vol.64 , Issue.2 , pp. 391-408
- Juang, B.-H.¹ Rabiner, L.R.²

18
- 0028996640
- Language identification with phonological and lexical models
- Detroit, MI,May
- S. Kadambe and J. L. Hieronymus, "Language identification with phonological and lexical models," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Detroit, MI,May 1995, vol. 5, pp. 3507-3510.
- (1995) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , vol.5 , pp. 3507-3510
- Kadambe, S.¹ Hieronymus, J.L.²

19
- 85135151046
- Foreign speaker accent classification using phoneme-dependent accent discrimination models and comparisons with human perception benchmarks
- Rhodos, Greece, Sep
- K. Kumpf and R. W. King, "Foreign speaker accent classification using phoneme-dependent accent discrimination models and comparisons with human perception benchmarks," in Proc. EuroSpeech, Rhodos, Greece, Sep. 1997, vol. 4, pp. 2323-2326.
- (1997) Proc. EuroSpeech , vol.4 , pp. 2323-2326
- Kumpf, K.¹ King, R.W.²

20
- 34548727590
- Effect of foreign accent on speech recognition in the NATO N-4 corpus
- Geneva, Switzerland, Sep
- A. Lawson, D. Harris, and J. Grieco, "Effect of foreign accent on speech recognition in the NATO N-4 corpus," in Proc. EuroSpeech, Geneva, Switzerland, Sep. 2003, vol. 3, pp. 1505-1508.
- (2003) Proc. EuroSpeech , vol.3 , pp. 1505-1508
- Lawson, A.¹ Harris, D.² Grieco, J.³

21
- 0029288633
- Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
- C. J. Leggetter and P. C. Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models," in Comput. Speech Lang., 1995, vol. 9, pp. 171-185.
- (1995) Comput. Speech Lang , vol.9 , pp. 171-185
- Leggetter, C.J.¹ Woodland, P.C.²

22
- 85128396783
- A comparison of two unsupervised approaches to accent identification
- Sydney, Australia, Nov
- M. Lincoln, S. Cox, and S. Ringland, "A comparison of two unsupervised approaches to accent identification," in Proc. Int. Conf. Spoken Language Processing, Sydney, Australia, Nov. 1998, vol. 1, pp. 109-112.
- (1998) Proc. Int. Conf. Spoken Language Processing , vol.1 , pp. 109-112
- Lincoln, M.¹ Cox, S.² Ringland, S.³

23
- 0033719637
- Mandarin accent adaptation based on context-independent/context-dependent pronunciation modeling
- Istanbul, Turkey, Jun
- M. K. Liu, B. Xu, T. Y. Huang, Y. G. Deng, and C. R. Li, "Mandarin accent adaptation based on context-independent/context-dependent pronunciation modeling," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Istanbul, Turkey, Jun. 2000, vol. 2, pp. 1025-1028.
- (2000) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , vol.2 , pp. 1025-1028
- Liu, M.K.¹ Xu, B.² Huang, T.Y.³ Deng, Y.G.⁴ Li, C.R.⁵

24
- 78649294756
- Language identification incorporating lexical information
- Sydney, Australia, Dec
- D. Matrouf, M. Adda-Decker, L. F. Lamel, and J. L. Gauvain, "Language identification incorporating lexical information," in Proc. Int. Conf. Spoken Lang. Process., Sydney, Australia, Dec. 1998, vol. 1, pp. 181-185.
- (1998) Proc. Int. Conf. Spoken Lang. Process , vol.1 , pp. 181-185
- Matrouf, D.¹ Adda-Decker, M.² Lamel, L.F.³ Gauvain, J.L.⁴

25
- 0029725760
- Automatic language identification using large vocabulary continuous speech recognition
- Atlanta, GA, May
- S. Mendoma, L. Gillick, Y. Ito, S. Lowe, and M. Newman, "Automatic language identification using large vocabulary continuous speech recognition," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Atlanta, GA, May 1996, vol. 2, pp. 785-788.
- (1996) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , vol.2 , pp. 785-788
- Mendoma, S.¹ Gillick, L.² Ito, Y.³ Lowe, S.⁴ Newman, M.⁵

26
- 0036293851
- Utterance-level boosting of HMM speech recognizers
- Orlando, FL, May
- C. Meyer, "Utterance-level boosting of HMM speech recognizers," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Orlando, FL, May 2002, vol. 1, pp. 109-112.
- (2002) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , vol.1 , pp. 109-112
- Meyer, C.¹

27
- 0030366921
- Statistical dialect classification based on mean phonetic features
- Philadelphia, PA, Oct
- D. Miller and J. Trischitta, "Statistical dialect classification based on mean phonetic features," in Proc. Int. Conf. Spoken Lang. Process., Philadelphia, PA, Oct. 1996, vol. 4, pp. 2025-2027.
- (1996) Proc. Int. Conf. Spoken Lang. Process , vol.4 , pp. 2025-2027
- Miller, D.¹ Trischitta, J.²

28
- 0141703394
- Multi-stream language identification using data-driven dependency selection
- Hong Kong, China, Apr
- S. Parandekar and K. Kirchhoff, "Multi-stream language identification using data-driven dependency selection," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Hong Kong, China, Apr. 2003, vol. 1, pp. 28-31.
- (2003) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , vol.1 , pp. 28-31
- Parandekar, S.¹ Kirchhoff, K.²

29
- 0141589558
- Univ. Colorado, Boulder, Tech. Rep. TR-CSLR, Mar
- B. Pellom, "Sonic: The University of Colorado Continuous Speech Recognizer," Univ. Colorado, Boulder, Tech. Rep. TR-CSLR-2001-01, Mar. 2001.
- (2001) Sonic: The University of Colorado Continuous Speech Recognizer , pp. 2001
- Pellom, B.¹

30
- 0033084277
- Perceptual and phonetic experiments on American English dialect identification
- Mar
- T. Purnell, W. Idsardi, and J. Baugh, "Perceptual and phonetic experiments on American English dialect identification," J. Lang. Soc. Psychol., vol. 18, no. 1, pp. 10-30, Mar. 1999.
- (1999) J. Lang. Soc. Psychol , vol.18 , Issue.1 , pp. 10-30
- Purnell, T.¹ Idsardi, W.² Baugh, J.³

31
- 0004656028
- Language identification with embedded word models
- Yokohama, Japan, Sep
- P. Ramesh and E. Roe, "Language identification with embedded word models," in Proc. Int. Conf. Spoken Lang. Process., Yokohama, Japan, Sep. 1994, vol. 4, pp. 1887-1890.
- (1994) Proc. Int. Conf. Spoken Lang. Process , vol.4 , pp. 1887-1890
- Ramesh, P.¹ Roe, E.²

32
- 84871413421
- Modeling prosody for language identification on read and spontaneous speech
- Hong Kong, China, Apr
- J.-L. Rouas, J. Farinas, F. Pellegrino, and R. Andre-Obrecht, "Modeling prosody for language identification on read and spontaneous speech," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Hong Kong, China, Apr. 2003, vol. 1, pp. 40-43.
- (2003) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , vol.1 , pp. 40-43
- Rouas, J.-L.¹ Farinas, J.² Pellegrino, F.³ Andre-Obrecht, R.⁴

33
- 0033281701
- Improved boosting algorithms using confidence-rated predictions
- R. E. Schapire and Y. Singer, "Improved boosting algorithms using confidence-rated predictions," Mach. Learn., vol. 37, no. 3, pp. 297-336, 1999.
- (1999) Mach. Learn , vol.37 , Issue.3 , pp. 297-336
- Schapire, R.E.¹ Singer, Y.²

34
- 0029725380
- LVCSR-based language identification
- Atlanta, GA, May
- T. Schultz, I. Rogina, and A. Waibel, "LVCSR-based language identification," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Atlanta, GA, May 1996, vol. 2, pp. 781-784.
- (1996) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , vol.2 , pp. 781-784
- Schultz, T.¹ Rogina, I.² Waibel, A.³

35
- 0003970124
- 2nd ed. Oxford, U.K, Blackwell
- P. Trudgill, The Dialects of England, 2nd ed. Oxford, U.K.: Blackwell, 1999.
- (1999) The Dialects of England
- Trudgill, P.¹

36
- 33745227196
- Lexicon adaptation for LVCSR: Speaker idiosyncracies, non-native speakers, and pronunciation choice
- presented at the, Estes Park, CO, Sep
- W.Ward, H. Krech, X. Yu, K. Herold, G. Figgs, A. Ikeno, D. Jurafsky, and W. Byrne, "Lexicon adaptation for LVCSR: speaker idiosyncracies, non-native speakers, and pronunciation choice," presented at the ISCA Workshop Pronunciation Modeling and Lexicon Adaptation, Estes Park, CO, Sep. 2002.
- (2002) ISCA Workshop Pronunciation Modeling and Lexicon Adaptation
- Ward, W.¹ Krech, H.² Yu, X.³ Herold, K.⁴ Figgs, G.⁵ Ikeno, A.⁶ Jurafsky, D.⁷ Byrne, W.⁸

37
- 0004283130
- Cambridge, U.K, Cambridge University Press, II, III
- J. C. Wells, Accents of English. Cambridge, U.K.: Cambridge University Press, 1982, vol. I, II, III.
- (1982) Accents of English , vol.1
- Wells, J.C.¹

38
- 64149103667
- quot;WSJ0 corpus, [Online]. Available: http://www.ldc.upenn.edu/ Catalog/CatalogEntry.jsp?catalogId=LDC93S6A
- quot;WSJ0 corpus," [Online]. Available: http://www.ldc.upenn.edu/ Catalog/CatalogEntry.jsp?catalogId=LDC93S6A

39
- 0141590573
- Analysis, modeling and synthesis of formants of British, American and Australian accents
- Hong Kong, China, Apr
- Q. Yan and S. Vaseghi, "Analysis, modeling and synthesis of formants of British, American and Australian accents," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Hong Kong, China, Apr. 2003, vol. 1, pp. 712-715.
- (2003) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , vol.1 , pp. 712-715
- Yan, Q.¹ Vaseghi, S.²

40
- 0029733178
- Comparison of four approaches to automatic language identification of telephone speech
- Jan
- M. A. Zissman, "Comparison of four approaches to automatic language identification of telephone speech," IEEE Trans. Speech Audio Process., vol. 4, no. 1, pp. 31-44, Jan. 1996.
- (1996) IEEE Trans. Speech Audio Process , vol.4 , Issue.1 , pp. 31-44
- Zissman, M.A.¹

41
- 0035427178
- Automatic language identification
- M. A. Zissman and K. M. Berkling, "Automatic language identification," Speech Commun., vol. 35, pp. 115-124, 2001.
- (2001) Speech Commun , vol.35 , pp. 115-124
- Zissman, M.A.¹ Berkling, K.M.²

42
- 85009089453
- Unsupervised audio stream segmentation and clustering via the Bayesian information criterion
- Beijing, China, Oct
- B. Zhou and J. H. L. Hansen, "Unsupervised audio stream segmentation and clustering via the Bayesian information criterion," in Proc. Int. Conf. Spoken Lang. Process., Beijing, China, Oct. 2000, vol. 1, pp. 714-717.
- (2000) Proc. Int. Conf. Spoken Lang. Process , vol.1 , pp. 714-717
- Zhou, B.¹ Hansen, J.H.L.²

43
- 22544475615
- Efficient audio stream segmentation via the combined T BIC statistic and Bayesian information criterion
- Jul
- B. Zhou and J. H. L. Hansen, "Efficient audio stream segmentation via the combined T BIC statistic and Bayesian information criterion," IEEE Trans. Speech Audio Process., vol. 13, no. 4, pp. 467-474, Jul. 2005.
- (2005) IEEE Trans. Speech Audio Process , vol.13 , Issue.4 , pp. 467-474
- Zhou, B.¹ Hansen, J.H.L.²

44
- 64149103404
- quot;WSJCAM0 corpus, [Online]. Available: http://www.ldc.upenn.edu/ Catalog/CatalogEntry.jsp?catalogId=LDC95S24
- quot;WSJCAM0 corpus," [Online]. Available: http://www.ldc.upenn.edu/ Catalog/CatalogEntry.jsp?catalogId=LDC95S24

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.