메뉴 건너뛰기




Volumn 15, Issue 2, 2007, Pages 453-464

Dialect/accent classification using unrestricted audio

Author keywords

Accent dialect classification; AdaBoost algorithm; Context adapted trianing; Dialect dependency information; Limited training data; Robust acoustic modeling; Word based modeling

Indexed keywords

ACCENT/DIALECT CLASSIFICATION; ADABOOST ALGORITHM; CONTEXT ADAPTED TRIANING; DIALECT DEPENDENCY INFORMATION; LIMITED TRAINING DATA; ROBUST ACOUSTIC MODELING; WORD-BASED MODELING;

EID: 64149085238     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2006.881695     Document Type: Article
Times cited : (42)

References (44)
  • 1
    • 85009231014 scopus 로고    scopus 로고
    • Use of trajectory model for automatic accent classification
    • Geneva, Switzerland, Sep
    • P. Angkititrakul and J. H. L. Hansen, "Use of trajectory model for automatic accent classification," in Proc. EuroSpeech, Geneva, Switzerland, Sep. 2003, pp. 1353-1356.
    • (2003) Proc. EuroSpeech , pp. 1353-1356
    • Angkititrakul, P.1    Hansen, J.H.L.2
  • 2
    • 51449095035 scopus 로고    scopus 로고
    • Pittsburgh, PA: Carnegie Mellon Univ, Online, Available
    • The CMU Pronunciation Dictionary. Pittsburgh, PA: Carnegie Mellon Univ. [Online]. Available: http://www.speech.cs.cmu.edu/cgibin/cmudict
    • The CMU Pronunciation Dictionary
  • 4
    • 4544236424 scopus 로고    scopus 로고
    • Boosting HMMs with an application to speech recognition
    • Montreal, QC, Canada, May
    • C. Dimitrakakis and S. Bengio, "Boosting HMMs with an application to speech recognition," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Montreal, QC, Canada, May 2004, vol. 5, pp. 621-624.
    • (2004) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , vol.5 , pp. 621-624
    • Dimitrakakis, C.1    Bengio, S.2
  • 5
    • 0141590276 scopus 로고    scopus 로고
    • A boosted multi-HMM classifier for recognition of visual speech elements
    • Hong Kong, China, Apr
    • S. W. Foo and L. Dong, "A boosted multi-HMM classifier for recognition of visual speech elements," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Hong Kong, China, Apr. 2003, vol. 2, pp. 285-288.
    • (2003) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , vol.2 , pp. 285-288
    • Foo, S.W.1    Dong, L.2
  • 6
    • 0031211090 scopus 로고    scopus 로고
    • A decision-theoretic generalization of on-line learning and an application to boosting
    • Y. Freund and R. E. Schapire, "A decision-theoretic generalization of on-line learning and an application to boosting," J. Comput. Syst. Sci., vol. 55, no. 1, pp. 119-139, 1997.
    • (1997) J. Comput. Syst. Sci , vol.55 , Issue.1 , pp. 119-139
    • Freund, Y.1    Schapire, R.E.2
  • 7
    • 0030263447 scopus 로고    scopus 로고
    • Mean and variance adaptation within the MLLR framework
    • M. J. F. Gales and P. C. Woodland, "Mean and variance adaptation within the MLLR framework," in Comput. Speech Lang., 1996, vol. 10, pp. 249-264.
    • (1996) Comput. Speech Lang , vol.10 , pp. 249-264
    • Gales, M.J.F.1    Woodland, P.C.2
  • 9
    • 0141591602 scopus 로고    scopus 로고
    • Speaker and text independent language identification using predictive error histogram vectors
    • Hong Kong, China, Apr
    • Q. Gu and T. Shibata, "Speaker and text independent language identification using predictive error histogram vectors," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Hong Kong, China, Apr. 2003, vol. 1, pp. 36-39.
    • (2003) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , vol.1 , pp. 36-39
    • Gu, Q.1    Shibata, T.2
  • 10
    • 0020141497 scopus 로고
    • Effect of speaker accent on the performance of a speaker-independent, isolated word recognizer
    • V. Gupta and P. Mermelstein, "Effect of speaker accent on the performance of a speaker-independent, isolated word recognizer," J. Acoust. Soc. Amer., vol. 71, pp. 1581-1587, 1982.
    • (1982) J. Acoust. Soc. Amer , vol.71 , pp. 1581-1587
    • Gupta, V.1    Mermelstein, P.2
  • 11
    • 0030643681 scopus 로고    scopus 로고
    • Robust spoken language identification using large vocabulary speech recognition
    • Munich, Germany, Apr
    • J. L. Hieronymus and S. Kadambe, "Robust spoken language identification using large vocabulary speech recognition," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Munich, Germany, Apr. 1997, vol. 2, pp. 1111-1114.
    • (1997) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , vol.2 , pp. 1111-1114
    • Hieronymus, J.L.1    Kadambe, S.2
  • 12
    • 85009113198 scopus 로고    scopus 로고
    • Analysis of speaker variability
    • Aalborg, Denmark, Sep
    • C. Huang, T. Chen, S. Li, E. Chang, and J. L. Zhou, "Analysis of speaker variability," in Proc. EuroSpeech, Aalborg, Denmark, Sep. 2001, vol. 2, pp. 1377-1380.
    • (2001) Proc. EuroSpeech , vol.2 , pp. 1377-1380
    • Huang, C.1    Chen, T.2    Li, S.3    Chang, E.4    Zhou, J.L.5
  • 13
    • 4544369704 scopus 로고    scopus 로고
    • Advances in unsupervised audio segmentation for the broadcast news and NGSW corpora
    • Montreal, QC, Canada, May
    • R. Huang and J. H. L. Hansen, "Advances in unsupervised audio segmentation for the broadcast news and NGSW corpora," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Montreal, QC, Canada, May 2004, vol. 1, pp. 741-744.
    • (2004) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , vol.1 , pp. 741-744
    • Huang, R.1    Hansen, J.H.L.2
  • 14
    • 0031631064 scopus 로고    scopus 로고
    • The use of accent-specific pronunciation dictionaries in acoustic model training
    • Seattle,WA,May
    • J. J. Humphries and P. C. Woodland, "The use of accent-specific pronunciation dictionaries in acoustic model training," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Seattle,WA,May 1998, vol. 1, pp. 317-320.
    • (1998) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , vol.1 , pp. 317-320
    • Humphries, J.J.1    Woodland, P.C.2
  • 15
    • 64149098292 scopus 로고    scopus 로고
    • quot;IViE, British dialect corpus, [Online]. Available: http://www.phon.ox.ac.uk/̃esther/ivyweb/
    • quot;IViE, British dialect corpus," [Online]. Available: http://www.phon.ox.ac.uk/̃esther/ivyweb/
  • 17
    • 0022018101 scopus 로고
    • A probabilistic distance measure for hidden Markov models
    • B.-H. Juang and L. R. Rabiner, "A probabilistic distance measure for hidden Markov models," AT&T Tech. J., vol. 64, no. 2, pp. 391-408, 1985.
    • (1985) AT&T Tech. J , vol.64 , Issue.2 , pp. 391-408
    • Juang, B.-H.1    Rabiner, L.R.2
  • 19
    • 85135151046 scopus 로고    scopus 로고
    • Foreign speaker accent classification using phoneme-dependent accent discrimination models and comparisons with human perception benchmarks
    • Rhodos, Greece, Sep
    • K. Kumpf and R. W. King, "Foreign speaker accent classification using phoneme-dependent accent discrimination models and comparisons with human perception benchmarks," in Proc. EuroSpeech, Rhodos, Greece, Sep. 1997, vol. 4, pp. 2323-2326.
    • (1997) Proc. EuroSpeech , vol.4 , pp. 2323-2326
    • Kumpf, K.1    King, R.W.2
  • 20
    • 34548727590 scopus 로고    scopus 로고
    • Effect of foreign accent on speech recognition in the NATO N-4 corpus
    • Geneva, Switzerland, Sep
    • A. Lawson, D. Harris, and J. Grieco, "Effect of foreign accent on speech recognition in the NATO N-4 corpus," in Proc. EuroSpeech, Geneva, Switzerland, Sep. 2003, vol. 3, pp. 1505-1508.
    • (2003) Proc. EuroSpeech , vol.3 , pp. 1505-1508
    • Lawson, A.1    Harris, D.2    Grieco, J.3
  • 21
    • 0029288633 scopus 로고
    • Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
    • C. J. Leggetter and P. C. Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models," in Comput. Speech Lang., 1995, vol. 9, pp. 171-185.
    • (1995) Comput. Speech Lang , vol.9 , pp. 171-185
    • Leggetter, C.J.1    Woodland, P.C.2
  • 22
    • 85128396783 scopus 로고    scopus 로고
    • A comparison of two unsupervised approaches to accent identification
    • Sydney, Australia, Nov
    • M. Lincoln, S. Cox, and S. Ringland, "A comparison of two unsupervised approaches to accent identification," in Proc. Int. Conf. Spoken Language Processing, Sydney, Australia, Nov. 1998, vol. 1, pp. 109-112.
    • (1998) Proc. Int. Conf. Spoken Language Processing , vol.1 , pp. 109-112
    • Lincoln, M.1    Cox, S.2    Ringland, S.3
  • 23
    • 0033719637 scopus 로고    scopus 로고
    • Mandarin accent adaptation based on context-independent/context-dependent pronunciation modeling
    • Istanbul, Turkey, Jun
    • M. K. Liu, B. Xu, T. Y. Huang, Y. G. Deng, and C. R. Li, "Mandarin accent adaptation based on context-independent/context-dependent pronunciation modeling," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Istanbul, Turkey, Jun. 2000, vol. 2, pp. 1025-1028.
    • (2000) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , vol.2 , pp. 1025-1028
    • Liu, M.K.1    Xu, B.2    Huang, T.Y.3    Deng, Y.G.4    Li, C.R.5
  • 26
    • 0036293851 scopus 로고    scopus 로고
    • Utterance-level boosting of HMM speech recognizers
    • Orlando, FL, May
    • C. Meyer, "Utterance-level boosting of HMM speech recognizers," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Orlando, FL, May 2002, vol. 1, pp. 109-112.
    • (2002) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , vol.1 , pp. 109-112
    • Meyer, C.1
  • 27
    • 0030366921 scopus 로고    scopus 로고
    • Statistical dialect classification based on mean phonetic features
    • Philadelphia, PA, Oct
    • D. Miller and J. Trischitta, "Statistical dialect classification based on mean phonetic features," in Proc. Int. Conf. Spoken Lang. Process., Philadelphia, PA, Oct. 1996, vol. 4, pp. 2025-2027.
    • (1996) Proc. Int. Conf. Spoken Lang. Process , vol.4 , pp. 2025-2027
    • Miller, D.1    Trischitta, J.2
  • 28
    • 0141703394 scopus 로고    scopus 로고
    • Multi-stream language identification using data-driven dependency selection
    • Hong Kong, China, Apr
    • S. Parandekar and K. Kirchhoff, "Multi-stream language identification using data-driven dependency selection," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Hong Kong, China, Apr. 2003, vol. 1, pp. 28-31.
    • (2003) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , vol.1 , pp. 28-31
    • Parandekar, S.1    Kirchhoff, K.2
  • 30
    • 0033084277 scopus 로고    scopus 로고
    • Perceptual and phonetic experiments on American English dialect identification
    • Mar
    • T. Purnell, W. Idsardi, and J. Baugh, "Perceptual and phonetic experiments on American English dialect identification," J. Lang. Soc. Psychol., vol. 18, no. 1, pp. 10-30, Mar. 1999.
    • (1999) J. Lang. Soc. Psychol , vol.18 , Issue.1 , pp. 10-30
    • Purnell, T.1    Idsardi, W.2    Baugh, J.3
  • 31
    • 0004656028 scopus 로고
    • Language identification with embedded word models
    • Yokohama, Japan, Sep
    • P. Ramesh and E. Roe, "Language identification with embedded word models," in Proc. Int. Conf. Spoken Lang. Process., Yokohama, Japan, Sep. 1994, vol. 4, pp. 1887-1890.
    • (1994) Proc. Int. Conf. Spoken Lang. Process , vol.4 , pp. 1887-1890
    • Ramesh, P.1    Roe, E.2
  • 33
    • 0033281701 scopus 로고    scopus 로고
    • Improved boosting algorithms using confidence-rated predictions
    • R. E. Schapire and Y. Singer, "Improved boosting algorithms using confidence-rated predictions," Mach. Learn., vol. 37, no. 3, pp. 297-336, 1999.
    • (1999) Mach. Learn , vol.37 , Issue.3 , pp. 297-336
    • Schapire, R.E.1    Singer, Y.2
  • 37
    • 0004283130 scopus 로고
    • Cambridge, U.K, Cambridge University Press, II, III
    • J. C. Wells, Accents of English. Cambridge, U.K.: Cambridge University Press, 1982, vol. I, II, III.
    • (1982) Accents of English , vol.1
    • Wells, J.C.1
  • 38
    • 64149103667 scopus 로고    scopus 로고
    • quot;WSJ0 corpus, [Online]. Available: http://www.ldc.upenn.edu/ Catalog/CatalogEntry.jsp?catalogId=LDC93S6A
    • quot;WSJ0 corpus," [Online]. Available: http://www.ldc.upenn.edu/ Catalog/CatalogEntry.jsp?catalogId=LDC93S6A
  • 39
    • 0141590573 scopus 로고    scopus 로고
    • Analysis, modeling and synthesis of formants of British, American and Australian accents
    • Hong Kong, China, Apr
    • Q. Yan and S. Vaseghi, "Analysis, modeling and synthesis of formants of British, American and Australian accents," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Hong Kong, China, Apr. 2003, vol. 1, pp. 712-715.
    • (2003) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process , vol.1 , pp. 712-715
    • Yan, Q.1    Vaseghi, S.2
  • 40
    • 0029733178 scopus 로고    scopus 로고
    • Comparison of four approaches to automatic language identification of telephone speech
    • Jan
    • M. A. Zissman, "Comparison of four approaches to automatic language identification of telephone speech," IEEE Trans. Speech Audio Process., vol. 4, no. 1, pp. 31-44, Jan. 1996.
    • (1996) IEEE Trans. Speech Audio Process , vol.4 , Issue.1 , pp. 31-44
    • Zissman, M.A.1
  • 41
    • 0035427178 scopus 로고    scopus 로고
    • Automatic language identification
    • M. A. Zissman and K. M. Berkling, "Automatic language identification," Speech Commun., vol. 35, pp. 115-124, 2001.
    • (2001) Speech Commun , vol.35 , pp. 115-124
    • Zissman, M.A.1    Berkling, K.M.2
  • 42
    • 85009089453 scopus 로고    scopus 로고
    • Unsupervised audio stream segmentation and clustering via the Bayesian information criterion
    • Beijing, China, Oct
    • B. Zhou and J. H. L. Hansen, "Unsupervised audio stream segmentation and clustering via the Bayesian information criterion," in Proc. Int. Conf. Spoken Lang. Process., Beijing, China, Oct. 2000, vol. 1, pp. 714-717.
    • (2000) Proc. Int. Conf. Spoken Lang. Process , vol.1 , pp. 714-717
    • Zhou, B.1    Hansen, J.H.L.2
  • 43
    • 22544475615 scopus 로고    scopus 로고
    • Efficient audio stream segmentation via the combined T BIC statistic and Bayesian information criterion
    • Jul
    • B. Zhou and J. H. L. Hansen, "Efficient audio stream segmentation via the combined T BIC statistic and Bayesian information criterion," IEEE Trans. Speech Audio Process., vol. 13, no. 4, pp. 467-474, Jul. 2005.
    • (2005) IEEE Trans. Speech Audio Process , vol.13 , Issue.4 , pp. 467-474
    • Zhou, B.1    Hansen, J.H.L.2
  • 44
    • 64149103404 scopus 로고    scopus 로고
    • quot;WSJCAM0 corpus, [Online]. Available: http://www.ldc.upenn.edu/ Catalog/CatalogEntry.jsp?catalogId=LDC95S24
    • quot;WSJCAM0 corpus," [Online]. Available: http://www.ldc.upenn.edu/ Catalog/CatalogEntry.jsp?catalogId=LDC95S24


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.