메뉴 건너뛰기




Volumn 69, Issue , 2016, Pages 277-285

Dynamic time warping in phoneme modeling for fast pronunciation error detection

Author keywords

CAPT systems; DTW algorithm; Phoneme modeling; Pronunciation error detection; Word structure analysis

Indexed keywords

ERROR DETECTION; ERRORS; MARKOV PROCESSES; SPEECH RECOGNITION;

EID: 84957841078     PISSN: 00104825     EISSN: 18790534     Source Type: Journal    
DOI: 10.1016/j.compbiomed.2015.12.004     Document Type: Article
Times cited : (17)

References (50)
  • 1
    • 77957867161 scopus 로고    scopus 로고
    • The use of speech technology in foreign language pronunciation training
    • Demenko G., Wagner A., Cylwik N. The use of speech technology in foreign language pronunciation training. Arch. Acoust. 2010, 35(5):309-330.
    • (2010) Arch. Acoust. , vol.35 , Issue.5 , pp. 309-330
    • Demenko, G.1    Wagner, A.2    Cylwik, N.3
  • 2
    • 84941074343 scopus 로고    scopus 로고
    • A computer-aided Chinese pronunciation training program for English-speaking learners
    • Y. Qin, G. Wang, A computer-aided Chinese pronunciation training program for English-speaking learners, in: 2014 International Conference on Asian Language Processing (IALP), 2014, pp. 154-157, . doi:10.1109/IALP.2014.6973499.
    • (2014) 2014 International Conference on Asian Language Processing (IALP) , pp. 154-157
    • Qin, Y.1    Wang, G.2
  • 3
    • 84923277426 scopus 로고    scopus 로고
    • A recursive dialogue game for personalized computer-aided pronunciation training
    • Su P.-H., Wu C.-H., Lee L.-S. A recursive dialogue game for personalized computer-aided pronunciation training. IEEE/ACM Trans. Audio Speech Lang. Process. 2015, 23(1):127-141. 10.1109/TASLP.2014.2375572.
    • (2015) IEEE/ACM Trans. Audio Speech Lang. Process. , vol.23 , Issue.1 , pp. 127-141
    • Su, P.-H.1    Wu, C.-H.2    Lee, L.-S.3
  • 4
    • 84874226647 scopus 로고    scopus 로고
    • Automatic Chinese pronunciation error detection using SVM trained with structural features
    • SLT, IEEE, Miami, Florida
    • T. Zhao, A. Hoshino, M. Suzuki, N. Minematsu, K. Hirose, Automatic Chinese pronunciation error detection using SVM trained with structural features, in: SLT, IEEE, Miami, Florida, 2012, ISBN: 978-1-4673-5125-6; . http://dx.doi.org/10.1109/SLT.2012.6424270.
    • (2012)
    • Zhao, G.1    Hoshino, A.2    Suzuki, M.3    Minematsu, N.4    Hirose, K.5
  • 5
    • 67650764397 scopus 로고    scopus 로고
    • Comparing classifiers for pronunciation error detection
    • INTERSPEECH, ISCA
    • H. Strik, K.P. Truong, F. de Wet, C. Cucchiarini, Comparing classifiers for pronunciation error detection, in: INTERSPEECH, ISCA, 2007, pp. 1837-1840.
    • (2007) , pp. 1837-1840
    • Strik, H.1    Truong, K.P.2    de Wet, F.3    Cucchiarini, C.4
  • 6
    • 70349206329 scopus 로고    scopus 로고
    • Automatic pronunciation error detection based on linguistic knowledge and pronunciation space
    • ICASSP, IEEE, Taipei, Taiwan
    • S. Xu, J. Jiang, Z. Chen, B. Xu, Automatic pronunciation error detection based on linguistic knowledge and pronunciation space, in: ICASSP, IEEE, Taipei, Taiwan, 2009, pp. 4841-4844. http://dx.doi.org/10.1109/ICASSP.2009.4960715.
    • (2009) , pp. 4841-4844
    • Xu, S.1    Jiang, J.2    Chen, Z.3    Xu, B.4
  • 8
    • 84922704934 scopus 로고    scopus 로고
    • Improved mispronunciation detection with deep neural network trained acoustic models and transfer learning based logistic regression classifiers
    • W. Hu, Y. Qian, F.K. Song, Y. Wang, Improved mispronunciation detection with deep neural network trained acoustic models and transfer learning based logistic regression classifiers, Speech Commun. 67 (2015) 154-166, . http://dx.doi.org/10.1016/j.specom.2014.12.008.
    • (2015) Speech Commun. , vol.67 , pp. 154-166
    • Hu, W.1    Qian, Y.2    Song, F.K.3    Wang, Y.4
  • 9
    • 0345852475 scopus 로고    scopus 로고
    • The STAR system: an interactive pronunciation tutor for young children
    • M. Russell, R.W. Series, J.L. Wallace, C. Brown, A. Skilling, The STAR system: an interactive pronunciation tutor for young children, Comput Speech Lang. (2000) 161-175, . http://dx.doi.org/10.1006/csla.2000.0139.
    • (2000) Comput Speech Lang. , pp. 161-175
    • Russell, M.1    Series, R.W.2    Wallace, J.L.3    Brown, C.4    Skilling, A.5
  • 10
    • 84969387033 scopus 로고    scopus 로고
    • Applying Speech and Language Technology to Foreign Language Education
    • G. Demenko, N. Cylwik, A. Wagner, Applying Speech and Language Technology to Foreign Language Education.
    • Demenko, G.1    Cylwik, N.2    Wagner, A.3
  • 11
    • 84969400356 scopus 로고    scopus 로고
    • An audiovisual feedback system for acquiring l2 pronunciation and l2 prosody
    • G. Demenko, A. Wagner, N. Cylwik, O. Jokisch, An audiovisual feedback system for acquiring l2 pronunciation and l2 prosody, in: SLaTE 2009.
    • (2009) SLaTE
    • Demenko, G.1    Wagner, A.2    Cylwik, N.3    Jokisch, O.4
  • 12
    • 85133211643 scopus 로고    scopus 로고
    • The EURONOUNCE corpus of non-native polish for ASR-based pronunciation tutoring system
    • N. Cylwik, A. Wagner, G. Demenko, The EURONOUNCE corpus of non-native polish for ASR-based pronunciation tutoring system, in: SLaTE, 2009.
    • (2009) SLaTE
    • Cylwik, N.1    Wagner, A.2    Demenko, G.3
  • 13
    • 84905223885 scopus 로고    scopus 로고
    • Phonological modeling of mispronunciation gradations in L2 English speech of L1 Chinese learners
    • Florence, Italy, May 4-9
    • H. Wang, X. Qian, H. Meng, Phonological modeling of mispronunciation gradations in L2 English speech of L1 Chinese learners, in: IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2014, Florence, Italy, May 4-9, 2014, pp. 7714-7718, . doi:10.1109/ICASSP.2014.6855101.
    • (2014) IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP 2014 , pp. 7714-7718
    • Wang, H.1    Qian, X.2    Meng, H.3
  • 14
    • 15044362454 scopus 로고    scopus 로고
    • Use of speech recognition in computer-assisted language learning
    • doctoral dissertation, University of Cambridge, November
    • S.M. Witt, Use of speech recognition in computer-assisted language learning, doctoral dissertation, University of Cambridge, November 1999.
    • (1999)
    • Witt, S.M.1
  • 15
    • 0034140966 scopus 로고    scopus 로고
    • Phone-level pronunciation scoring and assessment for interactive language learning
    • Witt S.M., Young S.J. Phone-level pronunciation scoring and assessment for interactive language learning. Speech Commun. 2000, 30(2-3):95-108.
    • (2000) Speech Commun. , vol.30 , Issue.2-3 , pp. 95-108
    • Witt, S.M.1    Young, S.J.2
  • 16
    • 84912109010 scopus 로고    scopus 로고
    • A new neural network based logistic regression classifier for improving mispronunciation detection of L2 language learners
    • W. Hu, Y. Qian, F. Soong, A new neural network based logistic regression classifier for improving mispronunciation detection of L2 language learners, in: 2014 9th International Symposium on Chinese Spoken Language Processing (ISCSLP), 2014, pp. 245-249, . doi:10.1109/ISCSLP.2014.6936712.
    • (2014) 2014 9th International Symposium on Chinese Spoken Language Processing (ISCSLP) , pp. 245-249
    • Hu, W.1    Qian, Y.2    Soong, F.3
  • 17
    • 84912101692 scopus 로고    scopus 로고
    • Mispronunciation detection and diagnosis in l2 English speech using multi-distribution deep neural networks
    • K. Li, H. Meng, Mispronunciation detection and diagnosis in l2 English speech using multi-distribution deep neural networks, in: 2014 9th International Symposium on Chinese Spoken Language Processing (ISCSLP), 2014, pp. 255-259, . doi:10.1109/ISCSLP.2014.6936724.
    • (2014) 2014 9th International Symposium on Chinese Spoken Language Processing (ISCSLP) , pp. 255-259
    • Li, K.1    Meng, H.2
  • 19
    • 84901048193 scopus 로고    scopus 로고
    • A prototype of an adaptive Chinese pronunciation training system
    • Liao H.-C., Guan Y.-H., Tu J.-J., Chen J.-C. A prototype of an adaptive Chinese pronunciation training system. System 2014, 45:52-66.
    • (2014) System , vol.45 , pp. 52-66
    • Liao, H.-C.1    Guan, Y.-H.2    Tu, J.-J.3    Chen, J.-C.4
  • 22
    • 84879839980 scopus 로고    scopus 로고
    • Improving mispronunciation detection using adaptive frequency scale
    • Ge Z., Sharma S.R., Smith M.J.T. Improving mispronunciation detection using adaptive frequency scale. Comput. Electr. Eng. 2013, 39(5):1464-1472. 10.1016/j.compeleceng.2012.12.001.
    • (2013) Comput. Electr. Eng. , vol.39 , Issue.5 , pp. 1464-1472
    • Ge, Z.1    Sharma, S.R.2    Smith, M.J.T.3
  • 23
    • 84856609002 scopus 로고    scopus 로고
    • Design, analysis and experimental evaluation of block based transformation in MFCC computation for speaker recognition
    • Sahidullah M., Saha G. Design, analysis and experimental evaluation of block based transformation in MFCC computation for speaker recognition. Speech Commun. 2012, 54(4):543-565.
    • (2012) Speech Commun. , vol.54 , Issue.4 , pp. 543-565
    • Sahidullah, M.1    Saha, G.2
  • 24
    • 84881410102 scopus 로고    scopus 로고
    • Identification of language using mel-frequency cepstral coefficients (MFCC)
    • International Conference on Modelling, Optimization and Computing
    • Koolagudi S.G., Rastogi D., Rao K.S. Identification of language using mel-frequency cepstral coefficients (MFCC). Proc. Eng. 2012, 38(0):3391-3398. International Conference on Modelling, Optimization and Computing.
    • (2012) Proc. Eng. , vol.38 , pp. 3391-3398
    • Koolagudi, S.G.1    Rastogi, D.2    Rao, K.S.3
  • 25
    • 84876245465 scopus 로고    scopus 로고
    • On mispronunciation analysis of individual foreign speakers using auditory periphery models
    • Koniaris C., Salvi G., Engwall O. On mispronunciation analysis of individual foreign speakers using auditory periphery models. Speech Commun. 2013, 55(5):691-706.
    • (2013) Speech Commun. , vol.55 , Issue.5 , pp. 691-706
    • Koniaris, C.1    Salvi, G.2    Engwall, O.3
  • 26
    • 84890507052 scopus 로고    scopus 로고
    • Toward unsupervised discovery of pronunciation error patterns using universal phoneme posteriorgram for computer-assisted language learning
    • Y.-B. Wang, L.-S. Lee, Toward unsupervised discovery of pronunciation error patterns using universal phoneme posteriorgram for computer-assisted language learning, in: 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2013, pp. 8232-8236, . doi:10.1109/ICASSP.2013.6639270.
    • (2013) 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) , pp. 8232-8236
    • Wang, Y.-B.1    Lee, L.-S.2
  • 27
    • 0023168578 scopus 로고
    • A connected speech recognition system based on spotting diphone-like segments-preliminary results
    • ICASSP [U+05F3]87
    • A. Rosenberg, A. Colla, A connected speech recognition system based on spotting diphone-like segments-preliminary results, in: IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP [U+05F3]87, vol. 12, 1987, pp. 85-88.
    • (1987) IEEE International Conference on Acoustics, Speech, and Signal Processing , vol.12 , pp. 85-88
    • Rosenberg, A.1    Colla, A.2
  • 28
    • 0000114416 scopus 로고    scopus 로고
    • Pronunciation modeling by sharing Gaussian densities across phonetic models
    • M. Saraclar, H.J. Nock, S. Khudanpur, Pronunciation modeling by sharing Gaussian densities across phonetic models. Comput. Speech Lang. (2000) 137-160, . http://dx.doi.org/10.1006/csla.2000.0140.
    • (2000) Comput. Speech Lang. , pp. 137-160
    • Saraclar, M.1    Nock, H.J.2    Khudanpur, S.3
  • 29
    • 4544318800 scopus 로고    scopus 로고
    • Pronunciation change in conversational speech and its implications for automatic speech recognition
    • M. Saraclar, S. Khudanpur, Pronunciation change in conversational speech and its implications for automatic speech recognition, Comput. Speech Lang. (2004) 375-395.
    • (2004) Comput. Speech Lang. , pp. 375-395
    • Saraclar, M.1    Khudanpur, S.2
  • 31
    • 84867615078 scopus 로고    scopus 로고
    • Syllable: A self-contained unit to model pronunciation variation
    • ICASSP, IEEE, Kyoto, Japan
    • R.W.M. Ng, K. Hirose, Syllable: A self-contained unit to model pronunciation variation, in: ICASSP, IEEE, Kyoto, Japan, 2012, pp. 4457-4460. ISBN: 978-1-4673-0046-9; . http://dx.doi.org/10.1109/ICASSP.2012.6288909.
    • (2012) , pp. 4457-4460
    • Ng, R.W.M.1    Hirose, K.2
  • 32
    • 70349317164 scopus 로고    scopus 로고
    • Triphone statistics for polish language
    • Z. Vetulani, H. Uszkoreit (Eds.), LTC, Springer, Berlin, Heidelberg
    • B. Ziółko, J. Gałka, S. Manandhar, R.C. Wilson, M. Ziółko, Triphone statistics for polish language, in: Z. Vetulani, H. Uszkoreit (Eds.), LTC, Lecture Notes in Computer Science, vol. 5603, Springer, Berlin, Heidelberg, 2007, pp. 63-73.
    • (2007) Lecture Notes in Computer Science , vol.5603 , pp. 63-73
    • Ziółko, B.1    Gałka, J.2    Manandhar, S.3    Wilson, R.C.4    Ziółko, M.5
  • 33
    • 85007884499 scopus 로고    scopus 로고
    • A look at the research on computer-based technology use in second language learning: a review of the literature from 1990-2000
    • M. Liu, Z. Moore, L. Graham, S. Lee, A look at the research on computer-based technology use in second language learning: a review of the literature from 1990-2000, J. Res. Technol. Educ. 34(3) (2002).
    • (2002) J. Res. Technol. Educ. , vol.34 , Issue.3
    • Liu, M.1    Moore, Z.2    Graham, L.3    Lee, S.4
  • 34
    • 33646761676 scopus 로고    scopus 로고
    • Pronunciation learning and foreign accent reduction by an audiovisual feedback system.
    • J. Tao, T. Tan, R.W. Picard (Eds.), ACII, Springer
    • O. Jokisch, U. Koloska, D. Hirschfeld, R. Hoffmann, Pronunciation learning and foreign accent reduction by an audiovisual feedback system., in: J. Tao, T. Tan, R.W. Picard (Eds.), ACII, Lecture Notes in Computer Science, vol. 3784, Springer, 2005, pp. 419-425.
    • (2005) Lecture Notes in Computer Science , vol.3784 , pp. 419-425
    • Jokisch, O.1    Koloska, U.2    Hirschfeld, D.3    Hoffmann, R.4
  • 36
    • 85045160643 scopus 로고    scopus 로고
    • The isle corpus. Italian and German spoken learners English
    • Atwell E., Howarth P., Souter C. The isle corpus. Italian and German spoken learners English. ICAME J. 2003, 27:5-18.
    • (2003) ICAME J. , vol.27 , pp. 5-18
    • Atwell, E.1    Howarth, P.2    Souter, C.3
  • 37
    • 84957848808 scopus 로고    scopus 로고
    • Pronunciation error detection using dynamic time warping algorithm
    • E. Piȩtka, J. Kawa, W. Wiȩcławek (Eds.), vol. 284, Springer International Publishing, Gliwice
    • M. Bugdol, Z. Segiet, M. Krȩcichwost, Pronunciation error detection using dynamic time warping algorithm, in: E. Piȩtka, J. Kawa, W. Wiȩcławek (Eds.), Information Technologies in Biomedicine, vol. 4, Advances in Intelligent Systems and Computing, vol. 284, Springer International Publishing, Gliwice, 2014, pp. 345-354, ISBN: 978-3-319-06595-3, . doi:10.1007/978-3-319-06596-0_32.
    • (2014) Information Technologies in Biomedicine, Advances in Intelligent Systems and Computing , vol.4 , pp. 345-354
    • Bugdol, M.1    Segiet, Z.2    Krȩcichwost, M.3
  • 38
    • 0036722013 scopus 로고    scopus 로고
    • A DTW-based probability model for speaker feature analysis and data mining
    • Liu J., Cheng Q., Zheng Z., Qian M. A DTW-based probability model for speaker feature analysis and data mining. Pattern Recognit. Lett. 2002, 23(11):1271-1276. 10.1016/S0167-8655(02)00068-5.
    • (2002) Pattern Recognit. Lett. , vol.23 , Issue.11 , pp. 1271-1276
    • Liu, J.1    Cheng, Q.2    Zheng, Z.3    Qian, M.4
  • 39
    • 84858419571 scopus 로고    scopus 로고
    • Speech recognition based on efficient DTW algorithm and its DSP implementation
    • International Workshop on Information and Electronics Engineering
    • Jing XinXing, Shi Xu, Speech recognition based on efficient DTW algorithm and its DSP implementation, Proc. Eng. 29 (2012) 832-836, 2012, International Workshop on Information and Electronics Engineering, . doi:10.1016/j.proeng.2012.01.050.
    • (2012) Proc. Eng. , vol.29 , Issue.2012 , pp. 832-836
    • XinXing, J.1    Xu, S.2
  • 40
    • 84957855281 scopus 로고    scopus 로고
    • Implementation of grapheme-to-phoneme rules and extended SAMPA alphabet in polish text-to-speech synthesis, Poznań
    • G. Demenko, M. Wypych, E. Baranowska, Implementation of grapheme-to-phoneme rules and extended SAMPA alphabet in polish text-to-speech synthesis, Poznań 7(17) (2003).
    • (2003) , vol.7 , Issue.17
    • Demenko, G.1    Wypych, M.2    Baranowska, E.3
  • 41
    • 85016611268 scopus 로고    scopus 로고
    • Polish phoneme statistics obtained on large set of written texts
    • B. Ziolko, J. Galka, M. Ziolko, Polish phoneme statistics obtained on large set of written texts, Comput. Sci. 10(3).
    • Comput. Sci. , vol.10 , Issue.3
    • Ziolko, B.1    Galka, J.2    Ziolko, M.3
  • 42
    • 84957855283 scopus 로고    scopus 로고
    • The SAMPA Homepage, 〈〉
    • J. Wells, The SAMPA Homepage, 〈〉. http://www.phon.ucl.ac.uk/home/sampa/index.html.
    • Wells, J.1
  • 43
    • 84957855284 scopus 로고    scopus 로고
    • The goodness of pronunciation algorithm: a detailed performance study, in: SLaTE 2009
    • S. Kanters, C. Cucchiarini, H. Strik, The goodness of pronunciation algorithm: a detailed performance study, in: SLaTE 2009, 2009.
    • (2009)
    • Kanters, S.1    Cucchiarini, C.2    Strik, H.3
  • 44
    • 84957839622 scopus 로고
    • Using dynamic time warping to find patterns in time series, KDD-94: AAAI Workshop on Knowledge Discovery in Databases, Seattle, Washington, July
    • D.J. Bemdt, J. Clifford, Using dynamic time warping to find patterns in time series, KDD-94: AAAI Workshop on Knowledge Discovery in Databases, Seattle, Washington, pp. 359-370 (July 1994).
    • (1994) , pp. 359-370
    • Bemdt, D.J.1    Clifford, J.2
  • 45
    • 56349107806 scopus 로고
    • Considerations in dynamic time warping algorithms for discrete word recognition
    • Rabiner L.R. Considerations in dynamic time warping algorithms for discrete word recognition. Acoust. Soc. Am. J. 1978, 63:79. 10.1121/1.2016831.
    • (1978) Acoust. Soc. Am. J. , vol.63 , pp. 79
    • Rabiner, L.R.1
  • 46
    • 41749090269 scopus 로고    scopus 로고
    • Toward accurate dynamic time warping in linear time and space
    • Salvador S., Chan P. Toward accurate dynamic time warping in linear time and space. Intell. Data Anal. 2007, 11(5):561-580.
    • (2007) Intell. Data Anal. , vol.11 , Issue.5 , pp. 561-580
    • Salvador, S.1    Chan, P.2
  • 47
    • 33746089199 scopus 로고
    • Dynamic programming algorithm optimization for spoken word recognition
    • in: Readings in Speech Recognition, Morgan Kaufmann Publishers Inc., San Francisco, CA, USA
    • H. Sakoe, S. Chiba, Dynamic programming algorithm optimization for spoken word recognition, in: Readings in Speech Recognition, Morgan Kaufmann Publishers Inc., San Francisco, CA, USA, 1990, pp. 159-165.
    • (1990) , pp. 159-165
    • Sakoe, H.1    Chiba, S.2
  • 48
    • 84957855285 scopus 로고    scopus 로고
    • Everything you know about dynamic time warping is wrong, 3rd Workshop on Mining Temporal and Sequential Data
    • Knowledge Discovery and Data Mining, Seattle
    • C. A. Ratanamahatana, E. Keogh, Everything you know about dynamic time warping is wrong, 3rd Workshop on Mining Temporal and Sequential Data, in conjunction with 10th ACM SIGKDD Int. Conf. Knowledge Discovery and Data Mining, Seattle (2004).
    • (2004) conjunction with 10th ACM SIGKDD Int. Conf.
    • Ratanamahatana, C.A.1    Keogh, E.2
  • 49
    • 84957855286 scopus 로고    scopus 로고
    • Zastosowanie parametryzacji miesznej w systemie rozpoznawania mowy polskiej, Technical report, Instytut Radioelektroniki, Politechnika Warszawska, Warszawa
    • S. Wydra, Zastosowanie parametryzacji miesznej w systemie rozpoznawania mowy polskiej, Technical report, Instytut Radioelektroniki, Politechnika Warszawska, Warszawa, 2006.
    • (2006)
    • Wydra, S.1
  • 50
    • 84867609992 scopus 로고    scopus 로고
    • Improved approaches of modeling and detecting error patterns with empirical analysis for computer-aided pronunciation training
    • 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
    • Y.-B. Wang, L. shan Lee, Improved approaches of modeling and detecting error patterns with empirical analysis for computer-aided pronunciation training, in: 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2012, pp. 5049-5052, . doi:10.1109/ICASSP.2012.6289055.
    • (2012) , pp. 5049-5052
    • Wang, Y.-B.1    Shan Lee, L.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.