메뉴 건너뛰기




Volumn 49, Issue 12, 2007, Pages 861-873

Highly accurate children's speech recognition for interactive reading tutors using subword units

Author keywords

Language modeling; Literacy tutors; Reading tracking; Subword unit based speech recognition

Indexed keywords

DATA ACQUISITION; MATHEMATICAL MODELS; STATISTICAL METHODS;

EID: 34748820596     PISSN: 01676393     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.specom.2007.05.004     Document Type: Article
Times cited : (69)

References (42)
  • 1
    • 34748870995 scopus 로고    scopus 로고
    • Aist, G., Chan, P., Huang, X., Jiang, L., Kennedy, R., Latimer, D., Mostow, J., Yeung, C., 1998. How effective is unsupervised data collection for children's speech recognition? In: Proc. ICSLP 98 Sydney, Australia.
  • 2
    • 85009136762 scopus 로고    scopus 로고
    • Arcy, S., Wong, L., Russel, M., 2004. Recognition of read and spontaneous children's speech using two new corpora. In: Proc. ICSLP 2004, Jeju Island, Korea.
  • 3
    • 51849142134 scopus 로고    scopus 로고
    • Banerjee, S., Beck, J., Mostow, J., 2003a. Evaluating the effect of predicting oral reading miscues. In: Proc. Eurospeech 2003, Geneva, Switzerland.
  • 4
    • 34748885796 scopus 로고    scopus 로고
    • Banerjee, S., Mostow, J., Beck, J., Tam, W., 2003b. Improving language models by learning from speech recognition errors in a reading tutor that listens. In: Proc. Second Internat. Conf. on Applied Artificial Intelligence 2003, Fort Panhala, Kolhapur, India.
  • 5
    • 34748915431 scopus 로고    scopus 로고
    • Bazzi, I., 2002. Modelling out-of-vocabulary words for robust speech recognition. Ph.D. Thesis, MIT, June 2002, Department of Electrical Engineering and Computer Science.
  • 6
    • 34748914474 scopus 로고    scopus 로고
    • Cole, R., Hosom, P., Pellom, B., 2006a. University of Colorado Prompted and Read Children's Speech Corpus. Technical Report TR-CSLR-2006-02, Center for Spoken Language Research, University of Colorado, Boulder.
  • 7
    • 34748914851 scopus 로고    scopus 로고
    • Cole, R., Pellom, B., 2006b. University of Colorado Read and Summarized Stories Corpus. Technical Report TR-CSLR-2006-03, Center for Spoken Language Research, University of Colorado, Boulder.
  • 9
    • 34748874316 scopus 로고    scopus 로고
    • How Marni teaches children to read
    • Cole R., Wise B., and Van Vuuren S. How Marni teaches children to read. Educ. Technol. 47 1 (2006) 14-18
    • (2006) Educ. Technol. , vol.47 , Issue.1 , pp. 14-18
    • Cole, R.1    Wise, B.2    Van Vuuren, S.3
  • 10
    • 34748845388 scopus 로고    scopus 로고
    • COLit, 2004. Colorado Literacy Tutor Project. .
  • 11
    • 34748823216 scopus 로고    scopus 로고
    • Cosi, P., Pellom, B., 2005. Italian Children's speech recognition for advanced interactive literacy tutors. In: Proc. Eurospeech 2005, Lisbon, Portugal.
  • 12
    • 34748865091 scopus 로고    scopus 로고
    • Creutz, M., Lagus, K., 2002. Unsupervised discovery of morphemes. In: Proc. Workshop on Morphological and Phonological Learning of ACL-02, Philadelphia, pp. 21-30.
  • 13
    • 0031644298 scopus 로고    scopus 로고
    • Das, S., Nix D., Picheny, M., 1998. Improvements in children's speech recognition performance. In: Proc. ICASSP 98, Seattle, WA.
  • 14
    • 33745197755 scopus 로고    scopus 로고
    • KIDS: A database of childrens speech
    • Eskenazi M. KIDS: A database of childrens speech. J. Acoust. Soc. Amer. 100 4, Part 2 (1996)
    • (1996) J. Acoust. Soc. Amer. , vol.100 , Issue.4 PART 2
    • Eskenazi, M.1
  • 15
    • 34748824399 scopus 로고    scopus 로고
    • Fogarty, J., Dabbish, L., Steck, D.M., Mostow, J., 2001. Mining a database of reading mistakes: For what should an automated Reading Tutor listen? In: Proc. Tenth Internat. Conf. on Artificial Intelligence in Education (AI-ED) 2001, San Antonio, Texas.
  • 16
    • 34748853374 scopus 로고    scopus 로고
    • Gales, M., 1997. Maximum likelihood linear transformations for HMM-based speech recognition. Technical Report, CUED/F-INFENG/TR291, Cambridge University.
  • 17
    • 85143190560 scopus 로고    scopus 로고
    • Giuliani, D., Gerosa, M., 2003. Investigating recognition of children's speech. In: Proc. ICASSP 2003, Hong Kong.
  • 18
    • 56149113752 scopus 로고    scopus 로고
    • Gustafson, J., Sjolander, K., 2002. Voice transformations for improving children's speech recognition in a publicly available dialogue system. In: Proc. ICSLP 2002, Denver, Colorado.
  • 19
    • 33846253444 scopus 로고    scopus 로고
    • Hacioglu, K., Pellom, B., Ciloglu, T., Ozturk, O., Kurimo, M., Creutz, M., 2003. On lexicon creation for Turkish LVCSR. In: Proc. Eurospeech 2003, Geneva, Switzerland.
  • 20
    • 34748816452 scopus 로고    scopus 로고
    • Hagen, A., Pellom, B., 2005a. A Multi-layered lexical-tree based token passing architecture for efficient recognition of subword speech units. In: The 2nd Language and Tech. Conf., Poznan, Poland.
  • 21
    • 34748826432 scopus 로고    scopus 로고
    • Hagen, A., Pellom, B., 2005b. Data driven subword unit modeling for speech recognition and its application to interactive reading tutors. In: Interspeech 2005, Lisbon, Portugal.
  • 22
    • 84946707630 scopus 로고    scopus 로고
    • Hagen, A., Pellom, B., Cole, R., 2003. Children's speech recognition with application to interactive books and tutors. In: IEEE Automatic Speech Recognition and Understanding (ASRU) Workshop, St. Thomas.
  • 23
    • 34748861752 scopus 로고    scopus 로고
    • Hagen, A., Pellom, B., Van Vuuren, S., Cole, R., 2004. Advances in children's speech recognition within an interactive literacy tutor. HLT-NAACL, Boston, May 2004.
  • 24
    • 34748865655 scopus 로고    scopus 로고
    • Lee, S., Potamianos, A., Narayanan, S., 1997. Analysis of children's speech: duration, pitch and formants, In: Proc. EUROSPEECH 97, Rhodes, Greece.
  • 25
    • 0032969462 scopus 로고    scopus 로고
    • Acoustics of children's speech: developmental changes of temporal and spectral parameters
    • Lee S., Potamianos A., and Narayanan S. Acoustics of children's speech: developmental changes of temporal and spectral parameters. J. Acoust. Soc. Amer. 105 (1999) 1455-1468
    • (1999) J. Acoust. Soc. Amer. , vol.105 , pp. 1455-1468
    • Lee, S.1    Potamianos, A.2    Narayanan, S.3
  • 26
    • 34748819630 scopus 로고    scopus 로고
    • Lee, K., Hagen, A., Romanyshyn, N., Martin, S., Pellom, B., 2004. Analysis and detection of reading miscues for interactive literacy tutors. COLING, Geneva, Switzerland.
  • 27
    • 85009291880 scopus 로고    scopus 로고
    • Li, Q., Russell, M., 2002. An analysis of the causes of increased error rates in children's speech recognition. In: Proc. ICSLP 02, Denver, Colorado.
  • 28
    • 34748868113 scopus 로고    scopus 로고
    • McCandless, M., 1992. Word rejection for a literacy tutor. S.B. Thesis, MIT, May 1992, Department of Electrical Engineering and Computer Science.
  • 29
    • 0028601241 scopus 로고    scopus 로고
    • Mostow, J., Roth, S.F., Hauptmann, A.G., Kane, M., 1994. A prototype reading coach that listens. In: Proc. of AAAI-94, Seattle, WA, pp. 785-792.
  • 30
    • 85009262059 scopus 로고    scopus 로고
    • Mostow, J., Beck, J., Winter, S., Wang, S., Tobin, B., 2002. Predicting oral reading miscues. In: ICSLP 2002, Denver, Colorado.
  • 31
    • 34748916051 scopus 로고    scopus 로고
    • Pellom, B., 2001. SONIC: The University of Colorado Continuous Speech Recognizer. Technical Report TR-CSLR-2001-01, University of Colorado.
  • 32
    • 0141591620 scopus 로고    scopus 로고
    • Pellom, B., Hacioglu, K., 2003. Recent improvements in the CU SONIC ASR system for noisy speech: the SPINE task. In: Proc. ICASSP 2003, Hong Kong.
  • 34
    • 34748822003 scopus 로고    scopus 로고
    • Potamianos, A., Narayanan, S., Lee, S., 1997. Automatic speech recognition for children. In: Proc. EUROSPEECH 97, Rhodes, Greece.
  • 35
    • 85009064115 scopus 로고    scopus 로고
    • Shobaki, K., Hosom, J.P., Cole, R., 2000. The OGI Kids' Speech Corpus and recognizers. In: Proc. ICSLP 2000, Beijing, China.
  • 36
    • 0036461005 scopus 로고    scopus 로고
    • Structural maximum a posteriori linear regression for fast HMM adaptation
    • Siohan O., Myrvoll T., and Lee C.H. Structural maximum a posteriori linear regression for fast HMM adaptation. Computer, Speech and Language 16 (2002) 5-24
    • (2002) Computer, Speech and Language , vol.16 , pp. 5-24
    • Siohan, O.1    Myrvoll, T.2    Lee, C.H.3
  • 37
    • 34748912638 scopus 로고    scopus 로고
    • Spache, G.D., 1981. Diagnostic Reading Scales. Del, Monte Research Park, Monterey, CA 93940: CTB, Macmillan/McGraw-Hill.
  • 38
    • 51849105470 scopus 로고    scopus 로고
    • Tam, Y.C., Mostow, J., Beck, J., Banerjee, S., 2003. Training a confidence measure for a reading tutor that listens. In: Proc. Eurospeech 2003, Geneva, Switzerland.
  • 39
    • 34748925505 scopus 로고    scopus 로고
    • van Vuuren, S., Cole, R., Ngampatipatpong, N., 2006. Providing feedback to students while reading out loud in interactive books. Technical Report TR-CSLR-2006-01, Center for Spoken Language Research, University of Colorado, Boulder.
  • 40
    • 34748836637 scopus 로고    scopus 로고
    • Welling, L.,Kanthak, S., Ney, H., 1999. Improved methods for vocal tract length normalization. In: Proc. ICASSP 99, Phoenix, Arizona.
  • 42
    • 34748816451 scopus 로고    scopus 로고
    • Young, S.J., Russell, N.H., Thornton, J.H.S., 1989. Token passing: a simple conceptual model for connected speech recognition systems. Cambridge University, Technical Report CUED/F-INFENG/TR.38.


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.