메뉴 건너뛰기




Volumn 15, Issue 7, 2007, Pages 2160-2168

Knowledge-based adaptive decision tree state tying for conversational speech recognition

Author keywords

Acoustic modeling; Approximate Bayesian; Decision tree state tying; Implicit prior; Speech recognition

Indexed keywords

ACOUSTIC MODELING; ACOUSTIC MODELS; APPROXIMATE BAYESIAN; BAYESIAN LEARNING FRAMEWORKS; CONVERSATIONAL SPEECH RECOGNITION; DECISION RULES; DECISION TREE STATE TYING; DOMAIN SPECIFICS; GREEDY SEARCHES; IMPLICIT PRIOR; LARGE DATUM; MODEL QUALITIES; PHONETIC DECISION TREES; PRIOR KNOWLEDGE; PRONUNCIATION VARIATIONS; RECOGNITION ACCURACIES; SPEAKER ADAPTATIONS; TRANSFORMATION OF TREES; TREE GROWING; TREE STRUCTURES;

EID: 64549109650     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2007.901830     Document Type: Article
Times cited : (6)

References (25)
  • 1
    • 1842527766 scopus 로고    scopus 로고
    • The use of subword linguistic modeling for multiple tasks in speech recognition
    • Apr
    • S. Seneff, "The use of subword linguistic modeling for multiple tasks in speech recognition," Speech Commun., vol. 42, pp. 373-390, Apr. 2004.
    • (2004) Speech Commun , vol.42 , pp. 373-390
    • Seneff, S.1
  • 2
    • 0033357399 scopus 로고    scopus 로고
    • Speaking in shorthand-A syllable-centric perspective for understanding pronunciation variation
    • Nov
    • S. Greenberg, "Speaking in shorthand-A syllable-centric perspective for understanding pronunciation variation," Speech Commun., vol. 29, no. 2-4, pp. 159-176, Nov. 1999.
    • (1999) Speech Commun , vol.29 , Issue.2-4 , pp. 159-176
    • Greenberg, S.1
  • 4
    • 19944415893 scopus 로고    scopus 로고
    • Implicit modeling of pronunciation variation in automatic speech recognition
    • T. Hain, "Implicit modeling of pronunciation variation in automatic speech recognition," Speech Commun., vol. 26, pp. 171-188, 2005.
    • (2005) Speech Commun , vol.26 , pp. 171-188
    • Hain, T.1
  • 6
    • 0000114416 scopus 로고    scopus 로고
    • Pronunciation modeling by sharing Gaussian densities across phonetic models
    • M. Saraclar, H. J. Nock, and S. Khudanpur, "Pronunciation modeling by sharing Gaussian densities across phonetic models," Comput. Speech Lang., vol. 14, pp. 137-160, 2000.
    • (2000) Comput. Speech Lang , vol.14 , pp. 137-160
    • Saraclar, M.1    Nock, H.J.2    Khudanpur, S.3
  • 7
    • 0034273299 scopus 로고    scopus 로고
    • Robust decision tree state tying for continuous speech recognition
    • Sep
    • W. Reichl and W. Chou, "Robust decision tree state tying for continuous speech recognition," IEEE Trans. Speech Audio Process., vol. 8, no. 5, pp. 555-566, Sep. 2000.
    • (2000) IEEE Trans. Speech Audio Process , vol.8 , Issue.5 , pp. 555-566
    • Reichl, W.1    Chou, W.2
  • 8
    • 18744376902 scopus 로고    scopus 로고
    • Predictive hidden Markov model selection for speech recognition
    • May
    • J.-T. Chien and S. Furui, "Predictive hidden Markov model selection for speech recognition," IEEE Trans. Speech Audio Process., vol. 13, no. 3, pp. 377-387, May 2005.
    • (2005) IEEE Trans. Speech Audio Process , vol.13 , Issue.3 , pp. 377-387
    • Chien, J.-T.1    Furui, S.2
  • 9
    • 0141906266 scopus 로고    scopus 로고
    • Acoustic model clustering based on syllable structure
    • I. Shafran and M. Ostendorf, "Acoustic model clustering based on syllable structure," Comput. Speech Lang., vol. 17, no. 4, pp. 311-328, 2003.
    • (2003) Comput. Speech Lang , vol.17 , Issue.4 , pp. 311-328
    • Shafran, I.1    Ostendorf, M.2
  • 10
    • 0035440798 scopus 로고    scopus 로고
    • Online Bayesian tree-structured transformation of HMMs with optimal model selection for speaker adaptation
    • Sep
    • S. Wang and Y. Zhao, " Online Bayesian tree-structured transformation of HMMs with optimal model selection for speaker adaptation," IEEE Trans. Speech Audio Process., vol. 9, no. 6, pp. 663-677, Sep. 2001.
    • (2001) IEEE Trans. Speech Audio Process , vol.9 , Issue.6 , pp. 663-677
    • Wang, S.1    Zhao, Y.2
  • 11
    • 0035279111 scopus 로고    scopus 로고
    • A structural Bayes approach to speaker adaptation
    • Mar
    • K. Shinoda and C.-H. Lee, "A structural Bayes approach to speaker adaptation," IEEE Trans. Speech Audio Process., vol. 9, no. 3, pp. 276-287, Mar. 2001.
    • (2001) IEEE Trans. Speech Audio Process , vol.9 , Issue.3 , pp. 276-287
    • Shinoda, K.1    Lee, C.-H.2
  • 13
    • 0033335618 scopus 로고    scopus 로고
    • Modeling pronunciation variation for ASR: A survey of the literature
    • H. Strik and C. Cucchiarini, "Modeling pronunciation variation for ASR: A survey of the literature," Speech Commun., vol. 29, pp. 225-246, 1999.
    • (1999) Speech Commun , vol.29 , pp. 225-246
    • Strik, H.1    Cucchiarini, C.2
  • 15
    • 0003637516 scopus 로고
    • A Theory of Learning Classification Rules,
    • Ph.D. dissertation, School of Comput. Sci, Univ. Technology, Sydney
    • W. L. Buntine, "A Theory of Learning Classification Rules," Ph.D. dissertation, School of Comput. Sci., Univ. Technology, Sydney, 1992.
    • (1992)
    • Buntine, W.L.1
  • 18
    • 0000120766 scopus 로고
    • Estimating the dimension of a model
    • G. Schwarz, "Estimating the dimension of a model," Ann. Statist., vol. 6, no. 2, pp. 465-471, 1978.
    • (1978) Ann. Statist , vol.6 , Issue.2 , pp. 465-471
    • Schwarz, G.1
  • 19
    • 0001822107 scopus 로고
    • Catalan numbers, their generalization, and their uses
    • P. Hilton and J. Pedersen, "Catalan numbers, their generalization, and their uses," Math. Intell., vol. 13, no. 2, pp. 64-75, 1991.
    • (1991) Math. Intell , vol.13 , Issue.2 , pp. 64-75
    • Hilton, P.1    Pedersen, J.2
  • 20
    • 0038676761 scopus 로고    scopus 로고
    • Towards knowledge- based features forHMMbased large vocabulary automatic speech recognition
    • B. Launay, O. Siohan, A. Surendran, and C.-H. Lee, "Towards knowledge- based features forHMMbased large vocabulary automatic speech recognition," in Proc. ICASSP02, 2002, vol. 1, pp. I-817-I-820.
    • (2002) Proc. ICASSP02 , vol.1
    • Launay, B.1    Siohan, O.2    Surendran, A.3    Lee, C.-H.4
  • 21
    • 64549085552 scopus 로고    scopus 로고
    • quot;The HTK Toolkit. [Online]. Available: http://htk.eng.cam.ac. uk/
    • quot;The HTK Toolkit." [Online]. Available: http://htk.eng.cam.ac. uk/
  • 22
    • 0028996876 scopus 로고
    • Improved backing-off for M-gram language modeling
    • R. R. Kneser and H. Ney, "Improved backing-off for M-gram language modeling," in Proc. ICASSP, 1995, pp. 181-184.
    • (1995) Proc. ICASSP , pp. 181-184
    • Kneser, R.R.1    Ney, H.2
  • 23
    • 84891308106 scopus 로고    scopus 로고
    • SRILM-An extensible language modeling toolkit
    • Denver, CO, Sep
    • A. Stolcke, "SRILM-An extensible language modeling toolkit," in Proc. ICSLP, Denver, CO, Sep. 2002, pp. 901-904.
    • (2002) Proc. ICSLP , pp. 901-904
    • Stolcke, A.1
  • 24
    • 34248589754 scopus 로고    scopus 로고
    • A novel method of language modeling for automatic captioning in tc video teleconferencing
    • May
    • X. Zhang, Y. Zhao, and L. Schopp, "A novel method of language modeling for automatic captioning in tc video teleconferencing," IEEE Trans. Inf. Technol. Biomed., vol. 11, no. 3, pp. 332-337, May 2007.
    • (2007) IEEE Trans. Inf. Technol. Biomed , vol.11 , Issue.3 , pp. 332-337
    • Zhang, X.1    Zhao, Y.2    Schopp, L.3
  • 25
    • 33749555597 scopus 로고    scopus 로고
    • A fast and memory-efficient N-gram language model lookup method for large vocabulary continuous speech recognition
    • X. Li and Y. Zhao, "A fast and memory-efficient N-gram language model lookup method for large vocabulary continuous speech recognition," Comput. Speech Lang., vol. 21, pp. 1-25, 2007.
    • (2007) Comput. Speech Lang , vol.21 , pp. 1-25
    • Li, X.1    Zhao, Y.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.