SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 15, Issue 7, 2007, Pages 2160-2168

Knowledge-based adaptive decision tree state tying for conversational speech recognition

(2) Hu, Rusheng a,b Zhao, Yunxin a

a University of Missouri Columbia (United States)

b Capital One (United States)

Author keywords

Acoustic modeling; Approximate Bayesian; Decision tree state tying; Implicit prior; Speech recognition

Indexed keywords

ACOUSTIC MODELING; ACOUSTIC MODELS; APPROXIMATE BAYESIAN; BAYESIAN LEARNING FRAMEWORKS; CONVERSATIONAL SPEECH RECOGNITION; DECISION RULES; DECISION TREE STATE TYING; DOMAIN SPECIFICS; GREEDY SEARCHES; IMPLICIT PRIOR; LARGE DATUM; MODEL QUALITIES; PHONETIC DECISION TREES; PRIOR KNOWLEDGE; PRONUNCIATION VARIATIONS; RECOGNITION ACCURACIES; SPEAKER ADAPTATIONS; TRANSFORMATION OF TREES; TREE GROWING; TREE STRUCTURES;

ACOUSTICS; BAYESIAN NETWORKS; DECISION TREES; KNOWLEDGE BASED SYSTEMS; SPEECH ANALYSIS; TELEMEDICINE;

SPEECH RECOGNITION;

EID: 64549109650 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TASL.2007.901830 Document Type: Article

Times cited : (6)

References (25)

1
- 1842527766
- The use of subword linguistic modeling for multiple tasks in speech recognition
- Apr
- S. Seneff, "The use of subword linguistic modeling for multiple tasks in speech recognition," Speech Commun., vol. 42, pp. 373-390, Apr. 2004.
- (2004) Speech Commun , vol.42 , pp. 373-390
- Seneff, S.¹

2
- 0033357399
- Speaking in shorthand-A syllable-centric perspective for understanding pronunciation variation
- Nov
- S. Greenberg, "Speaking in shorthand-A syllable-centric perspective for understanding pronunciation variation," Speech Commun., vol. 29, no. 2-4, pp. 159-176, Nov. 1999.
- (1999) Speech Commun , vol.29 , Issue.2-4 , pp. 159-176
- Greenberg, S.¹

3
- 0034853397
- What kind of pronunciation variation is hard for triphones to model?
- Salt Lake City, UT, May
- D. Jurafsky, W.Ward, J. Zhang, K. Herold, X. Yu, and S. Zhang, "What kind of pronunciation variation is hard for triphones to model?," in Proc. 2001 IEEE Int. Conf. Acoust., Speech, Signal Process., Salt Lake City, UT, May 2001, pp. 577-580.
- (2001) Proc. 2001 IEEE Int. Conf. Acoust., Speech, Signal Process , pp. 577-580
- Jurafsky, D.¹ Ward, W.² Zhang, J.³ Herold, K.⁴ Yu, X.⁵ Zhang, S.⁶

4
- 19944415893
- Implicit modeling of pronunciation variation in automatic speech recognition
- T. Hain, "Implicit modeling of pronunciation variation in automatic speech recognition," Speech Commun., vol. 26, pp. 171-188, 2005.
- (2005) Speech Commun , vol.26 , pp. 171-188
- Hain, T.¹

5
- 0033353288
- Stochastic pronunciation modeling from hand-labeled phonetic corpra
- Nov
- M. Riley, B. Byrne, M. Finke, S. Khudanpur, A. Ljolje, J. McDonough, H. Nock, M. Saraclar, C. Wooters, and G. Zavaliagkos, "Stochastic pronunciation modeling from hand-labeled phonetic corpra," Speech Commun., vol. 29, pp. 209-224, Nov. 1999.
- (1999) Speech Commun , vol.29 , pp. 209-224
- Riley, M.¹ Byrne, B.² Finke, M.³ Khudanpur, S.⁴ Ljolje, A.⁵ McDonough, J.⁶ Nock, H.⁷ Saraclar, M.⁸ Wooters, C.⁹ Zavaliagkos, G.¹⁰

6
- 0000114416
- Pronunciation modeling by sharing Gaussian densities across phonetic models
- M. Saraclar, H. J. Nock, and S. Khudanpur, "Pronunciation modeling by sharing Gaussian densities across phonetic models," Comput. Speech Lang., vol. 14, pp. 137-160, 2000.
- (2000) Comput. Speech Lang , vol.14 , pp. 137-160
- Saraclar, M.¹ Nock, H.J.² Khudanpur, S.³

7
- 0034273299
- Robust decision tree state tying for continuous speech recognition
- Sep
- W. Reichl and W. Chou, "Robust decision tree state tying for continuous speech recognition," IEEE Trans. Speech Audio Process., vol. 8, no. 5, pp. 555-566, Sep. 2000.
- (2000) IEEE Trans. Speech Audio Process , vol.8 , Issue.5 , pp. 555-566
- Reichl, W.¹ Chou, W.²

8
- 18744376902
- Predictive hidden Markov model selection for speech recognition
- May
- J.-T. Chien and S. Furui, "Predictive hidden Markov model selection for speech recognition," IEEE Trans. Speech Audio Process., vol. 13, no. 3, pp. 377-387, May 2005.
- (2005) IEEE Trans. Speech Audio Process , vol.13 , Issue.3 , pp. 377-387
- Chien, J.-T.¹ Furui, S.²

9
- 0141906266
- Acoustic model clustering based on syllable structure
- I. Shafran and M. Ostendorf, "Acoustic model clustering based on syllable structure," Comput. Speech Lang., vol. 17, no. 4, pp. 311-328, 2003.
- (2003) Comput. Speech Lang , vol.17 , Issue.4 , pp. 311-328
- Shafran, I.¹ Ostendorf, M.²

10
- 0035440798
- Online Bayesian tree-structured transformation of HMMs with optimal model selection for speaker adaptation
- Sep
- S. Wang and Y. Zhao, " Online Bayesian tree-structured transformation of HMMs with optimal model selection for speaker adaptation," IEEE Trans. Speech Audio Process., vol. 9, no. 6, pp. 663-677, Sep. 2001.
- (2001) IEEE Trans. Speech Audio Process , vol.9 , Issue.6 , pp. 663-677
- Wang, S.¹ Zhao, Y.²

11
- 0035279111
- A structural Bayes approach to speaker adaptation
- Mar
- K. Shinoda and C.-H. Lee, "A structural Bayes approach to speaker adaptation," IEEE Trans. Speech Audio Process., vol. 9, no. 3, pp. 276-287, Mar. 2001.
- (2001) IEEE Trans. Speech Audio Process , vol.9 , Issue.3 , pp. 276-287
- Shinoda, K.¹ Lee, C.-H.²

12
- 0002144369
- Tree-based state tying for high accuracy acoustic modeling
- Mar
- S. J. Young, J. J. Odell, and P. C. Woodland, "Tree-based state tying for high accuracy acoustic modeling," in Proc. ARPA Human Lang. Technol. Workshop, Mar. 1994, pp. 307-312.
- (1994) Proc. ARPA Human Lang. Technol. Workshop , pp. 307-312
- Young, S.J.¹ Odell, J.J.² Woodland, P.C.³

13
- 0033335618
- Modeling pronunciation variation for ASR: A survey of the literature
- H. Strik and C. Cucchiarini, "Modeling pronunciation variation for ASR: A survey of the literature," Speech Commun., vol. 29, pp. 225-246, 1999.
- (1999) Speech Commun , vol.29 , pp. 225-246
- Strik, H.¹ Cucchiarini, C.²

14
- 33947715150
- An automatic captioning system for telemedicine
- Toulouse, France
- Y. Zhao, X. Zhang, R-S. Hu, J. Xue, X. Li, L. Che, R. Hu, and L. Schopp, "An automatic captioning system for telemedicine," in Proc. ICASSP, Toulouse, France, 2006, pp. I-957-I-960.
- (2006) Proc. ICASSP
- Zhao, Y.¹ Zhang, X.² Hu, R.-S.³ Xue, J.⁴ Li, X.⁵ Che, L.⁶ Hu, R.⁷ Schopp, L.⁸

15
- 0003637516
- A Theory of Learning Classification Rules,
- Ph.D. dissertation, School of Comput. Sci, Univ. Technology, Sydney
- W. L. Buntine, "A Theory of Learning Classification Rules," Ph.D. dissertation, School of Comput. Sci., Univ. Technology, Sydney, 1992.
- (1992)
- Buntine, W.L.¹

16
- 0032346848
- Bayesian CART model search
- Sep
- H. A. Chipman, E. I. George, and R. E. McCulloch, "Bayesian CART model search," J. Amer. Statist. Assoc., vol. 93, no. 443, pp. 935-948, Sep. 1998.
- (1998) J. Amer. Statist. Assoc , vol.93 , Issue.443 , pp. 935-948
- Chipman, H.A.¹ George, E.I.² McCulloch, R.E.³

17
- 0004224632
- New York:Wiley
- D. Denison, C. Holmes, B. Malick, and A. Smith, Bayesian Methods for Nonlinear Classification and Regression. New York:Wiley, 2002.
- (2002) Bayesian Methods for Nonlinear Classification and Regression
- Denison, D.¹ Holmes, C.² Malick, B.³ Smith, A.⁴

18
- 0000120766
- Estimating the dimension of a model
- G. Schwarz, "Estimating the dimension of a model," Ann. Statist., vol. 6, no. 2, pp. 465-471, 1978.
- (1978) Ann. Statist , vol.6 , Issue.2 , pp. 465-471
- Schwarz, G.¹

19
- 0001822107
- Catalan numbers, their generalization, and their uses
- P. Hilton and J. Pedersen, "Catalan numbers, their generalization, and their uses," Math. Intell., vol. 13, no. 2, pp. 64-75, 1991.
- (1991) Math. Intell , vol.13 , Issue.2 , pp. 64-75
- Hilton, P.¹ Pedersen, J.²

20
- 0038676761
- Towards knowledge- based features forHMMbased large vocabulary automatic speech recognition
- B. Launay, O. Siohan, A. Surendran, and C.-H. Lee, "Towards knowledge- based features forHMMbased large vocabulary automatic speech recognition," in Proc. ICASSP02, 2002, vol. 1, pp. I-817-I-820.
- (2002) Proc. ICASSP02 , vol.1
- Launay, B.¹ Siohan, O.² Surendran, A.³ Lee, C.-H.⁴

21
- 64549085552
- quot;The HTK Toolkit. [Online]. Available: http://htk.eng.cam.ac. uk/
- quot;The HTK Toolkit." [Online]. Available: http://htk.eng.cam.ac. uk/

22
- 0028996876
- Improved backing-off for M-gram language modeling
- R. R. Kneser and H. Ney, "Improved backing-off for M-gram language modeling," in Proc. ICASSP, 1995, pp. 181-184.
- (1995) Proc. ICASSP , pp. 181-184
- Kneser, R.R.¹ Ney, H.²

23
- 84891308106
- SRILM-An extensible language modeling toolkit
- Denver, CO, Sep
- A. Stolcke, "SRILM-An extensible language modeling toolkit," in Proc. ICSLP, Denver, CO, Sep. 2002, pp. 901-904.
- (2002) Proc. ICSLP , pp. 901-904
- Stolcke, A.¹

24
- 34248589754
- A novel method of language modeling for automatic captioning in tc video teleconferencing
- May
- X. Zhang, Y. Zhao, and L. Schopp, "A novel method of language modeling for automatic captioning in tc video teleconferencing," IEEE Trans. Inf. Technol. Biomed., vol. 11, no. 3, pp. 332-337, May 2007.
- (2007) IEEE Trans. Inf. Technol. Biomed , vol.11 , Issue.3 , pp. 332-337
- Zhang, X.¹ Zhao, Y.² Schopp, L.³

25
- 33749555597
- A fast and memory-efficient N-gram language model lookup method for large vocabulary continuous speech recognition
- X. Li and Y. Zhao, "A fast and memory-efficient N-gram language model lookup method for large vocabulary continuous speech recognition," Comput. Speech Lang., vol. 21, pp. 1-25, 2007.
- (2007) Comput. Speech Lang , vol.21 , pp. 1-25
- Li, X.¹ Zhao, Y.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.