-
1
-
-
1842527766
-
The use of subword linguistic modeling for multiple tasks in speech recognition
-
Apr
-
S. Seneff, "The use of subword linguistic modeling for multiple tasks in speech recognition," Speech Commun., vol. 42, pp. 373-390, Apr. 2004.
-
(2004)
Speech Commun
, vol.42
, pp. 373-390
-
-
Seneff, S.1
-
2
-
-
0033357399
-
Speaking in shorthand-A syllable-centric perspective for understanding pronunciation variation
-
Nov
-
S. Greenberg, "Speaking in shorthand-A syllable-centric perspective for understanding pronunciation variation," Speech Commun., vol. 29, no. 2-4, pp. 159-176, Nov. 1999.
-
(1999)
Speech Commun
, vol.29
, Issue.2-4
, pp. 159-176
-
-
Greenberg, S.1
-
3
-
-
0034853397
-
What kind of pronunciation variation is hard for triphones to model?
-
Salt Lake City, UT, May
-
D. Jurafsky, W.Ward, J. Zhang, K. Herold, X. Yu, and S. Zhang, "What kind of pronunciation variation is hard for triphones to model?," in Proc. 2001 IEEE Int. Conf. Acoust., Speech, Signal Process., Salt Lake City, UT, May 2001, pp. 577-580.
-
(2001)
Proc. 2001 IEEE Int. Conf. Acoust., Speech, Signal Process
, pp. 577-580
-
-
Jurafsky, D.1
Ward, W.2
Zhang, J.3
Herold, K.4
Yu, X.5
Zhang, S.6
-
4
-
-
19944415893
-
Implicit modeling of pronunciation variation in automatic speech recognition
-
T. Hain, "Implicit modeling of pronunciation variation in automatic speech recognition," Speech Commun., vol. 26, pp. 171-188, 2005.
-
(2005)
Speech Commun
, vol.26
, pp. 171-188
-
-
Hain, T.1
-
5
-
-
0033353288
-
Stochastic pronunciation modeling from hand-labeled phonetic corpra
-
Nov
-
M. Riley, B. Byrne, M. Finke, S. Khudanpur, A. Ljolje, J. McDonough, H. Nock, M. Saraclar, C. Wooters, and G. Zavaliagkos, "Stochastic pronunciation modeling from hand-labeled phonetic corpra," Speech Commun., vol. 29, pp. 209-224, Nov. 1999.
-
(1999)
Speech Commun
, vol.29
, pp. 209-224
-
-
Riley, M.1
Byrne, B.2
Finke, M.3
Khudanpur, S.4
Ljolje, A.5
McDonough, J.6
Nock, H.7
Saraclar, M.8
Wooters, C.9
Zavaliagkos, G.10
-
6
-
-
0000114416
-
Pronunciation modeling by sharing Gaussian densities across phonetic models
-
M. Saraclar, H. J. Nock, and S. Khudanpur, "Pronunciation modeling by sharing Gaussian densities across phonetic models," Comput. Speech Lang., vol. 14, pp. 137-160, 2000.
-
(2000)
Comput. Speech Lang
, vol.14
, pp. 137-160
-
-
Saraclar, M.1
Nock, H.J.2
Khudanpur, S.3
-
7
-
-
0034273299
-
Robust decision tree state tying for continuous speech recognition
-
Sep
-
W. Reichl and W. Chou, "Robust decision tree state tying for continuous speech recognition," IEEE Trans. Speech Audio Process., vol. 8, no. 5, pp. 555-566, Sep. 2000.
-
(2000)
IEEE Trans. Speech Audio Process
, vol.8
, Issue.5
, pp. 555-566
-
-
Reichl, W.1
Chou, W.2
-
8
-
-
18744376902
-
Predictive hidden Markov model selection for speech recognition
-
May
-
J.-T. Chien and S. Furui, "Predictive hidden Markov model selection for speech recognition," IEEE Trans. Speech Audio Process., vol. 13, no. 3, pp. 377-387, May 2005.
-
(2005)
IEEE Trans. Speech Audio Process
, vol.13
, Issue.3
, pp. 377-387
-
-
Chien, J.-T.1
Furui, S.2
-
9
-
-
0141906266
-
Acoustic model clustering based on syllable structure
-
I. Shafran and M. Ostendorf, "Acoustic model clustering based on syllable structure," Comput. Speech Lang., vol. 17, no. 4, pp. 311-328, 2003.
-
(2003)
Comput. Speech Lang
, vol.17
, Issue.4
, pp. 311-328
-
-
Shafran, I.1
Ostendorf, M.2
-
10
-
-
0035440798
-
Online Bayesian tree-structured transformation of HMMs with optimal model selection for speaker adaptation
-
Sep
-
S. Wang and Y. Zhao, " Online Bayesian tree-structured transformation of HMMs with optimal model selection for speaker adaptation," IEEE Trans. Speech Audio Process., vol. 9, no. 6, pp. 663-677, Sep. 2001.
-
(2001)
IEEE Trans. Speech Audio Process
, vol.9
, Issue.6
, pp. 663-677
-
-
Wang, S.1
Zhao, Y.2
-
11
-
-
0035279111
-
A structural Bayes approach to speaker adaptation
-
Mar
-
K. Shinoda and C.-H. Lee, "A structural Bayes approach to speaker adaptation," IEEE Trans. Speech Audio Process., vol. 9, no. 3, pp. 276-287, Mar. 2001.
-
(2001)
IEEE Trans. Speech Audio Process
, vol.9
, Issue.3
, pp. 276-287
-
-
Shinoda, K.1
Lee, C.-H.2
-
12
-
-
0002144369
-
Tree-based state tying for high accuracy acoustic modeling
-
Mar
-
S. J. Young, J. J. Odell, and P. C. Woodland, "Tree-based state tying for high accuracy acoustic modeling," in Proc. ARPA Human Lang. Technol. Workshop, Mar. 1994, pp. 307-312.
-
(1994)
Proc. ARPA Human Lang. Technol. Workshop
, pp. 307-312
-
-
Young, S.J.1
Odell, J.J.2
Woodland, P.C.3
-
13
-
-
0033335618
-
Modeling pronunciation variation for ASR: A survey of the literature
-
H. Strik and C. Cucchiarini, "Modeling pronunciation variation for ASR: A survey of the literature," Speech Commun., vol. 29, pp. 225-246, 1999.
-
(1999)
Speech Commun
, vol.29
, pp. 225-246
-
-
Strik, H.1
Cucchiarini, C.2
-
14
-
-
33947715150
-
An automatic captioning system for telemedicine
-
Toulouse, France
-
Y. Zhao, X. Zhang, R-S. Hu, J. Xue, X. Li, L. Che, R. Hu, and L. Schopp, "An automatic captioning system for telemedicine," in Proc. ICASSP, Toulouse, France, 2006, pp. I-957-I-960.
-
(2006)
Proc. ICASSP
-
-
Zhao, Y.1
Zhang, X.2
Hu, R.-S.3
Xue, J.4
Li, X.5
Che, L.6
Hu, R.7
Schopp, L.8
-
15
-
-
0003637516
-
A Theory of Learning Classification Rules,
-
Ph.D. dissertation, School of Comput. Sci, Univ. Technology, Sydney
-
W. L. Buntine, "A Theory of Learning Classification Rules," Ph.D. dissertation, School of Comput. Sci., Univ. Technology, Sydney, 1992.
-
(1992)
-
-
Buntine, W.L.1
-
16
-
-
0032346848
-
Bayesian CART model search
-
Sep
-
H. A. Chipman, E. I. George, and R. E. McCulloch, "Bayesian CART model search," J. Amer. Statist. Assoc., vol. 93, no. 443, pp. 935-948, Sep. 1998.
-
(1998)
J. Amer. Statist. Assoc
, vol.93
, Issue.443
, pp. 935-948
-
-
Chipman, H.A.1
George, E.I.2
McCulloch, R.E.3
-
18
-
-
0000120766
-
Estimating the dimension of a model
-
G. Schwarz, "Estimating the dimension of a model," Ann. Statist., vol. 6, no. 2, pp. 465-471, 1978.
-
(1978)
Ann. Statist
, vol.6
, Issue.2
, pp. 465-471
-
-
Schwarz, G.1
-
19
-
-
0001822107
-
Catalan numbers, their generalization, and their uses
-
P. Hilton and J. Pedersen, "Catalan numbers, their generalization, and their uses," Math. Intell., vol. 13, no. 2, pp. 64-75, 1991.
-
(1991)
Math. Intell
, vol.13
, Issue.2
, pp. 64-75
-
-
Hilton, P.1
Pedersen, J.2
-
20
-
-
0038676761
-
Towards knowledge- based features forHMMbased large vocabulary automatic speech recognition
-
B. Launay, O. Siohan, A. Surendran, and C.-H. Lee, "Towards knowledge- based features forHMMbased large vocabulary automatic speech recognition," in Proc. ICASSP02, 2002, vol. 1, pp. I-817-I-820.
-
(2002)
Proc. ICASSP02
, vol.1
-
-
Launay, B.1
Siohan, O.2
Surendran, A.3
Lee, C.-H.4
-
21
-
-
64549085552
-
-
quot;The HTK Toolkit. [Online]. Available: http://htk.eng.cam.ac. uk/
-
quot;The HTK Toolkit." [Online]. Available: http://htk.eng.cam.ac. uk/
-
-
-
-
22
-
-
0028996876
-
Improved backing-off for M-gram language modeling
-
R. R. Kneser and H. Ney, "Improved backing-off for M-gram language modeling," in Proc. ICASSP, 1995, pp. 181-184.
-
(1995)
Proc. ICASSP
, pp. 181-184
-
-
Kneser, R.R.1
Ney, H.2
-
23
-
-
84891308106
-
SRILM-An extensible language modeling toolkit
-
Denver, CO, Sep
-
A. Stolcke, "SRILM-An extensible language modeling toolkit," in Proc. ICSLP, Denver, CO, Sep. 2002, pp. 901-904.
-
(2002)
Proc. ICSLP
, pp. 901-904
-
-
Stolcke, A.1
-
24
-
-
34248589754
-
A novel method of language modeling for automatic captioning in tc video teleconferencing
-
May
-
X. Zhang, Y. Zhao, and L. Schopp, "A novel method of language modeling for automatic captioning in tc video teleconferencing," IEEE Trans. Inf. Technol. Biomed., vol. 11, no. 3, pp. 332-337, May 2007.
-
(2007)
IEEE Trans. Inf. Technol. Biomed
, vol.11
, Issue.3
, pp. 332-337
-
-
Zhang, X.1
Zhao, Y.2
Schopp, L.3
-
25
-
-
33749555597
-
A fast and memory-efficient N-gram language model lookup method for large vocabulary continuous speech recognition
-
X. Li and Y. Zhao, "A fast and memory-efficient N-gram language model lookup method for large vocabulary continuous speech recognition," Comput. Speech Lang., vol. 21, pp. 1-25, 2007.
-
(2007)
Comput. Speech Lang
, vol.21
, pp. 1-25
-
-
Li, X.1
Zhao, Y.2
|