-
1
-
-
0028466072
-
The importance of cepstral parameter correlation in speech recognition
-
A. Ljolje, "The importance of cepstral parameter correlation in speech recognition," Comput. Speech Lang., vol. 8, pp. 223-232, 1994.
-
(1994)
Comput. Speech Lang
, vol.8
, pp. 223-232
-
-
Ljolje, A.1
-
2
-
-
0019053271
-
Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences
-
Aug
-
S. B. Davis and P. Mermelstein, "Comparison of parametric representations for monosyllabic word recognition in continuously spoken sentences," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-28, no. 4, p. 357, Aug. 1980.
-
(1980)
IEEE Trans. Acoust., Speech, Signal Process
, vol.ASSP-28
, Issue.4
, pp. 357
-
-
Davis, S.B.1
Mermelstein, P.2
-
4
-
-
85017287487
-
Linear discriminant analysis for improved large vocabulary continuous speech recognition
-
R. Haeb-Umbach and H. Ney, "Linear discriminant analysis for improved large vocabulary continuous speech recognition," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing, 1992, vol. 1, pp. 13-16.
-
(1992)
Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing
, vol.1
, pp. 13-16
-
-
Haeb-Umbach, R.1
Ney, H.2
-
5
-
-
17344383223
-
Continuous mixture densities and linear discriminant analysis for improved context-dependent acoustic models
-
X. Aubert, R. Haeb-Umbach, and H. Ney, "Continuous mixture densities and linear discriminant analysis for improved context-dependent acoustic models," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing, 1993, vol. 2, pp. 27-30.
-
(1993)
Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing
, vol.2
, pp. 27-30
-
-
Aubert, X.1
Haeb-Umbach, R.2
Ney, H.3
-
6
-
-
0033677121
-
Maximum likelihood discriminant feature spaces
-
G. Saon, M. Padmanabhan, R. Gopinath, and S. Chen, "Maximum likelihood discriminant feature spaces," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing, 2000, vol. 2, pp. 1129-1132.
-
(2000)
Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing
, vol.2
, pp. 1129-1132
-
-
Saon, G.1
Padmanabhan, M.2
Gopinath, R.3
Chen, S.4
-
7
-
-
84892187452
-
Maximum likelihood modeling with Gaussian distributions for classification
-
R. A. Gopinath, "Maximum likelihood modeling with Gaussian distributions for classification," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing, 1998, vol. 2, pp. 661-664.
-
(1998)
Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing
, vol.2
, pp. 661-664
-
-
Gopinath, R.A.1
-
8
-
-
0003871508
-
Investigation of silicon-auditory models and generalization of linear discriminant analysis for improved speech recognition,
-
Ph.D. dissertation, Johns Hopkins Univ, Baltimore, MD
-
N.Kumar, "Investigation of silicon-auditory models and generalization of linear discriminant analysis for improved speech recognition," Ph.D. dissertation, , Johns Hopkins Univ., Baltimore, MD, 1997.
-
(1997)
-
-
Kumar, N.1
-
9
-
-
0141703325
-
Automatic complexity control forHLDAsystems
-
X. Liu, M. F. J. Gales, and P. C. Woodland, "Automatic complexity control forHLDAsystems," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing, 2003, vol. 1, pp. 132-135.
-
(2003)
Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing
, vol.1
, pp. 132-135
-
-
Liu, X.1
Gales, M.F.J.2
Woodland, P.C.3
-
10
-
-
0033677061
-
Full covariance modeling and adaptation in sub-bands
-
B. Doherty, S.Vaseghi, and P. McCourt, "Full covariance modeling and adaptation in sub-bands," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing, 2000, vol. 2, pp. 969-972.
-
(2000)
Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing
, vol.2
, pp. 969-972
-
-
Doherty, B.1
Vaseghi, S.2
McCourt, P.3
-
11
-
-
0033677172
-
Factored sparse inverse covariance matrices
-
J. A. Bilmes, "Factored sparse inverse covariance matrices," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing, 2000, vol. 2, pp. 1009-1012.
-
(2000)
Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing
, vol.2
, pp. 1009-1012
-
-
Bilmes, J.A.1
-
12
-
-
0032638856
-
Semi-tied covariance matrices for hidden Markov models
-
May
-
M. J. F. Gales, "Semi-tied covariance matrices for hidden Markov models," IEEE Trans. Speech Audio Process., vol. 7, no. 3, pp. 272-281, May 1999.
-
(1999)
IEEE Trans. Speech Audio Process
, vol.7
, Issue.3
, pp. 272-281
-
-
Gales, M.J.F.1
-
13
-
-
0036475982
-
Maximum likelihood multiple subspace projections for hidden Markov models
-
Feb
-
-, "Maximum likelihood multiple subspace projections for hidden Markov models," IEEE Trans. Speech Audio Process., vol. 10, no. 2, pp. 37-47, Feb. 2002.
-
(2002)
IEEE Trans. Speech Audio Process
, vol.10
, Issue.2
, pp. 37-47
-
-
Gales, M.J.F.1
-
14
-
-
2442457791
-
Mixtures of inverse covariance
-
May
-
V. Vanhoucke and A. Sankar, "Mixtures of inverse covariance," IEEE Trans. Speech Audio Process., vol. 12, no. 3, pp. 250-264, May 2004.
-
(2004)
IEEE Trans. Speech Audio Process
, vol.12
, Issue.3
, pp. 250-264
-
-
Vanhoucke, V.1
Sankar, A.2
-
15
-
-
0742272654
-
Modeling inverse covariance matrices by basis expansion
-
Jan
-
P. A. Olsen and R. A. Gopinath, "Modeling inverse covariance matrices by basis expansion," IEEE Trans. Acoust., Speech, Signal Process., vol. 12, no. 1, pp. 37-46, Jan. 2004.
-
(2004)
IEEE Trans. Acoust., Speech, Signal Process
, vol.12
, Issue.1
, pp. 37-46
-
-
Olsen, P.A.1
Gopinath, R.A.2
-
16
-
-
85009289957
-
Modeling with a subspace constraint on inverse covariance matrices
-
S. Axelrod, R. Gopinath, and P. Olsen, "Modeling with a subspace constraint on inverse covariance matrices," in Proc. Int. Conf. Spoken Language Processing, 2001, vol. 9, pp. 2177-2180.
-
(2001)
Proc. Int. Conf. Spoken Language Processing
, vol.9
, pp. 2177-2180
-
-
Axelrod, S.1
Gopinath, R.2
Olsen, P.3
-
17
-
-
0024610919
-
A tutorial on hidden Markov models and selected applications in speech recognition
-
Feb
-
L. R. Rabiner, "A tutorial on hidden Markov models and selected applications in speech recognition," Proc. IEEE, vol. 7, no. 2, pp. 257-286, Feb. 1989.
-
(1989)
Proc. IEEE
, vol.7
, Issue.2
, pp. 257-286
-
-
Rabiner, L.R.1
-
18
-
-
0035279111
-
A structural Bayes approach to speaker adaptation
-
Mar
-
K. Shinoda and C. H. Lee, "A structural Bayes approach to speaker adaptation," IEEE Trans. Speech Audio Process., vol. 9, no. 3, pp. 276-287, Mar. 2001.
-
(2001)
IEEE Trans. Speech Audio Process
, vol.9
, Issue.3
, pp. 276-287
-
-
Shinoda, K.1
Lee, C.H.2
-
19
-
-
85009064348
-
Constrained maximum likelihood linear regression for speaker adaptation
-
M. Afify and O. Siohan, "Constrained maximum likelihood linear regression for speaker adaptation," in Proc. Int. Conf. Spoken Language Processing, 2000, pp. 861-864.
-
(2000)
Proc. Int. Conf. Spoken Language Processing
, pp. 861-864
-
-
Afify, M.1
Siohan, O.2
-
20
-
-
0029288633
-
Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
-
C. J. Leggetter and P. C. Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models," Comput. Speech Lang., vol. 9, pp. 171-185, 1995.
-
(1995)
Comput. Speech Lang
, vol.9
, pp. 171-185
-
-
Leggetter, C.J.1
Woodland, P.C.2
-
21
-
-
0000133385
-
Speech recognition using tree-structured probability density function
-
T. Watanabe, K. Shinoda, K. Takagi, and E. Yamada, "Speech recognition using tree-structured probability density function," in Proc. Int. Conf. Spoken Language Processing, 1994, pp. 223-226.
-
(1994)
Proc. Int. Conf. Spoken Language Processing
, pp. 223-226
-
-
Watanabe, T.1
Shinoda, K.2
Takagi, K.3
Yamada, E.4
-
22
-
-
0000120766
-
Estimating the dimension of a model
-
G. Schwarz, "Estimating the dimension of a model," Ann. Statist., vol. 6, pp. 461-464, 1973.
-
(1973)
Ann. Statist
, vol.6
, pp. 461-464
-
-
Schwarz, G.1
-
24
-
-
0023776398
-
A database for continuous speech recognition in a 1000-word domain
-
P. Price, W. Fisher, J. Bernstein, and D. Pallett, "A database for continuous speech recognition in a 1000-word domain," in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing, 1988, vol. 2, pp. 651-654.
-
(1988)
Proc. IEEE Int. Conf. Acoustics, Speech, Signal Processing
, vol.2
, pp. 651-654
-
-
Price, P.1
Fisher, W.2
Bernstein, J.3
Pallett, D.4
-
25
-
-
64549152628
-
-
S. Young, D. Kershaw, J. Odell, D. Ollason, V. Valtchev, and P.Woodland, The HTK Book for HTK Version 3.0, 2000 [Online, Available
-
S. Young, D. Kershaw, J. Odell, D. Ollason, V. Valtchev, and P.Woodland, The HTK Book (for HTK Version 3.0). : , 2000 [Online]. Available: http://htk.eng.cam.ac.uk/
-
-
-
-
26
-
-
64549145995
-
Speech lab in a box: AMandarin speech toolbox to jump start speech related research toolbox
-
E. Chang, Y. Shi, J. Zhou, and C. Huang, "Speech lab in a box: aMandarin speech toolbox to jump start speech related research toolbox," in Proc. Eur. Conf. Speech Communication and Technology, 2001, pp. 2782-2799.
-
(2001)
Proc. Eur. Conf. Speech Communication and Technology
, pp. 2782-2799
-
-
Chang, E.1
Shi, Y.2
Zhou, J.3
Huang, C.4
-
27
-
-
85009126501
-
Large vocabulary Mandarin speech recognition with different approaches in modeling tones
-
E. Chang, J. Zhou, C. Huang, and K. F. Lee, "Large vocabulary Mandarin speech recognition with different approaches in modeling tones," in Proc. Int. Conf. Spoken Language Processing, 2000, pp. 983-986.
-
(2000)
Proc. Int. Conf. Spoken Language Processing
, pp. 983-986
-
-
Chang, E.1
Zhou, J.2
Huang, C.3
Lee, K.F.4
-
29
-
-
1642377925
-
Factor analyzed hidden Markov models for speech recognition
-
A.-V. I. Rosti and M. J. F. Gales, "Factor analyzed hidden Markov models for speech recognition," Comput. Speech Lang., vol. 18, no. 2, pp. 181-200, 2003.
-
(2003)
Comput. Speech Lang
, vol.18
, Issue.2
, pp. 181-200
-
-
Rosti, A.-V.I.1
Gales, M.J.F.2
-
30
-
-
33646798740
-
The IBM 2004 conversational telephony system for rich transcription
-
H. Soltau, B. Kingsbury, L. Mangu, D. Povey, G. Saon, and G. Zweig, "The IBM 2004 conversational telephony system for rich transcription," in Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing, 2005, vol. 1, pp. 205-208.
-
(2005)
Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing
, vol.1
, pp. 205-208
-
-
Soltau, H.1
Kingsbury, B.2
Mangu, L.3
Povey, D.4
Saon, G.5
Zweig, G.6
-
31
-
-
85009192356
-
An architecture for rapid decoding of large vocabulary conversational speech
-
G. Saon, G. Zweig, B. Kingsbury, L. Mangu, and U. Chaudhari, "An architecture for rapid decoding of large vocabulary conversational speech," in Proc. Eur. Conf. Speech Communication and Technology, 2003, pp. 1977-1980.
-
(2003)
Proc. Eur. Conf. Speech Communication and Technology
, pp. 1977-1980
-
-
Saon, G.1
Zweig, G.2
Kingsbury, B.3
Mangu, L.4
Chaudhari, U.5
|