-
1
-
-
0028516073
-
"How do humans process and recognize speech?"
-
Oct
-
J. Allen, "How do humans process and recognize speech?" IEEE Trans. Speech Audio Processing, vol. 2, no. 4, pp. 567-577, Oct. 1994.
-
(1994)
IEEE Trans. Speech Audio Processing
, vol.2
, Issue.4
, pp. 567-577
-
-
Allen, J.1
-
2
-
-
0016067897
-
"Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification"
-
B. Atal, "Effectiveness of linear prediction characteristics of the speech wave for automatic speaker identification and verification," J. Acoust. Soc. America, vol. 55, no. 6, pp. 1304-1312, 1974.
-
(1974)
J. Acoust. Soc. America
, vol.55
, Issue.6
, pp. 1304-1312
-
-
Atal, B.1
-
3
-
-
84863773378
-
"Frequency-domain linear prediction for temporal features"
-
M. Athineos and D.P.W. Ellis, "Frequency-domain linear prediction for temporal features," in Proc. ASRU, 2003, pp. 261-266.
-
(2003)
Proc. ASRU
, pp. 261-266
-
-
Athineos, M.1
Ellis, D.P.W.2
-
4
-
-
53049096459
-
"LP-TRAP: Linear predictive temporal patterns"
-
M. Athineos, H. Hermansky, and D. Ellis, "LP-TRAP: Linear predictive temporal patterns," in Proc. ICSLP, 2004, pp. 949-952.
-
(2004)
Proc. ICSLP
, pp. 949-952
-
-
Athineos, M.1
Hermansky, H.2
Ellis, D.3
-
5
-
-
27144453376
-
2: Autoregressive modeling of auditory-like 2-D spectro-temporal patterns"
-
Jeju, Korea, Oct
-
2: Autoregressive modeling of auditory-like 2-D spectro-temporal patterns," in Proc. ISCA Tutorial Research Workshop Statistical and Perceptual Audio Processing SAPA-04, Jeju, Korea, Oct. 2004, pp. 37-42.
-
(2004)
Proc. ISCA Tutorial Research Workshop Statistical and Perceptual Audio Processing SAPA-04
, pp. 37-42
-
-
Athineos, M.1
Hermansky, H.2
Ellis, D.3
-
6
-
-
0031619381
-
"Maximum mutual information based reduction strategies for cross-correlation based joint distributional modeling"
-
Seattle
-
J. Bilmes, "Maximum mutual information based reduction strategies for cross-correlation based joint distributional modeling," in Proc. ICASSP-98, Seattle, 1998, pp. 469-472.
-
(1998)
Proc. ICASSP-98
, pp. 469-472
-
-
Bilmes, J.1
-
7
-
-
0030142722
-
"Towards increasing speech recognition error rates"
-
May
-
H. Bourlard, H. Hermansky, and N. Morgan, "Towards increasing speech recognition error rates," Speech Commun., vol. 18, no. 3, pp. 205-231, May 1996.
-
(1996)
Speech Commun.
, vol.18
, Issue.3
, pp. 205-231
-
-
Bourlard, H.1
Hermansky, H.2
Morgan, N.3
-
8
-
-
27144520907
-
"Multi-rate and variable-rate modeling of speech at phone and syllable time scales"
-
Ö. Çetin and M. Ostendorf, "Multi-rate and variable-rate modeling of speech at phone and syllable time scales," in Proc. ICASSP 2005, pp. I-665-668.
-
(2005)
Proc. ICASSP
-
-
Çetin, Ö.1
Ostendorf, M.2
-
9
-
-
85032762662
-
"A CTS task for meaningful fast-turnaround experiments"
-
IBM Palisades Center, Nov
-
B. Chen, Ö. Çetin, G. Doddington, D. Morgan M. Ostendorf, T. Shinozaki, and Q. Zhu, "A CTS task for meaningful fast-turnaround experiments" in Proc. RT-04 Workshop, IBM Palisades Center, Nov. 2004.
-
(2004)
Proc. RT-04 Workshop
-
-
Chen, B.1
Çetin, Ö.2
Doddington, G.3
Morgan, D.4
Ostendorf, M.5
Shinozaki, T.6
Zhu, Q.7
-
10
-
-
27144509179
-
"Learning long term temporal features in LVCSR using neural networks"
-
B. Chen, Q. Zhu, and N. Morgan, "Learning long term temporal features in LVCSR using neural networks," in Proc. ICSLP, 2004, pp. 612-615.
-
(2004)
Proc. ICSLP
, pp. 612-615
-
-
Chen, B.1
Zhu, Q.2
Morgan, N.3
-
11
-
-
27144558023
-
"Eyes and ears for computers"
-
E. Davis and O. Selfridge, "Eyes and ears for computers," Proc. IRE, vol. 50, pp. 1093-1101, 1962.
-
(1962)
Proc. IRE
, vol.50
, pp. 1093-1101
-
-
Davis, E.1
Selfridge, O.2
-
12
-
-
0002629270
-
"Maximum likelihood from incomplete data via the EM algorithm"
-
A. Dempster, N. Laird, and D. Rubin, "Maximum likelihood from incomplete data via the EM algorithm," J. Royal Statist. Soci. Series B, vol. 39, pp. 1-38, 1977.
-
(1977)
J. Royal Statist. Soci. Series B
, vol.39
, pp. 1-38
-
-
Dempster, A.1
Laird, N.2
Rubin, D.3
-
13
-
-
85079090910
-
"Phonetic classification and recognition using HMM representation of overlapping articulatory features for all classes of English sounds"
-
Apr
-
L. Deng and D. Sun, "Phonetic classification and recognition using HMM representation of overlapping articulatory features for all classes of English sounds," Proc. ICASSP, Apr. 1994, pp. 45-48.
-
(1994)
Proc. ICASSP
, pp. 45-48
-
-
Deng, L.1
Sun, D.2
-
14
-
-
0002174507
-
"The vocoder"
-
Dec
-
H. Dudley, "The vocoder," Bell Labs Record, vol. 17, pp. 122-126, Dec. 1939.
-
(1939)
Bell Labs Record
, vol.17
, pp. 122-126
-
-
Dudley, H.1
-
15
-
-
0005029290
-
"The road not taken"
-
New York: Henry Holt and Co
-
R. Frost, "The road not taken," in Mountain Interval. New York: Henry Holt and Co., 1920.
-
(1920)
Mountain Interval
-
-
Frost, R.1
-
16
-
-
0022667694
-
"Speaker independent isolated word recognizer using dynamic features of speech spectrum"
-
S. Furui, "Speaker independent isolated word recognizer using dynamic features of speech spectrum," IEEE Trans. Acoust. Speech Audio Processing, vol. 34, no. 1, pp. 52-59, 1986.
-
(1986)
IEEE Trans. Acoust. Speech Audio Processing
, vol.34
, Issue.1
, pp. 52-59
-
-
Furui, S.1
-
17
-
-
0027239233
-
"Improvements in connected digit recognition using linear discriminant analysis and mixture densities"
-
Adelaide, Australia
-
R. Haeb-Umbach, D. Geller, and H. Ney, "Improvements in connected digit recognition using linear discriminant analysis and mixture densities," Proc. IEEE Int. Conf. Acoustics Speech Signal Processing, Adelaide, Australia, 1994, vol. 2, pp. 239-242.
-
(1994)
Proc. IEEE Int. Conf. Acoustics Speech Signal Processing
, vol.2
, pp. 239-242
-
-
Haeb-Umbach, R.1
Geller, D.2
Ney, H.3
-
18
-
-
0028517164
-
"RASTA processing of speech"
-
Oct
-
H. Hermansky and N. Morgan, "RASTA processing of speech," IEEE Trans. Speech Audio Processing (Special Issue on Robust Speech Recognition), vol. 2, no. 4 pp. 578-589, Oct. 1994.
-
(1994)
IEEE Trans. Speech Audio Processing (Special Issue on Robust Speech Recognition)
, vol.2
, Issue.4
, pp. 578-589
-
-
Hermansky, H.1
Morgan, N.2
-
19
-
-
85009254284
-
"TRAPS - Classifiers of temporal patterns"
-
Sydney
-
H. Hermansky and S. Sharma, "TRAPS - Classifiers of temporal patterns," in Proc. ICSLP-98, Sydney, 1998, vol. 3, pp. 1003-1006.
-
(1998)
Proc. ICSLP-98
, vol.3
, pp. 1003-1006
-
-
Hermansky, H.1
Sharma, S.2
-
20
-
-
27144439262
-
"Data-derived nonlinear mapping for feature extraction in HMM"
-
Keystone, CO
-
H. Hermansky, S. Sharma, and P. Jain, "Data-derived nonlinear mapping for feature extraction in HMM," in Proc. ASRU-99, Keystone, CO, 1999, pp. I-63-66.
-
(1999)
Proc. ASRU-99
-
-
Hermansky, H.1
Sharma, S.2
Jain, P.3
-
21
-
-
0024905238
-
"A comparison of several acoustic representations for speech recognit on with degraded and undegraded speech"
-
Glasgow, Scotland
-
M. Hunt and C. Lefebvre, "A comparison of several acoustic representations for speech recognit on with degraded and undegraded speech," in Proc. IEEE Conf. Acoustics, Speech, Signal Processing, Glasgow, Scotland, 1989, pp. 262-265.
-
(1989)
Proc. IEEE Conf. Acoustics, Speech, Signal Processing
, pp. 262-265
-
-
Hunt, M.1
Lefebvre, C.2
-
22
-
-
85009233038
-
"Improving word accuracy with Gabor feature extraction"
-
Denver, CO, Sept
-
M. Kleinschmidt and D. Gelbart, "Improving word accuracy with Gabor feature extraction," in Proc. ICSLP-2002, Denver, CO, Sept. 2002, pp. 25-28.
-
(2002)
Proc. ICSLP-2002
, pp. 25-28
-
-
Kleinschmidt, M.1
Gelbart, D.2
-
23
-
-
0346262152
-
"Real-time probabilistic segmentation for segment-based speech recognition"
-
Sydney
-
S. Lee and J. Glass, "Real-time probabilistic segmentation for segment-based speech recognition," in Proc. ICSLP-1998, Sydney, 1998, pp. 1803-1806.
-
(1998)
Proc. ICSLP-1998
, pp. 1803-1806
-
-
Lee, S.1
Glass, J.2
-
24
-
-
0030245363
-
"From HMMs to segment models: A unified view of stochastic modeling for speech recognition"
-
M. Ostendorf, V. Digilakis, and O. Kimball, "From HMMs to segment models: A unified view of stochastic modeling for speech recognition," IEEE Trans. Acoustics, Speech, Signal Processing, vol. 4, no. 5, pp. 369-378, 1996.
-
(1996)
IEEE Trans. Acoustics, Speech, Signal Processing
, vol.4
, Issue.5
, pp. 369-378
-
-
Ostendorf, M.1
Digilakis, V.2
Kimball, O.3
-
25
-
-
85032765253
-
"FPME: Discriminatively trained features for speech recognition"
-
IBM Palisades Center, Nov
-
D. Povey, B. Kingsbury, L. Mangu, G. Saon, H. Soltau, and G. Zweig, "FPME: Discriminatively trained features for speech recognition," in Proc. RT-04 Workshop, IBM Palisades Center, Nov. 2004.
-
(2004)
Proc. RT-04 Workshop
-
-
Povey, D.1
Kingsbury, B.2
Mangu, L.3
Saon, G.4
Soltau, H.5
Zweig, G.6
-
26
-
-
85079097438
-
"IPA: Improved modelling with recurrent neural networks"
-
Apr
-
A. Robinson, M. Hochberg, and S. Renals, "IPA: Improved modelling with recurrent neural networks," in Proc. ICASSP-94, Apr. 1994, pp. 37-40.
-
(1994)
Proc. ICASSP-94
, pp. 37-40
-
-
Robinson, A.1
Hochberg, M.2
Renals, S.3
-
27
-
-
85009115694
-
"Consonant discrimination in elicited and spontaneous speech: A case for signal-adaptive front ends in ASR"
-
Beijing, China, Oct
-
M. Sonmez, M. Plauche, E. Shriberg, and H. Franco, "Consonant discrimination in elicited and spontaneous speech: A case for signal-adaptive front ends in ASR," in Proc. ICSLP-2000, Beijing, China, Oct. 2000, pp. 548-551.
-
(2000)
Proc. ICSLP-2000
, pp. 548-551
-
-
Sonmez, M.1
Plauche, M.2
Shriberg, E.3
Franco, H.4
-
28
-
-
0002915083
-
"Relevance of time-frequency features for phonetic and speaker-channel classification"
-
H. Yang, S. Van Vuuren, S. Sharma and H. Hermansky, "Relevance of time-frequency features for phonetic and speaker-channel classification," Speech Commun., vol. 31, no. 1, pp. 35-50, 2000.
-
(2000)
Speech Commun.
, vol.31
, Issue.1
, pp. 35-50
-
-
Yang, H.1
Van Vuuren, S.2
Sharma, S.3
Hermansky, H.4
|