-
1
-
-
0036642567
-
Combining acoustic and articulatory feature information for robust speech recognition
-
K. Kirchhoff, G. Fink, and G. Sagerer, "Combining acoustic and articulatory feature information for robust speech recognition", Speech Comm., vol. 37, pp. 303-319, 2000.
-
(2000)
Speech Comm.
, vol.37
, pp. 303-319
-
-
Kirchhoff, K.1
Fink, G.2
Sagerer, G.3
-
3
-
-
0037697284
-
Hidden-articulator Markov models for speech recognition
-
M. Richardson, J. Bilmes and C. Diorio, "Hidden-articulator Markov models for speech recognition", Speech Comm., 41(2-3), pp. 511-529, 2003.
-
(2003)
Speech Comm.
, vol.41
, Issue.2-3
, pp. 511-529
-
-
Richardson, M.1
Bilmes, J.2
Diorio, C.3
-
5
-
-
0026854213
-
A generalized hidden Markov model with state-conditioned trend functions of time for the speech signal
-
L. Deng, "A generalized hidden Markov model with state-conditioned trend functions of time for the speech signal", Sig. Proc., 27(1), pp. 65-78, 1992.
-
(1992)
Sig. Proc.
, vol.27
, Issue.1
, pp. 65-78
-
-
Deng, L.1
-
6
-
-
0028234947
-
A statistical approach to automatic speech recognition using the atomic speech units constructed from overlapping articulatory features
-
DOI 10.1121/1.409839
-
L. Deng and D. Sun, "A statistical approach to ASR using atomic units constructed from overlapping articulatory features", J. of Acoust. Soc. Am., 95, pp. 2702-2719, 1994. (Pubitemid 24152864)
-
(1994)
Journal of the Acoustical Society of America
, vol.95
, Issue.5
, pp. 2702-2719
-
-
Deng, L.1
Sun, D.X.2
-
7
-
-
0027627252
-
Hidden Markov model representation of quantized articulatory features for speech recognition
-
DOI 10.1006/csla.1993.1014
-
K. Erler and L. Deng, "Hidden Markov model representation of quantized articulatory features for speech recognition", Comp., Speech & Lang., Vol. 7, pp. 265-282, 1993. (Pubitemid 23705305)
-
(1993)
Computer Speech and Language
, vol.7
, Issue.3
, pp. 265-282
-
-
Erler, K.1
Deng, L.2
-
8
-
-
58849145971
-
ASR - Articulatory speech recognition
-
Denmark
-
J. Frankel and S. King, "ASR - Articulatory Speech Recognition", Proc. of Eurospeech, pp. 599-602, Denmark, 2001.
-
(2001)
Proc. of Eurospeech
, pp. 599-602
-
-
Frankel, J.1
King, S.2
-
9
-
-
84994254645
-
An automatic speech recognition system using neural networks and linear dynamic models to recover and model articulatory traces
-
J. Frankel, K. Richmond, S. King and P. Taylor, "An automatic speech recognition system using neural networks and linear dynamic models to recover and model articulatory traces", Proc. of ICSLP, Vol. 4, pp. 254-257, 2000.
-
(2000)
Proc. of ICSLP
, vol.4
, pp. 254-257
-
-
Frankel, J.1
Richmond, K.2
King, S.3
Taylor, P.4
-
10
-
-
34547541459
-
Articulatory feature-based methods for acoustic and audio-visual speech recognition: Summary from the 2006 JHU summer workshop
-
K. Livescu, O. Cetin, M. Hasegawa-Johnson, S. King, C. Bartels, N. Borges, A. Kantor, P. Lal, L. Yung, A. Bezman, S. Dawson-Haggerty, B. Woods, J. Frankel, M. Magimai-Doss and K. Saenko, "Articulatory feature-based methods for acoustic and audio-visual speech recognition: Summary from the 2006 JHU Summer Workshop", Proc. of ICASSP, Vol. 4, pp. 621-624, 2007.
-
(2007)
Proc. of ICASSP
, vol.4
, pp. 621-624
-
-
Livescu, K.1
Cetin, O.2
Hasegawa-Johnson, M.3
King, S.4
Bartels, C.5
Borges, N.6
Kantor, A.7
Lal, P.8
Yung, L.9
Bezman, A.10
Dawson-Haggerty, S.11
Woods, B.12
Frankel, J.13
Magimai-Doss, M.14
Saenko, K.15
-
11
-
-
70450174439
-
Articulatory phonological code for word classification
-
UK
-
X. Zhuang, H. Nam, M. Hasegawa-Johnson, L. Goldstein and E. Saltzman, "Articulatory Phonological Code for Word Classification", Proc. of Interspeech, pp. 2763-2766, UK, 2009.
-
(2009)
Proc. of Interspeech
, pp. 2763-2766
-
-
Zhuang, X.1
Nam, H.2
Hasegawa-Johnson, M.3
Goldstein, L.4
Saltzman, E.5
-
12
-
-
0001622923
-
On defining coarticulation
-
R. Daniloff and R. Hammarberg, "On defining coarticulation", J. of Phonetics, Vol. 1, pp. 239-248, 1973.
-
(1973)
J. of Phonetics
, vol.1
, pp. 239-248
-
-
Daniloff, R.1
Hammarberg, R.2
-
13
-
-
84971737266
-
Articulatory gestures as phonological units
-
C. Browman and L. Goldstein, "Articulatory Gestures as Phonological Units", Phonology, 6: 201-251, 1989.
-
(1989)
Phonology
, vol.6
, pp. 201-251
-
-
Browman, C.1
Goldstein, L.2
-
14
-
-
0027024362
-
Articulatory phonology: An overview
-
C. Browman and L. Goldstein, "Articulatory Phonology: An Overview", Phonetica, 49: 155-180, 1992.
-
(1992)
Phonetica
, vol.49
, pp. 155-180
-
-
Browman, C.1
Goldstein, L.2
-
15
-
-
78649390043
-
Retrieving tract variables from acoustics: A comparison of different machine learning strategies
-
V. Mitra, H. Nam, C. Espy-Wilson, E. Saltzman and L. Goldstein, "Retrieving Tract Variables from Acoustics: a comparison of different Machine Learning strategies", IEEE J. of Selected Topics on Sig. Proc., Vol. 4(6), pp. 1027-1045, 2010.
-
(2010)
IEEE J. of Selected Topics on Sig. Proc.
, vol.4
, Issue.6
, pp. 1027-1045
-
-
Mitra, V.1
Nam, H.2
Espy-Wilson, C.3
Saltzman, E.4
Goldstein, L.5
-
16
-
-
0015613574
-
Articulatory model for the study of speech production
-
P. Mermelstein, "Articulatory model for the study of speech production", J. Acoust. Soc. of Am., 53(4), pp. 1070-1082, 1973.
-
(1973)
J. Acoust. Soc. of Am.
, vol.53
, Issue.4
, pp. 1070-1082
-
-
Mermelstein, P.1
-
17
-
-
84955535347
-
Gestural specification using dynamically-defined articulatory structures
-
C. Browman and L. Goldstein, "Gestural specification using dynamically-defined articulatory structures", J. of Phonetics, Vol. 18, pp. 299-320, 1990.
-
(1990)
J. of Phonetics
, vol.18
, pp. 299-320
-
-
Browman, C.1
Goldstein, L.2
-
18
-
-
79959813685
-
Robust word recognition using articulatory trajectories and gestures
-
Japan
-
V. Mitra, H. Nam, C. Espy-Wilson, E. Saltzman and L. Goldstein, "Robust word recognition using articulatory trajectories and Gestures", Proc. of Interspeech, pp. 2038-2041, Japan, 2010.
-
(2010)
Proc. of Interspeech
, pp. 2038-2041
-
-
Mitra, V.1
Nam, H.2
Espy-Wilson, C.3
Saltzman, E.4
Goldstein, L.5
-
19
-
-
0038669544
-
The aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions
-
Paris, France
-
H.G. Hirsch and D. Pearce, "The Aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions", In Proc. ISCA ITRW ASR2000, pp. 181-188, Paris, France, 2000.
-
(2000)
Proc. ISCA ITRW ASR2000
, pp. 181-188
-
-
Hirsch, H.G.1
Pearce, D.2
-
20
-
-
80051649631
-
Gesture-based dynamic Bayesian network for noise robust speech recognition
-
V. Mitra, H. Nam, C. Espy-Wilson, E. Saltzman and L. Goldstein, "Gesture-based Dynamic Bayesian Network for Noise robust Speech Recognition", Proc. of ICASSP, pp. 5172-5175, 2011.
-
(2011)
Proc. of ICASSP
, pp. 5172-5175
-
-
Mitra, V.1
Nam, H.2
Espy-Wilson, C.3
Saltzman, E.4
Goldstein, L.5
-
23
-
-
84858956763
-
Speaker identification on the SCOTUS corpus
-
J. Yuan and M. Liberman, "Speaker identification on the SCOTUS corpus", J. Acoust. Soc. of Am., 123(5), pp. 3878, 2008.
-
(2008)
J. Acoust. Soc. of Am.
, vol.123
, Issue.5
, pp. 3878
-
-
Yuan, J.1
Liberman, M.2
-
24
-
-
79959846806
-
A procedure for estimating gestural scores from natural speech
-
Japan
-
H. Nam, V. Mitra, M. Tiede, E. Saltzman, L. Goldstein, C. Espy-Wilson and M. Hasegawa-Johnson, "A procedure for estimating gestural scores from natural speech", Proc. of Interspeech, pp. 30-33, Japan, 2010.
-
(2010)
Proc. of Interspeech
, pp. 30-33
-
-
Nam, H.1
Mitra, V.2
Tiede, M.3
Saltzman, E.4
Goldstein, L.5
Espy-Wilson, C.6
Hasegawa-Johnson, M.7
-
25
-
-
70349207706
-
Tada: An enhanced, portable task dynamics model in matlab
-
2
-
H. Nam, L. Goldstein, E. Saltzman and D. Byrd, "Tada: An enhanced, portable task dynamics model in matlab", J. Acoust. Soc. of Am., 115(5), 2, pp. 2430, 2004.
-
(2004)
J. Acoust. Soc. of Am.
, vol.115
, Issue.5
, pp. 2430
-
-
Nam, H.1
Goldstein, L.2
Saltzman, E.3
Byrd, D.4
-
27
-
-
70349213974
-
From acoustics to vocal tract time functions
-
V. Mitra, I. Özbek, H. Nam, X. Zhou and C. Espy-Wilson, "From Acoustics to Vocal Tract Time Functions", Proc. of International Conference on Acoustics, Speech and Signal Processing, ICASSP, pp. 4497-4500, 2009.
-
(2009)
Proc. of International Conference on Acoustics, Speech and Signal Processing, ICASSP
, pp. 4497-4500
-
-
Mitra, V.1
Özbek, I.2
Nam, H.3
Zhou, X.4
Espy-Wilson, C.5
-
28
-
-
80051617129
-
Speech inversion: Benefits of tract variables over pellet trajectories
-
Prague, Czeck Rep.
-
V. Mitra, H. Nam, C. Espy-Wilson, E. Saltzman and L. Goldstein, "Speech Inversion: Benefits of Tract Variables over Pellet Trajectories", Proc. of International Conference on Acoustics, Speech and Signal Processing, ICASSP, pp. 5188-5191, Prague, Czeck Rep., 2011.
-
(2011)
Proc. of International Conference on Acoustics, Speech and Signal Processing, ICASSP
, pp. 5188-5191
-
-
Mitra, V.1
Nam, H.2
Espy-Wilson, C.3
Saltzman, E.4
Goldstein, L.5
-
29
-
-
77955810460
-
A study on the generalization capability of acoustic models for robust speech recognition
-
X. Xiao, J. Li, E.S. Chng, H. Li and C. Lee, "A Study on the Generalization Capability of Acoustic Models for Robust Speech Recognition", IEEE Trans. Audio, Speech & Lang. Process, 18(6), pp. 1158-1169, 2010.
-
(2010)
IEEE Trans. Audio, Speech & Lang. Process
, vol.18
, Issue.6
, pp. 1158-1169
-
-
Xiao, X.1
Li, J.2
Chng, E.S.3
Li, H.4
Lee, C.5
-
30
-
-
42549139762
-
MVA processing of speech features
-
C. Chen and J. Bilmes, "MVA Processing of Speech Features", IEEE Trans. on Audio, Speech and Lang. Processing, 15(1), pp. 257-270, 2007.
-
(2007)
IEEE Trans. on Audio, Speech and Lang. Processing
, vol.15
, Issue.1
, pp. 257-270
-
-
Chen, C.1
Bilmes, J.2
-
31
-
-
84858952822
-
-
http://portal.etsi.org/stq/kta/DSR/dsr.asp
-
-
-
-
32
-
-
27744539597
-
Noise robust speech recognition using feature compensation based on polynomial regression of utterance SNR
-
DOI 10.1109/TSA.2005.853002
-
X. Cui and A. Alwan, "Noise Robust Speech Recognition Using Feature Compensation Based on Polynomial Regression of Utterance SNR", IEEE Transs. on Speech and Audio Processing, Vol. 13(6), pp. 1161-1172, 2005. (Pubitemid 41605019)
-
(2005)
IEEE Transactions on Speech and Audio Processing
, vol.13
, Issue.6
, pp. 1161-1172
-
-
Cui, X.1
Alwan, A.2
-
33
-
-
33750368310
-
An audio-visual corpus for speech perception and automatic speech recognition
-
DOI 10.1121/1.2229005
-
M. Cooke, J. Barker, S. Cunningham and X. Shao, "An audio-visual corpus for speech perception and automatic speech recognition", Journal of Acoustic Society of America, Vol. 120, pp 2421-2424, 2006. (Pubitemid 44631681)
-
(2006)
Journal of the Acoustical Society of America
, vol.120
, Issue.5
, pp. 2421-2424
-
-
Cooke, M.1
Barker, J.2
Cunningham, S.3
Shao, X.4
-
34
-
-
79960545035
-
Tract variables for noise robust speech recognition
-
V. Mitra, H. Nam, C. Espy-Wilson, E. Saltzman, L. Goldstein, "Tract variables for noise robust speech recognition", IEEE Trans. on Audio, Speech and Language Processing, pp. 1913-1924, 2011
-
(2011)
IEEE Trans. on Audio, Speech and Language Processing
, pp. 1913-1924
-
-
Mitra, V.1
Nam, H.2
Espy-Wilson, C.3
Saltzman, E.4
Goldstein, L.5
|