메뉴 건너뛰기




Volumn , Issue , 2011, Pages 131-136

Robust speech recognition using articulatory gestures in a dynamic Bayesian network framework

Author keywords

[No Author keywords available]

Indexed keywords

ACOUSTIC INFORMATION; ARTICULATORY GESTURES; DYNAMIC BAYESIAN NETWORK; HIDDEN VARIABLE; MICRO BEAMS; NOISY DATA; PHONE RECOGNITION; PROPOSED ARCHITECTURES; RECOGNITION SYSTEMS; ROBUST SPEECH RECOGNITION; SPATIO-TEMPORAL; SPEECH RECOGNITION ARCHITECTURES; UNIVERSITY OF WISCONSIN; VOCAL-TRACTS; WORD RECOGNITION;

EID: 84858964876     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ASRU.2011.6163918     Document Type: Conference Paper
Times cited : (8)

References (34)
  • 1
    • 0036642567 scopus 로고    scopus 로고
    • Combining acoustic and articulatory feature information for robust speech recognition
    • K. Kirchhoff, G. Fink, and G. Sagerer, "Combining acoustic and articulatory feature information for robust speech recognition", Speech Comm., vol. 37, pp. 303-319, 2000.
    • (2000) Speech Comm. , vol.37 , pp. 303-319
    • Kirchhoff, K.1    Fink, G.2    Sagerer, G.3
  • 3
    • 0037697284 scopus 로고    scopus 로고
    • Hidden-articulator Markov models for speech recognition
    • M. Richardson, J. Bilmes and C. Diorio, "Hidden-articulator Markov models for speech recognition", Speech Comm., 41(2-3), pp. 511-529, 2003.
    • (2003) Speech Comm. , vol.41 , Issue.2-3 , pp. 511-529
    • Richardson, M.1    Bilmes, J.2    Diorio, C.3
  • 5
    • 0026854213 scopus 로고
    • A generalized hidden Markov model with state-conditioned trend functions of time for the speech signal
    • L. Deng, "A generalized hidden Markov model with state-conditioned trend functions of time for the speech signal", Sig. Proc., 27(1), pp. 65-78, 1992.
    • (1992) Sig. Proc. , vol.27 , Issue.1 , pp. 65-78
    • Deng, L.1
  • 6
    • 0028234947 scopus 로고
    • A statistical approach to automatic speech recognition using the atomic speech units constructed from overlapping articulatory features
    • DOI 10.1121/1.409839
    • L. Deng and D. Sun, "A statistical approach to ASR using atomic units constructed from overlapping articulatory features", J. of Acoust. Soc. Am., 95, pp. 2702-2719, 1994. (Pubitemid 24152864)
    • (1994) Journal of the Acoustical Society of America , vol.95 , Issue.5 , pp. 2702-2719
    • Deng, L.1    Sun, D.X.2
  • 7
    • 0027627252 scopus 로고
    • Hidden Markov model representation of quantized articulatory features for speech recognition
    • DOI 10.1006/csla.1993.1014
    • K. Erler and L. Deng, "Hidden Markov model representation of quantized articulatory features for speech recognition", Comp., Speech & Lang., Vol. 7, pp. 265-282, 1993. (Pubitemid 23705305)
    • (1993) Computer Speech and Language , vol.7 , Issue.3 , pp. 265-282
    • Erler, K.1    Deng, L.2
  • 8
    • 58849145971 scopus 로고    scopus 로고
    • ASR - Articulatory speech recognition
    • Denmark
    • J. Frankel and S. King, "ASR - Articulatory Speech Recognition", Proc. of Eurospeech, pp. 599-602, Denmark, 2001.
    • (2001) Proc. of Eurospeech , pp. 599-602
    • Frankel, J.1    King, S.2
  • 9
    • 84994254645 scopus 로고    scopus 로고
    • An automatic speech recognition system using neural networks and linear dynamic models to recover and model articulatory traces
    • J. Frankel, K. Richmond, S. King and P. Taylor, "An automatic speech recognition system using neural networks and linear dynamic models to recover and model articulatory traces", Proc. of ICSLP, Vol. 4, pp. 254-257, 2000.
    • (2000) Proc. of ICSLP , vol.4 , pp. 254-257
    • Frankel, J.1    Richmond, K.2    King, S.3    Taylor, P.4
  • 12
    • 0001622923 scopus 로고
    • On defining coarticulation
    • R. Daniloff and R. Hammarberg, "On defining coarticulation", J. of Phonetics, Vol. 1, pp. 239-248, 1973.
    • (1973) J. of Phonetics , vol.1 , pp. 239-248
    • Daniloff, R.1    Hammarberg, R.2
  • 13
    • 84971737266 scopus 로고
    • Articulatory gestures as phonological units
    • C. Browman and L. Goldstein, "Articulatory Gestures as Phonological Units", Phonology, 6: 201-251, 1989.
    • (1989) Phonology , vol.6 , pp. 201-251
    • Browman, C.1    Goldstein, L.2
  • 14
    • 0027024362 scopus 로고
    • Articulatory phonology: An overview
    • C. Browman and L. Goldstein, "Articulatory Phonology: An Overview", Phonetica, 49: 155-180, 1992.
    • (1992) Phonetica , vol.49 , pp. 155-180
    • Browman, C.1    Goldstein, L.2
  • 15
    • 78649390043 scopus 로고    scopus 로고
    • Retrieving tract variables from acoustics: A comparison of different machine learning strategies
    • V. Mitra, H. Nam, C. Espy-Wilson, E. Saltzman and L. Goldstein, "Retrieving Tract Variables from Acoustics: a comparison of different Machine Learning strategies", IEEE J. of Selected Topics on Sig. Proc., Vol. 4(6), pp. 1027-1045, 2010.
    • (2010) IEEE J. of Selected Topics on Sig. Proc. , vol.4 , Issue.6 , pp. 1027-1045
    • Mitra, V.1    Nam, H.2    Espy-Wilson, C.3    Saltzman, E.4    Goldstein, L.5
  • 16
    • 0015613574 scopus 로고
    • Articulatory model for the study of speech production
    • P. Mermelstein, "Articulatory model for the study of speech production", J. Acoust. Soc. of Am., 53(4), pp. 1070-1082, 1973.
    • (1973) J. Acoust. Soc. of Am. , vol.53 , Issue.4 , pp. 1070-1082
    • Mermelstein, P.1
  • 17
    • 84955535347 scopus 로고
    • Gestural specification using dynamically-defined articulatory structures
    • C. Browman and L. Goldstein, "Gestural specification using dynamically-defined articulatory structures", J. of Phonetics, Vol. 18, pp. 299-320, 1990.
    • (1990) J. of Phonetics , vol.18 , pp. 299-320
    • Browman, C.1    Goldstein, L.2
  • 19
    • 0038669544 scopus 로고    scopus 로고
    • The aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions
    • Paris, France
    • H.G. Hirsch and D. Pearce, "The Aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions", In Proc. ISCA ITRW ASR2000, pp. 181-188, Paris, France, 2000.
    • (2000) Proc. ISCA ITRW ASR2000 , pp. 181-188
    • Hirsch, H.G.1    Pearce, D.2
  • 20
    • 80051649631 scopus 로고    scopus 로고
    • Gesture-based dynamic Bayesian network for noise robust speech recognition
    • V. Mitra, H. Nam, C. Espy-Wilson, E. Saltzman and L. Goldstein, "Gesture-based Dynamic Bayesian Network for Noise robust Speech Recognition", Proc. of ICASSP, pp. 5172-5175, 2011.
    • (2011) Proc. of ICASSP , pp. 5172-5175
    • Mitra, V.1    Nam, H.2    Espy-Wilson, C.3    Saltzman, E.4    Goldstein, L.5
  • 23
    • 84858956763 scopus 로고    scopus 로고
    • Speaker identification on the SCOTUS corpus
    • J. Yuan and M. Liberman, "Speaker identification on the SCOTUS corpus", J. Acoust. Soc. of Am., 123(5), pp. 3878, 2008.
    • (2008) J. Acoust. Soc. of Am. , vol.123 , Issue.5 , pp. 3878
    • Yuan, J.1    Liberman, M.2
  • 25
    • 70349207706 scopus 로고    scopus 로고
    • Tada: An enhanced, portable task dynamics model in matlab
    • 2
    • H. Nam, L. Goldstein, E. Saltzman and D. Byrd, "Tada: An enhanced, portable task dynamics model in matlab", J. Acoust. Soc. of Am., 115(5), 2, pp. 2430, 2004.
    • (2004) J. Acoust. Soc. of Am. , vol.115 , Issue.5 , pp. 2430
    • Nam, H.1    Goldstein, L.2    Saltzman, E.3    Byrd, D.4
  • 29
    • 77955810460 scopus 로고    scopus 로고
    • A study on the generalization capability of acoustic models for robust speech recognition
    • X. Xiao, J. Li, E.S. Chng, H. Li and C. Lee, "A Study on the Generalization Capability of Acoustic Models for Robust Speech Recognition", IEEE Trans. Audio, Speech & Lang. Process, 18(6), pp. 1158-1169, 2010.
    • (2010) IEEE Trans. Audio, Speech & Lang. Process , vol.18 , Issue.6 , pp. 1158-1169
    • Xiao, X.1    Li, J.2    Chng, E.S.3    Li, H.4    Lee, C.5
  • 31
    • 84858952822 scopus 로고    scopus 로고
    • http://portal.etsi.org/stq/kta/DSR/dsr.asp
  • 32
    • 27744539597 scopus 로고    scopus 로고
    • Noise robust speech recognition using feature compensation based on polynomial regression of utterance SNR
    • DOI 10.1109/TSA.2005.853002
    • X. Cui and A. Alwan, "Noise Robust Speech Recognition Using Feature Compensation Based on Polynomial Regression of Utterance SNR", IEEE Transs. on Speech and Audio Processing, Vol. 13(6), pp. 1161-1172, 2005. (Pubitemid 41605019)
    • (2005) IEEE Transactions on Speech and Audio Processing , vol.13 , Issue.6 , pp. 1161-1172
    • Cui, X.1    Alwan, A.2
  • 33
    • 33750368310 scopus 로고    scopus 로고
    • An audio-visual corpus for speech perception and automatic speech recognition
    • DOI 10.1121/1.2229005
    • M. Cooke, J. Barker, S. Cunningham and X. Shao, "An audio-visual corpus for speech perception and automatic speech recognition", Journal of Acoustic Society of America, Vol. 120, pp 2421-2424, 2006. (Pubitemid 44631681)
    • (2006) Journal of the Acoustical Society of America , vol.120 , Issue.5 , pp. 2421-2424
    • Cooke, M.1    Barker, J.2    Cunningham, S.3    Shao, X.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.