SCOPUS 정보 검색 플랫폼

2011 IEEE Workshop on Automatic Speech Recognition and Understanding, ASRU 2011, Proceedings

Volumn , Issue , 2011, Pages 131-136

Robust speech recognition using articulatory gestures in a dynamic Bayesian network framework

(3) Mitra, Vikramjit a Nam, Hosung b Espy Wilson, Carol Y c

a SRI INTERNATIONAL (United States)

b HASKINS LABORATORIES (United States)

c UNIVERSITY OF MARYLAND (United States)

Author keywords

[No Author keywords available]

Indexed keywords

ACOUSTIC INFORMATION; ARTICULATORY GESTURES; DYNAMIC BAYESIAN NETWORK; HIDDEN VARIABLE; MICRO BEAMS; NOISY DATA; PHONE RECOGNITION; PROPOSED ARCHITECTURES; RECOGNITION SYSTEMS; ROBUST SPEECH RECOGNITION; SPATIO-TEMPORAL; SPEECH RECOGNITION ARCHITECTURES; UNIVERSITY OF WISCONSIN; VOCAL-TRACTS; WORD RECOGNITION;

EXPERIMENTS; SPEECH RECOGNITION;

INFORMATION USE;

EID: 84858964876 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ASRU.2011.6163918 Document Type: Conference Paper

Times cited : (8)

References (34)

1
- 0036642567
- Combining acoustic and articulatory feature information for robust speech recognition
- K. Kirchhoff, G. Fink, and G. Sagerer, "Combining acoustic and articulatory feature information for robust speech recognition", Speech Comm., vol. 37, pp. 303-319, 2000.
- (2000) Speech Comm. , vol.37 , pp. 303-319
- Kirchhoff, K.¹ Fink, G.² Sagerer, G.³

2
- 0003424928
- PhD Thesis, University of Bielefeld
- K. Kirchhoff, "Robust Speech Recognition Using Articulatory Information", PhD Thesis, University of Bielefeld, 1999.
- (1999) Robust Speech Recognition Using Articulatory Information
- Kirchhoff, K.¹

3
- 0037697284
- Hidden-articulator Markov models for speech recognition
- M. Richardson, J. Bilmes and C. Diorio, "Hidden-articulator Markov models for speech recognition", Speech Comm., 41(2-3), pp. 511-529, 2003.
- (2003) Speech Comm. , vol.41 , Issue.2-3 , pp. 511-529
- Richardson, M.¹ Bilmes, J.² Diorio, C.³

4
- 0024906981
- Robust statistic modelling of systematic variabilities in continuous speech incorporating acoustic-articulatory relations
- O. Schmidbauer, "Robust statistic modelling of systematic variabilities in continuous speech incorporating acoustic-articulatory relations", Proc. of ICASSP, pp. 616-619, 1989. (Pubitemid 20604192)
- (1989) ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings , vol.1 , pp. 616-619
- Schmidbauer Otto¹

5
- 0026854213
- A generalized hidden Markov model with state-conditioned trend functions of time for the speech signal
- L. Deng, "A generalized hidden Markov model with state-conditioned trend functions of time for the speech signal", Sig. Proc., 27(1), pp. 65-78, 1992.
- (1992) Sig. Proc. , vol.27 , Issue.1 , pp. 65-78
- Deng, L.¹

6
- 0028234947
- A statistical approach to automatic speech recognition using the atomic speech units constructed from overlapping articulatory features
- DOI 10.1121/1.409839
- L. Deng and D. Sun, "A statistical approach to ASR using atomic units constructed from overlapping articulatory features", J. of Acoust. Soc. Am., 95, pp. 2702-2719, 1994. (Pubitemid 24152864)
- (1994) Journal of the Acoustical Society of America , vol.95 , Issue.5 , pp. 2702-2719
- Deng, L.¹ Sun, D.X.²

7
- 0027627252
- Hidden Markov model representation of quantized articulatory features for speech recognition
- DOI 10.1006/csla.1993.1014
- K. Erler and L. Deng, "Hidden Markov model representation of quantized articulatory features for speech recognition", Comp., Speech & Lang., Vol. 7, pp. 265-282, 1993. (Pubitemid 23705305)
- (1993) Computer Speech and Language , vol.7 , Issue.3 , pp. 265-282
- Erler, K.¹ Deng, L.²

8
- 58849145971
- ASR - Articulatory speech recognition
- Denmark
- J. Frankel and S. King, "ASR - Articulatory Speech Recognition", Proc. of Eurospeech, pp. 599-602, Denmark, 2001.
- (2001) Proc. of Eurospeech , pp. 599-602
- Frankel, J.¹ King, S.²

9
- 84994254645
- An automatic speech recognition system using neural networks and linear dynamic models to recover and model articulatory traces
- J. Frankel, K. Richmond, S. King and P. Taylor, "An automatic speech recognition system using neural networks and linear dynamic models to recover and model articulatory traces", Proc. of ICSLP, Vol. 4, pp. 254-257, 2000.
- (2000) Proc. of ICSLP , vol.4 , pp. 254-257
- Frankel, J.¹ Richmond, K.² King, S.³ Taylor, P.⁴

10
- 34547541459
- Articulatory feature-based methods for acoustic and audio-visual speech recognition: Summary from the 2006 JHU summer workshop
- K. Livescu, O. Cetin, M. Hasegawa-Johnson, S. King, C. Bartels, N. Borges, A. Kantor, P. Lal, L. Yung, A. Bezman, S. Dawson-Haggerty, B. Woods, J. Frankel, M. Magimai-Doss and K. Saenko, "Articulatory feature-based methods for acoustic and audio-visual speech recognition: Summary from the 2006 JHU Summer Workshop", Proc. of ICASSP, Vol. 4, pp. 621-624, 2007.
- (2007) Proc. of ICASSP , vol.4 , pp. 621-624
- Livescu, K.¹ Cetin, O.² Hasegawa-Johnson, M.³ King, S.⁴ Bartels, C.⁵ Borges, N.⁶ Kantor, A.⁷ Lal, P.⁸ Yung, L.⁹ Bezman, A.¹⁰ Dawson-Haggerty, S.¹¹ Woods, B.¹² Frankel, J.¹³ Magimai-Doss, M.¹⁴ Saenko, K.¹⁵

11
- 70450174439
- Articulatory phonological code for word classification
- UK
- X. Zhuang, H. Nam, M. Hasegawa-Johnson, L. Goldstein and E. Saltzman, "Articulatory Phonological Code for Word Classification", Proc. of Interspeech, pp. 2763-2766, UK, 2009.
- (2009) Proc. of Interspeech , pp. 2763-2766
- Zhuang, X.¹ Nam, H.² Hasegawa-Johnson, M.³ Goldstein, L.⁴ Saltzman, E.⁵

12
- 0001622923
- On defining coarticulation
- R. Daniloff and R. Hammarberg, "On defining coarticulation", J. of Phonetics, Vol. 1, pp. 239-248, 1973.
- (1973) J. of Phonetics , vol.1 , pp. 239-248
- Daniloff, R.¹ Hammarberg, R.²

13
- 84971737266
- Articulatory gestures as phonological units
- C. Browman and L. Goldstein, "Articulatory Gestures as Phonological Units", Phonology, 6: 201-251, 1989.
- (1989) Phonology , vol.6 , pp. 201-251
- Browman, C.¹ Goldstein, L.²

14
- 0027024362
- Articulatory phonology: An overview
- C. Browman and L. Goldstein, "Articulatory Phonology: An Overview", Phonetica, 49: 155-180, 1992.
- (1992) Phonetica , vol.49 , pp. 155-180
- Browman, C.¹ Goldstein, L.²

15
- 78649390043
- Retrieving tract variables from acoustics: A comparison of different machine learning strategies
- V. Mitra, H. Nam, C. Espy-Wilson, E. Saltzman and L. Goldstein, "Retrieving Tract Variables from Acoustics: a comparison of different Machine Learning strategies", IEEE J. of Selected Topics on Sig. Proc., Vol. 4(6), pp. 1027-1045, 2010.
- (2010) IEEE J. of Selected Topics on Sig. Proc. , vol.4 , Issue.6 , pp. 1027-1045
- Mitra, V.¹ Nam, H.² Espy-Wilson, C.³ Saltzman, E.⁴ Goldstein, L.⁵

16
- 0015613574
- Articulatory model for the study of speech production
- P. Mermelstein, "Articulatory model for the study of speech production", J. Acoust. Soc. of Am., 53(4), pp. 1070-1082, 1973.
- (1973) J. Acoust. Soc. of Am. , vol.53 , Issue.4 , pp. 1070-1082
- Mermelstein, P.¹

17
- 84955535347
- Gestural specification using dynamically-defined articulatory structures
- C. Browman and L. Goldstein, "Gestural specification using dynamically-defined articulatory structures", J. of Phonetics, Vol. 18, pp. 299-320, 1990.
- (1990) J. of Phonetics , vol.18 , pp. 299-320
- Browman, C.¹ Goldstein, L.²

18
- 79959813685
- Robust word recognition using articulatory trajectories and gestures
- Japan
- V. Mitra, H. Nam, C. Espy-Wilson, E. Saltzman and L. Goldstein, "Robust word recognition using articulatory trajectories and Gestures", Proc. of Interspeech, pp. 2038-2041, Japan, 2010.
- (2010) Proc. of Interspeech , pp. 2038-2041
- Mitra, V.¹ Nam, H.² Espy-Wilson, C.³ Saltzman, E.⁴ Goldstein, L.⁵

19
- 0038669544
- The aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions
- Paris, France
- H.G. Hirsch and D. Pearce, "The Aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions", In Proc. ISCA ITRW ASR2000, pp. 181-188, Paris, France, 2000.
- (2000) Proc. ISCA ITRW ASR2000 , pp. 181-188
- Hirsch, H.G.¹ Pearce, D.²

20
- 80051649631
- Gesture-based dynamic Bayesian network for noise robust speech recognition
- V. Mitra, H. Nam, C. Espy-Wilson, E. Saltzman and L. Goldstein, "Gesture-based Dynamic Bayesian Network for Noise robust Speech Recognition", Proc. of ICASSP, pp. 5172-5175, 2011.
- (2011) Proc. of ICASSP , pp. 5172-5175
- Mitra, V.¹ Nam, H.² Espy-Wilson, C.³ Saltzman, E.⁴ Goldstein, L.⁵

21
- 0442317754
- ETSI ES 202 050 Ver. 1.1.5
- ETSI ES 202 050 Ver. 1.1.5, (2007). Speech processing, transmission and quality aspects (STQ); distributed speech recognition; advanced frontend feature extraction algorithm; compression algorithms.
- (2007) Speech Processing, Transmission and Quality Aspects (STQ); Distributed Speech Recognition; Advanced Frontend Feature Extraction Algorithm; Compression Algorithms

22
- 0003652255
- Univ. of Wisconsin
- Westbury "X-ray microbeam speech production database user's handbook", Univ. of Wisconsin, 1994.
- (1994) X-ray Microbeam Speech Production Database User's Handbook
- Westbury¹

23
- 84858956763
- Speaker identification on the SCOTUS corpus
- J. Yuan and M. Liberman, "Speaker identification on the SCOTUS corpus", J. Acoust. Soc. of Am., 123(5), pp. 3878, 2008.
- (2008) J. Acoust. Soc. of Am. , vol.123 , Issue.5 , pp. 3878
- Yuan, J.¹ Liberman, M.²

24
- 79959846806
- A procedure for estimating gestural scores from natural speech
- Japan
- H. Nam, V. Mitra, M. Tiede, E. Saltzman, L. Goldstein, C. Espy-Wilson and M. Hasegawa-Johnson, "A procedure for estimating gestural scores from natural speech", Proc. of Interspeech, pp. 30-33, Japan, 2010.
- (2010) Proc. of Interspeech , pp. 30-33
- Nam, H.¹ Mitra, V.² Tiede, M.³ Saltzman, E.⁴ Goldstein, L.⁵ Espy-Wilson, C.⁶ Hasegawa-Johnson, M.⁷

25
- 70349207706
- Tada: An enhanced, portable task dynamics model in matlab
- 2
- H. Nam, L. Goldstein, E. Saltzman and D. Byrd, "Tada: An enhanced, portable task dynamics model in matlab", J. Acoust. Soc. of Am., 115(5), 2, pp. 2430, 2004.
- (2004) J. Acoust. Soc. of Am. , vol.115 , Issue.5 , pp. 2430
- Nam, H.¹ Goldstein, L.² Saltzman, E.³ Byrd, D.⁴

26
- 33745221611
- SSLI Laboratory, Univ. of Washington, October
- J. Bilmes, "GMTK: The Graphical Models Toolkit", SSLI Laboratory, Univ. of Washington, October 2002.
- (2002) GMTK: The Graphical Models Toolkit
- Bilmes, J.¹

27
- 70349213974
- From acoustics to vocal tract time functions
- V. Mitra, I. Özbek, H. Nam, X. Zhou and C. Espy-Wilson, "From Acoustics to Vocal Tract Time Functions", Proc. of International Conference on Acoustics, Speech and Signal Processing, ICASSP, pp. 4497-4500, 2009.
- (2009) Proc. of International Conference on Acoustics, Speech and Signal Processing, ICASSP , pp. 4497-4500
- Mitra, V.¹ Özbek, I.² Nam, H.³ Zhou, X.⁴ Espy-Wilson, C.⁵

28
- 80051617129
- Speech inversion: Benefits of tract variables over pellet trajectories
- Prague, Czeck Rep.
- V. Mitra, H. Nam, C. Espy-Wilson, E. Saltzman and L. Goldstein, "Speech Inversion: Benefits of Tract Variables over Pellet Trajectories", Proc. of International Conference on Acoustics, Speech and Signal Processing, ICASSP, pp. 5188-5191, Prague, Czeck Rep., 2011.
- (2011) Proc. of International Conference on Acoustics, Speech and Signal Processing, ICASSP , pp. 5188-5191
- Mitra, V.¹ Nam, H.² Espy-Wilson, C.³ Saltzman, E.⁴ Goldstein, L.⁵

29
- 77955810460
- A study on the generalization capability of acoustic models for robust speech recognition
- X. Xiao, J. Li, E.S. Chng, H. Li and C. Lee, "A Study on the Generalization Capability of Acoustic Models for Robust Speech Recognition", IEEE Trans. Audio, Speech & Lang. Process, 18(6), pp. 1158-1169, 2010.
- (2010) IEEE Trans. Audio, Speech & Lang. Process , vol.18 , Issue.6 , pp. 1158-1169
- Xiao, X.¹ Li, J.² Chng, E.S.³ Li, H.⁴ Lee, C.⁵

30
- 42549139762
- MVA processing of speech features
- C. Chen and J. Bilmes, "MVA Processing of Speech Features", IEEE Trans. on Audio, Speech and Lang. Processing, 15(1), pp. 257-270, 2007.
- (2007) IEEE Trans. on Audio, Speech and Lang. Processing , vol.15 , Issue.1 , pp. 257-270
- Chen, C.¹ Bilmes, J.²

31
- 84858952822
- http://portal.etsi.org/stq/kta/DSR/dsr.asp

32
- 27744539597
- Noise robust speech recognition using feature compensation based on polynomial regression of utterance SNR
- DOI 10.1109/TSA.2005.853002
- X. Cui and A. Alwan, "Noise Robust Speech Recognition Using Feature Compensation Based on Polynomial Regression of Utterance SNR", IEEE Transs. on Speech and Audio Processing, Vol. 13(6), pp. 1161-1172, 2005. (Pubitemid 41605019)
- (2005) IEEE Transactions on Speech and Audio Processing , vol.13 , Issue.6 , pp. 1161-1172
- Cui, X.¹ Alwan, A.²

33
- 33750368310
- An audio-visual corpus for speech perception and automatic speech recognition
- DOI 10.1121/1.2229005
- M. Cooke, J. Barker, S. Cunningham and X. Shao, "An audio-visual corpus for speech perception and automatic speech recognition", Journal of Acoustic Society of America, Vol. 120, pp 2421-2424, 2006. (Pubitemid 44631681)
- (2006) Journal of the Acoustical Society of America , vol.120 , Issue.5 , pp. 2421-2424
- Cooke, M.¹ Barker, J.² Cunningham, S.³ Shao, X.⁴

34
- 79960545035
- Tract variables for noise robust speech recognition
- V. Mitra, H. Nam, C. Espy-Wilson, E. Saltzman, L. Goldstein, "Tract variables for noise robust speech recognition", IEEE Trans. on Audio, Speech and Language Processing, pp. 1913-1924, 2011
- (2011) IEEE Trans. on Audio, Speech and Language Processing , pp. 1913-1924
- Mitra, V.¹ Nam, H.² Espy-Wilson, C.³ Saltzman, E.⁴ Goldstein, L.⁵

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.