메뉴 건너뛰기




Volumn , Issue , 2014, Pages 3017-3021

Articulatory features from deep neural networks and their role in speech recognition

Author keywords

articulatory trajectories; automatic speech recognition; deep neural networks; vocal tract variables

Indexed keywords

CONTINUOUS SPEECH RECOGNITION; SIGNAL PROCESSING; TRAJECTORIES;

EID: 84905234271     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2014.6854154     Document Type: Conference Paper
Times cited : (45)

References (30)
  • 1
    • 0001622923 scopus 로고
    • On defining coarticulation
    • R. Daniloff and R. Hammarberg, "On defining coarticulation", J. of Phonetics, Vol.1, pp. 239-248, 1973.
    • (1973) J. of Phonetics , vol.1 , pp. 239-248
    • Daniloff, R.1    Hammarberg, R.2
  • 2
    • 84939672029 scopus 로고
    • Toward a model for speech recognition
    • K. N. Stevens, "Toward a model for speech recognition", J. of Acoust. Soc. Am., Vol.32, pp. 47-55, 1960.
    • (1960) J. of Acoust. Soc. Am. , vol.32 , pp. 47-55
    • Stevens, K.N.1
  • 4
    • 58849145971 scopus 로고    scopus 로고
    • ASR-articulatory speech recognition
    • Denmark
    • J. Frankel and S. King, "ASR-Articulatory Speech Recognition", Proc. of Eurospeech, pp. 599-602, Denmark, 2001.
    • (2001) Proc. of Eurospeech , pp. 599-602
    • Frankel, J.1    King, S.2
  • 5
    • 0028234947 scopus 로고
    • A statistical approach to automatic speech recognition using atomic units constructed from overlapping articulatory features
    • L. Deng and D. Sun, "A statistical approach to automatic speech recognition using atomic units constructed from overlapping articulatory features", J. of Acoust. Soc. Am., 95(5), pp. 2702-2719, 1994.
    • (1994) J. of Acoust. Soc. Am. , vol.95 , Issue.5 , pp. 2702-2719
    • Deng, L.1    Sun, D.2
  • 8
    • 0037697284 scopus 로고    scopus 로고
    • Hidden-articulator Markov models for speech recognition
    • M. Richardson, J. Bilmes and C. Diorio, "Hidden-articulator Markov models for speech recognition", Speech Comm., 41(2-3), pp. 511-529, 2003.
    • (2003) Speech Comm. , vol.41 , Issue.2-3 , pp. 511-529
    • Richardson, M.1    Bilmes, J.2    Diorio, C.3
  • 13
    • 84906219170 scopus 로고    scopus 로고
    • Relevanceweighted reconstruction of articulatory features in deep neural network-based acoustic-to-articulatory mapping
    • Canevari, C., Badino, L., Fadiga, L., Metta, G., "Relevanceweighted reconstruction of articulatory features in Deep Neural Network-based Acoustic-to-Articulatory Mapping", in Proc. of Interspeech, 2013.
    • (2013) Proc. of Interspeech
    • Canevari, C.1    Badino, L.2    Fadiga, L.3    Metta, G.4
  • 14
    • 80051649631 scopus 로고    scopus 로고
    • Gesture-based dynamic bayesian network for noise robust speech recognition
    • V. Mitra, H. Nam, C. Espy-Wilson, E. Saltzman, L. Goldstein, "Gesture-based Dynamic Bayesian Network for Noise robust Speech Recognition," in Proc. of ICASSP, pp. 5172-5175, 2011.
    • (2011) Proc. of ICASSP , pp. 5172-5175
    • Mitra, V.1    Nam, H.2    Espy-Wilson, C.3    Saltzman, E.4    Goldstein, L.5
  • 15
    • 0028234947 scopus 로고
    • A statistical approach to ASR using atomic units constructed from overlapping articulatory features
    • L. Deng and D. Sun, "A statistical approach to ASR using atomic units constructed from overlapping articulatory features", J. of Acoust. Soc. Am., 95, pp. 2702-2719, 1994.
    • (1994) J. of Acoust. Soc. Am. , vol.95 , pp. 2702-2719
    • Deng, L.1    Sun, D.2
  • 16
    • 0027627252 scopus 로고
    • Hidden Markov model representation of quantized articulatory features for speech recognition
    • K. Erler and L. Deng, "Hidden Markov model representation of quantized articulatory features for speech recognition", Comp., Speech & Lang., Vol. 7, pp. 265-282, 1993.
    • (1993) Comp., Speech & Lang. , vol.7 , pp. 265-282
    • Erler, K.1    Deng, L.2
  • 17
    • 70349207706 scopus 로고    scopus 로고
    • Tada: An enhanced, portable task dynamics model in Matlab
    • H. Nam, L. Goldstein, E. Saltzman and D. Byrd, "Tada: An enhanced, portable task dynamics model in Matlab", J. of Acoust. Soc. Am., 115(5), pp. 2430, 2004.
    • (2004) J. of Acoust. Soc. Am. , vol.115 , Issue.5 , pp. 2430
    • Nam, H.1    Goldstein, L.2    Saltzman, E.3    Byrd, D.4
  • 18
    • 0036711819 scopus 로고    scopus 로고
    • A quasiarticulatory approach to controlling acoustic source parameters in a Klatt-type formant synthesizer using HLsyn
    • H. M. Hanson and K. N. Stevens, "A quasiarticulatory approach to controlling acoustic source parameters in a Klatt-type formant synthesizer using HLsyn", J. of Acoust. Soc. Am., 112(3), pp. 1158-1182, 2002.
    • (2002) J. of Acoust. Soc. Am. , vol.112 , Issue.3 , pp. 1158-1182
    • Hanson, H.M.1    Stevens, K.N.2
  • 19
    • 84955548400 scopus 로고
    • Towards an articulatory phonology
    • C. P. Browman and L. Goldstein, "Towards an Articulatory Phonology", Phonology Yearbook, 85, pp. 219-252, 1986.
    • (1986) Phonology Yearbook , vol.85 , pp. 219-252
    • Browman, C.P.1    Goldstein, L.2
  • 20
    • 0027024362 scopus 로고
    • Articulatory phonology: An overview
    • C. P. Browman and L. Goldstein, "Articulatory Phonology: An Overview", Phonetica, 49, pp. 155-180, 1992.
    • (1992) Phonetica , vol.49 , pp. 155-180
    • Browman, C.P.1    Goldstein, L.2
  • 21
    • 77956779481 scopus 로고
    • A dynamical approach to gestural patterning in speech production
    • E. Saltzman and K. Munhall, "A Dynamical Approach to Gestural Patterning in Speech Production", Ecological Psychology, 1(4), pp. 332-382, 1989.
    • (1989) Ecological Psychology , vol.1 , Issue.4 , pp. 332-382
    • Saltzman, E.1    Munhall, K.2
  • 22
    • 84905246750 scopus 로고    scopus 로고
    • http://www.speech.cs.cmu.edu/cgi-bin/cmudict
  • 23
    • 33646677283 scopus 로고    scopus 로고
    • Experimental framework for the performance evaluation of speech recognition front-ends on a large vocabulary task
    • June 4
    • G. Hirsch, "Experimental framework for the performance evaluation of speech recognition front-ends on a large vocabulary task", ETSI STQ-Aurora DSR Working Group, June 4, 2001.
    • (2001) ETSI STQ-Aurora DSR Working Group
    • Hirsch, G.1
  • 25
    • 84867589420 scopus 로고    scopus 로고
    • Normalized amplitude modulation features for large vocabulary noise-robust speech recognition
    • V. Mitra, H. Franco, M. Graciarena and A. Mandal, "Normalized amplitude modulation features for large vocabulary noise-robust speech recognition", Proc. IEEE CASSP, pp. 4117-4120, 2012.
    • (2012) Proc. IEEE CASSP , pp. 4117-4120
    • Mitra, V.1    Franco, H.2    Graciarena, M.3    Mandal, A.4
  • 26
    • 84906260861 scopus 로고    scopus 로고
    • Damped oscillator cepstral coefficients for robust speech recognition
    • V. Mitra, H. Franco, M. Graciarena, "Damped Oscillator Cepstral Coefficients for Robust Speech Recognition," Proc. Interspeech, pp. 886-890, 2013.
    • (2013) Proc. Interspeech , pp. 886-890
    • Mitra, V.1    Franco, H.2    Graciarena, M.3
  • 30
    • 53849127143 scopus 로고    scopus 로고
    • Improving robustness of MLLR adaptation with speaker-clustered regression class trees
    • ISSN 0885-2308
    • A. Mandal, M. Ostendorf and Andreas Stolcke, "Improving robustness of MLLR adaptation with speaker-clustered regression class trees", Computer Speech & Language, 23, pp. 176 199 (2009). ISSN 0885-2308.
    • (2009) Computer Speech & Language , vol.23 , pp. 176-199
    • Mandal, A.1    Ostendorf, M.2    Stolcke, A.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.