메뉴 건너뛰기




Volumn 12, Issue 2, 2004, Pages 175-185

Estimation of Articulatory Movements From Speech Acoustics Using an HMM-Based Speech Production Model

Author keywords

Articulatory HMM; Articulatory to acoustic mapping; HMM based speech production model; Speech inversion

Indexed keywords

COMPUTER SIMULATION; KALMAN FILTERING; MATHEMATICAL MODELS; MATRIX ALGEBRA; NONLINEAR FILTERING; PROBABILITY; SPECTRUM ANALYSIS; SPEECH RECOGNITION;

EID: 2142659020     PISSN: 10636676     EISSN: None     Source Type: Journal    
DOI: 10.1109/TSA.2003.822636     Document Type: Article
Times cited : (125)

References (21)
  • 1
    • 0014077928 scopus 로고
    • Determination of the geometry of the human vocal tract by acoustic measurements
    • M. R. Schroeder, "Determination of the geometry of the human vocal tract by acoustic measurements," J. Acoust. Soc. Amer., vol. 41, pp. 1002-1010, 1967.
    • (1967) J. Acoust. Soc. Amer. , vol.41 , pp. 1002-1010
    • Schroeder, M.R.1
  • 2
    • 0017968519 scopus 로고
    • Inversion of articulatory-to-acoustic transformation in the vocal tract by a computer-sorting technique
    • B. S. Atal, J. J. Chang, M. V. Mathews, and J. W. Tukey, "Inversion of articulatory-to-acoustic transformation in the vocal tract by a computer-sorting technique," J. Acoust Soc. Amer., vol. 63, pp. 1535-1555, 1978.
    • (1978) J. Acoust Soc. Amer. , vol.63 , pp. 1535-1555
    • Atal, B.S.1    Chang, J.J.2    Mathews, M.V.3    Tukey, J.W.4
  • 3
    • 0001736204 scopus 로고
    • Speech coding based on physiological models of speech production
    • New York: Dekker
    • J. Schroeter and M. M. Sondhi, "Speech coding based on physiological models of speech production," in Advances in Speech Signal Processing. New York: Dekker, 1992, pp. 231-267.
    • (1992) Advances in Speech Signal Processing , pp. 231-267
    • Schroeter, J.1    Sondhi, M.M.2
  • 4
    • 0028259480 scopus 로고
    • Techniques for estimating vocal-tract shapes from the speech signal
    • _, "Techniques for estimating vocal-tract shapes from the speech signal," IEEE Trans. Speech Audio Processing, vol. 2, pp. 133-150, 1994.
    • (1994) IEEE Trans. Speech Audio Processing , vol.2 , pp. 133-150
  • 5
    • 0029843107 scopus 로고    scopus 로고
    • Accurate recovery of articulator positions from acoustics: New conclusions based on human data
    • J. Hogden, A. Lofqvist, V. Gracco, I. Zlokarnik, P. Rubin, and E. Saltzman, "Accurate recovery of articulator positions from acoustics: New conclusions based on human data," J. Acoust. Soc. Amer., vol. 100, pp. 1819-1834, 1996.
    • (1996) J. Acoust. Soc. Amer. , vol.100 , pp. 1819-1834
    • Hogden, J.1    Lofqvist, A.2    Gracco, V.3    Zlokarnik, I.4    Rubin, P.5    Saltzman, E.6
  • 6
    • 0026491198 scopus 로고
    • Electromagnetic midsagittal articulometer system for transducing speech articulatory movements
    • J. Perkell, M. Cohen, M. Svirsky, M. Mathies, I. Garabieta, and M. Jackson, "Electromagnetic midsagittal articulometer system for transducing speech articulatory movements," J. Acoust. Soc. Amer., vol. 92, pp. 3078-3096, 1992.
    • (1992) J. Acoust. Soc. Amer. , vol.92 , pp. 3078-3096
    • Perkell, J.1    Cohen, M.2    Svirsky, M.3    Mathies, M.4    Garabieta, I.5    Jackson, M.6
  • 7
    • 0003026847 scopus 로고    scopus 로고
    • Determination of articulatory positions from speech acoustics by applying dynamic articulatory constraints
    • S. Suzuki, T. Okadome, and M. Honda, "Determination of articulatory positions from speech acoustics by applying dynamic articulatory constraints," in Proc. ICSLP, 1998, pp. 2251-2254.
    • (1998) Proc. ICSLP , pp. 2251-2254
    • Suzuki, S.1    Okadome, T.2    Honda, M.3
  • 8
    • 0010505818 scopus 로고    scopus 로고
    • Recovery of articulatory movements from acoustics with phonemic information
    • T. Okadome, S. Suzuki, and M. Honda, "Recovery of articulatory movements from acoustics with phonemic information," in Proc. Seminar on Speech Production, 2000, pp. 229-233.
    • (2000) Proc. Seminar on Speech Production , pp. 229-233
    • Okadome, T.1    Suzuki, S.2    Honda, M.3
  • 9
    • 0010424152 scopus 로고    scopus 로고
    • Acoustic-to-articulatory inversion using dynamical and phonological constraints
    • S. Dusan and L. Deng, "Acoustic-to-articulatory inversion using dynamical and phonological constraints," in Proc. Seminar on Speech Production, 2000, pp. 237-240.
    • (2000) Proc. Seminar on Speech Production , pp. 237-240
    • Dusan, S.1    Deng, L.2
  • 10
    • 0035337891 scopus 로고    scopus 로고
    • Parameter estimation of a target-directed dynamic system model with switching states
    • R. Togneri, J. Ma, and L. Deng, "Parameter estimation of a target-directed dynamic system model with switching states," Signal Processing, vol. 81, pp. 975-987, 2001.
    • (2001) Signal Processing , vol.81 , pp. 975-987
    • Togneri, R.1    Ma, J.2    Deng, L.3
  • 11
    • 0033623527 scopus 로고    scopus 로고
    • Spontaneous speech recognition using a statistical coarticulatory model for the hidden vocal-tract-resonance dynamics
    • L. Deng and J. Ma, "Spontaneous speech recognition using a statistical coarticulatory model for the hidden vocal-tract-resonance dynamics," J. Acoust. Soc. Amer., vol. 108, pp. 3036-3048, 2000.
    • (2000) J. Acoust. Soc. Amer. , vol.108 , pp. 3036-3048
    • Deng, L.1    Ma, J.2
  • 12
    • 0031198059 scopus 로고    scopus 로고
    • Production models as a structural basis for automatic speech recognition
    • L. Deng, G. Ramsay, and D. Sun, "Production models as a structural basis for automatic speech recognition," Speech Communication, vol. 22, pp. 93-112, 1997.
    • (1997) Speech Communication , vol.22 , pp. 93-112
    • Deng, L.1    Ramsay, G.2    Sun, D.3
  • 13
    • 0032119268 scopus 로고    scopus 로고
    • A dynamic feature-based approach to the interface between phonology and phonetics for speech modeling and recognition
    • L. Deng, "A dynamic feature-based approach to the interface between phonology and phonetics for speech modeling and recognition," Speech Communication, vol. 24, pp. 299-323, 1998.
    • (1998) Speech Communication , vol.24 , pp. 299-323
    • Deng, L.1
  • 14
    • 0033884177 scopus 로고    scopus 로고
    • Maximum likelihood and minimum classification error factor analysis for automatic speech recognition
    • L. K. Saul and M. G. Rahim, "Maximum likelihood and minimum classification error factor analysis for automatic speech recognition," IEEE Trans. Speech Audio Processing, vol. 8, pp. 115-125, 2000.
    • (2000) IEEE Trans. Speech Audio Processing , vol.8 , pp. 115-125
    • Saul, L.K.1    Rahim, M.G.2
  • 15
    • 85009243663 scopus 로고    scopus 로고
    • Acoustic-to-articulatory inverse mapping using an HMM-based speech production model
    • S. Hiroya and M. Honda, "Acoustic-to-articulatory inverse mapping using an HMM-based speech production model," in Proc. ICSLP, 2002, pp. 2305-2308.
    • (2002) Proc. ICSLP , pp. 2305-2308
    • Hiroya, S.1    Honda, M.2
  • 16
    • 0027965617 scopus 로고
    • Determination of sagittal tongue shape from the positions of points on the tongue surface
    • T. Kaburagi and M. Honda, "Determination of sagittal tongue shape from the positions of points on the tongue surface," J. Acoust. Soc. Amer., vol. 96, pp. 1356-1366, 1994.
    • (1994) J. Acoust. Soc. Amer. , vol.96 , pp. 1356-1366
    • Kaburagi, T.1    Honda, M.2
  • 17
    • 85031628788 scopus 로고
    • An algorithm for speech parameter generation from continuous mixture HMM's with dynamic features
    • K. Tokuda, T. Masuko, T. Yamada, T. Kobayashi, and S. Imai, "An algorithm for speech parameter generation from continuous mixture HMM's with dynamic features," in Proc. EUROSPEECH, 1995, pp. 757-760.
    • (1995) Proc. EUROSPEECH , pp. 757-760
    • Tokuda, K.1    Masuko, T.2    Yamada, T.3    Kobayashi, T.4    Imai, S.5
  • 18
    • 0030263447 scopus 로고    scopus 로고
    • Mean and variance adaptation within the MLLR framework
    • M. J. F. Gales and P. C. Woodland, "Mean and variance adaptation within the MLLR framework," Computer Speech and Language, vol. 10, pp. 249-264, 1996.
    • (1996) Computer Speech and Language , vol.10 , pp. 249-264
    • Gales, M.J.F.1    Woodland, P.C.2
  • 19
    • 0000353178 scopus 로고
    • A maximization technique occurring in the statistical analysis of probabilistic functions of Markov chains
    • L. E. Baum, T. Petrie, G. Soules, and N. Weiss, "A maximization technique occurring in the statistical analysis of probabilistic functions of Markov chains," Ann. Math. Stat., vol. 41, pp. 164-171, 1970.
    • (1970) Ann. Math. Stat. , vol.41 , pp. 164-171
    • Baum, L.E.1    Petrie, T.2    Soules, G.3    Weiss, N.4
  • 20
    • 85016140477 scopus 로고
    • An adaptive algorithm for mel-cepstral analysis of speech
    • T. Fukada, K. Tokuda, T. Kobayashi, and S. Imai, "An adaptive algorithm for mel-cepstral analysis of speech," in Proc. ICASSP, 1992, pp. 137-140.
    • (1992) Proc. ICASSP , pp. 137-140
    • Fukada, T.1    Tokuda, K.2    Kobayashi, T.3    Imai, S.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.