SCOPUS 정보 검색 플랫폼

IEEE Transactions on Speech and Audio Processing

Volumn 12, Issue 2, 2004, Pages 175-185

Estimation of Articulatory Movements From Speech Acoustics Using an HMM-Based Speech Production Model

(2) Hiroya, Sadao a Honda, Masaaki a,b

a NTT Communication Science Laboratories (Japan)

b WASEDA UNIVERSITY (Japan)

Author keywords

Articulatory HMM; Articulatory to acoustic mapping; HMM based speech production model; Speech inversion

Indexed keywords

COMPUTER SIMULATION; KALMAN FILTERING; MATHEMATICAL MODELS; MATRIX ALGEBRA; NONLINEAR FILTERING; PROBABILITY; SPECTRUM ANALYSIS; SPEECH RECOGNITION;

ARTICULATORY HMM; ARTICULATORY-TO-ACOUSTIC MAPPING; HMM-BASED SPEECH PRODUCTION MODEL; SPEECH INVERSION;

SPEECH PROCESSING;

EID: 2142659020 PISSN: 10636676 EISSN: None Source Type: Journal
DOI: 10.1109/TSA.2003.822636 Document Type: Article

Times cited : (125)

References (21)

1
- 0014077928
- Determination of the geometry of the human vocal tract by acoustic measurements
- M. R. Schroeder, "Determination of the geometry of the human vocal tract by acoustic measurements," J. Acoust. Soc. Amer., vol. 41, pp. 1002-1010, 1967.
- (1967) J. Acoust. Soc. Amer. , vol.41 , pp. 1002-1010
- Schroeder, M.R.¹

2
- 0017968519
- Inversion of articulatory-to-acoustic transformation in the vocal tract by a computer-sorting technique
- B. S. Atal, J. J. Chang, M. V. Mathews, and J. W. Tukey, "Inversion of articulatory-to-acoustic transformation in the vocal tract by a computer-sorting technique," J. Acoust Soc. Amer., vol. 63, pp. 1535-1555, 1978.
- (1978) J. Acoust Soc. Amer. , vol.63 , pp. 1535-1555
- Atal, B.S.¹ Chang, J.J.² Mathews, M.V.³ Tukey, J.W.⁴

3
- 0001736204
- Speech coding based on physiological models of speech production
- New York: Dekker
- J. Schroeter and M. M. Sondhi, "Speech coding based on physiological models of speech production," in Advances in Speech Signal Processing. New York: Dekker, 1992, pp. 231-267.
- (1992) Advances in Speech Signal Processing , pp. 231-267
- Schroeter, J.¹ Sondhi, M.M.²

4
- 0028259480
- Techniques for estimating vocal-tract shapes from the speech signal
- _, "Techniques for estimating vocal-tract shapes from the speech signal," IEEE Trans. Speech Audio Processing, vol. 2, pp. 133-150, 1994.
- (1994) IEEE Trans. Speech Audio Processing , vol.2 , pp. 133-150

5
- 0029843107
- Accurate recovery of articulator positions from acoustics: New conclusions based on human data
- J. Hogden, A. Lofqvist, V. Gracco, I. Zlokarnik, P. Rubin, and E. Saltzman, "Accurate recovery of articulator positions from acoustics: New conclusions based on human data," J. Acoust. Soc. Amer., vol. 100, pp. 1819-1834, 1996.
- (1996) J. Acoust. Soc. Amer. , vol.100 , pp. 1819-1834
- Hogden, J.¹ Lofqvist, A.² Gracco, V.³ Zlokarnik, I.⁴ Rubin, P.⁵ Saltzman, E.⁶

6
- 0026491198
- Electromagnetic midsagittal articulometer system for transducing speech articulatory movements
- J. Perkell, M. Cohen, M. Svirsky, M. Mathies, I. Garabieta, and M. Jackson, "Electromagnetic midsagittal articulometer system for transducing speech articulatory movements," J. Acoust. Soc. Amer., vol. 92, pp. 3078-3096, 1992.
- (1992) J. Acoust. Soc. Amer. , vol.92 , pp. 3078-3096
- Perkell, J.¹ Cohen, M.² Svirsky, M.³ Mathies, M.⁴ Garabieta, I.⁵ Jackson, M.⁶

7
- 0003026847
- Determination of articulatory positions from speech acoustics by applying dynamic articulatory constraints
- S. Suzuki, T. Okadome, and M. Honda, "Determination of articulatory positions from speech acoustics by applying dynamic articulatory constraints," in Proc. ICSLP, 1998, pp. 2251-2254.
- (1998) Proc. ICSLP , pp. 2251-2254
- Suzuki, S.¹ Okadome, T.² Honda, M.³

8
- 0010505818
- Recovery of articulatory movements from acoustics with phonemic information
- T. Okadome, S. Suzuki, and M. Honda, "Recovery of articulatory movements from acoustics with phonemic information," in Proc. Seminar on Speech Production, 2000, pp. 229-233.
- (2000) Proc. Seminar on Speech Production , pp. 229-233
- Okadome, T.¹ Suzuki, S.² Honda, M.³

9
- 0010424152
- Acoustic-to-articulatory inversion using dynamical and phonological constraints
- S. Dusan and L. Deng, "Acoustic-to-articulatory inversion using dynamical and phonological constraints," in Proc. Seminar on Speech Production, 2000, pp. 237-240.
- (2000) Proc. Seminar on Speech Production , pp. 237-240
- Dusan, S.¹ Deng, L.²

10
- 0035337891
- Parameter estimation of a target-directed dynamic system model with switching states
- R. Togneri, J. Ma, and L. Deng, "Parameter estimation of a target-directed dynamic system model with switching states," Signal Processing, vol. 81, pp. 975-987, 2001.
- (2001) Signal Processing , vol.81 , pp. 975-987
- Togneri, R.¹ Ma, J.² Deng, L.³

11
- 0033623527
- Spontaneous speech recognition using a statistical coarticulatory model for the hidden vocal-tract-resonance dynamics
- L. Deng and J. Ma, "Spontaneous speech recognition using a statistical coarticulatory model for the hidden vocal-tract-resonance dynamics," J. Acoust. Soc. Amer., vol. 108, pp. 3036-3048, 2000.
- (2000) J. Acoust. Soc. Amer. , vol.108 , pp. 3036-3048
- Deng, L.¹ Ma, J.²

12
- 0031198059
- Production models as a structural basis for automatic speech recognition
- L. Deng, G. Ramsay, and D. Sun, "Production models as a structural basis for automatic speech recognition," Speech Communication, vol. 22, pp. 93-112, 1997.
- (1997) Speech Communication , vol.22 , pp. 93-112
- Deng, L.¹ Ramsay, G.² Sun, D.³

13
- 0032119268
- A dynamic feature-based approach to the interface between phonology and phonetics for speech modeling and recognition
- L. Deng, "A dynamic feature-based approach to the interface between phonology and phonetics for speech modeling and recognition," Speech Communication, vol. 24, pp. 299-323, 1998.
- (1998) Speech Communication , vol.24 , pp. 299-323
- Deng, L.¹

14
- 0033884177
- Maximum likelihood and minimum classification error factor analysis for automatic speech recognition
- L. K. Saul and M. G. Rahim, "Maximum likelihood and minimum classification error factor analysis for automatic speech recognition," IEEE Trans. Speech Audio Processing, vol. 8, pp. 115-125, 2000.
- (2000) IEEE Trans. Speech Audio Processing , vol.8 , pp. 115-125
- Saul, L.K.¹ Rahim, M.G.²

15
- 85009243663
- Acoustic-to-articulatory inverse mapping using an HMM-based speech production model
- S. Hiroya and M. Honda, "Acoustic-to-articulatory inverse mapping using an HMM-based speech production model," in Proc. ICSLP, 2002, pp. 2305-2308.
- (2002) Proc. ICSLP , pp. 2305-2308
- Hiroya, S.¹ Honda, M.²

16
- 0027965617
- Determination of sagittal tongue shape from the positions of points on the tongue surface
- T. Kaburagi and M. Honda, "Determination of sagittal tongue shape from the positions of points on the tongue surface," J. Acoust. Soc. Amer., vol. 96, pp. 1356-1366, 1994.
- (1994) J. Acoust. Soc. Amer. , vol.96 , pp. 1356-1366
- Kaburagi, T.¹ Honda, M.²

17
- 85031628788
- An algorithm for speech parameter generation from continuous mixture HMM's with dynamic features
- K. Tokuda, T. Masuko, T. Yamada, T. Kobayashi, and S. Imai, "An algorithm for speech parameter generation from continuous mixture HMM's with dynamic features," in Proc. EUROSPEECH, 1995, pp. 757-760.
- (1995) Proc. EUROSPEECH , pp. 757-760
- Tokuda, K.¹ Masuko, T.² Yamada, T.³ Kobayashi, T.⁴ Imai, S.⁵

18
- 0030263447
- Mean and variance adaptation within the MLLR framework
- M. J. F. Gales and P. C. Woodland, "Mean and variance adaptation within the MLLR framework," Computer Speech and Language, vol. 10, pp. 249-264, 1996.
- (1996) Computer Speech and Language , vol.10 , pp. 249-264
- Gales, M.J.F.¹ Woodland, P.C.²

19
- 0000353178
- A maximization technique occurring in the statistical analysis of probabilistic functions of Markov chains
- L. E. Baum, T. Petrie, G. Soules, and N. Weiss, "A maximization technique occurring in the statistical analysis of probabilistic functions of Markov chains," Ann. Math. Stat., vol. 41, pp. 164-171, 1970.
- (1970) Ann. Math. Stat. , vol.41 , pp. 164-171
- Baum, L.E.¹ Petrie, T.² Soules, G.³ Weiss, N.⁴

20
- 85016140477
- An adaptive algorithm for mel-cepstral analysis of speech
- T. Fukada, K. Tokuda, T. Kobayashi, and S. Imai, "An adaptive algorithm for mel-cepstral analysis of speech," in Proc. ICASSP, 1992, pp. 137-140.
- (1992) Proc. ICASSP , pp. 137-140
- Fukada, T.¹ Tokuda, K.² Kobayashi, T.³ Imai, S.⁴

21
- 0002144369
- Tree-based state tying for high accuracy acoustic modeling
- S. J. Young, J. Odell, and P. Woodland, "Tree-based state tying for high accuracy acoustic modeling," in Proc. ARPA Human Language Technology Workshop, 1994, pp. 307-312.
- (1994) Proc. ARPA Human Language Technology Workshop , pp. 307-312
- Young, S.J.¹ Odell, J.² Woodland, P.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.