SCOPUS 정보 검색 플랫폼

Speech Communication

Volumn 26, Issue 1-2, 1998, Pages 23-43

Quantitative association of vocal-tract and facial behavior

(3) Yehia, Hani a Rubin, Philip b Vatikiotis Bateson, Eric c

a FEDERAL UNIVERSITY OF MINAS GERAIS (Brazil)

b YALE UNIVERSITY (United States)

c ADVANCED TELECOMMUNICATIONS RESEARCH INSTITUTE INTERNATIONAL (Japan)

Author keywords

Dynamic time warping (DTW); Facial motion; Line spectrum pair (LSP); Linear estimator; Principal component analysis; Singular value decomposition; Vocal tract motion

Indexed keywords

CORRELATION METHODS; SPEECH ANALYSIS;

DYNAMIC TIME WARPING (DTW); LINE SPECTRUM PAIR (LSP); LINEAR ESTIMATORS; PRINCIPAL COMPONENT ANALYSIS (PCA); SINGULAR VALUE DECOMPOSITION (SVD); SPEECH ACOUSTICS; VOCAL-TRACT MOTION;

SPEECH COMMUNICATION;

EID: 0032178592 PISSN: 01676393 EISSN: None Source Type: Journal
DOI: 10.1016/S0167-6393(98)00048-X Document Type: Article

Times cited : (381)

References (44)

1
- 0017968519
- Inversion of articulatory-to-acoustic transformation in the vocal-tract by a computing sorting technique
- Atal, B.S., Chang, J.J., Tukey, J.W., 1978. Inversion of articulatory-to-acoustic transformation in the vocal-tract by a computing sorting technique. J. Acoust. Soc. Amer. 63 (5), 1535-1555.
- (1978) J. Acoust. Soc. Amer. , vol.63 , Issue.5 , pp. 1535-1555
- Atal, B.S.¹ Chang, J.J.² Tukey, J.W.³

2
- 0001762548
- Recovery of vocal tract geometry from formants for vowels and fricative consonants using a midsagittal-to-area function conversion model
- Badin, P., Beautemps, D., Laboissiere, R., Schwartz, J.L., 1995. Recovery of vocal tract geometry from formants for vowels and fricative consonants using a midsagittal-to-area function conversion model. Journal of Phonetics 23, 221-229.
- (1995) Journal of Phonetics , vol.23 , pp. 221-229
- Badin, P.¹ Beautemps, D.² Laboissiere, R.³ Schwartz, J.L.⁴

3
- 0010423709
- On the use of structured light in speech research
- Carter, J.N., Shadle, C.H., Davis, C.J., 1996. On the use of structured light in speech research. In: Proceedings of The First ESCA Tutorial and Research Workshop on Speech Production Modeling and Fourth Speech Production Seminar, pp. 229-232.
- (1996) Proceedings of the First ESCA Tutorial and Research Workshop on Speech Production Modeling and Fourth Speech Production Seminar , pp. 229-232
- Carter, J.N.¹ Shadle, C.H.² Davis, C.J.³

4
- 0003418124
- The Hague
- Fant, G., 1960. Acoustic Theory of Speech Production. The Hague.
- (1960) Acoustic Theory of Speech Production
- Fant, G.¹

5
- 0039821028
- An unsupervised method for learning to track tongue position from ana acoustic signal
- Haskins Laboratories, New Haven, CT, USA
- Hogden, J., 1993. An unsupervised method for learning to track tongue position from ana acoustic signal. Status Report on Speech Research SR-115/116, Haskins Laboratories, New Haven, CT, USA.
- (1993) Status Report on Speech Research SR-115/116
- Hogden, J.¹

6
- 0004151494
- Cambridge
- Horn, R., Johnson, C., 1985. Matrix Analysis. Cambridge.
- (1985) Matrix Analysis
- Horn, R.¹ Johnson, C.²

7
- 0001810975
- Line spectrum representation of linear predictive coefficients of speech signals
- Itakura, F., 1975. Line spectrum representation of linear predictive coefficients of speech signals. J. Acoust. Soc. Amer. 57, 535.
- (1975) J. Acoust. Soc. Amer. , vol.57 , pp. 535
- Itakura, F.¹

8
- 0027965617
- Determination of sagittal tongue shape from the positions of points on the tongue surface
- Kaburagi, T., Honda, M., 1994. Determination of sagittal tongue shape from the positions of points on the tongue surface. J. Acoust. Soc. Amer. 56, 1356-1366.
- (1994) J. Acoust. Soc. Amer. , vol.56 , pp. 1356-1366
- Kaburagi, T.¹ Honda, M.²

9
- 0021451739
- Converging evidence in support of common dynamic principles for speech and movement coordination
- Kelso, J.A.S., Tuller, B., 1984. Converging evidence in support of common dynamic principles for speech and movement coordination. Amer. J. Psychol. 15, R928-R935.
- (1984) Amer. J. Psychol. , vol.15
- Kelso, J.A.S.¹ Tuller, B.²

10
- 84930562624
- Dissertatie, Royal Institute of Technology (KTH), Stockholm
- Lin, Q., 1990. Speech production theory and articulatory speech synthesis. Dissertatie, Royal Institute of Technology (KTH), Stockholm.
- (1990) Speech Production Theory and Articulatory Speech Synthesis
- Lin, Q.¹

11
- 0020281396
- A digital simulation method of the vocal-tract system
- Maeda, S., 1982. A digital simulation method of the vocal-tract system. Speech Communication 1 (3,4), 199-229.
- (1982) Speech Communication , vol.1 , Issue.3-4 , pp. 199-229
- Maeda, S.¹

12
- 0028375762
- Recovering articulatory movement from formant frequency trajectories using task dynamics and a genetic algorithm: Preliminary model tests
- McGowan, R.S., 1994. Recovering articulatory movement from formant frequency trajectories using task dynamics and a genetic algorithm: Preliminary model tests. Speech Communication 14, 19-48.
- (1994) Speech Communication , vol.14 , pp. 19-48
- McGowan, R.S.¹

13
- 0014092133
- Determination of vocal-tract shape from measured formant frequencies
- Mermelstein, P., 1967. Determination of vocal-tract shape from measured formant frequencies. J. Acoust. Soc. Amer. 41 (5), 1283-1294.
- (1967) J. Acoust. Soc. Amer. , vol.41 , Issue.5 , pp. 1283-1294
- Mermelstein, P.¹

14
- 0015613574
- Articulatory model for the study of speech production
- Mermelstein, P., 1973. Articulatory model for the study of speech production. J. Acoust. Soc. Amer. 53 (4), 1070-1082.
- (1973) J. Acoust. Soc. Amer. , vol.53 , Issue.4 , pp. 1070-1082
- Mermelstein, P.¹

15
- 0022342333
- An examination of intra-articulator relative timing
- Munhall, K.G., 1985. An examination of intra-articulator relative timing. J. Acoust. Soc. Amer. 78, 1548-1553.
- (1985) J. Acoust. Soc. Amer. , vol.78 , pp. 1548-1553
- Munhall, K.G.¹

16
- 0023739204
- Patterns of interarticulator phasing and their relation to linguistic structure
- Nittrouer, S., Munhall, K.G., Kelso, J.A.S., Tuller, B., Harris, K.S., 1988. Patterns of interarticulator phasing and their relation to linguistic structure. J. Acoust. Soc. Amer. 84, 1653-1661.
- (1988) J. Acoust. Soc. Amer. , vol.84 , pp. 1653-1661
- Nittrouer, S.¹ Munhall, K.G.² Kelso, J.A.S.³ Tuller, B.⁴ Harris, K.S.⁵

17
- 0028288484
- Control of jaw orientation and position in mastication and speech
- Ostry, D.J., Munhall, K.G., 1994. Control of jaw orientation and position in mastication and speech. Journal of Neurophysiology 71, 1528-1545.
- (1994) Journal of Neurophysiology , vol.71 , pp. 1528-1545
- Ostry, D.J.¹ Munhall, K.G.²

18
- 0026491198
- Electromagnetic midsagittal articulometer systems for transducing speech articulatory movements
- Perkell, J.S., Cohen, M.H., Svirsky, M.A., Matthies, M.L., Garabieta, I., Jackson, M.T.T., 1992. Electromagnetic midsagittal articulometer systems for transducing speech articulatory movements. J. Acoust. Soc. Amer. 92 (6), 3078-3096.
- (1992) J. Acoust. Soc. Amer. , vol.92 , Issue.6 , pp. 3078-3096
- Perkell, J.S.¹ Cohen, M.H.² Svirsky, M.A.³ Matthies, M.L.⁴ Garabieta, I.⁵ Jackson, M.T.T.⁶

19
- 0004244302
- Prentice-Hall, Englewood Cliffs, NJ
- Rabiner, L., Juang, B.W., 1993. Fundamentals of Speech Recognition. Prentice-Hall, Englewood Cliffs, NJ.
- (1993) Fundamentals of Speech Recognition
- Rabiner, L.¹ Juang, B.W.²

20
- 0019606728
- An articulatory synthesizer for perceptual research
- Rubin, P.E., Baer, T., Mermelstein, P., 1981. An articulatory synthesizer for perceptual research. J. Acoust. Soc. Amer. 70, 321-328.
- (1981) J. Acoust. Soc. Amer. , vol.70 , pp. 321-328
- Rubin, P.E.¹ Baer, T.² Mermelstein, P.³

21
- 77956779481
- A dynamical approach to gestural patterning in speech production
- Saltzman, E., Munhall, K.G., 1989. A dynamical approach to gestural patterning in speech production. Ecological Psychology 1, 333-382.
- (1989) Ecological Psychology , vol.1 , pp. 333-382
- Saltzman, E.¹ Munhall, K.G.²

22
- 0014077928
- Determination of the geometry of the human vocal-tract by acoustical measurements
- Schroeder, M.R., 1967. Determination of the geometry of the human vocal-tract by acoustical measurements. J. Acoust. Soc. Amer. 41 (4), 1002-1010.
- (1967) J. Acoust. Soc. Amer. , vol.41 , Issue.4 , pp. 1002-1010
- Schroeder, M.R.¹

23
- 0001736204
- Speech coding based on physiological models of speech production
- Sondhi, M.M., Furui, S. (Eds.), Marcel Dekker, New York
- Schroeter, J., Sondhi, M., 1991. Speech coding based on physiological models of speech production. In: Sondhi, M.M., Furui, S. (Eds.), Advances in Speech Processing. Marcel Dekker, New York, pp. 231-268.
- (1991) Advances in Speech Processing , pp. 231-268
- Schroeter, J.¹ Sondhi, M.²

24
- 0028259480
- Techniques for estimating vocal-tract shapes from the speech signal
- Schroeter, J., Sondhi, M.M., 1994. Techniques for estimating vocal-tract shapes from the speech signal. IEEE Trans. Speech Audio Process. 2 (1), 133-150.
- (1994) IEEE Trans. Speech Audio Process , vol.2 , Issue.1 , pp. 133-150
- Schroeter, J.¹ Sondhi, M.M.²

25
- 0007558272
- Articulatory synthesis
- Hardcastle, W.J., Marchal, A. (Eds.), Kluwer Academic Publishers, Dordrecht
- Scully, C., 1990. Articulatory synthesis. In: Hardcastle, W.J., Marchal, A. (Eds.), Speech Production and Speech Modelling. Kluwer Academic Publishers, Dordrecht, pp. 151-186.
- (1990) Speech Production and Speech Modelling , pp. 151-186
- Scully, C.¹

26
- 0039229118
- Estimation and generation of articulatory motion using neural networks
- Shirai, K., 1993. Estimation and generation of articulatory motion using neural networks. Speech Communication 13, 45-51.
- (1993) Speech Communication , vol.13 , pp. 45-51
- Shirai, K.¹

27
- 0023165217
- A hybrid time-frequency domain articulatory speech synthesizer
- Sondhi, M.M., Schroeter, J., 1987. A hybrid time-frequency domain articulatory speech synthesizer. IEEE Trans. Acoust. Speech Signal Process. 35 (7), 955-967.
- (1987) IEEE Trans. Acoust. Speech Signal Process , vol.35 , Issue.7 , pp. 955-967
- Sondhi, M.M.¹ Schroeter, J.²

28
- 84955022381
- Development of a quantitative description of vowel articulation
- Stevens, K., House, A., 1955. Development of a quantitative description of vowel articulation. J. Acoust. Soc. Amer. 27 (3), 484-493.
- (1955) J. Acoust. Soc. Amer. , vol.27 , Issue.3 , pp. 484-493
- Stevens, K.¹ House, A.²

29
- 0025316435
- A three-dimensional model of tongue movement based on ultrasound and X-ray microbeam data
- Stone, M., 1990. A three-dimensional model of tongue movement based on ultrasound and X-ray microbeam data. J. Acoust. Soc. Amer. 87 (5), 2207-2217.
- (1990) J. Acoust. Soc. Amer. , vol.87 , Issue.5 , pp. 2207-2217
- Stone, M.¹

30
- 0001390793
- Speech analysis and synthesis methods developed at ECL in NTT - From LPC to LSP
- Sugamura, N., Itakura, F., 1986. Speech analysis and synthesis methods developed at ECL in NTT - From LPC to LSP -. Speech Communication 5, 199-215.
- (1986) Speech Communication , vol.5 , pp. 199-215
- Sugamura, N.¹ Itakura, F.²

31
- 28244482533
- Extracting articulator movement parameters from a videodisc-based cineradiographic database
- Tiede, M.K., Vatikiotis-Bateson, E., 1994. Extracting articulator movement parameters from a videodisc-based cineradiographic database. In: Proc. International Conference on Spoken Language Processing, pp. S02-4.1-S02-4.4.
- (1994) Proc. International Conference on Spoken Language Processing
- Tiede, M.K.¹ Vatikiotis-Bateson, E.²

32
- 0000078906
- An analysis of the dimensionality of jaw motion in speech
- Vatikiotis-Bateson, E., Ostry, D.J., 1995. An analysis of the dimensionality of jaw motion in speech. Journal of Phonetics 23, 101-117.
- (1995) Journal of Phonetics , vol.23 , pp. 101-117
- Vatikiotis-Bateson, E.¹ Ostry, D.J.²

33
- 0002560289
- H-96 65, The Acoustical Society of Japan
- Vatikiotis-Bateson, E., Yehia, H., 1996. Physiological modeling of facial motion during speech. H-96 65, The Acoustical Society of Japan.
- (1996) Physiological Modeling of Facial Motion during Speech
- Vatikiotis-Bateson, E.¹ Yehia, H.²

34
- 85032406625
- Unified physiological model of audible-visible speech production
- Vatikiotis-Bateson, E., Yehia, H.C., 1997. Unified physiological model of audible-visible speech production. In: Fifth European Conference on Speech Communication and Technology.
- (1997) Fifth European Conference on Speech Communication and Technology
- Vatikiotis-Bateson, E.¹ Yehia, H.C.²

35
- 0030355346
- Characterizing audiovisual information during speech
- Vatikiotis-Bateson, E., Munhall, K.G., Kasahara Y., Garcia, F., Yehia, H., 1996a. Characterizing audiovisual information during speech. In: Proceedings of the International Conference on Spoken Language Processing, pp. 1485-1488.
- (1996) Proceedings of the International Conference on Spoken Language Processing , pp. 1485-1488
- Vatikiotis-Bateson, E.¹ Munhall, K.G.² Kasahara, Y.³ Garcia, F.⁴ Yehia, H.⁵

36
- 0010605203
- The dynamics of audiovisual behavior in speech
- Stork, D., Hennecke, M. (Eds.), NATO-ASI Series, Series F, Computers and Systems Sciences. Springer, Berlin
- Vatikiotis-Bateson, E., Munhall, K.G., Hirayama, M., Lee, Y.C., Terzopoulos, D., 1996b. The dynamics of audiovisual behavior in speech. In: Stork, D., Hennecke, M. (Eds.), Speech Reading by Humans and Machines, Vol. 150, NATO-ASI Series, Series F, Computers and Systems Sciences. Springer, Berlin, pp. 221-232.
- (1996) Speech Reading by Humans and Machines , vol.150 , pp. 221-232
- Vatikiotis-Bateson, E.¹ Munhall, K.G.² Hirayama, M.³ Lee, Y.C.⁴ Terzopoulos, D.⁵

37
- 0002906060
- Technical Report TR-H-237, ATR-HIP
- Vatikiotis-Bateson, E., Kuratate, T., Tiede, M.K., Yehia, H.C., 1998. Kinematics-based synthesis of realistic talking faces. Technical Report TR-H-237, ATR-HIP.
- (1998) Kinematics-based Synthesis of Realistic Talking Faces
- Vatikiotis-Bateson, E.¹ Kuratate, T.² Tiede, M.K.³ Yehia, H.C.⁴

38
- 0038594488
- Determination of human vocaltract dynamic geometry from formant trajectories using spatial and temporal Fourier analysis
- Yehia, H., Itakura, F., 1994. Determination of human vocaltract dynamic geometry from formant trajectories using spatial and temporal Fourier analysis. In: Proc. IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 477-480.
- (1994) Proc. IEEE International Conference on Acoustics, Speech and Signal Processing , pp. 477-480
- Yehia, H.¹ Itakura, F.²

39
- 0030121505
- A method to combine acoustical and morphological constraints in the speech production inverse problem
- Yehia, H., Itakura, F., 1996. A method to combine acoustical and morphological constraints in the speech production inverse problem. Speech Communication 18 (2), 151-174.
- (1996) Speech Communication , vol.18 , Issue.2 , pp. 151-174
- Yehia, H.¹ Itakura, F.²

40
- 0030218668
- An acoustically oriented vocal-tract model
- Yehia, H., Takeda, K., Itakura, F., 1996. An acoustically oriented vocal-tract model. IEICE Transactions on Information and Systems E79 (D-8), 1198-1208.
- (1996) IEICE Transactions on Information and Systems E79 (D-8) , pp. 1198-1208
- Yehia, H.¹ Takeda, K.² Itakura, F.³

41
- 0041007692
- An analysis of the acoustic-to-articulatory mapping during speech under morphological and continuity constraints
- in review
- Yehia, H., Takeda, K., Itakura, F., in review. An analysis of the acoustic-to-articulatory mapping during speech under morphological and continuity constraints. Speech Communication.
- Speech Communication
- Yehia, H.¹ Takeda, K.² Itakura, F.³

42
- 0040413488
- Quantitative association of orofacial and vocal-tract shapes
- Yehia, H.C., Rubin, P., Vatikiotis-Bateson, E., 1997. Quantitative association of orofacial and vocal-tract shapes. In: European Tutorial and Research Workshop on Audio-Visual Speech Processing: Computational and Cognitive Science Approaches.
- (1997) European Tutorial and Research Workshop on Audio-visual Speech Processing: Computational and Cognitive Science Approaches
- Yehia, H.C.¹ Rubin, P.² Vatikiotis-Bateson, E.³

43
- 0004138931
- Wiley, New York
- Zacks, S., 1971. The Theory of Statistical Inference. Wiley, New York.
- (1971) The Theory of Statistical Inference
- Zacks, S.¹

44
- 0028464701
- A new neural network for articulatory speech recognition and its application to vowel identification
- Zacks, S., Thomas, T.R., 1994. A new neural network for articulatory speech recognition and its application to vowel identification. Computer Speech and Language 8, 189-209.
- (1994) Computer Speech and Language , vol.8 , pp. 189-209
- Zacks, S.¹ Thomas, T.R.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.