메뉴 건너뛰기




Volumn 26, Issue 1-2, 1998, Pages 23-43

Quantitative association of vocal-tract and facial behavior

Author keywords

Dynamic time warping (DTW); Facial motion; Line spectrum pair (LSP); Linear estimator; Principal component analysis; Singular value decomposition; Vocal tract motion

Indexed keywords

CORRELATION METHODS; SPEECH ANALYSIS;

EID: 0032178592     PISSN: 01676393     EISSN: None     Source Type: Journal    
DOI: 10.1016/S0167-6393(98)00048-X     Document Type: Article
Times cited : (378)

References (44)
  • 1
    • 0017968519 scopus 로고
    • Inversion of articulatory-to-acoustic transformation in the vocal-tract by a computing sorting technique
    • Atal, B.S., Chang, J.J., Tukey, J.W., 1978. Inversion of articulatory-to-acoustic transformation in the vocal-tract by a computing sorting technique. J. Acoust. Soc. Amer. 63 (5), 1535-1555.
    • (1978) J. Acoust. Soc. Amer. , vol.63 , Issue.5 , pp. 1535-1555
    • Atal, B.S.1    Chang, J.J.2    Tukey, J.W.3
  • 2
    • 0001762548 scopus 로고
    • Recovery of vocal tract geometry from formants for vowels and fricative consonants using a midsagittal-to-area function conversion model
    • Badin, P., Beautemps, D., Laboissiere, R., Schwartz, J.L., 1995. Recovery of vocal tract geometry from formants for vowels and fricative consonants using a midsagittal-to-area function conversion model. Journal of Phonetics 23, 221-229.
    • (1995) Journal of Phonetics , vol.23 , pp. 221-229
    • Badin, P.1    Beautemps, D.2    Laboissiere, R.3    Schwartz, J.L.4
  • 5
    • 0039821028 scopus 로고
    • An unsupervised method for learning to track tongue position from ana acoustic signal
    • Haskins Laboratories, New Haven, CT, USA
    • Hogden, J., 1993. An unsupervised method for learning to track tongue position from ana acoustic signal. Status Report on Speech Research SR-115/116, Haskins Laboratories, New Haven, CT, USA.
    • (1993) Status Report on Speech Research SR-115/116
    • Hogden, J.1
  • 7
    • 0001810975 scopus 로고
    • Line spectrum representation of linear predictive coefficients of speech signals
    • Itakura, F., 1975. Line spectrum representation of linear predictive coefficients of speech signals. J. Acoust. Soc. Amer. 57, 535.
    • (1975) J. Acoust. Soc. Amer. , vol.57 , pp. 535
    • Itakura, F.1
  • 8
    • 0027965617 scopus 로고
    • Determination of sagittal tongue shape from the positions of points on the tongue surface
    • Kaburagi, T., Honda, M., 1994. Determination of sagittal tongue shape from the positions of points on the tongue surface. J. Acoust. Soc. Amer. 56, 1356-1366.
    • (1994) J. Acoust. Soc. Amer. , vol.56 , pp. 1356-1366
    • Kaburagi, T.1    Honda, M.2
  • 9
    • 0021451739 scopus 로고
    • Converging evidence in support of common dynamic principles for speech and movement coordination
    • Kelso, J.A.S., Tuller, B., 1984. Converging evidence in support of common dynamic principles for speech and movement coordination. Amer. J. Psychol. 15, R928-R935.
    • (1984) Amer. J. Psychol. , vol.15
    • Kelso, J.A.S.1    Tuller, B.2
  • 11
    • 0020281396 scopus 로고
    • A digital simulation method of the vocal-tract system
    • Maeda, S., 1982. A digital simulation method of the vocal-tract system. Speech Communication 1 (3,4), 199-229.
    • (1982) Speech Communication , vol.1 , Issue.3-4 , pp. 199-229
    • Maeda, S.1
  • 12
    • 0028375762 scopus 로고
    • Recovering articulatory movement from formant frequency trajectories using task dynamics and a genetic algorithm: Preliminary model tests
    • McGowan, R.S., 1994. Recovering articulatory movement from formant frequency trajectories using task dynamics and a genetic algorithm: Preliminary model tests. Speech Communication 14, 19-48.
    • (1994) Speech Communication , vol.14 , pp. 19-48
    • McGowan, R.S.1
  • 13
    • 0014092133 scopus 로고
    • Determination of vocal-tract shape from measured formant frequencies
    • Mermelstein, P., 1967. Determination of vocal-tract shape from measured formant frequencies. J. Acoust. Soc. Amer. 41 (5), 1283-1294.
    • (1967) J. Acoust. Soc. Amer. , vol.41 , Issue.5 , pp. 1283-1294
    • Mermelstein, P.1
  • 14
    • 0015613574 scopus 로고
    • Articulatory model for the study of speech production
    • Mermelstein, P., 1973. Articulatory model for the study of speech production. J. Acoust. Soc. Amer. 53 (4), 1070-1082.
    • (1973) J. Acoust. Soc. Amer. , vol.53 , Issue.4 , pp. 1070-1082
    • Mermelstein, P.1
  • 15
    • 0022342333 scopus 로고
    • An examination of intra-articulator relative timing
    • Munhall, K.G., 1985. An examination of intra-articulator relative timing. J. Acoust. Soc. Amer. 78, 1548-1553.
    • (1985) J. Acoust. Soc. Amer. , vol.78 , pp. 1548-1553
    • Munhall, K.G.1
  • 17
    • 0028288484 scopus 로고
    • Control of jaw orientation and position in mastication and speech
    • Ostry, D.J., Munhall, K.G., 1994. Control of jaw orientation and position in mastication and speech. Journal of Neurophysiology 71, 1528-1545.
    • (1994) Journal of Neurophysiology , vol.71 , pp. 1528-1545
    • Ostry, D.J.1    Munhall, K.G.2
  • 20
    • 0019606728 scopus 로고
    • An articulatory synthesizer for perceptual research
    • Rubin, P.E., Baer, T., Mermelstein, P., 1981. An articulatory synthesizer for perceptual research. J. Acoust. Soc. Amer. 70, 321-328.
    • (1981) J. Acoust. Soc. Amer. , vol.70 , pp. 321-328
    • Rubin, P.E.1    Baer, T.2    Mermelstein, P.3
  • 21
    • 77956779481 scopus 로고
    • A dynamical approach to gestural patterning in speech production
    • Saltzman, E., Munhall, K.G., 1989. A dynamical approach to gestural patterning in speech production. Ecological Psychology 1, 333-382.
    • (1989) Ecological Psychology , vol.1 , pp. 333-382
    • Saltzman, E.1    Munhall, K.G.2
  • 22
    • 0014077928 scopus 로고
    • Determination of the geometry of the human vocal-tract by acoustical measurements
    • Schroeder, M.R., 1967. Determination of the geometry of the human vocal-tract by acoustical measurements. J. Acoust. Soc. Amer. 41 (4), 1002-1010.
    • (1967) J. Acoust. Soc. Amer. , vol.41 , Issue.4 , pp. 1002-1010
    • Schroeder, M.R.1
  • 23
    • 0001736204 scopus 로고
    • Speech coding based on physiological models of speech production
    • Sondhi, M.M., Furui, S. (Eds.), Marcel Dekker, New York
    • Schroeter, J., Sondhi, M., 1991. Speech coding based on physiological models of speech production. In: Sondhi, M.M., Furui, S. (Eds.), Advances in Speech Processing. Marcel Dekker, New York, pp. 231-268.
    • (1991) Advances in Speech Processing , pp. 231-268
    • Schroeter, J.1    Sondhi, M.2
  • 24
    • 0028259480 scopus 로고
    • Techniques for estimating vocal-tract shapes from the speech signal
    • Schroeter, J., Sondhi, M.M., 1994. Techniques for estimating vocal-tract shapes from the speech signal. IEEE Trans. Speech Audio Process. 2 (1), 133-150.
    • (1994) IEEE Trans. Speech Audio Process , vol.2 , Issue.1 , pp. 133-150
    • Schroeter, J.1    Sondhi, M.M.2
  • 25
    • 0007558272 scopus 로고
    • Articulatory synthesis
    • Hardcastle, W.J., Marchal, A. (Eds.), Kluwer Academic Publishers, Dordrecht
    • Scully, C., 1990. Articulatory synthesis. In: Hardcastle, W.J., Marchal, A. (Eds.), Speech Production and Speech Modelling. Kluwer Academic Publishers, Dordrecht, pp. 151-186.
    • (1990) Speech Production and Speech Modelling , pp. 151-186
    • Scully, C.1
  • 26
    • 0039229118 scopus 로고
    • Estimation and generation of articulatory motion using neural networks
    • Shirai, K., 1993. Estimation and generation of articulatory motion using neural networks. Speech Communication 13, 45-51.
    • (1993) Speech Communication , vol.13 , pp. 45-51
    • Shirai, K.1
  • 27
    • 0023165217 scopus 로고
    • A hybrid time-frequency domain articulatory speech synthesizer
    • Sondhi, M.M., Schroeter, J., 1987. A hybrid time-frequency domain articulatory speech synthesizer. IEEE Trans. Acoust. Speech Signal Process. 35 (7), 955-967.
    • (1987) IEEE Trans. Acoust. Speech Signal Process , vol.35 , Issue.7 , pp. 955-967
    • Sondhi, M.M.1    Schroeter, J.2
  • 28
    • 84955022381 scopus 로고
    • Development of a quantitative description of vowel articulation
    • Stevens, K., House, A., 1955. Development of a quantitative description of vowel articulation. J. Acoust. Soc. Amer. 27 (3), 484-493.
    • (1955) J. Acoust. Soc. Amer. , vol.27 , Issue.3 , pp. 484-493
    • Stevens, K.1    House, A.2
  • 29
    • 0025316435 scopus 로고
    • A three-dimensional model of tongue movement based on ultrasound and X-ray microbeam data
    • Stone, M., 1990. A three-dimensional model of tongue movement based on ultrasound and X-ray microbeam data. J. Acoust. Soc. Amer. 87 (5), 2207-2217.
    • (1990) J. Acoust. Soc. Amer. , vol.87 , Issue.5 , pp. 2207-2217
    • Stone, M.1
  • 30
    • 0001390793 scopus 로고
    • Speech analysis and synthesis methods developed at ECL in NTT - From LPC to LSP
    • Sugamura, N., Itakura, F., 1986. Speech analysis and synthesis methods developed at ECL in NTT - From LPC to LSP -. Speech Communication 5, 199-215.
    • (1986) Speech Communication , vol.5 , pp. 199-215
    • Sugamura, N.1    Itakura, F.2
  • 32
    • 0000078906 scopus 로고
    • An analysis of the dimensionality of jaw motion in speech
    • Vatikiotis-Bateson, E., Ostry, D.J., 1995. An analysis of the dimensionality of jaw motion in speech. Journal of Phonetics 23, 101-117.
    • (1995) Journal of Phonetics , vol.23 , pp. 101-117
    • Vatikiotis-Bateson, E.1    Ostry, D.J.2
  • 36
    • 0010605203 scopus 로고    scopus 로고
    • The dynamics of audiovisual behavior in speech
    • Stork, D., Hennecke, M. (Eds.), NATO-ASI Series, Series F, Computers and Systems Sciences. Springer, Berlin
    • Vatikiotis-Bateson, E., Munhall, K.G., Hirayama, M., Lee, Y.C., Terzopoulos, D., 1996b. The dynamics of audiovisual behavior in speech. In: Stork, D., Hennecke, M. (Eds.), Speech Reading by Humans and Machines, Vol. 150, NATO-ASI Series, Series F, Computers and Systems Sciences. Springer, Berlin, pp. 221-232.
    • (1996) Speech Reading by Humans and Machines , vol.150 , pp. 221-232
    • Vatikiotis-Bateson, E.1    Munhall, K.G.2    Hirayama, M.3    Lee, Y.C.4    Terzopoulos, D.5
  • 38
    • 0038594488 scopus 로고
    • Determination of human vocaltract dynamic geometry from formant trajectories using spatial and temporal Fourier analysis
    • Yehia, H., Itakura, F., 1994. Determination of human vocaltract dynamic geometry from formant trajectories using spatial and temporal Fourier analysis. In: Proc. IEEE International Conference on Acoustics, Speech and Signal Processing, pp. 477-480.
    • (1994) Proc. IEEE International Conference on Acoustics, Speech and Signal Processing , pp. 477-480
    • Yehia, H.1    Itakura, F.2
  • 39
    • 0030121505 scopus 로고    scopus 로고
    • A method to combine acoustical and morphological constraints in the speech production inverse problem
    • Yehia, H., Itakura, F., 1996. A method to combine acoustical and morphological constraints in the speech production inverse problem. Speech Communication 18 (2), 151-174.
    • (1996) Speech Communication , vol.18 , Issue.2 , pp. 151-174
    • Yehia, H.1    Itakura, F.2
  • 41
    • 0041007692 scopus 로고    scopus 로고
    • An analysis of the acoustic-to-articulatory mapping during speech under morphological and continuity constraints
    • in review
    • Yehia, H., Takeda, K., Itakura, F., in review. An analysis of the acoustic-to-articulatory mapping during speech under morphological and continuity constraints. Speech Communication.
    • Speech Communication
    • Yehia, H.1    Takeda, K.2    Itakura, F.3
  • 44
    • 0028464701 scopus 로고
    • A new neural network for articulatory speech recognition and its application to vowel identification
    • Zacks, S., Thomas, T.R., 1994. A new neural network for articulatory speech recognition and its application to vowel identification. Computer Speech and Language 8, 189-209.
    • (1994) Computer Speech and Language , vol.8 , pp. 189-209
    • Zacks, S.1    Thomas, T.R.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.