메뉴 건너뛰기




Volumn 20, Issue 10, 2012, Pages 2672-2682

Exploring the predictability of non-unique acoustic-to-articulatory mappings

Author keywords

Acoustic to articulatory inversion; entropy of GMM (Gaussian mixture model); many to one mapping

Indexed keywords

ACOUSTIC VECTORS; ACOUSTIC-TO-ARTICULATORY INVERSION; ACOUSTIC-TO-ARTICULATORY MAPPING; ARTICULATORY DATA; CONDITIONAL ENTROPY; ELECTROMAGNETIC ARTICULOGRAPHY; GAUSSIAN MIXTURE MODEL; INVERSE MAPPING; MANY-TO-ONE-MAPPING; PROBABILISTIC ESTIMATES; SPEECH ACOUSTICS; STATISTICAL TOOLS; UPPER BOUND;

EID: 84867169172     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2012.2210876     Document Type: Article
Times cited : (8)

References (28)
  • 1
    • 0014092133 scopus 로고
    • Determination of the Vocal-Tract shape from measured formant frequencies
    • P. Mermelstein, "Determination of the Vocal-Tract shape from measured formant frequencies," J. Acoust. Soc. Amer., vol. 41, no. 5, pp. 1283-1294, 1967.
    • (1967) J. Acoust. Soc. Amer. , vol.41 , Issue.5 , pp. 1283-1294
    • Mermelstein, P.1
  • 2
    • 0014077928 scopus 로고
    • Determination of the geometry of the human vocal tract by acoustic measurements
    • M. R. Schroeder, "Determination of the geometry of the human vocal tract by acoustic measurements," J. Acoust. Soc. Amer., vol. 41, no. 4B, pp. 1002-1010, 1967.
    • (1967) J. Acoust. Soc. Amer. , vol.41 , Issue.4 B , pp. 1002-1010
    • Schroeder, M.R.1
  • 3
    • 0017968519 scopus 로고
    • Inversion of articulatory-to-acoustic transformation in the vocal tract by a computer-sorting technique
    • B. S. Atal, J. J. Chang, M. V. Mathews, and J. W. Tukey, "Inversion of articulatory-to-acoustic transformation in the vocal tract by a computer-sorting technique," J. Acoust. Soc. Amer., vol. 63, no. 5, pp. 1535-1555, 1978.
    • (1978) J. Acoust. Soc. Amer. , vol.63 , Issue.5 , pp. 1535-1555
    • Atal, B.S.1    Chang, J.J.2    Mathews, M.V.3    Tukey, J.W.4
  • 4
    • 0002539638 scopus 로고
    • Formant frequencies of some fixed-mandible vowels and a model of speech motor programming by predictive simulation
    • B. Lindblom, J. Lubker, and T. Gay, "Formant frequencies of some fixed-mandible vowels and a model of speech motor programming by predictive simulation," J. Phonetics, vol. 7, pp. 147-161, 1979.
    • (1979) J. Phonetics , vol.7 , pp. 147-161
    • Lindblom, B.1    Lubker, J.2    Gay, T.3
  • 5
    • 0000352807 scopus 로고
    • Trade-offs in tongue, jaw, and palate contributions to speech production
    • M. Stone and E. Vatikiotis-Bateson, "Trade-offs in tongue, jaw, and palate contributions to speech production," J. Phonetics, vol. 23, no. 1-2, pp. 81-100, 1995.
    • (1995) J. Phonetics , vol.23 , Issue.1-2 , pp. 81-100
    • Stone, M.1    Vatikiotis-Bateson, E.2
  • 6
    • 84867186671 scopus 로고    scopus 로고
    • Tongue-jaw trade-offs and naturally occurring perturbation
    • C. Kroos, A. Geumann, and P. Hoole, "Tongue-jaw trade-offs and naturally occurring perturbation," J. Acoust. Soc. Amer., vol. 105, pp. 1355-1355, 1999.
    • (1999) J. Acoust. Soc. Amer. , vol.105 , pp. 1355-1355
    • Kroos, C.1    Geumann, A.2    Hoole, P.3
  • 8
    • 18744382512 scopus 로고    scopus 로고
    • A modeling investigation of articulatory variability and acoustic stability during American English production
    • A. Nieto-Castanon, F. H. Guenther, J. S. Perkell, and H. D. Curtin, "A modeling investigation of articulatory variability and acoustic stability during American English production," J. Acoust. Soc. Amer., vol. 117, no. 5, pp. 3196-3212, 2005.
    • (2005) J. Acoust. Soc. Amer. , vol.117 , Issue.5 , pp. 3196-3212
    • Nieto-Castanon, A.1    Guenther, F.H.2    Perkell, J.S.3    Curtin, H.D.4
  • 9
    • 51449098747 scopus 로고    scopus 로고
    • An empirical investigation of the nonuniqueness in the acoustic-to-articulatory mapping
    • C. Qin and M. Á. Carreira-Perpiñán, "An empirical investigation of the nonuniqueness in the acoustic-to-articulatory mapping," in Proc. Interspeech, 2007, pp. 74-77.
    • (2007) Proc. Interspeech , pp. 74-77
    • Qin, C.1    Carreira-Perpiñán, M.Á.2
  • 11
    • 77953853877 scopus 로고    scopus 로고
    • The geometry of the articulatory region that produces a speech sound
    • Pacific Grove, CA
    • C. Qin and M. Carreira-Perpiñán, "The geometry of the articulatory region that produces a speech sound," in Proc. Asilomar Conf. Signals, Syst. Comput., Pacific Grove, CA, 2009, pp. 1742-1746.
    • (2009) Proc. Asilomar Conf. Signals, Syst. Comput. , pp. 1742-1746
    • Qin, C.1    Carreira-Perpiñán, M.2
  • 12
    • 70450192172 scopus 로고    scopus 로고
    • In search of nonuniqueness in the acoustic-to-articulatory mapping
    • Brighton, U.K.
    • G. Ananthakrishnan, D. Neiberg, and O. Engwall, "In search of nonuniqueness in the acoustic-to-articulatory mapping," in Proc. Interspeech, Brighton, U.K., 2009, pp. 2799-2802.
    • (2009) Proc. Interspeech , pp. 2799-2802
    • Ananthakrishnan, G.1    Neiberg, D.2    Engwall, O.3
  • 13
    • 67349084720 scopus 로고    scopus 로고
    • Maximum entropy autoregressive conditional heteroskedasticity model
    • S. Park and A. Bera, "Maximum entropy autoregressive conditional heteroskedasticity model," J. Econometrics, vol. 150, no. 2, pp. 219-230, 2009.
    • (2009) J. Econometrics , vol.150 , Issue.2 , pp. 219-230
    • Park, S.1    Bera, A.2
  • 14
    • 38649140222 scopus 로고    scopus 로고
    • Statistical mapping between articulatory movements and acoustic spectrum using a Gaussian mixture model
    • T. Toda, A. W. Black, and K. Tokuda, "Statistical mapping between articulatory movements and acoustic spectrum using a Gaussian mixture model," Speech Commun., vol. 50, pp. 215-227, 2008.
    • (2008) Speech Commun. , vol.50 , pp. 215-227
    • Toda, T.1    Black, A.W.2    Tokuda, K.3
  • 16
    • 36849078684 scopus 로고    scopus 로고
    • Kullback-Leibler approach to Gaussian mixture reduction
    • A. Runnalls, "Kullback-Leibler approach to Gaussian mixture reduction," IEEE Trans. Aerosp. Electron. Syst., vol. 43, no. 3, pp. 989-999, 2007.
    • (2007) IEEE Trans. Aerosp. Electron. Syst. , vol.43 , Issue.3 , pp. 989-999
    • Runnalls, A.1
  • 17
    • 84863732214 scopus 로고    scopus 로고
    • M.S. thesis, The Graduate School of the Univ. of Florida, Gainesville
    • E. S.Youn, "Feature selection in support vector machines," M.S. thesis, The Graduate School of the Univ. of Florida, Gainesville, 2002.
    • (2002) Feature Selection in Support Vector Machines
    • Youn, E.S.1
  • 19
    • 38649140222 scopus 로고    scopus 로고
    • Statistical mapping between articulatory movements and acoustic spectrum using a Gaussian mixture model
    • T. Toda, A. W. Black, and K. Tokuda, "Statistical mapping between articulatory movements and acoustic spectrum using a Gaussian mixture model," Speech Commun., vol. 50, no. 3, pp. 215-227, 2008.
    • (2008) Speech Commun. , vol.50 , Issue.3 , pp. 215-227
    • Toda, T.1    Black, A.W.2    Tokuda, K.3
  • 22
    • 0003857778 scopus 로고    scopus 로고
    • A gentle tutorial of the em algorithm and its application to parameter estimation for Gaussian mixture and hidden Markov models
    • Tech. Rep
    • J. A. Bilmes, "A gentle tutorial of the em algorithm and its application to parameter estimation for Gaussian mixture and hidden Markov models," Int. Comput. Sci. Inst., 1998, Tech. Rep.
    • (1998) Int. Comput. Sci. Inst.
    • Bilmes, J.A.1
  • 23
    • 33747060681 scopus 로고    scopus 로고
    • Coordination of lingual and mandibular gestures for different manners of articulation
    • C. Mooshammer, A. Geumann, P. Hoole, P. Alfonso, P. van Lieshout, and S. Fucks, "Coordination of lingual and mandibular gestures for different manners of articulation," in Proc. ICPhS, 2003, pp. 81-84.
    • (2003) Proc. ICPhS , pp. 81-84
    • Mooshammer, C.1    Geumann, A.2    Hoole, P.3    Alfonso, P.4    Van Lieshout, P.5    Fucks, S.6
  • 24
    • 33745183789 scopus 로고    scopus 로고
    • Oldenburg logatome speech corpus (OLLO) for speech recognition experiments with humans and machines
    • Lisbon, Portugal
    • T. Wesker, B. Meyer, K. Wagener, J. Anemüller, A. Mertins, and B. Kollmeier, "Oldenburg logatome speech corpus (OLLO) for speech recognition experiments with humans and machines," in Proc. Interspeech, Lisbon, Portugal, 2005, pp. 1273-1276.
    • (2005) Proc. Interspeech , pp. 1273-1276
    • Wesker, T.1    Meyer, B.2    Wagener, K.3    Anemüller, J.4    Mertins, A.5    Kollmeier, B.6
  • 28
    • 0002378392 scopus 로고
    • The quantal nature of speech: Evidence from articulatory-acoustic data
    • E. E. David and P. B. Denes, Eds. New York: McGraw-Hill
    • K. N. Stevens, "The quantal nature of speech: Evidence from articulatory-acoustic data," in Human Communication: A Unified View, E. E. David and P. B. Denes, Eds. New York: McGraw-Hill, 1972, pp. 51-66.
    • (1972) Human Communication: A Unified View , pp. 51-66
    • Stevens, K.N.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.