메뉴 건너뛰기




Volumn 49, Issue 5, 2007, Pages 361-383

Inverting mappings from smooth paths through Rn to paths through Rm: A technique applied to recovering articulation from acoustics

Author keywords

Channel normalization; Dimensionality reduction; Speech inverse problem

Indexed keywords

ACOUSTICS; CONFORMAL MAPPING; INVERSE PROBLEMS; SIGNAL ANALYSIS; SPEECH RECOGNITION; TIME SERIES ANALYSIS;

EID: 34247647975     PISSN: 01676393     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.specom.2007.02.008     Document Type: Article
Times cited : (12)

References (101)
  • 1
    • 0025536990 scopus 로고
    • Harmonic and intermodulation distortion in carbon microphones
    • Abuelma'atti M. Harmonic and intermodulation distortion in carbon microphones. Appl. Acoust. 31 (1990) 233-243
    • (1990) Appl. Acoust. , vol.31 , pp. 233-243
    • Abuelma'atti, M.1
  • 2
    • 0004120747 scopus 로고
    • Rota G.-C. (Ed), Cambridge University Press, Cambridge
    • Aczel J., and Dhombres J. In: Rota G.-C. (Ed). Functional Equations in Several Variables with Applications to Mathematics, Information Theory, and the Natural and Social Sciences. Encyclopedia of Mathematics and its Applications Series Vol. 31 (1989), Cambridge University Press, Cambridge
    • (1989) Encyclopedia of Mathematics and its Applications Series , vol.31
    • Aczel, J.1    Dhombres, J.2
  • 3
    • 0025225150 scopus 로고
    • Competitive learning algorithms for vector quantization
    • Ahalt S., Krishnamurthy A., Chen P., and Melton D. Competitive learning algorithms for vector quantization. Neural Networks 3 (1990) 277-290
    • (1990) Neural Networks , vol.3 , pp. 277-290
    • Ahalt, S.1    Krishnamurthy, A.2    Chen, P.3    Melton, D.4
  • 4
    • 0017968519 scopus 로고
    • Inversion of articulatory-to-acoustic transformation in the vocal tract by a computer-sorting technique
    • Atal B.S., Chang J.J., Mathews M.V., and Tukey J.W. Inversion of articulatory-to-acoustic transformation in the vocal tract by a computer-sorting technique. J. Acoust. Soc. Amer. 63 5 (1978) 1535-1555
    • (1978) J. Acoust. Soc. Amer. , vol.63 , Issue.5 , pp. 1535-1555
    • Atal, B.S.1    Chang, J.J.2    Mathews, M.V.3    Tukey, J.W.4
  • 5
    • 0001762548 scopus 로고
    • Recovery of vocal tract geometry from formants for vowels and fricative consonants using a midsagittal-to-area function conversion method
    • Badin P., Beautemps D., Laboissière R., and Schwartz J.-L. Recovery of vocal tract geometry from formants for vowels and fricative consonants using a midsagittal-to-area function conversion method. J. Phonetics 23 (1995) 221-229
    • (1995) J. Phonetics , vol.23 , pp. 221-229
    • Badin, P.1    Beautemps, D.2    Laboissière, R.3    Schwartz, J.-L.4
  • 6
    • 84892157236 scopus 로고    scopus 로고
    • Non-parametric estimation and correction of non-linear distortion in speech systems
    • Balchandran R., and Mammone R. Non-parametric estimation and correction of non-linear distortion in speech systems. Proc. Internat. Conf. Acoust. Speech Signal Process. Vol. 2 (1998) 749-752
    • (1998) Proc. Internat. Conf. Acoust. Speech Signal Process. , vol.2 , pp. 749-752
    • Balchandran, R.1    Mammone, R.2
  • 7
    • 0035025894 scopus 로고    scopus 로고
    • Linear degrees of freedom in speech production: analysis of cineradio- and labio-film data and articulatory-acoustic modeling
    • Beautemps D., Badin P., and Bailly G. Linear degrees of freedom in speech production: analysis of cineradio- and labio-film data and articulatory-acoustic modeling. J. Acoust. Soc. Amer. 109 5 (2001) 2165-2180
    • (2001) J. Acoust. Soc. Amer. , vol.109 , Issue.5 , pp. 2165-2180
    • Beautemps, D.1    Badin, P.2    Bailly, G.3
  • 10
    • 0035412933 scopus 로고    scopus 로고
    • Enhanced speech recognition using an articulatory production model trained on X-ray data
    • Blackburn C.S., and Young S. Enhanced speech recognition using an articulatory production model trained on X-ray data. Comput. Speech Language 15 (2001) 195-215
    • (2001) Comput. Speech Language , vol.15 , pp. 195-215
    • Blackburn, C.S.1    Young, S.2
  • 11
    • 0015307807 scopus 로고
    • Speech perception under conditions of spectral transformation: I. Phonetic characteristics
    • Blesser B. Speech perception under conditions of spectral transformation: I. Phonetic characteristics. J. Speech Hear. Res. 15 (1972) 5-41
    • (1972) J. Speech Hear. Res. , vol.15 , pp. 5-41
    • Blesser, B.1
  • 12
    • 85048901870 scopus 로고
    • The geometric vocal tract variables controlled for vowel production: proposals for constraining acoustic-to-articulatory conversion
    • Boe L.J., Perrier P., and Bailly G. The geometric vocal tract variables controlled for vowel production: proposals for constraining acoustic-to-articulatory conversion. J. Phonetics 20 (1992) 27-38
    • (1992) J. Phonetics , vol.20 , pp. 27-38
    • Boe, L.J.1    Perrier, P.2    Bailly, G.3
  • 13
    • 34247561112 scopus 로고    scopus 로고
    • Carreira-Perpinan, M.A., 2001. Continuous latent variable models for dimensionality reduction and sequential data reconstruction. Unpublished Ph.D. Dissertation, University of Sheffield, Sheffield, UK.
  • 14
    • 0016940126 scopus 로고
    • A model of articulatory dynamics and control
    • Coker C. A model of articulatory dynamics and control. Proc. IEEE 64 4 (1976) 452-460
    • (1976) Proc. IEEE , vol.64 , Issue.4 , pp. 452-460
    • Coker, C.1
  • 15
    • 0002629270 scopus 로고
    • Maximum likelihood from incomplete data via the EM algorithm (with discussion)
    • Dempster A., Laird N., and Rubin D. Maximum likelihood from incomplete data via the EM algorithm (with discussion). J. Roy. Statist. Soc. Ser. B 39 (1977) 1-38
    • (1977) J. Roy. Statist. Soc. Ser. B , vol.39 , pp. 1-38
    • Dempster, A.1    Laird, N.2    Rubin, D.3
  • 16
    • 0032119268 scopus 로고    scopus 로고
    • A dynamic, feature-based approach to the interface between phonology and phonetics for speech modeling and recognition
    • Deng L. A dynamic, feature-based approach to the interface between phonology and phonetics for speech modeling and recognition. Speech Comm. 24 (1998) 299-323
    • (1998) Speech Comm. , vol.24 , pp. 299-323
    • Deng, L.1
  • 17
    • 0031198059 scopus 로고    scopus 로고
    • Production models as a structural basis for automatic speech recognition
    • Deng L., Ramsay G., and Sun D. Production models as a structural basis for automatic speech recognition. Speech Comm. 22 (1997) 93-111
    • (1997) Speech Comm. , vol.22 , pp. 93-111
    • Deng, L.1    Ramsay, G.2    Sun, D.3
  • 19
    • 34247556885 scopus 로고    scopus 로고
    • Dusan, S., Deng, L., 2000. Acoustic-to-articulator inversion using dynamical and phonological constraints. In: Proc. 5th Seminar on Speech Production, pp. 237-240.
  • 20
    • 0032216898 scopus 로고    scopus 로고
    • The geometry of algorithms with orthogonality constraints
    • Edelman A., Arias T., and Smith S. The geometry of algorithms with orthogonality constraints. SIAM J. Matrix Anal. Appl. 20 2 (1998) 303-353
    • (1998) SIAM J. Matrix Anal. Appl. , vol.20 , Issue.2 , pp. 303-353
    • Edelman, A.1    Arias, T.2    Smith, S.3
  • 23
    • 0019261519 scopus 로고
    • Immediate compensation in bite-block speech
    • Fowler C., and Turvey M. Immediate compensation in bite-block speech. Phonetica 37 (1980) 306-326
    • (1980) Phonetica , vol.37 , pp. 306-326
    • Fowler, C.1    Turvey, M.2
  • 24
    • 58849145971 scopus 로고    scopus 로고
    • ASR - articulatory speech recognition
    • Frankel J., and King S. ASR - articulatory speech recognition. Proc. Eurospeech (2001) 599-602
    • (2001) Proc. Eurospeech , pp. 599-602
    • Frankel, J.1    King, S.2
  • 25
    • 34247573334 scopus 로고    scopus 로고
    • Frankel, J., King, S., 2001b. Speech recognition in the articulatory domain: Investigating an alternative to acoustic HMMs. In: Proc. Workshop for Innovations in Speech Processing.
  • 27
    • 0032192891 scopus 로고    scopus 로고
    • A theoretical investigation of reference frames for the planning of speech movements
    • Guenther F., Hampson M., and Johnson D. A theoretical investigation of reference frames for the planning of speech movements. Psychol. Rev. 105 4 (1998) 611-633
    • (1998) Psychol. Rev. , vol.105 , Issue.4 , pp. 611-633
    • Guenther, F.1    Hampson, M.2    Johnson, D.3
  • 28
    • 0027494671 scopus 로고
    • Pitch-synchronous frame-by-frame and segment-based articulatory analysis by synthesis
    • Gupta S., and Schroeter J. Pitch-synchronous frame-by-frame and segment-based articulatory analysis by synthesis. J. Acoust. Soc. Amer. 94 5 (1993) 2517-2530
    • (1993) J. Acoust. Soc. Amer. , vol.94 , Issue.5 , pp. 2517-2530
    • Gupta, S.1    Schroeter, J.2
  • 29
    • 34247633959 scopus 로고    scopus 로고
    • Hirayama, M., Vatikiotis-Bateson, E., Honda, K., Koike, Y., Kawato, M., 1992. Physiologically based speech synthesis. Poster presented at Neural Information Processing Systems (92).
  • 30
    • 2142659020 scopus 로고    scopus 로고
    • Estimation of articulatory movements from speech acoustics using an HMM-based speech production model
    • Hiroya S., and Honda M. Estimation of articulatory movements from speech acoustics using an HMM-based speech production model. IEEE Trans. Speech Audio Process. 12 2 (2004) 175-185
    • (2004) IEEE Trans. Speech Audio Process. , vol.12 , Issue.2 , pp. 175-185
    • Hiroya, S.1    Honda, M.2
  • 31
    • 34247647902 scopus 로고    scopus 로고
    • Hogden, J., 1991. Low-dimensional phoneme mapping using a continuity constraint. Unpublished Doctoral Dissertation, Stanford University, Stanford, CA.
  • 32
    • 34247613885 scopus 로고    scopus 로고
    • Hogden, J., 1995. A maximum likelihood approach to estimating speech articulator positions from speech acoustics. In: Neural Information Processing Systems (95), Workshop on Neural Networks for Speech Processing, Vail, CO.
  • 33
    • 34247633423 scopus 로고    scopus 로고
    • Hogden, J., 2000. USA Patent # 6,052,662.
  • 34
    • 34247640585 scopus 로고    scopus 로고
    • Hogden, J., Valdez, P., 2000. Bridging the gap between speech production and speech recognition. In: Proc. 5th Seminar on Speech Production: Models and Data, Kloster Seeon, Bavaria, May 1st-4th.
  • 35
    • 34247616659 scopus 로고
    • An unsupervised method for learning to track tongue position from an acoustic signal
    • (A)
    • Hogden J., Rubin P., and Saltzman E. An unsupervised method for learning to track tongue position from an acoustic signal. J. Acoust. Soc. Amer. 91 4 (1992) 2443 (A)
    • (1992) J. Acoust. Soc. Amer. , vol.91 , Issue.4 , pp. 2443
    • Hogden, J.1    Rubin, P.2    Saltzman, E.3
  • 36
    • 34247566960 scopus 로고    scopus 로고
    • Hogden, J., Saltzman, E., Rubin, P., 1993. Tracking moving objects with unsupervised neural networks. Paper presented at the World Conference on Neural Networks, Portland, OR.
  • 37
    • 84937295891 scopus 로고    scopus 로고
    • An unsupervised method for learning to track tongue position from an acoustic signal
    • Hogden J., Rubin P., and Saltzman E. An unsupervised method for learning to track tongue position from an acoustic signal. Bull. Communication Parlee 3 (1996) 101-116
    • (1996) Bull. Communication Parlee , vol.3 , pp. 101-116
    • Hogden, J.1    Rubin, P.2    Saltzman, E.3
  • 38
  • 39
    • 34247592285 scopus 로고    scopus 로고
    • Hogden, J., Nix, D., Valdez, P., 1998. Maximum likelihood continuity mapping: bridging the gap between production and recognition. Paper presented at the 9th Hub-5 Conversational Speech Recognition Workshop, Linthicum Heights, MD, USA, September 24th-25th.
  • 40
  • 41
    • 0034940788 scopus 로고    scopus 로고
    • Dynamic articulatory model based on multidimensional invariant-feature task representation
    • Kaburagi T., and Honda M. Dynamic articulatory model based on multidimensional invariant-feature task representation. J. Acoust. Soc. Amer. 110 1 (2001) 441-452
    • (2001) J. Acoust. Soc. Amer. , vol.110 , Issue.1 , pp. 441-452
    • Kaburagi, T.1    Honda, M.2
  • 42
    • 0001724240 scopus 로고    scopus 로고
    • Dimension reduction by local PCA
    • Kambhatle N., and Leen T. Dimension reduction by local PCA. Neural Computation 9 (1997) 1-18
    • (1997) Neural Computation , vol.9 , pp. 1-18
    • Kambhatle, N.1    Leen, T.2
  • 43
    • 34247592812 scopus 로고    scopus 로고
    • Kimber, D., 1994. Geometric methods for nonparametric modeling of dynamical systems. Unpublished Ph.D. Dissertation, Stanford University, Stanford, CA.
  • 45
    • 0022222203 scopus 로고    scopus 로고
    • Kuc, R., Tutuer, F., Vaisnys, J.R., 1985. Determining vocal tract shape by applying dynamic constraints. In: Proc. Internat. Conf. on Acoustics Speech and Signal Processing, Tampa, FL.
  • 46
    • 0036091061 scopus 로고    scopus 로고
    • Representations of sound that are insensitive to spectral filtering and parameterization procedures
    • Levin D. Representations of sound that are insensitive to spectral filtering and parameterization procedures. J. Acoust. Soc. Amer. 111 5 (2002) 2257-2271
    • (2002) J. Acoust. Soc. Amer. , vol.111 , Issue.5 , pp. 2257-2271
    • Levin, D.1
  • 48
    • 0022143866 scopus 로고
    • The motor theory of speech perception revised
    • Liberman A., and Mattingly I. The motor theory of speech perception revised. Cognition 21 (1985) 1-36
    • (1985) Cognition , vol.21 , pp. 1-36
    • Liberman, A.1    Mattingly, I.2
  • 49
    • 0012778209 scopus 로고
    • Effects of differentiation, integration, and infinite peak clipping upon the intelligibility of speech
    • Licklider J., and Pollack I. Effects of differentiation, integration, and infinite peak clipping upon the intelligibility of speech. J. Acoust. Soc. Amer. 20 (1948) 42-51
    • (1948) J. Acoust. Soc. Amer. , vol.20 , pp. 42-51
    • Licklider, J.1    Pollack, I.2
  • 50
    • 0029916854 scopus 로고    scopus 로고
    • Role of articulation in speech perception: clues from production
    • Lindblom B. Role of articulation in speech perception: clues from production. J. Acoust. Soc. Amer. 99 3 (1996) 1683-1692
    • (1996) J. Acoust. Soc. Amer. , vol.99 , Issue.3 , pp. 1683-1692
    • Lindblom, B.1
  • 51
    • 0002539638 scopus 로고
    • Formant frequencies of some fixed-mandible vowels and a model of speech motor programming by predictive simulation
    • Lindblom B., Lubker J., and Gay T. Formant frequencies of some fixed-mandible vowels and a model of speech motor programming by predictive simulation. J. Phonetics 7 (1979) 146-161
    • (1979) J. Phonetics , vol.7 , pp. 146-161
    • Lindblom, B.1    Lubker, J.2    Gay, T.3
  • 53
    • 34247634965 scopus 로고
    • An articulatory model of the tongue based on a statistical analysis
    • Maeda S. An articulatory model of the tongue based on a statistical analysis. J. Acoust. Soc. Amer. 65 (1979) S22
    • (1979) J. Acoust. Soc. Amer. , vol.65
    • Maeda, S.1
  • 55
    • 0035534411 scopus 로고    scopus 로고
    • How many subspaces force linearity?
    • Maxson C., and Meyer J. How many subspaces force linearity?. Amer. Math. Monthly 108 6 (2001) 531-536
    • (2001) Amer. Math. Monthly , vol.108 , Issue.6 , pp. 531-536
    • Maxson, C.1    Meyer, J.2
  • 57
    • 34247619212 scopus 로고    scopus 로고
    • McGowan, R., 1987. Articulatory synthesis: numerical solution of a hyperbolic differential equation. Haskins Laboratories Status Report on Speech Research, SR-89/90.
  • 58
    • 0029867514 scopus 로고    scopus 로고
    • Introduction to papers on speech recognition and perception from an articulatory view
    • McGowan R., and Faber A. Introduction to papers on speech recognition and perception from an articulatory view. J. Acoust. Soc. Amer. 99 3 (1996) 1680-1682
    • (1996) J. Acoust. Soc. Amer. , vol.99 , Issue.3 , pp. 1680-1682
    • McGowan, R.1    Faber, A.2
  • 59
    • 0030045846 scopus 로고    scopus 로고
    • Task dynamic and articulatory recovery of lip and velar approximations under model mismatch conditions
    • McGowan R., and Lee M. Task dynamic and articulatory recovery of lip and velar approximations under model mismatch conditions. J. Acoust. Soc. Amer. 99 1 (1996) 595-608
    • (1996) J. Acoust. Soc. Amer. , vol.99 , Issue.1 , pp. 595-608
    • McGowan, R.1    Lee, M.2
  • 60
    • 0015613574 scopus 로고
    • Articulatory model for the study of speech production
    • Mermelstein P. Articulatory model for the study of speech production. J. Acoust. Soc. Amer. 53 4 (1973) 1070-1082
    • (1973) J. Acoust. Soc. Amer. , vol.53 , Issue.4 , pp. 1070-1082
    • Mermelstein, P.1
  • 61
    • 34247622969 scopus 로고    scopus 로고
    • Moody, J., 1999. Visualizing speech with a recurrent neural network trained on human acoustic-articulator data. Unpublished Ph.D. Dissertation, University of California, San Diego, CA.
  • 62
    • 0034854661 scopus 로고    scopus 로고
    • Morris, R., Clements, M., 2001. Maximum-likelihood compensation of zero-memory nonlinearities in speech signals. In: Proc. Internat. Conf. on Acoustics, Speech, and Signal Processing, pp. 289-292.
  • 63
    • 12444325470 scopus 로고
    • Perioral biomechanics and its relation to labial motor control
    • Muller E., and McLeod G. Perioral biomechanics and its relation to labial motor control. J. Acoust. Soc. Amer. 78 Suppl. 1 (1982) S38
    • (1982) J. Acoust. Soc. Amer. , vol.78 , Issue.SUPPL. 1
    • Muller, E.1    McLeod, G.2
  • 65
    • 34247589068 scopus 로고    scopus 로고
    • Nix, D., 1998. Machine learning methods for inferring vocal-tract articulation from speech acoustics. Unpublished Ph.D. Thesis, University of Colorado, Boulder, CO.
  • 66
    • 0014469991 scopus 로고
    • Speech analysis-synthesis system based on homomorphic filtering
    • Oppenheim A. Speech analysis-synthesis system based on homomorphic filtering. J. Acoust. Soc. Amer. 45 2 (1969) 458-465
    • (1969) J. Acoust. Soc. Amer. , vol.45 , Issue.2 , pp. 458-465
    • Oppenheim, A.1
  • 67
    • 22144465830 scopus 로고    scopus 로고
    • Modeling the articulatory space using a hypercube codebook for acoustic-to-articulatory inversion
    • Ouni S., and Laprie Y. Modeling the articulatory space using a hypercube codebook for acoustic-to-articulatory inversion. J. Acoust. Soc. Amer. 118 1 (2005) 444-460
    • (2005) J. Acoust. Soc. Amer. , vol.118 , Issue.1 , pp. 444-460
    • Ouni, S.1    Laprie, Y.2
  • 68
    • 13544259544 scopus 로고    scopus 로고
    • On the usefulness of STFT phase spectrum in human listening tests
    • Paliwal K., and Alsteris L. On the usefulness of STFT phase spectrum in human listening tests. Speech Comm. 45 (2005) 153-170
    • (2005) Speech Comm. , vol.45 , pp. 153-170
    • Paliwal, K.1    Alsteris, L.2
  • 69
    • 0026675669 scopus 로고
    • Inferring articulation and recognizing gestures from acoustics with a neural network trained on X-ray microbeam data
    • Papcun G., Hotchberg J., Thomas T., Laroche F., Zacks J., and Levy S. Inferring articulation and recognizing gestures from acoustics with a neural network trained on X-ray microbeam data. J. Acoust. Soc. Amer. 92 2 (1992) 688-700
    • (1992) J. Acoust. Soc. Amer. , vol.92 , Issue.2 , pp. 688-700
    • Papcun, G.1    Hotchberg, J.2    Thomas, T.3    Laroche, F.4    Zacks, J.5    Levy, S.6
  • 70
    • 0026491198 scopus 로고
    • Electromagnetic midsagittal articulometer systems for transducing speech articulatory movements
    • Perkell J., Cohen M., Svirsky M., Matthies M., Garabieta I., and Jackson M. Electromagnetic midsagittal articulometer systems for transducing speech articulatory movements. J. Acoust. Soc. Amer. 92 6 (1992) 3078-3096
    • (1992) J. Acoust. Soc. Amer. , vol.92 , Issue.6 , pp. 3078-3096
    • Perkell, J.1    Cohen, M.2    Svirsky, M.3    Matthies, M.4    Garabieta, I.5    Jackson, M.6
  • 71
    • 0027221510 scopus 로고
    • Trading relations between tongue-body raising and lip rounding in production of the vowel /u/: A pilot 'motor equivalence' study
    • Perkell J., Mathies M., Svirsky M., and Jordan M. Trading relations between tongue-body raising and lip rounding in production of the vowel /u/: A pilot 'motor equivalence' study. J. Acoust. Soc. Amer. 93 5 (1993) 2948-2961
    • (1993) J. Acoust. Soc. Amer. , vol.93 , Issue.5 , pp. 2948-2961
    • Perkell, J.1    Mathies, M.2    Svirsky, M.3    Jordan, M.4
  • 73
    • 0031188967 scopus 로고    scopus 로고
    • A GCD method for blind channel identification
    • Qiu W., and Hua Y. A GCD method for blind channel identification. Digital Signal Processing 7 (1997) 199-205
    • (1997) Digital Signal Processing , vol.7 , pp. 199-205
    • Qiu, W.1    Hua, Y.2
  • 74
    • 0034274733 scopus 로고    scopus 로고
    • Estimation of handset nonlinearity with application to speaker recognition
    • Quatieri T.F., Reynolds D.A., and O'Leary G.C. Estimation of handset nonlinearity with application to speaker recognition. IEEE Trans. Speech Audio Process. 8 5 (2000) 567-584
    • (2000) IEEE Trans. Speech Audio Process. , vol.8 , Issue.5 , pp. 567-584
    • Quatieri, T.F.1    Reynolds, D.A.2    O'Leary, G.C.3
  • 75
    • 0019508260 scopus 로고
    • Speech perception without traditional speech cues
    • Remez R.E., Rubin P.E., Pisoni D.B., and Carrell T.D. Speech perception without traditional speech cues. Science 212 (1981) 947-950
    • (1981) Science , vol.212 , pp. 947-950
    • Remez, R.E.1    Rubin, P.E.2    Pisoni, D.B.3    Carrell, T.D.4
  • 76
    • 0029725601 scopus 로고    scopus 로고
    • Reynolds, D., 1996. The effects of handset variability on speaker recognition performance: experiments on the Switchboard corpus. In: Proc. IEEE Internat. Conf. on Acoustics, Speech, and Signal Processing, pp. 113-116.
  • 78
    • 34247571284 scopus 로고    scopus 로고
    • Dynamic constraint weighting in the context of articulatory parameter estimation
    • Richards H., Bridle J., Hunt M., and Mason J. Dynamic constraint weighting in the context of articulatory parameter estimation. Proc. Eurospeech 97 (1997)
    • (1997) Proc. Eurospeech , vol.97
    • Richards, H.1    Bridle, J.2    Hunt, M.3    Mason, J.4
  • 79
    • 0030008004 scopus 로고    scopus 로고
    • The potential role of speech production models in automatic speech recognition
    • Rose R., Schroeter J., and Sondhi M. The potential role of speech production models in automatic speech recognition. J. Acoust. Soc. Amer. 99 3 (1996) 1699-1709
    • (1996) J. Acoust. Soc. Amer. , vol.99 , Issue.3 , pp. 1699-1709
    • Rose, R.1    Schroeter, J.2    Sondhi, M.3
  • 80
    • 84890640150 scopus 로고    scopus 로고
    • The primacy of multimodal speech perception
    • Pisoni D., and Remez R. (Eds), Blackwell, Malden, MA
    • Rosenblum L.D. The primacy of multimodal speech perception. In: Pisoni D., and Remez R. (Eds). Handbook of Speech Perception (2005), Blackwell, Malden, MA 51-78
    • (2005) Handbook of Speech Perception , pp. 51-78
    • Rosenblum, L.D.1
  • 81
    • 34247620153 scopus 로고    scopus 로고
    • Roweis, S., 1999. Data driven production models for speech processing. Unpublished Ph.D. Thesis, California Institute of Technology, Pasadena, CA.
  • 82
    • 34247607091 scopus 로고    scopus 로고
    • Towards articulatory speech recognition: learning smooth maps to recover articulator information
    • Roweis S., and Alwan A. Towards articulatory speech recognition: learning smooth maps to recover articulator information. Proc. Eurospeech 3 (1997) 1227-1230
    • (1997) Proc. Eurospeech , vol.3 , pp. 1227-1230
    • Roweis, S.1    Alwan, A.2
  • 83
    • 0034704222 scopus 로고    scopus 로고
    • Nonlinear dimensionality reduction by locally linear embedding
    • Roweis S., and Saul L. Nonlinear dimensionality reduction by locally linear embedding. Science 290 (2000) 2323-2326
    • (2000) Science , vol.290 , pp. 2323-2326
    • Roweis, S.1    Saul, L.2
  • 84
    • 0019606728 scopus 로고
    • An articulatory synthesizer for perceptual research
    • Rubin P., Baer T., and Mermelstein P. An articulatory synthesizer for perceptual research. J. Acoust. Soc. Amer. 70 2 (1981) 321-328
    • (1981) J. Acoust. Soc. Amer. , vol.70 , Issue.2 , pp. 321-328
    • Rubin, P.1    Baer, T.2    Mermelstein, P.3
  • 85
    • 0033614397 scopus 로고    scopus 로고
    • Cognitive restoration of reversed speech
    • Saberi, and Perrott. Cognitive restoration of reversed speech. Nature 398 (1999) 760
    • (1999) Nature , vol.398 , pp. 760
    • Saberi1    Perrott2
  • 86
    • 77956779481 scopus 로고
    • A dynamical approach to gestural patterning in speech production
    • Saltzman E., and Munhall K. A dynamical approach to gestural patterning in speech production. Ecol. Psychol. 1 4 (1989) 333-382
    • (1989) Ecol. Psychol. , vol.1 , Issue.4 , pp. 333-382
    • Saltzman, E.1    Munhall, K.2
  • 87
    • 0014077928 scopus 로고
    • Determination of the geometry of the human vocal tract by acoustic measurements
    • Schroeder M. Determination of the geometry of the human vocal tract by acoustic measurements. J. Acoust. Soc. Amer. 41 4 (1967) 1002-1010
    • (1967) J. Acoust. Soc. Amer. , vol.41 , Issue.4 , pp. 1002-1010
    • Schroeder, M.1
  • 88
    • 0028259480 scopus 로고
    • Techniques for estimating vocal-tract shapes from the speech signal
    • Schroeter J., and Sondhi M. Techniques for estimating vocal-tract shapes from the speech signal. IEEE Trans. Speech Audio Process. 2 1 (1994) 133-150
    • (1994) IEEE Trans. Speech Audio Process. , vol.2 , Issue.1 , pp. 133-150
    • Schroeter, J.1    Sondhi, M.2
  • 89
    • 34247572820 scopus 로고
    • Model prediction and real speech: fricative dynamics
    • Lindblom B., and Ohman S. (Eds), Academic Press, New York
    • Scully C. Model prediction and real speech: fricative dynamics. In: Lindblom B., and Ohman S. (Eds). Frontiers of Speech Comm. Research (1979), Academic Press, New York 35-48
    • (1979) Frontiers of Speech Comm. Research , pp. 35-48
    • Scully, C.1
  • 90
    • 0008499181 scopus 로고
    • Estimating articulatory motion from speech wave
    • Shirai K., and Kobayashi T. Estimating articulatory motion from speech wave. Speech Comm. 5 (1986) 159-170
    • (1986) Speech Comm. , vol.5 , pp. 159-170
    • Shirai, K.1    Kobayashi, T.2
  • 91
    • 0020722409 scopus 로고
    • The inverse problem for the vocal tract: numerical methods, acoustical experiments, and speech synthesis
    • Sondhi M., and Resnick J.R. The inverse problem for the vocal tract: numerical methods, acoustical experiments, and speech synthesis. J. Acoust. Soc. Amer. 73 3 (1983) 985-1002
    • (1983) J. Acoust. Soc. Amer. , vol.73 , Issue.3 , pp. 985-1002
    • Sondhi, M.1    Resnick, J.R.2
  • 92
    • 0030205050 scopus 로고    scopus 로고
    • Articulatory-to-acoustic mapping for inverse problem
    • Sorokin V., and Trushkin A.V. Articulatory-to-acoustic mapping for inverse problem. Speech Comm. 19 (1996) 105-118
    • (1996) Speech Comm. , vol.19 , pp. 105-118
    • Sorokin, V.1    Trushkin, A.V.2
  • 95
    • 34247638880 scopus 로고    scopus 로고
    • Suzuki, S., Okadome, T., Honda, M., 1998. Determination of articulatory positions from speech acoustics by applying dynamic articulatory constraints. In: Proc. Internat. Conf. on Spoken Language Perception, pp. 2251-2254.
  • 96
    • 0034704229 scopus 로고    scopus 로고
    • A global geometric framework for nonlinear dimensionality reduction
    • Tenenbaum J., Silva V.d., and Langford J. A global geometric framework for nonlinear dimensionality reduction. Science 290 (2000) 2319-2323
    • (2000) Science , vol.290 , pp. 2319-2323
    • Tenenbaum, J.1    Silva, V.d.2    Langford, J.3
  • 97
    • 34247562150 scopus 로고    scopus 로고
    • Tsimbinos, J., 1995. Identification and compensation of nonlinear distortion. Unpublished Ph.D. Dissertation, University of South Australia, The Levels, Australia.
  • 98
    • 0015677434 scopus 로고
    • Direct estimation of the vocal tract shape by inverse filtering of acoustic speech waveforms
    • Wakita H. Direct estimation of the vocal tract shape by inverse filtering of acoustic speech waveforms. IEEE Trans. Audio Electroacoust. AU-21 5 (1973) 417-427
    • (1973) IEEE Trans. Audio Electroacoust. , vol.AU-21 , Issue.5 , pp. 417-427
    • Wakita, H.1
  • 100
    • 0032178592 scopus 로고    scopus 로고
    • Quantitative association of vocal-tract and facial behavior
    • Yehia H., Rubin P., and Vatikiotis-Bateson E. Quantitative association of vocal-tract and facial behavior. Speech Comm. 22 1-2 (1998) 23-43
    • (1998) Speech Comm. , vol.22 , Issue.1-2 , pp. 23-43
    • Yehia, H.1    Rubin, P.2    Vatikiotis-Bateson, E.3
  • 101
    • 33744637086 scopus 로고
    • Adding articulatory features to acoustic features for automatic speech recognition
    • (A)
    • Zlokarnik I. Adding articulatory features to acoustic features for automatic speech recognition. J. Acoust. Soc. Amer. 97 5 pt. 2 (1995) 3246 (A)
    • (1995) J. Acoust. Soc. Amer. , vol.97 , Issue.5 PART 2 , pp. 3246
    • Zlokarnik, I.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.