메뉴 건너뛰기




Volumn 116, Issue 4 I, 2004, Pages 2354-2364

A neural network model of the articulatory-acoustic forward mapping trained on recordings of articulatory parameters

Author keywords

[No Author keywords available]

Indexed keywords

DATABASE SYSTEMS; INFORMATION ANALYSIS; MATHEMATICAL MODELS; SIGNAL PROCESSING; SPEECH;

EID: 6344254321     PISSN: 00014966     EISSN: None     Source Type: Journal    
DOI: 10.1121/1.1715112     Document Type: Article
Times cited : (60)

References (39)
  • 1
    • 0036656541 scopus 로고    scopus 로고
    • Three-dimensional linear articulatory modeling of tongue, lips, and face, based on MRI and video images
    • Badin, P., Bailly, G., Reveret, L., Baciu, M., Segebarth, C., and Savariaux, C. (2002). "Three-dimensional linear articulatory modeling of tongue, lips, and face, based on MRI and video images," J. Phonetics 30, 533-553.
    • (2002) J. Phonetics , vol.30 , pp. 533-553
    • Badin, P.1    Bailly, G.2    Reveret, L.3    Baciu, M.4    Segebarth, C.5    Savariaux, C.6
  • 2
    • 0025739174 scopus 로고
    • Analysis of vocal-tract shape and dimensions using magnetic-resonance- imaging-vowels
    • Baer, T., Gore, J. C., Gracco, L. C., and Nye, P. W. (1991). "Analysis of vocal-tract shape and dimensions using magnetic-resonance- imaging-vowels," J. Acoust. Soc. Am. 90, 799-828.
    • (1991) J. Acoust. Soc. Am. , vol.90 , pp. 799-828
    • Baer, T.1    Gore, J.C.2    Gracco, L.C.3    Nye, P.W.4
  • 3
    • 0031198820 scopus 로고    scopus 로고
    • Learning to speak. Sensori-motor control of speech movements
    • Bailly, G. (1997). "Learning to speak. Sensori-motor control of speech movements," Speech Commun. 22, 251-267.
    • (1997) Speech Commun. , vol.22 , pp. 251-267
    • Bailly, G.1
  • 4
    • 0035025894 scopus 로고    scopus 로고
    • Linear degrees of freedom in speech production: Analysis of cineradio- and labio-film data and articulatory-acoustic modeling
    • Beautemps, D., Badin, P., and Bailly, G. (2001). "Linear degrees of freedom in speech production: Analysis of cineradio- and labio-film data and articulatory-acoustic modeling," J. Acoust. Soc. Am. 109, 2165-2180.
    • (2001) J. Acoust. Soc. Am. , vol.109 , pp. 2165-2180
    • Beautemps, D.1    Badin, P.2    Bailly, G.3
  • 5
    • 0029230714 scopus 로고
    • Deriving vocal-tract area functions from midsagittal profiles and formant frequencies - A new model for vowels and fricative consonants based on experimental-data
    • Beautemps, D., Badin, P., and Laboissiere, R. (1995). "Deriving vocal-tract area functions from midsagittal profiles and formant frequencies - A new model for vowels and fricative consonants based on experimental-data," Speech Commun. 16, 27-47.
    • (1995) Speech Commun. , vol.16 , pp. 27-47
    • Beautemps, D.1    Badin, P.2    Laboissiere, R.3
  • 8
    • 0034092076 scopus 로고    scopus 로고
    • A self-learning predictive model of articulator movements during speech production
    • Blackburn, C. S., and Young, S. J. (2000). "A self-learning predictive model of articulator movements during speech production," J. Acoust. Soc. Am. 107, 1659-1670.
    • (2000) J. Acoust. Soc. Am. , vol.107 , pp. 1659-1670
    • Blackburn, C.S.1    Young, S.J.2
  • 9
    • 0035412933 scopus 로고    scopus 로고
    • Enhanced speech recognition using an articulatory production model trained on X-ray data
    • Blackburn, C. S., and Young, S. (2000a). "Enhanced speech recognition using an articulatory production model trained on X-ray data," Comput. Speech Lang. 15, 195-215.
    • (2000) Comput. Speech Lang. , vol.15 , pp. 195-215
    • Blackburn, C.S.1    Young, S.2
  • 10
    • 0034092076 scopus 로고    scopus 로고
    • A self-learning predictive model of articulator movements during speech production
    • Blackburn, C. S., and Young, S. (2000b). "A self-learning predictive model of articulator movements during speech production," J. Acoust. Soc. Am. 107, 1659-1670.
    • (2000) J. Acoust. Soc. Am. , vol.107 , pp. 1659-1670
    • Blackburn, C.S.1    Young, S.2
  • 11
    • 0036497601 scopus 로고    scopus 로고
    • A comparison of spectral smoothing methods for segment concatenation based speech synthesis
    • Chappell, D. T., and Hansen, J. H. L. (2002). "A comparison of spectral smoothing methods for segment concatenation based speech synthesis," Speech Commun. 36, 343-374.
    • (2002) Speech Commun. , vol.36 , pp. 343-374
    • Chappell, D.T.1    Hansen, J.H.L.2
  • 12
    • 0024861871 scopus 로고
    • Approximation by superpositions of a sigmoid function
    • Cybenko, G. (1989). "Approximation by superpositions of a sigmoid function," Math. Control, Signals, Syst. 2, 303-314.
    • (1989) Math. Control, Signals, Syst. , vol.2 , pp. 303-314
    • Cybenko, G.1
  • 13
    • 0034226802 scopus 로고    scopus 로고
    • Incorporating lip protrusion and larynx lowering into a time domain model for articulatory speech synthesis
    • Goodyear, C. C. (2000). "Incorporating lip protrusion and larynx lowering into a time domain model for articulatory speech synthesis," Comput. Speech Lang. 14, 211-226.
    • (2000) Comput. Speech Lang. , vol.14 , pp. 211-226
    • Goodyear, C.C.1
  • 15
    • 0028719136 scopus 로고
    • A neural-network model of speech acquisition and motor equivalent speech production
    • Guenther, F. H. (1994). "A neural-network model of speech acquisition and motor equivalent speech production," Biol. Cybern. 72, 43-53.
    • (1994) Biol. Cybern. , vol.72 , pp. 43-53
    • Guenther, F.H.1
  • 16
    • 0029338245 scopus 로고
    • Speech sound acquisition, coarticulation, and rate effects in a neural-network model of speech production
    • Guenther, F. H. (1995). "Speech sound acquisition, coarticulation, and rate effects in a neural-network model of speech production," Psychol. Rev. 102, 594-621.
    • (1995) Psychol. Rev. , vol.102 , pp. 594-621
    • Guenther, F.H.1
  • 17
    • 0032192891 scopus 로고    scopus 로고
    • A theoretical investigation of reference frames for the planning of speech movements
    • Guenther, F. H., Hampson, M., and Johnson, D. (1998). "A theoretical investigation of reference frames for the planning of speech movements," Psychol. Rev. 105, 611-633.
    • (1998) Psychol. Rev. , vol.105 , pp. 611-633
    • Guenther, F.H.1    Hampson, M.2    Johnson, D.3
  • 18
    • 0035348355 scopus 로고    scopus 로고
    • Functional anatomy of speech perception and speech production: Psycholinguistic implications
    • Hickok, G. (2001). "Functional anatomy of speech perception and speech production: Psycholinguistic implications," J. Psycholinguist. Res. 30, 225-235.
    • (2001) J. Psycholinguist. Res. , vol.30 , pp. 225-235
    • Hickok, G.1
  • 19
    • 0028996871 scopus 로고
    • Noise estimation techniques for robust speech recognition
    • Paper presented
    • Hirsch, H. G., and Ehrlicher, C. (1995). "Noise estimation techniques for robust speech recognition," Paper presented at the Proc. ICASSP.
    • (1995) Proc. ICASSP
    • Hirsch, H.G.1    Ehrlicher, C.2
  • 20
    • 0024137490 scopus 로고
    • Increased rates of convergence through learning rate adaptation
    • Jacobs, R. A. (1988). "Increased rates of convergence through learning rate adaptation," Neural Networks 1, 295-307.
    • (1988) Neural Networks , vol.1 , pp. 295-307
    • Jacobs, R.A.1
  • 22
    • 44049116478 scopus 로고
    • Forward models - Supervised learning with a distal teacher
    • Jordan, M. I., and Rumelhart, D. E. (1992). "Forward models - Supervised learning with a distal teacher," Cogn. Sci. 16, 307-354.
    • (1992) Cogn. Sci. , vol.16 , pp. 307-354
    • Jordan, M.I.1    Rumelhart, D.E.2
  • 24
    • 0034940788 scopus 로고    scopus 로고
    • Dynamic articulatory model based on multidimensional invariant-feature task representation
    • Kaburagi, T., and Honda, M. (2001). "Dynamic articulatory model based on multidimensional invariant-feature task representation," J. Acoust. Soc. Am. 110, 441-452.
    • (2001) J. Acoust. Soc. Am. , vol.110 , pp. 441-452
    • Kaburagi, T.1    Honda, M.2
  • 25
    • 0002023092 scopus 로고
    • Speech Synthesis
    • Proceedings of the Fourth International Congress of Acoustics, paper G42, 1-4, edited by J. L. Flanagan and L. R. Rabiner (Dowden, Hutchinson & Ross, Stroudsburg, PA)
    • Kelly, J. L., and Lochbaum, C. C. (1962). "Speech Synthesis," Proceedings of the Fourth International Congress of Acoustics, paper G42, 1-4, in Speech Synthesis, edited by J. L. Flanagan and L. R. Rabiner (Dowden, Hutchinson & Ross, Stroudsburg, PA), pp. 127-130.
    • (1962) Speech Synthesis , pp. 127-130
    • Kelly, J.L.1    Lochbaum, C.C.2
  • 27
    • 0015613574 scopus 로고
    • Articulatory model for the study of speech production
    • Mermelstein, P. (1973). "Articulatory model for the study of speech production," J. Acoust. Soc. Am. 53, 1070-1082.
    • (1973) J. Acoust. Soc. Am. , vol.53 , pp. 1070-1082
    • Mermelstein, P.1
  • 29
    • 0031200496 scopus 로고    scopus 로고
    • Speech motor control: Acoustic goals, saturation effects, auditory feedback, and internal models
    • Perkell, J. Matthies, M., Lane, H., Guenther, F., Wilhelms-Tricarico, R., Wozniak, J., et al. (1997). "Speech motor control: Acoustic goals, saturation effects, auditory feedback, and internal models," Speech Commun. 22, 227-250.
    • (1997) Speech Commun. , vol.22 , pp. 227-250
    • Perkell, J.1    Matthies, M.2    Lane, H.3    Guenther, F.4    Wilhelms-Tricarico, R.5    Wozniak, J.6
  • 30
    • 0000678652 scopus 로고    scopus 로고
    • A theory of speech motor control and supporting data from speakers with normal hearing and with profound hearing loss
    • Perkell, J. S., Guenther, F. H., Lane, H., Matthies, M. L., Perrier, P., Vick, J. et al. (2000). "A theory of speech motor control and supporting data from speakers with normal hearing and with profound hearing loss," J. Phonetics 28, 233-272.
    • (2000) J. Phonetics , vol.28 , pp. 233-272
    • Perkell, J.S.1    Guenther, F.H.2    Lane, H.3    Matthies, M.L.4    Perrier, P.5    Vick, J.6
  • 31
    • 0019606145 scopus 로고
    • Some current theoretical issues in speech perception
    • Pisoni, D. B. (1981). "Some current theoretical issues in speech perception," Cognition 10, 249-259.
    • (1981) Cognition , vol.10 , pp. 249-259
    • Pisoni, D.B.1
  • 32
    • 0002075963 scopus 로고    scopus 로고
    • The emergence of phonology from the interplay of speech comprehension and production: A distributed connectionist approach
    • edited by B. MacWhinney (Erlbaum, Mahweh, NJ)
    • Plaut, D. C., and Kello, C. T. (1999). "The emergence of phonology from the interplay of speech comprehension and production: A distributed connectionist approach," in The Emergence of Language, edited by B. MacWhinney (Erlbaum, Mahweh, NJ), pp. 381-415.
    • (1999) The Emergence of Language , pp. 381-415
    • Plaut, D.C.1    Kello, C.T.2
  • 33
    • 0029691655 scopus 로고    scopus 로고
    • Understanding normal and impaired word reading: Computational principles in quasi-regular domains
    • Plaut, D. C., McClelland, J. L., Seidenberg, M. S., and Patterson, K. (1996). "Understanding normal and impaired word reading: Computational principles in quasi-regular domains," Psychol. Rev. 103, 56-115.
    • (1996) Psychol. Rev. , vol.103 , pp. 56-115
    • Plaut, D.C.1    McClelland, J.L.2    Seidenberg, M.S.3    Patterson, K.4
  • 35
    • 0019606728 scopus 로고
    • An articulatory synthesizer for perceptual research
    • Rubin, P., Baer, T., and Mermelstein, P. (1981). "An articulatory synthesizer for perceptual research," J. Acoust. Soc. Am. 70, 321-328.
    • (1981) J. Acoust. Soc. Am. , vol.70 , pp. 321-328
    • Rubin, P.1    Baer, T.2    Mermelstein, P.3
  • 37
    • 0022471098 scopus 로고
    • Learning representations by back-propagating errors
    • Rumelhart, D. E., Hinton, G. E., and Williams, R. J. (1986). "Learning representations by back-propagating errors," Nature (London) 323, 533-536.
    • (1986) Nature (London) , vol.323 , pp. 533-536
    • Rumelhart, D.E.1    Hinton, G.E.2    Williams, R.J.3
  • 38
    • 85009123170 scopus 로고    scopus 로고
    • Estimation of voice source and vocal tract characteristics based on multi-frame analysis
    • Paper presented
    • Shiga, Y., and King, S. (2003). "Estimation of voice source and vocal tract characteristics based on multi-frame analysis." Paper presented at Eurospeech.
    • (2003) Eurospeech
    • Shiga, Y.1    King, S.2
  • 39
    • 0037503670 scopus 로고    scopus 로고
    • A multichannel articulatory speech database and its application for automatic speech recognition
    • Paper presented
    • Wrench, A., and Hardcastle, W. (2000). "A multichannel articulatory speech database and its application for automatic speech recognition," Paper presented at the Proceedings of the 5th Seminar on Speech Production.
    • (2000) Proceedings of the 5th Seminar on Speech Production
    • Wrench, A.1    Hardcastle, W.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.