SCOPUS 정보 검색 플랫폼

Journal of the Acoustical Society of America

Volumn 116, Issue 4 I, 2004, Pages 2354-2364

A neural network model of the articulatory-acoustic forward mapping trained on recordings of articulatory parameters

(2) Kello, Christopher T a Plaut, David C b

a GEORGE MASON UNIVERSITY (United States)

b CARNEGIE MELLON UNIVERSITY (United States)

Author keywords

[No Author keywords available]

Indexed keywords

DATABASE SYSTEMS; INFORMATION ANALYSIS; MATHEMATICAL MODELS; SIGNAL PROCESSING; SPEECH;

ARTICULATORY SYNTHESIZERS; PHONETIC INFORMATION; SPEECH DATABASE;

NEURAL NETWORKS;

ACOUSTICS; ARTICLE; ARTIFICIAL NEURAL NETWORK; CONTROLLED STUDY; FEMALE; HUMAN; HUMAN EXPERIMENT; INTELLIGENCE TEST; MOLECULAR MODEL; NORMAL HUMAN; PHONETICS; PRIORITY JOURNAL; SPEECH; SPEECH ARTICULATION; SYNTHESIS;

ADULT; COMPUTER SIMULATION; HUMANS; MODELS, NEUROLOGICAL; NEURAL NETWORKS (COMPUTER); PHONATION; SOUND SPECTROGRAPHY; SPEECH; SPEECH ACOUSTICS; SPEECH INTELLIGIBILITY; SPEECH PRODUCTION MEASUREMENT;

DATA; MATHEMATICAL MODELS; NEURAL NETWORKS;

EID: 6344254321 PISSN: 00014966 EISSN: None Source Type: Journal
DOI: 10.1121/1.1715112 Document Type: Article

Times cited : (60)

References (39)

1
- 0036656541
- Three-dimensional linear articulatory modeling of tongue, lips, and face, based on MRI and video images
- Badin, P., Bailly, G., Reveret, L., Baciu, M., Segebarth, C., and Savariaux, C. (2002). "Three-dimensional linear articulatory modeling of tongue, lips, and face, based on MRI and video images," J. Phonetics 30, 533-553.
- (2002) J. Phonetics , vol.30 , pp. 533-553
- Badin, P.¹ Bailly, G.² Reveret, L.³ Baciu, M.⁴ Segebarth, C.⁵ Savariaux, C.⁶

2
- 0025739174
- Analysis of vocal-tract shape and dimensions using magnetic-resonance- imaging-vowels
- Baer, T., Gore, J. C., Gracco, L. C., and Nye, P. W. (1991). "Analysis of vocal-tract shape and dimensions using magnetic-resonance- imaging-vowels," J. Acoust. Soc. Am. 90, 799-828.
- (1991) J. Acoust. Soc. Am. , vol.90 , pp. 799-828
- Baer, T.¹ Gore, J.C.² Gracco, L.C.³ Nye, P.W.⁴

3
- 0031198820
- Learning to speak. Sensori-motor control of speech movements
- Bailly, G. (1997). "Learning to speak. Sensori-motor control of speech movements," Speech Commun. 22, 251-267.
- (1997) Speech Commun. , vol.22 , pp. 251-267
- Bailly, G.¹

4
- 0035025894
- Linear degrees of freedom in speech production: Analysis of cineradio- and labio-film data and articulatory-acoustic modeling
- Beautemps, D., Badin, P., and Bailly, G. (2001). "Linear degrees of freedom in speech production: Analysis of cineradio- and labio-film data and articulatory-acoustic modeling," J. Acoust. Soc. Am. 109, 2165-2180.
- (2001) J. Acoust. Soc. Am. , vol.109 , pp. 2165-2180
- Beautemps, D.¹ Badin, P.² Bailly, G.³

5
- 0029230714
- Deriving vocal-tract area functions from midsagittal profiles and formant frequencies - A new model for vowels and fricative consonants based on experimental-data
- Beautemps, D., Badin, P., and Laboissiere, R. (1995). "Deriving vocal-tract area functions from midsagittal profiles and formant frequencies - A new model for vowels and fricative consonants based on experimental-data," Speech Commun. 16, 27-47.
- (1995) Speech Commun. , vol.16 , pp. 27-47
- Beautemps, D.¹ Badin, P.² Laboissiere, R.³

6
- 0003708584
- Academic, San Diego
- Bernhardt, B. H., and Stemberger, J. P. (1998). Handbook of Phonological Development: From the Perspective of Constraint-based Nonlinear Phonology (Academic, San Diego).
- (1998) Handbook of Phonological Development: From the Perspective of Constraint-based Nonlinear Phonology
- Bernhardt, B.H.¹ Stemberger, J.P.²

7
- 0034132106
- Speech perception without hearing
- Bernstein, L. E., Demorest, M. E., and Tucker, P. E. (2000). "Speech perception without hearing," Percept. Psychophys. 62, 233-252.
- (2000) Percept. Psychophys. , vol.62 , pp. 233-252
- Bernstein, L.E.¹ Demorest, M.E.² Tucker, P.E.³

8
- 0034092076
- A self-learning predictive model of articulator movements during speech production
- Blackburn, C. S., and Young, S. J. (2000). "A self-learning predictive model of articulator movements during speech production," J. Acoust. Soc. Am. 107, 1659-1670.
- (2000) J. Acoust. Soc. Am. , vol.107 , pp. 1659-1670
- Blackburn, C.S.¹ Young, S.J.²

9
- 0035412933
- Enhanced speech recognition using an articulatory production model trained on X-ray data
- Blackburn, C. S., and Young, S. (2000a). "Enhanced speech recognition using an articulatory production model trained on X-ray data," Comput. Speech Lang. 15, 195-215.
- (2000) Comput. Speech Lang. , vol.15 , pp. 195-215
- Blackburn, C.S.¹ Young, S.²

10
- 0034092076
- A self-learning predictive model of articulator movements during speech production
- Blackburn, C. S., and Young, S. (2000b). "A self-learning predictive model of articulator movements during speech production," J. Acoust. Soc. Am. 107, 1659-1670.
- (2000) J. Acoust. Soc. Am. , vol.107 , pp. 1659-1670
- Blackburn, C.S.¹ Young, S.²

11
- 0036497601
- A comparison of spectral smoothing methods for segment concatenation based speech synthesis
- Chappell, D. T., and Hansen, J. H. L. (2002). "A comparison of spectral smoothing methods for segment concatenation based speech synthesis," Speech Commun. 36, 343-374.
- (2002) Speech Commun. , vol.36 , pp. 343-374
- Chappell, D.T.¹ Hansen, J.H.L.²

12
- 0024861871
- Approximation by superpositions of a sigmoid function
- Cybenko, G. (1989). "Approximation by superpositions of a sigmoid function," Math. Control, Signals, Syst. 2, 303-314.
- (1989) Math. Control, Signals, Syst. , vol.2 , pp. 303-314
- Cybenko, G.¹

13
- 0034226802
- Incorporating lip protrusion and larynx lowering into a time domain model for articulatory speech synthesis
- Goodyear, C. C. (2000). "Incorporating lip protrusion and larynx lowering into a time domain model for articulatory speech synthesis," Comput. Speech Lang. 14, 211-226.
- (2000) Comput. Speech Lang. , vol.14 , pp. 211-226
- Goodyear, C.C.¹

14
- 0027000966
- Measurements of vocal-tract shapes using magnetic-resonance-imaging
- Greenwood, A. R., Goodyear, C. C., and Martin, P. A. (1992). "Measurements of vocal-tract shapes using magnetic-resonance-imaging," IEEE Proc.-I: Commun. Speech Vision 139, 553-560.
- (1992) IEEE Proc.-I: Commun. Speech Vision , vol.139 , pp. 553-560
- Greenwood, A.R.¹ Goodyear, C.C.² Martin, P.A.³

15
- 0028719136
- A neural-network model of speech acquisition and motor equivalent speech production
- Guenther, F. H. (1994). "A neural-network model of speech acquisition and motor equivalent speech production," Biol. Cybern. 72, 43-53.
- (1994) Biol. Cybern. , vol.72 , pp. 43-53
- Guenther, F.H.¹

16
- 0029338245
- Speech sound acquisition, coarticulation, and rate effects in a neural-network model of speech production
- Guenther, F. H. (1995). "Speech sound acquisition, coarticulation, and rate effects in a neural-network model of speech production," Psychol. Rev. 102, 594-621.
- (1995) Psychol. Rev. , vol.102 , pp. 594-621
- Guenther, F.H.¹

17
- 0032192891
- A theoretical investigation of reference frames for the planning of speech movements
- Guenther, F. H., Hampson, M., and Johnson, D. (1998). "A theoretical investigation of reference frames for the planning of speech movements," Psychol. Rev. 105, 611-633.
- (1998) Psychol. Rev. , vol.105 , pp. 611-633
- Guenther, F.H.¹ Hampson, M.² Johnson, D.³

18
- 0035348355
- Functional anatomy of speech perception and speech production: Psycholinguistic implications
- Hickok, G. (2001). "Functional anatomy of speech perception and speech production: Psycholinguistic implications," J. Psycholinguist. Res. 30, 225-235.
- (2001) J. Psycholinguist. Res. , vol.30 , pp. 225-235
- Hickok, G.¹

19
- 0028996871
- Noise estimation techniques for robust speech recognition
- Paper presented
- Hirsch, H. G., and Ehrlicher, C. (1995). "Noise estimation techniques for robust speech recognition," Paper presented at the Proc. ICASSP.
- (1995) Proc. ICASSP
- Hirsch, H.G.¹ Ehrlicher, C.²

20
- 0024137490
- Increased rates of convergence through learning rate adaptation
- Jacobs, R. A. (1988). "Increased rates of convergence through learning rate adaptation," Neural Networks 1, 295-307.
- (1988) Neural Networks , vol.1 , pp. 295-307
- Jacobs, R.A.¹

21
- 0036874551
- On the relationship between face movements, tongue movements, and speech acoustics
- Jiang, J. T., Alwan, A., Keating, P. A., Auer, E. T., and Bernstein, L. E. (2002). "On the relationship between face movements, tongue movements, and speech acoustics," Eurasip J. Appl. Signal Process. 2002, 1174-1188.
- (2002) Eurasip J. Appl. Signal Process. , vol.2002 , pp. 1174-1188
- Jiang, J.T.¹ Alwan, A.² Keating, P.A.³ Auer, E.T.⁴ Bernstein, L.E.⁵

22
- 44049116478
- Forward models - Supervised learning with a distal teacher
- Jordan, M. I., and Rumelhart, D. E. (1992). "Forward models - Supervised learning with a distal teacher," Cogn. Sci. 16, 307-354.
- (1992) Cogn. Sci. , vol.16 , pp. 307-354
- Jordan, M.I.¹ Rumelhart, D.E.²

23
- 0003914808
- MIT, Cambridge
- Jusczyk, P. W. (1997). The Discovery of Spoken Language (MIT, Cambridge).
- (1997) The Discovery of Spoken Language
- Jusczyk, P.W.¹

24
- 0034940788
- Dynamic articulatory model based on multidimensional invariant-feature task representation
- Kaburagi, T., and Honda, M. (2001). "Dynamic articulatory model based on multidimensional invariant-feature task representation," J. Acoust. Soc. Am. 110, 441-452.
- (2001) J. Acoust. Soc. Am. , vol.110 , pp. 441-452
- Kaburagi, T.¹ Honda, M.²

25
- 0002023092
- Speech Synthesis
- Proceedings of the Fourth International Congress of Acoustics, paper G42, 1-4, edited by J. L. Flanagan and L. R. Rabiner (Dowden, Hutchinson & Ross, Stroudsburg, PA)
- Kelly, J. L., and Lochbaum, C. C. (1962). "Speech Synthesis," Proceedings of the Fourth International Congress of Acoustics, paper G42, 1-4, in Speech Synthesis, edited by J. L. Flanagan and L. R. Rabiner (Dowden, Hutchinson & Ross, Stroudsburg, PA), pp. 127-130.
- (1962) Speech Synthesis , pp. 127-130
- Kelly, J.L.¹ Lochbaum, C.C.²

26
- 0004145667
- Harcourt Brace, Orlando, FL
- Ladefoged, P. (1993). A Course in Phonetics (Harcourt Brace, Orlando, FL).
- (1993) A Course in Phonetics
- Ladefoged, P.¹

27
- 0015613574
- Articulatory model for the study of speech production
- Mermelstein, P. (1973). "Articulatory model for the study of speech production," J. Acoust. Soc. Am. 53, 1070-1082.
- (1973) J. Acoust. Soc. Am. , vol.53 , pp. 1070-1082
- Mermelstein, P.¹

28
- 0037584177
- Lawrence Erlbaum, Hillsdale, NJ
- Perkell, J., and Klatt, D. (Eds.). (1986). Invariance and Variability in Speech Processes. (Lawrence Erlbaum, Hillsdale, NJ).
- (1986) Invariance and Variability in Speech Processes
- Perkell, J.¹ Klatt, D.²

29
- 0031200496
- Speech motor control: Acoustic goals, saturation effects, auditory feedback, and internal models
- Perkell, J. Matthies, M., Lane, H., Guenther, F., Wilhelms-Tricarico, R., Wozniak, J., et al. (1997). "Speech motor control: Acoustic goals, saturation effects, auditory feedback, and internal models," Speech Commun. 22, 227-250.
- (1997) Speech Commun. , vol.22 , pp. 227-250
- Perkell, J.¹ Matthies, M.² Lane, H.³ Guenther, F.⁴ Wilhelms-Tricarico, R.⁵ Wozniak, J.⁶

30
- 0000678652
- A theory of speech motor control and supporting data from speakers with normal hearing and with profound hearing loss
- Perkell, J. S., Guenther, F. H., Lane, H., Matthies, M. L., Perrier, P., Vick, J. et al. (2000). "A theory of speech motor control and supporting data from speakers with normal hearing and with profound hearing loss," J. Phonetics 28, 233-272.
- (2000) J. Phonetics , vol.28 , pp. 233-272
- Perkell, J.S.¹ Guenther, F.H.² Lane, H.³ Matthies, M.L.⁴ Perrier, P.⁵ Vick, J.⁶

31
- 0019606145
- Some current theoretical issues in speech perception
- Pisoni, D. B. (1981). "Some current theoretical issues in speech perception," Cognition 10, 249-259.
- (1981) Cognition , vol.10 , pp. 249-259
- Pisoni, D.B.¹

32
- 0002075963
- The emergence of phonology from the interplay of speech comprehension and production: A distributed connectionist approach
- edited by B. MacWhinney (Erlbaum, Mahweh, NJ)
- Plaut, D. C., and Kello, C. T. (1999). "The emergence of phonology from the interplay of speech comprehension and production: A distributed connectionist approach," in The Emergence of Language, edited by B. MacWhinney (Erlbaum, Mahweh, NJ), pp. 381-415.
- (1999) The Emergence of Language , pp. 381-415
- Plaut, D.C.¹ Kello, C.T.²

33
- 0029691655
- Understanding normal and impaired word reading: Computational principles in quasi-regular domains
- Plaut, D. C., McClelland, J. L., Seidenberg, M. S., and Patterson, K. (1996). "Understanding normal and impaired word reading: Computational principles in quasi-regular domains," Psychol. Rev. 103, 56-115.
- (1996) Psychol. Rev. , vol.103 , pp. 56-115
- Plaut, D.C.¹ McClelland, J.L.² Seidenberg, M.S.³ Patterson, K.⁴

34
- 4243661296
- Ph.D. thesis, University of Toronto, Toronto
- Roweis, S. (1999). "Data driven production models for speech processing," Ph.D. thesis, University of Toronto, Toronto.
- (1999) Data Driven Production Models for Speech Processing
- Roweis, S.¹

35
- 0019606728
- An articulatory synthesizer for perceptual research
- Rubin, P., Baer, T., and Mermelstein, P. (1981). "An articulatory synthesizer for perceptual research," J. Acoust. Soc. Am. 70, 321-328.
- (1981) J. Acoust. Soc. Am. , vol.70 , pp. 321-328
- Rubin, P.¹ Baer, T.² Mermelstein, P.³

36
- 0002530434
- Backpropagation: The basic theory
- edited by Y. Chauvin and D. E. Rumelhart (Erlbaum, Hillsdale, NJ)
- Rumelhart, D. E., Durbin, R., Golden, R., and Chauvin, Y. (1995). "Backpropagation: The basic theory," in Backpropagation: Theory, Architectures, and Applications Developments in Connectionist Theory, edited by Y. Chauvin and D. E. Rumelhart (Erlbaum, Hillsdale, NJ), pp. 1-34.
- (1995) Backpropagation: Theory, Architectures, and Applications Developments in Connectionist Theory , pp. 1-34
- Rumelhart, D.E.¹ Durbin, R.² Golden, R.³ Chauvin, Y.⁴

37
- 0022471098
- Learning representations by back-propagating errors
- Rumelhart, D. E., Hinton, G. E., and Williams, R. J. (1986). "Learning representations by back-propagating errors," Nature (London) 323, 533-536.
- (1986) Nature (London) , vol.323 , pp. 533-536
- Rumelhart, D.E.¹ Hinton, G.E.² Williams, R.J.³

38
- 85009123170
- Estimation of voice source and vocal tract characteristics based on multi-frame analysis
- Paper presented
- Shiga, Y., and King, S. (2003). "Estimation of voice source and vocal tract characteristics based on multi-frame analysis." Paper presented at Eurospeech.
- (2003) Eurospeech
- Shiga, Y.¹ King, S.²

39
- 0037503670
- A multichannel articulatory speech database and its application for automatic speech recognition
- Paper presented
- Wrench, A., and Hardcastle, W. (2000). "A multichannel articulatory speech database and its application for automatic speech recognition," Paper presented at the Proceedings of the 5th Seminar on Speech Production.
- (2000) Proceedings of the 5th Seminar on Speech Production
- Wrench, A.¹ Hardcastle, W.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.