SCOPUS 정보 검색 플랫폼

IEEE Journal on Selected Topics in Signal Processing

Volumn 4, Issue 6, 2010, Pages 1027-1045

Retrieving tract variables from acoustics: A comparison of different machine learning strategies

(5) Mitra, Vikramjit a Nam, Hosung b Espy Wilson, Carol Y a Saltzman, Elliot b,c Goldstein, Louis b,d

a UNIVERSITY OF MARYLAND (United States)

b HASKINS LABORATORIES (United States)

c BOSTON UNIVERSITY (United States)

d UNIVERSITY OF SOUTHERN CALIFORNIA (United States)

Author keywords

Articulatory phonology; articulatory speech recognition (ASR); artificial neural networks (ANNs); coarticulation; distal supervised learning; mixture density networks; speech inversion; task dynamic and applications model; vocal tract variables

Indexed keywords

ARTICULATORY PHONOLOGY; ARTICULATORY SPEECH RECOGNITION (ASR); ARTIFICIAL NEURAL NETWORKS; CO-ARTICULATION; DISTAL SUPERVISED LEARNING; MIXTURE DENSITY NETWORKS; SPEECH INVERSION; TASK DYNAMIC AND APPLICATIONS MODEL; VOCAL-TRACTS;

FEEDFORWARD NEURAL NETWORKS; MIXTURES; SUPERVISED LEARNING;

SPEECH RECOGNITION;

EID: 78649390043 PISSN: 19324553 EISSN: None Source Type: Journal
DOI: 10.1109/JSTSP.2010.2076013 Document Type: Article

Times cited : (49)

References (123)

1
- 0017968519
- Inversion of articulatory-to-acoustic transformation in the vocal tract by a computer sorting technique
- B. S. Atal, J. J. Chang, M. V. Mathews, and J. W. Tukey, "Inversion of articulatory-to-acoustic transformation in the vocal tract by a computer sorting technique,"J. Acoust. Soc. Amer., vol. 63, pp. 1535-1555, 1978.
- (1978) J. Acoust. Soc. Amer. , vol.63 , pp. 1535-1555
- Atal, B.S.¹ Chang, J.J.² Mathews, M.V.³ Tukey, J.W.⁴

2
- 0020602364
- Efficient coding of LPC parameters by temporal decomposition
- B. S. Atal, "Efficient coding of LPC parameters by temporal decomposition," in Proc. ICASSP, 1983, pp. 81-84.
- (1983) Proc. ICASSP , pp. 81-84
- Atal, B.S.¹

3
- 0004113976
- Mixture density networks
- Dept., Comput. Sci., Aston Univ., Birmingham, U.K., Tech. Rep. NCRG/4288
- C. Bishop, "Mixture density networks," Neural Computing Research Group, Dept., Comput. Sci., Aston Univ., Birmingham, U.K., Tech. Rep. NCRG/4288.
- Neural Computing Research Group
- Bishop, C.¹

4
- 0000523613
- Towards an articulatory phonology
- C. P. Browman and L. Goldstein, "Towards an articulatory phonology," Phonol. Yearbook, vol. 85, pp. 219-252, 1986.
- (1986) Phonol. Yearbook , vol.85 , pp. 219-252
- Browman, C.P.¹ Goldstein, L.²

5
- 0024150474
- Some notes on syllable structure in articulatory phonology
- C. P. Browman and L. Goldstein, "Some notes on syllable structure in articulatory phonology," Phonetica, vol. 45, pp. 140-155, 1988.
- (1988) Phonetica , vol.45 , pp. 140-155
- Browman, C.P.¹ Goldstein, L.²

6
- 84971737266
- Articulatory gestures as phonological units
- C. P. Browman and L. Goldstein, "Articulatory gestures as phonological units," Phonol., vol. 6, pp. 201-251, 1989.
- (1989) Phonol. , vol.6 , pp. 201-251
- Browman, C.P.¹ Goldstein, L.²

7
- 84955535347
- Gestural specification using dynamically-defined articulatory structures
- C. P. Browman and L. Goldstein, "Gestural specification using dynamically-defined articulatory structures," J. Phonetics, vol. 18, no. 3, pp. 299-320, 1990.
- (1990) J. Phonetics , vol.18 , Issue.3 , pp. 299-320
- Browman, C.P.¹ Goldstein, L.²

8
- 0006080506
- Representation and reality: Physical systems and phonological structure
- C. P. Browman and L. Goldstein, "Representation and reality: Physical systems and phonological structure," J. Phonetics, vol. 18, pp. 411-424, 1990.
- (1990) J. Phonetics , vol.18 , pp. 411-424
- Browman, C.P.¹ Goldstein, L.²

9
- 0001577222
- Tiers in articulatory phonology, with some implications for casual speech
- J. Kingston and M. E. Beckman, Eds. Cambridge, U.K.: Cambridge Univ. Press
- C. P.Browman and L. Goldstein, "Tiers in articulatory phonology, with some implications for casual speech," in Papers in Lab. Phon. I: Between the Grammar and the Physics of Speech, J. Kingston and M. E. Beckman, Eds. Cambridge, U.K.: Cambridge Univ. Press, 1991, pp. 341-376.
- (1991) Papers in Lab. Phon. I: Between the Grammar and the Physics of Speech , pp. 341-376
- Browman, C.P.¹ Goldstein, L.²

10
- 0027024362
- Articulatory phonology: An overview
- C. P. Browman and L. Goldstein, "Articulatory phonology: An overview," Phonetica, vol. 49, pp. 155-180, 1992.
- (1992) Phonetica , vol.49 , pp. 155-180
- Browman, C.P.¹ Goldstein, L.²

11
- 0037949203
- The elastic phrase: Modeling the dynamics of boundary-adjacent lengthening
- D. Byrd and E. Saltzman, "The elastic phrase: Modeling the dynamics of boundary-adjacent lengthening," J. Phonetics, vol. 31, no. 2, pp. 149-180, 2003.
- (2003) J. Phonetics , vol.31 , Issue.2 , pp. 149-180
- Byrd, D.¹ Saltzman, E.²

12
- 34547497796
- An articulatory feature-based tandem approach and factored observation modeling
- O. Cetin, A. Kantor, S. King, C. Bartels, M. Magimai-Doss, J. Frankel, and K. Livescu, "An articulatory feature-based tandem approach and factored observation modeling," in Proc. ICASSP, 2007, vol. 4, pp. 645-648.
- (2007) Proc. ICASSP , vol.4 , pp. 645-648
- Cetin, O.¹ Kantor, A.² King, S.³ Bartels, C.⁴ Magimai-Doss, M.⁵ Frankel, J.⁶ Livescu, K.⁷

13
- 26444619785
- An elitist approach to automatic articulatory-acoustic feature classification for phonetic characterization of spoken language
- Nov.
- S. Chang, M. Wester, and S. Greenberg, "An elitist approach to automatic articulatory-acoustic feature classification for phonetic characterization of spoken language," Speech Commun., vol. 47, no. 3, pp. 290-311, Nov. 2005.
- (2005) Speech Commun. , vol.47 , Issue.3 , pp. 290-311
- Chang, S.¹ Wester, M.² Greenberg, S.³

14
- 85009064164
- Place of articulation cues for voiced and voiceless plosives and fricatives in syllable-initial position
- S. Chen and A. Alwan, "Place of articulation cues for voiced and voiceless plosives and fricatives in syllable-initial position," in Proc. ICSLP, 2000, vol. 4, pp. 113-116.
- (2000) Proc. ICSLP , vol.4 , pp. 113-116
- Chen, S.¹ Alwan, A.²

15
- 0004119259
- New York: Harper & Row
- N. Chomsky and M. Halle, The Sound Pattern of English. New York: Harper & Row, 1968.
- (1968) The Sound Pattern of English
- Chomsky, N.¹ Halle, M.²

16
- 0004147298
- Oxford, U.K.: Blackwell
- J. Clark and C. Yallop, An introduction to Phonetics and Phonology, 2nd ed. Oxford, U.K.: Blackwell, 1995.
- (1995) An Introduction to Phonetics and Phonology, 2nd Ed
- Clark, J.¹ Yallop, C.²

17
- 0000707529
- The internal organization of speech sounds
- J. A. Goldsmith, Ed. Cambridge, U.K.: Blackwell
- G. N. Clements and E. V. Hume, "The internal organization of speech sounds," in Handbook of Phonological Theory, J. A. Goldsmith, Ed. Cambridge, U.K.: Blackwell, 1995.
- (1995) Handbook of Phonological Theory
- Clements, G.N.¹ Hume, E.V.²

18
- 0001887625
- Performing fine phonetic distinctions: Templates versus features
- J. S. Perkell and D. Klatt, Eds. Hillsdale, NJ: Lawrence, Erlbaum Assoc. ch. 15
- R. Cole, R. M. Stern, and M. J. Lasry, "Performing fine phonetic distinctions: Templates versus features," in Invariance and Variability of Speech Processes, J. S. Perkell and D. Klatt, Eds. Hillsdale, NJ: Lawrence Erlbaum Assoc., 1986, ch. 15, pp. 325-345.
- (1986) Invariance and Variability of Speech Processes , pp. 325-345
- Cole, R.¹ Stern, R.M.² Lasry, M.J.³

19
- 85135196323
- New telephone speech corpora at CSLU
- R. Cole, M. Noel, T. Lander, and T. Durham, "New telephone speech corpora at CSLU," in Proc. 4th Euro. Conf. Speech Commun. Technol., 1995, vol. 1, pp. 821-824.
- (1995) Proc. 4th Euro. Conf. Speech Commun. Technol. , vol.1 , pp. 821-824
- Cole, R.¹ Noel, M.² Lander, T.³ Durham, T.⁴

20
- 0026372938
- Microstructural speech units and their HMM representations for discrete utterance speech recognition
- L. Deng and K. Erler, "Microstructural speech units and their HMM representations for discrete utterance speech recognition," in Proc. ICASSP, 1991, pp. 193-196.
- (1991) Proc. ICASSP , pp. 193-196
- Deng, L.¹ Erler, K.²

21
- 0026854213
- A generalized hidden Markov model with state-conditioned trend functions of time for the speech signal
- L. Deng, "A generalized hidden Markov model with state-conditioned trend functions of time for the speech signal," Signal Process., vol. 27, no. 1, pp. 65-78, 1992.
- (1992) Signal Process. , vol.27 , Issue.1 , pp. 65-78
- Deng, L.¹

22
- 0028234947
- A statistical approach to ASR using atomic units constructed from overlapping articulatory features
- L. Deng and D. Sun, "A statistical approach to ASR using atomic units constructed from overlapping articulatory features," J. Acoust. Soc. Amer., vol. 95, pp. 2702-2719, 1994.
- (1994) J. Acoust. Soc. Amer. , vol.95 , pp. 2702-2719
- Deng, L.¹ Sun, D.²

23
- 85079090910
- Phonetic classification and recognition using HMM representation of overlapping articulator features for all classes of English sounds
- L. Deng and D. Sun, "Phonetic classification and recognition using HMM representation of overlapping articulator features for all classes of English sounds," in Proc. ICASSP, 1994, pp. 45-47.
- (1994) Proc. ICASSP , pp. 45-47
- Deng, L.¹ Sun, D.²

24
- 0031198059
- Production models as a structural basis for automatic speech recognition
- L. Deng, G. Ramsay, and D. Sun, "Production models as a structural basis for automatic speech recognition," Spec. Iss. Speech Prod. Modeling, Speech Commun., vol. 22, no. 2, pp. 93-112, 1997.
- (1997) Spec. Iss. Speech Prod. Modeling, Speech Commun. , vol.22 , Issue.2 , pp. 93-112
- Deng, L.¹ Ramsay, G.² Sun, D.³

25
- 0032119268
- A dynamic, feature-based approach to the interface between phonology and phonetics for speech modeling and recognition
- L. Deng, "A dynamic, feature-based approach to the interface between phonology and phonetics for speech modeling and recognition,"Speech Commun., vol. 24, no. 4, pp. 299-323, 1998.
- (1998) Speech Commun. , vol.24 , Issue.4 , pp. 299-323
- Deng, L.¹

26
- 0033623527
- Spontaneous speech recognition using a statistical coarticulatory model for the hidden vocal-tract-resonance dynamics
- L. Deng and J. Ma, "Spontaneous speech recognition using a statistical coarticulatory model for the hidden vocal-tract-resonance dynamics," J. Acoust. Soc. Amer., vol. 108, no. 6, pp. 3036-3048, 2000.
- (2000) J. Acoust. Soc. Amer. , vol.108 , Issue.6 , pp. 3036-3048
- Deng, L.¹ Ma, J.²

27
- 4544323815
- A structured speech model with continuous hidden dynamics and prediction-residual training for tracking vocal tract resonances
- L. Deng, L. Lee, H. Attias, and A. Acero, "A structured speech model with continuous hidden dynamics and prediction-residual training for tracking vocal tract resonances," in Proc. ICASSP, 2004, pp. I557-I560.
- (2004) Proc. ICASSP
- Deng, L.¹ Lee, L.² Attias, H.³ Acero, A.⁴

28
- 34047266395
- Structured speech modeling
- Sep.
- L. Deng, D. Yu, and A. Acero, "Structured speech modeling," IEEE Trans. Audio, Speech, Lang. Process., vol. 14, no. 5, pp. 1492-1504, Sep. 2006.
- (2006) IEEE Trans. Audio, Speech, Lang. Process. , vol.14 , Issue.5 , pp. 1492-1504
- Deng, L.¹ Yu, D.² Acero, A.³

29
- 27644525945
- Use of temporal information: Detection of the periodicity and aperiodicity profile of speech
- Sep.
- O.Deshmukh, C. Espy-Wilson, A.Salomon, and J. Singh, "Use of temporal information: Detection of the periodicity and aperiodicity profile of speech," IEEE Trans. Speech Audio Process., vol. 13, no. 5, pp. 776-786, Sep. 2005.
- (2005) IEEE Trans. Speech Audio Process. , vol.13 , Issue.5 , pp. 776-786
- Deshmukh, O.¹ Espy-Wilson, C.² Salomon, A.³ Singh, J.⁴

30
- 84937322060
- Ph.D., Univ. of Waterloo, Dept. of Elect. Comput. Eng., Waterloo, ON, Canada
- S. Dusan, "Statistical estimation of articulatory trajectories from the speech signal using dynamical and phonological constraints," Ph.D., Univ. of Waterloo, Dept. of Elect. Comput. Eng., Waterloo, ON, Canada, 2000.
- (2000) Statistical Estimation of Articulatory Trajectories from the Speech Signal Using Dynamical and Phonological Constraints
- Dusan, S.¹

31
- 0006036234
- Phoneme recognition with an artificial neural network
- K. Elenius and G. Tacacs, "Phoneme recognition with an artificial neural network," in Proc. Eurospeech, 1991, pp. 121-124.
- (1991) Proc. Eurospeech , pp. 121-124
- Elenius, K.¹ Tacacs, G.²

32
- 0037518143
- Comparing phoneme and feature based speech recognition using artificial neural networks
- K. Elenius and M. Blomberg, "Comparing phoneme and feature based speech recognition using artificial neural networks," in Proc. ICSLP, 1992, pp. 1279-1282.
- (1992) Proc. ICSLP , pp. 1279-1282
- Elenius, K.¹ Blomberg, M.²

33
- 0027627252
- Hidden Markov model representation of quantized articulatory features for speech recognition
- K. Erler and L. Deng, "Hidden Markov model representation of quantized articulatory features for speech recognition," Comput., Speech, Lang., vol. 7, pp. 265-282, 1993.
- (1993) Comput., Speech, Lang. , vol.7 , pp. 265-282
- Erler, K.¹ Deng, L.²

34
- 33646663971
- The relevance of F4 in distinguishing between different articulatory configurations of American English/r/
- C. Y. Espy-Wilson and S. E. Boyce, "The relevance of F4 in distinguishing between different articulatory configurations of American English/r/," J. Acoust. Soc. Amer., vol. 105, no. 2, p. 1400, 1999.
- (1999) J. Acoust. Soc. Amer. , vol.105 , Issue.2 , pp. 1400
- Espy-Wilson, C.Y.¹ Boyce, S.E.²

35
- 0033927512
- Acoustic modeling of American English/r/
- C. Y. Espy-Wilson, S. E. Boyce, M. Jackson, S. Narayanan, and A. Alwan, "Acoustic modeling of American English/r/," J. Acoust. Soc. Amer., vol. 108, no. 1, pp. 343-356, 2000.
- (2000) J. Acoust. Soc. Amer. , vol.108 , Issue.1 , pp. 343-356
- Espy-Wilson, C.Y.¹ Boyce, S.E.² Jackson, M.³ Narayanan, S.⁴ Alwan, A.⁵

36
- 0013631878
- Coordination and coarticulation in speech production
- C. A. Fowler and E. Saltzman, "Coordination and coarticulation in speech production," Lang. Speech, vol. 36, pp. 171-195, 1993.
- (1993) Lang. Speech , vol.36 , pp. 171-195
- Fowler, C.A.¹ Saltzman, E.²

37
- 0002441991
- Coarticulation resistance of American English consonants and its effects on transconsonantal vowel-to-vowel coarticulation
- C. A. Fowler and L. Brancazio, "Coarticulation resistance of American English consonants and its effects on transconsonantal vowel-to-vowel coarticulation," Lang. Speech, vol. 43, pp. 1-42, 2000.
- (2000) Lang. Speech , vol.43 , pp. 1-42
- Fowler, C.A.¹ Brancazio, L.²

38
- 20444400371
- Speech production and perception
- A. Healy and R. Proctor, Eds. New York: Wiley, Experimental Psychology
- C. A. Fowler, "Speech production and perception," in Handbook of Psychology, A. Healy and R. Proctor, Eds. New York: Wiley, 2003, vol. 4, Experimental Psychology, pp. 237-266.
- (2003) Handbook of Psychology , vol.4 , pp. 237-266
- Fowler, C.A.¹

39
- 84994254645
- An automatic speech recognition system using neural networks and linear dynamic models to recover and model articulatory traces
- J. Frankel, K. Richmond, S. King, and P. Taylor, "An automatic speech recognition system using neural networks and linear dynamic models to recover and model articulatory traces," in Proc. ICSLP, 2000, vol. 4, pp. 254-257.
- (2000) Proc. ICSLP , vol.4 , pp. 254-257
- Frankel, J.¹ Richmond, K.² King, S.³ Taylor, P.⁴

40
- 58849145971
- ASR\Articulatory speech recognition
- J. Frankel and S. King, "ASR\Articulatory speech recognition," in Proc. Eurospeech, Denmark, 2001, pp. 599-602.
- (2001) Proc. Eurospeech, Denmark , pp. 599-602
- Frankel, J.¹ King, S.²

41
- 85009088992
- Articulatory feature recognition using dynamic Bayesian networks
- Korea
- J. Frankel, M. Wester, and S. King, "Articulatory feature recognition using dynamic Bayesian networks," in Proc. Int. Conf. Spoken Lang. Process., Korea, 2004, pp. 1202-1205.
- (2004) Proc. Int. Conf. Spoken Lang. Process. , pp. 1202-1205
- Frankel, J.¹ Wester, M.² King, S.³

42
- 33745225408
- A hybrid ANN/DBN approach to articula-tory feature recognition
- J. Frankel and S. King, "A hybrid ANN/DBN approach to articula-tory feature recognition," in Proc. Eurospeech, Interspeech, 2005, pp. 3045-3048.
- (2005) Proc. Eurospeech, Interspeech , pp. 3045-3048
- Frankel, J.¹ King, S.²

43
- 0015712358
- Computer controlled radiography for observation of movements of articulatory and other human organs
- O. Fujimura, S. Kiritani, and H. Ishida, "Computer controlled radiography for observation of movements of articulatory and other human organs," Comput. Biol. Med., vol. 3, pp. 371-384, 1973.
- (1973) Comput. Biol. Med. , vol.3 , pp. 371-384
- Fujimura, O.¹ Kiritani, S.² Ishida, H.³

44
- 0000154329
- Relative invariance of articulatory movements: Aniceberg model
- J. S. Perkell and D. Klatt, Eds. Mahwah, NJ: Lawrence Erlbaum Assoc. ch. 11
- O. Fujimura, "Relative invariance of articulatory movements: An iceberg model," in Invariance & Variability of Speech Processes, J. S. Perkell and D. Klatt, Eds. Mahwah, NJ: Lawrence Erlbaum Assoc., 1986, ch. 11, pp. 226-242.
- (1986) Invariance & Variability of Speech Processes , pp. 226-242
- Fujimura, O.¹

45
- 0032627247
- Development of rules for controlling the HLsyn speech synthesizer
- H. M. Hanson, R. S. McGowan, K. N. Stevens, and R. E. Beaudoin, "Development of rules for controlling the HLsyn speech synthesizer," in Proc. ICASSP, 1999, vol. 1, pp. 85-88.
- (1999) Proc. ICASSP , vol.1 , pp. 85-88
- Hanson, H.M.¹ McGowan, R.S.² Stevens, K.N.³ Beaudoin, R.E.⁴

46
- 0036711819
- A quasiarticulatory approach to controlling acoustic source parameters in a Klatt-type formant synthesizer using HLsyn
- H. M. Hanson and K. N. Stevens, "A quasiarticulatory approach to controlling acoustic source parameters in a Klatt-type formant synthesizer using HLsyn," J. Acoust. Soc. Amer., vol. 112, no. 3, pp. 1158-1182, 2002.
- (2002) J. Acoust. Soc. Amer. , vol.112 , Issue.3 , pp. 1158-1182
- Hanson, H.M.¹ Stevens, K.N.²

47
- 0003948389
- Oxford U.K.: Blackwell
- J. Harris, English Sound Structure. Oxford, U.K.: Blackwell, 1994.
- (1994) English Sound Structure
- Harris, J.¹

48
- 78649376063
- Audiovisual speech recognition with articulator positions as hidden variables
- Germany
- M. Hasegawa-Johnson, K. Livescu, P. Lal, and K. Saenko, "Audiovisual speech recognition with articulator positions as hidden variables," in Proc. ICPhS, Saarbrucken, Germany, 2007, pp. 297-302.
- (2007) Proc. ICPhS, Saarbrucken , pp. 297-302
- Hasegawa-Johnson, M.¹ Livescu, K.² Lal, P.³ Saenko, K.⁴

49
- 78049402623
- G. H. Juang, Ed. San Mateo, CA: Morgan & Claypool
- X. He and L. Deng, Discriminative Learning for Speech Processing, G. H. Juang, Ed. San Mateo, CA: Morgan & Claypool, 2008.
- (2008) Discriminative Learning for Speech Processing
- He, X.¹ Deng, L.²

50
- 33745805403
- A fast learning algorithm for deep belief nets
- G. E. Hinton, S. Osindero, and Y. Teh, "A fast learning algorithm for deep belief nets," Neural Comput., vol. 18, pp. 1527-1554, 2006.
- (2006) Neural Comput. , vol.18 , pp. 1527-1554
- Hinton, G.E.¹ Osindero, S.² Teh, Y.³

51
- 0029843107
- Accurate recovery of articulator positions from acoustics: New conclusions based on human data
- J. Hogden, A. Löfqvist, V. Gracco, I. Zlokarnik, P. Rubin, and E. Saltzman, "Accurate recovery of articulator positions from acoustics: New conclusions based on human data," J. Acoust. Soc. Amer., vol. 100, no. 3, pp. 1819-1834, 1996.
- (1996) J. Acoust. Soc. Amer. , vol.100 , Issue.3 , pp. 1819-1834
- Hogden, J.¹ Löfqvist, A.² Gracco, V.³ Zlokarnik, I.⁴ Rubin, P.⁵ Saltzman, E.⁶

52
- 33846700692
- An articulatorily constrained, maximum likelihood approach to speech recognition
- Tech. Rep. LA-UR-
- J. Hogden, D. Nix, and P. Valdez, "An articulatorily constrained, maximum likelihood approach to speech recognition," Los Alamos National Laboratory, Los Alamos, NM, 1998, Tech. Rep. LA-UR-96-3945.
- (1998) Los Alamos National Laboratory, Los Alamos, NM , pp. 96-3945
- Hogden, J.¹ Nix, D.² Valdez, P.³

53
- 34247647975
- Inverting mappings from smooth paths through Rn to paths through Rm. A technique applied to recovering articulation from acoustics
- J. Hogden, P. Rubin, E. McDermott, S. Katagiri, and L. Goldstein, "Inverting mappings from smooth paths through Rn to paths through Rm. A technique applied to recovering articulation from acoustics," Speech Commun., vol. 49, no. 5, pp. 361-383, 2007.
- (2007) Speech Commun. , vol.49 , Issue.5 , pp. 361-383
- Hogden, J.¹ Rubin, P.² McDermott, E.³ Katagiri, S.⁴ Goldstein, L.⁵

54
- 0036289950
- Triphone based unit selection for concatenative visual speech synthesis
- F. J. Huang, E. Cosatto, and H. P. Graf, "Triphone based unit selection for concatenative visual speech synthesis," in Proc. ICASSP, Orlando, FL, 2002, vol. 2, pp. 2037-2040.
- (2002) Proc. ICASSP, Orlando, FL , vol.2 , pp. 2037-2040
- Huang, F.J.¹ Cosatto, E.² Graf, H.P.³

55
- 44049116478
- Forward models\Supervised learning with a distal teacher
- M. I. Jordan and D. E. Rumelhart, "Forward models\Supervised learning with a distal teacher," Cogn. Sci., vol. 16, pp. 307-354, 1992.
- (1992) Cogn. Sci. , vol.16 , pp. 307-354
- Jordan, M.I.¹ Rumelhart, D.E.²

56
- 33750725541
- Ph.D. dissertation Univ. of MD, College Park
- A. Juneja, "Speech recognition based on phonetic features and acoustic landmarks," Ph.D. dissertation, Univ. of MD, College Park, 2004.
- (2004) Speech Recognition Based on Phonetic Features and Acoustic Landmarks
- Juneja, A.¹

57
- 0029753859
- Deriving gestural scores from articulator-movement records using weighted temporal decomposition
- T. P. Jung, A. K. Krishnamurthy, S. C. Ahalt, M. E. Beckman, and S. H. Lee, "Deriving gestural scores from articulator-movement records using weighted temporal decomposition," IEEE Trans. Speech Audio Process., vol. 4, no. 1, pp. 2-18, 1996.
- (1996) IEEE Trans. Speech Audio Process , vol.4 , Issue.1 , pp. 2-18
- Jung, T.P.¹ Krishnamurthy, A.K.² Ahalt, S.C.³ Beckman, M.E.⁴ Lee, S.H.⁵

58
- 0034853397
- What kind of pronunciation variation is hard for triphones to model?
- D. Jurafsky, W. Ward, Z. Jianping, K. Herold, Y. Xiuyang, and Z. Sen, "What kind of pronunciation variation is hard for triphones to model?," in Proc. ICASSP, 2001, vol. 1, pp. 577-580.
- (2001) Proc. ICASSP , vol.1 , pp. 577-580
- Jurafsky, D.¹ Ward, W.² Jianping, Z.³ Herold, K.⁴ Xiuyang, Y.⁵ Sen, Z.⁶

59
- 70350574658
- Face active appearance modeling and speech acoustic information to recover articulation
- Mar.
- A. Katsamanis, G. Papandreou, and P. Maragos, "Face active appearance modeling and speech acoustic information to recover articulation," IEEE Trans. Audio, Speech, Lang. Process., vol. 17, no. 3, pp. 411-422, Mar. 2009.
- (2009) IEEE Trans. Audio, Speech, Lang. Process. , vol.17 , Issue.3 , pp. 411-422
- Katsamanis, A.¹ Papandreou, G.² Maragos, P.³

60
- 0034297586
- Detection of phonological features in continuous speech using neural networks
- S. King and P. Taylor, "Detection of phonological features in continuous speech using neural networks," Comput., Speech, Lang., vol. 14, no. 4, pp. 333-353, 2000.
- (2000) Comput., Speech, Lang. , vol.14 , Issue.4 , pp. 333-353
- King, S.¹ Taylor, P.²

61
- 33745198184
- SVitchboard 1: Small vocabulary tasks from Switchboard 1
- S. King, C. Bartels, and J. Bilmes, "SVitchboard 1: Small vocabulary tasks from Switchboard 1," in Proc. Interspeech, 2005, pp. 3385-3388.
- (2005) Proc. Interspeech , pp. 3385-3388
- King, S.¹ Bartels, C.² Bilmes, J.³

62
- 0003424928
- Ph.D. dissertation Univ. of Bielefeld, Bielefeld, Germany
- K. Kirchhoff, "Robust speech recognition using articulatory information," Ph.D. dissertation, Univ. of Bielefeld, Bielefeld, Germany, 1999.
- (1999) Robust Speech Recognition Using Articulatory Information
- Kirchhoff, K.¹

63
- 0036642567
- Combining acoustic and articulatory feature information for robust speech recognition
- K. Kirchhoff, G. A. Fink, and G. Sagerer, "Combining acoustic and articulatory feature information for robust speech recognition," Speech Commun., vol. 37, pp. 303-319, 2002.
- (2002) Speech Commun. , vol.37 , pp. 303-319
- Kirchhoff, K.¹ Fink, G.A.² Sagerer, G.³

64
- 78649382951
- Application of Neural networks to articulatory motion estimation
- T. Kobayashi, M. Yagyu, and K. Shirai, "Application of Neural networks to articulatory motion estimation," in Proc. ICASSP, 1985, pp. 1001-1104.
- (1985) Proc. ICASSP , pp. 1001-1104
- Kobayashi, T.¹ Yagyu, M.² Shirai, K.³

65
- 0018116027
- Generating vocal tract shapes from formant frequencies
- P. Ladefoged, R. Harshman, L. Goldstein, and L. Rice, "Generating vocal tract shapes from formant frequencies," J. Acoust. Soc. Amer., vol. 64, pp. 1027-1035, 1978.
- (1978) J. Acoust. Soc. Amer. , vol.64 , pp. 1027-1035
- Ladefoged, P.¹ Harshman, R.² Goldstein, L.³ Rice, L.⁴

66
- 80054370614
- Oxford, U.K.: Oxford Univ. Press.
- J. Laver, Principles of Phonetics. Oxford, U.K.: Oxford Univ. Press., 1994.
- (1994) Principles of Phonetics
- Laver, J.¹

67
- 0020300423
- Acoustic-phonetic analysis based on an articulatory model
- J. P. Hayton, Ed. Dordrecht, The Netherlands: D. Reidel
- B. Lochschmidt, "Acoustic-phonetic analysis based on an articulatory model," in Automatic Speech Analysis and Recognition, J. P. Hayton, Ed. Dordrecht, The Netherlands: D. Reidel, 1982, pp. 139-152.
- (1982) Automatic Speech Analysis and Recognition , pp. 139-152
- Lochschmidt, B.¹

68
- 34547541459
- Articula-tory feature-based methods for acoustic and audio-visual speech recognition: Summary from the 2006 JHU Summer Workshop
- K. Livescu, O. Cetin, M. Hasegawa-Johnson, S. King, C. Bartels, N. Borges, A. Kantor, P. Lal, L. Yung, A. Bezman, S. Dawson-Haggerty, B. Woods, J. Frankel, M. Magimai-Doss, and K. Saenko, "Articula-tory feature-based methods for acoustic and audio-visual speech recognition: Summary from the 2006 JHU Summer Workshop," in Proc. ICASSP, 2007, vol. 4, pp. 621-624.
- (2007) Proc. ICASSP , vol.4 , pp. 621-624
- Livescu, K.¹ Cetin, O.² Hasegawa-Johnson, M.³ King, S.⁴ Bartels, C.⁵ Borges, N.⁶ Kantor, A.⁷ Lal, P.⁸ Yung, L.⁹ Bezman, A.¹⁰ Dawson-Haggerty, S.¹¹ Woods, B.¹² Frankel, J.¹³ Magimai-Doss, M.¹⁴ Saenko, K.¹⁵

69
- 0001523807
- A path-stack algorithm for optimizing dynamic regimes in a statistical hidden dynamic model of speech
- J. Ma and L. Deng, "A path-stack algorithm for optimizing dynamic regimes in a statistical hidden dynamic model of speech," Comput., Speech, Lang., vol. 14, pp. 101-104, 2000.
- (2000) Comput. Speech, Lang. , vol.14 , pp. 101-104
- Ma, J.¹ Deng, L.²

70
- 84916524147
- Universal and language particular aspects of vowel-to-vowel coarticulation
- Star. Rep, Speech Res. SR-77/78
- S. Y. Manuel and R. A. Krakow, "Universal and language particular aspects of vowel-to-vowel coarticulation," Haskins Lab. Star. Rep, Speech Res. SR-77/78, pp. 69-78, 1984.
- (1984) Haskins Lab. , pp. 69-78
- Manuel, S.Y.¹ Krakow, R.A.²

71
- 0025162662
- The role of contrast in limiting vowel-to-vowel coar-ticulation in different languages
- S. Y. Manuel, "The role of contrast in limiting vowel-to-vowel coar-ticulation in different languages," J. Acoust. Soc. Amer., vol. 88, pp. 1286-1298, 1990.
- (1990) J. Acoust. Soc. Amer. , vol.88 , pp. 1286-1298
- Manuel, S.Y.¹

72
- 29444436962
- Integration of articulatory and spectrum features based on the hybrid HMM/BN modeling framework
- DOI 10.1016/j.specom.2005.07.003, PII S0167639305001731
- K. Markov, J. Dang, and S. Nakamura, "Integration of articulatory and spectrum features based on the hybrid HMM/BN modeling framework," Speech Commun., vol. 48, pp. 161-175, 2006. (Pubitemid 43012029)
- (2006) Speech Communication , vol.48 , Issue.2 , pp. 161-175
- Markov, K.¹ Dang, J.² Nakamura, S.³

73
- 63149189029
- Phonetics and linguistic evolution
- B. Malmberg, Ed Amsterdam, The Netherlands: North-Holland
- A. Martinet, "Phonetics and linguistic evolution," in Manual of Phon., B. Malmberg, Ed. Amsterdam, The Netherlands: North-Holland, 1957, pp. 252-272.
- (1957) Manual of Phon. , pp. 252-272
- Martinet, A.¹

74
- 0028375762
- Recovering articulatory movement from formant frequency trajectories using task dynamics and a genetic algorithm: Preliminary model tests
- R. S. McGowan, "Recovering articulatory movement from formant frequency trajectories using task dynamics and a genetic algorithm: Preliminary model tests," Speech Commun., vol. 14, no. 1, pp. 19-48, 1994.
- (1994) Speech Commun. , vol.14 , Issue.1 , pp. 19-48
- McGowan, R.S.¹

75
- 85009240321
- A flexible stream architecture for ASR using articulatory features
- F. Metze and A. Waibel, "A flexible stream architecture for ASR using articulatory features," in Proc. ICSLP, 2002, pp. 2133-2136.
- (2002) Proc. ICSLP , pp. 2133-2136
- Metze, F.¹ Waibel, A.²

76
- 0031624622
- Improved phone recognition using Bayesian Triphone Models
- J. Ming and F. J. Smith, "Improved phone recognition using Bayesian Triphone Models," in Proc. ICASSP, 1998, pp. 409-412.
- (1998) Proc. ICASSP , pp. 409-412
- Ming, J.¹ Smith, F.J.²

77
- 70349213974
- From acoustics to vocal tract time functions
- V. Mitra, I. Özbek, H. Nam, X. Zhou, and C. Espy-Wilson, "From acoustics to vocal tract time functions," in Proc. ICASSP, 2009, pp. 4497-4500.
- (2009) Proc. ICASSP , pp. 4497-4500
- Mitra, V.¹ Özbek, I.² Nam, H.³ Zhou, X.⁴ Espy-Wilson, C.⁵

78
- 70450200298
- Noise robustness of Tract variables and their application to speech recognition
- V. Mitra, H. Nam, C. Espy-Wilson, E. Saltzman, and L. Goldstein, "Noise robustness of Tract variables and their application to speech recognition," in Proc. Interspeech, U.K., 2009, pp. 2759-2762.
- (2009) Proc. Interspeech, U.K. , pp. 2759-2762
- Mitra, V.¹ Nam, H.² Espy-Wilson, C.³ Saltzman, E.⁴ Goldstein, L.⁵

79
- 78649390028
- A step in the realization of a speech recognition system based on gestural phonology and landmarks
- Portland J. Acoust. Soc. Amer.
- V. Mitra, H. Nam, and C. Espy-Wilson, "A step in the realization of a speech recognition system based on gestural phonology and landmarks," in Proc. 157th Meeting ASA, Portland, 2009, vol. 125, J. Acoust. Soc. Amer., p. 2530.
- (2009) Proc. 157th Meeting ASA , vol.125 , pp. 2530
- Mitra, V.¹ Nam, H.² Espy-Wilson, C.³

80
- 78649357089
- Recovering speech gestures using a cascaded neural network
- submitted for publication
- V. Mitra, H. Nam,C.Espy-Wilson, E.Saltzman, and L. Goldstein, "Recovering speech gestures using a cascaded neural network," J. Acoust. Soc. Amer., submitted for publication.
- J. Acoust. Soc. Amer.
- Mitra, V.¹ Nam, H.² Espy-Wilson, C.³ Saltzman, E.⁴ Goldstein, L.⁵

81
- 78649297301
- Deep belief networks for phone recognition
- A. Mohamed, G. Dahl, and G. Hinton, "Deep belief networks for phone recognition," in Proc. NIPS Workshop on Deep Learning for Speech Recognition and Related Applications, 2009.
- (2009) Proc. NIPS Workshop on Deep Learning for Speech Recognition and Related Applications
- Mohamed, A.¹ Dahl, G.² Hinton, G.³

82
- 0027205884
- A scaled conjugate gradient algorithm for fast supervised learning
- M. F. Moller, "A scaled conjugate gradient algorithm for fast supervised learning," Neural Netw., vol. 6, pp. 525-533, 1993.
- (1993) Neural Netw. , vol.6 , pp. 525-533
- Moller, M.F.¹

83
- 0017007706
- Automatic detection and description of syllabic features in continuous speech
- Oct.
- R. D. Mori, P. Laface, and E. Piccolo, "Automatic detection and description of syllabic features in continuous speech," IEEE Trans. Acoust., Speech Signal Process., vol. 24, no. 5, pp. 365-379, Oct. 1976.
- (1976) IEEE Trans. Acoust., Speech Signal Process. , vol.24 , Issue.5 , pp. 365-379
- Mori, R.D.¹ Laface, P.² Piccolo, E.³

84
- 70349207706
- Tada: An enhanced, portable task dynamics model in Matlab
- 2
- H. Nam, L. Goldstein, E. Saltzman, and D. Byrd, "Tada: An enhanced, portable task dynamics model in Matlab," J. Acoust. Soc. Amer., vol. 115, no. 5-2, p. 2430, 2004.
- (2004) J. Acoust. Soc. Amer. , vol.115 , Issue.5 , pp. 2430
- Nam, H.¹ Goldstein, L.² Saltzman, E.³ Byrd, D.⁴

85
- 84867222549
- The acoustic to articulation mapping: Non-linear or Non-unique?
- D. Neiberg, G. Ananthakrishnan, and O. Engwall, "The acoustic to articulation mapping: Non-linear or Non-unique?," in Proc. Interspeech, 2008, pp. 1485-1488.
- (2008) Proc. Interspeech , pp. 1485-1488
- Neiberg, D.¹ Ananthakrishnan, G.² Engwall, O.³

86
- 0013871855
- Coarticulation in VCV utterances: Spectrographic measurements
- S. E. G. Ohman, "Coarticulation in VCV utterances: Spectrographic measurements," J. Acoust. Soc. Amer., vol. 39, pp. 151-168, 1966.
- (1966) J. Acoust. Soc. Amer. , vol.39 , pp. 151-168
- Ohman, S.E.G.¹

87
- 0010505818
- Recovery of articulatory movements from acoustics with phonemic information
- Bavaria, Germany
- T. Okadome, S. Suzuki, and M. Honda, "Recovery of articulatory movements from acoustics with phonemic information," in Proc. 5th Seminar Speech Production, Bavaria, Germany, 2000, pp. 229-232.
- (2000) Proc. 5th Seminar Speech Production , pp. 229-232
- Okadome, T.¹ Suzuki, S.² Honda, M.³

88
- 0036298107
- Maximum mutual information based acoustic features representation of phonological features for speech recognition
- M. K. Omar and M. Hasegawa-Johnson, "Maximum mutual information based acoustic features representation of phonological features for speech recognition," in Proc. ICASSP, 2002, vol. 1, pp. 81-84.
- (2002) Proc. ICASSP , vol.1 , pp. 81-84
- Omar, M.K.¹ Hasegawa-Johnson, M.²

89
- 4544293504
- Moving beyond the 'beads-on-a-string' model of speech
- M. Ostendorf, "Moving beyond the 'beads-on-a-string' model of speech," in Proc. IEEE Auto. Speech Recog. Understanding Workshop, 1999, vol. 1, pp. 79-83.
- (1999) Proc. IEEE Auto. Speech Recog. Understanding Workshop , vol.1 , pp. 79-83
- Ostendorf, M.¹

90
- 0026675669
- Inferring articulation and recognizing gestures from acoustics with a neural network trained on X-ray microbeam data
- G. Papcun, J. Hochberg, T. R. Thomas, F. Laroche, J. Zachs, and S. Levy, "Inferring articulation and recognizing gestures from acoustics with a neural network trained on X-ray microbeam data," J. Acoust. Soc. Amer., vol. 92, no. 2, pp. 688-700, 1992.
- (1992) J. Acoust. Soc. Amer. , vol.92 , Issue.2 , pp. 688-700
- Papcun, G.¹ Hochberg, J.² Thomas, T.R.³ Laroche, F.⁴ Zachs, J.⁵ Levy, S.⁶

91
- 51449098747
- An empirical investigation of the nonuniqueness in the acoustic-to-articulatory mapping
- C. Qin and M. Á. Carreira-Perpiñán, "An empirical investigation of the nonuniqueness in the acoustic-to-articulatory mapping," in Proc. Interspeech, 2007, pp. 74-77.
- (2007) Proc. Interspeech , pp. 74-77
- Qin, C.¹ Carreira-Perpiñán, M.Á.²

92
- 0026396339
- Acoustic-to-articulatory parameter mapping using an assembly of neural networks
- M. G. Rahim, W. B. Kleijn, J. Schroeter, and C. C. Goodyear, "Acoustic-to-articulatory parameter mapping using an assembly of neural networks," in Proc. ICASSP, 1991, pp. 485-488.
- (1991) Proc. ICASSP , pp. 485-488
- Rahim, M.G.¹ Kleijn, W.B.² Schroeter, J.³ Goodyear, C.C.⁴

93
- 0027499166
- On the use of neural networks in articulatory speech synthesis
- M. G. Rahim, C. C. Goodyear, W. B. Kleijn, J. Schroeter, and M. Sondhi, "On the use of neural networks in articulatory speech synthesis," J. Acoust. Soc. Amer., vol. 93, no. 2, pp. 1109-1121, 1993.
- (1993) J. Acoust. Soc. Amer. , vol.93 , Issue.2 , pp. 1109-1121
- Rahim, M.G.¹ Goodyear, C.C.² Kleijn, W.B.³ Schroeter, J.⁴ Sondhi, M.⁵

94
- 0021687548
- Timing constraints and coarticulation: Alveolo-palatals and sequences of alveolar + [j] in Catalan
- D. Recasens, "Timing constraints and coarticulation: Alveolo-palatals and sequences of alveolar + [j] in Catalan," Phonetica, vol. 41, pp. 125-139, 1984.
- (1984) Phonetica , vol.41 , pp. 125-139
- Recasens, D.¹

95
- 4243714433
- Ph.D., Univ. of Edinburgh, Edinburgh, U.K.
- K. Richmond, "Estimating articulatory parameters from the acoustic speech signal," Ph.D., Univ. of Edinburgh, Edinburgh, U.K., 2001.
- (2001) Estimating Articulatory Parameters from the Acoustic Speech Signal
- Richmond, K.¹

96
- 38549178971
- Trajectory mixture density networks with multiple mixtures for acoustic-articulatory inversion
- K.Richmond, "Trajectory mixture density networks with multiple mixtures for acoustic-articulatory inversion," Lecture Notes in Comput. Sci., vol. 4885/2007, pp. 263-272, 2007.
- (2007) Lecture Notes in Comput. Sci. , vol.2007-4885 , pp. 263-272
- Richmond, K.¹

97
- 70349215624
- Boston MA: Allyn & Bacon
- J. Ryalls and S. J. Behrens, Introduction to Speech Science: From Basic Theories to Clinical Applications. Boston, MA: Allyn & Bacon, 2000.
- (2000) Introduction to Speech Science: From Basic Theories to Clinical Applications
- Ryalls, J.¹ Behrens, S.J.²

98
- 77956779481
- A dynamical approach to gestural patterning in speech production
- E. Saltzman and K. Munhall, "A dynamical approach to gestural patterning in speech production," Ecol. Psychol., vol. 1, no. 4, pp. 332-382, 1989.
- (1989) Ecol. Psychol. , vol.1 , Issue.4 , pp. 332-382
- Saltzman, E.¹ Munhall, K.²

99
- 0024906981
- Robust statistic modelling of systematic variabilities in continuous speech incorporating acoustic-articulatory relations
- O. Schmidbauer, "Robust statistic modelling of systematic variabilities in continuous speech incorporating acoustic-articulatory relations," in Proc. ICASSP, 1989, pp. 616-619.
- (1989) Proc. ICASSP , pp. 616-619
- Schmidbauer, O.¹

100
- 84885860927
- A hierarchy of recurrent networks for speech recognition
- B. Schrauwen and L. Buesing, "A hierarchy of recurrent networks for speech recognition," in Proc. NIPS Workshop on Deep Learning for Speech Recognition and Related Applications, 2009.
- (2009) Proc. NIPS Workshop on Deep Learning for Speech Recognition and Related Applications
- Schrauwen, B.¹ Buesing, L.²

101
- 0008499181
- Estimating articulatory motion from speech wave
- K. Shirai and T. Kobayashi, "Estimating articulatory motion from speech wave," Speech Commun., vol. 5, pp. 159-170, 1986.
- (1986) Speech Commun. , vol.5 , pp. 159-170
- Shirai, K.¹ Kobayashi, T.²

102
- 4043137356
- A tutorial on support vector regression
- A. Smola and B. Scholkhopf, "A tutorial on support vector regression," Statist. Comput., vol. 14, no. 3, pp. 199-222, 2004.
- (2004) Statist. Comput. , vol.14 , Issue.3 , pp. 199-222
- Smola, A.¹ Scholkhopf, B.²

103
- 84939672029
- Toward a model for speech recognition
- K. N. Stevens, "Toward a model for speech recognition," J. Acoust. Soc. Amer., vol. 32, pp. 47-55, 1960.
- (1960) J. Acoust. Soc. Amer. , vol.32 , pp. 47-55
- Stevens, K.N.¹

104
- 0008796094
- Revisiting place of articulation measures for stop consonants: Implications for models of consonant production
- K. N. Stevens, S. Manuel, and M. Matthies, "Revisiting place of articulation measures for stop consonants: Implications for models of consonant production," in Proc. Int. Cong. Phon. Sci., 1999, vol. 2, pp. 1117-1120.
- (1999) Proc. Int. Cong. Phon. Sci. , vol.2 , pp. 1117-1120
- Stevens, K.N.¹ Manuel, S.² Matthies, M.³

105
- 34047251076
- Cambridge MA: MIT Press
- K. N. Stevens, Acoustic Phonetics (Current Studies in Linguistics). Cambridge, MA: MIT Press, 2000.
- (2000) Acoustic Phonetics (Current Studies in Linguistics)
- Stevens, K.N.¹

106
- 0036219864
- Toward a model for lexical access based on acoustic landmarks and distinctive features
- K. N. Stevens, "Toward a model for lexical access based on acoustic landmarks and distinctive features," J. Acoust. Soc. Amer., vol. 111, no. 4, pp. 1872-1891, 2002.
- (2002) J. Acoust. Soc. Amer. , vol.111 , Issue.4 , pp. 1872-1891
- Stevens, K.N.¹

107
- 78649348268
- Annotation and use of speech production corpus for building language-universal speech recognizers
- Beijing, China Oct.
- J. Sun and L. Deng, "Annotation and use of speech production corpus for building language-universal speech recognizers," in Proc. 2nd Int. Symp. Chinese Spoken Lang. Processi. ISCSLP, Beijing, China, Oct. 2000, vol. 3, pp. 31-34.
- (2000) Proc. 2nd Int. Symp. Chinese Spoken Lang. Processi. ISCSLP , vol.3 , pp. 31-34
- Sun, J.¹ Deng, L.²

108
- 0036165806
- An overlapping-feature-based phonological model incorporating linguistic constraints: Applications to speech recognition
- J. Sun and L. Deng, "An overlapping-feature-based phonological model incorporating linguistic constraints: Applications to speech recognition," J. Acoust. Soc. Amer., vol. 111, no. 2, pp. 1086-1101, 2002.
- (2002) J. Acoust. Soc. Amer. , vol.111 , Issue.2 , pp. 1086-1101
- Sun, J.¹ Deng, L.²

109
- 57749193836
- Voice conversion based on maximum likelihood estimation of speech parameter trajectory
- Nov.
- T. Toda, A. W. Black, and K. Tokuda, "Voice conversion based on maximum likelihood estimation of speech parameter trajectory," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 8, pp. 2222-2235, Nov. 2007.
- (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.8 , pp. 2222-2235
- Toda, T.¹ Black, A.W.² Tokuda, K.³

110
- 0344443787
- Joint state and parameter estimation for a target-directed nonlinear dynamic system model
- Dec.
- R. Togneri and L. Deng, "Joint state and parameter estimation for a target-directed nonlinear dynamic system model," IEEE Trans. Signal Process., vol. 51, no. 12, pp. 3061-3070, Dec. 2003.
- (2003) IEEE Trans. Signal Process. , vol.51 , Issue.12 , pp. 3061-3070
- Togneri, R.¹ Deng, L.²

111
- 0033708106
- Speech parameter generation algorithms for HMM-based speech synthesis
- Jun.
- K. Tokuda, T. Yoshimura, T. Masuko, T. Kobayashi, and T. Kitamura, "Speech parameter generation algorithms for HMM-based speech synthesis," in Proc. ICASSP, Jun. 2000, vol. 3, pp. 1315-1318.
- (2000) Proc. ICASSP , vol.3 , pp. 1315-1318
- Tokuda, K.¹ Yoshimura, T.² Masuko, T.³ Kobayashi, T.⁴ Kitamura, T.⁵

112
- 33745288610
- A support vector approach to the acoustic-to-articulatory mapping
- A. Toutios and K. Margaritis, "A support vector approach to the acoustic-to-articulatory mapping," in Proc. Interspeech, 2005, pp. 3221-3224.
- (2005) Proc. Interspeech , pp. 3221-3224
- Toutios, A.¹ Margaritis, K.²

113
- 78649366518
- Learning articulation from cepstral coefficients
- A. Toutios and K. Margaritis, "Learning articulation from cepstral coefficients," in Proc. SPECOM, 2005.
- (2005) Proc. SPECOM
- Toutios, A.¹ Margaritis, K.²

114
- 0003991806
- New York: Wiley
- V. Vapnik, Statistical Learning Theory. New York: Wiley, 1998.
- (1998) Statistical Learning Theory
- Vapnik, V.¹

115
- 0003652255
- Madison: Univ. of Wisconsin
- J. Westbury, X-Ray Microbeam Speech Production Database User's Handbook. Madison: Univ. of Wisconsin, 1994.
- (1994) X-Ray Microbeam Speech Production Database User's Handbook
- Westbury, J.¹

116
- 26444603266
- A dutch treatment of an elitist approach to articulatory-acoustic feature classification
- M. Wester, S. Greenberg, and S. Chang, "A dutch treatment of an elitist approach to articulatory-acoustic feature classification," in Proc. Eu-rospeech, 2001, pp. 1729-1732.
- (2001) Proc. Eu-rospeech , pp. 1729-1732
- Wester, M.¹ Greenberg, S.² Chang, S.³

117
- 33745222753
- Asynchronous articulatory feature recognition using dynamic Bayesian networks
- SP2004-81-95
- M. Wester, J. Frankel, and S. King, "Asynchronous articulatory feature recognition using dynamic Bayesian networks," in Proc. Inst. Electronics, Info, Commun. Engi. Beyond HMM Workshop, 2004, vol. 104, pp. 37-42, SP2004-81-95.
- (2004) Proc. Inst. Electronics, Info, Commun. Engi. beyond HMM Workshop , vol.104 , pp. 37-42
- Wester, M.¹ Frankel, J.² King, S.³

118
- 33745270665
- Tuebingen, Germany: Machine Learning Summer School
- J. Weston, A. Gretton, and A. Elisseeff, SVM Practical Session-How to Get Good Results Without Cheating. Tuebingen, Germany: Machine Learning Summer School, 2003.
- (2003) SVM Practical Session-How to Get Good Results Without Cheating
- Weston, J.¹ Gretton, A.² Elisseeff, A.³

119
- 78649366959
- A probabilistic framework for word recognition using phonetic features
- C. Windheuser, F. Bimbot, and P. Haffner, "A probabilistic framework for word recognition using phonetic features," in Proc. ICSLP, 1994, pp. 287-290.
- (1994) Proc. ICSLP , pp. 287-290
- Windheuser, C.¹ Bimbot, F.² Haffner, P.³

120
- 33646815712
- Online Available
- A. Wrench, The MOCHA-TIMIT Articulatory Database 1999 [Online]. Available: http://www.cstr.ed.ac.uk/artic/mocha.html
- (1999) The MOCHA-TIMIT Articulatory Database
- Wrench, A.¹

121
- 0028464701
- A new neural network for articula-tory speech recognition and its application to vowel identification
- J. Zachs and T. R. Thomas, "A new neural network for articula-tory speech recognition and its application to vowel identification," Comput., Speech, Lang., vol. 8, pp. 189-209, 1994.
- (1994) Comput. Speech, Lang. , vol.8 , pp. 189-209
- Zachs, J.¹ Thomas, T.R.²

122
- 84867193584
- The entropy of articulatory phonological code: Recognizing gestures from tract variables
- X. Zhuang, H. Nam, M. Hasegawa-Johnson, L. Goldstein, and E. Saltzman, "The entropy of articulatory phonological code: Recognizing gestures from tract variables," in Proc. Interspeech, 2008, pp. 1489-1492.
- (2008) Proc. Interspeech , pp. 1489-1492
- Zhuang, X.¹ Nam, H.² Hasegawa-Johnson, M.³ Goldstein, L.⁴ Saltzman, E.⁵

123
- 70450174439
- Articulatory phonological code for word classification
- X. Zhuang, H. Nam, M. Hasegawa-Johnson, L. Goldstein, and E. Saltzman, "Articulatory phonological code for word classification," in Proc. Interspeech, 2009, pp. 2763-2766
- (2009) Proc. Interspeech , pp. 2763-2766
- Zhuang, X.¹ Nam, H.² Hasegawa-Johnson, M.³ Goldstein, L.⁴ Saltzman, E.⁵

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.