SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 19, Issue 7, 2011, Pages 1913-1924

Articulatory information for noise robust speech recognition

(5) Mitra, Vikramjit a Nam, Hosung b Espy Wilson, Carol Y a Saltzman, Elliot b,c Goldstein, Louis b,d

a UNIVERSITY OF MARYLAND (United States)

b HASKINS LABORATORIES (United States)

c BOSTON UNIVERSITY (United States)

d UNIVERSITY OF SOUTHERN CALIFORNIA (United States)

Author keywords

Articulatory phonology; articulatory speech recognition; artificial neural networks (ANNs); noise robust speech recognition; speech inversion; task dynamic model; vocal tract variables

Indexed keywords

ARTICULATORY PHONOLOGY; ARTICULATORY SPEECH RECOGNITION; ARTIFICIAL NEURAL NETWORKS (ANNS); NOISE-ROBUST SPEECH RECOGNITION; SPEECH INVERSION; TASK DYNAMIC MODEL; VOCAL-TRACTS;

ACOUSTIC NOISE; FEATURE EXTRACTION; NEURAL NETWORKS; RESEARCH; VOCABULARY CONTROL;

SPEECH RECOGNITION;

EID: 79960545035 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TASL.2010.2103058 Document Type: Article

Times cited : (56)

References (67)

1
- 0345843991
- Experiments with a non linear spectral subtractor (NSS), hidden Markov models and the projection, for robust speech recognition in cars
- P. Lockwood and J. Boudy, "Experiments with a non linear spectral subtractor (NSS), hidden Markov models and the projection, for robust speech recognition in cars," in Proc. Eurospeech, 1991, pp. 79-82.
- (1991) Proc. Eurospeech , pp. 79-82
- Lockwood, P.¹ Boudy, J.²

2
- 56249136428
- Transforming binary uncertainties for robust speech recognition
- Sep.
- S. Srinivasan and D. L. Wang, "Transforming binary uncertainties for robust speech recognition," IEEE Trans Audio, Speech, Lang. Process., vol. 15, no. 7, pp. 2130-2140, Sep. 2007.
- (2007) IEEE Trans Audio, Speech, Lang. Process. , vol.15 , Issue.7 , pp. 2130-2140
- Srinivasan, S.¹ Wang, D.L.²

3
- 34249884500
- Speech enhancement using the modified phase-opponency model
- DOI 10.1121/1.2714913
- O. Deshmukh, C. Espy-Wilson, and L. H. Carney, "Speech enhancement using the modified phase opponency model," J. Acoust. Soc. Amer., vol. 121, no. 6, pp. 3886-3898, 2007. (Pubitemid 46872142)
- (2007) Journal of the Acoustical Society of America , vol.121 , Issue.6 , pp. 3886-3898
- Deshmukh, O.D.¹ Espy-Wilson, C.Y.² Carney, L.H.³

4
- 52949093125
- Combined speech enhancement and auditory modelling for robust distributed speech recognition
- R. Flynn and E. Jones, "Combined speech enhancement and auditory modelling for robust distributed speech recognition," Speech Commun., vol. 50, pp. 797-809, 2008.
- (2008) Speech Commun. , vol.50 , pp. 797-809
- Flynn, R.¹ Jones, E.²

5
- 0028517164
- RASTA processing of speech
- Oct.
- H. Hermansky and N. Morgan, "RASTA processing of speech," IEEE Trans. Speech Audio Process., vol. 2, no. 4, pp. 578-589, Oct. 1994.
- (1994) IEEE Trans. Speech Audio Process. , vol.2 , Issue.4 , pp. 578-589
- Hermansky, H.¹ Morgan, N.²

6
- 42549139762
- MVA processing of speech features
- Jan.
- C. Chen and J. Bilmes, "MVA processing of speech features," IEEE Trans Audio, Speech, Lang. Process., vol. 15, no. 1, pp. 257-270, Jan. 2007.
- (2007) IEEE Trans Audio, Speech, Lang. Process. , vol.15 , Issue.1 , pp. 257-270
- Chen, C.¹ Bilmes, J.²

7
- 0009578471
- Ph.D. dissertation, Carnegie Mellon Univ., Pittsburgh, PA
- T. M. Sullivan, "Multi-Microphone correlation-based processing for robust automatic speech recognition," Ph.D. dissertation, Carnegie Mellon Univ., Pittsburgh, PA, 1996.
- (1996) Multi-Microphone Correlation-based Processing for Robust Automatic Speech Recognition
- Sullivan, T.M.¹

8
- 4544286862
- Entropy-based variable frame rate analysis of speech signals and its application to ASR
- H. You, Q. Zhu, and A. Alwan, "Entropy-based variable frame rate analysis of speech signals and its application to ASR," in Proc. ICASSP, 2004, pp. 549-552.
- (2004) Proc. ICASSP , pp. 549-552
- You, H.¹ Zhu, Q.² Alwan, A.³

9
- 0031238095
- A model of dynamic auditory perception and its application to Robust Word recognition
- PII S1063667697063906
- B. Strope and A. Alwan, "A model of dynamic auditory perception and its application to robust word recognition," IEEE Trans. Speech Audio Process., vol. 5, no. 5, pp. 451-464, Sep. 1997. (Pubitemid 127746017)
- (1997) IEEE Transactions on Speech and Audio Processing , vol.5 , Issue.5 , pp. 451-464
- Strope, B.¹ Alwan, A.²

10
- 0009589650
- ETSI ES 201 108 Ver. 1.1.3
- Speech Processing, Transmission and Quality Aspects (STQ); Distributed Speech Recognition; Front-End Feature Extraction Algorithm; Compression Algorithms, ETSI ES 201 108 Ver. 1.1.3, 2003.
- (2003) Speech Processing Transmission and Quality Aspects (STQ); Distributed Speech Recognition; Front-End Feature Extraction Algorithm; Compression Algorithms

11
- 0442317754
- ETSI ES 202 050 Ver. 1.1.5
- Speech Processing, Transmission and Quality Aspects (STQ); Distributed Speech Recognition; Adv. Frontend Feature Extraction Algorithm; Compression Algorithms, ETSI ES 202 050 Ver. 1.1.5, 2007.
- (2007) Speech Processing Transmission and Quality Aspects (STQ); Distributed Speech Recognition; Adv. Frontend Feature Extraction Algorithm; Compression Algorithms

12
- 17344389852
- Robust speech recognition in noisy environments: The 2001 IBM spin evaluation system
- B. Kingsbury, G. Saon, L. Mangu, M. Padmanabhan, and R. Sarikaya, "Robust speech recognition in noisy environments: The 2001 IBM spin evaluation system," in Proc. ICASSP, 2002, vol. 1, pp. I-53-I-56.
- (2002) Proc. ICASSP , vol.1
- Kingsbury, B.¹ Saon, G.² Mangu, L.³ Padmanabhan, M.⁴ Sarikaya, R.⁵

13
- 0030245128
- Robust continuous speech recognition using parallel model combination
- PII S1063667696067120
- M. J. F. Gales and S. J. Young, "Robust continuous speech recognition using parallel model combination," IEEE Trans. Speech Audio Process., vol. 4, no. 5, pp. 352-359, Sep. 1996. (Pubitemid 126753023)
- (1996) IEEE Transactions on Speech and Audio Processing , vol.4 , Issue.5 , pp. 352-359
- Gales, M.J.F.¹ Young, S.J.²

14
- 0029288633
- Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models
- C. Leggetter and P.Woodland, "Maximum likelihood linear regression for speaker adaptation of continuous density hidden Markov models," Comput., Speech Lang., vol. 9, pp. 171-185, 1995.
- (1995) Comput., Speech Lang. , vol.9 , pp. 171-185
- Leggetter, C.¹ Woodland, P.²

15
- 0347899508
- Piecewise-linear transformation-based HMM adaptation for noisy speech
- Jan.
- Z. Zhang and S. Furui, "Piecewise-linear transformation-based HMM adaptation for noisy speech," Speech Commun., vol. 42, no. 1, pp. 43-58, Jan. 2004.
- (2004) Speech Commun. , vol.42 , Issue.1 , pp. 43-58
- Zhang, Z.¹ Furui, S.²

16
- 0035342414
- Robust automatic speech recognition with missing and unreliable acoustic data
- DOI 10.1016/S0167-6393(00)00034-0, PII S0167639300000340
- M. Cooke, P. Green, L. Josifovski, and A. Vizinho, "Robust automatic speech recognition with missing and uncertain acoustic data," Speech Commun., vol. 34, pp. 267-285, 2001. (Pubitemid 32284867)
- (2001) Speech Communication , vol.34 , Issue.3 , pp. 267-285
- Cooke, M.¹ Green, P.² Josifovski, L.³ Vizinho, A.⁴

17
- 85009063707
- Soft decisions in missing data techniques for robust automatic speech recognition
- J. Barker, L. Josifovski, M. P. Cooke, and P. D. Green, "Soft decisions in missing data techniques for robust automatic speech recognition," in Proc. Int. Conf. Spoken Lang. Process., 2000, pp. 373-376.
- (2000) Proc. Int. Conf. Spoken Lang. Process. , pp. 373-376
- Barker, J.¹ Josifovski, L.² Cooke, M.P.³ Green, P.D.⁴

18
- 0037841203
- State based imputation of missing data for robust speech recognition and speech enhancement
- L. Josifovski, M. Cooke, P. Green, and A. Vizinho, "State based imputation of missing data for robust speech recognition and speech enhancement," in Proc. Eurospeech, 1999, vol. 6, pp. 2833-2836.
- (1999) Proc. Eurospeech , vol.6 , pp. 2833-2836
- Josifovski, L.¹ Cooke, M.² Green, P.³ Vizinho, A.⁴

19
- 4544293504
- Moving beyond the 'beads-on-a-string' model of speech
- CO
- M. Ostendorf, "Moving beyond the 'beads-on-a-string' model of speech," in Proc. IEEE Autom. Speech Recognition Understanding Workshop, CO, 1999, vol. 1, pp. 79-83.
- (1999) Proc. IEEE Autom. Speech Recognition Understanding Workshop , vol.1 , pp. 79-83
- Ostendorf, M.¹

20
- 0036165806
- An overlapping-feature-based phonological model incorporating linguistic constraints: Applications to speech recognition
- DOI 10.1121/1.1420380
- J. Sun and L. Deng, "An overlapping-feature-based phonological model incorporating linguistic constraints: Applications to speech recognition," J. Acoust. Soc. Amer., vol. 111, no. 2, pp. 1086-1101, Feb. 2002. (Pubitemid 34127489)
- (2002) Journal of the Acoustical Society of America , vol.111 , Issue.2 , pp. 1086-1101
- Sun, J.¹ Deng, L.²

21
- 0034853397
- What kind of pronunciation variation is hard for triphones to model?
- D. Jurafsky, W.Ward, Z. Jianping, K. Herold, Y. Xiuyang, and Z. Sen, "What kind of pronunciation variation is hard for triphones to model?," in Proc. ICASSP, 2001, vol. 1, pp. 577-580. (Pubitemid 32839316)
- (2001) ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings , vol.1 , pp. 577-580
- Jurafsky, D.¹ Ward, W.² Zhang, J.³ Herold, K.⁴ Yu, X.⁵ Zhang, S.⁶

22
- 84939672029
- Toward a model for speech recognition
- K. N. Stevens, "Toward a model for speech recognition," J. Acoust. Soc. Amer., vol. 32, pp. 47-55, 1960.
- (1960) J. Acoust. Soc. Amer. , vol.32 , pp. 47-55
- Stevens, K.N.¹

23
- 0001887625
- Performing fine phonetic distinctions: Templates versus features
- Hillsdale, NJ: Lawrence Erlbaum Assoc., ch. 15
- R. Cole, R. M. Stern, and M. J. Lasry, , J. S. Perkell and D. Klatt, Eds., "Performing fine phonetic distinctions: Templates versus features," in Invariance and Variability of Speech Processes. Hillsdale, NJ: Lawrence Erlbaum Assoc., 1986, ch. 15, pp. 325-345.
- (1986) Invariance and Variability of Speech Processes , pp. 325-345
- Cole, R.¹ Stern, R.M.² Lasry, M.J.³ Perkell, J.S.⁴ Klatt, D.⁵

24
- 0020300423
- Acoustic-phonetic analysis based on an articulatory model
- J. P. Hayton, Ed. Dordrecht, The Netherlands: D. Reidel
- B. Lochschmidt, "Acoustic-phonetic analysis based on an articulatory model," in Automatic Speech Analysis and Recognition, J. P. Hayton, Ed. Dordrecht, The Netherlands: D. Reidel, 1982, pp. 139-152.
- (1982) Automatic Speech Analysis and Recognition , pp. 139-152
- Lochschmidt, B.¹

25
- 0017007706
- Automatic detection and description of syllabic features in continuous speech
- Oct.
- R. D. Mori, P. Laface, and E. Piccolo, "Automatic detection and description of syllabic features in continuous speech," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-24, no. 5, pp. 365-379, Oct. 1976.
- (1976) IEEE Trans. Acoust., Speech, Signal Process. , vol.ASSP-24 , Issue.5 , pp. 365-379
- Mori, R.D.¹ Laface, P.² Piccolo, E.³

26
- 0024906981
- Robust statistic modelling of systematic variabilities in continuous speech incorporating acoustic-articulatory relations
- O. Schmidbauer, "Robust statistic modelling of systematic variabilities in continuous speech incorporating acoustic-articulatory relations," in Proc. ICASSP, 1989, pp. 616-619. (Pubitemid 20604192)
- (1989) ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings , vol.1 , pp. 616-619
- Schmidbauer Otto¹

27
- 0026854213
- A generalized hidden Markov model with state-conditioned trend functions of time for the speech signal
- L. Deng, "A generalized hidden Markov model with state-conditioned trend functions of time for the speech signal," Signal Process., vol. 27, no. 1, pp. 65-78, 1992.
- (1992) Signal Process. , vol.27 , Issue.1 , pp. 65-78
- Deng, L.¹

28
- 0028234947
- A statistical approach to automatic speech recognition using the atomic speech units constructed from overlapping articulatory features
- DOI 10.1121/1.409839
- L. Deng and D. Sun, "A statistical approach to ASR using atomic units constructed from overlapping articulatory features," J. Acoust. Soc. Amer., vol. 95, pp. 2702-2719, 1994. (Pubitemid 24152864)
- (1994) Journal of the Acoustical Society of America , vol.95 , Issue.5 , pp. 2702-2719
- Deng, L.¹ Sun, D.X.²

29
- 0027627252
- Hidden Markov model representation of quantized articulatory features for speech recognition
- DOI 10.1006/csla.1993.1014
- K. Erler and L. Deng, "Hidden Markov model representation of quantized articulatory features for speech recognition," Comput., Speech Lang., vol. 7, pp. 265-282, 1993. (Pubitemid 23705305)
- (1993) Computer Speech and Language , vol.7 , Issue.3 , pp. 265-282
- Erler Kevin¹ Deng, L.²

30
- 0034297586
- Detection of phonological features in continuous speech using neural networks
- Oct.
- S. King and P. Taylor, "Detection of phonological features in continuous speech using neural networks," Comput. Speech Lang., vol. 14, no. 4, pp. 333-353, Oct. 2000.
- (2000) Comput. Speech Lang. , vol.14 , Issue.4 , pp. 333-353
- King, S.¹ Taylor, P.²

31
- 0004119259
- New York: Harper & Row
- N. Chomsky and M. Halle, The Sound Pattern of English. New York: Harper & Row, 1968.
- (1968) The Sound Pattern of English
- Chomsky, N.¹ Halle, M.²

32
- 0003948389
- Oxford, U.K.: Wiley-Blackwell
- J. Harris, English Sound Structure. Oxford, U.K.: Wiley-Blackwell, 1994.
- (1994) English Sound Structure
- Harris, J.¹

33
- 33846680938
- Speech production knowledge in automatic speech recognition
- DOI 10.1121/1.2404622
- S. King, J. Frankel, K. Livescu, E. McDermott, K. Richmond, and M. Wester, "Speech production knowledge in automatic speech recognition," J. Acoust. Soc. Amer., vol. 121, no. 2, pp. 723-742, 2007. (Pubitemid 46192674)
- (2007) Journal of the Acoustical Society of America , vol.121 , Issue.2 , pp. 723-742
- King, S.¹ Frankel, J.² Livescu, K.³ McDermott, E.⁴ Richmond, K.⁵ Wester, M.⁶

34
- 4243714433
- Ph.D. dissertation, Univ. of Edinburgh, Edinburgh, U.K.
- K. Richmond, "Estimating articulatory parameters from the acoustic speech signal," Ph.D. dissertation, Univ. of Edinburgh, Edinburgh, U.K., 2001.
- (2001) Estimating Articulatory Parameters from the Acoustic Speech Signal
- Richmond, K.¹

35
- 0004129646
- Cambridge, MA: MIT Press
- K. Stevens, Acoustic Phonetics. Cambridge, MA: MIT Press, 2000.
- (2000) Acoustic Phonetics
- Stevens, K.¹

36
- 0017968519
- Inversion of articulatory-to-acoustic transformation in the vocal tract by a computer-sorting technique
- B. S. Atal, J. J. Chang, M. V. Mathews, and J. W. Tukey, "Inversion of articulatory-to-acoustic transformation in the vocal tract by a computer sorting technique," J. Acoust. Soc. Amer., vol. 63, pp. 1535-1555, 1978. (Pubitemid 8346208)
- (1978) Journal of the Acoustical Society of America , vol.63 , Issue.5 , pp. 1535-1555
- Atal, B.S.¹ Chang, J.J.² Mathews, M.V.³ Tukey, J.W.⁴

37
- 0027499166
- On the use of neural networks in articulatory speech synthesis
- DOI 10.1121/1.405559
- M. G. Rahim, C. C. Goodyear, W. B. Kleijn, J. Schroeter, and M. Sondhi, "On the use of neural networks in articulatory speech synthesis," J. Acoust. Soc. Amer., vol. 93, no. 2, pp. 1109-1121, 1993. (Pubitemid 23059838)
- (1993) Journal of the Acoustical Society of America , vol.93 , Issue.2 , pp. 1109-1121
- Rahim, M.G.¹ Goodyear, C.C.² Kleijn, W.B.³ Schroeter, J.⁴ Sondhi, M.M.⁵

38
- 0026675669
- Inferring articulation and recognizing gestures from acoustics with a neural network trained on x-ray microbeam data
- G. Papcun, J. Hochberg, T. R. Thomas, F. Laroche, J. Zachs, and S. Levy, "Inferring articulation and recognizing gestures from acoustics with a neural network trained on x-ray microbeam data," J. Acoust. Soc. Amer., vol. 92, no. 2, pp. 688-700, 1992.
- (1992) J. Acoust. Soc. Amer. , vol.92 , Issue.2 , pp. 688-700
- Papcun, G.¹ Hochberg, J.² Thomas, T.R.³ Laroche, F.⁴ Zachs, J.⁵ Levy, S.⁶

39
- 0018116027
- Generating vocal tract shapes from formant frequencies
- P. Ladefoged, R. Harshman, L. Goldstein, and L. Rice, "Generating vocal tract shapes from formant frequencies," J. Acoust. Soc. Amer., vol. 64, no. 4, pp. 1027-1035, 1978. (Pubitemid 9053326)
- (1978) Journal of the Acoustical Society of America , vol.64 , Issue.4 , pp. 1027-1035
- Ladefoged, P.¹ Harshman, R.² Goldstein, L.³ Rice, L.⁴

40
- 0029843107
- Accurate recovery of articulator positions from acoustics: New conclusions based on human data
- J. Hogden, A. Lofqvist, V. Gracco, I. Zlokarnik, P. Rubin, and E. Saltzman, "Accurate recovery of articulator positions from acoustics: New conclusions based on human data," J. Acoust. Soc. Amer., vol. 100, no. 3, pp. 1819-1834, 1996. (Pubitemid 26307570)
- (1996) Journal of the Acoustical Society of America , vol.100 , Issue.3 , pp. 1819-1834
- Hogden, J.¹ Lofqvist, A.² Gracco, V.³ Zlokarnik, I.⁴ Rubin, P.⁵ Saltzman, E.⁶

41
- 0010505818
- Recovery of articulatory movements from acoustics with phonemic information
- Bavaria, Germany
- T. Okadome, S. Suzuki, and M. Honda, "Recovery of articulatory movements from acoustics with phonemic information," in Proc. 5th Seminar Speech Prod., Bavaria, Germany, 2000, pp. 229-232.
- (2000) Proc. 5th Seminar Speech Prod. , pp. 229-232
- Okadome, T.¹ Suzuki, S.² Honda, M.³

42
- 51449098747
- An empirical investigation of the nonuniqueness in the acoustic-to-articulatory mapping
- C. Qin and M. Á. Carreira-Perpiñán, "An empirical investigation of the nonuniqueness in the acoustic-to-articulatory mapping," in Proc. Interspeech, 2007, pp. 74-77.
- (2007) Proc. Interspeech , pp. 74-77
- Qin, C.¹ Carreira-Perpinán, M.A.²

43
- 84867222549
- The acoustic to articulation mapping: Non-Linear or non-unique?
- D. Neiberg, G. Ananthakrishnan, and O. Engwall, "The acoustic to articulation mapping: Non-Linear or non-unique?," in Proc. Interspeech, 2008, pp. 1485-1488.
- (2008) Proc. Interspeech , pp. 1485-1488
- Neiberg, D.¹ Ananthakrishnan, G.² Engwall, O.³

44
- 58849145971
- ASR-Articulatory speech recognition
- J. Frankel and S. King, "ASR-Articulatory speech recognition," in Proc. Eurospeech, 2001, pp. 599-602.
- (2001) Proc. Eurospeech , pp. 599-602
- Frankel, J.¹ King, S.²

45
- 84994254645
- An automatic speech recognition system using neural networks and linear dynamic models to recover and model articulatory traces
- J. Frankel, K. Richmond, S. King, and P. Taylor, "An automatic speech recognition system using neural networks and linear dynamic models to recover and model articulatory traces," in Proc. ICSLP, 2000, vol. 4, pp. 254-257.
- (2000) Proc. ICSLP , vol.4 , pp. 254-257
- Frankel, J.¹ Richmond, K.² King, S.³ Taylor, P.⁴

46
- 0001622923
- On defining coarticulation
- R. Daniloff and R. Hammarberg, "On defining coarticulation," J. Phon., vol. 1, pp. 239-248, 1973.
- (1973) J. Phon. , vol.1 , pp. 239-248
- Daniloff, R.¹ Hammarberg, R.²

47
- 0000523613
- Towards an articulatory phonology
- C. P. Browman and L. Goldstein, "Towards an articulatory phonology," Phonol. Yearbook, vol. 85, pp. 219-252, 1986.
- (1986) Phonol. Yearbook , vol.85 , pp. 219-252
- Browman, C.P.¹ Goldstein, L.²

48
- 0027024362
- Articulatory phonology: An overview
- C. P. Browman and L. Goldstein, "Articulatory phonology: An overview," Gynecol. Obstet. Invest., vol. 49, pp. 155-180, 1992.
- (1992) Gynecol. Obstet. Invest. , vol.49 , pp. 155-180
- Browman, C.P.¹ Goldstein, L.²

49
- 77956779481
- A dynamical approach to gestural patterning in speech production
- E. Saltzman and K. Munhall, "A dynamical approach to gestural patterning in speech production," Ecol. Psychol., vol. 1, no. 4, pp. 332-382, 1989.
- (1989) Ecol. Psychol. , vol.1 , Issue.4 , pp. 332-382
- Saltzman, E.¹ Munhall, K.²

50
- 70349207706
- TADA: An enhanced, portable task dynamics model in Matlab
- H. Nam, L. Goldstein, E. Saltzman, and D. Byrd, "TADA: An enhanced, portable task dynamics model in Matlab," J. Acoust. Soc. Amer., vol. 115, no. 5, p. 2430, 2004.
- (2004) J. Acoust. Soc. Amer. , vol.115 , Issue.5 , pp. 2430
- Nam, H.¹ Goldstein, L.² Saltzman, E.³ Byrd, D.⁴

51
- 0003652255
- Madison, WI: Univ. of Wisconsin
- J. Westbury, X-ray Microbeam Speech Production Database User's Handbook. Madison, WI: Univ. of Wisconsin, 1994.
- (1994) X-ray Microbeam Speech Production Database User's Handbook
- Westbury, J.¹

52
- 0028375762
- Recovering articulatory movement from formant frequency trajectories using task dynamics and a genetic algorithm: Preliminary model tests
- Feb.
- R. S. McGowan, "Recovering articulatory movement from formant frequency trajectories using task dynamics and a genetic algorithm: Preliminary model tests," Speech Commun., vol. 14, no. 1, pp. 19-48, Feb. 1994.
- (1994) Speech Commun. , vol.14 , Issue.1 , pp. 19-48
- McGowan, R.S.¹

53
- 78649390043
- Retrieving tract variables from acoustics: A comparison of different machine learning strategies
- Dec.
- V. Mitra, H. Nam, C. Espy-Wilson, E. Saltzman, and L. Goldstein, "Retrieving tract variables from acoustics: A comparison of different machine learning strategies," IEEE J. Sel. Topics Signal Process., vol. 4, no. 6, pp. 1027-1045, Dec. 2010.
- (2010) IEEE J. Sel. Topics Signal Process. , vol.4 , Issue.6 , pp. 1027-1045
- Mitra, V.¹ Nam, H.² Espy-Wilson, C.³ Saltzman, E.⁴ Goldstein, L.⁵

54
- 0003424928
- Ph.D. dissertation, Univ. of Bielefeld, Bielefeld, Germany
- K. Kirchhoff, "Robust speech recognition using articulatory information," Ph.D. dissertation, Univ. of Bielefeld, Bielefeld, Germany, 1999.
- (1999) Robust Speech Recognition using Articulatory Information
- Kirchhoff, K.¹

55
- 0036642567
- Combining acoustic and articulatory feature information for robust speech recognition
- DOI 10.1016/S0167-6393(01)00020-6, PII S0167639301000206
- K. Kirchhoff, G. A. Fink, and G. Sagerer, "Combining acoustic and articulatory feature information for robust speech recognition," Speech Commun., vol. 37, no. 3-4, pp. 303-319, Jul. 2002. (Pubitemid 34524845)
- (2002) Speech Communication , vol.37 , Issue.3-4 , pp. 303-319
- Kirchhoff, K.¹ Fink, G.A.² Sagerer, G.³

56
- 0037697284
- Hidden-articulator Markov models for speech recognition
- Oct.
- M. Richardson, J. Bilmes, and C. Diorio, "Hidden-articulator Markov models for speech recognition," Speech Commun., vol. 41, no. 2-3, pp. 511-529, Oct. 2003.
- (2003) Speech Commun. , vol.41 , Issue.2-3 , pp. 511-529
- Richardson, M.¹ Bilmes, J.² Diorio, C.³

57
- 70450200298
- Noise robustness of tract variables and their application to speech recognition
- V. Mitra, H. Nam, C. Espy-Wilson, E. Saltzman, and L. Goldstein, "Noise robustness of tract variables and their application to speech recognition," in Proc. Interspeech, 2009, pp. 2759-2762.
- (2009) Proc. Interspeech , pp. 2759-2762
- Mitra, V.¹ Nam, H.² Espy-Wilson, C.³ Saltzman, E.⁴ Goldstein, L.⁵

58
- 0036711819
- A quasiarticulatory approach to controlling acoustic source parameters in a Klatt-type formant synthesizer using HLsyn
- DOI 10.1121/1.1498851
- H. M. Hanson and K. N. Stevens, "A quasiarticulatory approach to controlling acoustic source parameters in a Klatt-type formant synthesizer using HLsyn," J. Acoust. Soc. Amer., vol. 112, no. 3, pp. 1158-1182, 2002. (Pubitemid 35006671)
- (2002) Journal of the Acoustical Society of America , vol.112 , Issue.3 , pp. 1158-1182
- Hanson, H.M.¹ Stevens, K.N.²

59
- 0038669544
- The aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions
- Paris, France
- D. Pearce and H. G. Hirsch, "The aurora experimental framework for the performance evaluation of speech recognition systems under noisy conditions," in Proc. Autom. Speech Recognition: Challenges For New Millenium, ASR-2000, Paris, France, 2000, pp. 181-188.
- (2000) Proc. Autom. Speech Recognition: Challenges For New Millenium, ASR-2000 , pp. 181-188
- Pearce, D.¹ Hirsch, H.G.²

60
- 79959846806
- A procedure for estimating gestural scores from natural speech
- H. Nam, V. Mitra, M. Tiede, E. Saltzman, L. Goldstein, C. Espy-Wilson, and M. Hasegawa-Johnson, "A procedure for estimating gestural scores from natural speech," in Proc. Interspeech, 2010, pp. 30-33.
- (2010) Proc. Interspeech , pp. 30-33
- Nam, H.¹ Mitra, V.² Tiede, M.³ Saltzman, E.⁴ Goldstein, L.⁵ Espy-Wilson, C.⁶ Hasegawa-Johnson, M.⁷

61
- 70349213974
- From acoustics to vocal tract time functions
- V. Mitra, I. Özbek, H. Nam, X. Zhou, and C. Espy-Wilson, "From acoustics to vocal tract time functions," in Proc. ICASSP, 2009, pp. 4497-4500.
- (2009) Proc. ICASSP , pp. 4497-4500
- Mitra, V.¹ Özbek, I.² Nam, H.³ Zhou, X.⁴ Espy-Wilson, C.⁵

62
- 33846700692
- Los Alamos, NM, Tech. Rep., LA-UR-96-3945
- J. Hogden, D. Nix, and P. Valdez, An articulatorily constrained, maximum likelihood approach to speech recognition Los Alamos National Laboratory, Los Alamos, NM, Tech. Rep., LA-UR-96-3945, 1998.
- (1998) An Articulatorily Constrained, Maximum Likelihood Approach to Speech Recognition Los Alamos National Laboratory
- Hogden, J.¹ Nix, D.² Valdez, P.³

63
- 0024909979
- Some statistical issues in the comparison of speech recognition algorithms
- L. Gillick and S. Cox, "Some statistical issues in the comparison of speech recognition algorithms," in Proc. ICASSP, 1989, vol. 1, pp. 532-535. (Pubitemid 20604171)
- (1989) ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings , vol.1 , pp. 532-535
- Gillick, L.¹ Cox, S.J.²

64
- 70450212952
- A noisetype and level-dependent MPO-based speech enhancement architecture with variable frame analysis for noise-robust speech recognition
- Brighton, U.K.
- V. Mitra, B. J. Borgstrom, C. Espy-Wilson, and A. Alwan, "A noisetype and level-dependent MPO-based speech enhancement architecture with variable frame analysis for noise-robust speech recognition," in Proc. Interspeech, Brighton, U.K., 2009, pp. 2751-2754.
- (2009) Proc. Interspeech , pp. 2751-2754
- Mitra, V.¹ Borgstrom, B.J.² Espy-Wilson, C.³ Alwan, A.⁴

65
- 0021892216
- Speech enhancement using a minimum mean square log-spectral amplitude estimator
- Apr.
- Y. Ephraim and D. Malah, "Speech enhancement using a minimum mean square log-spectral amplitude estimator," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-33, no. 2, pp. 443-445, Apr. 1985.
- (1985) IEEE Trans. Acoust., Speech, Signal Process. , vol.ASSP-33 , Issue.2 , pp. 443-445
- Ephraim, Y.¹ Malah, D.²

66
- 77955810460
- A study on the generalization capability of acoustic models for robust speech recognition
- Aug.
- X. Xiao, J. Li, E. S. Chng, H. Li, and C. Lee, "A study on the generalization capability of acoustic models for robust speech recognition," IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 6, pp. 1158-1169, Aug. 2009.
- (2009) IEEE Trans. Audio, Speech, Lang. Process. , vol.18 , Issue.6 , pp. 1158-1169
- Xiao, X.¹ Li, J.² Chng, E.S.³ Li, H.⁴ Lee, C.⁵

67
- 51849099743
- A study of variable-parameter gaussian mixture hidden Markov modeling for noisy speech recognition
- May
- X. Cui and Y. Gong, "A study of variable-parameter gaussian mixture hidden Markov modeling for noisy speech recognition," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 4, pp. 1366-1376, May 2007.
- (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.4 , pp. 1366-1376
- Cui, X.¹ Gong, Y.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.