SCOPUS 정보 검색 플랫폼

Computer Speech and Language

Volumn 21, Issue 1, 2007, Pages 72-87

Speaker-adaptive learning of resonance targets in a hidden trajectory model of speech coarticulation

(3) Yu, Dong a Deng, Li a Acero, Alex a

a MICROSOFT RESEARCH (United States)

Author keywords

[No Author keywords available]

Indexed keywords

HIDDEN TRAJECTORY MODEL; SPEAKER ADAPTIVE LEARNING; SPEAKER INDEPENDENT; VOCAL TRACT RESONANCE (VTR);

ACOUSTIC PROPERTIES; ADAPTIVE SYSTEMS; LEARNING ALGORITHMS; LEARNING SYSTEMS; RESONANCE; SPEECH RECOGNITION;

SPEECH ANALYSIS;

EID: 33749541517 PISSN: 08852308 EISSN: 10958363 Source Type: Journal
DOI: 10.1016/j.csl.2005.12.002 Document Type: Article

Times cited : (13)

References (31)

1
- 33749582201
- Bakis, R., 1991. Coarticulation modeling with continuous-state HMMs. In: Proceedings of the IEEE Workshop Automatic Speech Recognition, Harriman, New York, pp. 20-21.

2
- 0037841402
- Graphical models and automatic speech recognition
- Johnson M., Ostendorf M., Khudanpur S., and Rosenfeld R. (Eds), Springer, New York
- Bilmes J. Graphical models and automatic speech recognition. In: Johnson M., Ostendorf M., Khudanpur S., and Rosenfeld R. (Eds). Mathematical Foundations of Speech and Language Processing (2004), Springer, New York 135-186
- (2004) Mathematical Foundations of Speech and Language Processing , pp. 135-186
- Bilmes, J.¹

3
- 33749551679
- Bridle, J., Deng, L., Picone, J., et al., 1998. An investigation of segmental hidden dynamic models of speech coarticulation for automatic speech recognition. Final Report for the 1998 Workshop on Language Engineering, Center for Language and Speech Processing at Johns Hopkins University, pp. 1-61.

4
- 0034295822
- Structured language modeling
- Chelba C., and Jelinek F. Structured language modeling. Computer Speech Lang. October (2000) 283-332
- (2000) Computer Speech Lang. , Issue.October , pp. 283-332
- Chelba, C.¹ Jelinek, F.²

5
- 0032119268
- A dynamic, feature-based approach to the interface between phonology and phonetics for speech modeling and recognition
- Deng L. A dynamic, feature-based approach to the interface between phonology and phonetics for speech modeling and recognition. Speech Commun. 24 4 (1998) 299-323
- (1998) Speech Commun. , vol.24 , Issue.4 , pp. 299-323
- Deng, L.¹

6
- 33744966595
- Switching dynamic system models for speech articulation and acoustics
- Johnson M., Ostendorf M., Khudanpur S., and Rosenfeld R. (Eds), Springer, New York
- Deng L. Switching dynamic system models for speech articulation and acoustics. In: Johnson M., Ostendorf M., Khudanpur S., and Rosenfeld R. (Eds). Mathematical Foundations of Speech and Language Processing (2004), Springer, New York 115-134
- (2004) Mathematical Foundations of Speech and Language Processing , pp. 115-134
- Deng, L.¹

7
- 0028088646
- Context-dependent Markov model structured by locus equations: applications to phonetic classification
- Deng L., and Braam D. Context-dependent Markov model structured by locus equations: applications to phonetic classification. J. Acoust. Soc. Am. 96 (1994) 2008-2025
- (1994) J. Acoust. Soc. Am. , vol.96 , pp. 2008-2025
- Deng, L.¹ Braam, D.²

8
- 4243117872
- Marcel Dekker, New York, NY
- Deng L., and O'Shaughnessy D. Speech Processing - A Dynamic and Optimization-Oriented Approach (2003), Marcel Dekker, New York, NY
- (2003) Speech Processing - A Dynamic and Optimization-Oriented Approach
- Deng, L.¹ O'Shaughnessy, D.²

9
- 4544323815
- Deng, L., Lee, L., Attias, H., Acero, A., 2004a. A structured speech model with continuous hidden dynamics and prediction-residual training for tracking vocal tract resonances. In: IEEE Proceedings of ICASSP, May 2004, vol. I, pp. 557-560.

10
- 84876465692
- Deng, L., Yu, D., Acero, A., 2004b. A quantitative model for formant dynamics and contextually assimilated reduction in fluent speech. ICSLP 2004, Jeju, Korea.

11
- 33746456716
- Deng, L., Acero, A., Bazzi, I., 2006a. Tracking vocal tract resonances using a quantized nonlinear function embedded in a temporal constraint. IEEE Trans. Speech Audio Process 14 (2), in press.

12
- 33744966561
- A bi-directional target-filtering model of speech coarticulation and reduction: two-stage implementation for phonetic recognition
- Deng L., Yu D., and Acero A. A bi-directional target-filtering model of speech coarticulation and reduction: two-stage implementation for phonetic recognition. IEEE Trans. Speech Audio Process 14 1 (2006) 256-265
- (2006) IEEE Trans. Speech Audio Process , vol.14 , Issue.1 , pp. 256-265
- Deng, L.¹ Yu, D.² Acero, A.³

13
- 0029725604
- Eide, E., Gish, H., 1996. A parametric approach to vocal tract length normalization. In: IEEE Proceedings of ICASSP, pp. 346-348.

14
- 85009110670
- Gao, Y., Bakis, R., Huang, J., Zhang, B., 2000. Multistage coarticulation model combining articulatory, formant and cepstral features. In: Proceedings of ICSLP, vol. 1, pp. 25-28.

15
- 0032673963
- Probabilistic-trajectory segmental HMMs
- Holmes W., and Russell M. Probabilistic-trajectory segmental HMMs. Computer Speech Lang. 13 (1999) 3-37
- (1999) Computer Speech Lang. , vol.13 , pp. 3-37
- Holmes, W.¹ Russell, M.²

16
- 0003919964
- Vocal tract normalization in speech recognition: compensating for systematic speaker variability
- CLSP, Johns Hopkins University, Baltimore, MD
- Kamm T., Andreou G., and Cohen J. Vocal tract normalization in speech recognition: compensating for systematic speaker variability. Proceedings of the 15th Annual Speech Research Symposium (1995), CLSP, Johns Hopkins University, Baltimore, MD 161-167
- (1995) Proceedings of the 15th Annual Speech Research Symposium , pp. 161-167
- Kamm, T.¹ Andreou, G.² Cohen, J.³

17
- 0018986665
- Software for a cascade/parallel formant synthesizer
- Klatt D. Software for a cascade/parallel formant synthesizer. J. Acoust. Soc. Am. 99 3 (1980) 971-995
- (1980) J. Acoust. Soc. Am. , vol.99 , Issue.3 , pp. 971-995
- Klatt, D.¹

18
- 0031647824
- A frequency warping approach to speaker normalization
- Lee L., and Rose R. A frequency warping approach to speaker normalization. IEEE Trans. Speech Audio Process. 6 (1998) 49-60
- (1998) IEEE Trans. Speech Audio Process. , vol.6 , pp. 49-60
- Lee, L.¹ Rose, R.²

19
- 0347968275
- Efficient decoding strategies for conversational speech recognition using a constrained nonlinear state-space model for vocal-tract-resonance dynamics
- Ma J., and Deng L. Efficient decoding strategies for conversational speech recognition using a constrained nonlinear state-space model for vocal-tract-resonance dynamics. IEEE Trans. Speech Audio Process. 11 (2003) 590-602
- (2003) IEEE Trans. Speech Audio Process. , vol.11 , pp. 590-602
- Ma, J.¹ Deng, L.²

20
- 33749546670
- McDonough, J., Byrne, W., Luo, X., 1998. Speaker normalization with all-pass transforms. In: Proceedings of ICSLP, vol. 6, pp. 2307-2310.

21
- 0036497667
- Speaker clustering for speech recognition using vocal-tract parameters
- Naito M., Deng L., and Sagisaka Y. Speaker clustering for speech recognition using vocal-tract parameters. Speech Commun. 36 3-4 (2002) 305-315
- (2002) Speech Commun. , vol.36 , Issue.3-4 , pp. 305-315
- Naito, M.¹ Deng, L.² Sagisaka, Y.³

22
- 0030245363
- From HMMs to segment models: a unified view of stochastic modeling for speech recognition
- Ostendorf M., Digalakis V., and Rohlicek J. From HMMs to segment models: a unified view of stochastic modeling for speech recognition. IEEE Trans. Speech Audio Process. 4 (1996) 360-378
- (1996) IEEE Trans. Speech Audio Process. , vol.4 , pp. 360-378
- Ostendorf, M.¹ Digalakis, V.² Rohlicek, J.³

23
- 0030672082
- Pye, D., Woodland, P.C., 1997. Experiments in speaker normalisation and adaptation for large vocabulary speech recognition. In: IEEE Proceedings of ICASSP, pp. 1047-1050.

24
- 0030008004
- The potential role of speech production models in automatic speech recognition
- Rose R., Schroeter J., and Sondhi M. The potential role of speech production models in automatic speech recognition. J. Acoust. Soc. Am. 99 (1996) 1699-1709
- (1996) J. Acoust. Soc. Am. , vol.99 , pp. 1699-1709
- Rose, R.¹ Schroeter, J.² Sondhi, M.³

25
- 0036165806
- An overlapping-feature based phonological model incorporating linguistic constraints: applications to speech recognition
- Sun J., and Deng L. An overlapping-feature based phonological model incorporating linguistic constraints: applications to speech recognition. J. Acoust. Soc. Am. 111 2 (2002) 1086-1101
- (2002) J. Acoust. Soc. Am. , vol.111 , Issue.2 , pp. 1086-1101
- Sun, J.¹ Deng, L.²

26
- 4544383109
- Wang, W., Stolcke, A., Harper, M., 2004. The use of a linguistically motivated language model in conversational speech recognition. In: IEEE Proceedings of ICASSP, May 2004.

27
- 0029764708
- Wegmann, S., McAllaster, D., Orloff, J., Peskin, B., 1996. Speaker normalization on conversational telephone speech. In: IEEE Proceedings of ICASSP, pp. 339-341.

28
- 0001390960
- Welling, L., Haeb-Umbach, R., Aubert, X., Haberland, N., 1998. A study on speaker normalization using vocal tract normalization and speaker adaptive training. In: IEEE Proceedings of ICASSP, Seattle, WA, May 1998, vol. 2, pp. 797-800.

29
- 33749560872
- Zhan, P., Waibel, A., 1997. Vocal tract length normalization for large vocabulary continuous speech recognition. CMU-CS-97-148, Carnegie Mellon University, Pittsburgh, PA, May 1997.

30
- 0030705337
- Zhan, P., Westphal, M., 1997. Speaker normalization based on frequency warping. In: IEEE Proceedings of ICASSP, pp. 1039-1042.

31
- 0141702226
- Zhou, J., Seide, F., Deng. L., 2003. Coarticulation modeling by embedding a target-directed hidden trajectory model into HMM. In: IEEE Proceedings of ICASSP, April 2003, vol. I, pp. 744-747.

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.