메뉴 건너뛰기




Volumn 21, Issue 1, 2007, Pages 72-87

Speaker-adaptive learning of resonance targets in a hidden trajectory model of speech coarticulation

Author keywords

[No Author keywords available]

Indexed keywords

HIDDEN TRAJECTORY MODEL; SPEAKER ADAPTIVE LEARNING; SPEAKER INDEPENDENT; VOCAL TRACT RESONANCE (VTR);

EID: 33749541517     PISSN: 08852308     EISSN: 10958363     Source Type: Journal    
DOI: 10.1016/j.csl.2005.12.002     Document Type: Article
Times cited : (13)

References (31)
  • 1
    • 33749582201 scopus 로고    scopus 로고
    • Bakis, R., 1991. Coarticulation modeling with continuous-state HMMs. In: Proceedings of the IEEE Workshop Automatic Speech Recognition, Harriman, New York, pp. 20-21.
  • 2
    • 0037841402 scopus 로고    scopus 로고
    • Graphical models and automatic speech recognition
    • Johnson M., Ostendorf M., Khudanpur S., and Rosenfeld R. (Eds), Springer, New York
    • Bilmes J. Graphical models and automatic speech recognition. In: Johnson M., Ostendorf M., Khudanpur S., and Rosenfeld R. (Eds). Mathematical Foundations of Speech and Language Processing (2004), Springer, New York 135-186
    • (2004) Mathematical Foundations of Speech and Language Processing , pp. 135-186
    • Bilmes, J.1
  • 3
    • 33749551679 scopus 로고    scopus 로고
    • Bridle, J., Deng, L., Picone, J., et al., 1998. An investigation of segmental hidden dynamic models of speech coarticulation for automatic speech recognition. Final Report for the 1998 Workshop on Language Engineering, Center for Language and Speech Processing at Johns Hopkins University, pp. 1-61.
  • 4
    • 0034295822 scopus 로고    scopus 로고
    • Structured language modeling
    • Chelba C., and Jelinek F. Structured language modeling. Computer Speech Lang. October (2000) 283-332
    • (2000) Computer Speech Lang. , Issue.October , pp. 283-332
    • Chelba, C.1    Jelinek, F.2
  • 5
    • 0032119268 scopus 로고    scopus 로고
    • A dynamic, feature-based approach to the interface between phonology and phonetics for speech modeling and recognition
    • Deng L. A dynamic, feature-based approach to the interface between phonology and phonetics for speech modeling and recognition. Speech Commun. 24 4 (1998) 299-323
    • (1998) Speech Commun. , vol.24 , Issue.4 , pp. 299-323
    • Deng, L.1
  • 6
    • 33744966595 scopus 로고    scopus 로고
    • Switching dynamic system models for speech articulation and acoustics
    • Johnson M., Ostendorf M., Khudanpur S., and Rosenfeld R. (Eds), Springer, New York
    • Deng L. Switching dynamic system models for speech articulation and acoustics. In: Johnson M., Ostendorf M., Khudanpur S., and Rosenfeld R. (Eds). Mathematical Foundations of Speech and Language Processing (2004), Springer, New York 115-134
    • (2004) Mathematical Foundations of Speech and Language Processing , pp. 115-134
    • Deng, L.1
  • 7
    • 0028088646 scopus 로고
    • Context-dependent Markov model structured by locus equations: applications to phonetic classification
    • Deng L., and Braam D. Context-dependent Markov model structured by locus equations: applications to phonetic classification. J. Acoust. Soc. Am. 96 (1994) 2008-2025
    • (1994) J. Acoust. Soc. Am. , vol.96 , pp. 2008-2025
    • Deng, L.1    Braam, D.2
  • 9
    • 4544323815 scopus 로고    scopus 로고
    • Deng, L., Lee, L., Attias, H., Acero, A., 2004a. A structured speech model with continuous hidden dynamics and prediction-residual training for tracking vocal tract resonances. In: IEEE Proceedings of ICASSP, May 2004, vol. I, pp. 557-560.
  • 10
    • 84876465692 scopus 로고    scopus 로고
    • Deng, L., Yu, D., Acero, A., 2004b. A quantitative model for formant dynamics and contextually assimilated reduction in fluent speech. ICSLP 2004, Jeju, Korea.
  • 11
    • 33746456716 scopus 로고    scopus 로고
    • Deng, L., Acero, A., Bazzi, I., 2006a. Tracking vocal tract resonances using a quantized nonlinear function embedded in a temporal constraint. IEEE Trans. Speech Audio Process 14 (2), in press.
  • 12
    • 33744966561 scopus 로고    scopus 로고
    • A bi-directional target-filtering model of speech coarticulation and reduction: two-stage implementation for phonetic recognition
    • Deng L., Yu D., and Acero A. A bi-directional target-filtering model of speech coarticulation and reduction: two-stage implementation for phonetic recognition. IEEE Trans. Speech Audio Process 14 1 (2006) 256-265
    • (2006) IEEE Trans. Speech Audio Process , vol.14 , Issue.1 , pp. 256-265
    • Deng, L.1    Yu, D.2    Acero, A.3
  • 13
    • 0029725604 scopus 로고    scopus 로고
    • Eide, E., Gish, H., 1996. A parametric approach to vocal tract length normalization. In: IEEE Proceedings of ICASSP, pp. 346-348.
  • 14
    • 85009110670 scopus 로고    scopus 로고
    • Gao, Y., Bakis, R., Huang, J., Zhang, B., 2000. Multistage coarticulation model combining articulatory, formant and cepstral features. In: Proceedings of ICSLP, vol. 1, pp. 25-28.
  • 15
    • 0032673963 scopus 로고    scopus 로고
    • Probabilistic-trajectory segmental HMMs
    • Holmes W., and Russell M. Probabilistic-trajectory segmental HMMs. Computer Speech Lang. 13 (1999) 3-37
    • (1999) Computer Speech Lang. , vol.13 , pp. 3-37
    • Holmes, W.1    Russell, M.2
  • 16
    • 0003919964 scopus 로고
    • Vocal tract normalization in speech recognition: compensating for systematic speaker variability
    • CLSP, Johns Hopkins University, Baltimore, MD
    • Kamm T., Andreou G., and Cohen J. Vocal tract normalization in speech recognition: compensating for systematic speaker variability. Proceedings of the 15th Annual Speech Research Symposium (1995), CLSP, Johns Hopkins University, Baltimore, MD 161-167
    • (1995) Proceedings of the 15th Annual Speech Research Symposium , pp. 161-167
    • Kamm, T.1    Andreou, G.2    Cohen, J.3
  • 17
    • 0018986665 scopus 로고
    • Software for a cascade/parallel formant synthesizer
    • Klatt D. Software for a cascade/parallel formant synthesizer. J. Acoust. Soc. Am. 99 3 (1980) 971-995
    • (1980) J. Acoust. Soc. Am. , vol.99 , Issue.3 , pp. 971-995
    • Klatt, D.1
  • 18
    • 0031647824 scopus 로고    scopus 로고
    • A frequency warping approach to speaker normalization
    • Lee L., and Rose R. A frequency warping approach to speaker normalization. IEEE Trans. Speech Audio Process. 6 (1998) 49-60
    • (1998) IEEE Trans. Speech Audio Process. , vol.6 , pp. 49-60
    • Lee, L.1    Rose, R.2
  • 19
    • 0347968275 scopus 로고    scopus 로고
    • Efficient decoding strategies for conversational speech recognition using a constrained nonlinear state-space model for vocal-tract-resonance dynamics
    • Ma J., and Deng L. Efficient decoding strategies for conversational speech recognition using a constrained nonlinear state-space model for vocal-tract-resonance dynamics. IEEE Trans. Speech Audio Process. 11 (2003) 590-602
    • (2003) IEEE Trans. Speech Audio Process. , vol.11 , pp. 590-602
    • Ma, J.1    Deng, L.2
  • 20
    • 33749546670 scopus 로고    scopus 로고
    • McDonough, J., Byrne, W., Luo, X., 1998. Speaker normalization with all-pass transforms. In: Proceedings of ICSLP, vol. 6, pp. 2307-2310.
  • 21
    • 0036497667 scopus 로고    scopus 로고
    • Speaker clustering for speech recognition using vocal-tract parameters
    • Naito M., Deng L., and Sagisaka Y. Speaker clustering for speech recognition using vocal-tract parameters. Speech Commun. 36 3-4 (2002) 305-315
    • (2002) Speech Commun. , vol.36 , Issue.3-4 , pp. 305-315
    • Naito, M.1    Deng, L.2    Sagisaka, Y.3
  • 22
    • 0030245363 scopus 로고    scopus 로고
    • From HMMs to segment models: a unified view of stochastic modeling for speech recognition
    • Ostendorf M., Digalakis V., and Rohlicek J. From HMMs to segment models: a unified view of stochastic modeling for speech recognition. IEEE Trans. Speech Audio Process. 4 (1996) 360-378
    • (1996) IEEE Trans. Speech Audio Process. , vol.4 , pp. 360-378
    • Ostendorf, M.1    Digalakis, V.2    Rohlicek, J.3
  • 23
    • 0030672082 scopus 로고    scopus 로고
    • Pye, D., Woodland, P.C., 1997. Experiments in speaker normalisation and adaptation for large vocabulary speech recognition. In: IEEE Proceedings of ICASSP, pp. 1047-1050.
  • 24
    • 0030008004 scopus 로고    scopus 로고
    • The potential role of speech production models in automatic speech recognition
    • Rose R., Schroeter J., and Sondhi M. The potential role of speech production models in automatic speech recognition. J. Acoust. Soc. Am. 99 (1996) 1699-1709
    • (1996) J. Acoust. Soc. Am. , vol.99 , pp. 1699-1709
    • Rose, R.1    Schroeter, J.2    Sondhi, M.3
  • 25
    • 0036165806 scopus 로고    scopus 로고
    • An overlapping-feature based phonological model incorporating linguistic constraints: applications to speech recognition
    • Sun J., and Deng L. An overlapping-feature based phonological model incorporating linguistic constraints: applications to speech recognition. J. Acoust. Soc. Am. 111 2 (2002) 1086-1101
    • (2002) J. Acoust. Soc. Am. , vol.111 , Issue.2 , pp. 1086-1101
    • Sun, J.1    Deng, L.2
  • 26
    • 4544383109 scopus 로고    scopus 로고
    • Wang, W., Stolcke, A., Harper, M., 2004. The use of a linguistically motivated language model in conversational speech recognition. In: IEEE Proceedings of ICASSP, May 2004.
  • 27
    • 0029764708 scopus 로고    scopus 로고
    • Wegmann, S., McAllaster, D., Orloff, J., Peskin, B., 1996. Speaker normalization on conversational telephone speech. In: IEEE Proceedings of ICASSP, pp. 339-341.
  • 28
    • 0001390960 scopus 로고    scopus 로고
    • Welling, L., Haeb-Umbach, R., Aubert, X., Haberland, N., 1998. A study on speaker normalization using vocal tract normalization and speaker adaptive training. In: IEEE Proceedings of ICASSP, Seattle, WA, May 1998, vol. 2, pp. 797-800.
  • 29
    • 33749560872 scopus 로고    scopus 로고
    • Zhan, P., Waibel, A., 1997. Vocal tract length normalization for large vocabulary continuous speech recognition. CMU-CS-97-148, Carnegie Mellon University, Pittsburgh, PA, May 1997.
  • 30
    • 0030705337 scopus 로고    scopus 로고
    • Zhan, P., Westphal, M., 1997. Speaker normalization based on frequency warping. In: IEEE Proceedings of ICASSP, pp. 1039-1042.
  • 31
    • 0141702226 scopus 로고    scopus 로고
    • Zhou, J., Seide, F., Deng. L., 2003. Coarticulation modeling by embedding a target-directed hidden trajectory model into HMM. In: IEEE Proceedings of ICASSP, April 2003, vol. I, pp. 744-747.


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.