메뉴 건너뛰기




Volumn 14, Issue 2, 2006, Pages 425-434

Tracking vocal tract resonances using a quantized nonlinear function embedded in a temporal constraint

Author keywords

Cepstrum; Continuity constraint; Dynamic programming; Expectation maximization (EM) optimization; Formant; Greedy search, linear predictive coding (LPC); Nonlinear prediction, prediction residual; Quantization; Vocal tract resonance (VTR)

Indexed keywords

CEPSTRUM; EXPECTATION MAXIMIZATION (EM) OPTIMIZATIONS; GAUSSIAN VECTORS; GREEDY SEARCH; LINEAR PREDICTIVE CODING (LPC); RESONANCE TRACKING; VOCAL TRACT RESONANCE (VTR);

EID: 33746456716     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TSA.2005.855841     Document Type: Article
Times cited : (36)

References (30)
  • 1
    • 85135264071 scopus 로고    scopus 로고
    • Formant analysis and synthesis using hidden Markov models
    • Budapest, Hungary, Sep
    • A. Acero, "Formant analysis and synthesis using hidden Markov models," in Proc. Enrospeech, Budapest, Hungary, Sep. 1999.
    • (1999) Proc. Enrospeech
    • Acero, A.1
  • 3
    • 0141814630 scopus 로고    scopus 로고
    • An expectation-maximization approach for formant tracking using a parameter-free nonlinear predictor
    • Hong Kong, Apr
    • I. Bazzi, A. Acero, and L. Deng, "An expectation-maximization approach for formant tracking using a parameter-free nonlinear predictor," in Proc. ICASSP, Hong Kong, Apr. 2003.
    • (2003) Proc. ICASSP
    • Bazzi, I.1    Acero, A.2    Deng, L.3
  • 4
    • 0037567933 scopus 로고
    • Formant estimation by linear transformation of the LPC cepstrum
    • D. Broad and F. Clermont, "Formant estimation by linear transformation of the LPC cepstrum," J. Acoust. Soc. Amer, vol. 86, pp. 2013-2017, 1989.
    • (1989) J. Acoust. Soc. Amer , vol.86 , pp. 2013-2017
    • Broad, D.1    Clermont, F.2
  • 5
    • 17344378368 scopus 로고    scopus 로고
    • Robust formant tracking in noise
    • Orlando, FL
    • I. Bruce, N. Karkhanis, E. Young, and M. Sachs, "Robust formant tracking in noise," in Proc. ICASSP, Orlando, FL, 2002, pp. 281-284.
    • (2002) Proc. ICASSP , pp. 281-284
    • Bruce, I.1    Karkhanis, N.2    Young, E.3    Sachs, M.4
  • 6
    • 0002629270 scopus 로고
    • Maximum likelihood from incomplete data via the EM algorithm
    • A. P. Dempster, N. M. Laird, and D. B. Rubin, "Maximum likelihood from incomplete data via the EM algorithm," J. R. Statist. Soc., vol. 39, no. 1, pp. 1-38, 1977.
    • (1977) J. R. Statist. Soc , vol.39 , Issue.1 , pp. 1-38
    • Dempster, A.P.1    Laird, N.M.2    Rubin, D.B.3
  • 8
    • 85009211881 scopus 로고    scopus 로고
    • Tracking vocal tract resonances using an analytical nonlinear predictor and a target-guided temporal constraint
    • L. Deng, I. Bazzi, and A. Acero, 'Tracking vocal tract resonances using an analytical nonlinear predictor and a target-guided temporal constraint," in Proc. Eurospeech, vol. I, 2003, pp. 73-76.
    • (2003) Proc. Eurospeech , vol.1 , pp. 73-76
    • Deng, L.1    Bazzi, I.2    Acero, A.3
  • 9
    • 0023516708 scopus 로고
    • A composite auditory model for processing speech sounds
    • Dec
    • L. Deng and D. Geisler, "A composite auditory model for processing speech sounds," J. Acoust. Soc. Amer., vol. 82, pp. 2001-2012, Dec. 1987.
    • (1987) J. Acoust. Soc. Amer , vol.82 , pp. 2001-2012
    • Deng, L.1    Geisler, D.2
  • 10
    • 0033623527 scopus 로고    scopus 로고
    • Spontaneous speech recognition using a statistical coarticulatory model for vocal-tract-resonance dynamics
    • L. Deng and J. Ma, "Spontaneous speech recognition using a statistical coarticulatory model for vocal-tract-resonance dynamics," J. Acoust. Soc. Amer., vol. 108, pp. 3036-3048, 2000.
    • (2000) J. Acoust. Soc. Amer , vol.108 , pp. 3036-3048
    • Deng, L.1    Ma, J.2
  • 11
    • 56149108822 scopus 로고    scopus 로고
    • Recovering vocal tract shapes from MFCC parameters
    • S. Dusan and L. Deng, "Recovering vocal tract shapes from MFCC parameters," in Proc. ICSLP, 1998, pp. 3087-3090.
    • (1998) Proc. ICSLP , pp. 3087-3090
    • Dusan, S.1    Deng, L.2
  • 13
    • 85009110670 scopus 로고    scopus 로고
    • Multistage coarticulation model combining articulatory, formant, and cepstral features
    • Y. Gao, R. Bakis, J. Huang, and B. Zhang, "Multistage coarticulation model combining articulatory, formant, and cepstral features," in Proc. ICSLP, vol. 1, 2000, pp. 25-28.
    • (2000) Proc. ICSLP , vol.1 , pp. 25-28
    • Gao, Y.1    Bakis, R.2    Huang, J.3    Zhang, B.4
  • 14
    • 85016587886 scopus 로고
    • Switchboard: Telephone speech corpus for research and development
    • J. Godfrey, E. Holliman, and J. McDaniel, "Switchboard: Telephone speech corpus for research and development," in Proc. ICASSP, 1992, pp. 517-520.
    • (1992) Proc. ICASSP , pp. 517-520
    • Godfrey, J.1    Holliman, E.2    McDaniel, J.3
  • 15
    • 0024879199 scopus 로고
    • The effective second formant F2 and the vocal tract front-cavity
    • H. Hermansky and D. Broad, "The effective second formant F2 and the vocal tract front-cavity," in Proc. ICASSP, vol. 1, 1989, pp. 480-183.
    • (1989) Proc. ICASSP , vol.1 , pp. 480-183
    • Hermansky, H.1    Broad, D.2
  • 16
    • 33947096168 scopus 로고    scopus 로고
    • J. Hogberg, Prediction of formant frequencies from linear combinations of filterbank and cepstral coefficients, Royal Inst. Technol., Stockholm, Sweden, KTH-STL Quarterly Progress Rep., 1997.
    • J. Hogberg, "Prediction of formant frequencies from linear combinations of filterbank and cepstral coefficients," Royal Inst. Technol., Stockholm, Sweden, KTH-STL Quarterly Progress Rep., 1997.
  • 17
    • 85032644657 scopus 로고    scopus 로고
    • Using formant frequencies in speech recognition
    • Rhodes, Greece, Sep
    • J. Holmes, W. Holmes, and P. Garner, "Using formant frequencies in speech recognition," in Proc. Eurospeech, Rhodes, Greece, Sep. 1997, pp. 2083-2086.
    • (1997) Proc. Eurospeech , pp. 2083-2086
    • Holmes, J.1    Holmes, W.2    Garner, P.3
  • 18
    • 0037410755 scopus 로고    scopus 로고
    • Bandwidth-adjusted LPC analysis for robust speech recognition
    • C. S. Huang and H. C. Wang, "Bandwidth-adjusted LPC analysis for robust speech recognition," Pattern Recognit. Lett., vol. 24, pp. 1583-1587, 2003.
    • (2003) Pattern Recognit. Lett , vol.24 , pp. 1583-1587
    • Huang, C.S.1    Wang, H.C.2
  • 19
    • 0018986665 scopus 로고
    • Software for a cascade/parallel formant synthesizer
    • D. Klatt, "Software for a cascade/parallel formant synthesizer," J. Acoust. Soc. Amer., vol. 67, pp. 971-995, 1980.
    • (1980) J. Acoust. Soc. Amer , vol.67 , pp. 971-995
    • Klatt, D.1
  • 20
    • 4544367684 scopus 로고
    • Formant tracking using HMM's and vector quantization
    • G. Kopec, "Formant tracking using HMM's and vector quantization," IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-34, pp. 709-729, 1986.
    • (1986) IEEE Trans. Acoust., Speech, Signal Processing , vol.ASSP-34 , pp. 709-729
    • Kopec, G.1
  • 21
    • 0016049328 scopus 로고
    • An algorithm for automatic formant extraction using linear prediction spectra
    • S. McCandless, "An algorithm for automatic formant extraction using linear prediction spectra," IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-22, pp. 135-141, 1974.
    • (1974) IEEE Trans. Acoust., Speech, Signal Processing , vol.ASSP-22 , pp. 135-141
    • McCandless, S.1
  • 22
    • 0038359547 scopus 로고    scopus 로고
    • Modeling uncertainty in recovering articulation from acoustics
    • K. Richmond, S. King, and P. Taylor, "Modeling uncertainty in recovering articulation from acoustics," Comput. Speech Lang., vol. 17, pp. 153-172, 2003.
    • (2003) Comput. Speech Lang , vol.17 , pp. 153-172
    • Richmond, K.1    King, S.2    Taylor, P.3
  • 23
    • 0141702226 scopus 로고    scopus 로고
    • Coarticulation modeling by embedding a target-directed hidden trajectory model into HMM-MAP decoding and evaluation
    • F. Seide, J. Zhou, and L. Deng, "Coarticulation modeling by embedding a target-directed hidden trajectory model into HMM-MAP decoding and evaluation," in Proc. ICASSP, 2003, pp. 748-751.
    • (2003) Proc. ICASSP , pp. 748-751
    • Seide, F.1    Zhou, J.2    Deng, L.3
  • 25
    • 84912906590 scopus 로고
    • Constraints among parameters simplify control of Klatt formant synthesizer
    • K. Stevens and C. Bickley, "Constraints among parameters simplify control of Klatt formant synthesizer," J. Phonetics, vol. 19, pp. 161-174, 1991.
    • (1991) J. Phonetics , vol.19 , pp. 161-174
    • Stevens, K.1    Bickley, C.2
  • 26
    • 85009067878 scopus 로고    scopus 로고
    • Data-driven model construction for continuous speech recognition using overlapping articulatory features
    • J. Sun, L. Deng, and X. Jing, "Data-driven model construction for continuous speech recognition using overlapping articulatory features," in Proc. ICSLP, vol. 1, 2000, pp. 437-440.
    • (2000) Proc. ICSLP , vol.1 , pp. 437-440
    • Sun, J.1    Deng, L.2    Jing, X.3
  • 27
    • 33947157387 scopus 로고    scopus 로고
    • D. Talkin, Speech formant trajectory estimation using dynamic programming with modulated transition costs, J. Acoust. Soc. Amer., S1, p. S55, 1987.
    • D. Talkin, "Speech formant trajectory estimation using dynamic programming with modulated transition costs," J. Acoust. Soc. Amer., vol. S1, p. S55, 1987.
  • 29
    • 4544278205 scopus 로고    scopus 로고
    • Formant tracking by mixture state particle filter
    • Y. Zheng and M. Hasegawa-Johnson, "Formant tracking by mixture state particle filter," in Proc. ICASSP, vol. 1, 2004, pp. 565-568.
    • (2004) Proc. ICASSP , vol.1 , pp. 565-568
    • Zheng, Y.1    Hasegawa-Johnson, M.2
  • 30
    • 33947175283 scopus 로고    scopus 로고
    • Formant analysis using mixtures of Gaussians
    • Rhodes, Greece
    • P. Zolfaghari and T. Robinson, "Formant analysis using mixtures of Gaussians," in Proc. Eurospeech, Rhodes, Greece, 1997, pp. 2539-2542.
    • (1997) Proc. Eurospeech , pp. 2539-2542
    • Zolfaghari, P.1    Robinson, T.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.