SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 14, Issue 2, 2006, Pages 425-434

Tracking vocal tract resonances using a quantized nonlinear function embedded in a temporal constraint

(3) Deng, Li a,b Acero, Alex a,b Bazzi, Issam b

Author keywords

Cepstrum; Continuity constraint; Dynamic programming; Expectation maximization (EM) optimization; Formant; Greedy search, linear predictive coding (LPC); Nonlinear prediction, prediction residual; Quantization; Vocal tract resonance (VTR)

Indexed keywords

CEPSTRUM; EXPECTATION MAXIMIZATION (EM) OPTIMIZATIONS; GAUSSIAN VECTORS; GREEDY SEARCH; LINEAR PREDICTIVE CODING (LPC); RESONANCE TRACKING; VOCAL TRACT RESONANCE (VTR);

ALGORITHMS; BANDWIDTH; DYNAMIC PROGRAMMING; NATURAL FREQUENCIES; NONLINEAR SYSTEMS; SPEECH CODING; TEMPORAL LOGIC; VECTORS;

SPEECH COMMUNICATION;

EID: 33746456716 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TSA.2005.855841 Document Type: Article

Times cited : (36)

References (30)

1
- 85135264071
- Formant analysis and synthesis using hidden Markov models
- Budapest, Hungary, Sep
- A. Acero, "Formant analysis and synthesis using hidden Markov models," in Proc. Enrospeech, Budapest, Hungary, Sep. 1999.
- (1999) Proc. Enrospeech
- Acero, A.¹

2
- 0003724033
- Cambridge, U.K, Cambridge Univ. Press
- J. Allen, M. S. Hunnicutt, and D. Klatt, From Text to Speech: The MITalk System. Cambridge, U.K.: Cambridge Univ. Press, 1987.
- (1987) From Text to Speech: The MITalk System
- Allen, J.¹ Hunnicutt, M.S.² Klatt, D.³

3
- 0141814630
- An expectation-maximization approach for formant tracking using a parameter-free nonlinear predictor
- Hong Kong, Apr
- I. Bazzi, A. Acero, and L. Deng, "An expectation-maximization approach for formant tracking using a parameter-free nonlinear predictor," in Proc. ICASSP, Hong Kong, Apr. 2003.
- (2003) Proc. ICASSP
- Bazzi, I.¹ Acero, A.² Deng, L.³

4
- 0037567933
- Formant estimation by linear transformation of the LPC cepstrum
- D. Broad and F. Clermont, "Formant estimation by linear transformation of the LPC cepstrum," J. Acoust. Soc. Amer, vol. 86, pp. 2013-2017, 1989.
- (1989) J. Acoust. Soc. Amer , vol.86 , pp. 2013-2017
- Broad, D.¹ Clermont, F.²

5
- 17344378368
- Robust formant tracking in noise
- Orlando, FL
- I. Bruce, N. Karkhanis, E. Young, and M. Sachs, "Robust formant tracking in noise," in Proc. ICASSP, Orlando, FL, 2002, pp. 281-284.
- (2002) Proc. ICASSP , pp. 281-284
- Bruce, I.¹ Karkhanis, N.² Young, E.³ Sachs, M.⁴

6
- 0002629270
- Maximum likelihood from incomplete data via the EM algorithm
- A. P. Dempster, N. M. Laird, and D. B. Rubin, "Maximum likelihood from incomplete data via the EM algorithm," J. R. Statist. Soc., vol. 39, no. 1, pp. 1-38, 1977.
- (1977) J. R. Statist. Soc , vol.39 , Issue.1 , pp. 1-38
- Dempster, A.P.¹ Laird, N.M.² Rubin, D.B.³

7
- 4243117872
- New York: Marcel Dekker
- L. Deng and D. O'Shaughnessy, Speech Processing-A Dynamic and Optimization-Oriented Approach. New York: Marcel Dekker, 2003.
- (2003) Speech Processing-A Dynamic and Optimization-Oriented Approach
- Deng, L.¹ O'Shaughnessy, D.²

8
- 85009211881
- Tracking vocal tract resonances using an analytical nonlinear predictor and a target-guided temporal constraint
- L. Deng, I. Bazzi, and A. Acero, 'Tracking vocal tract resonances using an analytical nonlinear predictor and a target-guided temporal constraint," in Proc. Eurospeech, vol. I, 2003, pp. 73-76.
- (2003) Proc. Eurospeech , vol.1 , pp. 73-76
- Deng, L.¹ Bazzi, I.² Acero, A.³

9
- 0023516708
- A composite auditory model for processing speech sounds
- Dec
- L. Deng and D. Geisler, "A composite auditory model for processing speech sounds," J. Acoust. Soc. Amer., vol. 82, pp. 2001-2012, Dec. 1987.
- (1987) J. Acoust. Soc. Amer , vol.82 , pp. 2001-2012
- Deng, L.¹ Geisler, D.²

10
- 0033623527
- Spontaneous speech recognition using a statistical coarticulatory model for vocal-tract-resonance dynamics
- L. Deng and J. Ma, "Spontaneous speech recognition using a statistical coarticulatory model for vocal-tract-resonance dynamics," J. Acoust. Soc. Amer., vol. 108, pp. 3036-3048, 2000.
- (2000) J. Acoust. Soc. Amer , vol.108 , pp. 3036-3048
- Deng, L.¹ Ma, J.²

11
- 56149108822
- Recovering vocal tract shapes from MFCC parameters
- S. Dusan and L. Deng, "Recovering vocal tract shapes from MFCC parameters," in Proc. ICSLP, 1998, pp. 3087-3090.
- (1998) Proc. ICSLP , pp. 3087-3090
- Dusan, S.¹ Deng, L.²

12
- 0003418124
- The Hague, The Netherlands: Mouton
- G. Fant, Acoustic Theory of Speech Production. The Hague, The Netherlands: Mouton, 1960.
- (1960) Acoustic Theory of Speech Production
- Fant, G.¹

13
- 85009110670
- Multistage coarticulation model combining articulatory, formant, and cepstral features
- Y. Gao, R. Bakis, J. Huang, and B. Zhang, "Multistage coarticulation model combining articulatory, formant, and cepstral features," in Proc. ICSLP, vol. 1, 2000, pp. 25-28.
- (2000) Proc. ICSLP , vol.1 , pp. 25-28
- Gao, Y.¹ Bakis, R.² Huang, J.³ Zhang, B.⁴

14
- 85016587886
- Switchboard: Telephone speech corpus for research and development
- J. Godfrey, E. Holliman, and J. McDaniel, "Switchboard: Telephone speech corpus for research and development," in Proc. ICASSP, 1992, pp. 517-520.
- (1992) Proc. ICASSP , pp. 517-520
- Godfrey, J.¹ Holliman, E.² McDaniel, J.³

15
- 0024879199
- The effective second formant F2 and the vocal tract front-cavity
- H. Hermansky and D. Broad, "The effective second formant F2 and the vocal tract front-cavity," in Proc. ICASSP, vol. 1, 1989, pp. 480-183.
- (1989) Proc. ICASSP , vol.1 , pp. 480-183
- Hermansky, H.¹ Broad, D.²

16
- 33947096168
- J. Hogberg, Prediction of formant frequencies from linear combinations of filterbank and cepstral coefficients, Royal Inst. Technol., Stockholm, Sweden, KTH-STL Quarterly Progress Rep., 1997.
- J. Hogberg, "Prediction of formant frequencies from linear combinations of filterbank and cepstral coefficients," Royal Inst. Technol., Stockholm, Sweden, KTH-STL Quarterly Progress Rep., 1997.

17
- 85032644657
- Using formant frequencies in speech recognition
- Rhodes, Greece, Sep
- J. Holmes, W. Holmes, and P. Garner, "Using formant frequencies in speech recognition," in Proc. Eurospeech, Rhodes, Greece, Sep. 1997, pp. 2083-2086.
- (1997) Proc. Eurospeech , pp. 2083-2086
- Holmes, J.¹ Holmes, W.² Garner, P.³

18
- 0037410755
- Bandwidth-adjusted LPC analysis for robust speech recognition
- C. S. Huang and H. C. Wang, "Bandwidth-adjusted LPC analysis for robust speech recognition," Pattern Recognit. Lett., vol. 24, pp. 1583-1587, 2003.
- (2003) Pattern Recognit. Lett , vol.24 , pp. 1583-1587
- Huang, C.S.¹ Wang, H.C.²

19
- 0018986665
- Software for a cascade/parallel formant synthesizer
- D. Klatt, "Software for a cascade/parallel formant synthesizer," J. Acoust. Soc. Amer., vol. 67, pp. 971-995, 1980.
- (1980) J. Acoust. Soc. Amer , vol.67 , pp. 971-995
- Klatt, D.¹

20
- 4544367684
- Formant tracking using HMM's and vector quantization
- G. Kopec, "Formant tracking using HMM's and vector quantization," IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-34, pp. 709-729, 1986.
- (1986) IEEE Trans. Acoust., Speech, Signal Processing , vol.ASSP-34 , pp. 709-729
- Kopec, G.¹

21
- 0016049328
- An algorithm for automatic formant extraction using linear prediction spectra
- S. McCandless, "An algorithm for automatic formant extraction using linear prediction spectra," IEEE Trans. Acoust., Speech, Signal Processing, vol. ASSP-22, pp. 135-141, 1974.
- (1974) IEEE Trans. Acoust., Speech, Signal Processing , vol.ASSP-22 , pp. 135-141
- McCandless, S.¹

22
- 0038359547
- Modeling uncertainty in recovering articulation from acoustics
- K. Richmond, S. King, and P. Taylor, "Modeling uncertainty in recovering articulation from acoustics," Comput. Speech Lang., vol. 17, pp. 153-172, 2003.
- (2003) Comput. Speech Lang , vol.17 , pp. 153-172
- Richmond, K.¹ King, S.² Taylor, P.³

23
- 0141702226
- Coarticulation modeling by embedding a target-directed hidden trajectory model into HMM-MAP decoding and evaluation
- F. Seide, J. Zhou, and L. Deng, "Coarticulation modeling by embedding a target-directed hidden trajectory model into HMM-MAP decoding and evaluation," in Proc. ICASSP, 2003, pp. 748-751.
- (2003) Proc. ICASSP , pp. 748-751
- Seide, F.¹ Zhou, J.² Deng, L.³

24
- 0004129646
- Cambridge, MA: MIT Press
- K. Stevens, Acoustic Phonetics. Cambridge, MA: MIT Press, 1998.
- (1998) Acoustic Phonetics
- Stevens, K.¹

25
- 84912906590
- Constraints among parameters simplify control of Klatt formant synthesizer
- K. Stevens and C. Bickley, "Constraints among parameters simplify control of Klatt formant synthesizer," J. Phonetics, vol. 19, pp. 161-174, 1991.
- (1991) J. Phonetics , vol.19 , pp. 161-174
- Stevens, K.¹ Bickley, C.²

26
- 85009067878
- Data-driven model construction for continuous speech recognition using overlapping articulatory features
- J. Sun, L. Deng, and X. Jing, "Data-driven model construction for continuous speech recognition using overlapping articulatory features," in Proc. ICSLP, vol. 1, 2000, pp. 437-440.
- (2000) Proc. ICSLP , vol.1 , pp. 437-440
- Sun, J.¹ Deng, L.² Jing, X.³

27
- 33947157387
- D. Talkin, Speech formant trajectory estimation using dynamic programming with modulated transition costs, J. Acoust. Soc. Amer., S1, p. S55, 1987.
- D. Talkin, "Speech formant trajectory estimation using dynamic programming with modulated transition costs," J. Acoust. Soc. Amer., vol. S1, p. S55, 1987.

28
- 0031647965
- Formant tracking for speech recognition
- L. Welling and H. Ney, "Formant tracking for speech recognition," IEEE Trans. Speech Audio Processing, vol. 6, pp. 36-48, 1998.
- (1998) IEEE Trans. Speech Audio Processing , vol.6 , pp. 36-48
- Welling, L.¹ Ney, H.²

29
- 4544278205
- Formant tracking by mixture state particle filter
- Y. Zheng and M. Hasegawa-Johnson, "Formant tracking by mixture state particle filter," in Proc. ICASSP, vol. 1, 2004, pp. 565-568.
- (2004) Proc. ICASSP , vol.1 , pp. 565-568
- Zheng, Y.¹ Hasegawa-Johnson, M.²

30
- 33947175283
- Formant analysis using mixtures of Gaussians
- Rhodes, Greece
- P. Zolfaghari and T. Robinson, "Formant analysis using mixtures of Gaussians," in Proc. Eurospeech, Rhodes, Greece, 1997, pp. 2539-2542.
- (1997) Proc. Eurospeech , pp. 2539-2542
- Zolfaghari, P.¹ Robinson, T.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.