메뉴 건너뛰기




Volumn 17, Issue 8, 2009, Pages 1518-1532

Dynamic speech spectrum representation and tracking variable number of vocal tract resonance frequencies with time-varying dirichlet process mixture models

Author keywords

Dirichlet process; Formant tracking; Particle filter; Spectral representation; Spectrum estimation; Vocal tract resonance (VTR)

Indexed keywords

DIRICHLET PROCESS; FORMANT TRACKING; PARTICLE FILTER; SPECTRAL REPRESENTATION; SPECTRUM ESTIMATION; VOCAL TRACT RESONANCE (VTR);

EID: 69249099357     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2009.2022198     Document Type: Article
Times cited : (22)

References (54)
  • 2
  • 3
    • 0014272107 scopus 로고
    • Digital-formant synthesizer for speech-synthesis studies
    • L. R. Rabiner, "Digital-formant synthesizer for speech-synthesis studies," J. Acoust. Soc. Amer., vol.43, no.4, pp. 822-828, 1968.
    • (1968) J. Acoust. Soc. Amer. , vol.43 , Issue.4 , pp. 822-828
    • Rabiner, L.R.1
  • 4
    • 0018986665 scopus 로고
    • Software for a cascade/parallel formant synthesizer
    • D. Klatt, "Software for a cascade/parallel formant synthesizer," J Acoust. Soc. Amer., vol.67, no.3, pp. 971-995, 1980.
    • (1980) J Acoust. Soc. Amer. , vol.67 , Issue.3 , pp. 971-995
    • Klatt, D.1
  • 6
    • 33846261409 scopus 로고    scopus 로고
    • A global, boundary-centric framework for unit selection text-to-speech synthesis
    • May
    • J. Bellegarda,"A global, boundary-centric framework for unit selection text-to-speech synthesis," IEEE Trans. Speech Audio Process., vol.14, no.3, pp. 990-997, May. 2006.
    • (2006) IEEE Trans. Speech Audio Process. , vol.14 , Issue.3 , pp. 990-997
    • Bellegarda, J.1
  • 7
    • 0031647965 scopus 로고    scopus 로고
    • Formant estimation for speech recognition
    • Jan.
    • L. Welling and H. Ney, "Formant estimation for speech recognition," IEEE Trans. Speech Audio Process., vol.6, no.1, pp. 36-48, Jan. 1998.
    • (1998) IEEE Trans. Speech Audio Process , vol.6 , Issue.1 , pp. 36-48
    • Welling, L.1    Ney, H.2
  • 8
    • 0031632620 scopus 로고    scopus 로고
    • On the robust incorporation of formant features into hidden Markov models for automatic speech recognition
    • P. Garner and W. Holmes, "On the robust incorporation of formant features into hidden Markov models for automatic speech recognition," in Proc. Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 1998, pp. 1-4.
    • (1998) Proc. Int. Conf. Acoust., Speech, Signal Process. (ICASSP) , pp. 1-4
    • Garner, P.1    Holmes, W.2
  • 9
    • 33744930096 scopus 로고    scopus 로고
    • Parametric formant modeling and transformation in voice conversion
    • Springer
    • D. Rentzos, S. Vaseghi, Q. Yan, and C.-H. Ho, "Parametric formant modeling and transformation in voice conversion," Int. J. Speech Technol. Springer, vol.8, pp. 227-245, 2005.
    • (2005) Int. J. Speech Technol. , vol.8 , pp. 227-245
    • Rentzos, D.1    Vaseghi, S.2    Yan, Q.3    Ho, C.-H.4
  • 10
    • 0032204117 scopus 로고    scopus 로고
    • A novel feature transformation for vocal tract length normalization in automatic speech recognition
    • Nov.
    • T. Claes, I. Dologlou, L. ten Bosch, and D. Van Compernolle, "A novel feature transformation for vocal tract length normalization in automatic speech recognition," IEEE Trans. Speech Audio Process., vol.6, no.6, pp. 549-557, Nov. 1998.
    • (1998) IEEE Trans. Speech Audio Process. , vol.6 , Issue.6 , pp. 549-557
    • Claes, T.1    Dologlou, I.2    Ten Bosch, L.3    Van Compernolle, D.4
  • 11
    • 63049121231 scopus 로고    scopus 로고
    • A study offilter bank smoothingin MFCC features for recognition of children's speech
    • Nov.
    • R. S. S.Umesh, "A study offilter bank smoothingin MFCC features for recognition of children's speech," IEEE Trans. Speech Audio Process., vol.15, no.8, pp. 2418-2430, Nov. 2007.
    • (2007) IEEE Trans. Speech Audio Process. , vol.15 , Issue.8 , pp. 2418-2430
    • Umesh, R.S.S.1
  • 12
    • 34047257348 scopus 로고    scopus 로고
    • Reliable methods for estimating relative vocal tract lengths from formant trajectories of common words
    • Jul.
    • A. Watanabe and T. Sakata, "Reliable methods for estimating relative vocal tract lengths from formant trajectories of common words," IEEE Trans. Speech Audio Process., vol.14, no.4, pp. 1193-1204, Jul. 2006.
    • (2006) IEEE Trans. Speech Audio Process. , vol.14 , Issue.4 , pp. 1193-1204
    • Watanabe, A.1    Sakata, T.2
  • 13
    • 64349124465 scopus 로고    scopus 로고
    • Analysis and synthesis of formant spaces of British, Australian, and American accents
    • Feb.
    • Q. Yan, S. Vaseghi, D. Rentzos, and C.-H. Ho, "Analysis and synthesis of formant spaces of British, Australian, and American accents," IEEE Trans. Speech Audio Process., vol.15, no.2, pp. 676-689, Feb. 2007.
    • (2007) IEEE Trans. Speech Audio Process. , vol.15 , Issue.2 , pp. 676-689
    • Yan, Q.1    Vaseghi, S.2    Rentzos, D.3    Ho, C.-H.4
  • 14
    • 33847659290 scopus 로고    scopus 로고
    • Formant tracking linear prediction model using HMMs and Kalman filters for noisy speech processing
    • Q. Yan, S. Vaseghi, E. Zavarehei, B. Milner, J. Darch, P. White, and I. Andrianakis, "Formant tracking linear prediction model using HMMs and Kalman filters for noisy speech processing," Comput. Speech Lang., vol.21, no.3, pp. 543-561, 2007.
    • (2007) Comput. Speech Lang. , vol.21 , Issue.3 , pp. 543-561
    • Yan, Q.1    Vaseghi, S.2    Zavarehei, E.3    Milner, B.4    Darch, J.5    White, P.6    Andrianakis, I.7
  • 15
    • 0030008906 scopus 로고    scopus 로고
    • Speech formant frequency and bandwidth tracking using multiband energy demodulation
    • A. Potamianosa and P. Maragos, "Speech formant frequency and bandwidth tracking using multiband energy demodulation," Acoust. Soc. Amer., vol.99, no.6, pp. 3795-3806, 1996.
    • (1996) Acoust. Soc. Amer. , vol.99 , Issue.6 , pp. 3795-3806
    • Potamianosa, A.1    Maragos, P.2
  • 16
    • 0035340569 scopus 로고    scopus 로고
    • Formant estimation method using inverse-filter control
    • May
    • A. Watanabe, "Formant estimation method using inverse-filter control," IEEE Trans. Speech Audio Process., vol.9, no.4, pp. 317-326, May 2001.
    • (2001) IEEE Trans. Speech Audio Process. , vol.9 , Issue.4 , pp. 317-326
    • Watanabe, A.1
  • 17
    • 0000330384 scopus 로고    scopus 로고
    • On decomposing speech into modulated components
    • May
    • A. Rao and R. Kumaresan, "On decomposing speech into modulated components," IEEE Trans. Speech Audio Process., vol.8, no.3, pp. 240-254, May 2000.
    • (2000) IEEE Trans. Speech Audio Process. , vol.8 , Issue.3 , pp. 240-254
    • Rao, A.1    Kumaresan, R.2
  • 18
    • 33947155741 scopus 로고    scopus 로고
    • Robust formant tracking for continuous speech with speaker variability
    • Mar.
    • K. Mustafa and I. C. Bruce, "Robust formant tracking for continuous speech with speaker variability," IEEE Trans. Speech Audio Process., vol.14, no.2, pp. 435-444, Mar. 2006.
    • (2006) IEEE Trans. Speech Audio Process. , vol.14 , Issue.2 , pp. 435-444
    • Mustafa, K.1    Bruce, I.C.2
  • 19
    • 64849106911 scopus 로고    scopus 로고
    • Cascade prediction filters with adaptive zeros to track the time-varying resonances of the vocal tract
    • Jan.
    • J. Vargas and S. McLaughlin, "Cascade prediction filters with adaptive zeros to track the time-varying resonances of the vocal tract," IEEE Trans. Speech Audio Process., vol.16, no.1, pp. 1-7, Jan. 2008.
    • (2008) IEEE Trans. Speech Audio Process. , vol.16 , Issue.1 , pp. 1-7
    • Vargas, J.1    McLaughlin, S.2
  • 21
    • 0016049328 scopus 로고
    • An algorithm for automatic formant extraction using linear prediction spectra
    • Apr.
    • S. McCandless, "An algorithm for automatic formant extraction using linear prediction spectra," IEEE Trans. Acoust., Speech, Signal Process., vol.ASSP-22, no.2, pp. 135-141, Apr. 1974.
    • (1974) IEEE Trans. Acoust., Speech, Signal Process. , vol.ASSP-22 , Issue.2 , pp. 135-141
    • McCandless, S.1
  • 22
    • 4544367684 scopus 로고
    • Formant tracking using hidden Markov models and vector quantization
    • Aug.ASSP
    • G. Kopec, "Formant tracking using hidden Markov models and vector quantization," IEEE Trans. Acoust., Speech, Signal Process., vol.ASSP-34, no.4, pp. 709-729, Aug. 1986.
    • (1986) IEEE Trans. Acoust., Speech, Signal Process. , vol.34 , Issue.4 , pp. 709-729
    • Kopec, G.1
  • 23
    • 0012720145 scopus 로고
    • Speech formant trajectory estimation using dynamic programming with modulated transition costs
    • D. Talkin, "Speech formant trajectory estimation using dynamic programming with modulated transition costs," Acoust. Soc. Amer., vol.1, no.6, p. 55, 1987.
    • (1987) Acoust. Soc. Amer. , vol.1 , Issue.6 , pp. 55
    • Talkin, D.1
  • 25
    • 27644433406 scopus 로고    scopus 로고
    • Formant tracking using context-dependent phonemic information
    • Sep.
    • M.Lee,J. van Santen, B. Mobius, and J. Olive, "Formant tracking using context-dependent phonemic information," IEEE Trans. Speech Audio Process., vol.13, no.5, pp. 240-254, Sep. 2005.
    • (2005) IEEE Trans. Speech Audio Process. , vol.13 , Issue.5 , pp. 240-254
    • Lee, M.1    Van Santen, J.2    Mobius, B.3    Olive, J.4
  • 28
    • 33846336071 scopus 로고    scopus 로고
    • Dynamic assignment of Gaussian components in modeling speech spectra
    • P. Zolfaghari, H. Kato, Y. Minami, A. Nakamura, and S. Katagiri, "Dynamic assignment of Gaussian components in modeling speech spectra," J. VLSI Signal Process., vol.45, no.1-2, pp. 7-19, 2006.
    • (2006) J. VLSI Signal Process. , vol.45 , Issue.1-2 , pp. 7-19
    • Zolfaghari, P.1    Kato, H.2    Minami, Y.3    Nakamura, A.4    Katagiri, S.5
  • 29
    • 33947120106 scopus 로고    scopus 로고
    • Initialization, training, and context-dependency in HMM-based formant tracking
    • Mar.
    • D. T. Toledano, J. G. Villardebó, and L. H. Gómez, "Initialization, training, and context-dependency in HMM-based formant tracking," IEEE Trans. Speech Audio Process., vol.14, no.2, pp. 511-523, Mar. 2006.
    • (2006) IEEE Trans. Speech Audio Process. , vol.14 , Issue.2 , pp. 511-523
    • Toledano, D.T.1    Villardebó, J.G.2    Gómez, L.H.3
  • 30
    • 0022859239 scopus 로고
    • A new algorithm for estimation of formant trajectories directly from the speech signal based on an extended Kalman-filter
    • G. Rigoll, "A new algorithm for estimation of formant trajectories directly from the speech signal based on an extended Kalman-filter," in Proc. Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 1986, pp. 1229-1232.
    • (1986) Proc. Int. Conf. Acoust., Speech, Signal Process. (ICASSP) , pp. 1229-1232
    • Rigoll, G.1
  • 33
    • 33745373922 scopus 로고    scopus 로고
    • A state-space model with neural-network prediction for recovering vocal tract resonances in fluent speech from mel-cepstral coefficients
    • R. Togneri and L.Deng,"A state-space model with neural-network prediction for recovering vocal tract resonances in fluent speech from mel-cepstral coefficients," Speech Commun., vol.48, pp. 971-988, 2006.
    • (2006) Speech Commun. , vol.48 , pp. 971-988
    • Togneri, R.1    Deng, L.2
  • 34
    • 33746456716 scopus 로고    scopus 로고
    • Tracking vocal tract resonances using a quantized nonlinear function embedded in a temporal constraint
    • L.Deng, A. Acero, and I. Bazzi, "Tracking vocal tract resonances using a quantized nonlinear function embedded in a temporal constraint," IEEE Trans. Speech Audio Process., vol.14, pp. 425-434, 2006.
    • (2006) IEEE Trans. Speech Audio Process. , vol.14 , pp. 425-434
    • Deng, L.1    Acero, A.2    Bazzi, I.3
  • 36
    • 34547517867 scopus 로고    scopus 로고
    • Adaptive Kalman filtering and smoothing for tracking vocal tract resonances using a continuous-valued hidden dynamic model
    • Jan.
    • L. Deng, L. J. Lee, H. Attias, and A. Acero, "Adaptive Kalman filtering and smoothing for tracking vocal tract resonances using a continuous-valued hidden dynamic model," IEEE Trans. Audio, Speech, Lang. Process., vol.15, no.1, pp. 13-23, Jan. 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.1 , pp. 13-23
    • Deng, L.1    Lee, L.J.2    Attias, H.3    Acero, A.4
  • 37
    • 69249139982 scopus 로고    scopus 로고
    • Conditionally linear Gaussian models for tracking of vocal tract resonances
    • D. D. Rudoy andP. J. Wolfe, "Conditionally linear Gaussian models for tracking of vocal tract resonances," in Proc. Interspeech Conf., 2007, pp. 526-529.
    • (2007) Proc. Interspeech Conf. , pp. 526-529
    • Rudoy, D.D.1    Wolfe, P.J.2
  • 39
    • 44949198554 scopus 로고    scopus 로고
    • Tracking of visible vocal tract resonances (VVTR) based on Kalman filtering
    • I. Y. Özbek and M. Demirekler, "Tracking of visible vocal tract resonances (VVTR) based on Kalman filtering," in Proc. Interspeech Conf., 2006.
    • (2006) Proc. Interspeech Conf.
    • Özbek, I.Y.1    Demirekler, M.2
  • 40
    • 33847659290 scopus 로고    scopus 로고
    • Formant tracking linear prediction model using HMMs and Kalman filters for noisy speech processing
    • Q. Yan, S. Vaseghi, E. Zavarehei, B. Milner, J. Darch, P. White, and I. Andrianakis, "Formant tracking linear prediction model using HMMs and Kalman filters for noisy speech processing," Comput. Speech Lang., vol.21, no.3, pp. 543-561, 2007.
    • (2007) Comput. Speech Lang. , vol.21 , Issue.3 , pp. 543-561
    • Yan, Q.1    Vaseghi, S.2    Zavarehei, E.3    Milner, B.4    Darch, J.5    White, P.6    Andrianakis, I.7
  • 41
    • 51549093980 scopus 로고    scopus 로고
    • Vocal tract resonances tracking based on voiced and unvoiced speech classification using dynamic programming and fixed interval Kalman smoother
    • I. Y. Özbek and M. Demirekler, "Vocal tract resonances tracking based on voiced and unvoiced speech classification using dynamic programming and fixed interval Kalman smoother," in Proc. Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 2008, pp. 4217-4220.
    • (2008) Proc. Int. Conf. Acoust., Speech, Signal Process. (ICASSP) , pp. 4217-4220
    • Özbek, I.Y.1    Demirekler, M.2
  • 42
    • 0000708831 scopus 로고
    • Mixtures of Dirichlet processes with applications to Bayesian nonparametric problems
    • C. Antoniak, "Mixtures of Dirichlet processes with applications to Bayesian nonparametric problems," Ann. Statist., vol.2, pp. 1152-1174, 1974.
    • (1974) Ann. Statist. , vol.2 , pp. 1152-1174
    • Antoniak, C.1
  • 43
    • 80053193804 scopus 로고    scopus 로고
    • Generalized Polya urn for time-varying Dirichlet process mixtures
    • Vancouver, BC, Canada
    • F. Caron, M. Davy, and A. Doucet, "Generalized Polya urn for time-varying Dirichlet process mixtures," in Proc. Int. Conf. Uncertainty Artif. Intell., Vancouver, BC, Canada, 2007.
    • (2007) Proc. Int. Conf. Uncertainty Artif. Intell.
    • Caron, F.1    Davy, M.2    Doucet, A.3
  • 45
    • 0034970393 scopus 로고    scopus 로고
    • Verifying a vocal tract model with a closed side-branch
    • M. T.-T. Jackson, C. Espy-Wilson, and S. E. Boyce, "Verifying a vocal tract model with a closed side-branch," J. Acoust. Soc. Amer., vol.109, no.6, pp. 2983-2987, 2001.
    • (2001) J. Acoust. Soc. Amer. , vol.109 , Issue.6 , pp. 2983-2987
    • Jackson, M.T.-T.1    Espy-Wilson, C.2    Boyce, S.E.3
  • 47
    • 84943458946 scopus 로고
    • On the properties of voiceless fricative consonants
    • J. M. Heinz and K. N. Stevens, "On the properties of voiceless fricative consonants," J. Acoust. Soc. Amer., vol.33, no.5, pp. 589-596, 1961.
    • (1961) J. Acoust. Soc. Amer. , vol.33 , Issue.5 , pp. 589-596
    • Heinz, J.M.1    Stevens, K.N.2
  • 48
    • 33751003537 scopus 로고    scopus 로고
    • Rao-Blackwellized particle filter for multiple target tracking
    • S. Sarkka, A. Vehtari, and J. Lampinen, "Rao-Blackwellized particle filter for multiple target tracking," Inf. Fusion, vol.8, pp. 2-15, 2007.
    • (2007) Inf. Fusion , vol.8 , pp. 2-15
    • Sarkka, S.1    Vehtari, A.2    Lampinen, J.3
  • 50
    • 84858417776 scopus 로고    scopus 로고
    • Dirichlet processes Chinese restaurant processes and all that
    • M. I. Jordan, "Dirichlet processes Chinese restaurant processes and all that," in Tutorial presentation at NIPS Conf., 2005.
    • (2005) Tutorial Presentation at NIPS Conf.
    • Jordan, M.I.1
  • 51
    • 77950032550 scopus 로고    scopus 로고
    • Markov chain sampling methods for Dirichlet process mixture models
    • R. Neal, "Markov chain sampling methods for Dirichlet process mixture models," J. Comput. Graph. Statist., vol.9, pp. 249-265, 2000.
    • (2000) J. Comput. Graph. Statist. , vol.9 , pp. 249-265
    • Neal, R.1
  • 52
    • 0002617436 scopus 로고
    • Ferguson distributions via Polya urn schemes
    • D. Blackwell and J. MacQueen, "Ferguson distributions via Polya urn schemes," Annals Statist., vol.1, pp. 353-355, 1973.
    • (1973) Annals Statist , vol.1 , pp. 353-355
    • Blackwell, D.1    MacQueen, J.2
  • 53
    • 0000720609 scopus 로고
    • A constructive definition of Dirichlet priors
    • J. Sethuraman, "A constructive definition of Dirichlet priors," Statist. Sinica, vol.4, pp. 639-650, 1994.
    • (1994) Statist. Sinica , vol.4 , pp. 639-650
    • Sethuraman, J.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.