메뉴 건너뛰기




Volumn 20, Issue 10, 2012, Pages 2626-2636

A mixture model approach for formant tracking and the robustness of student's-t distribution

Author keywords

Formant tracking; Gaussian mixture model (GMM); multimodal density estimation; statistical mixture modeling; Student's t mixture model (tMM)

Indexed keywords

DENSITY ESTIMATION; FORMANT TRACKING; GAUSSIAN MIXTURE MODEL; MIXTURE MODEL; MIXTURE MODELING;

EID: 84867168402     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2012.2209418     Document Type: Article
Times cited : (12)

References (41)
  • 1
    • 0016049328 scopus 로고
    • An algorithm for automatic formant extraction using linear prediction spectra
    • Apr.
    • S. McCandless, "An algorithm for automatic formant extraction using linear prediction spectra," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-22, no. 2, pp. 135-141, Apr. 1974.
    • (1974) IEEE Trans. Acoust., Speech, Signal Process. , vol.ASSP-22 , Issue.2 , pp. 135-141
    • McCandless, S.1
  • 2
    • 33744645721 scopus 로고
    • Adaptive formant estimation with compensation for gross spectral shape
    • T. Toyoshima, N. Miki, and N. Nagai, "Adaptive formant estimation with compensation for gross spectral shape," Electron. Comm. Jpn. Pt. III, pp. 58-68, 1991.
    • (1991) Electron. Comm. Jpn. Pt. III , pp. 58-68
    • Toyoshima, T.1    Miki, N.2    Nagai, N.3
  • 3
    • 0030008906 scopus 로고    scopus 로고
    • Speech formant frequency and bandwidth tracking using multiband energy demodulation
    • A. Potamianos and P. Maragos, "Speech formant frequency and bandwidth tracking using multiband energy demodulation," J. Acoust. Soc. Amer., vol. 99, no. 6, pp. 3795-3806, 1996.
    • (1996) J. Acoust. Soc. Amer. , vol.99 , Issue.6 , pp. 3795-3806
    • Potamianos, A.1    Maragos, P.2
  • 4
    • 0027576027 scopus 로고
    • On amplitude and frequency demodulation using energy operators
    • Apr.
    • P. Maragos, J. F. Kaiser, and T. F. Quatieri, "On amplitude and frequency demodulation using energy operators," IEEE Trans. Signal Process., vol. 41, no. 4, pp. 1532-1550, Apr. 1993.
    • (1993) IEEE Trans. Signal Process. , vol.41 , Issue.4 , pp. 1532-1550
    • Maragos, P.1    Kaiser, J.F.2    Quatieri, T.F.3
  • 7
    • 4544323815 scopus 로고    scopus 로고
    • A structured speech model with continuous hidden dynamics and prediction-residual training for tracking vocal tract resonances
    • L. Deng, L. Lee, H. Attias, and A. Acero, "A structured speech model with continuous hidden dynamics and prediction-residual training for tracking vocal tract resonances," in Proc. Int. Conf. Acoust., Speech, Signal Process., 2004, pp. 557-560.
    • (2004) Proc. Int. Conf. Acoust., Speech, Signal Process. , pp. 557-560
    • Deng, L.1    Lee, L.2    Attias, H.3    Acero, A.4
  • 8
    • 34547517867 scopus 로고    scopus 로고
    • Adaptive Kalman filtering and smoothing for tracking vocal-tract resonances using a continuous-valued hidden dynamic model
    • Jan.
    • L. Deng, L. Lee, H. Attias, and A. Acero, "Adaptive Kalman filtering and smoothing for tracking vocal-tract resonances using a continuous-valued hidden dynamic model," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 1, pp. 13-23, Jan. 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.1 , pp. 13-23
    • Deng, L.1    Lee, L.2    Attias, H.3    Acero, A.4
  • 10
  • 11
    • 51549093980 scopus 로고    scopus 로고
    • Vocal-tract resonances tracking based on voiced and unvoiced speech classification using dynamic programming and fixed-interval Kalman smoother
    • I. Y. Ozbek and M. Demirekler, "Vocal-tract resonances tracking based on voiced and unvoiced speech classification using dynamic programming and fixed-interval Kalman smoother," in Proc. Int. Conf. Acoust., Speech, Signal Process., 2008, pp. 4217-4220.
    • (2008) Proc. Int. Conf. Acoust., Speech, Signal Process. , pp. 4217-4220
    • Ozbek, I.Y.1    Demirekler, M.2
  • 13
    • 0141480064 scopus 로고    scopus 로고
    • Spectrogram-based formant tracking via particle filters
    • Y. Shi and E. Chang, "Spectrogram-based formant tracking via particle filters," in Proc. Int. Conf. Acoust. Speech Signal Process., 2003, vol. 1, pp. 168-171.
    • (2003) Proc. Int. Conf. Acoust. Speech Signal Process. , vol.1 , pp. 168-171
    • Shi, Y.1    Chang, E.2
  • 14
    • 69249099357 scopus 로고    scopus 로고
    • Dynamic speech spectrum representation and tracking variable number of vocal tract resonance frequencies with time-varying Dirichlet process mixture models
    • Nov.
    • E. Ozkan, I. Y. Ozbek, and M. Demirekler, "Dynamic speech spectrum representation and tracking variable number of vocal tract resonance frequencies with time-varying Dirichlet process mixture models," IEEE Trans. Audio, Speech, Lang. Process., vol. 17, no. 8, pp. 1518-1532, Nov. 2009.
    • (2009) IEEE Trans. Audio, Speech, Lang. Process. , vol.17 , Issue.8 , pp. 1518-1532
    • Ozkan, E.1    Ozbek, I.Y.2    Demirekler, M.3
  • 15
    • 73049112030 scopus 로고    scopus 로고
    • Combining auditory preprocessing and Bayesian estimation for robust formant tracking
    • Feb.
    • C. Glaser, M. Heckmann, F. Joublin, and C. Goerick, "Combining auditory preprocessing and Bayesian estimation for robust formant tracking," IEEE Trans. Audio, Speech Lang. Process., vol. 18, no. 2, pp. 224-236, Feb. 2010.
    • (2010) IEEE Trans. Audio, Speech Lang. Process. , vol.18 , Issue.2 , pp. 224-236
    • Glaser, C.1    Heckmann, M.2    Joublin, F.3    Goerick, C.4
  • 16
    • 33947155741 scopus 로고    scopus 로고
    • Robust formant tracking for continuous speech with speaker variability
    • Mar.
    • K. Mustafa and I. C. Bruce, "Robust formant tracking for continuous speech with speaker variability," IEEE Trans. Audio, Speech, Lang. Process., vol. 14, no. 2, pp. 435-444, Mar. 2006.
    • (2006) IEEE Trans. Audio, Speech, Lang. Process. , vol.14 , Issue.2 , pp. 435-444
    • Mustafa, K.1    Bruce, I.C.2
  • 17
    • 0027154415 scopus 로고
    • Tracking of partials for additive sound synthesis using hidden Markov models
    • Apr.
    • G. Garcia, P. Depalle, and X. Rodet, "Tracking of partials for additive sound synthesis using hidden Markov models," in Proc. Int. Conf. Acoust. Speech, Signal Process., Apr. 1993, vol. 1, pp. 225-228.
    • (1993) Proc. Int. Conf. Acoust. Speech, Signal Process. , vol.1 , pp. 225-228
    • Garcia, G.1    Depalle, P.2    Rodet, X.3
  • 19
    • 33947120106 scopus 로고    scopus 로고
    • Initialization, training, and context-dependency in HMM-based formant tracking
    • Mar.
    • D. T. Toledano, J. G. Villardebo, and L. H. Gomez, "Initialization, training, and context-dependency in HMM-based formant tracking," IEEE Trans. Audio, Speech, Lang. Process., vol. 14, no. 2, pp. 511-523, Mar. 2006.
    • (2006) IEEE Trans. Audio, Speech, Lang. Process. , vol.14 , Issue.2 , pp. 511-523
    • Toledano, D.T.1    Villardebo, J.G.2    Gomez, L.H.3
  • 20
    • 0033640657 scopus 로고    scopus 로고
    • On-line retrainable neural networks: Improving the performance of neural networks in image analysis problems
    • Jan.
    • A. D. Doulamis, N. D. Doulamis, and S. D. Kollias, "On-line retrainable neural networks: Improving the performance of neural networks in image analysis problems," IEEE Trans. Neural Netw., vol. 11, no. 1, pp. 137-155, Jan. 2000.
    • (2000) IEEE Trans. Neural Netw. , vol.11 , Issue.1 , pp. 137-155
    • Doulamis, A.D.1    Doulamis, N.D.2    Kollias, S.D.3
  • 21
    • 63449135803 scopus 로고    scopus 로고
    • Variational Bayesian sparse kernel-based blind image deconvolution with Student's-t priors
    • Apr.
    • D. G. Tzikas, A. C. Likas, and N. P. Galatsanos, "Variational Bayesian sparse kernel-based blind image deconvolution with Student's-t priors," IEEE Trans. Image Process., vol. 18, no. 4, pp. 753-764, Apr. 2009.
    • (2009) IEEE Trans. Image Process. , vol.18 , Issue.4 , pp. 753-764
    • Tzikas, D.G.1    Likas, A.C.2    Galatsanos, N.P.3
  • 22
    • 48149085386 scopus 로고    scopus 로고
    • Robust image segmentation with mixtures of Student's-t distributions
    • G. Sfikas, C. Nikou, and N. Galatsanos, "Robust image segmentation with mixtures of Student's-t distributions," in Proc. IEEE Int. Conf. Image Process., 2007, vol. 1, pp. 273-276.
    • (2007) Proc. IEEE Int. Conf. Image Process. , vol.1 , pp. 273-276
    • Sfikas, G.1    Nikou, C.2    Galatsanos, N.3
  • 23
    • 48649101762 scopus 로고    scopus 로고
    • Robust fuzzy clustering using mixtures of Student's-t distributions
    • S. Chatzis and T. Varvarigou, "Robust fuzzy clustering using mixtures of Student's-t distributions," Patt. Recog. Lett., vol. 29, no. 13, pp. 1901-1905, 2008.
    • (2008) Patt. Recog. Lett. , vol.29 , Issue.13 , pp. 1901-1905
    • Chatzis, S.1    Varvarigou, T.2
  • 24
    • 67651005607 scopus 로고    scopus 로고
    • Robust sequential data modeling using an outlier tolerant hidden markov model
    • Sep.
    • S. P. Chatzis, D. I. Kosmopoulos, and T. A. Varvarigou, "Robust sequential data modeling using an outlier tolerant hidden markov model," IEEE Trans. Pattern Anal. Mach. Intell., vol. 31, no. 9, pp. 1657-1669, Sep. 2009.
    • (2009) IEEE Trans. Pattern Anal. Mach. Intell. , vol.31 , Issue.9 , pp. 1657-1669
    • Chatzis, S.P.1    Kosmopoulos, D.I.2    Varvarigou, T.A.3
  • 25
    • 84855410923 scopus 로고    scopus 로고
    • Robust Student's-t mixture model with spatial constraints and its application in medical image segmentation
    • Jan.
    • T. M. Nguyen and Q. M. J. Wu, "Robust Student's-t mixture model with spatial constraints and its application in medical image segmentation," IEEE Trans. Med. Imag., vol. 31, no. 1, pp. 103-116, Jan. 2012.
    • (2012) IEEE Trans. Med. Imag. , vol.31 , Issue.1 , pp. 103-116
    • Nguyen, T.M.1    Wu, Q.M.J.2
  • 26
    • 84856295283 scopus 로고    scopus 로고
    • Robust density modelling using the Student's-t distribution for human action recognition
    • Z. Moghaddam and M. Piccardi, "Robust density modelling using the Student's-t distribution for human action recognition," in Proc. IEEE Int. Conf. Imag. Process., 2011, pp. 3261-3264.
    • (2011) Proc. IEEE Int. Conf. Imag. Process. , pp. 3261-3264
    • Moghaddam, Z.1    Piccardi, M.2
  • 28
    • 79959828556 scopus 로고    scopus 로고
    • Robust mixture modeling using t-distribution: Application to speaker ID
    • H. Sundar and T. V. Sreenivas, "Robust mixture modeling using t-distribution: Application to speaker ID," in Proc. Interspeech, 2010, pp. 2750-2753.
    • (2010) Proc. Interspeech , pp. 2750-2753
    • Sundar, H.1    Sreenivas, T.V.2
  • 29
    • 60349089170 scopus 로고    scopus 로고
    • A robust to outliers hidden Markov model with application in text-dependent speaker identification
    • Nov.
    • S. Chatzis and T. Varvarigou, "A robust to outliers hidden Markov model with application in text-dependent speaker identification," in Proc. IEEE Int. Conf. Signal Process. Commun., Nov. 2007, pp. 804-807.
    • (2007) Proc. IEEE Int. Conf. Signal Process. Commun. , pp. 804-807
    • Chatzis, S.1    Varvarigou, T.2
  • 30
    • 34447092407 scopus 로고    scopus 로고
    • Subjective evaluation and comparison of speech enhancement algorithms
    • Y. Hu and P. C. Loizou, "Subjective evaluation and comparison of speech enhancement algorithms," Speech Commun., vol. 49, pp. 588-601, 2007.
    • (2007) Speech Commun. , vol.49 , pp. 588-601
    • Hu, Y.1    Loizou, P.C.2
  • 32
    • 84867152920 scopus 로고    scopus 로고
    • [Online]. Available: https://sites.google.com/site/ chandrasekharseelamantula/projects
  • 33
    • 0027874671 scopus 로고
    • AM-FM energy detection and separation in noise using multiband energy operators
    • Dec.
    • A. C. Bovik, P. Maragos, and T. F. Quatieri, "AM-FM energy detection and separation in noise using multiband energy operators," IEEE Trans. Signal Process., vol. 41, no. 12, pp. 3245-3265, Dec. 1993.
    • (1993) IEEE Trans. Signal Process. , vol.41 , Issue.12 , pp. 3245-3265
    • Bovik, A.C.1    Maragos, P.2    Quatieri, T.F.3
  • 34
    • 0027676955 scopus 로고
    • Energy separation in signal modulations with application to speech analysis
    • Oct.
    • P. Maragos, J. F. Kaiser, and T. F. Quatieri, "Energy separation in signal modulations with application to speech analysis," IEEE Trans. Signal Process., vol. 41, no. 10, pp. 3024-3051, Oct. 1993.
    • (1993) IEEE Trans. Signal Process. , vol.41 , Issue.10 , pp. 3024-3051
    • Maragos, P.1    Kaiser, J.F.2    Quatieri, T.F.3
  • 36
    • 0041407143 scopus 로고    scopus 로고
    • Robust mixture modelling using the t distribution
    • D. Peel and G. J. McLachlan, "Robust mixture modelling using the t distribution," Statist. Comput., vol. 10, no. 4, pp. 339-348, 2000.
    • (2000) Statist. Comput. , vol.10 , Issue.4 , pp. 339-348
    • Peel, D.1    McLachlan, G.J.2
  • 37
    • 84867152919 scopus 로고    scopus 로고
    • [Online]. Available: http://www.ece.mcmaster.ca/ibruce/mb-ftracker/mb- ftracker.htm
  • 38
    • 0016023546 scopus 로고
    • Tribology: The friction, lubrication, and wear of moving parts
    • R. J. Wakelin, "Tribology: The friction, lubrication, and wear of moving parts," Ann. Rev. Mat. Sci., vol. 4, no. 1, pp. 221-253, 1974.
    • (1974) Ann. Rev. Mat. Sci. , vol.4 , Issue.1 , pp. 221-253
    • Wakelin, R.J.1
  • 39
    • 84867152918 scopus 로고    scopus 로고
    • [Online]. Available: http://www.torontostle.com/Tribology-101.html
  • 40
    • 79952822072 scopus 로고    scopus 로고
    • Editorial: Time-frequency approach to radar detection, imaging, and classification
    • T. Thayaparan, L. Stankovic, M. Amin, V. Chen, L. Cohen, and B. Boashash, "Editorial: Time-frequency approach to radar detection, imaging, and classification," IET Signal Process., vol. 4, no. 3, p. 197, 2010.
    • (2010) IET Signal Process. , vol.4 , Issue.3 , pp. 197
    • Thayaparan, T.1    Stankovic, L.2    Amin, M.3    Chen, V.4    Cohen, L.5    Boashash, B.6


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.