메뉴 건너뛰기




Volumn 22, Issue 10, 2014, Pages 1467-1482

Event-based method for instantaneous fundamental frequency estimation from voiced speech based on eigenvalue decomposition of the Hankel matrix

Author keywords

Eigenvalue decomposition; Hankel matrix; Instantaneous fundamental frequency; Speech signal processing

Indexed keywords

EIGENVALUES AND EIGENFUNCTIONS; FREQUENCY ESTIMATION; ITERATIVE METHODS; MATRIX ALGEBRA; NATURAL FREQUENCIES; SIGNAL PROCESSING; SPEECH; SPEECH ANALYSIS; SPEECH COMMUNICATION;

EID: 84911369306     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASLP.2014.2335056     Document Type: Article
Times cited : (49)

References (52)
  • 2
    • 0025543906 scopus 로고
    • Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones
    • Dec.
    • E. Moulines and F. Charpentier, "Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones," Speech Commun., vol. 9, no. 5, pp. 453-467, Dec. 1990.
    • (1990) Speech Commun. , vol.9 , Issue.5 , pp. 453-467
    • Moulines, E.1    Charpentier, F.2
  • 4
    • 77950029338 scopus 로고    scopus 로고
    • Voice conversion by mapping the speaker-specific features using pitch synchronous approach
    • Jul.
    • K. S. Rao, "Voice conversion by mapping the speaker-specific features using pitch synchronous approach," Comput. Speech Lang., vol. 24, no. 3, pp. 474-494, Jul. 2010.
    • (2010) Comput. Speech Lang. , vol.24 , Issue.3 , pp. 474-494
    • Rao, K.S.1
  • 5
    • 21844454996 scopus 로고    scopus 로고
    • Modeling prosodic feature sequences for speaker recognition
    • Jul.
    • E. Shriberg, L. Ferrer, S. Kajarekar, A. Venkataraman, and A. Stolcke, "Modeling prosodic feature sequences for speaker recognition," Speech Commun., vol. 46, no. 3-4, pp. 455-472, Jul. 2005.
    • (2005) Speech Commun. , vol.46 , Issue.3-4 , pp. 455-472
    • Shriberg, E.1    Ferrer, L.2    Kajarekar, S.3    Venkataraman, A.4    Stolcke, A.5
  • 7
    • 0032645823 scopus 로고    scopus 로고
    • An improvement of LPC based on noise reduction using pitch synchronous addition
    • Y. Kuroiwa and T. Shimamura, "An improvement of LPC based on noise reduction using pitch synchronous addition," in Proc. IEEE Int. Symp. Circuits Syst., Jul. 1999, vol. 3, pp. 122-125.
    • Proc. IEEE Int. Symp. Circuits Syst., Jul. 1999 , vol.3 , pp. 122-125
    • Kuroiwa, Y.1    Shimamura, T.2
  • 8
    • 0032630841 scopus 로고    scopus 로고
    • Harmonic sound stream segregation using localization and its application to speech stream segregation
    • Apr.
    • T. Nakatani and H. G. Okuno, "Harmonic sound stream segregation using localization and its application to speech stream segregation," Speech Commun., vol. 27, no. 3-4, pp. 209-222, Apr. 1999.
    • (1999) Speech Commun. , vol.27 , Issue.3-4 , pp. 209-222
    • Nakatani, T.1    Okuno, H.G.2
  • 10
    • 0034163034 scopus 로고    scopus 로고
    • A comparative analysis of fundamental frequency estimation methods with application to pathological voices
    • Mar.
    • C. Manfredi, M. D'Aniello, P. Bruscaglioni, and A. Ismaelli, "A comparative analysis of fundamental frequency estimation methods with application to pathological voices," Med. Eng. Phys., vol. 22, no. 2, pp. 135-147, Mar. 2000.
    • (2000) Med. Eng. Phys. , vol.22 , Issue.2 , pp. 135-147
    • Manfredi, C.1    D'Aniello, M.2    Bruscaglioni, P.3    Ismaelli, A.4
  • 13
    • 0036642776 scopus 로고    scopus 로고
    • Analysis, enhancement and evaluation of five pitch determination techniques
    • Jul.
    • P. Veprek and M. S. Scordilis, "Analysis, enhancement and evaluation of five pitch determination techniques," Speech Commun., vol. 37, no. 3-4, pp. 249-270, Jul. 2002.
    • (2002) Speech Commun. , vol.37 , Issue.3-4 , pp. 249-270
    • Veprek, P.1    Scordilis, M.S.2
  • 15
    • 0017367712 scopus 로고
    • On the use of autocorrelation analysis for pitch detection
    • Feb.
    • L. Rabiner, "On the use of autocorrelation analysis for pitch detection," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-25, no. 1, pp. 24-33, Feb. 1977.
    • (1977) IEEE Trans. Acoust., Speech, Signal Process. , vol.ASSP-25 , Issue.1 , pp. 24-33
    • Rabiner, L.1
  • 16
    • 0014055288 scopus 로고
    • Cepstrum pitch determination
    • Aug.
    • A. M. Noll, "Cepstrum pitch determination," J. Acoust. Soc. Amer., vol. 41, no. 2, pp. 293-309, Aug. 1967.
    • (1967) J. Acoust. Soc. Amer. , vol.41 , Issue.2 , pp. 293-309
    • Noll, A.M.1
  • 17
    • 0015488387 scopus 로고
    • The SIFT algorithm for fundamental frequency estimation
    • Dec.
    • J. Markel, "The SIFT algorithm for fundamental frequency estimation," IEEE Trans. Audio Electroacoust., vol. AE-20, no. 5, pp. 367-377, Dec. 1972.
    • (1972) IEEE Trans. Audio Electroacoust. , vol.AE-20 , Issue.5 , pp. 367-377
    • Markel, J.1
  • 19
    • 0023833270 scopus 로고
    • Measurement of pitch by subharmonic summation
    • Jan.
    • D. J. Hermes, "Measurement of pitch by subharmonic summation," J. Acoust. Soc. Amer., vol. 83, no. 1, pp. 257-264, Jan. 1988.
    • (1988) J. Acoust. Soc. Amer. , vol.83 , Issue.1 , pp. 257-264
    • Hermes, D.J.1
  • 20
    • 0035472923 scopus 로고    scopus 로고
    • Weighted autocorrelation for pitch extraction of noisy speech
    • Oct.
    • T. Shimamura and H. Kobayashi, "Weighted autocorrelation for pitch extraction of noisy speech," IEEE Trans. Speech Audio Process., vol. 9, no. 7, pp. 727-730, Oct. 2001.
    • (2001) IEEE Trans. Speech Audio Process. , vol.9 , Issue.7 , pp. 727-730
    • Shimamura, T.1    Kobayashi, H.2
  • 21
    • 11144332020 scopus 로고    scopus 로고
    • Robust and accurate fundamental frequency estimation based on dominant harmonic components
    • Dec.
    • T. Nakatani and T. Irino, "Robust and accurate fundamental frequency estimation based on dominant harmonic components," J. Acoust. Soc. Amer., vol. 116, no. 6, pp. 3690-3700, Dec. 2004.
    • (2004) J. Acoust. Soc. Amer. , vol.116 , Issue.6 , pp. 3690-3700
    • Nakatani, T.1    Irino, T.2
  • 22
    • 81355122934 scopus 로고    scopus 로고
    • Pitch estimation based on a harmonic sinusoidal autocorrelation model and a time-domain matching scheme
    • Jan.
    • C. Shahnaz, W. P. Zhu, and M. O. Ahmad, "Pitch estimation based on a harmonic sinusoidal autocorrelation model and a time-domain matching scheme," IEEE Trans. Audio, Speech, Lang. Process., vol. 20, no. 1, pp. 322-335, Jan. 2012.
    • (2012) IEEE Trans. Audio, Speech, Lang. Process. , vol.20 , Issue.1 , pp. 322-335
    • Shahnaz, C.1    Zhu, W.P.2    Ahmad, M.O.3
  • 23
    • 0026103222 scopus 로고
    • An autocorrelation pitch detector and voicing decision with confidence measures developed for noisecorrupted speech
    • Feb.
    • D. Krubsack and R. J. Niederjohn, "An autocorrelation pitch detector and voicing decision with confidence measures developed for noisecorrupted speech," IEEE Trans. Signal Process., vol. 39, no. 2, pp. 319-329, Feb. 1991.
    • (1991) IEEE Trans. Signal Process. , vol.39 , Issue.2 , pp. 319-329
    • Krubsack, D.1    Niederjohn, R.J.2
  • 24
    • 0029326498 scopus 로고
    • Fundamental frequency determination based on instantaneous frequency estimation
    • Jun.
    • L. Qiu, H.Yang, and S.N.Koh, "Fundamental frequency determination based on instantaneous frequency estimation," Signal Process., vol. 44, no. 2, pp. 233-241, Jun. 1995.
    • (1995) Signal Process. , vol.44 , Issue.2 , pp. 233-241
    • Qiu, L.1    Yang, H.2    Koh, S.N.3
  • 26
    • 32644438199 scopus 로고    scopus 로고
    • Speech pitch determination based on Hilbert-Huang transform
    • Apr.
    • H. Huang and J. Pan, "Speech pitch determination based on Hilbert-Huang transform," Signal Process., vol. 86, no. 4, pp. 792-803, Apr. 2006.
    • (2006) Signal Process. , vol.86 , Issue.4 , pp. 792-803
    • Huang, H.1    Pan, J.2
  • 28
    • 0024924999 scopus 로고
    • Automatic and reliable estimation of glottal closure instant and period
    • Dec.
    • Y. M. Cheng and D. O'Shaughnessy, "Automatic and reliable estimation of glottal closure instant and period," IEEE Trans. Acoust., Speech, Signal Process., vol. 37, no. 12, pp. 1805-1815, Dec. 1989.
    • (1989) IEEE Trans. Acoust., Speech, Signal Process. , vol.37 , Issue.12 , pp. 1805-1815
    • Cheng, Y.M.1    O'Shaughnessy, D.2
  • 29
    • 0026727405 scopus 로고
    • Application of the wavelet transform for pitch detection of speech signals
    • Mar.
    • S. Kadambe and G. F. Boudreaux-Bartels, "Application of the wavelet transform for pitch detection of speech signals," IEEE Trans. Inf. Theory, vol. 38, no. 2, pp. 917-924, Mar. 1992.
    • (1992) IEEE Trans. Inf. Theory , vol.38 , Issue.2 , pp. 917-924
    • Kadambe, S.1    Boudreaux-Bartels, G.F.2
  • 30
    • 65249133648 scopus 로고    scopus 로고
    • Pitch period estimation using multipulse model and wavelet transformation
    • P. K. Ghosh, A. Ortega, and S. Narayanan, "Pitch period estimation using multipulse model and wavelet transformation," in Proc. Interspeech, Aug. 2007, pp. 2761-2764.
    • Proc. Interspeech, Aug. 2007 , pp. 2761-2764
    • Ghosh, P.K.1    Ortega, A.2    Narayanan, S.3
  • 31
    • 65249149180 scopus 로고    scopus 로고
    • Event-based instantaneous fundamental frequency estimation from speech signals
    • May
    • B. Yegnanarayana and K. S. R. Murty, "Event-based instantaneous fundamental frequency estimation from speech signals," IEEE Trans. Audio, Speech, Lang. Process., vol. 17, no. 4, pp. 614-624, May 2009.
    • (2009) IEEE Trans. Audio, Speech, Lang. Process. , vol.17 , Issue.4 , pp. 614-624
    • Yegnanarayana, B.1    Murty, K.S.R.2
  • 32
    • 70450198169 scopus 로고    scopus 로고
    • Glottal closure and opening instant detection from speech signals
    • T. Drugman and T. Dutoit, "Glottal closure and opening instant detection from speech signals," in Proc. Interspeech, Sep. 2009, pp. 2891-2894.
    • Proc. Interspeech, Sep. 2009 , pp. 2891-2894
    • Drugman, T.1    Dutoit, T.2
  • 34
    • 84860119809 scopus 로고    scopus 로고
    • Time-order representation based method for epoch detection
    • Feb.
    • P. Jain and R. B. Pachori, "Time-order representation based method for epoch detection," J. Intell. Syst., vol. 21, no. 1, pp. 79-95, Feb. 2012.
    • (2012) J. Intell. Syst. , vol.21 , Issue.1 , pp. 79-95
    • Jain, P.1    Pachori, R.B.2
  • 35
    • 84875532258 scopus 로고    scopus 로고
    • Marginal energy density over the low frequency range as a feature for voiced/non-voiced detection in noisy speech signals
    • May
    • P. Jain and R. B. Pachori, "Marginal energy density over the low frequency range as a feature for voiced/non-voiced detection in noisy speech signals," J. Franklin Inst., vol. 350, no. 4, pp. 698-716, May 2013.
    • (2013) J. Franklin Inst. , vol.350 , Issue.4 , pp. 698-716
    • Jain, P.1    Pachori, R.B.2
  • 36
    • 84911434649 scopus 로고    scopus 로고
    • [Online]. Available
    • [Online]. Available: http://www.ncvs.org/ncvs/tutorials/voiceprod/tutorial/influence.html
  • 37
    • 71649095504 scopus 로고    scopus 로고
    • Analysis of multicomponent AM-FM signals using FB-DESA method
    • Jan.
    • R. B. Pachori and P. Sircar, "Analysis of multicomponent AM-FM signals using FB-DESA method," Digital Signal Process., vol. 20, no. 1, pp. 42-62, Jan. 2010.
    • (2010) Digital Signal Process. , vol.20 , Issue.1 , pp. 42-62
    • Pachori, R.B.1    Sircar, P.2
  • 38
    • 0032628065 scopus 로고    scopus 로고
    • Acomparison of speaker identification results using features based on cepstrum and Fourier-Bessel expansion
    • May
    • K. Gopalan, T. R. Anderson, and E. Cupples, "Acomparison of speaker identification results using features based on cepstrum and Fourier-Bessel expansion," IEEE Trans. Speech Audio Process., vol. 7, no. 3, pp. 289-294, May 1999.
    • (1999) IEEE Trans. Speech Audio Process. , vol.7 , Issue.3 , pp. 289-294
    • Gopalan, K.1    Anderson, T.R.2    Cupples, E.3
  • 39
    • 38249004577 scopus 로고
    • Signal processing via Fourier-Bessel series expansion
    • Apr.
    • J. Schroeder, "Signal processing via Fourier-Bessel series expansion," Digital Signal Process., vol. 3, no. 2, pp. 112-124, Apr. 1993.
    • (1993) Digital Signal Process. , vol.3 , Issue.2 , pp. 112-124
    • Schroeder, J.1
  • 40
    • 35248825924 scopus 로고    scopus 로고
    • EEG signal analysis using FB expansion and second-order linear TVAR process
    • Feb.
    • R. B. Pachori and P. Sircar, "EEG signal analysis using FB expansion and second-order linear TVAR process," Signal Process., vol. 88, no. 2, pp. 415-420, Feb. 2008.
    • (2008) Signal Process. , vol.88 , Issue.2 , pp. 415-420
    • Pachori, R.B.1    Sircar, P.2
  • 42
    • 0000330384 scopus 로고    scopus 로고
    • On decomposing speech into modulated components
    • May
    • A. Rao and R. Kumaresan, "On decomposing speech into modulated components," IEEE Trans. Speech, Audio Process., vol. 8, no. 3, pp. 240-254, May 2000.
    • (2000) IEEE Trans. Speech, Audio Process. , vol.8 , Issue.3 , pp. 240-254
    • Rao, A.1    Kumaresan, R.2
  • 45
    • 5444236478 scopus 로고    scopus 로고
    • The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-stationary time series analysis
    • N. E. Huang et al., "The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-stationary time series analysis," in Proc. R. Soc. London A, Mar. 1998, vol. 454, no. 1971, pp. 903-995.
    • Proc. R. Soc. London A, Mar. 1998 , vol.454 , Issue.1971 , pp. 903-995
    • Huang, N.E.1
  • 47
    • 85093707396 scopus 로고    scopus 로고
    • Enhanced pitch tracking and the processing of F0 contours for computer aided and intonation teaching
    • P. C. Bagshaw, S.M. Hiller, and M. A. Jack, "Enhanced pitch tracking and the processing of F0 contours for computer aided and intonation teaching," in Proc. Eur. Conf. Speech Commun., Sep. 1993, vol. 2, pp. 1003-1006.
    • Proc. Eur. Conf. Speech Commun., Sep. 1993 , vol.2 , pp. 1003-1006
    • Bagshaw, P.C.1    Hiller, S.M.2    Jack, M.A.3
  • 49
    • 84911363459 scopus 로고    scopus 로고
    • [Online]. Available
    • [Online]. Available: www.speech.cs.cmu.edu/comp.speech/Section1/Data/noisex.html
  • 50
    • 0036214787 scopus 로고    scopus 로고
    • YIN, a fundamental frequency estimator for speech and music
    • Apr.
    • A. de Cheveigne and H. Kawahara, "YIN, a fundamental frequency estimator for speech and music," J. Acoust. Soc. Amer., vol. 111, no. 4, pp. 1917-1930, Apr. 2002.
    • (2002) J. Acoust. Soc. Amer. , vol.111 , Issue.4 , pp. 1917-1930
    • De Cheveigne, A.1    Kawahara, H.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.