SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 22, Issue 10, 2014, Pages 1467-1482

Event-based method for instantaneous fundamental frequency estimation from voiced speech based on eigenvalue decomposition of the Hankel matrix

(2) Jain, Pooja a Pachori, Ram Bilas a

a INDIAN INSTITUTE OF TECHNOLOGY INDORE (India)

Author keywords

Eigenvalue decomposition; Hankel matrix; Instantaneous fundamental frequency; Speech signal processing

Indexed keywords

EIGENVALUES AND EIGENFUNCTIONS; FREQUENCY ESTIMATION; ITERATIVE METHODS; MATRIX ALGEBRA; NATURAL FREQUENCIES; SIGNAL PROCESSING; SPEECH; SPEECH ANALYSIS; SPEECH COMMUNICATION;

EIGENVALUE DECOMPOSITION; FREQUENCY MODULATED; FUNDAMENTAL FREQUENCIES; FUNDAMENTAL FREQUENCY ESTIMATION; HANKEL MATRIX; LOW FREQUENCY RANGE; SPEECH SIGNAL PROCESSING; STATE-OF-THE-ART METHODS;

SPEECH RECOGNITION;

EID: 84911369306 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TASLP.2014.2335056 Document Type: Article

Times cited : (49)

References (52)

1
- 0003424145
- New Delhi, India: Wiley-India
- J. R. Deller, J. H. L. Hansen, and J. G. Proakis, Discrete-Time Processing of Speech Signals. New Delhi, India: Wiley-India, 2011.
- (2011) Discrete-Time Processing of Speech Signals
- Deller, J.R.¹ Hansen, J.H.L.² Proakis, J.G.³

2
- 0025543906
- Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones
- Dec.
- E. Moulines and F. Charpentier, "Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones," Speech Commun., vol. 9, no. 5, pp. 453-467, Dec. 1990.
- (1990) Speech Commun. , vol.9 , Issue.5 , pp. 453-467
- Moulines, E.¹ Charpentier, F.²

3
- 0028996945
- Speech compression using pitch synchronous interpolation
- R. Taori, R. J. Sluijter, and E. Kathmann, "Speech compression using pitch synchronous interpolation," in Proc. IEEE Int. Conf. Acoust. Speech, Signal Process, May 1995, vol. 1, pp. 512-515.
- Proc. IEEE Int. Conf. Acoust. Speech, Signal Process, May 1995 , vol.1 , pp. 512-515
- Taori, R.¹ Sluijter, R.J.² Kathmann, E.³

4
- 77950029338
- Voice conversion by mapping the speaker-specific features using pitch synchronous approach
- Jul.
- K. S. Rao, "Voice conversion by mapping the speaker-specific features using pitch synchronous approach," Comput. Speech Lang., vol. 24, no. 3, pp. 474-494, Jul. 2010.
- (2010) Comput. Speech Lang. , vol.24 , Issue.3 , pp. 474-494
- Rao, K.S.¹

5
- 21844454996
- Modeling prosodic feature sequences for speaker recognition
- Jul.
- E. Shriberg, L. Ferrer, S. Kajarekar, A. Venkataraman, and A. Stolcke, "Modeling prosodic feature sequences for speaker recognition," Speech Commun., vol. 46, no. 3-4, pp. 455-472, Jul. 2005.
- (2005) Speech Commun. , vol.46 , Issue.3-4 , pp. 455-472
- Shriberg, E.¹ Ferrer, L.² Kajarekar, S.³ Venkataraman, A.⁴ Stolcke, A.⁵

6
- 85009145332
- Prosody-based automatic detection of annoyance and frustration in human-computer dialog
- J. Ang, R. Dhillon, A. Krupski, E. Shriberg, and A. Stockle, "Prosody-based automatic detection of annoyance and frustration in human-computer dialog," in Proc. Int. Conf. Spoken Lang. Process., Sep. 2002, pp. 2037-2040.
- Proc. Int. Conf. Spoken Lang. Process., Sep. 2002 , pp. 2037-2040
- Ang, J.¹ Dhillon, R.² Krupski, A.³ Shriberg, E.⁴ Stockle, A.⁵

7
- 0032645823
- An improvement of LPC based on noise reduction using pitch synchronous addition
- Y. Kuroiwa and T. Shimamura, "An improvement of LPC based on noise reduction using pitch synchronous addition," in Proc. IEEE Int. Symp. Circuits Syst., Jul. 1999, vol. 3, pp. 122-125.
- Proc. IEEE Int. Symp. Circuits Syst., Jul. 1999 , vol.3 , pp. 122-125
- Kuroiwa, Y.¹ Shimamura, T.²

8
- 0032630841
- Harmonic sound stream segregation using localization and its application to speech stream segregation
- Apr.
- T. Nakatani and H. G. Okuno, "Harmonic sound stream segregation using localization and its application to speech stream segregation," Speech Commun., vol. 27, no. 3-4, pp. 209-222, Apr. 1999.
- (1999) Speech Commun. , vol.27 , Issue.3-4 , pp. 209-222
- Nakatani, T.¹ Okuno, H.G.²

9
- 78149484045
- Speech emotion analysis in noisy real-world environment
- A. Tawari and M. Trivedi, "Speech emotion analysis in noisy real-world environment," in Proc. Int. Conf. Pattern Recogn., Aug. 2010, pp. 4605-4608.
- Proc. Int. Conf. Pattern Recogn., Aug. 2010 , pp. 4605-4608
- Tawari, A.¹ Trivedi, M.²

10
- 0034163034
- A comparative analysis of fundamental frequency estimation methods with application to pathological voices
- Mar.
- C. Manfredi, M. D'Aniello, P. Bruscaglioni, and A. Ismaelli, "A comparative analysis of fundamental frequency estimation methods with application to pathological voices," Med. Eng. Phys., vol. 22, no. 2, pp. 135-147, Mar. 2000.
- (2000) Med. Eng. Phys. , vol.22 , Issue.2 , pp. 135-147
- Manfredi, C.¹ D'Aniello, M.² Bruscaglioni, P.³ Ismaelli, A.⁴

11
- 0017097478
- A comparative performance study of several pitch detection algorithms
- Oct.
- L. Rabiner, M. Cheng, A. E. Rosenberg, and C. McGonegal, "A comparative performance study of several pitch detection algorithms," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-24, no. 5, pp. 399-418, Oct. 1976.
- (1976) IEEE Trans. Acoust., Speech, Signal Process. , vol.ASSP-24 , Issue.5 , pp. 399-418
- Rabiner, L.¹ Cheng, M.² Rosenberg, A.E.³ McGonegal, C.⁴

12
- 0003391579
- Berlin, Germany: Springer-Verlag, Apr.
- W. Hess, Pitch Determination of Speech Signals: Algorithms and Devices. Berlin, Germany: Springer-Verlag, Apr. 1983.
- (1983) Pitch Determination of Speech Signals: Algorithms and Devices
- Hess, W.¹

13
- 0036642776
- Analysis, enhancement and evaluation of five pitch determination techniques
- Jul.
- P. Veprek and M. S. Scordilis, "Analysis, enhancement and evaluation of five pitch determination techniques," Speech Commun., vol. 37, no. 3-4, pp. 249-270, Jul. 2002.
- (2002) Speech Commun. , vol.37 , Issue.3-4 , pp. 249-270
- Veprek, P.¹ Scordilis, M.S.²

14
- 0016114130
- Average magnitude difference function pitch extractor
- Oct.
- M. Ross, H. Shaffer, A. Cohen, R. Freudberg, and H. Manley, "Average magnitude difference function pitch extractor," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-22, no. 5, pp. 353-362, Oct. 1974.
- (1974) IEEE Trans. Acoust., Speech, Signal Process. , vol.ASSP-22 , Issue.5 , pp. 353-362
- Ross, M.¹ Shaffer, H.² Cohen, A.³ Freudberg, R.⁴ Manley, H.⁵

15
- 0017367712
- On the use of autocorrelation analysis for pitch detection
- Feb.
- L. Rabiner, "On the use of autocorrelation analysis for pitch detection," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-25, no. 1, pp. 24-33, Feb. 1977.
- (1977) IEEE Trans. Acoust., Speech, Signal Process. , vol.ASSP-25 , Issue.1 , pp. 24-33
- Rabiner, L.¹

16
- 0014055288
- Cepstrum pitch determination
- Aug.
- A. M. Noll, "Cepstrum pitch determination," J. Acoust. Soc. Amer., vol. 41, no. 2, pp. 293-309, Aug. 1967.
- (1967) J. Acoust. Soc. Amer. , vol.41 , Issue.2 , pp. 293-309
- Noll, A.M.¹

17
- 0015488387
- The SIFT algorithm for fundamental frequency estimation
- Dec.
- J. Markel, "The SIFT algorithm for fundamental frequency estimation," IEEE Trans. Audio Electroacoust., vol. AE-20, no. 5, pp. 367-377, Dec. 1972.
- (1972) IEEE Trans. Audio Electroacoust. , vol.AE-20 , Issue.5 , pp. 367-377
- Markel, J.¹

18
- 32644434136
- Pitch estimation using a modulation model of speech
- K. Gopalan, "Pitch estimation using a modulation model of speech," in Proc. IEEE Int. Conf. Signal Process., Aug. 2000, vol. 2, pp. 786-791.
- Proc. IEEE Int. Conf. Signal Process., Aug. 2000 , vol.2 , pp. 786-791
- Gopalan, K.¹

19
- 0023833270
- Measurement of pitch by subharmonic summation
- Jan.
- D. J. Hermes, "Measurement of pitch by subharmonic summation," J. Acoust. Soc. Amer., vol. 83, no. 1, pp. 257-264, Jan. 1988.
- (1988) J. Acoust. Soc. Amer. , vol.83 , Issue.1 , pp. 257-264
- Hermes, D.J.¹

20
- 0035472923
- Weighted autocorrelation for pitch extraction of noisy speech
- Oct.
- T. Shimamura and H. Kobayashi, "Weighted autocorrelation for pitch extraction of noisy speech," IEEE Trans. Speech Audio Process., vol. 9, no. 7, pp. 727-730, Oct. 2001.
- (2001) IEEE Trans. Speech Audio Process. , vol.9 , Issue.7 , pp. 727-730
- Shimamura, T.¹ Kobayashi, H.²

21
- 11144332020
- Robust and accurate fundamental frequency estimation based on dominant harmonic components
- Dec.
- T. Nakatani and T. Irino, "Robust and accurate fundamental frequency estimation based on dominant harmonic components," J. Acoust. Soc. Amer., vol. 116, no. 6, pp. 3690-3700, Dec. 2004.
- (2004) J. Acoust. Soc. Amer. , vol.116 , Issue.6 , pp. 3690-3700
- Nakatani, T.¹ Irino, T.²

22
- 81355122934
- Pitch estimation based on a harmonic sinusoidal autocorrelation model and a time-domain matching scheme
- Jan.
- C. Shahnaz, W. P. Zhu, and M. O. Ahmad, "Pitch estimation based on a harmonic sinusoidal autocorrelation model and a time-domain matching scheme," IEEE Trans. Audio, Speech, Lang. Process., vol. 20, no. 1, pp. 322-335, Jan. 2012.
- (2012) IEEE Trans. Audio, Speech, Lang. Process. , vol.20 , Issue.1 , pp. 322-335
- Shahnaz, C.¹ Zhu, W.P.² Ahmad, M.O.³

23
- 0026103222
- An autocorrelation pitch detector and voicing decision with confidence measures developed for noisecorrupted speech
- Feb.
- D. Krubsack and R. J. Niederjohn, "An autocorrelation pitch detector and voicing decision with confidence measures developed for noisecorrupted speech," IEEE Trans. Signal Process., vol. 39, no. 2, pp. 319-329, Feb. 1991.
- (1991) IEEE Trans. Signal Process. , vol.39 , Issue.2 , pp. 319-329
- Krubsack, D.¹ Niederjohn, R.J.²

24
- 0029326498
- Fundamental frequency determination based on instantaneous frequency estimation
- Jun.
- L. Qiu, H.Yang, and S.N.Koh, "Fundamental frequency determination based on instantaneous frequency estimation," Signal Process., vol. 44, no. 2, pp. 233-241, Jun. 1995.
- (1995) Signal Process. , vol.44 , Issue.2 , pp. 233-241
- Qiu, L.¹ Yang, H.² Koh, S.N.³

25
- 37649002185
- Estimation of the instantaneous pitch of speech
- Mar.
- B. Resch, M. Nilsson, A. Ekman, and W. B. Kleijn, "Estimation of the instantaneous pitch of speech," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 3, pp. 813-822, Mar. 2007.
- (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.3 , pp. 813-822
- Resch, B.¹ Nilsson, M.² Ekman, A.³ Kleijn, W.B.⁴

26
- 32644438199
- Speech pitch determination based on Hilbert-Huang transform
- Apr.
- H. Huang and J. Pan, "Speech pitch determination based on Hilbert-Huang transform," Signal Process., vol. 86, no. 4, pp. 792-803, Apr. 2006.
- (2006) Signal Process. , vol.86 , Issue.4 , pp. 792-803
- Huang, H.¹ Pan, J.²

27
- 77952083041
- A new algorithm for instantaneous F0 speech extraction based on ensemble empirical mode decomposition
- G. Schlotthauer, M. E. Torres, and H. L. Rufiner, "A new algorithm for instantaneous F0 speech extraction based on ensemble empirical mode decomposition," in Proc. 17th Eur. Signal Process. Conf., Aug. 2009, pp. 2347-2351.
- Proc. 17th Eur. Signal Process. Conf., Aug. 2009 , pp. 2347-2351
- Schlotthauer, G.¹ Torres, M.E.² Rufiner, H.L.³

28
- 0024924999
- Automatic and reliable estimation of glottal closure instant and period
- Dec.
- Y. M. Cheng and D. O'Shaughnessy, "Automatic and reliable estimation of glottal closure instant and period," IEEE Trans. Acoust., Speech, Signal Process., vol. 37, no. 12, pp. 1805-1815, Dec. 1989.
- (1989) IEEE Trans. Acoust., Speech, Signal Process. , vol.37 , Issue.12 , pp. 1805-1815
- Cheng, Y.M.¹ O'Shaughnessy, D.²

29
- 0026727405
- Application of the wavelet transform for pitch detection of speech signals
- Mar.
- S. Kadambe and G. F. Boudreaux-Bartels, "Application of the wavelet transform for pitch detection of speech signals," IEEE Trans. Inf. Theory, vol. 38, no. 2, pp. 917-924, Mar. 1992.
- (1992) IEEE Trans. Inf. Theory , vol.38 , Issue.2 , pp. 917-924
- Kadambe, S.¹ Boudreaux-Bartels, G.F.²

30
- 65249133648
- Pitch period estimation using multipulse model and wavelet transformation
- P. K. Ghosh, A. Ortega, and S. Narayanan, "Pitch period estimation using multipulse model and wavelet transformation," in Proc. Interspeech, Aug. 2007, pp. 2761-2764.
- Proc. Interspeech, Aug. 2007 , pp. 2761-2764
- Ghosh, P.K.¹ Ortega, A.² Narayanan, S.³

31
- 65249149180
- Event-based instantaneous fundamental frequency estimation from speech signals
- May
- B. Yegnanarayana and K. S. R. Murty, "Event-based instantaneous fundamental frequency estimation from speech signals," IEEE Trans. Audio, Speech, Lang. Process., vol. 17, no. 4, pp. 614-624, May 2009.
- (2009) IEEE Trans. Audio, Speech, Lang. Process. , vol.17 , Issue.4 , pp. 614-624
- Yegnanarayana, B.¹ Murty, K.S.R.²

32
- 70450198169
- Glottal closure and opening instant detection from speech signals
- T. Drugman and T. Dutoit, "Glottal closure and opening instant detection from speech signals," in Proc. Interspeech, Sep. 2009, pp. 2891-2894.
- Proc. Interspeech, Sep. 2009 , pp. 2891-2894
- Drugman, T.¹ Dutoit, T.²

33
- 84896359998
- GCI identification from voiced speech using the eigen value decomposition of Hankel matrix
- P. Jain and R. B. Pachori, "GCI identification from voiced speech using the eigen value decomposition of Hankel matrix," in Proc. IEEE Int. Symp. Image Signal Process. Anal., Sep. 2013, pp. 371-376.
- Proc. IEEE Int. Symp. Image Signal Process. Anal., Sep. 2013 , pp. 371-376
- Jain, P.¹ Pachori, R.B.²

34
- 84860119809
- Time-order representation based method for epoch detection
- Feb.
- P. Jain and R. B. Pachori, "Time-order representation based method for epoch detection," J. Intell. Syst., vol. 21, no. 1, pp. 79-95, Feb. 2012.
- (2012) J. Intell. Syst. , vol.21 , Issue.1 , pp. 79-95
- Jain, P.¹ Pachori, R.B.²

35
- 84875532258
- Marginal energy density over the low frequency range as a feature for voiced/non-voiced detection in noisy speech signals
- May
- P. Jain and R. B. Pachori, "Marginal energy density over the low frequency range as a feature for voiced/non-voiced detection in noisy speech signals," J. Franklin Inst., vol. 350, no. 4, pp. 698-716, May 2013.
- (2013) J. Franklin Inst. , vol.350 , Issue.4 , pp. 698-716
- Jain, P.¹ Pachori, R.B.²

36
- 84911434649
- [Online]. Available
- [Online]. Available: http://www.ncvs.org/ncvs/tutorials/voiceprod/tutorial/influence.html

37
- 71649095504
- Analysis of multicomponent AM-FM signals using FB-DESA method
- Jan.
- R. B. Pachori and P. Sircar, "Analysis of multicomponent AM-FM signals using FB-DESA method," Digital Signal Process., vol. 20, no. 1, pp. 42-62, Jan. 2010.
- (2010) Digital Signal Process. , vol.20 , Issue.1 , pp. 42-62
- Pachori, R.B.¹ Sircar, P.²

38
- 0032628065
- Acomparison of speaker identification results using features based on cepstrum and Fourier-Bessel expansion
- May
- K. Gopalan, T. R. Anderson, and E. Cupples, "Acomparison of speaker identification results using features based on cepstrum and Fourier-Bessel expansion," IEEE Trans. Speech Audio Process., vol. 7, no. 3, pp. 289-294, May 1999.
- (1999) IEEE Trans. Speech Audio Process. , vol.7 , Issue.3 , pp. 289-294
- Gopalan, K.¹ Anderson, T.R.² Cupples, E.³

39
- 38249004577
- Signal processing via Fourier-Bessel series expansion
- Apr.
- J. Schroeder, "Signal processing via Fourier-Bessel series expansion," Digital Signal Process., vol. 3, no. 2, pp. 112-124, Apr. 1993.
- (1993) Digital Signal Process. , vol.3 , Issue.2 , pp. 112-124
- Schroeder, J.¹

40
- 35248825924
- EEG signal analysis using FB expansion and second-order linear TVAR process
- Feb.
- R. B. Pachori and P. Sircar, "EEG signal analysis using FB expansion and second-order linear TVAR process," Signal Process., vol. 88, no. 2, pp. 415-420, Feb. 2008.
- (2008) Signal Process. , vol.88 , Issue.2 , pp. 415-420
- Pachori, R.B.¹ Sircar, P.²

41
- 84896329380
- Saarbrucken, Germany: LAP Lambert Academic Publishing
- R. B. Pachori and P. Sircar, Non-stationary Signal Analysis: Methods based on Fourier-Bessel Representation. Saarbrucken, Germany: LAP Lambert Academic Publishing, 2010.
- (2010) Non-stationary Signal Analysis: Methods Based on Fourier-Bessel Representation
- Pachori, R.B.¹ Sircar, P.²

42
- 0000330384
- On decomposing speech into modulated components
- May
- A. Rao and R. Kumaresan, "On decomposing speech into modulated components," IEEE Trans. Speech, Audio Process., vol. 8, no. 3, pp. 240-254, May 2000.
- (2000) IEEE Trans. Speech, Audio Process. , vol.8 , Issue.3 , pp. 240-254
- Rao, A.¹ Kumaresan, R.²

43
- 0005632207
- New Dehli, India: Academic
- J. Gilbert and L. Gilbert, Linear Algebra and Matrix Theory. New Dehli, India: Academic, 2005.
- (2005) Linear Algebra and Matrix Theory
- Gilbert, J.¹ Gilbert, L.²

44
- 0025593242
- Tracking the frequencies of superimposed time-varying harmonics
- C. L. DiMonte and K. S. Arun, "Tracking the frequencies of superimposed time-varying harmonics," in Proc. Int. Conf. Acoust., Speech, Signal Process., Apr. 1990, pp. 2539-2542.
- Proc. Int. Conf. Acoust., Speech, Signal Process., Apr. 1990 , pp. 2539-2542
- DiMonte, C.L.¹ Arun, K.S.²

45
- 5444236478
- The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-stationary time series analysis
- N. E. Huang et al., "The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-stationary time series analysis," in Proc. R. Soc. London A, Mar. 1998, vol. 454, no. 1971, pp. 903-995.
- Proc. R. Soc. London A, Mar. 1998 , vol.454 , Issue.1971 , pp. 903-995
- Huang, N.E.¹

46
- 85090317334
- A pitch extraction reference database
- F. Plante, G. F. Meyer, and W. A. Ainsworth, "A pitch extraction reference database," in Proc. Eur. Conf. Speech Commun., Sep. 1995, pp. 837-840.
- Proc. Eur. Conf. Speech Commun., Sep. 1995 , pp. 837-840
- Plante, F.¹ Meyer, G.F.² Ainsworth, W.A.³

47
- 85093707396
- Enhanced pitch tracking and the processing of F0 contours for computer aided and intonation teaching
- P. C. Bagshaw, S.M. Hiller, and M. A. Jack, "Enhanced pitch tracking and the processing of F0 contours for computer aided and intonation teaching," in Proc. Eur. Conf. Speech Commun., Sep. 1993, vol. 2, pp. 1003-1006.
- Proc. Eur. Conf. Speech Commun., Sep. 1993 , vol.2 , pp. 1003-1006
- Bagshaw, P.C.¹ Hiller, S.M.² Jack, M.A.³

48
- 84896833162
- Available: Retrieved July 11, 2012
- P. C. Bagshaw, "Evaluating pitch determination algorithms," [Online]. Available: http://www.cstr.ed.ac.uk/research/projects/fda/ Retrieved July 11, 2012
- "Evaluating Pitch Determination Algorithms," [Online]
- Bagshaw, P.C.¹

49
- 84911363459
- [Online]. Available
- [Online]. Available: www.speech.cs.cmu.edu/comp.speech/Section1/Data/noisex.html

50
- 0036214787
- YIN, a fundamental frequency estimator for speech and music
- Apr.
- A. de Cheveigne and H. Kawahara, "YIN, a fundamental frequency estimator for speech and music," J. Acoust. Soc. Amer., vol. 111, no. 4, pp. 1917-1930, Apr. 2002.
- (2002) J. Acoust. Soc. Amer. , vol.111 , Issue.4 , pp. 1917-1930
- De Cheveigne, A.¹ Kawahara, H.²

51
- 77953352057
- Available: Retrieved July 11, 2012
- P. Boersma and D. Weenink, "Praat: Doing phonetics by computer (version: 5.3.21) [computer program]," [Online]. Available: http://www.fon.hum.uva.nl/praat/ Retrieved July 11, 2012
- "Praat: Doing Phonetics by Computer (Version: 5.3.21) [Computer Program]," [Online]
- Boersma, P.¹ Weenink, D.²

52
- 84911362960
- Burlington, MA, USA: Academic
- R. J. Freund, W. J. Wilson, and D. J. Mohr, Stastical Methods. Burlington, MA, USA: Academic, 2010.
- (2010) Stastical Methods
- Freund, R.J.¹ Wilson, W.J.² Mohr, D.J.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.