-
2
-
-
0025543906
-
Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones
-
Dec.
-
E. Moulines and F. Charpentier, "Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones," Speech Commun., vol. 9, no. 5, pp. 453-467, Dec. 1990.
-
(1990)
Speech Commun.
, vol.9
, Issue.5
, pp. 453-467
-
-
Moulines, E.1
Charpentier, F.2
-
3
-
-
0028996945
-
Speech compression using pitch synchronous interpolation
-
R. Taori, R. J. Sluijter, and E. Kathmann, "Speech compression using pitch synchronous interpolation," in Proc. IEEE Int. Conf. Acoust. Speech, Signal Process, May 1995, vol. 1, pp. 512-515.
-
Proc. IEEE Int. Conf. Acoust. Speech, Signal Process, May 1995
, vol.1
, pp. 512-515
-
-
Taori, R.1
Sluijter, R.J.2
Kathmann, E.3
-
4
-
-
77950029338
-
Voice conversion by mapping the speaker-specific features using pitch synchronous approach
-
Jul.
-
K. S. Rao, "Voice conversion by mapping the speaker-specific features using pitch synchronous approach," Comput. Speech Lang., vol. 24, no. 3, pp. 474-494, Jul. 2010.
-
(2010)
Comput. Speech Lang.
, vol.24
, Issue.3
, pp. 474-494
-
-
Rao, K.S.1
-
5
-
-
21844454996
-
Modeling prosodic feature sequences for speaker recognition
-
Jul.
-
E. Shriberg, L. Ferrer, S. Kajarekar, A. Venkataraman, and A. Stolcke, "Modeling prosodic feature sequences for speaker recognition," Speech Commun., vol. 46, no. 3-4, pp. 455-472, Jul. 2005.
-
(2005)
Speech Commun.
, vol.46
, Issue.3-4
, pp. 455-472
-
-
Shriberg, E.1
Ferrer, L.2
Kajarekar, S.3
Venkataraman, A.4
Stolcke, A.5
-
6
-
-
85009145332
-
Prosody-based automatic detection of annoyance and frustration in human-computer dialog
-
J. Ang, R. Dhillon, A. Krupski, E. Shriberg, and A. Stockle, "Prosody-based automatic detection of annoyance and frustration in human-computer dialog," in Proc. Int. Conf. Spoken Lang. Process., Sep. 2002, pp. 2037-2040.
-
Proc. Int. Conf. Spoken Lang. Process., Sep. 2002
, pp. 2037-2040
-
-
Ang, J.1
Dhillon, R.2
Krupski, A.3
Shriberg, E.4
Stockle, A.5
-
7
-
-
0032645823
-
An improvement of LPC based on noise reduction using pitch synchronous addition
-
Y. Kuroiwa and T. Shimamura, "An improvement of LPC based on noise reduction using pitch synchronous addition," in Proc. IEEE Int. Symp. Circuits Syst., Jul. 1999, vol. 3, pp. 122-125.
-
Proc. IEEE Int. Symp. Circuits Syst., Jul. 1999
, vol.3
, pp. 122-125
-
-
Kuroiwa, Y.1
Shimamura, T.2
-
8
-
-
0032630841
-
Harmonic sound stream segregation using localization and its application to speech stream segregation
-
Apr.
-
T. Nakatani and H. G. Okuno, "Harmonic sound stream segregation using localization and its application to speech stream segregation," Speech Commun., vol. 27, no. 3-4, pp. 209-222, Apr. 1999.
-
(1999)
Speech Commun.
, vol.27
, Issue.3-4
, pp. 209-222
-
-
Nakatani, T.1
Okuno, H.G.2
-
10
-
-
0034163034
-
A comparative analysis of fundamental frequency estimation methods with application to pathological voices
-
Mar.
-
C. Manfredi, M. D'Aniello, P. Bruscaglioni, and A. Ismaelli, "A comparative analysis of fundamental frequency estimation methods with application to pathological voices," Med. Eng. Phys., vol. 22, no. 2, pp. 135-147, Mar. 2000.
-
(2000)
Med. Eng. Phys.
, vol.22
, Issue.2
, pp. 135-147
-
-
Manfredi, C.1
D'Aniello, M.2
Bruscaglioni, P.3
Ismaelli, A.4
-
11
-
-
0017097478
-
A comparative performance study of several pitch detection algorithms
-
Oct.
-
L. Rabiner, M. Cheng, A. E. Rosenberg, and C. McGonegal, "A comparative performance study of several pitch detection algorithms," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-24, no. 5, pp. 399-418, Oct. 1976.
-
(1976)
IEEE Trans. Acoust., Speech, Signal Process.
, vol.ASSP-24
, Issue.5
, pp. 399-418
-
-
Rabiner, L.1
Cheng, M.2
Rosenberg, A.E.3
McGonegal, C.4
-
13
-
-
0036642776
-
Analysis, enhancement and evaluation of five pitch determination techniques
-
Jul.
-
P. Veprek and M. S. Scordilis, "Analysis, enhancement and evaluation of five pitch determination techniques," Speech Commun., vol. 37, no. 3-4, pp. 249-270, Jul. 2002.
-
(2002)
Speech Commun.
, vol.37
, Issue.3-4
, pp. 249-270
-
-
Veprek, P.1
Scordilis, M.S.2
-
14
-
-
0016114130
-
Average magnitude difference function pitch extractor
-
Oct.
-
M. Ross, H. Shaffer, A. Cohen, R. Freudberg, and H. Manley, "Average magnitude difference function pitch extractor," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-22, no. 5, pp. 353-362, Oct. 1974.
-
(1974)
IEEE Trans. Acoust., Speech, Signal Process.
, vol.ASSP-22
, Issue.5
, pp. 353-362
-
-
Ross, M.1
Shaffer, H.2
Cohen, A.3
Freudberg, R.4
Manley, H.5
-
15
-
-
0017367712
-
On the use of autocorrelation analysis for pitch detection
-
Feb.
-
L. Rabiner, "On the use of autocorrelation analysis for pitch detection," IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-25, no. 1, pp. 24-33, Feb. 1977.
-
(1977)
IEEE Trans. Acoust., Speech, Signal Process.
, vol.ASSP-25
, Issue.1
, pp. 24-33
-
-
Rabiner, L.1
-
16
-
-
0014055288
-
Cepstrum pitch determination
-
Aug.
-
A. M. Noll, "Cepstrum pitch determination," J. Acoust. Soc. Amer., vol. 41, no. 2, pp. 293-309, Aug. 1967.
-
(1967)
J. Acoust. Soc. Amer.
, vol.41
, Issue.2
, pp. 293-309
-
-
Noll, A.M.1
-
17
-
-
0015488387
-
The SIFT algorithm for fundamental frequency estimation
-
Dec.
-
J. Markel, "The SIFT algorithm for fundamental frequency estimation," IEEE Trans. Audio Electroacoust., vol. AE-20, no. 5, pp. 367-377, Dec. 1972.
-
(1972)
IEEE Trans. Audio Electroacoust.
, vol.AE-20
, Issue.5
, pp. 367-377
-
-
Markel, J.1
-
19
-
-
0023833270
-
Measurement of pitch by subharmonic summation
-
Jan.
-
D. J. Hermes, "Measurement of pitch by subharmonic summation," J. Acoust. Soc. Amer., vol. 83, no. 1, pp. 257-264, Jan. 1988.
-
(1988)
J. Acoust. Soc. Amer.
, vol.83
, Issue.1
, pp. 257-264
-
-
Hermes, D.J.1
-
20
-
-
0035472923
-
Weighted autocorrelation for pitch extraction of noisy speech
-
Oct.
-
T. Shimamura and H. Kobayashi, "Weighted autocorrelation for pitch extraction of noisy speech," IEEE Trans. Speech Audio Process., vol. 9, no. 7, pp. 727-730, Oct. 2001.
-
(2001)
IEEE Trans. Speech Audio Process.
, vol.9
, Issue.7
, pp. 727-730
-
-
Shimamura, T.1
Kobayashi, H.2
-
21
-
-
11144332020
-
Robust and accurate fundamental frequency estimation based on dominant harmonic components
-
Dec.
-
T. Nakatani and T. Irino, "Robust and accurate fundamental frequency estimation based on dominant harmonic components," J. Acoust. Soc. Amer., vol. 116, no. 6, pp. 3690-3700, Dec. 2004.
-
(2004)
J. Acoust. Soc. Amer.
, vol.116
, Issue.6
, pp. 3690-3700
-
-
Nakatani, T.1
Irino, T.2
-
22
-
-
81355122934
-
Pitch estimation based on a harmonic sinusoidal autocorrelation model and a time-domain matching scheme
-
Jan.
-
C. Shahnaz, W. P. Zhu, and M. O. Ahmad, "Pitch estimation based on a harmonic sinusoidal autocorrelation model and a time-domain matching scheme," IEEE Trans. Audio, Speech, Lang. Process., vol. 20, no. 1, pp. 322-335, Jan. 2012.
-
(2012)
IEEE Trans. Audio, Speech, Lang. Process.
, vol.20
, Issue.1
, pp. 322-335
-
-
Shahnaz, C.1
Zhu, W.P.2
Ahmad, M.O.3
-
23
-
-
0026103222
-
An autocorrelation pitch detector and voicing decision with confidence measures developed for noisecorrupted speech
-
Feb.
-
D. Krubsack and R. J. Niederjohn, "An autocorrelation pitch detector and voicing decision with confidence measures developed for noisecorrupted speech," IEEE Trans. Signal Process., vol. 39, no. 2, pp. 319-329, Feb. 1991.
-
(1991)
IEEE Trans. Signal Process.
, vol.39
, Issue.2
, pp. 319-329
-
-
Krubsack, D.1
Niederjohn, R.J.2
-
24
-
-
0029326498
-
Fundamental frequency determination based on instantaneous frequency estimation
-
Jun.
-
L. Qiu, H.Yang, and S.N.Koh, "Fundamental frequency determination based on instantaneous frequency estimation," Signal Process., vol. 44, no. 2, pp. 233-241, Jun. 1995.
-
(1995)
Signal Process.
, vol.44
, Issue.2
, pp. 233-241
-
-
Qiu, L.1
Yang, H.2
Koh, S.N.3
-
25
-
-
37649002185
-
Estimation of the instantaneous pitch of speech
-
Mar.
-
B. Resch, M. Nilsson, A. Ekman, and W. B. Kleijn, "Estimation of the instantaneous pitch of speech," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 3, pp. 813-822, Mar. 2007.
-
(2007)
IEEE Trans. Audio, Speech, Lang. Process.
, vol.15
, Issue.3
, pp. 813-822
-
-
Resch, B.1
Nilsson, M.2
Ekman, A.3
Kleijn, W.B.4
-
26
-
-
32644438199
-
Speech pitch determination based on Hilbert-Huang transform
-
Apr.
-
H. Huang and J. Pan, "Speech pitch determination based on Hilbert-Huang transform," Signal Process., vol. 86, no. 4, pp. 792-803, Apr. 2006.
-
(2006)
Signal Process.
, vol.86
, Issue.4
, pp. 792-803
-
-
Huang, H.1
Pan, J.2
-
27
-
-
77952083041
-
A new algorithm for instantaneous F0 speech extraction based on ensemble empirical mode decomposition
-
G. Schlotthauer, M. E. Torres, and H. L. Rufiner, "A new algorithm for instantaneous F0 speech extraction based on ensemble empirical mode decomposition," in Proc. 17th Eur. Signal Process. Conf., Aug. 2009, pp. 2347-2351.
-
Proc. 17th Eur. Signal Process. Conf., Aug. 2009
, pp. 2347-2351
-
-
Schlotthauer, G.1
Torres, M.E.2
Rufiner, H.L.3
-
28
-
-
0024924999
-
Automatic and reliable estimation of glottal closure instant and period
-
Dec.
-
Y. M. Cheng and D. O'Shaughnessy, "Automatic and reliable estimation of glottal closure instant and period," IEEE Trans. Acoust., Speech, Signal Process., vol. 37, no. 12, pp. 1805-1815, Dec. 1989.
-
(1989)
IEEE Trans. Acoust., Speech, Signal Process.
, vol.37
, Issue.12
, pp. 1805-1815
-
-
Cheng, Y.M.1
O'Shaughnessy, D.2
-
29
-
-
0026727405
-
Application of the wavelet transform for pitch detection of speech signals
-
Mar.
-
S. Kadambe and G. F. Boudreaux-Bartels, "Application of the wavelet transform for pitch detection of speech signals," IEEE Trans. Inf. Theory, vol. 38, no. 2, pp. 917-924, Mar. 1992.
-
(1992)
IEEE Trans. Inf. Theory
, vol.38
, Issue.2
, pp. 917-924
-
-
Kadambe, S.1
Boudreaux-Bartels, G.F.2
-
30
-
-
65249133648
-
Pitch period estimation using multipulse model and wavelet transformation
-
P. K. Ghosh, A. Ortega, and S. Narayanan, "Pitch period estimation using multipulse model and wavelet transformation," in Proc. Interspeech, Aug. 2007, pp. 2761-2764.
-
Proc. Interspeech, Aug. 2007
, pp. 2761-2764
-
-
Ghosh, P.K.1
Ortega, A.2
Narayanan, S.3
-
31
-
-
65249149180
-
Event-based instantaneous fundamental frequency estimation from speech signals
-
May
-
B. Yegnanarayana and K. S. R. Murty, "Event-based instantaneous fundamental frequency estimation from speech signals," IEEE Trans. Audio, Speech, Lang. Process., vol. 17, no. 4, pp. 614-624, May 2009.
-
(2009)
IEEE Trans. Audio, Speech, Lang. Process.
, vol.17
, Issue.4
, pp. 614-624
-
-
Yegnanarayana, B.1
Murty, K.S.R.2
-
32
-
-
70450198169
-
Glottal closure and opening instant detection from speech signals
-
T. Drugman and T. Dutoit, "Glottal closure and opening instant detection from speech signals," in Proc. Interspeech, Sep. 2009, pp. 2891-2894.
-
Proc. Interspeech, Sep. 2009
, pp. 2891-2894
-
-
Drugman, T.1
Dutoit, T.2
-
34
-
-
84860119809
-
Time-order representation based method for epoch detection
-
Feb.
-
P. Jain and R. B. Pachori, "Time-order representation based method for epoch detection," J. Intell. Syst., vol. 21, no. 1, pp. 79-95, Feb. 2012.
-
(2012)
J. Intell. Syst.
, vol.21
, Issue.1
, pp. 79-95
-
-
Jain, P.1
Pachori, R.B.2
-
35
-
-
84875532258
-
Marginal energy density over the low frequency range as a feature for voiced/non-voiced detection in noisy speech signals
-
May
-
P. Jain and R. B. Pachori, "Marginal energy density over the low frequency range as a feature for voiced/non-voiced detection in noisy speech signals," J. Franklin Inst., vol. 350, no. 4, pp. 698-716, May 2013.
-
(2013)
J. Franklin Inst.
, vol.350
, Issue.4
, pp. 698-716
-
-
Jain, P.1
Pachori, R.B.2
-
36
-
-
84911434649
-
-
[Online]. Available
-
[Online]. Available: http://www.ncvs.org/ncvs/tutorials/voiceprod/tutorial/influence.html
-
-
-
-
37
-
-
71649095504
-
Analysis of multicomponent AM-FM signals using FB-DESA method
-
Jan.
-
R. B. Pachori and P. Sircar, "Analysis of multicomponent AM-FM signals using FB-DESA method," Digital Signal Process., vol. 20, no. 1, pp. 42-62, Jan. 2010.
-
(2010)
Digital Signal Process.
, vol.20
, Issue.1
, pp. 42-62
-
-
Pachori, R.B.1
Sircar, P.2
-
38
-
-
0032628065
-
Acomparison of speaker identification results using features based on cepstrum and Fourier-Bessel expansion
-
May
-
K. Gopalan, T. R. Anderson, and E. Cupples, "Acomparison of speaker identification results using features based on cepstrum and Fourier-Bessel expansion," IEEE Trans. Speech Audio Process., vol. 7, no. 3, pp. 289-294, May 1999.
-
(1999)
IEEE Trans. Speech Audio Process.
, vol.7
, Issue.3
, pp. 289-294
-
-
Gopalan, K.1
Anderson, T.R.2
Cupples, E.3
-
39
-
-
38249004577
-
Signal processing via Fourier-Bessel series expansion
-
Apr.
-
J. Schroeder, "Signal processing via Fourier-Bessel series expansion," Digital Signal Process., vol. 3, no. 2, pp. 112-124, Apr. 1993.
-
(1993)
Digital Signal Process.
, vol.3
, Issue.2
, pp. 112-124
-
-
Schroeder, J.1
-
40
-
-
35248825924
-
EEG signal analysis using FB expansion and second-order linear TVAR process
-
Feb.
-
R. B. Pachori and P. Sircar, "EEG signal analysis using FB expansion and second-order linear TVAR process," Signal Process., vol. 88, no. 2, pp. 415-420, Feb. 2008.
-
(2008)
Signal Process.
, vol.88
, Issue.2
, pp. 415-420
-
-
Pachori, R.B.1
Sircar, P.2
-
42
-
-
0000330384
-
On decomposing speech into modulated components
-
May
-
A. Rao and R. Kumaresan, "On decomposing speech into modulated components," IEEE Trans. Speech, Audio Process., vol. 8, no. 3, pp. 240-254, May 2000.
-
(2000)
IEEE Trans. Speech, Audio Process.
, vol.8
, Issue.3
, pp. 240-254
-
-
Rao, A.1
Kumaresan, R.2
-
44
-
-
0025593242
-
Tracking the frequencies of superimposed time-varying harmonics
-
C. L. DiMonte and K. S. Arun, "Tracking the frequencies of superimposed time-varying harmonics," in Proc. Int. Conf. Acoust., Speech, Signal Process., Apr. 1990, pp. 2539-2542.
-
Proc. Int. Conf. Acoust., Speech, Signal Process., Apr. 1990
, pp. 2539-2542
-
-
DiMonte, C.L.1
Arun, K.S.2
-
45
-
-
5444236478
-
The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-stationary time series analysis
-
N. E. Huang et al., "The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-stationary time series analysis," in Proc. R. Soc. London A, Mar. 1998, vol. 454, no. 1971, pp. 903-995.
-
Proc. R. Soc. London A, Mar. 1998
, vol.454
, Issue.1971
, pp. 903-995
-
-
Huang, N.E.1
-
46
-
-
85090317334
-
A pitch extraction reference database
-
F. Plante, G. F. Meyer, and W. A. Ainsworth, "A pitch extraction reference database," in Proc. Eur. Conf. Speech Commun., Sep. 1995, pp. 837-840.
-
Proc. Eur. Conf. Speech Commun., Sep. 1995
, pp. 837-840
-
-
Plante, F.1
Meyer, G.F.2
Ainsworth, W.A.3
-
47
-
-
85093707396
-
Enhanced pitch tracking and the processing of F0 contours for computer aided and intonation teaching
-
P. C. Bagshaw, S.M. Hiller, and M. A. Jack, "Enhanced pitch tracking and the processing of F0 contours for computer aided and intonation teaching," in Proc. Eur. Conf. Speech Commun., Sep. 1993, vol. 2, pp. 1003-1006.
-
Proc. Eur. Conf. Speech Commun., Sep. 1993
, vol.2
, pp. 1003-1006
-
-
Bagshaw, P.C.1
Hiller, S.M.2
Jack, M.A.3
-
49
-
-
84911363459
-
-
[Online]. Available
-
[Online]. Available: www.speech.cs.cmu.edu/comp.speech/Section1/Data/noisex.html
-
-
-
-
50
-
-
0036214787
-
YIN, a fundamental frequency estimator for speech and music
-
Apr.
-
A. de Cheveigne and H. Kawahara, "YIN, a fundamental frequency estimator for speech and music," J. Acoust. Soc. Amer., vol. 111, no. 4, pp. 1917-1930, Apr. 2002.
-
(2002)
J. Acoust. Soc. Amer.
, vol.111
, Issue.4
, pp. 1917-1930
-
-
De Cheveigne, A.1
Kawahara, H.2
-
52
-
-
84911362960
-
-
Burlington, MA, USA: Academic
-
R. J. Freund, W. J. Wilson, and D. J. Mohr, Stastical Methods. Burlington, MA, USA: Academic, 2010.
-
(2010)
Stastical Methods
-
-
Freund, R.J.1
Wilson, W.J.2
Mohr, D.J.3
|