SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 19, Issue 5, 2011, Pages 1080-1090

Phase Minimization for Glottal Model Estimation

(3) Degottex, Gilles a Roebel, Axel a Rodet, Xavier a

a IRCAM (France)

Author keywords

glottal closure instants (GCIs); Glottal model; glottal shape; joint estimation; phase minimization; voice analysis

Indexed keywords

EID: 85008008295 PISSN: 15587916 EISSN: 15587924 Source Type: Journal
DOI: 10.1109/TASL.2010.2076806 Document Type: Article

Times cited : (52)

References (40)

1
- 17644370535
- The voice source as a causal/anticausal linear filter
- B. Doval, C. d'Alessandro, and N. Henrich, “The voice source as a causal/anticausal linear filter,” VOQUAL, 2003.
- (2003) VOQUAL
- Doval, B.¹ d'Alessandro, C.² Henrich, N.³

2
- 38049028071
- The LF-model revisited. Transformations and frequency domain analysis
- 3
- G. Fant “The LF-model revisited. Transformations and frequency domain analysis,” STL-QPSR, vol. 36, no. 2–3, pp. 119–156, 1995.
- (1995) STL-QPSR , vol.36 , Issue.2 , pp. 119-156
- Fant, G.¹

3
- 33947684811
- A four-parameter model of glottal flow
- G. Fant, J. Liljencrants, and Q.-G. Lin “A four-parameter model of glottal flow,” STL-QPSR, vol. 26, no. 4, pp. 1–13, 1985.
- (1985) STL-QPSR , vol.26 , Issue.4 , pp. 1-13
- Fant, G.¹ Liljencrants, J.² Lin, Q.-G.³

4
- 0032875050
- A method for generating natural-sounding speech stimuli for cognitive brain research
- P. Alku, H. Tiitinen, and R. Naatanen “A method for generating natural-sounding speech stimuli for cognitive brain research,” Clinical Neurophysiol., vol. 110, no. 8, pp. 1329–1333, 1999.
- (1999) Clinical Neurophysiol. , vol.110 , Issue.8 , pp. 1329-1333
- Alku, P.¹ Tiitinen, H.² Naatanen, R.³

5
- 4344646427
- Ph.D. dissertation, Stanford Univ., Stanford, CA
- H.-L. Lu, “Toward a high-quality singing synthesizer with vocal texture control,” Ph.D. dissertation, Stanford Univ., Stanford, CA, 2002.
- (2002) Toward a high-quality singing synthesizer with vocal texture control
- Lu, H.-L.¹

6
- 78049381054
- Estimation of LF glottal source parameters based on an ARX model
- D. Vincent, O. Rosec, and T. Chonavel, “Estimation of LF glottal source parameters based on an ARX model,” in Proc. Interspeech, 2005.
- (2005) Proc. Interspeech
- Vincent, D.¹ Rosec, O.² Chonavel, T.³

7
- 85008061329
- Glottal source estimation robustness
- T. Drugman, T. Dubuisson, A. Moinet, N. D'Alessandro, and T. Dutoit, “Glottal source estimation robustness,” in Proc. SIGMAP, 2008.
- (2008) Proc. SIGMAP
- Drugman, T.¹ Dubuisson, T.² Moinet, A.³ D'Alessandro, N.⁴ Dutoit, T.⁵

8
- 0003874959
- Berlin, Germany: Springer Verlag
- J. D. Markel and A. H. Gray, Linear Prediction of Speech. Berlin, Germany: Springer Verlag, 1976.
- (1976) Linear Prediction of Speech
- Markel, J.D.¹ Gray, A.H.²

9
- 0027623462
- Lossless pole-zero modeling of speech signals
- Jul.
- I.-T. Lim and B. G. Lee, “Lossless pole-zero modeling of speech signals,” IEEE Trans. Speech Audio Process., vol. 1, no. 3, pp. 269–276, Jul. 1993.
- (1993) IEEE Trans. Speech Audio Process. , vol.1 , Issue.3 , pp. 269-276
- Lim, I.-T.¹ Lee, B.G.²

10
- 0026106454
- Discrete all-pole modeling
- Feb.
- A. El-Jaroudi and J. Makhoul “Discrete all-pole modeling,” IEEE Trans. Signal Process., vol. 39, no. 2, pp. 411–423, Feb. 1991.
- (1991) IEEE Trans. Signal Process. , vol.39 , Issue.2 , pp. 411-423
- El-Jaroudi, A.¹ Makhoul, J.²

11
- 0001628038
- Nonlinear filtering of multiplied and convolved signals
- Aug.
- A. Oppenheim, R. Schafer, and T. Stockham “Nonlinear filtering of multiplied and convolved signals,” Proc. IEEE, vol. 56, no. 8, pp. 1264–1291, Aug. 1968.
- (1968) Proc. IEEE , vol.56 , Issue.8 , pp. 1264-1291
- Oppenheim, A.¹ Schafer, R.² Stockham, T.³

12
- 17644365443
- Zeros of z-transform representation with application to source-filter separation in speech
- Apr.
- B. Bozkurt, B. Doval, C. D'Alessandro, and T. Dutoit “Zeros of z-transform representation with application to source-filter separation in speech,” IEEE Signal Process. Lett., vol. 12, no. 4, pp. 344–347, Apr. 2005.
- (2005) IEEE Signal Process. Lett. , vol.12 , Issue.4 , pp. 344-347
- Bozkurt, B.¹ Doval, B.² D'Alessandro, C.³ Dutoit, T.⁴

13
- 0018653975
- Least squares glottal inverse filtering from the acoustic speech waveform
- Aug.
- D. Wong, J. D. Markel, and A. H. Gray “Least squares glottal inverse filtering from the acoustic speech waveform,” IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-27, no. 4, pp. 350–355, Aug. 1979.
- (1979) IEEE Trans. Acoust., Speech, Signal Process. , vol.ASSP-27 , Issue.4 , pp. 350-355
- Wong, D.¹ Markel, J.D.² Gray, A.H.³

14
- 0036299143
- The DYPSA algorithm for estimation of glottal closure instants in voiced speech
- A. Kounoudes, P. A. Naylor, and M. Brookes, “The DYPSA algorithm for estimation of glottal closure instants in voiced speech,” in Proc. ICASSP, 2002, pp. I-349–I-352.
- (2002) Proc. ICASSP , pp. I-349-I-352
- Kounoudes, A.¹ Naylor, P.A.² Brookes, M.³

15
- 0029375490
- Determination of instants of significant excitation in speech using group delay function
- Sep.
- R. Smits and B. Yegnanarayana “Determination of instants of significant excitation in speech using group delay function,” IEEE Trans. Speech Audio Process., vol. 3, no. 5, pp. 325–333, Sep. 1995.
- (1995) IEEE Trans. Speech Audio Process. , vol.3 , Issue.5 , pp. 325-333
- Smits, R.¹ Yegnanarayana, B.²

16
- 78049393696
- Joint estimate of shape and time-synchronization of a glottal source model by phase flatness
- G. Degottex, A. Roebel, and X. Rodet, “Joint estimate of shape and time-synchronization of a glottal source model by phase flatness,” in Proc. ICASSP, 2010, pp. 5058–5061.
- (2010) Proc. ICASSP , pp. 5058-5061
- Degottex, G.¹ Roebel, A.² Rodet, X.³

17
- 0026881384
- Glottal wave analysis with pitch synchronous iterative adaptive inverse filtering
- 3
- P. Alku, “Glottal wave analysis with pitch synchronous iterative adaptive inverse filtering,” Speech Commun., vol. 11, no. 2–3, pp. 109–118, 1992.
- (1992) Speech Commun. , vol.11 , Issue.2 , pp. 109-118
- Alku, P.¹

18
- 8644229278
- Ph.D. dissertation, Mass. Inst. of Technol., Cambridge
- R. Fernandez, “A computational model for the automatic recognition of affect in speech,” Ph.D. dissertation, Mass. Inst. of Technol., Cambridge, 2004.
- (2004) A computational model for the automatic recognition of affect in speech
- Fernandez, R.¹

19
- 0032673049
- Restructuring speech representations using a pitch-adaptative time-frequency smoothing and an instantaneous-frequency-based f0 extraction: Possible role of a repetitive structure in sounds
- H. Kawahara, I. Masuda-Katsuse, and A. Cheveigne, “Restructuring speech representations using a pitch-adaptative time-frequency smoothing and an instantaneous-frequency-based f0 extraction: Possible role of a repetitive structure in sounds,” Speech Commun., vol. 27, 1999.
- (1999) Speech Commun. , vol.27
- Kawahara, H.¹ Masuda-Katsuse, I.² Cheveigne, A.³

20
- 33646796046
- Ph.D. dissertation, Nagoya Inst. of Technol., Nagoya, Japan
- T. Yoshimura, “Simultaneous modeling of phonetic and prosodic parameters, and characteristic conversion for HMM-based text-to-speech systems,” Ph.D. dissertation, Nagoya Inst. of Technol., Nagoya, Japan, 2002.
- (2002) Simultaneous modeling of phonetic and prosodic parameters, and characteristic conversion for HMM-based text-to-speech systems
- Yoshimura, T.¹

21
- 84867209230
- HMM-based Finnish text-to-speech system utilizing glottal inverse filtering
- T. Raitio, A. Suni, H. Pulakka, M. Vainio, and P. Alku, “HMM-based Finnish text-to-speech system utilizing glottal inverse filtering,” in Proc. Interspeech, 2008.
- (2008) Proc. Interspeech
- Raitio, T.¹ Suni, A.² Pulakka, H.³ Vainio, M.⁴ Alku, P.⁵

22
- 0003757962
- Berlin, Germany: Springer Verlag
- J. L. Flanagan, Speech Analysis Synthesis and Perception. Berlin, Germany: Springer Verlag, 1972.
- (1972) Speech Analysis Synthesis and Perception
- Flanagan, J.L.¹

23
- 0003793552
- 2nd ed. Englewood Cliffs: Prentice-Hall
- A. V. Oppenheim and R. W. Schafer, Digital Signal Processing, 2nd ed. Englewood Cliffs: Prentice-Hall, 1978.
- (1978) Digital Signal Processing
- Oppenheim, A.V.¹ Schafer, R.W.²

24
- 0030701388
- Spectral correlates of glottal waveform models: An analytic study
- B. Doval and C. d'Alessandro, “Spectral correlates of glottal waveform models: An analytic study,” in Proc. ICASSP, 2000, pp. 1295–1298.
- (2000) Proc. ICASSP , pp. 1295-1298
- Doval, B.¹ d'Alessandro, C.²

25
- 84863772450
- Speech analysis/synthesis based on a sinusoidal representation
- Aug.
- R. McAulay and T. Quatieri “Speech analysis/synthesis based on a sinusoidal representation,” IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-34, no. 4, pp. 744–754, Aug. 1986.
- (1986) IEEE Trans. Acoust., Speech, Signal Process. , vol.ASSP-34 , Issue.4 , pp. 744-754
- McAulay, R.¹ Quatieri, T.²

26
- 0036214787
- Yin, a fundamental frequency estimator for speech and music
- Apr.
- A. de Cheveigne and H. Kawahara, “Yin, a fundamental frequency estimator for speech and music,” JASA, vol. 111, Apr. 2002.
- (2002) JASA , vol.111
- de Cheveigne, A.¹ Kawahara, H.²

27
- 70350354569
- Ph.D. dissertation, Univ. of Florida, Gainesville, USA, Dec.
- A. Camacho, “SWIPE: A Sawtooth Waveform Inspired Pitch Estimator for Speech and Music,” Ph.D. dissertation, Univ. of Florida, Gainesville, USA, Dec. 2007.
- (2007) SWIPE: A Sawtooth Waveform Inspired Pitch Estimator for Speech and Music
- Camacho, A.¹

28
- 84902534716
- A new score function for joint evaluation of multiple f0 hypothesis
- Naples, Italy, Oct.
- C. Yeh and A. Roebel, “A new score function for joint evaluation of multiple f0 hypothesis,” in Proc. DAFx, Naples, Italy, Oct. 2004, pp. 234–239.
- (2004) Proc. DAFx , pp. 234-239
- Yeh, C.¹ Roebel, A.²

29
- 78649266362
- (in French) Ph.D. dissertation ENST, Paris, France
- D. Vincent, “Analyse et controle du signal glottique en synthese de la parole,” (in French) Ph.D. dissertation, ENST, Paris, France, 2007.
- (2007) Analyse et controle du signal glottique en synthese de la parole
- Vincent, D.¹

30
- 85009105007
- Spectral correlates of voice open quotient and glottal flow asymmetry: Theory, limits and experimental data
- N. Henrich, C. d'Alessandro, and B. Doval, “Spectral correlates of voice open quotient and glottal flow asymmetry: Theory, limits and experimental data,” in Proc. Eurospeech, 2001.
- (2001) Proc. Eurospeech
- Henrich, N.¹ d'Alessandro, C.² Doval, B.³

31
- 34547522367
- Estimation of the voicing cut-off frequency contour based on a cumulative harmonicity score
- Nov.
- K. Hermus, H. Van Hamme, and S. Irhimeh “Estimation of the voicing cut-off frequency contour based on a cumulative harmonicity score,” IEEE Signal Process. Lett., vol. 14, no. 11, pp. 820–823, Nov. 2007.
- (2007) IEEE Signal Process. Lett. , vol.14 , Issue.11 , pp. 820-823
- Hermus, K.¹ Van Hamme, H.² Irhimeh, S.³

32
- 0003465464
- Englewood Cliffs, NJ: Prentice-Hall
- R. P. Brent, Algorithms for Minimization Without Derivatives. Englewood Cliffs, NJ: Prentice-Hall, 1973.
- (1973) Algorithms for Minimization Without Derivatives
- Brent, R.P.¹

33
- 44949246143
- Toolkit for voice inverse filtering and parametrization
- M. Airas, H. Pulakka, T. Backstrom, and P. Alku, “Toolkit for voice inverse filtering and parametrization,” in Proc. Interspeech, 2005, pp. 2145–2148.
- (2005) Proc. Interspeech , pp. 2145-2148
- Airas, M.¹ Pulakka, H.² Backstrom, T.³ Alku, P.⁴

34
- 56149091205
- A comparative evaluation of the zeros of z transform representation for voice source estimation
- N. Sturmel, C. d'Alessandro, and B. Doval, “A comparative evaluation of the zeros of z transform representation for voice source estimation,” in Proc. Interspeech, 2007.
- (2007) Proc. Interspeech
- Sturmel, N.¹ d'Alessandro, C.² Doval, B.³

35
- 70450170853
- Complex cepstrum-based decomposition of speech for glottal source estimation
- T. Drugman, B. Bozkurt, and T. Dutoit, “Complex cepstrum-based decomposition of speech for glottal source estimation,” in Proc. Interspeech, 2009.
- (2009) Proc. Interspeech
- Drugman, T.¹ Bozkurt, B.² Dutoit, T.³

36
- 0020281396
- A digital simulation method of the vocal-tract system
- S. Maeda, “A digital simulation method of the vocal-tract system,” Speech Commun., 1982.
- (1982) Speech Commun.
- Maeda, S.¹

37
- 69249091414
- The sigma algorithm: A glottal activity detector for electroglottographic signals
- Nov.
- M. R. P. Thomas and P. A. Naylor “The sigma algorithm: A glottal activity detector for electroglottographic signals,” IEEE Trans. Audio, Speech, Lang. Process., vol. 17, no. 8, pp. 1557–1566, Nov. 2009.
- (2009) IEEE Trans. Audio, Speech, Lang. Process. , vol.17 , Issue.8 , pp. 1557-1566
- Thomas, M.R.P.¹ Naylor, P.A.²

38
- 1542286741
- On the use of the derivative of electroglottographic signals for characterization of nonpathological phonation
- N. Henrich, C. d'Alessandro, B. Doval, and M. Castellengo “On the use of the derivative of electroglottographic signals for characterization of nonpathological phonation,” J. Acoust. Soc. Amer., vol. 115, no. 3, pp. 1321–1332, 2004.
- (2004) J. Acoust. Soc. Amer. , vol.115 , Issue.3 , pp. 1321-1332
- Henrich, N.¹ d'Alessandro, C.² Doval, B.³ Castellengo, M.⁴

39
- 84870244113
- Glottal closure instant detection from a glottal shape estimate
- G. Degottex, A. Roebel, and X. Rodet, “Glottal closure instant detection from a glottal shape estimate,” in Proc. SPECOM, 2009, pp. 226–231.
- (2009) Proc. SPECOM , pp. 226-231
- Degottex, G.¹ Roebel, A.² Rodet, X.³

40
- 33646773080
- J. Kominek and A. W. Black, “CMU arctic databases for speech synthesis,” 2003.
- (2003) CMU arctic databases for speech synthesis
- Kominek, J.¹ Black, A.W.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.