메뉴 건너뛰기




Volumn 19, Issue 5, 2011, Pages 1080-1090

Phase Minimization for Glottal Model Estimation

Author keywords

glottal closure instants (GCIs); Glottal model; glottal shape; joint estimation; phase minimization; voice analysis

Indexed keywords


EID: 85008008295     PISSN: 15587916     EISSN: 15587924     Source Type: Journal    
DOI: 10.1109/TASL.2010.2076806     Document Type: Article
Times cited : (52)

References (40)
  • 1
    • 17644370535 scopus 로고    scopus 로고
    • The voice source as a causal/anticausal linear filter
    • B. Doval, C. d'Alessandro, and N. Henrich, “The voice source as a causal/anticausal linear filter,” VOQUAL, 2003.
    • (2003) VOQUAL
    • Doval, B.1    d'Alessandro, C.2    Henrich, N.3
  • 2
    • 38049028071 scopus 로고
    • The LF-model revisited. Transformations and frequency domain analysis
    • 3
    • G. Fant “The LF-model revisited. Transformations and frequency domain analysis,” STL-QPSR, vol. 36, no. 2–3, pp. 119–156, 1995.
    • (1995) STL-QPSR , vol.36 , Issue.2 , pp. 119-156
    • Fant, G.1
  • 3
    • 33947684811 scopus 로고
    • A four-parameter model of glottal flow
    • G. Fant, J. Liljencrants, and Q.-G. Lin “A four-parameter model of glottal flow,” STL-QPSR, vol. 26, no. 4, pp. 1–13, 1985.
    • (1985) STL-QPSR , vol.26 , Issue.4 , pp. 1-13
    • Fant, G.1    Liljencrants, J.2    Lin, Q.-G.3
  • 4
    • 0032875050 scopus 로고    scopus 로고
    • A method for generating natural-sounding speech stimuli for cognitive brain research
    • P. Alku, H. Tiitinen, and R. Naatanen “A method for generating natural-sounding speech stimuli for cognitive brain research,” Clinical Neurophysiol., vol. 110, no. 8, pp. 1329–1333, 1999.
    • (1999) Clinical Neurophysiol. , vol.110 , Issue.8 , pp. 1329-1333
    • Alku, P.1    Tiitinen, H.2    Naatanen, R.3
  • 6
    • 78049381054 scopus 로고    scopus 로고
    • Estimation of LF glottal source parameters based on an ARX model
    • D. Vincent, O. Rosec, and T. Chonavel, “Estimation of LF glottal source parameters based on an ARX model,” in Proc. Interspeech, 2005.
    • (2005) Proc. Interspeech
    • Vincent, D.1    Rosec, O.2    Chonavel, T.3
  • 9
    • 0027623462 scopus 로고
    • Lossless pole-zero modeling of speech signals
    • Jul.
    • I.-T. Lim and B. G. Lee, “Lossless pole-zero modeling of speech signals,” IEEE Trans. Speech Audio Process., vol. 1, no. 3, pp. 269–276, Jul. 1993.
    • (1993) IEEE Trans. Speech Audio Process. , vol.1 , Issue.3 , pp. 269-276
    • Lim, I.-T.1    Lee, B.G.2
  • 10
    • 0026106454 scopus 로고
    • Discrete all-pole modeling
    • Feb.
    • A. El-Jaroudi and J. Makhoul “Discrete all-pole modeling,” IEEE Trans. Signal Process., vol. 39, no. 2, pp. 411–423, Feb. 1991.
    • (1991) IEEE Trans. Signal Process. , vol.39 , Issue.2 , pp. 411-423
    • El-Jaroudi, A.1    Makhoul, J.2
  • 11
    • 0001628038 scopus 로고
    • Nonlinear filtering of multiplied and convolved signals
    • Aug.
    • A. Oppenheim, R. Schafer, and T. Stockham “Nonlinear filtering of multiplied and convolved signals,” Proc. IEEE, vol. 56, no. 8, pp. 1264–1291, Aug. 1968.
    • (1968) Proc. IEEE , vol.56 , Issue.8 , pp. 1264-1291
    • Oppenheim, A.1    Schafer, R.2    Stockham, T.3
  • 12
    • 17644365443 scopus 로고    scopus 로고
    • Zeros of z-transform representation with application to source-filter separation in speech
    • Apr.
    • B. Bozkurt, B. Doval, C. D'Alessandro, and T. Dutoit “Zeros of z-transform representation with application to source-filter separation in speech,” IEEE Signal Process. Lett., vol. 12, no. 4, pp. 344–347, Apr. 2005.
    • (2005) IEEE Signal Process. Lett. , vol.12 , Issue.4 , pp. 344-347
    • Bozkurt, B.1    Doval, B.2    D'Alessandro, C.3    Dutoit, T.4
  • 13
    • 0018653975 scopus 로고
    • Least squares glottal inverse filtering from the acoustic speech waveform
    • Aug.
    • D. Wong, J. D. Markel, and A. H. Gray “Least squares glottal inverse filtering from the acoustic speech waveform,” IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-27, no. 4, pp. 350–355, Aug. 1979.
    • (1979) IEEE Trans. Acoust., Speech, Signal Process. , vol.ASSP-27 , Issue.4 , pp. 350-355
    • Wong, D.1    Markel, J.D.2    Gray, A.H.3
  • 14
    • 0036299143 scopus 로고    scopus 로고
    • The DYPSA algorithm for estimation of glottal closure instants in voiced speech
    • A. Kounoudes, P. A. Naylor, and M. Brookes, “The DYPSA algorithm for estimation of glottal closure instants in voiced speech,” in Proc. ICASSP, 2002, pp. I-349–I-352.
    • (2002) Proc. ICASSP , pp. I-349-I-352
    • Kounoudes, A.1    Naylor, P.A.2    Brookes, M.3
  • 15
    • 0029375490 scopus 로고
    • Determination of instants of significant excitation in speech using group delay function
    • Sep.
    • R. Smits and B. Yegnanarayana “Determination of instants of significant excitation in speech using group delay function,” IEEE Trans. Speech Audio Process., vol. 3, no. 5, pp. 325–333, Sep. 1995.
    • (1995) IEEE Trans. Speech Audio Process. , vol.3 , Issue.5 , pp. 325-333
    • Smits, R.1    Yegnanarayana, B.2
  • 16
    • 78049393696 scopus 로고    scopus 로고
    • Joint estimate of shape and time-synchronization of a glottal source model by phase flatness
    • G. Degottex, A. Roebel, and X. Rodet, “Joint estimate of shape and time-synchronization of a glottal source model by phase flatness,” in Proc. ICASSP, 2010, pp. 5058–5061.
    • (2010) Proc. ICASSP , pp. 5058-5061
    • Degottex, G.1    Roebel, A.2    Rodet, X.3
  • 17
    • 0026881384 scopus 로고
    • Glottal wave analysis with pitch synchronous iterative adaptive inverse filtering
    • 3
    • P. Alku, “Glottal wave analysis with pitch synchronous iterative adaptive inverse filtering,” Speech Commun., vol. 11, no. 2–3, pp. 109–118, 1992.
    • (1992) Speech Commun. , vol.11 , Issue.2 , pp. 109-118
    • Alku, P.1
  • 19
    • 0032673049 scopus 로고    scopus 로고
    • Restructuring speech representations using a pitch-adaptative time-frequency smoothing and an instantaneous-frequency-based f0 extraction: Possible role of a repetitive structure in sounds
    • H. Kawahara, I. Masuda-Katsuse, and A. Cheveigne, “Restructuring speech representations using a pitch-adaptative time-frequency smoothing and an instantaneous-frequency-based f0 extraction: Possible role of a repetitive structure in sounds,” Speech Commun., vol. 27, 1999.
    • (1999) Speech Commun. , vol.27
    • Kawahara, H.1    Masuda-Katsuse, I.2    Cheveigne, A.3
  • 21
    • 84867209230 scopus 로고    scopus 로고
    • HMM-based Finnish text-to-speech system utilizing glottal inverse filtering
    • T. Raitio, A. Suni, H. Pulakka, M. Vainio, and P. Alku, “HMM-based Finnish text-to-speech system utilizing glottal inverse filtering,” in Proc. Interspeech, 2008.
    • (2008) Proc. Interspeech
    • Raitio, T.1    Suni, A.2    Pulakka, H.3    Vainio, M.4    Alku, P.5
  • 24
    • 0030701388 scopus 로고    scopus 로고
    • Spectral correlates of glottal waveform models: An analytic study
    • B. Doval and C. d'Alessandro, “Spectral correlates of glottal waveform models: An analytic study,” in Proc. ICASSP, 2000, pp. 1295–1298.
    • (2000) Proc. ICASSP , pp. 1295-1298
    • Doval, B.1    d'Alessandro, C.2
  • 25
    • 84863772450 scopus 로고
    • Speech analysis/synthesis based on a sinusoidal representation
    • Aug.
    • R. McAulay and T. Quatieri “Speech analysis/synthesis based on a sinusoidal representation,” IEEE Trans. Acoust., Speech, Signal Process., vol. ASSP-34, no. 4, pp. 744–754, Aug. 1986.
    • (1986) IEEE Trans. Acoust., Speech, Signal Process. , vol.ASSP-34 , Issue.4 , pp. 744-754
    • McAulay, R.1    Quatieri, T.2
  • 26
    • 0036214787 scopus 로고    scopus 로고
    • Yin, a fundamental frequency estimator for speech and music
    • Apr.
    • A. de Cheveigne and H. Kawahara, “Yin, a fundamental frequency estimator for speech and music,” JASA, vol. 111, Apr. 2002.
    • (2002) JASA , vol.111
    • de Cheveigne, A.1    Kawahara, H.2
  • 28
    • 84902534716 scopus 로고    scopus 로고
    • A new score function for joint evaluation of multiple f0 hypothesis
    • Naples, Italy, Oct.
    • C. Yeh and A. Roebel, “A new score function for joint evaluation of multiple f0 hypothesis,” in Proc. DAFx, Naples, Italy, Oct. 2004, pp. 234–239.
    • (2004) Proc. DAFx , pp. 234-239
    • Yeh, C.1    Roebel, A.2
  • 30
    • 85009105007 scopus 로고    scopus 로고
    • Spectral correlates of voice open quotient and glottal flow asymmetry: Theory, limits and experimental data
    • N. Henrich, C. d'Alessandro, and B. Doval, “Spectral correlates of voice open quotient and glottal flow asymmetry: Theory, limits and experimental data,” in Proc. Eurospeech, 2001.
    • (2001) Proc. Eurospeech
    • Henrich, N.1    d'Alessandro, C.2    Doval, B.3
  • 31
    • 34547522367 scopus 로고    scopus 로고
    • Estimation of the voicing cut-off frequency contour based on a cumulative harmonicity score
    • Nov.
    • K. Hermus, H. Van Hamme, and S. Irhimeh “Estimation of the voicing cut-off frequency contour based on a cumulative harmonicity score,” IEEE Signal Process. Lett., vol. 14, no. 11, pp. 820–823, Nov. 2007.
    • (2007) IEEE Signal Process. Lett. , vol.14 , Issue.11 , pp. 820-823
    • Hermus, K.1    Van Hamme, H.2    Irhimeh, S.3
  • 33
    • 44949246143 scopus 로고    scopus 로고
    • Toolkit for voice inverse filtering and parametrization
    • M. Airas, H. Pulakka, T. Backstrom, and P. Alku, “Toolkit for voice inverse filtering and parametrization,” in Proc. Interspeech, 2005, pp. 2145–2148.
    • (2005) Proc. Interspeech , pp. 2145-2148
    • Airas, M.1    Pulakka, H.2    Backstrom, T.3    Alku, P.4
  • 34
    • 56149091205 scopus 로고    scopus 로고
    • A comparative evaluation of the zeros of z transform representation for voice source estimation
    • N. Sturmel, C. d'Alessandro, and B. Doval, “A comparative evaluation of the zeros of z transform representation for voice source estimation,” in Proc. Interspeech, 2007.
    • (2007) Proc. Interspeech
    • Sturmel, N.1    d'Alessandro, C.2    Doval, B.3
  • 35
    • 70450170853 scopus 로고    scopus 로고
    • Complex cepstrum-based decomposition of speech for glottal source estimation
    • T. Drugman, B. Bozkurt, and T. Dutoit, “Complex cepstrum-based decomposition of speech for glottal source estimation,” in Proc. Interspeech, 2009.
    • (2009) Proc. Interspeech
    • Drugman, T.1    Bozkurt, B.2    Dutoit, T.3
  • 36
    • 0020281396 scopus 로고
    • A digital simulation method of the vocal-tract system
    • S. Maeda, “A digital simulation method of the vocal-tract system,” Speech Commun., 1982.
    • (1982) Speech Commun.
    • Maeda, S.1
  • 37
    • 69249091414 scopus 로고    scopus 로고
    • The sigma algorithm: A glottal activity detector for electroglottographic signals
    • Nov.
    • M. R. P. Thomas and P. A. Naylor “The sigma algorithm: A glottal activity detector for electroglottographic signals,” IEEE Trans. Audio, Speech, Lang. Process., vol. 17, no. 8, pp. 1557–1566, Nov. 2009.
    • (2009) IEEE Trans. Audio, Speech, Lang. Process. , vol.17 , Issue.8 , pp. 1557-1566
    • Thomas, M.R.P.1    Naylor, P.A.2
  • 38
    • 1542286741 scopus 로고    scopus 로고
    • On the use of the derivative of electroglottographic signals for characterization of nonpathological phonation
    • N. Henrich, C. d'Alessandro, B. Doval, and M. Castellengo “On the use of the derivative of electroglottographic signals for characterization of nonpathological phonation,” J. Acoust. Soc. Amer., vol. 115, no. 3, pp. 1321–1332, 2004.
    • (2004) J. Acoust. Soc. Amer. , vol.115 , Issue.3 , pp. 1321-1332
    • Henrich, N.1    d'Alessandro, C.2    Doval, B.3    Castellengo, M.4
  • 39
    • 84870244113 scopus 로고    scopus 로고
    • Glottal closure instant detection from a glottal shape estimate
    • G. Degottex, A. Roebel, and X. Rodet, “Glottal closure instant detection from a glottal shape estimate,” in Proc. SPECOM, 2009, pp. 226–231.
    • (2009) Proc. SPECOM , pp. 226-231
    • Degottex, G.1    Roebel, A.2    Rodet, X.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.