메뉴 건너뛰기




Volumn 19, Issue 4, 2011, Pages 977-989

Time-varying autoregressions in speech: Detection theory and applications

Author keywords

Glottal airflow; likelihood ratio test; linear prediction; nonstationary time series; vocal tract variation

Indexed keywords

GLOTTAL AIRFLOW; LIKELIHOOD RATIO TEST; LINEAR PREDICTION; NONSTATIONARY TIME SERIES; VOCAL-TRACTS;

EID: 79953288197     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2010.2073704     Document Type: Article
Times cited : (27)

References (53)
  • 1
    • 70450216598 scopus 로고    scopus 로고
    • Time-varying autoregressive tests for multiscale speech analysis
    • [Online].Available
    • D. Rudoy, T. F. Quatieri, and P. J. Wolfe, "Time-varying autoregressive tests for multiscale speech analysis," in Proc. 10th Annu. Conf. Int. Speech Commun. Assoc., 2009, pp. 2839-2842 [Online].Available: http://sisl.seas.harvard.edu
    • (2009) Proc. 10th Annu. Conf. Int. Speech Commun. Assoc. , pp. 2839-2842
    • Rudoy, D.1    Quatieri, T.F.2    Wolfe, P.J.3
  • 3
    • 0020497768 scopus 로고
    • Detecting and estimating parameter jumps using ladder algorithms and likelihood ratio tests
    • A. V. Brandt, "Detecting and estimating parameter jumps using ladder algorithms and likelihood ratio tests," in Proc. IEEE Int. Conf. Acoust. Speech, Signal Process., 1983, vol. 8, pp. 1017-1020.
    • (1983) Proc. IEEE Int. Conf. Acoust. Speech, Signal Process. , vol.8 , pp. 1017-1020
    • Brandt, A.V.1
  • 4
    • 0023831656 scopus 로고
    • A new statistical approach for the automatic segmentation of continuous speech signals
    • DOI 10.1109/29.1486
    • R. Andre-Obrecht, "A new statistical approach for the automatic segmentation of continuous speech signals," IEEE Trans. Acoust., Speech, Signal Process., vol. 36, no. 1, pp. 29-40, Jan. 1988. (Pubitemid 18013484)
    • (1988) IEEE Transactions on Acoustics, Speech, and Signal Processing , vol.36 , Issue.1 , pp. 29-40
    • Andre-Obrecht, R.1
  • 5
    • 0025536708 scopus 로고
    • Detection of the glottal closure by jumps in the statistical properties of the speech signal
    • E. Moulines and R. D. Francesco, "Detection of the glottal closure by jumps in the statistical properties of the speech signal," Speech Commun., vol. 9, pp. 401-418, 1990.
    • (1990) Speech Commun. , vol.9 , pp. 401-418
    • Moulines, E.1    Francesco, R.D.2
  • 6
    • 0016735638 scopus 로고
    • Linear estimation of non-stationary signals
    • L. A. Liporace, "Linear estimation of non-stationary signals," J. Acoust. Soc. Amer., vol. 58, pp. 1268-1295, 1975.
    • (1975) J. Acoust. Soc. Amer. , vol.58 , pp. 1268-1295
    • Liporace, L.A.1
  • 8
    • 0020749186 scopus 로고
    • Time-varying parametric modeling of speech
    • M. G. Hall, A. V. Oppenheim, and A. S.Willsky, "Time-varying parametric modeling of speech," Signal Process., vol. 5, pp. 267-285, 1983.
    • (1983) Signal Process. , vol.5 , pp. 267-285
    • Hall, M.G.1    Oppenheim, A.V.2    Willsky, A.S.3
  • 10
    • 0024088635 scopus 로고
    • Autoregressive models with time-dependent log area ratios
    • Dec.
    • Y. Grenier, "Autoregressive models with time-dependent log area ratios," IEEE Trans. Acoust., Speech, Signal Process., vol. 36, no. 10, pp. 1602-1612, Dec. 1988.
    • (1988) IEEE Trans. Acoust., Speech, Signal Process. , vol.36 , Issue.10 , pp. 1602-1612
    • Grenier, Y.1
  • 11
    • 0026142442 scopus 로고
    • A time-varying analysis method for rapid transitions in speech
    • Apr.
    • K. S. Nathan, Y. T. Lee, and H. F. Silverman, "A time-varying analysis method for rapid transitions in speech," IEEE Trans. Signal Process., vol. 39, no. 4, pp. 815-824, Apr. 1991.
    • (1991) IEEE Trans. Signal Process. , vol.39 , Issue.4 , pp. 815-824
    • Nathan, K.S.1    Lee, Y.T.2    Silverman, H.F.3
  • 12
    • 0028460992 scopus 로고
    • Time-varying feature selection and classification of unvoiced stop consonants
    • Jul.
    • K. S. Nathan and H. F. Silverman, "Time-varying feature selection and classification of unvoiced stop consonants," IEEE Trans. Speech Audio Process., vol. 2, no. 3, pp. 395-405, Jul. 1994.
    • (1994) IEEE Trans. Speech Audio Process. , vol.2 , Issue.3 , pp. 395-405
    • Nathan, K.S.1    Silverman, H.F.2
  • 13
    • 0030270734 scopus 로고    scopus 로고
    • Generalized feature extraction for time-varying autoregressive models
    • PII S1053587X96071462
    • J. J. Rajan and P. J. W. Rayner, "Generalized feature extraction for time-varying autoregressive models," IEEE Trans. Signal Process., vol. 44, no. 10, pp. 2498-2507, Dec. 1996. (Pubitemid 126776552)
    • (1996) IEEE Transactions on Signal Processing , vol.44 , Issue.10 , pp. 2498-2507
    • Rajan, J.J.1    Rayner, P.J.W.2
  • 14
    • 0036508204 scopus 로고    scopus 로고
    • Particle methods for Bayesian modeling and enhancement of speech signals
    • DOI 10.1109/TSA.2002.1001982, PII S1063667602039743
    • J. Vermaak, C. Andrieu, A. Doucet, and S. J. Godsill, "Particle methods for Bayesian modeling and enhancement of speech signals," IEEE Trans. Speech Audio Process., vol. 10, no. 3, pp. 173-185, Mar. 2002. (Pubitemid 34692541)
    • (2002) IEEE Transactions on Speech and Audio Processing , vol.10 , Issue.3 , pp. 173-185
    • Vermaak, J.1    Andrieu, C.2    Doucet, A.3    Godsill, S.J.4
  • 15
    • 0042362199 scopus 로고    scopus 로고
    • Blind single channel deconvolution using nonstationary signal processing
    • Sep.
    • J. R. Hopgood and P. J. W. Rayner, "Blind single channel deconvolution using nonstationary signal processing," IEEE Trans. Speech Audio Process., vol. 11, no. 5, pp. 476-488, Sep. 2003.
    • (2003) IEEE Trans. Speech Audio Process. , vol.11 , Issue.5 , pp. 476-488
    • Hopgood, J.R.1    Rayner, P.J.W.2
  • 17
    • 0000777281 scopus 로고
    • The fitting of non-stationary time series models with time dependent parameters
    • T. S. Rao, "The fitting of non-stationary time series models with time dependent parameters," J. R. Stat. Soc. B, vol. 32, pp. 312-322, 1970.
    • (1970) J. R. Stat. Soc. B , vol.32 , pp. 312-322
    • Rao, T.S.1
  • 18
    • 0019006979 scopus 로고
    • The order determination problem for linear time-varying AR models
    • F. Kozin and F. Nakajima, "The order determination problem for linear time-varying AR models," IEEE Trans. Autom. Control, vol. AC-25, no. 2, pp. 250-257, Apr. 1980. (Pubitemid 10475307)
    • (1980) IEEE Transactions on Automatic Control , vol.AC-25 , Issue.2 , pp. 250-257
    • Kozin, F.1    Nakajima, F.2
  • 19
    • 0021819062 scopus 로고
    • A smoothness priors time-varying AR coefficient modeling of nonstationary covariance time series
    • G. Kitagawa and W. Gersch, "A smoothness priors time-varying AR coefficient modeling of nonstationary covariance time series," IEEE Trans. Autom. Control, vol. 30, no. 1, pp. 48-56, Jan. 1985. (Pubitemid 15453454)
    • (1985) IEEE Transactions on Automatic Control , vol.AC-30 , Issue.1 , pp. 48-56
    • Kitagawa Genshiro1    Gersch Will2
  • 20
    • 0027807710 scopus 로고
    • Time-varying system identification and model validation using wavelets
    • Dec.
    • M. K. Tsatsanis and G. B. Giannakis, "Time-varying system identification and model validation using wavelets," IEEE Trans. Signal Process., vol. 41, no. 12, pp. 3512-3523, Dec. 1993.
    • (1993) IEEE Trans. Signal Process. , vol.41 , Issue.12 , pp. 3512-3523
    • Tsatsanis, M.K.1    Giannakis, G.B.2
  • 21
    • 0031213093 scopus 로고    scopus 로고
    • Bayesian approach to parameter estimation and interpolation of time-varying autoregressive processes using the Gibbs sampler
    • J. J. Rajan, P. J. W. Rayner, and S. J. Godsill, "Bayesian approach to parameter estimation and interpolation of time-varying autoregressive processes using the Gibbs sampler," IEE Vision, Image, Signal Process., vol. 144, pp. 249-256, 1997.
    • (1997) IEE Vision, Image, Signal Process. , vol.144 , pp. 249-256
    • Rajan, J.J.1    Rayner, P.J.W.2    Godsill, S.J.3
  • 23
    • 48849100791 scopus 로고    scopus 로고
    • Identification of time-varying autoregressive systems using maximum a posteriori estimation
    • Aug.
    • T. Hsiao, "Identification of time-varying autoregressive systems using maximum a posteriori estimation," IEEE Trans. Signal Process., vol. 56, no. 8, pp. 3497-3509, Aug. 2008.
    • (2008) IEEE Trans. Signal Process. , vol.56 , Issue.8 , pp. 3497-3509
    • Hsiao, T.1
  • 24
    • 41849124922 scopus 로고    scopus 로고
    • A new nonstationarity detector
    • Apr.
    • S. M. Kay, "A new nonstationarity detector," IEEE Trans. Signal Process., vol. 56, no. 4, pp. 1440-1451, Apr. 2008.
    • (2008) IEEE Trans. Signal Process. , vol.56 , Issue.4 , pp. 1440-1451
    • Kay, S.M.1
  • 26
    • 34249787614 scopus 로고    scopus 로고
    • Order estimation and discrimination between stationary and time-varying (TVAR) autoregressive models
    • DOI 10.1109/TSP.2007.893966
    • Y. I. Abramovich, N. K. Spencer, and M. D. E. Turley, "Order estimation and discrimination between stationary and time-varying (TVAR) autoregressive models," IEEE Trans. Signal Process., vol. 55, no. 6, pp. 2861-2876, Jun. 2007. (Pubitemid 46845677)
    • (2007) IEEE Transactions on Signal Processing , vol.55 , Issue.6 , pp. 2861-2876
    • Abramovich, Y.I.1    Spencer, N.K.2    Turley, M.D.E.3
  • 28
    • 77951147585 scopus 로고    scopus 로고
    • Superposition frames for adaptive time-frequency representations and fast reconstruction
    • May
    • D. Rudoy, P. Basu, and P. J.Wolfe, "Superposition frames for adaptive time-frequency representations and fast reconstruction," IEEE Trans. Signal Process., vol. 58, no. 5, pp. 2581-2596, May 2010.
    • (2010) IEEE Trans. Signal Process. , vol.58 , Issue.5 , pp. 2581-2596
    • Rudoy, D.1    Basu, P.2    Wolfe, P.J.3
  • 29
    • 34548475179 scopus 로고    scopus 로고
    • Adaptive time segmentation for improved speech enhancement
    • Nov.
    • R. C. Hendriks, R. Heusdens, and J. Jensen, "Adaptive time segmentation for improved speech enhancement," IEEE Trans. Audio Speech Lang. Process., vol. 14, no. 6, pp. 2064-2074, Nov. 2006.
    • (2006) IEEE Trans. Audio Speech Lang. Process. , vol.14 , Issue.6 , pp. 2064-2074
    • Hendriks, R.C.1    Heusdens, R.2    Jensen, J.3
  • 30
    • 33746457336 scopus 로고    scopus 로고
    • On variable-scale piecewise stationary spectral analysis of speech signals for ASR
    • DOI 10.1016/j.specom.2006.04.002, PII S0167639306000446
    • V. Tyagi, H. Bourlard, and C.Wellekens, "On variable-scale piecewise stationary spectral analysis of signals for ASR," Speech Commun., vol. 48, pp. 1182-1191, 2006. (Pubitemid 44128616)
    • (2006) Speech Communication , vol.48 , Issue.9 , pp. 1182-1191
    • Tyagi, V.1    Bourlard, H.2    Wellekens, C.3
  • 31
    • 64649092391 scopus 로고    scopus 로고
    • Likelihood-ratio forensic voice comparison using parametric representations of the formant trajectories of diphthongs
    • G. S. Morrison, "Likelihood-ratio forensic voice comparison using parametric representations of the formant trajectories of diphthongs," J. Acoust. Soc. Amer., vol. 125, pp. 2387-2397, 2009.
    • (2009) J. Acoust. Soc. Amer. , vol.125 , pp. 2387-2397
    • Morrison, G.S.1
  • 32
  • 33
    • 0032595183 scopus 로고    scopus 로고
    • Modeling of the glottal flow derivative waveform with application to speaker identification
    • Sep.
    • M. D. Plumpe, T. F. Quatieri, and D. A. Reynolds, "Modeling of the glottal flow derivative waveform with application to speaker identification," IEEE Trans. Speech Audio Process., vol. 7, no. 5, pp. 569-586, Sep. 1999.
    • (1999) IEEE Trans. Speech Audio Process. , vol.7 , Issue.5 , pp. 569-586
    • Plumpe, M.D.1    Quatieri, T.F.2    Reynolds, D.A.3
  • 34
    • 33947141366 scopus 로고    scopus 로고
    • A quantitative assessment of group delay methods of identifying glottal closures in voiced speech
    • Mar.
    • M. Brookes, P. A. Taylor, and J. Gudnasson, "A quantitative assessment of group delay methods of identifying glottal closures in voiced speech," IEEE Trans. Audio, Speech, Lang. Process., vol. 14, no. 2, pp. 456-466, Mar. 2006.
    • (2006) IEEE Trans. Audio, Speech, Lang. Process. , vol.14 , Issue.2 , pp. 456-466
    • Brookes, M.1    Taylor, P.A.2    Gudnasson, J.3
  • 35
    • 0021645227 scopus 로고
    • Electroglottography for laryngeal function assessment and speech analysis
    • D. G. Childers and J. N. Larar, "Electroglottography for laryngeal function assessment and speech analysis," IEEE Trans. Biomed. Eng., vol. BME-31, no. 12, pp. 807-817, Dec. 1984. (Pubitemid 15453675)
    • (1984) IEEE Transactions on Biomedical Engineering , vol.BME-31 , Issue.12 , pp. 807-817
    • Childers Donald, G.1    Larar, J.N.2
  • 37
    • 41049089736 scopus 로고    scopus 로고
    • Estimation of glottal closure instants in voiced speech using the DYPSA algorithm
    • Jan.
    • P. A. Naylor, A. Kounoudes, J. Gudnasson, and M. Brookes, "Estimation of glottal closure instants in voiced speech using the DYPSA algorithm," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 1, pp. 34-43, Jan. 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.1 , pp. 34-43
    • Naylor, P.A.1    Kounoudes, A.2    Gudnasson, J.3    Brookes, M.4
  • 41
    • 0018011435 scopus 로고
    • Kronecker products and matrix calculus in system theory
    • Sep.
    • J. W. Brewer, "Kronecker products and matrix calculus in system theory," IEEE Trans. Circuits Syst., vol. CAS-25, no. 9, pp. 772-781, Sep. 1978.
    • (1978) IEEE Trans. Circuits Syst. , vol.CAS-25 , Issue.9 , pp. 772-781
    • Brewer, J.W.1
  • 44
    • 35348856844 scopus 로고    scopus 로고
    • A fusion approach for automatic speech segmentation of large corpora with application to speech synthesis
    • DOI 10.1016/j.specom.2007.07.001, PII S0167639307001215
    • S. Jarifia, D. Pastora, and O. Rosec, "A fusion approach for automatic speech segmentation of large corpora with application to speech synthesis," Speech Commun., vol. 50, pp. 67-80, 2008. (Pubitemid 47592927)
    • (2008) Speech Communication , vol.50 , Issue.1 , pp. 67-80
    • Jarifi, S.1    Pastor, D.2    Rosec, O.3
  • 45
    • 84947392504 scopus 로고
    • Tests of the hypothesis that a linear regression system obeys two separate regimes
    • R. E. Quandt, "Tests of the hypothesis that a linear regression system obeys two separate regimes," J. Amer. Stat. Assoc., vol. 55, pp. 324-330, 1960.
    • (1960) J. Amer. Stat. Assoc. , vol.55 , pp. 324-330
    • Quandt, R.E.1
  • 46
    • 0000211477 scopus 로고
    • Block-Toeplitz matrix inversion
    • H. Akaike, "Block-Toeplitz matrix inversion," Siam J. Appl. Math., vol. 24, pp. 234-241, 1973.
    • (1973) Siam J. Appl. Math. , vol.24 , pp. 234-241
    • Akaike, H.1
  • 47
    • 79953285823 scopus 로고    scopus 로고
    • WaveSurfer 1.8.5 for Windows, KTH Royal Inst. of Technol. [Online]. Available
    • K. Sjölander and J. Beskow, 2005, WaveSurfer 1.8.5 for Windows, KTH Royal Inst. of Technol. [Online]. Available: http://www.speech. kth.se/wavesurfer/wavesurfer-185-win.zip
    • (2005)
    • Sjölander, K.1    Beskow, J.2
  • 49
    • 84863739389 scopus 로고    scopus 로고
    • Detection of glottal closing and opening instants using an improved DYPSA framework
    • Glasgow, U.K.
    • M. P. Thomas, J. Gudnason, and P. A. Naylor, "Detection of glottal closing and opening instants using an improved DYPSA framework," in Proc. 17th Eur. Signal Process. Conf, Glasgow, U.K., 2009.
    • (2009) Proc. 17th Eur. Signal Process. Conf
    • Thomas, M.P.1    Gudnason, J.2    Naylor, P.A.3
  • 50
    • 1542286741 scopus 로고    scopus 로고
    • On the use of the derivative of electroglottographic signals for characterization of nonpathological phonation
    • DOI 10.1121/1.1646401
    • N. Henrich, C. d'Alessandro, B. Doval, and M. Castellengo, "On the use of the derivative of electroglottographic signals for characterization of nonpathological phonation," J. Acoust. Soc. Amer., vol. 115, pp. 1321-1332, 2004. (Pubitemid 38298582)
    • (2004) Journal of the Acoustical Society of America , vol.115 , Issue.3 , pp. 1321-1332
    • Henrich, N.1    D'Alessandro, C.2    Doval, B.3    Castellengo, M.4
  • 51
    • 0020255315 scopus 로고
    • Calculation of true glottal flow and its components
    • T. V. Ananthapadmanabha and G. Fant, "Calculation of true glottal flow and its components," Speech Commun., vol. 1, pp. 167-184, 1982. (Pubitemid 13556172)
    • (1982) Speech Communication , vol.1 , Issue.3-4 , pp. 167-184
    • Ananthapadmanabha, T.V.1    Fant, G.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.