메뉴 건너뛰기




Volumn 54, Issue 2, 2012, Pages 199-211

Data-driven voice source waveform analysis and synthesis

Author keywords

Gaussian mixture model; Inverse filtering; Principal component analysis; Segmental signal to reconstruction ratio; Vocal tract modeling; Voice source signal

Indexed keywords

GAUSSIAN MIXTURE MODEL; INVERSE FILTERING; PRINCIPAL COMPONENTS; SEGMENTAL SIGNAL TO RECONSTRUCTION RATIO; VOCAL-TRACTS; VOICE SOURCE SIGNAL;

EID: 80055082229     PISSN: 01676393     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.specom.2011.08.003     Document Type: Article
Times cited : (15)

References (54)
  • 2
    • 0033664616 scopus 로고    scopus 로고
    • A finite-element model of vocal-fold vibration
    • F. Alipour, D.A. Berry, and I.R. Titze A finite-element model of vocal-fold vibration J. Acoust. Soc. Amer. 108 2000 3003 3012
    • (2000) J. Acoust. Soc. Amer. , vol.108 , pp. 3003-3012
    • Alipour, F.1    Berry, D.A.2    Titze, I.R.3
  • 3
    • 0026881384 scopus 로고
    • Glottal wave analysis with pitch synchronous iterative adaptive inverse filtering
    • P. Alku Glottal wave analysis with pitch synchronous iterative adaptive filtering Speech Comm. 11 1992 109 118 (Pubitemid 23572504)
    • (1992) Speech Communication , vol.11 , Issue.2-3 , pp. 109-118
    • Alku Paavo1
  • 4
    • 65549101092 scopus 로고    scopus 로고
    • Closed phase covariance analysis based on constrained linear prediction for glottal inverse filtering
    • P. Alku, C. Magi, S. Yrttiaho, T. Bäckström, and B. Story Closed phase covariance analysis based on constrained linear prediction for glottal inverse filtering J. Acoust. Soc. Amer. 125 2009 3289 3305
    • (2009) J. Acoust. Soc. Amer. , vol.125 , pp. 3289-3305
    • Alku, P.1    Magi, C.2    Yrttiaho, S.3    Bäckström, T.4    Story, B.5
  • 6
    • 0015112070 scopus 로고
    • Speech analysis and synthesis by linear prediction of the speech wave
    • B.S. Atal, and S.L. Hanauer Speech analysis and synthesis by linear prediction of the speech wave J. Acoust. Soc. Amer. 50 1971 637 655
    • (1971) J. Acoust. Soc. Amer. , vol.50 , pp. 637-655
    • Atal, B.S.1    Hanauer, S.L.2
  • 7
    • 0036508041 scopus 로고    scopus 로고
    • Time-domain parameterization of the closing phase of glottal airflow waveform from voices over a large intensity range
    • DOI 10.1109/TSA.2002.1001983, PII S1063667602028043
    • T. Backstrom, P. Alku, and E. Vilkman Time-domain parameterization of the closing phase of glottal airflow waveform from voices over a large intensity range IEEE Trans. Speech Audio Process. 10 2002 186 192 (Pubitemid 34692542)
    • (2002) IEEE Transactions on Speech and Audio Processing , vol.10 , Issue.3 , pp. 186-192
    • Backstrom, T.1    Alku, P.2    Vilkman, E.3
  • 9
    • 0009625201 scopus 로고
    • Speaker characteristics from a glottal airflow model using glottal inverse filtering
    • D.M. Brookes, and D.S. Chan Speaker characteristics from a glottal airflow model using glottal inverse filtering Proc. Inst. Acoust. 15 1994 501 508
    • (1994) Proc. Inst. Acoust. , vol.15 , pp. 501-508
    • Brookes, D.M.1    Chan, D.S.2
  • 12
    • 84865737826 scopus 로고
    • Variability of excitation parameters derived from robust closed phase glottal inverse filtering
    • D.S.F. Chan, and D.M. Brookes Variability of excitation parameters derived from robust closed phase glottal inverse filtering Proc. Eur. Conf. Speech Comm. Technol. 33 1989
    • (1989) Proc. Eur. Conf. Speech Comm. Technol. , vol.33
    • Chan, D.S.F.1    Brookes, D.M.2
  • 14
    • 0029183215 scopus 로고
    • Glottal models for digital speech processing - A historical survey and new results
    • K.E. Cummings, and M.A. Clements Glottal models for digital speech processing - a historical survey and new results Digital Signal Process. 5 1995 21 42
    • (1995) Digital Signal Process. , vol.5 , pp. 21-42
    • Cummings, K.E.1    Clements, M.A.2
  • 21
    • 33947684811 scopus 로고
    • A four-parameter model of glottal flow
    • G. Fant, J. Liljencrants, and Q. Lin A four-parameter model of glottal flow STL-QPSR 26 1985 1 13
    • (1985) STL-QPSR , vol.26 , pp. 1-13
    • Fant, G.1    Liljencrants, J.2    Lin, Q.3
  • 26
    • 70450162429 scopus 로고    scopus 로고
    • Voice source waveform analysis and synthesis using principal component analysis and Gaussian mixture modeling
    • Brighton, UK
    • Gudnason, J.; Thomas, M.R.P.; Naylor, P.A.; Ellis, D.P.W.; 2009. Voice source waveform analysis and synthesis using principal component analysis and Gaussian mixture modelling. In: Proc. Interspeech Conf.; Brighton, UK.
    • (2009) Proc. Interspeech Conf.
    • Gudnason, J.1    Thomas, M.R.P.2    Naylor, P.A.3    Ellis, D.P.W.4
  • 27
    • 0001138328 scopus 로고
    • A k-means clustering algorithm
    • J. Hartigan, and M. Wang A k-means clustering algorithm Appl. Statist. 28 1979 100 108
    • (1979) Appl. Statist. , vol.28 , pp. 100-108
    • Hartigan, J.1    Wang, M.2
  • 30
    • 84990424053 scopus 로고
    • Synthesis of voiced sounds from a two-mass model of the vocal cords
    • K. Ishizaka, and J. Flanagan Synthesis of voiced sounds from a two-mass model of the vocal cords Bell Syst. Tech. J. 51 1972 1233 1268
    • (1972) Bell Syst. Tech. J. , vol.51 , pp. 1233-1268
    • Ishizaka, K.1    Flanagan, J.2
  • 32
    • 0025321354 scopus 로고
    • Analysis, synthesis, and perception of voice quality variations among female and male talkers
    • D.H. Klatt, and L.C. Klatt Analysis, synthesis and perception of voice quality variations among female and male talkers J. Acoust. Soc. Amer. 87 1990 820 857 (Pubitemid 20129722)
    • (1990) Journal of the Acoustical Society of America , vol.87 , Issue.2 , pp. 820-857
    • Klatt, D.H.1    Klatt, L.C.2
  • 34
    • 0016707703 scopus 로고
    • The electroglottography and its relation to glottal activity
    • F. Lecluse, M. Brocaar, and J. Verschuure The electroglottography and its relation to glottal activity Folia Phoniatr. 17 1975 215 224
    • (1975) Folia Phoniatr. , vol.17 , pp. 215-224
    • Lecluse, F.1    Brocaar, M.2    Verschuure, J.3
  • 36
    • 0028417076 scopus 로고
    • A Frobenius norm approach to glottal closure detection from the speech signal
    • C. Ma, Y. Kamp, and L.F. Willems A Frobenius norm approach to glottal closure detection from the speech signal IEEE Trans. Speech Audio Process. 2 1994 258 265
    • (1994) IEEE Trans. Speech Audio Process. , vol.2 , pp. 258-265
    • Ma, C.1    Kamp, Y.2    Willems, L.F.3
  • 37
    • 0016495091 scopus 로고
    • Linear prediction: A tutorial review
    • J. Makhoul Linear prediction: a tutorial review Proc. IEEE 63 1975 561 580
    • (1975) Proc. IEEE , vol.63 , pp. 561-580
    • Makhoul, J.1
  • 39
    • 0025543906 scopus 로고
    • Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones
    • E. Moulines, and F. Charpentier Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones Speech Comm. 9 1990 453 467
    • (1990) Speech Comm. , vol.9 , pp. 453-467
    • Moulines, E.1    Charpentier, F.2
  • 41
    • 0032595183 scopus 로고    scopus 로고
    • Modeling of the glottal flow derivative waveform with application to speaker identification
    • M.D. Plumpe, T.F. Quatieri, and D.A. Reynolds Modeling of the glottal flow derivative waveform with application to speaker identification IEEE Trans. Speech Audio Process. 7 1999 569 576
    • (1999) IEEE Trans. Speech Audio Process. , vol.7 , pp. 569-576
    • Plumpe, M.D.1    Quatieri, T.F.2    Reynolds, D.A.3
  • 42
    • 0015015215 scopus 로고
    • Effect of glottal pulse shape on the quality of natural vowels
    • A.E. Rosenberg Effect of glottal pulse shape on the quality of natural vowels J. Acoust. Soc. Amer. 49 1971 583 590
    • (1971) J. Acoust. Soc. Amer. , vol.49 , pp. 583-590
    • Rosenberg, A.E.1
  • 43
    • 0028515948 scopus 로고
    • Speech coding: A tutorial review
    • A.S. Spanias Speech coding: a tutorial review Proc. IEEE 82 1994 1541 1582
    • (1994) Proc. IEEE , vol.82 , pp. 1541-1582
    • Spanias, A.S.1
  • 44
    • 0028833120 scopus 로고
    • Voice simulation with a body-cover model of the vocal folds
    • B.H. Story, and I.R. Titze Voice simulation with a body-cover model of the vocal folds J. Acoust. Soc. Amer. 97 1994 1249 1260
    • (1994) J. Acoust. Soc. Amer. , vol.97 , pp. 1249-1260
    • Story, B.H.1    Titze, I.R.2
  • 45
    • 0016129045 scopus 로고
    • Determination of the instant of glottal closure from the speech wave
    • H.W. Strube Determination of the instant of glottal closure from the speech wave J. Acoust. Soc. Amer. 56 1974 1625 1629
    • (1974) J. Acoust. Soc. Amer. , vol.56 , pp. 1625-1629
    • Strube, H.W.1
  • 52
    • 0029991874 scopus 로고    scopus 로고
    • Videokymography: High-speed line scanning of vocal fold vibration
    • DOI 10.1016/S0892-1997(96)80047-6
    • J. Svec, and H. Schutte Videokymography: high-speed line scanning of vocal fold vibration J. Voice 10 1996 201 205 (Pubitemid 26142601)
    • (1996) Journal of Voice , vol.10 , Issue.2 , pp. 201-205
    • Svec, J.G.1    Schutte, H.K.2
  • 54
    • 0030355541 scopus 로고    scopus 로고
    • A new speech synthesis system based on the ARX speech production model
    • Zhu, W.; Kasuya, H.; 1996. A new speech synthesis system based on the ARX speech production model. In: Proc. Internat. Conf. on Spoken Language Processing, pp. 1413-1416.
    • (1996) Proc. Internat. Conf. on Spoken Language Processing , pp. 1413-1416
    • Zhu, W.1    Kasuya, H.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.