메뉴 건너뛰기




Volumn 20, Issue 3, 2012, Pages 806-817

Voice conversion using dynamic kernel partial least squares regression

Author keywords

Kernel methods; partial least squares regression; voice conversion

Indexed keywords

APERIODICITY; AUXILIARY INFORMATION; CONVERSION FUNCTION; GAUSSIANS; KERNEL METHODS; KERNEL PARTIAL LEAST SQUARES; NONLINEAR MODELING; PARTIAL LEAST SQUARES REGRESSION; SIMPLE AND EFFICIENT ALGORITHMS; SOURCE FEATURES; SPECTRAL FEATURE; SPEECH FEATURES; VOICE CONVERSION; VOICE CONVERSION ALGORITHM;

EID: 84856141218     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2011.2165944     Document Type: Article
Times cited : (142)

References (33)
  • 4
    • 0033154052 scopus 로고    scopus 로고
    • Speaker transformation algorithm using segmental codebooks (STASC)
    • Jun
    • L. Arslan, "Speaker transformation algorithm using segmental codebooks (STASC)," Speech Commun., vol. 28, no. 3, pp. 211-226, Jun. 1999.
    • (1999) Speech Commun. , vol.28 , Issue.3 , pp. 211-226
    • Arslan, L.1
  • 5
    • 79959836789 scopus 로고    scopus 로고
    • Maximum a posteriori voice conversion using sequential Monte Carlo methods
    • Sep
    • E. Helander, H. Silen, J. Miguez, and M. Gabbouj, "Maximum a posteriori voice conversion using sequential Monte Carlo methods," in Proc. Interspeech, Sep. 2010, pp. 1716-1719.
    • (2010) Proc. Interspeech , pp. 1716-1719
    • Helander, E.1    Silen, H.2    Miguez, J.3    Gabbouj, M.4
  • 7
    • 0031623661 scopus 로고    scopus 로고
    • Spectral voice conversion for text-tospeech synthesis
    • Seattle, WA, May
    • A. Kain and M. W. Macon, "Spectral voice conversion for text-tospeech synthesis," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Seattle, WA, May 1998, vol. 1, pp. 285-288.
    • (1998) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. , vol.1 , pp. 285-288
    • Kain, A.1    MacOn, M.W.2
  • 9
    • 57749193836 scopus 로고    scopus 로고
    • Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory
    • Nov
    • T. Toda, A. Black, and K. Tokuda, "Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 8, pp. 2222-2235, Nov. 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.8 , pp. 2222-2235
    • Toda, T.1    Black, A.2    Tokuda, K.3
  • 10
    • 84966348891 scopus 로고    scopus 로고
    • An HMM-based speech synthesis system applied to English
    • Sep
    • K. Tokuda, H. Zen, and A. W. Black, "An HMM-based speech synthesis system applied to English," in Proc. IEEE Workshop Speech Synth., Sep. 2002, pp. 227-230.
    • (2002) Proc. IEEE Workshop Speech Synth. , pp. 227-230
    • Tokuda, K.1    Zen, H.2    Black, A.W.3
  • 12
    • 84905560807 scopus 로고    scopus 로고
    • Voice conversion with smoothedGMM and MAP adaptation
    • Y. Chen, M. Chu, E. Chang, J. Liu, and R. Liu, "Voice conversion with smoothedGMM and MAP adaptation," in Proc. Eurospeech, 2003, pp. 2413-2416.
    • (2003) Proc. Eurospeech , pp. 2413-2416
    • Chen, Y.1    Chu, M.2    Chang, E.3    Liu, J.4    Liu, R.5
  • 13
  • 14
    • 33745171182 scopus 로고    scopus 로고
    • New York: Wiley, , ch. Introduction to Scientific Data Mining: Direct Kernel Methods & Applications
    • M. J. Embrechts and B. Szymanski, Computationally Intelligent Hybrid Systems. New York: Wiley, 2005, ch. Introduction to Scientific Data Mining: Direct Kernel Methods & Applications, pp. 317-365.
    • (2005) Computationally Intelligent Hybrid Systems , pp. 317-365
    • Embrechts, M.J.1    Szymanski, B.2
  • 15
    • 0032673049 scopus 로고    scopus 로고
    • Restructuring speech representations using a pitch-adaptive time-frequency smoothing and a instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds
    • Apr
    • H. Kawahara, I. Masuda-Katsuse, and A. de Cheveigné, "Restructuring speech representations using a pitch-adaptive time-frequency smoothing and a instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds," Speech Commun., vol. 27, no. 3-4, pp. 187-207, Apr. 1999.
    • (1999) Speech Commun. , vol.27 , Issue.3-4 , pp. 187-207
    • Kawahara, H.1    Masuda-Katsuse, I.2    De Cheveigné, A.3
  • 19
    • 0347243182 scopus 로고    scopus 로고
    • Nonlinear Component Analysis as a Kernel Eigenvalue Problem
    • B. Schölkopf, A. J. Smola, and K.-R. Müller, "Nonlinear component analysis as a kernel eigenvalue problem," Neural Comput., vol. 10, no. 5, pp. 1299-1319, 1998. (Pubitemid 128463674)
    • (1998) Neural Computation , vol.10 , Issue.5 , pp. 1299-1319
    • Scholkopf, B.1    Smola, A.2    Muller, K.-R.3
  • 20
    • 2442514721 scopus 로고    scopus 로고
    • ser. NATO Science Series. Series III: Computer and Systems Sciences. Amsterdam, The Netherlands: IOS Press , ch. An Optimization Perspective on Kernel Partial Least Squares Regression
    • K. P. Bennett and M. J. Embrechts, Advances in Learning Theory: Methods, Models and Applications, ser. NATO Science Series. Series III: Computer and Systems Sciences. Amsterdam, The Netherlands: IOS Press, 2003, vol. 190, ch. An Optimization Perspective on Kernel Partial Least Squares Regression, pp. 227-250.
    • (2003) Advances in Learning Theory: Methods, Models and Applications , vol.190 , pp. 227-250
    • Bennett, K.P.1    Embrechts, M.J.2
  • 21
    • 0027530250 scopus 로고
    • SIMPLS: An alternative approach to partial least squares regression
    • Mar
    • S. de Jong, "SIMPLS: An alternative approach to partial least squares regression," Chemometrics Intell. Lab. Syst., vol. 18, no. 3, pp. 251-263, Mar. 1993.
    • (1993) Chemometrics Intell. Lab. Syst. , vol.18 , Issue.3 , pp. 251-263
    • De Jong, S.1
  • 22
    • 0038259120 scopus 로고    scopus 로고
    • Kernel partial least squares regression in reproducing kernel Hilbert space
    • Dec
    • R. Rosipal and L. Trejo, "Kernel partial least squares regression in reproducing kernel Hilbert space," J. Mach. Learn. Res., vol. 2, pp. 97-123, Dec. 2001.
    • (2001) J. Mach. Learn. Res. , vol.2 , pp. 97-123
    • Rosipal, R.1    Trejo, L.2
  • 23
    • 33846405723 scopus 로고    scopus 로고
    • Details of the nitech HMM-based speech synthesis system for the blizzard challenge 2005
    • DOI 10.1093/ietisy/e90-1.1.325
    • H. Zen, T. Toda, M. Nakamura, and K. Tokuda, "Details of the Nitech HMM-based speech synthesis system for the Blizzard challenge 2005," IEICE Trans. Inf. Syst., vol. E90-D, no. 1, pp. 325-333, Jan. 2007. (Pubitemid 46145336)
    • (2007) IEICE Transactions on Information and Systems , vol.E90-D , Issue.1 , pp. 325-333
    • Zen, H.1    Toda, T.2    Nakamura, M.3    Tokuda, K.4
  • 24
    • 44949143155 scopus 로고    scopus 로고
    • Maximum likelihood voice conversion based on GMM with STRAIGHT mixed excitation
    • Y. Ohtani, T. Toda, H. Saruwatari, and K. Shikano, "Maximum likelihood voice conversion based on GMM with STRAIGHT mixed excitation," in Proc. Interspeech, 2006, pp. 2266-2269.
    • (2006) Proc. Interspeech , pp. 2266-2269
    • Ohtani, Y.1    Toda, T.2    Saruwatari, H.3    Shikano, K.4
  • 26
    • 33746653351 scopus 로고    scopus 로고
    • Robust processing techniques for voice conversion
    • DOI 10.1016/j.csl.2005.06.001, PII S088523080500029X
    • O. Turk and L. Arslan, "Robust processing techniques for voice conversion," Comput. Speech Lang., vol. 4, no. 20, pp. 441-467, Oct. 2006. (Pubitemid 44150541)
    • (2006) Computer Speech and Language , vol.20 , Issue.4 , pp. 441-467
    • Turk, O.1    Arslan, L.M.2
  • 32
    • 0029209272 scopus 로고
    • Robust text-independent speaker identification using Gaussian mixture speaker models
    • Jan
    • D. Reynolds and R. Rose, "Robust text-independent speaker identification using Gaussian mixture speaker models," IEEE Trans. Speech Audio Process., vol. 3, no. 1, pp. 72-83, Jan. 1995.
    • (1995) IEEE Trans. Speech Audio Process. , vol.3 , Issue.1 , pp. 72-83
    • Reynolds, D.1    Rose, R.2
  • 33
    • 77956826012 scopus 로고    scopus 로고
    • Automatic speaker recognition as a measurement of voice imitation and conversion
    • M. Farrus, M. Wagner, D. Erro, and J. Hernando, "Automatic speaker recognition as a measurement of voice imitation and conversion," Int. J. Speech Lang. Law, vol. 17, no. 1, 2010.
    • (2010) Int. J. Speech Lang. Law , vol.17 , Issue.1
    • Farrus, M.1    Wagner, M.2    Erro, D.3    Hernando, J.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.