메뉴 건너뛰기




Volumn 1, Issue , 2012, Pages 98-101

Effects of speaker adaptive training on tensor-based arbitrary speaker conversion

Author keywords

Eigenvoice; Gaussian mixture model; Speaker adaptive training; Tucker decomposition; Voice conversion

Indexed keywords

EIGENVOICES; GAUSSIAN MIXTURE MODEL; SPEAKER ADAPTIVE TRAININGS; TUCKER DECOMPOSITIONS; VOICE CONVERSION;

EID: 84878378722     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (9)

References (16)
  • 2
    • 0031623661 scopus 로고    scopus 로고
    • Spectral voice conversion for text-to-speech synthesis
    • A. Kain and M. W. Macon, "Spectral voice conversion for text-to-speech synthesis," Proc. ICASSP, vol. 1, pp. 285-288, 1998.
    • (1998) Proc. ICASSP , vol.1 , pp. 285-288
    • Kain, A.1    Macon, M.W.2
  • 3
    • 0034855352 scopus 로고    scopus 로고
    • High-performance robust speech recognition using stereo training data
    • L. Deng, A. Acero, L. Jiang, J. Droppo, and X. Huang, "High-performance robust speech recognition using stereo training data," Proc. ICASSP, pp. 301-304, 2001.
    • (2001) Proc. ICASSP , pp. 301-304
    • Deng, L.1    Acero, A.2    Jiang, L.3    Droppo, J.4    Huang, X.5
  • 5
    • 44949210554 scopus 로고    scopus 로고
    • Map-based adaptation for speech conversion using adaptation data selection and non-parallel training
    • C. H. Lee and C. H. Wu, "Map-based adaptation for speech conversion using adaptation data selection and non-parallel training," Proc. INTERSPEECH, pp. 2254-2257, 2006.
    • (2006) Proc. INTERSPEECH , pp. 2254-2257
    • Lee, C.H.1    Wu, C.H.2
  • 6
    • 34547512822 scopus 로고    scopus 로고
    • Eigenvoice conversion based on Gaussian mixture model
    • T. Toda, Y. Ohtani, and K. Shikano, "Eigenvoice conversion based on Gaussian mixture model," Proc. INTERSPEECH, pp. 2446-2449, 2006.
    • (2006) Proc. INTERSPEECH , pp. 2446-2449
    • Toda, T.1    Ohtani, Y.2    Shikano, K.3
  • 9
    • 84865798483 scopus 로고    scopus 로고
    • One-tomany voice conversion based on tensor representation of speaker space
    • D. Saito, K. Yamamoto, N. Minematsu, and K. Hirose, "One-tomany voice conversion based on tensor representation of speaker space," Proc. INTERSPEECH, pp. 653-656, 2011.
    • (2011) Proc. INTERSPEECH , pp. 653-656
    • Saito, D.1    Yamamoto, K.2    Minematsu, N.3    Hirose, K.4
  • 11
    • 70450182468 scopus 로고    scopus 로고
    • Speaker adaptive training for one-to-many eigenvoice conversion based on Gaussian mixture model
    • Y. Ohtani, T. Toda, H. Saruwatari, and K. Shikano, "Speaker adaptive training for one-to-many eigenvoice conversion based on Gaussian mixture model," Proc. INTERSPEECH, pp. 1981-1984, 2007.
    • (2007) Proc. INTERSPEECH , pp. 1981-1984
    • Ohtani, Y.1    Toda, T.2    Saruwatari, H.3    Shikano, K.4
  • 12
    • 0013953617 scopus 로고
    • Some mathematical notes on three-mode factor analysis
    • L. R. Tucker, "Some mathematical notes on three-mode factor analysis," Psychometrika, vol. 31, no. 3, pp. 279-311, 1966.
    • (1966) Psychometrika , vol.31 , Issue.3 , pp. 279-311
    • Tucker, L.R.1
  • 13
    • 78049396810 scopus 로고    scopus 로고
    • Speaker adaptation based on the multilinear decomposition of training speaker models
    • Y. Jeong, "Speaker adaptation based on the multilinear decomposition of training speaker models," Proc. ICASSP, pp. 4870-4873, 2010.
    • (2010) Proc. ICASSP , pp. 4870-4873
    • Jeong, Y.1
  • 15
    • 0032673049 scopus 로고    scopus 로고
    • Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds
    • H. Kawahara, I. Masuda-Katsuse, and A.de Cheveigné, "Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds," Speech Communication, vol.27, pp.187-207, 1999.
    • (1999) Speech Communication , vol.27 , pp. 187-207
    • Kawahara, H.1    Masuda-Katsuse, I.2    De Cheveigné, A.3
  • 16
    • 57749193836 scopus 로고    scopus 로고
    • Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory
    • T. Toda, A. W. Black, and K. Tokuda, "Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory," IEEE Trans. on Audio, Speech, and Language Processing, vol. 15, no. 8, pp. 2222-2235, 2007.
    • (2007) IEEE Trans. on Audio, Speech, and Language Processing , vol.15 , Issue.8 , pp. 2222-2235
    • Toda, T.1    Black, A.W.2    Tokuda, K.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.