메뉴 건너뛰기




Volumn , Issue , 2014, Pages 7909-7913

Non-parallel voice conversion using joint optimization of alignment by temporal context and spectral distortion

Author keywords

Gaussian Mixture Model (GMM); INCA; Non Parallel Voice Conversion; Spectral Distance

Indexed keywords

CODES (SYMBOLS); ITERATIVE METHODS; QUALITY CONTROL; SIGNAL PROCESSING;

EID: 84905234183     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2014.6855140     Document Type: Conference Paper
Times cited : (21)

References (19)
  • 2
    • 0031623661 scopus 로고    scopus 로고
    • Spectral voice conversion for text-to-speech synthesis
    • A. Kain and M. W. Macon, "Spectral voice conversion for text-to-speech synthesis, " in Proc. ICASSP, IEEE, 1998, vol. 1, pp. 285-288.
    • (1998) Proc. ICASSP, IEEE , vol.1 , pp. 285-288
    • Kain, A.1    Macon, M.W.2
  • 3
    • 0034841948 scopus 로고    scopus 로고
    • Design and evaluation of a voice conversion algorithm based on spectral envelope mapping and residual prediction
    • A. Kain and M. W. Macon, "Design and evaluation of a voice conversion algorithm based on spectral envelope mapping and residual prediction, " in Proc. ICASSP, IEEE, 2001, vol. 2, pp. 813-816.
    • (2001) Proc. ICASSP, IEEE , vol.2 , pp. 813-816
    • Kain, A.1    Macon, M.W.2
  • 4
    • 0034842552 scopus 로고    scopus 로고
    • Voice conversion algorithm based on gaussian mixture model with dynamic frequency warping of straight spectrum
    • T. Toda, H. Saruwatari, and K. Shikano, "Voice conversion algorithm based on gaussian mixture model with dynamic frequency warping of straight spectrum, " in Proc. ICASSP, IEEE, 2001, vol. 2, pp. 841-844.
    • (2001) Proc. ICASSP, IEEE , vol.2 , pp. 841-844
    • Toda, T.1    Saruwatari, H.2    Shikano, K.3
  • 5
    • 57749193836 scopus 로고    scopus 로고
    • Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory
    • T. Toda, A. W. Black, and K. Tokuda, "Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory, " IEEE Trans. Audio, Speech and Language Processing, vol. 15, no. 8, pp. 2222-2235, 2007.
    • (2007) IEEE Trans. Audio, Speech and Language Processing , vol.15 , Issue.8 , pp. 2222-2235
    • Toda, T.1    Black, A.W.2    Tokuda, K.3
  • 8
    • 51449121435 scopus 로고    scopus 로고
    • Textindependent voice conversion based on state mapped codebook
    • M. Zhang, J. Tao, J. Tian, and X. Wang, "Textindependent voice conversion based on state mapped codebook, " in Proc. ICASSP, IEEE, 2008, pp. 4605-4608.
    • (2008) Proc. ICASSP, IEEE , pp. 4605-4608
    • Zhang, M.1    Tao, J.2    Tian, J.3    Wang, X.4
  • 9
    • 84890484652 scopus 로고    scopus 로고
    • Non-parallel training for voice conversion based on adaptation method
    • P. Song, W. Zheng, and L. Zhao, "Non-parallel training for voice conversion based on adaptation method, " in Proc. ICASSP, IEEE, 2013.
    • (2013) Proc. ICASSP, IEEE
    • Song, P.1    Zheng, W.2    Zhao, L.3
  • 10
    • 4544297119 scopus 로고    scopus 로고
    • Nonparallel training for voice conversion by maximum likelihood constrained adaptation
    • A. Mouchtaris, J. Van der Spiegel, and P. Mueller, "Nonparallel training for voice conversion by maximum likelihood constrained adaptation, " in Proc. ICASSP, IEEE, 2004, vol. 1, pp. I-1.
    • (2004) Proc. ICASSP, IEEE , vol.1
    • Mouchtaris, A.1    Spiegel Der J.Van2    Mueller, P.3
  • 11
    • 34547512822 scopus 로고    scopus 로고
    • Eigenvoice conversion based on Gaussian mixture model
    • T. Toda, Y. Ohtani, and K. Shikano, "Eigenvoice conversion based on Gaussian mixture model, " in Proc. ICSLP, pp. 2446-2449.
    • Proc. ICSLP , pp. 2446-2449
    • Toda, T.1    Ohtani, Y.2    Shikano, K.3
  • 12
    • 77953725318 scopus 로고    scopus 로고
    • INCA algorithm for training voice conversion systems from nonparallel corpora
    • D. Erro, A. Moreno, and A. Bonafonte, "INCA algorithm for training voice conversion systems from nonparallel corpora, " IEEE Trans. Audio, Speech and Language Processing, vol. 18, no. 5, pp. 944-953, 2010.
    • (2010) IEEE Trans. Audio, Speech and Language Processing , vol.18 , Issue.5 , pp. 944-953
    • Erro, D.1    Moreno, A.2    Bonafonte, A.3
  • 14
    • 0001560954 scopus 로고
    • Information geometry and alternatingminimization procedures
    • I. Csiszar and G. Tusnady, "Information geometry and alternatingminimization procedures, " Statistics and Decisions, vol. 1, pp. 205-237, 1984.
    • (1984) Statistics and Decisions , vol.1 , pp. 205-237
    • Csiszar, I.1    Tusnady, G.2
  • 16
    • 84905247158 scopus 로고    scopus 로고
    • http://aholab. ehu. es/users/derro/software. html.
  • 17
    • 85135177301 scopus 로고
    • Highquality speech modifcation based on a harmonic + noise model
    • Y. Stylianou, J. Laroche, and E. Moulines, "Highquality speech modifcation based on a harmonic + noise model, " in Proc. EUROSPEECH, 1995.
    • (1995) Proc. EUROSPEECH
    • Stylianou, Y.1    Laroche, J.2    Moulines, E.3
  • 18
    • 0035127703 scopus 로고    scopus 로고
    • Applying the harmonic plus noise model in concatenative speech synthesis
    • Y. Stylianou, "Applying the harmonic plus noise model in concatenative speech synthesis, " IEEE Trans. Speech and Audio Processing, vol. 9, no. 1, pp. 21-29, 2001.
    • (2001) IEEE Trans. Speech and Audio Processing , vol.9 , Issue.1 , pp. 21-29
    • Stylianou, Y.1
  • 19
    • 51449107658 scopus 로고    scopus 로고
    • LSF mapping for voice conversion with very small training sets
    • E. Helander, J. Nurminen, and M. Gabbouj, "LSF mapping for voice conversion with very small training sets, " in Proc. ICASSP, IEEE, 2008, pp. 4669-4672.
    • (2008) Proc. ICASSP, IEEE , pp. 4669-4672
    • Helander, E.1    Nurminen, J.2    Gabbouj, M.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.