메뉴 건너뛰기




Volumn , Issue , 2007, Pages 107-112

An Evaluation of Many-to-One Voice Conversion Algorithms with Pre-Stored Speaker Data Sets

Author keywords

EVC; many to one VC; SAT; speaker selection; voice conversion

Indexed keywords

DATA SET; EIGENVOICE CONVERSION; EIGENVOICES; MANY-TO-ONE; MANY-TO-ONE VOICE CONVERSION; SPEAKER ADAPTIVE TRAININGS; SPEAKER SELECTION; VOICE CONVERSION; VOICE CONVERSION ALGORITHM;

EID: 78649286010     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (1)

References (18)
  • 1
    • 0029256373 scopus 로고
    • Acoustic characteristics of speaker individuality control and conversion
    • H. Kuwabara, and Y. Sagisaka. Acoustic characteristics of speaker individuality control and conversion. Speech Communication, Vol. 16, No. 2, pp. 165-173, 1995.
    • (1995) Speech Communication , vol.16 , Issue.2 , pp. 165-173
    • Kuwabara, H.1    Sagisaka, Y.2
  • 2
    • 0031623661 scopus 로고    scopus 로고
    • Spectral voice conversion for text-to-speech synthesis
    • Seattle, USA, May
    • A. Kain and M.W. Macon. Spectral voice conversion for text-to-speech synthesis. Proc. ICASSP, pp. 285-288, Seattle, USA, May 1998.
    • (1998) Proc. ICASSP , pp. 285-288
    • Kain, A.1    Macon, M.W.2
  • 3
    • 0033692729 scopus 로고    scopus 로고
    • Narrowband to wideband conversion of speech using GMM based transformation
    • K-Y Park and H. S. Kim. Narrowband to wideband conversion of speech using GMM based transformation.Proc. ICASSP, pp. 1843-1846, 2000.
    • (2000) Proc. ICASSP , pp. 1843-1846
    • Park, K-Y1    Kim, H. S.2
  • 4
    • 44949187612 scopus 로고    scopus 로고
    • Improving body Transmitted Unvoiced Speech with Statistical Voice Conversion
    • Pittsburgh, USA, Sep
    • M. Nakagiri, T. Toda, H. Kashioka and K. Shikano. Improving body Transmitted Unvoiced Speech with Statistical Voice Conversion. Proc. INTERSPEECH2006-ICSLP, pp. 2270-2273, Pittsburgh, USA, Sep. 2006.
    • (2006) Proc. INTERSPEECH2006-ICSLP , pp. 2270-2273
    • Nakagiri, M.1    Toda, T.2    Kashioka, H.3    Shikano, K.4
  • 6
    • 4544306344 scopus 로고    scopus 로고
    • Cross-language voice conversion using bilingual database
    • July
    • M. Mashimo, T. Toda, H. Kawanami, K. Shikano and N Campbell. Cross-language voice conversion using bilingual database. IPSJ Journal, Vol.43, No.7, pp.2177-2185, July 2002.
    • (2002) IPSJ Journal , vol.43 , Issue.7 , pp. 2177-2185
    • Mashimo, M.1    Toda, T.2    Kawanami, H.3    Shikano, K.4    Campbell, N5
  • 8
    • 33646779506 scopus 로고    scopus 로고
    • Spectral conversion based on maximum likelihood estimationconsidering global variance of converted parameter
    • T. Toda, A.W. Black and K. Tokuda. Spectral conversion based on maximum likelihood estimationconsidering global variance of converted parameter. Proc. ICASSP, pp. 9-12, 2005.
    • (2005) Proc. ICASSP , pp. 9-12
    • Toda, T.1    Black, A.W.2    Tokuda, K.3
  • 9
    • 34047254509 scopus 로고    scopus 로고
    • Quality-enhanced voice morphing using maximum likelihood transformations
    • H. Ye and S. Young. Quality-enhanced voice morphing using maximum likelihood transformations. IEEE Trans. ASLP, Vol. 14, No. 4, pp. 1301-1312, 2006.
    • (2006) IEEE Trans. ASLP , vol.14 , Issue.4 , pp. 1301-1312
    • Ye, H.1    Young, S.2
  • 11
    • 4544297119 scopus 로고    scopus 로고
    • Non-parallel training for voice conversion by maximum likelihood constrained adaptation
    • May
    • A. Mouchtaris, J. Spiegel, and P. Mueller. Non-parallel training for voice conversion by maximum likelihood constrained adaptation. Proc. ICASSP, pp. 1-4, May. 2004
    • (2004) Proc. ICASSP , pp. 1-4
    • Mouchtaris, A.1    Spiegel, J.2    Mueller, P.3
  • 12
    • 85084062692 scopus 로고    scopus 로고
    • Map-based adaptation for speech conversion using adaptation data selection and nonparallel training
    • Sept
    • C.H. Lee and C.H. Wu. Map-based adaptation for speech conversion using adaptation data selection and nonparallel training. Proc. ICSLP, pp. 1164-1167, Sept. 2006.
    • (2006) Proc. ICSLP , pp. 1164-1167
    • Lee, C.H.1    Wu, C.H.2
  • 13
    • 34547512822 scopus 로고    scopus 로고
    • Eigenvoice Conversion Based on Gaussian Mixture Model
    • Sept
    • T. Toda, Y. Ohtani and K. Shikano. Eigenvoice Conversion Based on Gaussian Mixture Model. Proc. ICSLP, pp. 2446-2449, Sept. 2006.
    • (2006) Proc. ICSLP , pp. 2446-2449
    • Toda, T.1    Ohtani, Y.2    Shikano, K.3
  • 14
    • 34547496175 scopus 로고    scopus 로고
    • One-to-Many and Many-to-One Voice Conversion Based on Eigenvoices
    • Apr
    • T. Toda, Y. Ohtani and K. Shikano. One-to-Many and Many-to-One Voice Conversion Based on Eigenvoices. Proc. ICASSP, Vol. 4, pp. 1249-1252, Apr. 2007.
    • (2007) Proc. ICASSP , vol.4 , pp. 1249-1252
    • Toda, T.1    Ohtani, Y.2    Shikano, K.3
  • 15
    • 0034848875 scopus 로고    scopus 로고
    • Unsupervised speaker adaptation based on sufficient HMM statistics of selected speakers
    • May
    • S. Yoshizawa, A. Baba, K. Matsunami, Y. Mera, M. Yamada and K. Shikano. Unsupervised speaker adaptation based on sufficient HMM statistics of selected speakers. Proc.ICASSP2001, pp. 341-344, May 2001.
    • (2001) Proc.ICASSP2001 , pp. 341-344
    • Yoshizawa, S.1    Baba, A.2    Matsunami, K.3    Mera, Y.4    Yamada, M.5    Shikano, K.6
  • 18
    • 0032673049 scopus 로고    scopus 로고
    • Restructuring speech representations using a pitch adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds
    • H. Kawahara, I. Masuda-Katsuse, and A.de Cheveigue. Restructuring speech representations using a pitch adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds. Speech Communivation, Vol. 27, No. 3-4, pp. 187-207, 1999.
    • (1999) Speech Communivation , vol.27 , Issue.3-4 , pp. 187-207
    • Kawahara, H.1    Masuda-Katsuse, I.2    de Cheveigue, A.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.