메뉴 건너뛰기




Volumn 2015-January, Issue , 2015, Pages 2759-2763

System fusion for high-performance voice conversion

Author keywords

Frequency warping; GMM; Highperformance; System fusion; Voice conversion

Indexed keywords

GAUSSIAN DISTRIBUTION; SPEECH COMMUNICATION;

EID: 84959163883     PISSN: 2308457X     EISSN: 19909772     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (13)

References (28)
  • 6
    • 84910087395 scopus 로고    scopus 로고
    • Sequenceerror (SE) minimization training of neural networkfor voice conversion
    • F.-L. Xie, Y. Qian, Y. Fan, F. K. Soong, and H. Li, "Sequenceerror (SE) minimization training of neural networkfor voice conversion, " in INTERSPEECH, 2014.
    • (2014) INTERSPEECH
    • Xie, F.-L.1    Qian, Y.2    Fan, Y.3    Soong, F.K.4    Li, H.5
  • 8
    • 57749193836 scopus 로고    scopus 로고
    • Voice conversionbased on maximum-likelihood estimation of spectral parametertrajectory
    • T. Toda, A. W. Black, and K. Tokuda, "Voice conversionbased on maximum-likelihood estimation of spectral parametertrajectory, " IEEE Transactions on Audio, Speech, and Language Processing, vol. 15, no. 8, pp. 2222-2235, 2007.
    • (2007) IEEE Transactions on Audio, Speech, and Language Processing , vol.15 , Issue.8 , pp. 2222-2235
    • Toda, T.1    Black, A.W.2    Tokuda, K.3
  • 9
    • 84865754815 scopus 로고    scopus 로고
    • Voice conversion using GMMwith enhanced global variance
    • H. Benisty and D. Malah, "Voice conversion using GMMwith enhanced global variance, " in INTERSPEECH, 2011, pp. 669-672.
    • (2011) INTERSPEECH , pp. 669-672
    • Benisty, H.1    Malah, D.2
  • 12
    • 84911369131 scopus 로고    scopus 로고
    • Exemplarbasedsparse representation with residual compensationfor voice conversion
    • Z. Wu, T. Virtanen, E. S. Chng, and H. Li, "Exemplarbasedsparse representation with residual compensationfor voice conversion, " IEEE Transactions on Speech and Audio Processing, vol. 22, no. 10, pp. 1506-1521, 2014.
    • (2014) IEEE Transactions on Speech and Audio Processing , vol.22 , Issue.10 , pp. 1506-1521
    • Wu, Z.1    Virtanen, T.2    Chng, E.S.3    Li, H.4
  • 16
    • 84872177757 scopus 로고    scopus 로고
    • Parametric voice conversionbased on bilinear frequency warping plus amplitudescaling
    • D. Erro, E. Navas, and I. Hernaez, "Parametric voice conversionbased on bilinear frequency warping plus amplitudescaling, " IEEE Transactions on Audio, Speech, and Language Processing, vol. 21, no. 3, pp. 556-566, 2013.
    • (2013) IEEE Transactions on Audio, Speech, and Language Processing , vol.21 , Issue.3 , pp. 556-566
    • Erro, D.1    Navas, E.2    Hernaez, I.3
  • 19
    • 84857498745 scopus 로고    scopus 로고
    • Voice conversionusing dynamic frequency warping with amplitude scaling, for parallel or nonparallel corpora
    • E. Godoy, O. Rosec, and T. Chonavel, "Voice conversionusing dynamic frequency warping with amplitude scaling, for parallel or nonparallel corpora, " IEEE Transactions onAudio, Speech, and Language Processing, vol. 20, no. 4, pp. 1313-1323, 2012.
    • (2012) IEEE Transactions OnAudio, Speech, and Language Processing , vol.20 , Issue.4 , pp. 1313-1323
    • Godoy, E.1    Rosec, O.2    Chonavel, T.3
  • 20
    • 0030245128 scopus 로고    scopus 로고
    • Robust continuous speechrecognition using parallel model combination
    • M. J. Gales and S. J. Young, "Robust continuous speechrecognition using parallel model combination, " IEEETransactions on Speech and Audio Processing, vol. 4, no. 5, pp. 352-359, 1996.
    • (1996) IEEETransactions on Speech and Audio Processing , vol.4 , Issue.5 , pp. 352-359
    • Gales, M.J.1    Young, S.J.2
  • 23
    • 0026880275 scopus 로고
    • Voice transformationusing PSOLA technique
    • H. Valbret, E. Moulines, and J.-P. Tubach, "Voice transformationusing PSOLA technique, " Speech Communication, vol. 11, no. 2, pp. 175-187, 1992.
    • (1992) Speech Communication , vol.11 , Issue.2 , pp. 175-187
    • Valbret, H.1    Moulines, E.2    Tubach, J.-P.3
  • 26
    • 0032673049 scopus 로고    scopus 로고
    • Restructuring speech representations using a pitchadaptivetime-frequency smoothing and an instantaneousfrequency-based F0 extraction: Possible role of a repetitivestructure in sounds
    • H. Kawahara, I. Masuda-Katsuse, and A. de Cheveigné, "Restructuring speech representations using a pitchadaptivetime-frequency smoothing and an instantaneousfrequency-based F0 extraction: Possible role of a repetitivestructure in sounds, " Speech communication, vol. 27, no. 3, pp. 187-207, 1999.
    • (1999) Speech Communication , vol.27 , Issue.3 , pp. 187-207
    • Kawahara, H.1    Masuda-Katsuse, I.2    De Cheveigné, A.3
  • 27
    • 84878390910 scopus 로고    scopus 로고
    • Implementationof computationally efficient real-time voice conversion
    • T. Toda, T. Muramatsu, and H. Banno, "Implementationof computationally efficient real-time voice conversion. "in INTERSPEECH, 2012.
    • (2012) INTERSPEECH
    • Toda, T.1    Muramatsu, T.2    Banno, H.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.