메뉴 건너뛰기




Volumn , Issue , 2016, Pages 44-51

An Automatic Voice Conversion Evaluation Strategy Based on Perceptual Background Noise Distortion and Speaker Similarity

Author keywords

objective measures; speaker similarity score; speech quality assessment; subjective listening tests; Voice conversion

Indexed keywords

ACOUSTIC NOISE; SPEECH COMMUNICATION; SPEECH ENHANCEMENT; SPEECH SYNTHESIS;

EID: 85075288991     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (7)

References (34)
  • 4
    • 57749193836 scopus 로고    scopus 로고
    • Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory
    • T. Toda, A.W. Black, and K. Tokuda, "Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory," Audio, Speech, and Language Processing, IEEE Transactions on, vol. 15, no. 8, pp. 2222-2235, 2007.
    • (2007) Audio, Speech, and Language Processing, IEEE Transactions on , vol.15 , Issue.8 , pp. 2222-2235
    • Toda, T.1    Black, A.W.2    Tokuda, K.3
  • 8
    • 84911369131 scopus 로고    scopus 로고
    • Exemplar-based sparse representation with residual compensation for voice conversion
    • Z. Wu, T. Virtanen, E. Chng, and H. Li, "Exemplar-based sparse representation with residual compensation for voice conversion," IEEE/ACM Trans. Audio, Speech & Language Processing, vol. 22, pp. 1506-1521, 2014.
    • (2014) IEEE/ACM Trans. Audio, Speech & Language Processing , vol.22 , pp. 1506-1521
    • Wu, Z.1    Virtanen, T.2    Chng, E.3    Li, H.4
  • 14
    • 84866873313 scopus 로고    scopus 로고
    • Prediction of perceived sound quality of synthetic speech
    • Xi'an, China
    • D.-Y. Huang, "Prediction of perceived sound quality of synthetic speech," in In: Proc. APSIPA ASC., Xi'an, China, 2011.
    • (2011) Proc. APSIPA ASC
    • Huang, D.-Y.1
  • 17
    • 84910071971 scopus 로고    scopus 로고
    • A comparative study of spectral transformation techniques for singing voice synthesis
    • S. W. Lee, Z. Wu, M. Dong, X. Tian, and H. Li, "A comparative study of spectral transformation techniques for singing voice synthesis," in Proc. Interspeech, 2014, pp. 2499-2503.
    • (2014) Proc. Interspeech , pp. 2499-2503
    • Lee, S. W.1    Wu, Z.2    Dong, M.3    Tian, X.4    Li, H.5
  • 20
    • 80053068819 scopus 로고    scopus 로고
    • Voice conversion using support vector regression
    • September
    • P. Song, Y. Q. Bao, L. Zhao, and C. R. Zou, "Voice conversion using support vector regression," Electronics Letters, vol. 47, no. 18, pp. 1045-1046, September 2011.
    • (2011) Electronics Letters , vol.47 , Issue.18 , pp. 1045-1046
    • Song, P.1    Bao, Y. Q.2    Zhao, L.3    Zou, C. R.4
  • 23
    • 84857498745 scopus 로고    scopus 로고
    • Voice conversion using dynamic frequency warping with amplitude scaling, for parallel or nonparallel corpora
    • May
    • E. Godoy, O. Rosec, and T. Chonavel, "Voice conversion using dynamic frequency warping with amplitude scaling, for parallel or nonparallel corpora," IEEE Transactions on Audio, Speech, and Language Processing, vol. 20, no. 4, pp. 1313-1323, May 2012.
    • (2012) IEEE Transactions on Audio, Speech, and Language Processing , vol.20 , Issue.4 , pp. 1313-1323
    • Godoy, E.1    Rosec, O.2    Chonavel, T.3
  • 29
    • 84946020861 scopus 로고    scopus 로고
    • Sparse representation for frequency warping based voice conversion
    • X. Tian, Z. Wu, S. W. Lee, N. Q. Hy, E. Chng, and M. Dong, "Sparse representation for frequency warping based voice conversion." in ICASSP. IEEE, 2015, pp. 4235-4239.
    • (2015) ICASSP. IEEE , pp. 4235-4239
    • Tian, X.1    Wu, Z.2    Lee, S. W.3    Hy, N. Q.4    Chng, E.5    Dong, M.6
  • 31
    • 85118466198 scopus 로고    scopus 로고
    • An effective quality evaluation protocol for speech enhancement algorithms
    • Sydney, Australia
    • J. Hansen and B. Pellom, "An effective quality evaluation protocol for speech enhancement algorithms," in Proc. ICSLP, Sydney, Australia, 1998, pp. 81-84.
    • (1998) Proc. ICSLP , pp. 81-84
    • Hansen, J.1    Pellom, B.2
  • 32
    • 84989489267 scopus 로고
    • Prediction of perceived phonetic distance from criticalband spectra: A first step
    • D. Klatt, "Prediction of perceived phonetic distance from criticalband spectra: A first step," in Proc. IEEE ICASSP, 1982, pp. 1278-1281.
    • (1982) Proc. IEEE ICASSP , pp. 1278-1281
    • Klatt, D.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.