메뉴 건너뛰기




Volumn , Issue , 2017, Pages 5535-5539

Non-parallel voice conversion using i-vector PLDA: Towards unifying speaker verification and transformation

Author keywords

i vector; non parallel training; Voice conversion

Indexed keywords


EID: 85023740493     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2017.7953215     Document Type: Conference Paper
Times cited : (93)

References (25)
  • 2
    • 70350125882 scopus 로고    scopus 로고
    • An overview of text-independent speaker recognition: From features to supervectors
    • January
    • T. Kinnunen and H. Li, "An overview of text-independent speaker recognition: from features to supervectors," Speech Communication, vol. 52, no. 1, pp. 12-40, January 2010.
    • (2010) Speech Communication , vol.52 , Issue.1 , pp. 12-40
    • Kinnunen, T.1    Li, H.2
  • 4
    • 33947714703 scopus 로고    scopus 로고
    • Effect of speech transformation on impostor acceptance
    • Toulouse, France, May
    • D. Matrouf, J.-F. Bonastre, and C. Fredouille, "Effect of speech transformation on impostor acceptance," in Proc. ICASSP, Toulouse, France, May 2006, pp. 933-936.
    • (2006) Proc. ICASSP , pp. 933-936
    • Matrouf, D.1    Bonastre, J.-F.2    Fredouille, C.3
  • 5
    • 0033884858 scopus 로고    scopus 로고
    • Speaker verification using adapted Gaussian mixture models
    • January
    • D.A. Reynolds, T.F. Quatieri, and R.B. Dunn, "Speaker verification using adapted gaussian mixture models," Digital Signal Processing, vol. 10, no. 1, pp. 19-41, January 2000.
    • (2000) Digital Signal Processing , vol.10 , Issue.1 , pp. 19-41
    • Reynolds, D.A.1    Quatieri, T.F.2    Dunn, R.B.3
  • 8
    • 84858973723 scopus 로고    scopus 로고
    • Bayesian speaker verification with heavy-tailed priors
    • Brno, Czech Republic, June
    • P. Kenny, "Bayesian speaker verification with heavy-tailed priors," in Odyssey 2010: The Speaker and Language Recognition Workshop, Brno, Czech Republic, June 2010, p. 14.
    • (2010) Odyssey 2010: The Speaker and Language Recognition Workshop , pp. 14
    • Kenny, P.1
  • 10
    • 84959112868 scopus 로고    scopus 로고
    • A study of speaker adaptation for DNN-based speech synthesis
    • Dresden, Germany
    • Z. Wu, P. Swietojanski, C. Veaux, S. Renals, and S. King, "A study of speaker adaptation for DNN-based speech synthesis," in Proc. Interspeech, Dresden, Germany, 2015, pp. 879-883.
    • (2015) Proc. Interspeech , pp. 879-883
    • Wu, Z.1    Swietojanski, P.2    Veaux, C.3    Renals, S.4    King, S.5
  • 11
    • 80051608660 scopus 로고    scopus 로고
    • A frame mapping based HMM approach to cross-lingual voice transformation
    • Czech Republic, May
    • Yao Qian, Ji Xu, and Frank K. Soong, "A frame mapping based HMM approach to cross-lingual voice transformation," Prague, Czech Republic, May 2011, pp. 5120-5123.
    • (2011) Prague , pp. 5120-5123
    • Qian, Y.1    Xu, J.2    Soong, F.K.3
  • 14
    • 34547512822 scopus 로고    scopus 로고
    • Eigenvoice conversion based on Gaussian mixture model
    • Pittsburgh, USA, September
    • T. Toda, Y. Ohtani, and K. Shikano, "Eigenvoice conversion based on gaussian mixture model," in Proc. Interspeech, Pittsburgh, USA, September 2006.
    • (2006) Proc. Interspeech
    • Toda, T.1    Ohtani, Y.2    Shikano, K.3
  • 15
    • 84869384026 scopus 로고    scopus 로고
    • Mixture of factor analyzers using priors from non-parallel speech for voice conversion
    • Z. Wu, T. Kinnunen, E.S. Chng, and H. Li, "Mixture of factor analyzers using priors from non-parallel speech for voice conversion," IEEE Signal Process. Lett., vol. 19, no. 12, pp. 914-917, 2012.
    • (2012) IEEE Signal Process. Lett. , vol.19 , Issue.12 , pp. 914-917
    • Wu, Z.1    Kinnunen, T.2    Chng, E.S.3    Li, H.4
  • 16
    • 84984920236 scopus 로고    scopus 로고
    • Non-parallel training in voice conversion using an adaptive restricted boltzmann machine
    • T. Nakashika, T. Takiguchi, and Y. Minami, "Non-parallel training in voice conversion using an adaptive restricted boltzmann machine," IEEE/ACM Trans. Audio, Speech & Language Processing, vol. 24, no. 11, pp. 2032-2045, 2016.
    • (2016) IEEE/ACM Trans. Audio, Speech & Language Processing , vol.24 , Issue.11 , pp. 2032-2045
    • Nakashika, T.1    Takiguchi, T.2    Minami, Y.3
  • 19
    • 33947637189 scopus 로고    scopus 로고
    • Joint factor analysis of speaker and session variability: Theory and algorithms
    • P. Kenny, "Joint factor analysis of speaker and session variability: theory and algorithms," technical report CRIM-06/08-14, 2006.
    • (2006) Technical Report CRIM-06/08-14
    • Kenny, P.1
  • 20
    • 84906311190 scopus 로고    scopus 로고
    • Unifying probabilistic linear discriminant analysis variants in biometric authentication
    • Syntactic, and Statistical Pattern Recognition - Joint IAPR International Workshop, S+SSPR 2014, Joensuu, Finland, August 20-22 Proceedings, 2014
    • A. Sizov, K.-A. Lee, and T. Kinnunen, "Unifying probabilistic linear discriminant analysis variants in biometric authentication," in Structural, Syntactic, and Statistical Pattern Recognition - Joint IAPR International Workshop, S+SSPR 2014, Joensuu, Finland, August 20-22, 2014. Proceedings, 2014, pp. 464-475.
    • (2014) Structural , pp. 464-475
    • Sizov, A.1    Lee, K.-A.2    Kinnunen, T.3
  • 22
    • 0016495091 scopus 로고
    • Linear prediction: A tutorial review
    • April
    • J. Makhoul, "Linear prediction: a tutorial review," Proceedings of the IEEE, vol. 64, no. 4, pp. 561-580, April 1975.
    • (1975) Proceedings of the IEEE , vol.64 , Issue.4 , pp. 561-580
    • Makhoul, J.1
  • 23
    • 33646236798 scopus 로고    scopus 로고
    • Clean speech reconstruction from MFCC vectors and fundamental frequency using an integrated front-end
    • B. Milner and X. Shao, "Clean speech reconstruction from MFCC vectors and fundamental frequency using an integrated front-end," Speech Communication, vol. 48, no. 6, pp. 697-715, 2006.
    • (2006) Speech Communication , vol.48 , Issue.6 , pp. 697-715
    • Milner, B.1    Shao, X.2
  • 24
    • 58149310662 scopus 로고    scopus 로고
    • On the inversion of melfrequency cepstral coefficients for speech enhancement applications
    • September
    • L.E. Boucheron and P.L. De Leon , "On the inversion of melfrequency cepstral coefficients for speech enhancement applications," in Int. Conf. Signals and Electronic Systems (ICSES), September 2008, pp. 485-488.
    • (2008) Int. Conf. Signals and Electronic Systems (ICSES) , pp. 485-488
    • Boucheron, L.E.1    De Leon, P.L.2
  • 25
    • 33645887246 scopus 로고    scopus 로고
    • Support vector machines using GMM supervectors for speaker verification
    • W.M. Campbell, D.E. Sturim, and D.A. Reynolds, "Support vector machines using GMM supervectors for speaker verification," IEEE Signal Process. Lett., vol. 13, no. 5, pp. 308-311, 2006.
    • (2006) IEEE Signal Process. Lett. , vol.13 , Issue.5 , pp. 308-311
    • Campbell, W.M.1    Sturim, D.E.2    Reynolds, D.A.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.