SCOPUS 정보 검색 플랫폼

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

Volumn , Issue , 2017, Pages 5535-5539

Non-parallel voice conversion using i-vector PLDA: Towards unifying speaker verification and transformation

(4) Kinnunen, Tomi a Juvela, Lauri b Alku, Paavo b Yamagishi, Junichi c,d

a UNIVERSITY OF EASTERN FINLAND (Finland)

b AALTO UNIVERSITY (Finland)

c NATIONAL INSTITUTE OF INFORMATICS (Japan)

d UNIVERSITY OF EDINBURGH (United Kingdom)

Author keywords

i vector; non parallel training; Voice conversion

Indexed keywords

EID: 85023740493 PISSN: 15206149 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICASSP.2017.7953215 Document Type: Conference Paper

Times cited : (93)

References (25)

1
- 70349197715
- Voice transformation: A survey
- Taipei, Taiwan, April
- Y. Stylianou, "Voice transformation: A survey," in Proc. Int. conference on acoustics, speech, and signal processing (ICASSP 2009), Taipei, Taiwan, April 2009, pp. 3585-3588.
- (2009) Proc. Int. Conference on Acoustics, Speech, and Signal Processing (ICASSP 2009) , pp. 3585-3588
- Stylianou, Y.¹

2
- 70350125882
- An overview of text-independent speaker recognition: From features to supervectors
- January
- T. Kinnunen and H. Li, "An overview of text-independent speaker recognition: from features to supervectors," Speech Communication, vol. 52, no. 1, pp. 12-40, January 2010.
- (2010) Speech Communication , vol.52 , Issue.1 , pp. 12-40
- Kinnunen, T.¹ Li, H.²

3
- 0032026483
- Continuous probabilistic transform for voice conversion
- March
- Y. Stylianou, O. Cappé, and E. Moulines, "Continuous probabilistic transform for voice conversion," IEEE Trans. on Speech and Audio Processing, vol. 6, no. 2, pp. 131-142, March 1998.
- (1998) IEEE Trans. on Speech and Audio Processing , vol.6 , Issue.2 , pp. 131-142
- Stylianou, Y.¹ Cappé, O.² Moulines, E.³

4
- 33947714703
- Effect of speech transformation on impostor acceptance
- Toulouse, France, May
- D. Matrouf, J.-F. Bonastre, and C. Fredouille, "Effect of speech transformation on impostor acceptance," in Proc. ICASSP, Toulouse, France, May 2006, pp. 933-936.
- (2006) Proc. ICASSP , pp. 933-936
- Matrouf, D.¹ Bonastre, J.-F.² Fredouille, C.³

5
- 0033884858
- Speaker verification using adapted Gaussian mixture models
- January
- D.A. Reynolds, T.F. Quatieri, and R.B. Dunn, "Speaker verification using adapted gaussian mixture models," Digital Signal Processing, vol. 10, no. 1, pp. 19-41, January 2000.
- (2000) Digital Signal Processing , vol.10 , Issue.1 , pp. 19-41
- Reynolds, D.A.¹ Quatieri, T.F.² Dunn, R.B.³

6
- 79951609039
- Front-end factor analysis for speaker verification
- May
- N. Dehak, P.J. Kenny, R. Dehak, P. Dumouchel, and P. Ouellet, "Front-end factor analysis for speaker verification," IEEE Trans. Audio, Speech and Language Processing, vol. 19, no. 4, pp. 788-798, May 2011.
- (2011) IEEE Trans. Audio, Speech and Language Processing , vol.19 , Issue.4 , pp. 788-798
- Dehak, N.¹ Kenny, P.J.² Dehak, R.³ Dumouchel, P.⁴ Ouellet, P.⁵

7
- 50649094277
- Probabilistic linear discriminant analysis for inferences about identity
- S. J. D. Prince and J. H. Elder, "Probabilistic linear discriminant analysis for inferences about identity," in Proc. IEEE 11th Int. Conf. on Computer Vision (ICCV), 2007.
- (2007) Proc. IEEE 11th Int. Conf. on Computer Vision (ICCV)
- Prince, S.J.D.¹ Elder, J.H.²

8
- 84858973723
- Bayesian speaker verification with heavy-tailed priors
- Brno, Czech Republic, June
- P. Kenny, "Bayesian speaker verification with heavy-tailed priors," in Odyssey 2010: The Speaker and Language Recognition Workshop, Brno, Czech Republic, June 2010, p. 14.
- (2010) Odyssey 2010: The Speaker and Language Recognition Workshop , pp. 14
- Kenny, P.¹

9
- 85073103063
- The speaker partitioning problem
- Brno, Czech Republic, June
- N. Brümmer and E. de Villiers, "The speaker partitioning problem," in Odyssey 2010: The Speaker and Language Recognition Workshop, Brno, Czech Republic, June 2010, p. 34.
- (2010) Odyssey 2010: The Speaker and Language Recognition Workshop , pp. 34
- Brümmer, N.¹ De Villiers, E.²

10
- 84959112868
- A study of speaker adaptation for DNN-based speech synthesis
- Dresden, Germany
- Z. Wu, P. Swietojanski, C. Veaux, S. Renals, and S. King, "A study of speaker adaptation for DNN-based speech synthesis," in Proc. Interspeech, Dresden, Germany, 2015, pp. 879-883.
- (2015) Proc. Interspeech , pp. 879-883
- Wu, Z.¹ Swietojanski, P.² Veaux, C.³ Renals, S.⁴ King, S.⁵

11
- 80051608660
- A frame mapping based HMM approach to cross-lingual voice transformation
- Czech Republic, May
- Yao Qian, Ji Xu, and Frank K. Soong, "A frame mapping based HMM approach to cross-lingual voice transformation," Prague, Czech Republic, May 2011, pp. 5120-5123.
- (2011) Prague , pp. 5120-5123
- Qian, Y.¹ Xu, J.² Soong, F.K.³

12
- 84867198185
- On the impact of alignment on voice conversion performance
- E. Helander, J. Schwarz, J. Nurminen, H. Silen, and M. Gabbouj, "On the impact of alignment on voice conversion performance," in Proc. Interspeech, 2008.
- (2008) Proc. Interspeech
- Helander, E.¹ Schwarz, J.² Nurminen, J.³ Silen, H.⁴ Gabbouj, M.⁵

13
- 34047245444
- Nonparallel training for voice conversion based on a parameter adaptation approach
- May
- A. Mouchtaris, J. Van der Spiegel, and P. Mueller, "Nonparallel training for voice conversion based on a parameter adaptation approach," IEEE Transactions on Audio, Speech, and Language Processing, vol. 14, no. 3, pp. 952-963, May 2006.
- (2006) IEEE Transactions on Audio, Speech, and Language Processing , vol.14 , Issue.3 , pp. 952-963
- Mouchtaris, A.¹ Van Der Spiegel, J.² Mueller, P.³

14
- 34547512822
- Eigenvoice conversion based on Gaussian mixture model
- Pittsburgh, USA, September
- T. Toda, Y. Ohtani, and K. Shikano, "Eigenvoice conversion based on gaussian mixture model," in Proc. Interspeech, Pittsburgh, USA, September 2006.
- (2006) Proc. Interspeech
- Toda, T.¹ Ohtani, Y.² Shikano, K.³

15
- 84869384026
- Mixture of factor analyzers using priors from non-parallel speech for voice conversion
- Z. Wu, T. Kinnunen, E.S. Chng, and H. Li, "Mixture of factor analyzers using priors from non-parallel speech for voice conversion," IEEE Signal Process. Lett., vol. 19, no. 12, pp. 914-917, 2012.
- (2012) IEEE Signal Process. Lett. , vol.19 , Issue.12 , pp. 914-917
- Wu, Z.¹ Kinnunen, T.² Chng, E.S.³ Li, H.⁴

16
- 84984920236
- Non-parallel training in voice conversion using an adaptive restricted boltzmann machine
- T. Nakashika, T. Takiguchi, and Y. Minami, "Non-parallel training in voice conversion using an adaptive restricted boltzmann machine," IEEE/ACM Trans. Audio, Speech & Language Processing, vol. 24, no. 11, pp. 2032-2045, 2016.
- (2016) IEEE/ACM Trans. Audio, Speech & Language Processing , vol.24 , Issue.11 , pp. 2032-2045
- Nakashika, T.¹ Takiguchi, T.² Minami, Y.³

17
- 85073232294
- A small footprint i-vector extractor
- Singapore, June
- P. Kenny, "A small footprint i-vector extractor," in Proc. Odyssey 2012: the Speaker and Language Recognition Workshop, Singapore, June 2012.
- (2012) Proc. Odyssey 2012: The Speaker and Language Recognition Workshop
- Kenny, P.¹

18
- 85023756121
- arXiv e-prints
- N. Brümmer, "VB calibration to improve the interface between phone recognizer and i-vector extractor," arXiv e-prints, 2015.
- (2015) VB Calibration to Improve the Interface Between Phone Recognizer and I-vector Extractor
- Brümmer, N.¹

19
- 33947637189
- Joint factor analysis of speaker and session variability: Theory and algorithms
- P. Kenny, "Joint factor analysis of speaker and session variability: theory and algorithms," technical report CRIM-06/08-14, 2006.
- (2006) Technical Report CRIM-06/08-14
- Kenny, P.¹

20
- 84906311190
- Unifying probabilistic linear discriminant analysis variants in biometric authentication
- Syntactic, and Statistical Pattern Recognition - Joint IAPR International Workshop, S+SSPR 2014, Joensuu, Finland, August 20-22 Proceedings, 2014
- A. Sizov, K.-A. Lee, and T. Kinnunen, "Unifying probabilistic linear discriminant analysis variants in biometric authentication," in Structural, Syntactic, and Statistical Pattern Recognition - Joint IAPR International Workshop, S+SSPR 2014, Joensuu, Finland, August 20-22, 2014. Proceedings, 2014, pp. 464-475.
- (2014) Structural , pp. 464-475
- Sizov, A.¹ Lee, K.-A.² Kinnunen, T.³

21
- 85023762159
- Master's thesis, Aalto University, Espoo, Finland
- L. Juvela, Perceptual spectral matching utilizing mel-scale filterbanks for statistical parametric speech synthesis with glottal excitation vocoder, Master's thesis, Aalto University, Espoo, Finland, 2015.
- (2015) Perceptual Spectral Matching Utilizing Mel-scale Filterbanks for Statistical Parametric Speech Synthesis with Glottal Excitation Vocoder
- Juvela, L.¹

22
- 0016495091
- Linear prediction: A tutorial review
- April
- J. Makhoul, "Linear prediction: a tutorial review," Proceedings of the IEEE, vol. 64, no. 4, pp. 561-580, April 1975.
- (1975) Proceedings of the IEEE , vol.64 , Issue.4 , pp. 561-580
- Makhoul, J.¹

23
- 33646236798
- Clean speech reconstruction from MFCC vectors and fundamental frequency using an integrated front-end
- B. Milner and X. Shao, "Clean speech reconstruction from MFCC vectors and fundamental frequency using an integrated front-end," Speech Communication, vol. 48, no. 6, pp. 697-715, 2006.
- (2006) Speech Communication , vol.48 , Issue.6 , pp. 697-715
- Milner, B.¹ Shao, X.²

24
- 58149310662
- On the inversion of melfrequency cepstral coefficients for speech enhancement applications
- September
- L.E. Boucheron and P.L. De Leon , "On the inversion of melfrequency cepstral coefficients for speech enhancement applications," in Int. Conf. Signals and Electronic Systems (ICSES), September 2008, pp. 485-488.
- (2008) Int. Conf. Signals and Electronic Systems (ICSES) , pp. 485-488
- Boucheron, L.E.¹ De Leon, P.L.²

25
- 33645887246
- Support vector machines using GMM supervectors for speaker verification
- W.M. Campbell, D.E. Sturim, and D.A. Reynolds, "Support vector machines using GMM supervectors for speaker verification," IEEE Signal Process. Lett., vol. 13, no. 5, pp. 308-311, 2006.
- (2006) IEEE Signal Process. Lett. , vol.13 , Issue.5 , pp. 308-311
- Campbell, W.M.¹ Sturim, D.E.² Reynolds, D.A.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.