SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 20, Issue 3, 2012, Pages 806-817

Voice conversion using dynamic kernel partial least squares regression

(4) Helander, Elina a Silen, Hanna a Virtanen, Tuomas a Gabbouj, Moncef a

a TAMPERE UNIVERSITY OF TECHNOLOGY (Finland)

Author keywords

Kernel methods; partial least squares regression; voice conversion

Indexed keywords

APERIODICITY; AUXILIARY INFORMATION; CONVERSION FUNCTION; GAUSSIANS; KERNEL METHODS; KERNEL PARTIAL LEAST SQUARES; NONLINEAR MODELING; PARTIAL LEAST SQUARES REGRESSION; SIMPLE AND EFFICIENT ALGORITHMS; SOURCE FEATURES; SPECTRAL FEATURE; SPEECH FEATURES; VOICE CONVERSION; VOICE CONVERSION ALGORITHM;

ALGORITHMS; DYNAMICS; LINEAR TRANSFORMATIONS; MATHEMATICAL TRANSFORMATIONS; REGRESSION ANALYSIS;

SPEECH PROCESSING;

EID: 84856141218 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TASL.2011.2165944 Document Type: Article

Times cited : (142)

References (33)

1
- 0023739214
- Voice conversion through vector quantization
- New York, Apr
- M. Abe, S. Nakamura, K. Shikano, and H. Kuwabara, "Voice conversion through vector quantization," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing, New York, Apr. 1988, pp. 565-568.
- (1988) Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing , pp. 565-568
- Abe, M.¹ Nakamura, S.² Shikano, K.³ Kuwabara, H.⁴

2
- 77953707533
- Spectral mapping using artificial neural networks for voice conversion
- Jul
- S. Desai, A. Black, B. Yegnanarayana, and K. Prahallad, "Spectral mapping using artificial neural networks for voice conversion," IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 5, pp. 954-964, Jul. 2010.
- (2010) IEEE Trans. Audio, Speech, Lang. Process. , vol.18 , Issue.5 , pp. 954-964
- Desai, S.¹ Black, A.² Yegnanarayana, B.³ Prahallad, K.⁴

3
- 85010815133
- Voice transformation using PSOLA technique
- Mar
- H. Valbret, E. Moulines, and J. Tubach, "Voice transformation using PSOLA technique," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing, Mar. 1992, vol. 1, pp. 145-148.
- (1992) Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing , vol.1 , pp. 145-148
- Valbret, H.¹ Moulines, E.² Tubach, J.³

4
- 0033154052
- Speaker transformation algorithm using segmental codebooks (STASC)
- Jun
- L. Arslan, "Speaker transformation algorithm using segmental codebooks (STASC)," Speech Commun., vol. 28, no. 3, pp. 211-226, Jun. 1999.
- (1999) Speech Commun. , vol.28 , Issue.3 , pp. 211-226
- Arslan, L.¹

5
- 79959836789
- Maximum a posteriori voice conversion using sequential Monte Carlo methods
- Sep
- E. Helander, H. Silen, J. Miguez, and M. Gabbouj, "Maximum a posteriori voice conversion using sequential Monte Carlo methods," in Proc. Interspeech, Sep. 2010, pp. 1716-1719.
- (2010) Proc. Interspeech , pp. 1716-1719
- Helander, E.¹ Silen, H.² Miguez, J.³ Gabbouj, M.⁴

6
- 0032026483
- Continuous probabilistic transform for voice conversion
- PII S1063667698017386
- Y. Stylianou, O. Cappe, and E. Moulines, "Continuous probabilistic transform for voice conversion," IEEE Trans. Speech Audio Process., vol. 6, no. 2, pp. 131-142, Mar. 1998. (Pubitemid 128720639)
- (1998) IEEE Transactions on Speech and Audio Processing , vol.6 , Issue.2 , pp. 131-142
- Stylianou, Y.¹ Cappe, O.² Moulines, E.³

7
- 0031623661
- Spectral voice conversion for text-tospeech synthesis
- Seattle, WA, May
- A. Kain and M. W. Macon, "Spectral voice conversion for text-tospeech synthesis," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Seattle, WA, May 1998, vol. 1, pp. 285-288.
- (1998) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. , vol.1 , pp. 285-288
- Kain, A.¹ MacOn, M.W.²

8
- 77953712499
- Voice conversion using partial least squares regression
- Jul
- E. Helander, T.Virtanen, J. Nurminen, and M. Gabbouj, "Voice conversion using partial least squares regression," IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 5, pp. 912-921, Jul. 2010.
- (2010) IEEE Trans. Audio, Speech, Lang. Process. , vol.18 , Issue.5 , pp. 912-921
- Helander, E.¹ Virtanen, T.² Nurminen, J.³ Gabbouj, M.⁴

9
- 57749193836
- Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory
- Nov
- T. Toda, A. Black, and K. Tokuda, "Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 8, pp. 2222-2235, Nov. 2007.
- (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.8 , pp. 2222-2235
- Toda, T.¹ Black, A.² Tokuda, K.³

10
- 84966348891
- An HMM-based speech synthesis system applied to English
- Sep
- K. Tokuda, H. Zen, and A. W. Black, "An HMM-based speech synthesis system applied to English," in Proc. IEEE Workshop Speech Synth., Sep. 2002, pp. 227-230.
- (2002) Proc. IEEE Workshop Speech Synth. , pp. 227-230
- Tokuda, K.¹ Zen, H.² Black, A.W.³

11
- 34547496175
- One-to-many and many-to-one voice conversion based on eigenvoices
- DOI 10.1109/ICASSP.2007.367303, 4218334, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '07
- T. Toda, Y. Ohtani, and K. Shikano, "One-to-many and many-to-one voice conversion based on eigenvoices," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Apr. 2007, vol. 4, pp. IV-1249-IV-1252. (Pubitemid 47178603)
- (2007) ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings , vol.4
- Toda, T.¹ Ohtani, Y.² Shikano, K.³

12
- 84905560807
- Voice conversion with smoothedGMM and MAP adaptation
- Y. Chen, M. Chu, E. Chang, J. Liu, and R. Liu, "Voice conversion with smoothedGMM and MAP adaptation," in Proc. Eurospeech, 2003, pp. 2413-2416.
- (2003) Proc. Eurospeech , pp. 2413-2416
- Chen, Y.¹ Chu, M.² Chang, E.³ Liu, J.⁴ Liu, R.⁵

13
- 77953727123
- Voice conversion based on weighted frequency warping
- Jul
- D. Erro, A. Moreno, and A. Bonafonte, "Voice conversion based on weighted frequency warping," IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 5, pp. 922-931, Jul. 2010.
- (2010) IEEE Trans. Audio, Speech, Lang. Process. , vol.18 , Issue.5 , pp. 922-931
- Erro, D.¹ Moreno, A.² Bonafonte, A.³

14
- 33745171182
- New York: Wiley, , ch. Introduction to Scientific Data Mining: Direct Kernel Methods & Applications
- M. J. Embrechts and B. Szymanski, Computationally Intelligent Hybrid Systems. New York: Wiley, 2005, ch. Introduction to Scientific Data Mining: Direct Kernel Methods & Applications, pp. 317-365.
- (2005) Computationally Intelligent Hybrid Systems , pp. 317-365
- Embrechts, M.J.¹ Szymanski, B.²

15
- 0032673049
- Restructuring speech representations using a pitch-adaptive time-frequency smoothing and a instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds
- Apr
- H. Kawahara, I. Masuda-Katsuse, and A. de Cheveigné, "Restructuring speech representations using a pitch-adaptive time-frequency smoothing and a instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds," Speech Commun., vol. 27, no. 3-4, pp. 187-207, Apr. 1999.
- (1999) Speech Commun. , vol.27 , Issue.3-4 , pp. 187-207
- Kawahara, H.¹ Masuda-Katsuse, I.² De Cheveigné, A.³

16
- 77953708096
- Thousands of voices for HMM-based speech synthesis-Analysis and application of TTS systems built on various ASR corpora
- Jul
- J. Yamagishi, B. Usabaev, S. King, O.Watts, J. Dines, J. Tian, Y. Guan, R. Hu, K. Oura, Y.-J. Wu, K. Tokuda, R. Karhila, and M. Kurimo, "Thousands of voices for HMM-based speech synthesis-Analysis and application of TTS systems built on various ASR corpora," IEEE Trans. Audio, Speech, Lang. Process., vol. 18, no. 5, pp. 984-1004, Jul. 2010.
- (2010) IEEE Trans. Audio, Speech, Lang. Process. , vol.18 , Issue.5 , pp. 984-1004
- Yamagishi, J.¹ Usabaev, B.² King, S.³ Watts, O.⁴ Dines, J.⁵ Tian, J.⁶ Guan, Y.⁷ Hu, R.⁸ Oura, K.⁹ Wu, Y.-J.¹⁰ Tokuda, K.¹¹ Karhila, R.¹² Kurimo, M.¹³

17
- 0004244302
- Upper Saddle River, NJ: Prentice-Hall
- L. Rabiner and B.-H. Juang, Fundamentals of Speech Recognition. Upper Saddle River, NJ: Prentice-Hall, 1993.
- (1993) Fundamentals of Speech Recognition
- Rabiner, L.¹ Juang, B.-H.²

18
- 18144401294
- A novel kernel method for clustering
- May
- F. Camastra and A.Verri, "A novel kernel method for clustering," IEEE Trans. Pattern Anal. Mach. Intell., vol. 27, no. 5, pp. 801-804, May 2005.
- (2005) IEEE Trans. Pattern Anal. Mach. Intell. , vol.27 , Issue.5 , pp. 801-804
- Camastra, F.¹ Verri, A.²

19
- 0347243182
- Nonlinear Component Analysis as a Kernel Eigenvalue Problem
- B. Schölkopf, A. J. Smola, and K.-R. Müller, "Nonlinear component analysis as a kernel eigenvalue problem," Neural Comput., vol. 10, no. 5, pp. 1299-1319, 1998. (Pubitemid 128463674)
- (1998) Neural Computation , vol.10 , Issue.5 , pp. 1299-1319
- Scholkopf, B.¹ Smola, A.² Muller, K.-R.³

20
- 2442514721
- ser. NATO Science Series. Series III: Computer and Systems Sciences. Amsterdam, The Netherlands: IOS Press , ch. An Optimization Perspective on Kernel Partial Least Squares Regression
- K. P. Bennett and M. J. Embrechts, Advances in Learning Theory: Methods, Models and Applications, ser. NATO Science Series. Series III: Computer and Systems Sciences. Amsterdam, The Netherlands: IOS Press, 2003, vol. 190, ch. An Optimization Perspective on Kernel Partial Least Squares Regression, pp. 227-250.
- (2003) Advances in Learning Theory: Methods, Models and Applications , vol.190 , pp. 227-250
- Bennett, K.P.¹ Embrechts, M.J.²

21
- 0027530250
- SIMPLS: An alternative approach to partial least squares regression
- Mar
- S. de Jong, "SIMPLS: An alternative approach to partial least squares regression," Chemometrics Intell. Lab. Syst., vol. 18, no. 3, pp. 251-263, Mar. 1993.
- (1993) Chemometrics Intell. Lab. Syst. , vol.18 , Issue.3 , pp. 251-263
- De Jong, S.¹

22
- 0038259120
- Kernel partial least squares regression in reproducing kernel Hilbert space
- Dec
- R. Rosipal and L. Trejo, "Kernel partial least squares regression in reproducing kernel Hilbert space," J. Mach. Learn. Res., vol. 2, pp. 97-123, Dec. 2001.
- (2001) J. Mach. Learn. Res. , vol.2 , pp. 97-123
- Rosipal, R.¹ Trejo, L.²

23
- 33846405723
- Details of the nitech HMM-based speech synthesis system for the blizzard challenge 2005
- DOI 10.1093/ietisy/e90-1.1.325
- H. Zen, T. Toda, M. Nakamura, and K. Tokuda, "Details of the Nitech HMM-based speech synthesis system for the Blizzard challenge 2005," IEICE Trans. Inf. Syst., vol. E90-D, no. 1, pp. 325-333, Jan. 2007. (Pubitemid 46145336)
- (2007) IEICE Transactions on Information and Systems , vol.E90-D , Issue.1 , pp. 325-333
- Zen, H.¹ Toda, T.² Nakamura, M.³ Tokuda, K.⁴

24
- 44949143155
- Maximum likelihood voice conversion based on GMM with STRAIGHT mixed excitation
- Y. Ohtani, T. Toda, H. Saruwatari, and K. Shikano, "Maximum likelihood voice conversion based on GMM with STRAIGHT mixed excitation," in Proc. Interspeech, 2006, pp. 2266-2269.
- (2006) Proc. Interspeech , pp. 2266-2269
- Ohtani, Y.¹ Toda, T.² Saruwatari, H.³ Shikano, K.⁴

25
- 0028996842
- CELP coding based on mel-cepstral analysis
- May
- K. Koishida, K. Tokuda, T. Kobayashi, and S. Imai, "CELP coding based on mel-cepstral analysis," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., May 1995, vol. 1, pp. 33-36.
- (1995) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. , vol.1 , pp. 33-36
- Koishida, K.¹ Tokuda, K.² Kobayashi, T.³ Imai, S.⁴

26
- 33746653351
- Robust processing techniques for voice conversion
- DOI 10.1016/j.csl.2005.06.001, PII S088523080500029X
- O. Turk and L. Arslan, "Robust processing techniques for voice conversion," Comput. Speech Lang., vol. 4, no. 20, pp. 441-467, Oct. 2006. (Pubitemid 44150541)
- (2006) Computer Speech and Language , vol.20 , Issue.4 , pp. 441-467
- Turk, O.¹ Arslan, L.M.²

27
- 85009097254
- Mixed excitation for HMM-based speech synthesis
- T. Yoshimura, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura, "Mixed excitation for HMM-based speech synthesis," in Proc. Eurospeech, 2001, pp. 2263-2266.
- (2001) Proc. Eurospeech , pp. 2263-2266
- Yoshimura, T.¹ Tokuda, K.² Masuko, T.³ Kobayashi, T.⁴ Kitamura, T.⁵

28
- 70349218136
- Voice conversion based on simultaneous modeling of spectrum and F0
- May
- K. Yutani, Y. Uto, Y. Nankaku, A. Lee, and K. Tokuda, "Voice conversion based on simultaneous modeling of spectrum and F0," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., May 2009, pp. 3897-3900.
- (2009) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. , pp. 3897-3900
- Yutani, K.¹ Uto, Y.² Nankaku, Y.³ Lee, A.⁴ Tokuda, K.⁵

29
- 34547520011
- A novel method for prosody prediction in voice conversion
- DOI 10.1109/ICASSP.2007.366961, 4218149, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '07
- E. Helander and J. Nurminen, "A novel method for prosody prediction in voice conversion," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Apr. 2007, vol. 4, pp. IV-509-IV-512. (Pubitemid 47178423)
- (2007) ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings , vol.4
- Helander, E.E.¹ Nurminen, J.²

30
- 5444243681
- Speaker-specific pitch contour modelling and modification
- Seattle, WA, May
- D. Chapell and J. Hansen, "Speaker-specific pitch contour modelling and modification," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., Seattle, WA, May 1998, pp. 885-888.
- (1998) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. , pp. 885-888
- Chapell, D.¹ Hansen, J.²

31
- 84856159343
- CLSU: Voices
- Philadelphia, PA
- A. Kain, "CLSU: Voices," in Linguistic Data Consortium, Philadelphia, PA, 2006.
- (2006) Linguistic Data Consortium
- Kain, A.¹

32
- 0029209272
- Robust text-independent speaker identification using Gaussian mixture speaker models
- Jan
- D. Reynolds and R. Rose, "Robust text-independent speaker identification using Gaussian mixture speaker models," IEEE Trans. Speech Audio Process., vol. 3, no. 1, pp. 72-83, Jan. 1995.
- (1995) IEEE Trans. Speech Audio Process. , vol.3 , Issue.1 , pp. 72-83
- Reynolds, D.¹ Rose, R.²

33
- 77956826012
- Automatic speaker recognition as a measurement of voice imitation and conversion
- M. Farrus, M. Wagner, D. Erro, and J. Hernando, "Automatic speaker recognition as a measurement of voice imitation and conversion," Int. J. Speech Lang. Law, vol. 17, no. 1, 2010.
- (2010) Int. J. Speech Lang. Law , vol.17 , Issue.1
- Farrus, M.¹ Wagner, M.² Erro, D.³ Hernando, J.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.