SCOPUS 정보 검색 플랫폼

IEEE Transactions on Audio, Speech and Language Processing

Volumn 18, Issue 5, 2010, Pages 922-931

Voice conversion based on weighted frequency warping

(3) Erro, Daniel a Moreno, Asunción b Bonafonte, Antonio b

a UNIVERSITY OF THE BASQUE COUNTRY UPV EHU (Spain)

b UNIVERSITAT POLITÈCNICA DE CATALUNYA (Spain)

Author keywords

Gaussian mixture models (GMMs); Harmonic plus stochastic model (HSM); Speech synthesis; Voice conversion; Weighted frequency warping

Indexed keywords

FREQUENCY WARPING; GAUSSIAN MIXTURE MODELS; GAUSSIAN MIXTURE MODELS (GMMS); HARMONIC PLUS STOCHASTIC MODEL (HSM); VOICE CONVERSION;

LINEAR TRANSFORMATIONS; PIECEWISE LINEAR TECHNIQUES; SPEECH SYNTHESIS; STOCHASTIC SYSTEMS; WEAVING;

STOCHASTIC MODELS;

EID: 77953727123 PISSN: 15587916 EISSN: None Source Type: Journal
DOI: 10.1109/TASL.2009.2038663 Document Type: Article

Times cited : (162)

References (35)

1
- 4444251929
- Voice conversion: State of the art and perspectives
- E. Moulines and Y. Sagisaka Eds. Elsevier
- E. Moulines and Y. Sagisaka, Eds., "Voice conversion: State of the art and perspectives," Special Iss. Speech Commun., vol.16(2), 1995, Elsevier.
- (1995) Special Iss. Speech Commun. , vol.16 , Issue.2

2
- 0023739214
- Voice conversion through vector quantization
- M. Abe, S. Nakamura, K. Shikano, and H. Kuwabara, "Voice conversion through vector quantization," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., 1988, pp. 655-658.
- (1988) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. , pp. 655-658
- Abe, M.¹ Nakamura, S.² Shikano, K.³ Kuwabara, H.⁴

3
- 0026394044
- Speaker adaptation and voice conversion by codebook mapping
- K. Shikano, S. Nakamura, and M. Abe, "Speaker adaptation and voice conversion by codebook mapping," in Proc. IEEE Int. Symp. Circuits Syst., 1991, vol.1, pp. 594-597.
- (1991) Proc. IEEE Int. Symp. Circuits Syst. , vol.1 , pp. 594-597
- Shikano, K.¹ Nakamura, S.² Abe, M.³

4
- 33646900967
- Voice conversion based on piecewise linear conversion rules of formant frequency and spectrum tilt
- H. Mizuno and M. Abe, "Voice conversion based on piecewise linear conversion rules of formant frequency and spectrum tilt," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., 1994, vol.1, pp. 469-472.
- (1994) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. , vol.1 , pp. 469-472
- Mizuno, H.¹ Abe, M.²

5
- 0033154052
- Speaker transformation algorithm using segmental codebooks (STASC)
- L. M. Arslan, "Speaker transformation algorithm using segmental codebooks (STASC)," Speech Commun., no.28, 1999.
- (1999) Speech Commun. , Issue.28
- Arslan, L.M.¹

6
- 33746653351
- Robust processing techniques for voice conversion
- O. Turk and L. M. Arslan, "Robust processing techniques for voice conversion," Comput. Speech Lang., vol.20, no.4, pp. 441-467, 2006.
- (2006) Comput. Speech Lang. , vol.20 , Issue.4 , pp. 441-467
- Turk, O.¹ Arslan, L.M.²

7
- 85010815133
- Voice transformation using PSOLA technique
- H. Valbret, E. Moulines, and J. P. Tubach, "Voice transformation using PSOLA technique," Speech Commun., vol.1, pp. 145-148, 1992.
- (1992) Speech Commun , vol.1 , pp. 145-148
- Valbret, H.¹ Moulines, E.² Tubach, J.P.³

8
- 84948175540
- VTLN-based voice conversion
- D. Sündermann and H. Ney, "VTLN-based voice conversion," in Proc. IEEE Symp. Signal Process. Inf. Technol., 2003, pp. 556-559.
- (2003) Proc. IEEE Symp. Signal Process. Inf. Technol. , pp. 556-559
- Sündermann, D.¹ Ney, H.²

9
- 4544361661
- Voice conversion through transformation of spectral and intonation features
- D. Rentzos, S. Vaseghi, Q. Yan, and C. H. Ho, "Voice conversion through transformation of spectral and intonation features," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., 2004, vol.1, pp. 21-24.
- (2004) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. , vol.1 , pp. 21-24
- Rentzos, D.¹ Vaseghi, S.² Yan, Q.³ Ho, C.H.⁴

10
- 34547507542
- Frequency warping based on mapping formant parameters
- Z. W. Shuang, R. Bakis, S. Shechtman, D. Chazan, and Y. Qin, "Frequency warping based on mapping formant parameters," in Proc. Int. Conf. Spoken Lang. Process., 2006.
- (2006) Proc. Int. Conf. Spoken Lang. Process.
- Shuang, Z.W.¹ Bakis, R.² Shechtman, S.³ Chazan, D.⁴ Qin, Y.⁵

11
- 85064715894
- Speech spectrum transformation by speaker interpolation
- N. Iwahashi and Y. Sagisaka, "Speech spectrum transformation by speaker interpolation," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., 1994, vol.1, pp. 461-464.
- (1994) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. , vol.1 , pp. 461-464
- Iwahashi, N.¹ Sagisaka, Y.²

12
- 0029254176
- Transformation of formants for voice conversion using artificial neural networks
- M. Narendranath, H. A. Murthy, S. Rajendran, and B. Yegnanarayana, "Transformation of formants for voice conversion using artificial neural networks," Speech Commun., vol.16, no.2, pp. 207-216, 1995.
- (1995) Speech Commun , vol.16 , Issue.2 , pp. 207-216
- Narendranath, M.¹ Murthy, H.A.² Rajendran, S.³ Yegnanarayana, B.⁴

13
- 0003447548
- Ph.D. dissertation, École Nationale Superieure Des Télécommunications, Paris, France
- Y. Stylianou, "Harmonic plus noise models for speech, combined with statistical methods, for speech and speaker modification," Ph.D. dissertation, École Nationale Superieure Des Télé communications, Paris, France, 1996.
- (1996) Harmonic Plus Noise Models for Speech, Combined with Statistical Methods, for Speech and Speaker Modification
- Stylianou, Y.¹

14
- 0032026483
- Continuous probabilistic transform for voice conversion
- Y. Stylianou, O. Cappé, and E. Moulines, "Continuous probabilistic transform for voice conversion," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., 1998, vol.6, pp. 131-142.
- (1998) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. , vol.6 , pp. 131-142
- Stylianou, Y.¹ Cappé, O.² Moulines, E.³

15
- 4444285698
- Ph.D. dissertation, OGI School of Sci. and Eng., Beaverton, OR
- A. Kain, "High resolution voice transformation," Ph.D. dissertation, OGI School of Sci. and Eng., Beaverton, OR, 2001.
- (2001) High Resolution Voice Transformation
- Kain, A.¹

16
- 33846972308
- Residual prediction
- D. Sündermann, H. Höge, A. Bonafonte, and H. Duxans, "Residual prediction," in Proc. IEEE Symp. Signal Process. Inf. Technol., 2005, pp. 512-516.
- (2005) Proc. IEEE Symp. Signal Process. Inf. Technol. , pp. 512-516
- Sündermann, D.¹ Höge, H.² Bonafonte, A.³ Duxans, H.⁴

17
- 34047254509
- Quality-enhanced voice morphing using maximum likelihood transformations
- Jul.
- H. Ye and S. Young, "Quality-enhanced voice morphing using maximum likelihood transformations," IEEE Trans. Audio, Speech, Lang. Process., vol.14, no.4, pp. 1301-1312, Jul. 2006.
- (2006) IEEE Trans. Audio, Speech, Lang. Process. , vol.14 , Issue.4 , pp. 1301-1312
- Ye, H.¹ Young, S.²

18
- 44949143155
- Maximum likelihood voice conversion based on GMM with straight mixed excitation
- Y. Ohtani, T. Toda, H. Saruwatari, and K. Shikano, "Maximum likelihood voice conversion based on GMM with straight mixed excitation," in Proc. Interspeech, 2006.
- (2006) Proc. Interspeech
- Ohtani, Y.¹ Toda, T.² Saruwatari, H.³ Shikano, K.⁴

19
- 84867216755
- The linear transformation of LF glottal waveforms for voice conversion
- A. del Pozo and S. Young, "The linear transformation of LF glottal waveforms for voice conversion," in Proc. Interspeech, 2008, pp. 1457-1460.
- (2008) Proc. Interspeech , pp. 1457-1460
- Del Pozo, A.¹ Young, S.²

20
- 0034842552
- Voice conversion algorithm based on gaussian mixture model with dynamic frequency warping of straight spectrum
- T. Toda, H. Saruwatari, and K. Shikano, "Voice conversion algorithm based on gaussian mixture model with dynamic frequency warping of straight spectrum," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., 2001, pp. 841-844.
- (2001) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. , pp. 841-844
- Toda, T.¹ Saruwatari, H.² Shikano, K.³

21
- 33646779506
- Spectral conversion based on maximum likelihood estimation considering global variance of converted parameter
- T. Toda, A. W. Black, and K. Tokuda, "Spectral conversion based on maximum likelihood estimation considering global variance of converted parameter," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., 2005, vol.1, pp. 9-12.
- (2005) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. , vol.1 , pp. 9-12
- Toda, T.¹ Black, A.W.² Tokuda, K.³

22
- 57749193836
- Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory
- Nov.
- T. Toda, A. W. Black, and K. Tokuda, "Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory," IEEE Trans. Audio, Speech, Lang. Process., vol.15, no.8, pp. 2222-2235, Nov. 2007.
- (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.8 , pp. 2222-2235
- Toda, T.¹ Black, A.W.² Tokuda, K.³

23
- 0029725605
- Speech synthesis usingHMMSwith dynamic features
- T. Masuko, K. Tokuda, T. Kobayashi, and S. Imai, "Speech synthesis usingHMMSwith dynamic features," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., 1996, pp. 389-392.
- (1996) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. , pp. 389-392
- Masuko, T.¹ Tokuda, K.² Kobayashi, T.³ Imai, S.⁴

24
- 0030696416
- Voice characteristics conversion for HMM-based speech synthesis system
- T. Masuko, K. Tokuda, T. Kobayashi, and S. Imai, "Voice characteristics conversion for HMM-based speech synthesis system," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., 1997, pp. 1611-1614.
- (1997) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. , pp. 1611-1614
- Masuko, T.¹ Tokuda, K.² Kobayashi, T.³ Imai, S.⁴

25
- 67650854725
- Analysis of speaker adaptation algorithms for HMM-based speech synthesis and a constrained SMAPLR adaptation algorithm
- Jan.
- J. Yamagishi, T. Kobayashi, Y. Nakano, K. Ogata, and J. Isogai, "Analysis of speaker adaptation algorithms for HMM-based speech synthesis and a constrained SMAPLR adaptation algorithm," IEEE Trans. Audio, Speech, Lang. Process., vol.17, no.1, pp. 66-83, Jan. 2009.
- (2009) IEEE Trans. Audio, Speech, Lang. Process. , vol.17 , Issue.1 , pp. 66-83
- Yamagishi, J.¹ Kobayashi, T.² Nakano, Y.³ Ogata, K.⁴ Isogai, J.⁵

26
- 67651002140
- Statistical parametric speech synthesis
- H. Zen, K. Tokuda, and A. W. Black, "Statistical parametric speech synthesis," Speech Commun., vol.51, no.11, pp. 1039-1064, 2009.
- (2009) Speech Commun. , vol.51 , Issue.11 , pp. 1039-1064
- Zen, H.¹ Tokuda, K.² Black, A.W.³

27
- 77953717562
- Continuous stochastic feature mapping based on trajectory HMMs
- H. Zen, Y. Nankaku, and K. Tokuda, "Continuous stochastic feature mapping based on trajectory HMMs," in Proc. 2nd One Day Meeting on Unified Models for Speech Recognition and Synthesis, 2009.
- (2009) Proc. 2nd One Day Meeting on Unified Models for Speech Recognition and Synthesis
- Zen, H.¹ Nankaku, Y.² Tokuda, K.³

28
- 77953697940
- Ph.D. dissertation, Univ. Politècnica de Catalunya, Barcelona, Spain
- D. Erro, "Intra-lingual and cross-lingual voice conversion using harmonic plus stochastic models," Ph.D. dissertation, Univ. Politècnica de Catalunya, Barcelona, Spain, 2008.
- (2008) Intra-lingual and Cross-lingual Voice Conversion Using Harmonic Plus Stochastic Models
- Erro, D.¹

29
- 85068458327
- Weighted frequency warping for voice conversion
- D. Erro and A. Moreno, "Weighted frequency warping for voice conversion," in Proc. Interspeech, 2007, pp. 1965-1968.
- (2007) Proc. Interspeech , pp. 1965-1968
- Erro, D.¹ Moreno, A.²

30
- 51449124416
- Flexible harmonic/stochastic speech synthesis
- D. Erro, A. Moreno, and A. Bonafonte, "Flexible harmonic/stochastic speech synthesis," in Proc. 6th ISCA Workshop Speech Synth., 2007.
- (2007) Proc. 6th ISCA Workshop Speech Synth.
- Erro, D.¹ Moreno, A.² Bonafonte, A.³

31
- 56149126421
- Voice conversion of non-aligned data using unit selection
- H. Duxans, D. Erro, J. Pérez, F. Diego, A. Bonafonte, and A. Moreno, "Voice conversion of non-aligned data using unit selection," in Proc. TC-STAR Workshop Speech to Speech Transl., 2006.
- (2006) Proc. TC-STAR Workshop Speech to Speech Transl.
- Duxans, H.¹ Erro, D.² Pérez, J.³ Diego, F.⁴ Bonafonte, A.⁵ Moreno, A.⁶

32
- 0026106454
- Discrete all-pole modeling
- Feb.
- A. El-Jaroudi and J. Makhoul, "Discrete all-pole modeling," IEEE Trans. Signal Process., vol.39, no.2, pp. 411-423, Feb. 1991.
- (1991) IEEE Trans. Signal Process. , vol.39 , Issue.2 , pp. 411-423
- El-Jaroudi, A.¹ Makhoul, J.²

33
- 79961212205
- TC-STAR: Specifications of language resources and evaluation for speech synthesis
- A. Bonafonte, H. Höge, I. Kiss, A. Moreno, U. Ziegenhain, H. van Den Heuvel, H. U. Hain, X. S.Wang, and M. N. Garcia, "TC-STAR: Specifications of language resources and evaluation for speech synthesis," in Proc. Int. Conf. Lang. Resources Eval., 2006.
- (2006) Proc. Int. Conf. Lang. Resources Eval.
- Bonafonte, A.¹ Höge, H.² Kiss, I.³ Moreno, A.⁴ Ziegenhain, U.⁵ Van Den Heuvel, H.⁶ Hain, H.U.⁷ Wang, X.S.⁸ Garcia, M.N.⁹

34
- 77953708737
- The UPC TTS system description for the 2007 Blizzard Challenge
- A. Bonafonte, J. Adell, P. D. Agüero, D. Erro, I. Esquerra, A. Moreno, J. Pérez, and T. Polyakova, "The UPC TTS system description for the 2007 Blizzard Challenge," in Proc. 6th ISCA Workshop Speech Synth., 2007.
- (2007) Proc. 6th ISCA Workshop Speech Synth.
- Bonafonte, A.¹ Adell, J.² Agüero, P.D.³ Erro, D.⁴ Esquerra, I.⁵ Moreno, A.⁶ Pérez, J.⁷ Polyakova, T.⁸

35
- 77953705600
- [Online]. Available
- D. Mostefa, O. Hamon, N. Moreau, and K. Choukri, Evaluation report, deliverable D30 of the EU funded project TC-STAR 2007 [Online]. Available: http://www.tc-star.org
- (2007) Evaluation Report Deliverable D30 of the EU Funded Project TC-STAR
- Mostefa, D.¹ Hamon, O.² Moreau, N.³ Choukri, K.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.