메뉴 건너뛰기




Volumn 18, Issue 5, 2010, Pages 922-931

Voice conversion based on weighted frequency warping

Author keywords

Gaussian mixture models (GMMs); Harmonic plus stochastic model (HSM); Speech synthesis; Voice conversion; Weighted frequency warping

Indexed keywords

FREQUENCY WARPING; GAUSSIAN MIXTURE MODELS; GAUSSIAN MIXTURE MODELS (GMMS); HARMONIC PLUS STOCHASTIC MODEL (HSM); VOICE CONVERSION;

EID: 77953727123     PISSN: 15587916     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASL.2009.2038663     Document Type: Article
Times cited : (162)

References (35)
  • 1
    • 4444251929 scopus 로고
    • Voice conversion: State of the art and perspectives
    • E. Moulines and Y. Sagisaka Eds. Elsevier
    • E. Moulines and Y. Sagisaka, Eds., "Voice conversion: State of the art and perspectives," Special Iss. Speech Commun., vol.16(2), 1995, Elsevier.
    • (1995) Special Iss. Speech Commun. , vol.16 , Issue.2
  • 3
    • 0026394044 scopus 로고
    • Speaker adaptation and voice conversion by codebook mapping
    • K. Shikano, S. Nakamura, and M. Abe, "Speaker adaptation and voice conversion by codebook mapping," in Proc. IEEE Int. Symp. Circuits Syst., 1991, vol.1, pp. 594-597.
    • (1991) Proc. IEEE Int. Symp. Circuits Syst. , vol.1 , pp. 594-597
    • Shikano, K.1    Nakamura, S.2    Abe, M.3
  • 4
    • 33646900967 scopus 로고
    • Voice conversion based on piecewise linear conversion rules of formant frequency and spectrum tilt
    • H. Mizuno and M. Abe, "Voice conversion based on piecewise linear conversion rules of formant frequency and spectrum tilt," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., 1994, vol.1, pp. 469-472.
    • (1994) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. , vol.1 , pp. 469-472
    • Mizuno, H.1    Abe, M.2
  • 5
    • 0033154052 scopus 로고    scopus 로고
    • Speaker transformation algorithm using segmental codebooks (STASC)
    • L. M. Arslan, "Speaker transformation algorithm using segmental codebooks (STASC)," Speech Commun., no.28, 1999.
    • (1999) Speech Commun. , Issue.28
    • Arslan, L.M.1
  • 6
    • 33746653351 scopus 로고    scopus 로고
    • Robust processing techniques for voice conversion
    • O. Turk and L. M. Arslan, "Robust processing techniques for voice conversion," Comput. Speech Lang., vol.20, no.4, pp. 441-467, 2006.
    • (2006) Comput. Speech Lang. , vol.20 , Issue.4 , pp. 441-467
    • Turk, O.1    Arslan, L.M.2
  • 7
    • 85010815133 scopus 로고
    • Voice transformation using PSOLA technique
    • H. Valbret, E. Moulines, and J. P. Tubach, "Voice transformation using PSOLA technique," Speech Commun., vol.1, pp. 145-148, 1992.
    • (1992) Speech Commun , vol.1 , pp. 145-148
    • Valbret, H.1    Moulines, E.2    Tubach, J.P.3
  • 12
    • 0029254176 scopus 로고
    • Transformation of formants for voice conversion using artificial neural networks
    • M. Narendranath, H. A. Murthy, S. Rajendran, and B. Yegnanarayana, "Transformation of formants for voice conversion using artificial neural networks," Speech Commun., vol.16, no.2, pp. 207-216, 1995.
    • (1995) Speech Commun , vol.16 , Issue.2 , pp. 207-216
    • Narendranath, M.1    Murthy, H.A.2    Rajendran, S.3    Yegnanarayana, B.4
  • 15
    • 4444285698 scopus 로고    scopus 로고
    • Ph.D. dissertation, OGI School of Sci. and Eng., Beaverton, OR
    • A. Kain, "High resolution voice transformation," Ph.D. dissertation, OGI School of Sci. and Eng., Beaverton, OR, 2001.
    • (2001) High Resolution Voice Transformation
    • Kain, A.1
  • 17
    • 34047254509 scopus 로고    scopus 로고
    • Quality-enhanced voice morphing using maximum likelihood transformations
    • Jul.
    • H. Ye and S. Young, "Quality-enhanced voice morphing using maximum likelihood transformations," IEEE Trans. Audio, Speech, Lang. Process., vol.14, no.4, pp. 1301-1312, Jul. 2006.
    • (2006) IEEE Trans. Audio, Speech, Lang. Process. , vol.14 , Issue.4 , pp. 1301-1312
    • Ye, H.1    Young, S.2
  • 18
    • 44949143155 scopus 로고    scopus 로고
    • Maximum likelihood voice conversion based on GMM with straight mixed excitation
    • Y. Ohtani, T. Toda, H. Saruwatari, and K. Shikano, "Maximum likelihood voice conversion based on GMM with straight mixed excitation," in Proc. Interspeech, 2006.
    • (2006) Proc. Interspeech
    • Ohtani, Y.1    Toda, T.2    Saruwatari, H.3    Shikano, K.4
  • 19
    • 84867216755 scopus 로고    scopus 로고
    • The linear transformation of LF glottal waveforms for voice conversion
    • A. del Pozo and S. Young, "The linear transformation of LF glottal waveforms for voice conversion," in Proc. Interspeech, 2008, pp. 1457-1460.
    • (2008) Proc. Interspeech , pp. 1457-1460
    • Del Pozo, A.1    Young, S.2
  • 20
    • 0034842552 scopus 로고    scopus 로고
    • Voice conversion algorithm based on gaussian mixture model with dynamic frequency warping of straight spectrum
    • T. Toda, H. Saruwatari, and K. Shikano, "Voice conversion algorithm based on gaussian mixture model with dynamic frequency warping of straight spectrum," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., 2001, pp. 841-844.
    • (2001) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. , pp. 841-844
    • Toda, T.1    Saruwatari, H.2    Shikano, K.3
  • 21
    • 33646779506 scopus 로고    scopus 로고
    • Spectral conversion based on maximum likelihood estimation considering global variance of converted parameter
    • T. Toda, A. W. Black, and K. Tokuda, "Spectral conversion based on maximum likelihood estimation considering global variance of converted parameter," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., 2005, vol.1, pp. 9-12.
    • (2005) Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. , vol.1 , pp. 9-12
    • Toda, T.1    Black, A.W.2    Tokuda, K.3
  • 22
    • 57749193836 scopus 로고    scopus 로고
    • Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory
    • Nov.
    • T. Toda, A. W. Black, and K. Tokuda, "Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory," IEEE Trans. Audio, Speech, Lang. Process., vol.15, no.8, pp. 2222-2235, Nov. 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.8 , pp. 2222-2235
    • Toda, T.1    Black, A.W.2    Tokuda, K.3
  • 25
    • 67650854725 scopus 로고    scopus 로고
    • Analysis of speaker adaptation algorithms for HMM-based speech synthesis and a constrained SMAPLR adaptation algorithm
    • Jan.
    • J. Yamagishi, T. Kobayashi, Y. Nakano, K. Ogata, and J. Isogai, "Analysis of speaker adaptation algorithms for HMM-based speech synthesis and a constrained SMAPLR adaptation algorithm," IEEE Trans. Audio, Speech, Lang. Process., vol.17, no.1, pp. 66-83, Jan. 2009.
    • (2009) IEEE Trans. Audio, Speech, Lang. Process. , vol.17 , Issue.1 , pp. 66-83
    • Yamagishi, J.1    Kobayashi, T.2    Nakano, Y.3    Ogata, K.4    Isogai, J.5
  • 26
    • 67651002140 scopus 로고    scopus 로고
    • Statistical parametric speech synthesis
    • H. Zen, K. Tokuda, and A. W. Black, "Statistical parametric speech synthesis," Speech Commun., vol.51, no.11, pp. 1039-1064, 2009.
    • (2009) Speech Commun. , vol.51 , Issue.11 , pp. 1039-1064
    • Zen, H.1    Tokuda, K.2    Black, A.W.3
  • 29
    • 85068458327 scopus 로고    scopus 로고
    • Weighted frequency warping for voice conversion
    • D. Erro and A. Moreno, "Weighted frequency warping for voice conversion," in Proc. Interspeech, 2007, pp. 1965-1968.
    • (2007) Proc. Interspeech , pp. 1965-1968
    • Erro, D.1    Moreno, A.2
  • 32


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.