메뉴 건너뛰기




Volumn , Issue , 2008, Pages 224-229

Phoneme-based spectral voice conversion using temporal decomposition and Gaussian mixture model

Author keywords

Gaussian mixture model (GMM); Spectral voice conversion; Temporal decomposition

Indexed keywords

CANTILEVER BEAMS; MAGNETOSTRICTIVE DEVICES; PHOTODEGRADATION; TARGETS;

EID: 51549110156     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (2)

References (22)
  • 2
    • 85009266993 scopus 로고    scopus 로고
    • Transformation of spectral envelope for voice conversion based on radial basis function networks
    • T. Watanabe, T. Murakami, M. Namba, T. Hoya, and Y. Ishida, "Transformation of spectral envelope for voice conversion based on radial basis function networks," Proc. ICSLP, pp. 285-288, 2002.
    • (2002) Proc. ICSLP , pp. 285-288
    • Watanabe, T.1    Murakami, T.2    Namba, M.3    Hoya, T.4    Ishida, Y.5
  • 3
    • 85135141647 scopus 로고    scopus 로고
    • Hidden Markov model based voice conversion using dynamic characteristics of speaker
    • E. K. Kim, S. Lee, and Y. H. Oh, "Hidden Markov model based voice conversion using dynamic characteristics of speaker," Proc. Eurospeech, pp. 2519-2522, 1997.
    • (1997) Proc. Eurospeech , pp. 2519-2522
    • Kim, E.K.1    Lee, S.2    Oh, Y.H.3
  • 4
  • 5
    • 0031623661 scopus 로고    scopus 로고
    • Spectral voice conversion for text-to-speech synthesis
    • A. Kain and M. W. Macon, "Spectral voice conversion for text-to-speech synthesis," Proc. ICASSP, pp. 285-288, 1998.
    • (1998) Proc. ICASSP , pp. 285-288
    • Kain, A.1    Macon, M.W.2
  • 6
    • 0034842552 scopus 로고    scopus 로고
    • Voice conversion algorithm based on Gaussian mixture model with dynamic frequency warping of STRAIGHT spectrum
    • T. Toda, H. Saruwatari, and K. Shikano, "Voice conversion algorithm based on Gaussian mixture model with dynamic frequency warping of STRAIGHT spectrum," Proc. ICASSP, pp. 841-844, 2001.
    • (2001) Proc. ICASSP , pp. 841-844
    • Toda, T.1    Saruwatari, H.2    Shikano, K.3
  • 7
    • 84905560807 scopus 로고    scopus 로고
    • Voice conversion with smoothed GMM and MAP adaptation
    • Y. Chen, M. Chu, E. Chang, J. Liu, and R. Liu, "Voice conversion with smoothed GMM and MAP adaptation," Proc. Eurospeech, pp. 2413-2416, 2003.
    • (2003) Proc. Eurospeech , pp. 2413-2416
    • Chen, Y.1    Chu, M.2    Chang, E.3    Liu, J.4    Liu, R.5
  • 8
    • 0141702280 scopus 로고    scopus 로고
    • Using phone and diphone based acoustic models for voice conversion: A step towards creating voice fonts
    • A. Kumar and A. Verma, "Using phone and diphone based acoustic models for voice conversion: A step towards creating voice fonts," Proc. ICASSP, pp. 720-723, 2003.
    • (2003) Proc. ICASSP , pp. 720-723
    • Kumar, A.1    Verma, A.2
  • 9
    • 84994241109 scopus 로고    scopus 로고
    • Including dynamic and phonetic information in voice conversion systems
    • H. Duxans, A. Bonafonte, A. Kain, and J. van Santen, "Including dynamic and phonetic information in voice conversion systems," Proc. ICSLP, pp. 1193-1196, 2004.
    • (2004) Proc. ICSLP , pp. 1193-1196
    • Duxans, H.1    Bonafonte, A.2    Kain, A.3    van Santen, J.4
  • 10
    • 34047254509 scopus 로고    scopus 로고
    • Quality-enhanced voice morphing using maximum likelihood transformations
    • H. Ye and S. Young, "Quality-enhanced voice morphing using maximum likelihood transformations," IEEE Trans. on Audio, Speech and lang. Proc., pp. 1301-1312, 2006.
    • (2006) IEEE Trans. on Audio, Speech and lang. Proc , pp. 1301-1312
    • Ye, H.1    Young, S.2
  • 11
    • 51549090536 scopus 로고    scopus 로고
    • High quality voice conversion through combining modified GMM and formant mapping for Mandarin
    • K. Liu, J. Zhang, and Y. Yan, "High quality voice conversion through combining modified GMM and formant mapping for Mandarin," Proc. ICDT, p. 10, 2007.
    • (2007) Proc. ICDT , pp. 10
    • Liu, K.1    Zhang, J.2    Yan, Y.3
  • 12
    • 85068458327 scopus 로고    scopus 로고
    • Weighted frequency warping for voice conversion
    • D. Erro and A. Moreno, "Weighted frequency warping for voice conversion," Proc. Interspeech, pp. 1965-1968, 2007.
    • (2007) Proc. Interspeech , pp. 1965-1968
    • Erro, D.1    Moreno, A.2
  • 13
    • 51549106452 scopus 로고    scopus 로고
    • Control of spectral dynamics using temporal decomposition in voice conversion and concatenative speech synthesis
    • B. P. Nguyen and M. Akagi, "Control of spectral dynamics using temporal decomposition in voice conversion and concatenative speech synthesis," Proc. NCSP, pp. 279-282, 2008.
    • (2008) Proc. NCSP , pp. 279-282
    • Nguyen, B.P.1    Akagi, M.2
  • 14
    • 0028997012 scopus 로고
    • Spectral dynamics is more important than spectral distortion
    • H. P. Knagenhjelm and W. B. Kleijn, "Spectral dynamics is more important than spectral distortion," Proc. ICASSP, pp. 732-735, 1995.
    • (1995) Proc. ICASSP , pp. 732-735
    • Knagenhjelm, H.P.1    Kleijn, W.B.2
  • 15
    • 0020602364 scopus 로고
    • Efficient coding of LPC parameters by temporal decomposition
    • B. S. Atal, "Efficient coding of LPC parameters by temporal decomposition," Proc. ICASSP, pp. 81-84, 1983.
    • (1983) Proc. ICASSP , pp. 81-84
    • Atal, B.S.1
  • 16
    • 0038719980 scopus 로고    scopus 로고
    • Modified restricted temporal decomposition and its application to low bit rate speech coding
    • P. C. Nguyen, T. Ochi, and M. Akagi, "Modified restricted temporal decomposition and its application to low bit rate speech coding," IEICE Transactions on Information and Systems, vol. E86-D, pp. 397-405, 2003.
    • (2003) IEICE Transactions on Information and Systems , vol.E86-D , pp. 397-405
    • Nguyen, P.C.1    Ochi, T.2    Akagi, M.3
  • 17
    • 0032673049 scopus 로고    scopus 로고
    • Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous frequency-based F0 extraction: Possible role of a repetitive structure in sounds
    • H. Kawahara, I. Masuda-Katsuse, and A. de Cheveigné, "Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous frequency-based F0 extraction: Possible role of a repetitive structure in sounds," Journal of Speech Communication, vol. 27, pp. 187-207, 1999.
    • (1999) Journal of Speech Communication , vol.27 , pp. 187-207
    • Kawahara, H.1    Masuda-Katsuse, I.2    de Cheveigné, A.3
  • 19
    • 0141703296 scopus 로고    scopus 로고
    • Temporal decomposition: A promising approach to VQ-based speaker identification
    • P. C. Nguyen, M. Akagi, and T. B. Ho, "Temporal decomposition: A promising approach to VQ-based speaker identification," Proc. ICASSP, pp. 184-187, 2003.
    • (2003) Proc. ICASSP , pp. 184-187
    • Nguyen, P.C.1    Akagi, M.2    Ho, T.B.3
  • 20
    • 51549087731 scopus 로고    scopus 로고
    • A study on voice conversion method for synthesizing stimuli to perform gender perception experiments of speech
    • T. Shibata and M. Akagi, "A study on voice conversion method for synthesizing stimuli to perform gender perception experiments of speech," Proc. NCSP, pp. 180-183, 2008.
    • (2008) Proc. NCSP , pp. 180-183
    • Shibata, T.1    Akagi, M.2
  • 22
    • 51549089733 scopus 로고    scopus 로고
    • Voice conversion Matlab toolbox,
    • Technical Report, Siemens Corporate Technology, Munich, Germany
    • D. Suendermann, "Voice conversion Matlab toolbox," Technical Report, Siemens Corporate Technology, Munich, Germany, 2007.
    • (2007)
    • Suendermann, D.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.