SCOPUS 정보 검색 플랫폼

HUT-ICCE 2008 - 2nd International Conference on Communications and Electronics

Volumn , Issue , 2008, Pages 224-229

Phoneme-based spectral voice conversion using temporal decomposition and Gaussian mixture model

(2) Binh, Phu Nguyen a Akagi, Masato a

a JAPAN ADVANCED INSTITUTE OF SCIENCE AND TECHNOLOGY (Japan)

Author keywords

Gaussian mixture model (GMM); Spectral voice conversion; Temporal decomposition

Indexed keywords

CANTILEVER BEAMS; MAGNETOSTRICTIVE DEVICES; PHOTODEGRADATION; TARGETS;

GAUSSIAN MIXTURE MODEL (GMM); SPECTRAL VOICE CONVERSION; TEMPORAL DECOMPOSITION; VOICE CONVERSION;

SPEECH;

EID: 51549110156 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (2)

References (22)

1
- 0023739214
- Voice conversion through vector quantization
- M. Abe, S. Nakamura, K. Shikano, and H. Kuwabara, "Voice conversion through vector quantization," Proc. ICASSP, pp. 655-658, 1998.
- (1998) Proc. ICASSP , pp. 655-658
- Abe, M.¹ Nakamura, S.² Shikano, K.³ Kuwabara, H.⁴

2
- 85009266993
- Transformation of spectral envelope for voice conversion based on radial basis function networks
- T. Watanabe, T. Murakami, M. Namba, T. Hoya, and Y. Ishida, "Transformation of spectral envelope for voice conversion based on radial basis function networks," Proc. ICSLP, pp. 285-288, 2002.
- (2002) Proc. ICSLP , pp. 285-288
- Watanabe, T.¹ Murakami, T.² Namba, M.³ Hoya, T.⁴ Ishida, Y.⁵

3
- 85135141647
- Hidden Markov model based voice conversion using dynamic characteristics of speaker
- E. K. Kim, S. Lee, and Y. H. Oh, "Hidden Markov model based voice conversion using dynamic characteristics of speaker," Proc. Eurospeech, pp. 2519-2522, 1997.
- (1997) Proc. Eurospeech , pp. 2519-2522
- Kim, E.K.¹ Lee, S.² Oh, Y.H.³

4
- 0032026483
- Continuous probabilistic transform for voice conversion
- Y. Stylianou, O. Cappe, and E. Moulines, "Continuous probabilistic transform for voice conversion," Proc. IEEE Trans. Speech Audio, vol. 6, pp. 131-142, 1998.
- (1998) Proc. IEEE Trans. Speech Audio , vol.6 , pp. 131-142
- Stylianou, Y.¹ Cappe, O.² Moulines, E.³

5
- 0031623661
- Spectral voice conversion for text-to-speech synthesis
- A. Kain and M. W. Macon, "Spectral voice conversion for text-to-speech synthesis," Proc. ICASSP, pp. 285-288, 1998.
- (1998) Proc. ICASSP , pp. 285-288
- Kain, A.¹ Macon, M.W.²

6
- 0034842552
- Voice conversion algorithm based on Gaussian mixture model with dynamic frequency warping of STRAIGHT spectrum
- T. Toda, H. Saruwatari, and K. Shikano, "Voice conversion algorithm based on Gaussian mixture model with dynamic frequency warping of STRAIGHT spectrum," Proc. ICASSP, pp. 841-844, 2001.
- (2001) Proc. ICASSP , pp. 841-844
- Toda, T.¹ Saruwatari, H.² Shikano, K.³

7
- 84905560807
- Voice conversion with smoothed GMM and MAP adaptation
- Y. Chen, M. Chu, E. Chang, J. Liu, and R. Liu, "Voice conversion with smoothed GMM and MAP adaptation," Proc. Eurospeech, pp. 2413-2416, 2003.
- (2003) Proc. Eurospeech , pp. 2413-2416
- Chen, Y.¹ Chu, M.² Chang, E.³ Liu, J.⁴ Liu, R.⁵

8
- 0141702280
- Using phone and diphone based acoustic models for voice conversion: A step towards creating voice fonts
- A. Kumar and A. Verma, "Using phone and diphone based acoustic models for voice conversion: A step towards creating voice fonts," Proc. ICASSP, pp. 720-723, 2003.
- (2003) Proc. ICASSP , pp. 720-723
- Kumar, A.¹ Verma, A.²

9
- 84994241109
- Including dynamic and phonetic information in voice conversion systems
- H. Duxans, A. Bonafonte, A. Kain, and J. van Santen, "Including dynamic and phonetic information in voice conversion systems," Proc. ICSLP, pp. 1193-1196, 2004.
- (2004) Proc. ICSLP , pp. 1193-1196
- Duxans, H.¹ Bonafonte, A.² Kain, A.³ van Santen, J.⁴

10
- 34047254509
- Quality-enhanced voice morphing using maximum likelihood transformations
- H. Ye and S. Young, "Quality-enhanced voice morphing using maximum likelihood transformations," IEEE Trans. on Audio, Speech and lang. Proc., pp. 1301-1312, 2006.
- (2006) IEEE Trans. on Audio, Speech and lang. Proc , pp. 1301-1312
- Ye, H.¹ Young, S.²

11
- 51549090536
- High quality voice conversion through combining modified GMM and formant mapping for Mandarin
- K. Liu, J. Zhang, and Y. Yan, "High quality voice conversion through combining modified GMM and formant mapping for Mandarin," Proc. ICDT, p. 10, 2007.
- (2007) Proc. ICDT , pp. 10
- Liu, K.¹ Zhang, J.² Yan, Y.³

12
- 85068458327
- Weighted frequency warping for voice conversion
- D. Erro and A. Moreno, "Weighted frequency warping for voice conversion," Proc. Interspeech, pp. 1965-1968, 2007.
- (2007) Proc. Interspeech , pp. 1965-1968
- Erro, D.¹ Moreno, A.²

13
- 51549106452
- Control of spectral dynamics using temporal decomposition in voice conversion and concatenative speech synthesis
- B. P. Nguyen and M. Akagi, "Control of spectral dynamics using temporal decomposition in voice conversion and concatenative speech synthesis," Proc. NCSP, pp. 279-282, 2008.
- (2008) Proc. NCSP , pp. 279-282
- Nguyen, B.P.¹ Akagi, M.²

14
- 0028997012
- Spectral dynamics is more important than spectral distortion
- H. P. Knagenhjelm and W. B. Kleijn, "Spectral dynamics is more important than spectral distortion," Proc. ICASSP, pp. 732-735, 1995.
- (1995) Proc. ICASSP , pp. 732-735
- Knagenhjelm, H.P.¹ Kleijn, W.B.²

15
- 0020602364
- Efficient coding of LPC parameters by temporal decomposition
- B. S. Atal, "Efficient coding of LPC parameters by temporal decomposition," Proc. ICASSP, pp. 81-84, 1983.
- (1983) Proc. ICASSP , pp. 81-84
- Atal, B.S.¹

16
- 0038719980
- Modified restricted temporal decomposition and its application to low bit rate speech coding
- P. C. Nguyen, T. Ochi, and M. Akagi, "Modified restricted temporal decomposition and its application to low bit rate speech coding," IEICE Transactions on Information and Systems, vol. E86-D, pp. 397-405, 2003.
- (2003) IEICE Transactions on Information and Systems , vol.E86-D , pp. 397-405
- Nguyen, P.C.¹ Ochi, T.² Akagi, M.³

17
- 0032673049
- Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous frequency-based F0 extraction: Possible role of a repetitive structure in sounds
- H. Kawahara, I. Masuda-Katsuse, and A. de Cheveigné, "Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous frequency-based F0 extraction: Possible role of a repetitive structure in sounds," Journal of Speech Communication, vol. 27, pp. 187-207, 1999.
- (1999) Journal of Speech Communication , vol.27 , pp. 187-207
- Kawahara, H.¹ Masuda-Katsuse, I.² de Cheveigné, A.³

18
- 0002629270
- Maximum likelihood from incomplete data via the EM algorithm
- A. Dempster, N. Laird, and D. Rubin, "Maximum likelihood from incomplete data via the EM algorithm," Journal of the Royal Statistical Society Series B, vol. 39, pp. 1-38, 1977.
- (1977) Journal of the Royal Statistical Society Series B , vol.39 , pp. 1-38
- Dempster, A.¹ Laird, N.² Rubin, D.³

19
- 0141703296
- Temporal decomposition: A promising approach to VQ-based speaker identification
- P. C. Nguyen, M. Akagi, and T. B. Ho, "Temporal decomposition: A promising approach to VQ-based speaker identification," Proc. ICASSP, pp. 184-187, 2003.
- (2003) Proc. ICASSP , pp. 184-187
- Nguyen, P.C.¹ Akagi, M.² Ho, T.B.³

20
- 51549087731
- A study on voice conversion method for synthesizing stimuli to perform gender perception experiments of speech
- T. Shibata and M. Akagi, "A study on voice conversion method for synthesizing stimuli to perform gender perception experiments of speech," Proc. NCSP, pp. 180-183, 2008.
- (2008) Proc. NCSP , pp. 180-183
- Shibata, T.¹ Akagi, M.²

21
- 33646815712
- The MOCHA-TIMIT articulatory database
- A. Wrench, "The MOCHA-TIMIT articulatory database," Queen Margaret University College, http://www.cstr.ed.ac.uk/artic/mocha.html, 1999.
- (1999) Queen Margaret University College, http://www.cstr.ed.ac.uk/artic/mocha.html
- Wrench, A.¹

22
- 51549089733
- Voice conversion Matlab toolbox,
- Technical Report, Siemens Corporate Technology, Munich, Germany
- D. Suendermann, "Voice conversion Matlab toolbox," Technical Report, Siemens Corporate Technology, Munich, Germany, 2007.
- (2007)
- Suendermann, D.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.