메뉴 건너뛰기




Volumn , Issue , 2013, Pages 428-433

Speech analysis/synthesis by Gaussian mixture approximation of the speech spectrum for voice conversion

Author keywords

Analysis Synthesis; Feature Extraction; GMM; STRAIGHT; Voice Conversion

Indexed keywords

FEATURE EXTRACTION; INFORMATION TECHNOLOGY; SIGNAL PROCESSING; SPEECH ANALYSIS;

EID: 84899122951     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ISSPIT.2013.6781919     Document Type: Conference Paper
Times cited : (3)

References (11)
  • 2
    • 0032673049 scopus 로고    scopus 로고
    • Restructuring speech representations using a pitch-adaptive time-frequency smoothing and instantaneous-frequency-based FO extraction: Possible role of a repetitive structure in sounds
    • Elsevier
    • H. Kawahara, J. Masuda-Katsuse, and A. de Cheveign, "Restructuring speech representations using a pitch-adaptive time-frequency smoothing and instantaneous-frequency-based FO extraction: Possible role of a repetitive structure in sounds", in Speech Communication, pp. 187-207, Elsevier, 1999.
    • (1999) Speech Communication , pp. 187-207
    • Kawahara, H.1    Masuda-Katsuse, J.2    De Cheveign, A.3
  • 3
    • 0034842552 scopus 로고    scopus 로고
    • Voice conversion algorithm based on gaussian mixture model with dynamic frequency warping of STRAIGHT spectrum
    • T. Toda, H. Saruwatari, and K. Shikano, "Voice conversion algorithm based on gaussian mixture model with dynamic frequency warping of STRAIGHT spectrum", in Proc. ICASSP, 2001, pp. 841-844.
    • (2001) Proc. ICASSP , pp. 841-844
    • Toda, T.1    Saruwatari, H.2    Shikano, K.3
  • 4
    • 0031623661 scopus 로고    scopus 로고
    • Spectral voice conversion for text-to-speech synthesis
    • A. Kain and M. W. Macon, "Spectral voice conversion for Text-to-Speech Synthesis", in Proc. ICASSP, 1998, pp. 285-288.
    • (1998) Proc. ICASSP , pp. 285-288
    • Kain, A.1    MacOn, M.W.2
  • 5
    • 4544260276 scopus 로고    scopus 로고
    • Bayesian modelling of the speech spectrum using mixture of Gaussians
    • P. Zolfaghari, S. Watanabe, A. Nakamura, and S. Katagiri, "Bayesian modelling of the speech spectrum using mixture of Gaussians", in Proc. ICASSP, 2004, pp. 553-556.
    • (2004) Proc. ICASSP , pp. 553-556
    • Zolfaghari, P.1    Watanabe, S.2    Nakamura, A.3    Katagiri, S.4
  • 6
    • 67649297853 scopus 로고    scopus 로고
    • Spectral modification for voice gender conversion using temporal decomposition
    • B. Nguyen and M. Akagi, "Spectral modification for voice gender conversion using temporal decomposition", Journal of Signal Processing, vol. II, pp. 333-336, 2007.
    • (2007) Journal of Signal Processing , vol.2 , pp. 333-336
    • Nguyen, B.1    Akagi, M.2
  • 7
    • 78650273608 scopus 로고    scopus 로고
    • Speech spectral envelope estimation through explicit control of peak evolution in time
    • E. Godoy, O. Rosec, and T. Chonavel, "Speech spectral envelope estimation through explicit control of peak evolution in time", in Proc. ISSPA, 2010, pp. 209-212.
    • (2010) Proc. ISSPA , pp. 209-212
    • Godoy, E.1    Rosec, O.2    Chonavel, T.3
  • 8
    • 84857498745 scopus 로고    scopus 로고
    • Voice conversion using dynamic frequency warping with amplitude scaling, for parallel or nonparallel corpora
    • E. Godoy, O. Rosec, and T. Chonavel, "Voice Conversion Using Dynamic Frequency Warping With Amplitude Scaling, for Parallel or Nonparallel Corpora", IEEE Trans. Audio, Speech, and Language Proc, vol. 20, pp. 1313-1323, 2012.
    • (2012) IEEE Trans. Audio, Speech, and Language Proc , vol.20 , pp. 1313-1323
    • Godoy, E.1    Rosec, O.2    Chonavel, T.3
  • 9
    • 0033677157 scopus 로고    scopus 로고
    • Speech reconstruction from mel frequency cepstral coefficients and pitch frequency
    • D. Chazan, R. Hoory, G. Cohen, and M. Zibulski, "Speech reconstruction from Mel frequency cepstral coefficients and pitch frequency", in Proc. ICASSP, 2000, pp. 1299-1302.
    • (2000) Proc. ICASSP , pp. 1299-1302
    • Chazan, D.1    Hoory, R.2    Cohen, G.3    Zibulski, M.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.