메뉴 건너뛰기




Volumn , Issue , 2010, Pages 2406-2409

Sub-band basis spectrum model for pitch-synchronous log-spectrum and phase based on approximation of sparse coding

Author keywords

Sparse coding; Spectrum parameter; Speech synthesis; Sub band basis spectrum model; Voice adaptation

Indexed keywords

SPEECH COMMUNICATION; SPEECH SYNTHESIS; CODES (SYMBOLS); CONTINUOUS SPEECH RECOGNITION; SPECTRUM ANALYSIS; SPEECH CODING;

EID: 79959839864     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (3)

References (11)
  • 1
    • 29144484191 scopus 로고    scopus 로고
    • Concatenative speech synthesis based on the plural unit selection and fusion method
    • DOI 10.1093/ietisy/e88-d.11.2565
    • T. Mizutani and T. Kagoshima, "Concatenative speech synthesis based on the plural unit selection and fusion method," IEICE Trans. E88-D, 11, pp.2565-2572, 2005. (Pubitemid 41816802)
    • (2005) IEICE Transactions on Information and Systems , vol.E88-D , Issue.11 , pp. 2565-2572
    • Mizutani, T.1    Kagoshima, T.2
  • 3
    • 85016140477 scopus 로고
    • An adaptive algorithm for mel-cepstral analysis of speech
    • T. Fukada, K. Tokuda, T. Kobayashi, and S. Imai, "An adaptive algorithm for mel-cepstral analysis of speech," Proc. ICASSP, pp. I-137 - I-140, 1992.
    • (1992) Proc. ICASSP
    • Fukada, T.1    Tokuda, K.2    Kobayashi, T.3    Imai, S.4
  • 4
    • 0032673049 scopus 로고    scopus 로고
    • Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds
    • H. Kawahara, I. Masuda-Kasuse and A. Cheveigne, "Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds," Speech Communication, 27, pp.187-207, 1999.
    • (1999) Speech Communication , vol.27 , pp. 187-207
    • Kawahara, H.1    Masuda-Kasuse, I.2    Cheveigne, A.3
  • 6
    • 70450176940 scopus 로고    scopus 로고
    • Speech synthesis based on the plural unit selection and fusion method using FWF model
    • R. Morinaka, M. Tamura, M. Morita, T. Kagoshima, "Speech synthesis based on the plural unit selection and fusion method using FWF model," Proc. INTERSPEECH, pp. 2083-2086, 2009.
    • (2009) Proc. INTERSPEECH , pp. 2083-2086
    • Morinaka, R.1    Tamura, M.2    Morita, M.3    Kagoshima, T.4
  • 7
    • 44949084615 scopus 로고    scopus 로고
    • A weight estimation method using LDA for multi-band speech recognition
    • K. Iwano, K. Kojima, and S. Furui, "A weight estimation method using LDA for multi-band speech recognition," Proc. INTERSPEECH, pp. 2534 - 2537, 2006.
    • (2006) Proc. INTERSPEECH , pp. 2534-2537
    • Iwano, K.1    Kojima, K.2    Furui, S.3
  • 8
    • 0029938380 scopus 로고    scopus 로고
    • Emergence of simple-cell receptive field properties by learning a sparse code for natural images
    • B. A. Olshausen and D. J. Field, "Emergence of simple-cell receptive field properties by learning a sparse code for natural images," Nature, vol 381, 1996.
    • (1996) Nature , vol.381
    • Olshausen, B.A.1    Field, D.J.2
  • 11
    • 0035472456 scopus 로고    scopus 로고
    • Pitch-scaled estimation of simultaneous voiced and turbulence-noise components in speech
    • DOI 10.1109/89.952489, PII S1063667601082335
    • P. Jackson and C. H. Shadle, "Pitch-scaled estimation of simultaneous voiced and turbulence-noise components in speech," IEEE Trans. Speech and Audio Processing, vol.9 pp.713-726, 2001. (Pubitemid 32992835)
    • (2001) IEEE Transactions on Speech and Audio Processing , vol.9 , Issue.7 , pp. 713-726
    • Jackson, P.J.B.1    Shadle, C.H.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.