메뉴 건너뛰기




Volumn , Issue , 2010, Pages 853-856

Conversational spontaneous speech synthesis using average voice model

Author keywords

Average voice model; Conversational speech; HMM based speech synthesis; Speaker adaptation; Spontaneous speech; Style adaptation

Indexed keywords

HIDDEN MARKOV MODELS; SPEECH SYNTHESIS; SPEECH COMMUNICATION;

EID: 79959835828     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (9)

References (15)
  • 1
    • 3042741062 scopus 로고    scopus 로고
    • Toward spontaneous speech synthesis-utilizing language model information in TTS
    • S. Werner, M. Eichner, M. Wolff, and R. Hoffmann, "Toward spontaneous speech synthesis-utilizing language model information in TTS," IEEE Trans. Speech Audio Processing, vol. 12, no. 4, pp. 436-445, 2004.
    • (2004) IEEE Trans. Speech Audio Processing , vol.12 , Issue.4 , pp. 436-445
    • Werner, S.1    Eichner, M.2    Wolff, M.3    Hoffmann, R.4
  • 2
    • 79959842873 scopus 로고    scopus 로고
    • Toward hidden Markov model-based spontaneous speech synthesis
    • T. Akagawa, K. Iwano, and S. Furui, "Toward hidden Markov model-based spontaneous speech synthesis," J. Acoust. Soc. America, vol. 120, pp. 3037-3038, 2006.
    • (2006) J. Acoust. Soc. America , vol.120 , pp. 3037-3038
    • Akagawa, T.1    Iwano, K.2    Furui, S.3
  • 4
    • 79959855113 scopus 로고    scopus 로고
    • A study on the statistical models for HMM-based spontaneous speech synthesis
    • T. Akagawa, K. Iwano, and S. Furui, "A study on the Statistical models for HMM-based spontaneous speech synthesis," IEICE technical report (in Japanese), vol. 107, no. 77, pp. 13-18, 2007.
    • (2007) IEICE Technical Report (in Japanese) , vol.107 , Issue.77 , pp. 13-18
    • Akagawa, T.1    Iwano, K.2    Furui, S.3
  • 5
    • 79959817255 scopus 로고    scopus 로고
    • Pronunciation variation generation for spontaneous speech synthesis using state-based voice transformation
    • C. Lee, C. Wu, and J. Guo, "Pronunciation variation generation for spontaneous speech synthesis using state-based voice transformation," INTERSPEECH, 2010.
    • (2010) INTERSPEECH
    • Lee, C.1    Wu, C.2    Guo, J.3
  • 6
    • 24144437793 scopus 로고    scopus 로고
    • Developments in corpus-based speech synthesis: Approaching natural conversational speech
    • DOI 10.1093/ietisy/e88-d.3.376
    • N. Campbell, "Developments in corpus-based speech synthesis: approaching natural conversational speech," IEICE Trans. Inf. & Syst., vol. 88, no. 3, pp. 376-383, 2005. (Pubitemid 41228045)
    • (2005) IEICE Transactions on Information and Systems , vol.E88-D , Issue.3 , pp. 376-383
    • Campbell, N.1
  • 7
    • 85009139544 scopus 로고    scopus 로고
    • Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis
    • Sept.
    • T. Yoshimura, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura, "Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis," in Proc. EUROSPEECH, Sept. 1999, pp. 2347-2350.
    • (1999) Proc. EUROSPEECH , pp. 2347-2350
    • Yoshimura, T.1    Tokuda, K.2    Masuko, T.3    Kobayashi, T.4    Kitamura, T.5
  • 8
    • 67650854725 scopus 로고    scopus 로고
    • Analysis of speaker adaptation algorithms for HMM-based speech synthesis and a constrained SMAPLR adaptation algorithm
    • Jan.
    • J. Yamagishi, T. Kobayashi, Y. Nakano, K. Ogata, and J. Isogai, "Analysis of speaker adaptation algorithms for HMM-based speech synthesis and a constrained SMAPLR adaptation algorithm," IEEE Trans. Audio, Speech, and Language Process., vol. 17, no. 1, pp. 66-83, Jan. 2009.
    • (2009) IEEE Trans. Audio, Speech, and Language Process. , vol.17 , Issue.1 , pp. 66-83
    • Yamagishi, J.1    Kobayashi, T.2    Nakano, Y.3    Ogata, K.4    Isogai, J.5
  • 9
    • 51449098017 scopus 로고    scopus 로고
    • Speaker and style adaptation using average voice model for style control in hmm-based speech synthesis
    • M. Tachibana, S. Izawa, T. Nose, and T. Kobayashi, "Speaker and style adaptation using average voice model for style control in hmm-based speech synthesis," in ICASSP, 2008.
    • (2008) ICASSP
    • Tachibana, M.1    Izawa, S.2    Nose, T.3    Kobayashi, T.4
  • 12
    • 0032673049 scopus 로고    scopus 로고
    • Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds
    • Apr.
    • H. Kawahara, I. Masuda-Katsuse, and A. de Cheveigne, "Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds," Speech Communication, vol. 27, no. 3-4, pp. 187-207, Apr. 1999.
    • (1999) Speech Communication , vol.27 , Issue.3-4 , pp. 187-207
    • Kawahara, H.1    Masuda-Katsuse, I.2    De Cheveigne, A.3
  • 15
    • 0030362995 scopus 로고    scopus 로고
    • A compact model for speaker-adaptive training
    • T. Aanastasakos, "A compact model for speaker-adaptive training," ICSLP, vol. 2, 1996.
    • (1996) ICSLP , vol.2
    • Aanastasakos, T.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.