메뉴 건너뛰기




Volumn 17, Issue 6, 2009, Pages 1231-1239

A Cross-Language State Sharing and Mapping Approach to Bilingual (Mandarin-English) TTS

Author keywords

Bilingual hidden Markov model (HMM) based speech synthesis; Kullback Leibler divergence (KLD); new language synthesis

Indexed keywords


EID: 85008020260     PISSN: 15587916     EISSN: 15587924     Source Type: Journal    
DOI: 10.1109/TASL.2009.2015708     Document Type: Article
Times cited : (67)

References (18)
  • 1
    • 84856249636 scopus 로고    scopus 로고
    • From multilingual to polyglot speech synthesis
    • C. Traber et al., “From multilingual to polyglot speech synthesis,” in Proc. Eurospeech, 1999, pp. 835–838.
    • (1999) Proc. Eurospeech , pp. 835-838
    • Traber, C.1
  • 4
    • 33646769932 scopus 로고    scopus 로고
    • Polyglot synthesis using a mixture of monolingual corpora
    • J. Latorre, K. Iwano, and S. Furui, “Polyglot synthesis using a mixture of monolingual corpora,” in Proc. ICASSP, 2005, vol. 1, pp. 1–4.
    • (2005) Proc. ICASSP , vol.1 , pp. 1-4
    • Latorre, J.1    Iwano, K.2    Furui, S.3
  • 5
    • 0141480034 scopus 로고    scopus 로고
    • Microsoft Mulan—A bilingual TTS system
    • M. Chu, H. Peng, Y. Zhao, Z. Y. Niu, and E. Chang, “Microsoft Mulan—A bilingual TTS system,” in Proc. ICASSP, 2003, vol. 1, pp. 264–267.
    • (2003) Proc. ICASSP , vol.1 , pp. 264-267
    • Chu, M.1    Peng, H.2    Zhao, Y.3    Niu, Z.Y.4    Chang, E.5
  • 6
    • 0033708106 scopus 로고    scopus 로고
    • Speech parameter generation algorithms for HMM-based speech synthesis
    • K. Tokuda, T. Kobayashi, T. Masuko, T. Kobayashi, and T. Kitamura, “Speech parameter generation algorithms for HMM-based speech synthesis,” in Proc. ICASSP, 2000, vol. 3, pp. 1315–1318.
    • (2000) Proc. ICASSP , vol.3 , pp. 1315-1318
    • Tokuda, K.1    Kobayashi, T.2    Masuko, T.3    Kobayashi, T.4    Kitamura, T.5
  • 7
    • 0141479954 scopus 로고    scopus 로고
    • Optimal clustering of multivariate normal distributions using divergence and its application to HMM adaptation
    • T. A. Myrvoll and F. K. Soong, “Optimal clustering of multivariate normal distributions using divergence and its application to HMM adaptation,” in Proc. ICASSP, 2003, vol. 1, pp. 552–555.
    • (2003) Proc. ICASSP , vol.1 , pp. 552-555
    • Myrvoll, T.A.1    Soong, F.K.2
  • 10
    • 51449111086 scopus 로고    scopus 로고
    • A cross-language state mapping approach to bilingual (Mandarin-English) TTS
    • H. Liang, Y. Qian, F. K. Soong, and G. Liu, “A cross-language state mapping approach to bilingual (Mandarin-English) TTS,” in Proc. ICASSP, 2008, pp. 4641–4644.
    • (2008) Proc. ICASSP , pp. 4641-4644
    • Liang, H.1    Qian, Y.2    Soong, F.K.3    Liu, G.4
  • 11
    • 4544354696 scopus 로고    scopus 로고
    • Seg-mental tonal modeling for phone set design in Mandarin LVCSR
    • C. Huang, Y. Shi, J.-L. Zhou, M. Chu, T. Wang, and E. Chang, “Seg-mental tonal modeling for phone set design in Mandarin LVCSR,” in Proc. ICASSP, 2004, vol. 1, pp. 901–904.
    • (2004) Proc. ICASSP , vol.1 , pp. 901-904
    • Huang, C.1    Shi, Y.2    Zhou, J.-L.3    Chu, M.4    Wang, T.5    Chang, E.6
  • 12
    • 0021157408 scopus 로고
    • Line Spectrum Pair (LSP) and speech data compression
    • F. K. Soong and B.-H. Juang, “Line Spectrum Pair (LSP) and speech data compression,” in Proc. ICASSP, 1984, pp. 1.10.1–1.10.4.
    • (1984) Proc. ICASSP
    • Soong, F.K.1    Juang, B.-H.2
  • 14
    • 85009129569 scopus 로고    scopus 로고
    • Evaluation of cross-language voice conversion based on GMM and STRAIGHT
    • M. Mashimo, T. Toda, K. Shikano, and N. Campbell, “Evaluation of cross-language voice conversion based on GMM and STRAIGHT,” in Proc. Eurospeech, 2001, pp. 361–364.
    • (2001) Proc. Eurospeech , pp. 361-364
    • Mashimo, M.1    Toda, T.2    Shikano, K.3    Campbell, N.4
  • 16
    • 0032673049 scopus 로고    scopus 로고
    • Restructuring speech representations using pitch-adaptive time-frequency smoothing and instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds
    • H. Kawahara, I. Masuda-Katsuse, and A. de Cheveigne “Restructuring speech representations using pitch-adaptive time-frequency smoothing and instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds,” Speech Commun., vol. 27, pp. 187–207, 1999.
    • (1999) Speech Commun. , vol.27 , pp. 187-207
    • Kawahara, H.1    Masuda-Katsuse, I.2    de Cheveigne, A.3
  • 17
    • 0033906251 scopus 로고    scopus 로고
    • MDL-based context-dependent subword modeling for speech recognition
    • K. Shinoda and T. Watanable, “MDL-based context-dependent subword modeling for speech recognition,” J. Acoust. Soc. Jpn.(E), vol. 21, no. 2, pp. 79–86, 2000.
    • (2000) J. Acoust. Soc. Jpn.(E) , vol.21 , Issue.2 , pp. 79-86
    • Shinoda, K.1    Watanable, T.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.