메뉴 건너뛰기




Volumn , Issue , 2011, Pages 1825-1828

Phonological knowledge guided HMM state mapping for cross-lingual speaker adaptation

Author keywords

Cross lingual speaker adaptation; HMM based TTS; Minimum generation error; Phonological knowledge

Indexed keywords

AVERAGE-VOICE; CROSS-LINGUAL; HMM-BASED TTS; KULLBACK LEIBLER DIVERGENCE; MAPPING CONSTRUCTION; MAPPING RULES; OBJECTIVE EVALUATION; PHONOLOGICAL KNOWLEDGE; SPEAKER ADAPTATION; STATE DISTRIBUTIONS; SUBJECTIVE LISTENING TEST;

EID: 84865786646     PISSN: None     EISSN: 19909772     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (6)

References (12)
  • 1
    • 33847129573 scopus 로고    scopus 로고
    • Average-voice-based speech synthesis using HSMM-based speaker adaptation and adaptive training
    • Feb.
    • J. Yamagishi and T. Kobayashi, "Average-voice-based speech synthesis using HSMM-based speaker adaptation and adaptive training", IEICE Trans. on Information and Systems, vol. E90-D, no. 2, pp. 533-543, Feb. 2007.
    • (2007) IEICE Trans. on Information and Systems , vol.E90-D , Issue.2 , pp. 533-543
    • Yamagishi, J.1    Kobayashi, T.2
  • 2
    • 85008020260 scopus 로고    scopus 로고
    • A cross-language state sharing and mapping approach to bilingual (Mandarin-English) TTS
    • Aug.
    • Y. Qian, H. Liang, and F. K. Soong, "A cross-language state sharing and mapping approach to bilingual (Mandarin-English) TTS", IEEE Trans. on Audio, Speech and Language Processing, vol. 17, no. 6, pp. 1231-1239, Aug. 2009.
    • (2009) IEEE Trans. on Audio, Speech and Language Processing , vol.17 , Issue.6 , pp. 1231-1239
    • Qian, Y.1    Liang, H.2    Soong, F.K.3
  • 3
    • 70450192740 scopus 로고    scopus 로고
    • State mapping based method for cross-lingual speaker adaptation in HMM-based speech synthesis
    • Sep.
    • Y.-J. Wu, Y. Nankaku, and K. Tokuda, "State mapping based method for cross-lingual speaker adaptation in HMM-based speech synthesis", in Proc. of Interspeech, Sep. 2009, pp. 528-531.
    • (2009) Proc. of Interspeech , pp. 528-531
    • Wu, Y.-J.1    Nankaku, Y.2    Tokuda, K.3
  • 4
    • 78049369783 scopus 로고    scopus 로고
    • A comparison of supervised and unsupervised cross-lingual speaker adaptation approaches for HMM-based speech synthesis
    • Mar.
    • H. Liang, J. Dines, and L. Saheer, "A comparison of supervised and unsupervised cross-lingual speaker adaptation approaches for HMM-based speech synthesis", in Proc. of ICASSP, Mar. 2010, pp. 4598-4601.
    • (2010) Proc. of ICASSP , pp. 4598-4601
    • Liang, H.1    Dines, J.2    Saheer, L.3
  • 6
    • 79959843446 scopus 로고    scopus 로고
    • An analysis of language mismatch in HMM state mapping-based cross-lingual speaker adaptation
    • Sep.
    • H. Liang and J. Dines, "An analysis of language mismatch in HMM state mapping-based cross-lingual speaker adaptation", in Proc. of Interspeech, Sep. 2010, pp. 622-625.
    • (2010) Proc. of Interspeech , pp. 622-625
    • Liang, H.1    Dines, J.2
  • 7
    • 60849092922 scopus 로고    scopus 로고
    • Cross-lingual speaker adaptation for HMM-based speech synthesis
    • Dec.
    • Y.-J. Wu, S. King, and K. Tokuda, "Cross-lingual speaker adaptation for HMM-based speech synthesis", in Proc. of ISCSLP, Dec. 2008, pp. 1-4.
    • (2008) Proc. of ISCSLP , pp. 1-4
    • Wu, Y.-J.1    King, S.2    Tokuda, K.3
  • 8
    • 44949167764 scopus 로고    scopus 로고
    • Minimum generation error criterion for tree-based clustering of context-dependent HMMs
    • Sep.
    • Y.-J. Wu, W. Guo, and R.-H. Wang, "Minimum generation error criterion for tree-based clustering of context-dependent HMMs", in Proc. of Interspeech, Sep. 2006, pp. 2046-2049.
    • (2006) Proc. of Interspeech , pp. 2046-2049
    • Wu, Y.-J.1    Guo, W.2    Wang, R.-H.3
  • 10
    • 0032673049 scopus 로고    scopus 로고
    • Restructuring speech representations using a pitch adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds
    • Apr.
    • H. Kawahara, I. Masuda-Katsuse, and A. Cheveigné, "Restructuring speech representations using a pitch adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds", Speech Communication, vol. 27, no. 3-4, pp. 187-207, Apr. 1999.
    • (1999) Speech Communication , vol.27 , Issue.3-4 , pp. 187-207
    • Kawahara, H.1    Masuda-Katsuse, I.2    Cheveigné, A.3
  • 11
    • 67650854725 scopus 로고    scopus 로고
    • Analysis of speaker adaptation algorithms for HMM-based speech synthesis and a constrained SMAPLR adaptation algorithm
    • Jan.
    • J. Yamagishi, T. Kobayashi, Y. Nakano, K. Ogata, and J. Isogai, "Analysis of speaker adaptation algorithms for HMM-based speech synthesis and a constrained SMAPLR adaptation algorithm", IEEE Trans. on Audio, Speech and Language Processing, vol. 17, no. 1, pp. 66-83, Jan. 2009.
    • (2009) IEEE Trans. on Audio, Speech and Language Processing , vol.17 , Issue.1 , pp. 66-83
    • Yamagishi, J.1    Kobayashi, T.2    Nakano, Y.3    Ogata, K.4    Isogai, J.5


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.