메뉴 건너뛰기




Volumn 2015-August, Issue , 2015, Pages 4475-4479

Multi-speaker modeling and speaker adaptation for DNN-based TTS synthesis

Author keywords

deep neural networks; multi task learning; statistical parametric speech synthesis; transfer learning

Indexed keywords

AUDIO SIGNAL PROCESSING; LINGUISTICS; METADATA; SPEECH COMMUNICATION; SPEECH SYNTHESIS;

EID: 84946051934     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2015.7178817     Document Type: Conference Paper
Times cited : (132)

References (10)
  • 3
    • 84910047819 scopus 로고    scopus 로고
    • TTS synthesis with bidirectional LSTM based recurrent neural networks
    • Yuchen Fan, Yao Qian, Feng-Long Xie, and Frank K Soong, TTS synthesis with bidirectional LSTM based recurrent neural networks, in INTERSPEECH, 2014
    • (2014) INTERSPEECH
    • Fan, Y.1    Qian, Y.2    Xie, F.-L.3    Soong, F.K.4
  • 4
    • 67651002140 scopus 로고    scopus 로고
    • Statistical parametric speech synthesis
    • Heiga Zen, Keiichi Tokuda, and Alan W Black, Statistical parametric speech synthesis, Speech Communication, vol. 51, no. 11, pp. 1039-1064, 2009
    • (2009) Speech Communication , vol.51 , Issue.11 , pp. 1039-1064
    • Zen, H.1    Tokuda, K.2    Black, A.W.3
  • 8
    • 84904548965 scopus 로고    scopus 로고
    • Deep learning of representations for unsupervised and transfer learning
    • Yoshua Bengio, Deep learning of representations for unsupervised and transfer learning., in ICML Unsupervised and Transfer Learning, 2012, pp. 17-36
    • (2012) ICML Unsupervised and Transfer Learning , pp. 17-36
    • Bengio, Y.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.