메뉴 건너뛰기




Volumn 2017-December, Issue , 2017, Pages 2963-2971

Deep voice 2: Multi-speaker neural text-to-speech

Author keywords

[No Author keywords available]

Indexed keywords

SPEECH SYNTHESIS;

EID: 85046637415     PISSN: 10495258     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (411)

References (24)
  • 1
    • 84890452886 scopus 로고    scopus 로고
    • Fast speaker adaptation of hybrid NN/HMM model for speech recognition based on discriminative learning of speaker code
    • O. Abdel-Hamid and H. Jiang. Fast speaker adaptation of hybrid NN/HMM model for speech recognition based on discriminative learning of speaker code. In ICASSP, 2013.
    • (2013) ICASSP
    • Abdel-Hamid, O.1    Jiang, H.2
  • 5
    • 84946051934 scopus 로고    scopus 로고
    • Multi-speaker modeling and speaker adaptation for DNN-based TTS synthesis
    • Y. Fan, Y. Qian, F. K. Soong, and L. He. Multi-speaker modeling and speaker adaptation for DNN-based TTS synthesis. In IEEE ICASSP, 2015.
    • (2015) IEEE ICASSP
    • Fan, Y.1    Qian, Y.2    Soong, F.K.3    He, L.4
  • 6
    • 33749259827 scopus 로고    scopus 로고
    • Connectionist temporal classification: Labelling unsegmented sequence data with recurrent neural networks
    • A. Graves, S. Fernández, F. Gomez, and J. Schmidhuber. Connectionist temporal classification: labelling unsegmented sequence data with recurrent neural networks. In ICML, 2006.
    • (2006) ICML
    • Graves, A.1    Fernández, S.2    Gomez, F.3    Schmidhuber, J.4
  • 14
    • 0033884858 scopus 로고    scopus 로고
    • Speaker verification using adapted Gaussian mixture models
    • D. A. Reynolds, T. F. Quatieri, and R. B. Dunn. Speaker verification using adapted gaussian mixture models. Digital signal processing, 10(1-3):19-41, 2000.
    • (2000) Digital Signal Processing , vol.10 , Issue.1-3 , pp. 19-41
    • Reynolds, D.A.1    Quatieri, T.F.2    Dunn, R.B.3
  • 15
    • 85047003030 scopus 로고    scopus 로고
    • Crowdmos: An approach for crowdsourcing mean opinion score studies
    • F. Ribeiro, D. Florêncio, C. Zhang, and M. Seltzer. Crowdmos: An approach for crowdsourcing mean opinion score studies. In IEEE ICASSP, 2011.
    • (2011) IEEE ICASSP
    • Ribeiro, F.1    Florêncio, D.2    Zhang, C.3    Seltzer, M.4
  • 23
    • 85047016414 scopus 로고    scopus 로고
    • Unidirectional long short-term memory recurrent neural network with recurrent output layer for low-latency speech synthesis
    • H. Zen and H. Sak. Unidirectional long short-term memory recurrent neural network with recurrent output layer for low-latency speech synthesis. In IEEE ICASSP, 2015.
    • (2015) IEEE ICASSP
    • Zen, H.1    Sak, H.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.