메뉴 건너뛰기




Volumn 1, Issue , 2017, Pages 264-273

Deep voice: Real-time neural text-to-speech

Author keywords

[No Author keywords available]

Indexed keywords

ARTIFICIAL INTELLIGENCE; SPEECH SYNTHESIS;

EID: 85039156048     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (247)

References (27)
  • 3
    • 4444257069 scopus 로고    scopus 로고
    • Praat, a system for doing phonetics by computer
    • Boersma, Paulus Petrus Gerardus et al. Praat, a system for doing phonetics by computer. Glot international, 5, 2002.
    • (2002) Glot International , pp. 5
    • Boersma, P.P.G.1
  • 11
    • 84976902575 scopus 로고    scopus 로고
    • World: A vocoder-based high-quality speech synthesis system for real-time applications
    • Morise, Masanori, Yokomori, Fumiya, and Ozawa, Kenji. World: a vocoder-based high-quality speech synthesis system for real-time applications. IEICE TRANSAC-TIONS on Information and Systems, 99(7):1877-1884, 2016.
    • (2016) IEICE TRANSAC-TIONS on Information and Systems , vol.99 , Issue.7 , pp. 1877-1884
    • Morise, M.1    Yokomori, F.2    Ozawa, K.3
  • 14
    • 85048678744 scopus 로고    scopus 로고
    • Multi-output rnn-lstm for multiple speaker speech synthesis with α-interpolation model
    • Pascual, Santiago and Bonafonte, Antonio. Multi-output rnn-lstm for multiple speaker speech synthesis with α-interpolation model. way, 1000:2, 2016.
    • (2016) Way , vol.1000 , pp. 2
    • Pascual, S.1    Bonafonte, A.2
  • 18
    • 84994213378 scopus 로고    scopus 로고
    • A template-based approach for speech synthesis intonation generation using lstms
    • Ronanki, Srikanth, Henter, Gustav Eje, Wu, Zhizheng, and King, Simon. A template-based approach for speech synthesis intonation generation using lstms. Interspeech 2016, pp. 2463-2467, 2016.
    • (2016) Interspeech 2016 , pp. 2463-2467
    • Ronanki, S.1    Henter, G.E.2    Wu, Z.3    King, S.4
  • 21
    • 84925160976 scopus 로고    scopus 로고
    • Cambridge University Press, New York, NY, USA, 1st edition, 9780521899277
    • Taylor, Paul. Text-to-Speech Synthesis. Cambridge University Press, New York, NY, USA, 1st edition, 2009. ISBN 0521899273, 9780521899277.
    • (2009) Text-to-speech Synthesis
    • Taylor, P.1
  • 26
    • 84946045510 scopus 로고    scopus 로고
    • Unidirectional long short-term memory recurrent neural network with recurrent output layer for low-latency speech synthesis
    • IEEE
    • Zen, Heiga and Sak, Hasim. Unidirectional long short-term memory recurrent neural network with recurrent output layer for low-latency speech synthesis. In Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on, pp. 4470-4474. IEEE, 2015.
    • (2015) Acoustics, Speech and Signal Processing (ICASSP), 2015 IEEE International Conference on , pp. 4470-4474
    • Zen, H.1    Sak, H.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.