메뉴 건너뛰기




Volumn , Issue , 2018, Pages

Deep Voice 3: Scaling text-to-speech with convolutional sequence learning

Author keywords

[No Author keywords available]

Indexed keywords

CONVOLUTION; SPEECH SYNTHESIS;

EID: 85083953940     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (286)

References (30)
  • 1
    • 84930664922 scopus 로고    scopus 로고
    • Vocaine the vocoder and applications in speech synthesis
    • Yannis Agiomyrgiannakis. Vocaine the vocoder and applications in speech synthesis. In ICASSP, 2015.
    • (2015) ICASSP
    • Agiomyrgiannakis, Y.1
  • 5
    • 85083953689 scopus 로고    scopus 로고
    • Neural machine translation by jointly learning to align and translate
    • Dzmitry Bahdanau, Kyunghyun Cho, and Yoshua Bengio. Neural machine translation by jointly learning to align and translate. In ICLR, 2015.
    • (2015) ICLR
    • Bahdanau, D.1    Cho, K.2    Bengio, Y.3
  • 9
    • 85048443641 scopus 로고    scopus 로고
    • Language modeling with gated convolutional networks
    • Yann Dauphin, Angela Fan, Michael Auli, and David Grangier. Language modeling with gated convolutional networks. In ICML, 2017.
    • (2017) ICML
    • Dauphin, Y.1    Fan, A.2    Auli, M.3    Grangier, D.4
  • 11
  • 13
    • 0032673049 scopus 로고    scopus 로고
    • Restructuring speech representations using a pitch-adaptive time–frequency smoothing and an instantaneous-frequency-based f0 extraction: Possible role of a repetitive structure in sounds
    • Hideki Kawahara, Ikuyo Masuda-Katsuse, and Alain De Cheveigne. Restructuring speech representations using a pitch-adaptive time–frequency smoothing and an instantaneous-frequency-based f0 extraction: Possible role of a repetitive structure in sounds. Speech communication, 1999.
    • (1999) Speech Communication
    • Kawahara, H.1    Masuda-Katsuse, I.2    De Cheveigne, A.3
  • 19
    • 85048524283 scopus 로고    scopus 로고
    • Online and linear-time attention by enforcing monotonic alignments
    • Colin Raffel, Thang Luong, Peter J Liu, Ron J Weiss, and Douglas Eck. Online and linear-time attention by enforcing monotonic alignments. In ICML, 2017.
    • (2017) ICML
    • Raffel, C.1    Luong, T.2    Liu, P.J.3    Weiss, R.J.4    Eck, D.5
  • 20
    • 85047003030 scopus 로고    scopus 로고
    • CrowDMOS: An approach for crowdsourcing mean opinion score studies
    • Flávio Ribeiro, Dinei Florêncio, Cha Zhang, and Michael Seltzer. Crowdmos: An approach for crowdsourcing mean opinion score studies. In IEEE ICASSP, 2011.
    • (2011) IEEE ICASSP
    • Ribeiro, F.1    Florêncio, D.2    Zhang, C.3    Seltzer, M.4
  • 21
    • 85011836388 scopus 로고    scopus 로고
    • A neural attention model for abstractive sentence summarization
    • Alexander M Rush, Sumit Chopra, and Jason Weston. A neural attention model for abstractive sentence summarization. In EMNLP, 2015.
    • (2015) EMNLP
    • Rush, A.M.1    Chopra, S.2    Weston, J.3
  • 22
    • 85017457992 scopus 로고    scopus 로고
    • Weight normalization: A simple reparameterization to accelerate training of deep neural networks
    • Tim Salimans and Diederik P Kingma. Weight normalization: A simple reparameterization to accelerate training of deep neural networks. In NIPS, 2016.
    • (2016) NIPS
    • Salimans, T.1    Kingma, D.P.2
  • 24
    • 84928547704 scopus 로고    scopus 로고
    • Sequence to sequence learning with neural networks
    • Ilya Sutskever, Oriol Vinyals, and Quoc V Le. Sequence to sequence learning with neural networks. In NIPS, 2014.
    • (2014) NIPS
    • Sutskever, I.1    Vinyals, O.2    Le, Q.V.3
  • 26
    • 84925160976 scopus 로고    scopus 로고
    • Cambridge University Press, New York, NY, USA, 1st edition, ISBN
    • Paul Taylor. Text-to-Speech Synthesis. Cambridge University Press, New York, NY, USA, 1st edition, 2009. ISBN 0521899273, 9780521899277.
    • (2009) Text-to-Speech Synthesis
    • Taylor, P.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.