메뉴 건너뛰기




Volumn 2018-December, Issue , 2018, Pages 10019-10029

Neural voice cloning with a few samples

Author keywords

[No Author keywords available]

Indexed keywords

ENCODING (SYMBOLS); SIGNAL ENCODING;

EID: 85064829543     PISSN: 10495258     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (302)

References (39)
  • 1
    • 85064830948 scopus 로고    scopus 로고
    • Fast speaker adaptation of hybrid nn/hmm model for speech recognition based on discriminative learning of speaker code
    • O. Abdel-Hamid and H. Jiang. Fast speaker adaptation of hybrid nn/hmm model for speech recognition based on discriminative learning of speaker code. In IEEE ICASSP, 2013.
    • (2013) IEEE ICASSP
    • Abdel-Hamid, O.1    Jiang, H.2
  • 2
    • 85064816663 scopus 로고    scopus 로고
    • Voice morphing that improves tts quality using an optimal dynamic frequency warping-and-weighting transform
    • Y. Agiomyrgiannakis and Z. Roupakia. Voice morphing that improves tts quality using an optimal dynamic frequency warping-and-weighting transform. IEEE ICASSP, 2016.
    • (2016) IEEE ICASSP
    • Agiomyrgiannakis, Y.1    Roupakia, Z.2
  • 12
    • 85049871154 scopus 로고    scopus 로고
    • Progressive growing of gans for improved quality, stability, and variation
    • abs/1710.10196
    • T. Karras, T. Aila, S. Laine, and J. Lehtinen. Progressive growing of gans for improved quality, stability, and variation. CoRR, abs/1710.10196, 2017.
    • (2017) CoRR
    • Karras, T.1    Aila, T.2    Laine, S.3    Lehtinen, J.4
  • 13
    • 84898998554 scopus 로고    scopus 로고
    • One-shot learning by inverting a compositional causal process
    • B. M. Lake, R. Salakhutdinov, and J. B. Tenenbaum. One-shot learning by inverting a compositional causal process. In NIPS, 2013.
    • (2013) NIPS
    • Lake, B.M.1    Salakhutdinov, R.2    Tenenbaum, J.B.3
  • 15
    • 84949683101 scopus 로고    scopus 로고
    • Human-level concept learning through probabilistic program induction
    • B. M. Lake, R. Salakhutdinov, and J. B. Tenenbaum. Human-level concept learning through probabilistic program induction. Science, 2015.
    • (2015) Science
    • Lake, B.M.1    Salakhutdinov, R.2    Tenenbaum, J.B.3
  • 16
    • 84959173377 scopus 로고    scopus 로고
    • Modeling speaker variability using long short-term memory networks for speech recognition
    • X. Li and X. Wu. Modeling speaker variability using long short-term memory networks for speech recognition. In INTERSPEECH, 2015.
    • (2015) INTERSPEECH
    • Li, X.1    Wu, X.2
  • 22
    • 85023755462 scopus 로고    scopus 로고
    • Librispeech: An ASR corpus based on public domain audio books
    • V. Panayotov, G. Chen, D. Povey, and S. Khudanpur. Librispeech: an ASR corpus based on public domain audio books. In IEEE ICASSP, 2015.
    • (2015) IEEE ICASSP
    • Panayotov, V.1    Chen, G.2    Povey, D.3    Khudanpur, S.4
  • 24
    • 50649094277 scopus 로고    scopus 로고
    • Probabilistic linear discriminant analysis for inferences about identity
    • S. Prince and J. Elder. Probabilistic linear discriminant analysis for inferences about identity. In ICCV, 2007.
    • (2007) ICCV
    • Prince, S.1    Elder, J.2
  • 26
    • 84998631632 scopus 로고    scopus 로고
    • One-shot generalization in deep generative models
    • D. Rezende, Shakir, I. Danihelka, K. Gregor, and D. Wierstra. One-shot generalization in deep generative models. In ICML, 2016.
    • (2016) ICML
    • Rezende, D.1    Shakir, I.D.2    Gregor, K.3    Wierstra, D.4
  • 30
    • 85083953646 scopus 로고    scopus 로고
    • VoiceLoop: Voice fitting and synthesis via a phonological loop
    • Y. Taigman, L. Wolf, A. Polyak, and E. Nachmani. Voiceloop: Voice fitting and synthesis via a phonological loop. In ICLR, 2018.
    • (2018) ICLR
    • Taigman, Y.1    Wolf, L.2    Polyak, A.3    Nachmani, E.4
  • 34
    • 84994351528 scopus 로고    scopus 로고
    • Analysis of the voice conversion challenge 2016 evaluation results
    • 09
    • M. Wester, Z. Wu, and J. Yamagishi. Analysis of the voice conversion challenge 2016 evaluation results. In INTERSPEECH, pages 1637-1641, 09 2016.
    • (2016) INTERSPEECH , pp. 1637-1641
    • Wester, M.1    Wu, Z.2    Yamagishi, J.3
  • 35
    • 84994247053 scopus 로고    scopus 로고
    • Locally linear embedding for exemplar-based spectral conversion
    • 09
    • Y.-C. Wu, H.-T. Hwang, C.-C. Hsu, Y. Tsao, and H.-m. Wang. Locally linear embedding for exemplar-based spectral conversion. In INTERSPEECH, pages 1652-1656, 09 2016.
    • (2016) INTERSPEECH , pp. 1652-1656
    • Wu, Y.-C.1    Hwang, H.-T.2    Hsu, C.-C.3    Tsao, Y.4    Wang, H.5
  • 39
    • 85064811346 scopus 로고    scopus 로고
    • Kl-divergence regularized deep neural network adaptation for improved large vocabulary speech recognition
    • D. Yu, K. Yao, H. Su, G. Li, and F. Seide. Kl-divergence regularized deep neural network adaptation for improved large vocabulary speech recognition. In IEEE ICASSP, 2013.
    • (2013) IEEE ICASSP
    • Yu, D.1    Yao, K.2    Su, H.3    Li, G.4    Seide, F.5


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.