메뉴 건너뛰기




Volumn 2017-August, Issue , 2017, Pages 4011-4015

Siri on-device deep learning-guided unit selection text-To-speech system

Author keywords

Hybrid; Recurrent mixture density network, on device; Speech synthesis; Unit selection

Indexed keywords

DEEP LEARNING; MIXTURES; SPEECH; SPEECH SYNTHESIS;

EID: 85039170210     PISSN: 2308457X     EISSN: 19909772     Source Type: Conference Proceeding    
DOI: 10.21437/Interspeech.2017-1798     Document Type: Conference Paper
Times cited : (75)

References (17)
  • 1
    • 67651002140 scopus 로고    scopus 로고
    • Statistical parametric speech syn thesis
    • H. Zen, K. Tokuda, and A. W. Black, "Statistical parametric speech synthesis, " Speech Communication, vol. 51, no. 11, pp. 1039-1064, 2009.
    • (2009) Speech Communication , vol.51 , Issue.11 , pp. 1039-1064
    • Zen, H.1    Tokuda, K.2    Black, A.W.3
  • 3
    • 85015421291 scopus 로고    scopus 로고
    • Unit size in unit selection speech synthesis
    • S. P. Kishore and A. W. Black, "Unit size in unit selection speech synthesis." in Interspeech, 2003.
    • (2003) Interspeech
    • Kishore, S.P.1    Black, A.W.2
  • 5
    • 84959124410 scopus 로고    scopus 로고
    • Using deep bidirectional recurrent neural networks for prosodictarget prediction in a unit-selection text-To-speech system
    • R. Fernandez, A. Rendel, B. Ramabhadran, and R. Hoory, "Using deep bidirectional recurrent neural networks for prosodictarget prediction in a unit-selection text-To-speech system." in Interspeech, 2015, pp. 1606-1610.
    • (2015) Interspeech , pp. 1606-1610
    • Fernandez, R.1    Rendel, A.2    Ramabhadran, B.3    Hoory, R.4
  • 8
    • 34047123652 scopus 로고    scopus 로고
    • Multisyn: Open-domain unit selection for the festival speech synthesis system
    • R. A. Clark, K. Richmond, and S. King, "Multisyn: Open-domain unit selection for the festival speech synthesis system, " Speech Communication, vol. 49, no. 4, pp. 317-330, 2007.
    • (2007) Speech Communication , vol.49 , Issue.4 , pp. 317-330
    • Clark, R.A.1    Richmond, K.2    King, S.3
  • 9
    • 84994309294 scopus 로고    scopus 로고
    • Recent advances in google real-Time HMM-driven unit selection synthesizer
    • X. Gonzalvo, S. Tazari, C.-A. Chan, M. Becker, A. Gutkin, and H. Silen, "Recent advances in google real-Time HMM-driven unit selection synthesizer, " in Interspeech, 2016, pp. 2238-2242.
    • (2016) Interspeech , pp. 2238-2242
    • Gonzalvo, X.1    Tazari, S.2    Chan, C.-A.3    Becker, M.4    Gutkin, A.5    Silen, H.6
  • 15
    • 34047268342 scopus 로고    scopus 로고
    • Conversational speech synthesis and the need for some laughter
    • N. Campbell, "Conversational speech synthesis and the need for some laughter, " IEEE Transactions on Audio, Speech, and Language Processing, vol. 14, no. 4, pp. 1171-1178, 2006.
    • (2006) IEEE Transactions on Audio, Speech, and Language Processing , vol.14 , Issue.4 , pp. 1171-1178
    • Campbell, N.1
  • 16
    • 84959121380 scopus 로고    scopus 로고
    • Pruning redundant synthesis units based on static and delta unit appearance frequency
    • H. Lu, W. Zhang, X. Shao, Q. Zhou, W. Lei, H. Zhou, and A. Breen, "Pruning redundant synthesis units based on static and delta unit appearance frequency, " in Interspeech, 2015.
    • (2015) Interspeech
    • Lu, H.1    Zhang, W.2    Shao, X.3    Zhou, Q.4    Lei, W.5    Zhou, H.6    Breen, A.7
  • 17
    • 79551478696 scopus 로고    scopus 로고
    • The Romanian speech synthesis (RSS) corpus: Building a high quality HMMbased speech synthesis system using a high sampling rate
    • A. Stan, J. Yamagishi, S. King, and M. Aylett, "The Romanian speech synthesis (RSS) corpus: Building a high quality HMMbased speech synthesis system using a high sampling rate, " Speech Communication, vol. 53, no. 3, pp. 442-450, 2011.
    • (2011) Speech Communication , vol.53 , Issue.3 , pp. 442-450
    • Stan, A.1    Yamagishi, J.2    King, S.3    Aylett, M.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.