메뉴 건너뛰기




Volumn 2015-January, Issue , 2015, Pages 864-868

Sequence generation error (SGE) minimization based deep neural networks training for text-to-speech synthesis

Author keywords

Deep neural networks(DNNs); Sequence generation error (SGE) minimization training; Speech synthesis

Indexed keywords

DECISION TREES; ERRORS; SPEECH; SPEECH RECOGNITION; SPEECH SYNTHESIS;

EID: 84959172579     PISSN: 2308457X     EISSN: 19909772     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (13)

References (18)
  • 1
    • 67651002140 scopus 로고    scopus 로고
    • Statistical parametricspeech synthesis
    • H. Zen, K. Tokuda, and W. Black, Alan, "Statistical parametricspeech synthesis", Speech Communication, Volume 51, Issue 11, pp. 1039-1064, 2009.
    • (2009) Speech Communication , vol.51 , Issue.11 , pp. 1039-1064
    • Zen, H.1    Tokuda, K.2    Alan, B.W.3
  • 3
    • 84055222005 scopus 로고    scopus 로고
    • Context-dependentpre-trained deep neural networks for large-vocabulary speechrecognition
    • G. E. Dahl, D. Yu, L. Deng, and A. Acero, "Context-dependentpre-trained deep neural networks for large-vocabulary speechrecognition, " IEEE Trans. on Audio, Speech, and LanguageProcessing, vol. 20, no. 1, pp. 30-42, 2012.
    • (2012) IEEE Trans. on Audio, Speech, and LanguageProcessing , vol.20 , Issue.1 , pp. 30-42
    • Dahl, G.E.1    Yu, D.2    Deng, L.3    Acero, A.4
  • 4
    • 84865801985 scopus 로고    scopus 로고
    • Conversational speechtranscription using context-dependent deep neural networks
    • F. Seide, G. Li, and D. Yu, "Conversational speechtranscription using context-dependent deep neural networks, " inProc. InterSpeech, pp. 437-440, 2011
    • (2011) Proc. InterSpeech , pp. 437-440
    • Seide, F.1    Li, G.2    Yu, D.3
  • 5
    • 84890490547 scopus 로고    scopus 로고
    • Statistical parametric speechsynthesis using deep neural networks
    • H. Zen, A. Senior and M. Senior, "Statistical Parametric SpeechSynthesis Using Deep Neural Networks", in Proc. ICASSP, pp. 8012-8016, 2013.
    • (2013) Proc. ICASSP , pp. 8012-8016
    • Zen, H.1    Senior, A.2    Senior, M.3
  • 6
    • 84905251808 scopus 로고    scopus 로고
    • On the trainingaspects of deep neural network (DNN) for parametric TTSsynthesis
    • Y. Qian, Y.-C. Fan, W.-P. Hu and F. K. Soong, "On the trainingaspects of deep neural network (DNN) for parametric TTSsynthesis", in Proc. ICASSP, pp. 3829-3833, 2014.
    • (2014) Proc. ICASSP , pp. 3829-3833
    • Qian, Y.1    Fan, Y.-C.2    Hu, W.-P.3    Soong, F.K.4
  • 7
    • 84929157442 scopus 로고    scopus 로고
    • Combining a vector spacerepresentation of linguistic context with a deep neural networkfor text-to-speech synthesis
    • H. Lu, S. King, and O. Watts, "Combining a vector spacerepresentation of linguistic context with a deep neural networkfor text-to-speech synthesis", in 8th ISCA Workshop on SpeechSynthesis, pp. 281-285, 2013.
    • (2013) 8th ISCA Workshop on SpeechSynthesis , pp. 281-285
    • Lu, H.1    King, S.2    Watts, O.3
  • 8
    • 84890527090 scopus 로고    scopus 로고
    • Multi-distribution deep beliefnetwork for speech synthesis
    • S. Kang, X. Qian, and H. Meng, "Multi-distribution deep beliefnetwork for speech synthesis", in Proc. ICASSP, pp. 7962-7966, 2013.
    • (2013) Proc. ICASSP , pp. 7962-7966
    • Kang, S.1    Qian, X.2    Meng, H.3
  • 9
    • 84890447002 scopus 로고    scopus 로고
    • Modeling spectral envelopesusing restricted Boltzmann machines for statistical parametricspeech synthesis
    • Z.-H. Ling, L. Deng, and D. Yu, "Modeling spectral envelopesusing restricted Boltzmann machines for statistical parametricspeech synthesis", in Proc. ICASSP, pp. 7825-7829, 2013.
    • (2013) Proc. ICASSP , pp. 7825-7829
    • Ling, Z.-H.1    Deng, L.2    Yu, D.3
  • 10
    • 0033708106 scopus 로고    scopus 로고
    • Speech parameter generation algorithms for HMMbasedspeech synthesis
    • K. Tokuda, T. Kobayashi, T. Masuko, T. Kobayashi, and T. Kitamura, "Speech parameter generation algorithms for HMMbasedspeech synthesis", in Proc. ICASSP, pp. 1315-1318, 2000.
    • (2000) Proc. ICASSP , pp. 1315-1318
    • Tokuda, K.1    Kobayashi, T.2    Masuko, T.3    Kobayashi, T.4    Kitamura, T.5
  • 11
    • 33846429403 scopus 로고    scopus 로고
    • Minimum generation error trainingfor HMM-based speech synthesis
    • Y.-J. Wu and R.-H. Wang, "Minimum generation error trainingfor HMM-based speech synthesis", in Proc. ICASSP, pp. I, I, 2006.
    • (2006) Proc. ICASSP , pp. I-I
    • Wu, Y.-J.1    Wang, R.-H.2
  • 12
    • 79959840616 scopus 로고    scopus 로고
    • Investigation of full-sequence training of deep belief networks for speechrecognition
    • A.-R. Mohamed, D. Yu and L. Deng, "Investigation of Full-Sequence Training of Deep Belief Networks for SpeechRecognition", in Proc. Interspeech, pp. 2846-2849, 2010.
    • (2010) Proc. Interspeech , pp. 2846-2849
    • Mohamed, A.-R.1    Yu, D.2    Deng, L.3
  • 13
    • 84890543852 scopus 로고    scopus 로고
    • Error back propagation forsequence training of context-dependent deep networks forconversational speech transcription
    • H. Su, G. Li, D. Yu, and F. Seide, "Error back propagation forsequence training of context-dependent deep networks forconversational speech transcription", in Proc. ICASSP, pp. 6664-6668, 2013
    • (2013) Proc. ICASSP , pp. 6664-6668
    • Su, H.1    Li, G.2    Yu, D.3    Seide, F.4
  • 14
    • 0022471098 scopus 로고
    • Learningrepresentations by back-propagating errors
    • D. E. Rumelhart, G. E. Hinton, and R. J. Williams, "Learningrepresentations by back-propagating errors, " Nature, vol. 323, no. 9, pp. 533-536, 1986.
    • (1986) Nature , vol.323 , Issue.9 , pp. 533-536
    • Rumelhart, D.E.1    Hinton, G.E.2    Williams, R.J.3
  • 17
    • 84959149034 scopus 로고    scopus 로고
    • Amazon Mechanical Turk
    • Amazon Mechanical Turk, Avaliable: https: //www. mturk. com/mturk/welcome
  • 18
    • 84910087395 scopus 로고    scopus 로고
    • Sequence error(SE) minimization training of neural network for voiceconversion
    • F.-L. Xie, Y. Qian, Y.-C. Fan, F. K. Soong, "Sequence Error(SE) Minimization Training of Neural Network for VoiceConversion", in Proc. Interspeech, 2014.
    • (2014) Proc. Interspeech
    • Xie, F.-L.1    Qian, Y.2    Fan, Y.-C.3    Soong, F.K.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.