메뉴 건너뛰기




Volumn 2015-January, Issue , 2015, Pages 854-858

Fusion of multiple parameterisations for DNN-based sinusoidal speech synthesis with multi-task learning

Author keywords

Deep neural network; Fusion vocoder; Sinusoidal model; Statistical speech synthesis

Indexed keywords

DECISION TREES; LEARNING SYSTEMS; PARAMETERIZATION; SPEECH; SPEECH SYNTHESIS; VOCODERS;

EID: 84959144342     PISSN: 2308457X     EISSN: 19909772     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (32)

References (29)
  • 1
    • 84930664922 scopus 로고    scopus 로고
    • Vocaine the vocoder and applicationsin speech synthesis
    • Y. Agiomyrgiannakis. Vocaine the vocoder and applicationsin speech synthesis. In Proc. ICASSP, 2015.
    • (2015) Proc. ICASSP
    • Agiomyrgiannakis, Y.1
  • 3
  • 5
    • 84856248602 scopus 로고    scopus 로고
    • The deterministic plus stochasticmodel of the residual signal and its applications
    • T. Drugman and T. Dutoit. The deterministic plus stochasticmodel of the residual signal and its applications. IEEETransactions on Audio, Speech and Language Processing, 20 (3): 968-981, 2012.
    • (2012) IEEETransactions on Audio, Speech and Language Processing , vol.20 , Issue.3 , pp. 968-981
    • Drugman, T.1    Dutoit, T.2
  • 9
    • 84946025802 scopus 로고    scopus 로고
    • Methods for applying dynamic sinusoidal modelsto statistical parametric speech synthesis
    • Q. Hu, Y. Stylianou, R. Maia, K. Richmond, and J. Yamagishi. Methods for applying dynamic sinusoidal modelsto statistical parametric speech synthesis. In Proc. ICASSP, 2015.
    • (2015) Proc. ICASSP
    • Hu, Q.1    Stylianou, Y.2    Maia, R.3    Richmond, K.4    Yamagishi, J.5
  • 10
    • 84910049275 scopus 로고    scopus 로고
    • An investigation of the application of dynamicsinusoidal models to statistical parametric speechsynthesis
    • Q. Hu, Y. Stylianou, R. Maia, K. Richmond, J. Yamagishi, and J. Latorre. An investigation of the application of dynamicsinusoidal models to statistical parametric speechsynthesis. In Proc. Interspeech, 2014.
    • (2014) Proc. Interspeech
    • Hu, Q.1    Stylianou, Y.2    Maia, R.3    Richmond, K.4    Yamagishi, J.5    Latorre, J.6
  • 13
    • 0032673049 scopus 로고    scopus 로고
    • Restructuring speech representations using a pitchadaptivetime-frequency smoothing and an instantaneousfrequency-based F0 extraction: Possible role of a repetitivestructure in sounds
    • H. Kawahara, I. Masuda-Katsuse, and A. de Cheveigné. Restructuring speech representations using a pitchadaptivetime-frequency smoothing and an instantaneousfrequency-based F0 extraction: Possible role of a repetitivestructure in sounds. Speech communication, 27 (3): 187-207, 1999.
    • (1999) Speech Communication , vol.27 , Issue.3 , pp. 187-207
    • Kawahara, H.1    Masuda-Katsuse, I.2    De Cheveigné, A.3
  • 14
    • 78649238036 scopus 로고    scopus 로고
    • Synthesizer voicequality of new languages calibrated with mean mel cepstraldistortion
    • J. Kominek, T. Schultz, and A. Black. Synthesizer voicequality of new languages calibrated with mean mel cepstraldistortion. In Pro. SLTU, 2008.
    • (2008) Pro. SLTU
    • Kominek, J.1    Schultz, T.2    Black, A.3
  • 16
    • 85009167968 scopus 로고    scopus 로고
    • Multitask learning in connectionistrobust asr using recurrent neural networks
    • S. Parveen and P. Green. Multitask learning in connectionistrobust asr using recurrent neural networks. In INTERSPEECH, 2003.
    • (2003) INTERSPEECH
    • Parveen, S.1    Green, P.2
  • 17
    • 84905251808 scopus 로고    scopus 로고
    • On the training aspectsof deep neural network for parametric tts synthesis. in Proc
    • Y. Qian, Y. Fan, W. Hu, and F. Soong. On the training aspectsof deep neural network for parametric tts synthesis. In Proc. ICASSP, 2014.
    • (2014) ICASSP
    • Qian, Y.1    Fan, Y.2    Hu, W.3    Soong, F.4
  • 20
    • 79959858197 scopus 로고    scopus 로고
    • Sinusoidal model parameterizationfor HMM-based TTS system
    • S. Shechtman and A. Sorin. Sinusoidal model parameterizationfor HMM-based TTS system. In Proc. Interspeech, 2010.
    • (2010) Proc. Interspeech
    • Shechtman, S.1    Sorin, A.2
  • 22
    • 38549096029 scopus 로고    scopus 로고
    • A speech parameter generationalgorithm considering global variance for HMM-basedspeech synthesis
    • T. Toda and K. Tokuda. A speech parameter generationalgorithm considering global variance for HMM-basedspeech synthesis. IEICE Transactions on Information and Systems, 90 (5): 816-824, 2007.
    • (2007) IEICE Transactions on Information and Systems , vol.90 , Issue.5 , pp. 816-824
    • Toda, T.1    Tokuda, A.2
  • 24
    • 33947651202 scopus 로고    scopus 로고
    • Multitask learning for spoken language understand ing
    • G. Tur. Multitask learning for spoken language understand ing. In ICASSP, 2006.
    • (2006) ICASSP
    • Tur, G.1
  • 25
    • 84946033275 scopus 로고    scopus 로고
    • Deep neuralnetworks employing multi-task learning and stacked bottleneckfeatures for speech synthesis
    • Z. Wu, C. Botinhao, O. Watts, and S. King. Deep neuralnetworks employing multi-task learning and stacked bottleneckfeatures for speech synthesis. In Proc. ICASSP, 2015.
    • (2015) Proc. ICASSP
    • Wu, Z.1    Botinhao, C.2    Watts, O.3    King, S.4
  • 27
    • 84905262874 scopus 로고    scopus 로고
    • Deep mixture density networks foracoustic modeling in statistical parametric speech synthesis
    • H. Zen and A. Senior. Deep mixture density networks foracoustic modeling in statistical parametric speech synthesis. In Proc. ICASSP, 2014.
    • (2014) Proc. ICASSP
    • Zen, H.1    Senior, A.2
  • 28
    • 84890490547 scopus 로고    scopus 로고
    • Statistical parametricspeech synthesis using deep neural networks
    • H. Zen, A. Senior, and M. Schuster. Statistical parametricspeech synthesis using deep neural networks. In Proc. ICASSP, 2013.
    • (2013) Proc. ICASSP
    • Zen, H.1    Senior, A.2    Schuster, M.3
  • 29
    • 67651002140 scopus 로고    scopus 로고
    • Statistical parametricspeech synthesis
    • H. Zen, K. Tokuda, and A. Black. Statistical parametricspeech synthesis. Speech Communication, 51 (11): 1039-1064, 2009.
    • (2009) Speech Communication , vol.51 , Issue.11 , pp. 1039-1064
    • Zen, H.1    Tokuda, K.2    Black, A.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.