메뉴 건너뛰기




Volumn , Issue , 2014, Pages 2273-2277

Modeling DCT parameterized F0 trajectory at intonation phrase level with DNN or decision tree

Author keywords

DCT; Deep neural network; F0 contour; Hidden Markov model; Speech synthesis

Indexed keywords

DECISION TREES; DISPENSERS; HIDDEN MARKOV MODELS; SPEECH COMMUNICATION; SPEECH SYNTHESIS; TREES (MATHEMATICS);

EID: 84910044428     PISSN: 2308457X     EISSN: 19909772     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (9)

References (18)
  • 1
    • 0033708106 scopus 로고    scopus 로고
    • Speech parameter generation algorithms for hmm-based speech synthesis
    • K. Tokuda, T. Yoshimura, T. Masuko, T. Kobayashi, and T. Kitamura, "Speech parameter generation algorithms for HMM-based speech synthesis, " in Proc. of ICASSP, vol. 3, 2000, pp. 1315 - 1318.
    • (2000) Proc. of ICASSP , vol.3 , pp. 1315-1318
    • Tokuda, K.1    Yoshimura, T.2    Masuko, T.3    Kobayashi, T.4    Kitamura, T.5
  • 3
    • 0028996993 scopus 로고
    • Speech parameter generation from hmm using dynamic features
    • K. Tokuda, T. Kobayashi, and S. Imai, "Speech parameter generation from HMM using dynamic features, " in Proc. of ICASSP, vol. 1, 1995, pp. 660-663.
    • (1995) Proc. of ICASSP , vol.1 , pp. 660-663
    • Tokuda, K.1    Kobayashi, T.2    Imai, S.3
  • 5
    • 0032678076 scopus 로고    scopus 로고
    • Hidden markov models based on multi-space probability distribution for pitch pattern modeling
    • K. Tokuda, T. Masuko, N. Miyazaki, and T. Kobayashi, "Hidden markov models based on multi-space probability distribution for pitch pattern modeling, " in Proc. of ICASSP, vol. 1, 1999, pp. 229-232.
    • (1999) Proc. of ICASSP , vol.1 , pp. 229-232
    • Tokuda, K.1    Masuko, T.2    Miyazaki, N.3    Kobayashi, T.4
  • 6
    • 70450161503 scopus 로고    scopus 로고
    • Context-dependent additive log f0 model for hmm-based speech synthesis
    • H. Zen and N. Braunschweiler, "Context-dependent additive log f0 model for HMM-based speech synthesis." in Proc. of INTERSPEECH, 2009, pp. 2091-2094.
    • (2009) Proc. of Inter Speech , pp. 2091-2094
    • Zen, H.1    Braunschweiler, N.2
  • 7
    • 60849084576 scopus 로고    scopus 로고
    • Multi-layer f0 modeling for hmm-based speech synthesis
    • C.-C.Wang, Z.-H. Ling, B.-F. Zhang, and L.-R. Dai, "Multi-layer f0 modeling for HMM-based speech synthesis, " in Proc. of ISCSLP, 2008, pp. 129-132.
    • (2008) Proc. of ISCSLP , pp. 129-132
    • Wang C.-c.1    Ling, Z.-H.2    Zhang, B.-F.3    Dai, L.-R.4
  • 8
    • 33646780328 scopus 로고    scopus 로고
    • F0 modeling with multi-layer additive modeling based on a statistical learning technique
    • IEEE
    • S. Sakai, "F0 modeling with multi-layer additive modeling based on a statistical learning technique, " in Proc. of ISCA SSW5. IEEE, 2004, pp. 151-154.
    • (2004) Proc. of ISCA SSW5 , pp. 151-154
    • Sakai, S.1
  • 9
    • 84867200235 scopus 로고    scopus 로고
    • Generating natural f0 trajectory with additive trees
    • Y. Qian, H. Liang, and F. K. Soong, "Generating natural f0 trajectory with additive trees." in Proc. of INTER SPEECH, 2008, pp. 2126-2129.
    • (2008) Proc. of Inter Speech , pp. 2126-2129
    • Qian, Y.1    Liang, H.2    Soong, F.K.3
  • 10
    • 79959844205 scopus 로고    scopus 로고
    • A hierarchical f0 modeling method for hmm-based speech synthesis
    • M. Lei, Y.-J.Wu, F. K. Soong, Z.-H. Ling, and L.-R. Dai, "A hierarchical f0 modeling method for HMM-based speech synthesis." in Proc. of INTER SPEECH, 2010, pp. 2170-2173.
    • (2010) Proc. of Inter Speech , pp. 2170-2173
    • Lei, M.1    Wu Y.-j.2    Soong, F.K.3    Ling, Z.-H.4    Dai, L.-R.5
  • 11
    • 84867589421 scopus 로고    scopus 로고
    • Modeling pitch trajectory by hierarchical hmm with minimum generation error training
    • March
    • Y.-J. Wu and F. K. Soong, "Modeling pitch trajectory by hierarchical HMM with minimum generation error training, " in Proc. of ICASSP, March 2012, pp. 4017-4020.
    • (2012) Proc. of ICASSP , pp. 4017-4020
    • Wu, Y.-J.1    Soong, F.K.2
  • 12
    • 85008039410 scopus 로고    scopus 로고
    • Improved prosody generation by maximizing joint probability of state and longer units
    • Y. Qian, Z.Wu, B. Gao, and F. K. Soong, "Improved prosody generation by maximizing joint probability of state and longer units, " IEEE Transactions on Audio, Speech, and Language Processing, vol. 19, no. 6, pp. 1702-1710, 2011.
    • (2011) IEEE Transactions on Audio, Speech, and Language Processing , vol.19 , Issue.6 , pp. 1702-1710
    • Qian, Y.1    Wu, Z.2    Gao, B.3    Soong, F.K.4
  • 13
    • 84867194192 scopus 로고    scopus 로고
    • Multilevel parametric-base f0 model for speech synthesis
    • J. Latorre and M. Akamine, "Multilevel parametric-base f0 model for speech synthesis, " in Proc. of INTER SPEECH, 2008, pp. 2274-2277.
    • (2008) Proc. of Inter Speech , pp. 2274-2277
    • Latorre, J.1    Akamine, M.2
  • 14
    • 84890543760 scopus 로고    scopus 로고
    • Accent group modeling for improved prosody in statistical parameteric speech synthesis
    • G. Krishna and A. W. Black, "Accent group modeling for improved prosody in statistical parameteric speech synthesis, " in Proc. of ICASSP, 2013, pp. 6890-6894.
    • (2013) Proc. of ICASSP , pp. 6890-6894
    • Krishna, G.1    Black, A.W.2
  • 15
    • 60849112575 scopus 로고    scopus 로고
    • Modeling and generating tone contour with phrase intonation for mandarin chinese speech
    • Dec
    • Z. Wu, Y. Qian, F. K. Soong, and B. Zhang, "Modeling and generating tone contour with phrase intonation for mandarin chinese speech, " in Proc. of ISCSLP, Dec 2008, pp. 1-4.
    • (2008) Proc. of ISCSLP , pp. 1-4
    • Wu, Z.1    Qian, Y.2    Soong, F.K.3    Zhang, B.4
  • 16
    • 51449117929 scopus 로고    scopus 로고
    • Modelling and synthesising f0 contours with the discrete cosine transform
    • J. Teutenberg, C. Watson, and P. Riddle, "Modelling and synthesising f0 contours with the discrete cosine transform, " in Proc. of ICASSP, 2008, pp. 3973-3976.
    • (2008) Proc. of ICASSP , pp. 3973-3976
    • Teutenberg, J.1    Watson, C.2    Riddle, P.3
  • 17
    • 84890490547 scopus 로고    scopus 로고
    • Statistical parametric speech synthesis using deep neural networks
    • H. Zen, A. Senior, and M. Schuster, "Statistical parametric speech synthesis using deep neural networks, " in Proc. of ICASSP, 2013, pp. 7962-7966.
    • (2013) Proc. of ICASSP , pp. 7962-7966
    • Zen, H.1    Senior, A.2    Schuster, M.3
  • 18
    • 0001455934 scopus 로고
    • A robust algorithm for pitch tracking (rapt)
    • D. Talkin, "A robust algorithm for pitch tracking (RAPT), " Speech coding and synthesis, pp. 495-518, 1995.
    • (1995) Speech Coding and Synthesis , pp. 495-518
    • Talkin, D.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.