SCOPUS 정보 검색 플랫폼

Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

Volumn , Issue , 2014, Pages 2273-2277

Modeling DCT parameterized F0 trajectory at intonation phrase level with DNN or decision tree

(7) Yin, Xiang a,b Lei, Ming b Qian, Yao b Soong, Frank K b He, Lei b Ling, Zhen Hua a Dai, Li Rong a

a National Engineering Laboratory for Speech and Language Information Processing (China)

b MICROSOFT (United States)

Author keywords

DCT; Deep neural network; F0 contour; Hidden Markov model; Speech synthesis

Indexed keywords

DECISION TREES; DISPENSERS; HIDDEN MARKOV MODELS; SPEECH COMMUNICATION; SPEECH SYNTHESIS; TREES (MATHEMATICS);

DCT; DEEP NEURAL NETWORKS; DISCRETE COSINE TRANSFORM COEFFICIENTS; F0 CONTOURS; LONG-TERM STRUCTURES; PARAMETERIZED; PHRASE INTONATION; SPEECH PROSODY;

DISCRETE COSINE TRANSFORMS;

EID: 84910044428 PISSN: 2308457X EISSN: 19909772 Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (9)

References (18)

1
- 0033708106
- Speech parameter generation algorithms for hmm-based speech synthesis
- K. Tokuda, T. Yoshimura, T. Masuko, T. Kobayashi, and T. Kitamura, "Speech parameter generation algorithms for HMM-based speech synthesis, " in Proc. of ICASSP, vol. 3, 2000, pp. 1315 - 1318.
- (2000) Proc. of ICASSP , vol.3 , pp. 1315-1318
- Tokuda, K.¹ Yoshimura, T.² Masuko, T.³ Kobayashi, T.⁴ Kitamura, T.⁵

2
- 85009139544
- Simultaneous modeling of spectrum, pitch and duration in hmm-based speech synthesis
- T. Yoshimura, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura, "Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis, " in Proceedings of 6th European Conference on Speech Communication and Technology, vol. 6, 1999, pp. 2347-2350.
- (1999) Proceedings of 6th European Conference on Speech Communication and Technology , vol.6 , pp. 2347-2350
- Yoshimura, T.¹ Tokuda, K.² Masuko, T.³ Kobayashi, T.⁴ Kitamura, T.⁵

3
- 0028996993
- Speech parameter generation from hmm using dynamic features
- K. Tokuda, T. Kobayashi, and S. Imai, "Speech parameter generation from HMM using dynamic features, " in Proc. of ICASSP, vol. 1, 1995, pp. 660-663.
- (1995) Proc. of ICASSP , vol.1 , pp. 660-663
- Tokuda, K.¹ Kobayashi, T.² Imai, S.³

4
- 67650816595
- The ustc and iflytek speech synthesis systems for blizzard challenge 2007
- Z.-H. Ling, L. Qin, H. Lu, Y. Gao, L.-R. Dai, R.-H. Wang, Y. Jiang, Z.-W. Zhao, J.-H. Yang, J. Chen et al., "The USTC and iflytek speech synthesis systems for blizzard challenge 2007, " in Blizzard Challenge Workshop, 2007.
- (2007) Blizzard Challenge Workshop
- Ling, Z.-H.¹ Qin, L.² Lu, H.³ Gao, Y.⁴ Dai, L.-R.⁵ Wang, R.-H.⁶ Jiang, Y.⁷ Zhao, Z.-W.⁸ Yang, J.-H.⁹ Chen, J.¹⁰

5
- 0032678076
- Hidden markov models based on multi-space probability distribution for pitch pattern modeling
- K. Tokuda, T. Masuko, N. Miyazaki, and T. Kobayashi, "Hidden markov models based on multi-space probability distribution for pitch pattern modeling, " in Proc. of ICASSP, vol. 1, 1999, pp. 229-232.
- (1999) Proc. of ICASSP , vol.1 , pp. 229-232
- Tokuda, K.¹ Masuko, T.² Miyazaki, N.³ Kobayashi, T.⁴

6
- 70450161503
- Context-dependent additive log f0 model for hmm-based speech synthesis
- H. Zen and N. Braunschweiler, "Context-dependent additive log f0 model for HMM-based speech synthesis." in Proc. of INTERSPEECH, 2009, pp. 2091-2094.
- (2009) Proc. of Inter Speech , pp. 2091-2094
- Zen, H.¹ Braunschweiler, N.²

7
- 60849084576
- Multi-layer f0 modeling for hmm-based speech synthesis
- C.-C.Wang, Z.-H. Ling, B.-F. Zhang, and L.-R. Dai, "Multi-layer f0 modeling for HMM-based speech synthesis, " in Proc. of ISCSLP, 2008, pp. 129-132.
- (2008) Proc. of ISCSLP , pp. 129-132
- Wang C.-c.¹ Ling, Z.-H.² Zhang, B.-F.³ Dai, L.-R.⁴

8
- 33646780328
- F0 modeling with multi-layer additive modeling based on a statistical learning technique
- IEEE
- S. Sakai, "F0 modeling with multi-layer additive modeling based on a statistical learning technique, " in Proc. of ISCA SSW5. IEEE, 2004, pp. 151-154.
- (2004) Proc. of ISCA SSW5 , pp. 151-154
- Sakai, S.¹

9
- 84867200235
- Generating natural f0 trajectory with additive trees
- Y. Qian, H. Liang, and F. K. Soong, "Generating natural f0 trajectory with additive trees." in Proc. of INTER SPEECH, 2008, pp. 2126-2129.
- (2008) Proc. of Inter Speech , pp. 2126-2129
- Qian, Y.¹ Liang, H.² Soong, F.K.³

10
- 79959844205
- A hierarchical f0 modeling method for hmm-based speech synthesis
- M. Lei, Y.-J.Wu, F. K. Soong, Z.-H. Ling, and L.-R. Dai, "A hierarchical f0 modeling method for HMM-based speech synthesis." in Proc. of INTER SPEECH, 2010, pp. 2170-2173.
- (2010) Proc. of Inter Speech , pp. 2170-2173
- Lei, M.¹ Wu Y.-j.² Soong, F.K.³ Ling, Z.-H.⁴ Dai, L.-R.⁵

11
- 84867589421
- Modeling pitch trajectory by hierarchical hmm with minimum generation error training
- March
- Y.-J. Wu and F. K. Soong, "Modeling pitch trajectory by hierarchical HMM with minimum generation error training, " in Proc. of ICASSP, March 2012, pp. 4017-4020.
- (2012) Proc. of ICASSP , pp. 4017-4020
- Wu, Y.-J.¹ Soong, F.K.²

12
- 85008039410
- Improved prosody generation by maximizing joint probability of state and longer units
- Y. Qian, Z.Wu, B. Gao, and F. K. Soong, "Improved prosody generation by maximizing joint probability of state and longer units, " IEEE Transactions on Audio, Speech, and Language Processing, vol. 19, no. 6, pp. 1702-1710, 2011.
- (2011) IEEE Transactions on Audio, Speech, and Language Processing , vol.19 , Issue.6 , pp. 1702-1710
- Qian, Y.¹ Wu, Z.² Gao, B.³ Soong, F.K.⁴

13
- 84867194192
- Multilevel parametric-base f0 model for speech synthesis
- J. Latorre and M. Akamine, "Multilevel parametric-base f0 model for speech synthesis, " in Proc. of INTER SPEECH, 2008, pp. 2274-2277.
- (2008) Proc. of Inter Speech , pp. 2274-2277
- Latorre, J.¹ Akamine, M.²

14
- 84890543760
- Accent group modeling for improved prosody in statistical parameteric speech synthesis
- G. Krishna and A. W. Black, "Accent group modeling for improved prosody in statistical parameteric speech synthesis, " in Proc. of ICASSP, 2013, pp. 6890-6894.
- (2013) Proc. of ICASSP , pp. 6890-6894
- Krishna, G.¹ Black, A.W.²

15
- 60849112575
- Modeling and generating tone contour with phrase intonation for mandarin chinese speech
- Dec
- Z. Wu, Y. Qian, F. K. Soong, and B. Zhang, "Modeling and generating tone contour with phrase intonation for mandarin chinese speech, " in Proc. of ISCSLP, Dec 2008, pp. 1-4.
- (2008) Proc. of ISCSLP , pp. 1-4
- Wu, Z.¹ Qian, Y.² Soong, F.K.³ Zhang, B.⁴

16
- 51449117929
- Modelling and synthesising f0 contours with the discrete cosine transform
- J. Teutenberg, C. Watson, and P. Riddle, "Modelling and synthesising f0 contours with the discrete cosine transform, " in Proc. of ICASSP, 2008, pp. 3973-3976.
- (2008) Proc. of ICASSP , pp. 3973-3976
- Teutenberg, J.¹ Watson, C.² Riddle, P.³

17
- 84890490547
- Statistical parametric speech synthesis using deep neural networks
- H. Zen, A. Senior, and M. Schuster, "Statistical parametric speech synthesis using deep neural networks, " in Proc. of ICASSP, 2013, pp. 7962-7966.
- (2013) Proc. of ICASSP , pp. 7962-7966
- Zen, H.¹ Senior, A.² Schuster, M.³

18
- 0001455934
- A robust algorithm for pitch tracking (rapt)
- D. Talkin, "A robust algorithm for pitch tracking (RAPT), " Speech coding and synthesis, pp. 495-518, 1995.
- (1995) Speech Coding and Synthesis , pp. 495-518
- Talkin, D.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.