SCOPUS 정보 검색 플랫폼

Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

Volumn , Issue , 2014, Pages 1959-1963

Statistical parametric speech synthesis using weighted multi-distribution deep belief network

(2) Kang, Shiyin a Meng, Helen a

a CHINESE UNIVERSITY OF HONG KONG (Hong Kong)

Author keywords

Deep belief network; Restricted Boltzmann machine; Speech synthesis

Indexed keywords

SPEECH COMMUNICATION; SPEECH SYNTHESIS;

CONTEXT DEPENDENT; DEEP BELIEF NETWORKS; FUNDAMENTAL FREQUENCIES; RESTRICTED BOLTZMANN MACHINE; STATISTICAL PARAMETRIC SPEECH SYNTHESIS; SYNTHESIZED SPEECH; TRAINING PROCEDURES; WEIGHTING COEFFICIENT;

CONTINUOUS SPEECH RECOGNITION;

EID: 84910030421 PISSN: 2308457X EISSN: 19909772 Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (15)

References (20)

1
- 67651002140
- Statistical parametric speech synthesis
- H. Zen, K. Tokuda, and A. W. Black, "Statistical parametric speech synthesis, " Speech Communication, vol. 51, no. 11, pp. 1039-1064, 2009.
- (2009) Speech Communication , vol.51 , Issue.11 , pp. 1039-1064
- Zen, H.¹ Tokuda, K.² Black, A.W.³

2
- 85009139544
- Simultaneous modeling of spectrum, pitch and duration in hmm-based speech synthesis
- T. Yoshimura, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura, "Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis, " in Euro speech, 1999, pp. 2347- 2350.
- (1999) Euro Speech , pp. 2347-2350
- Yoshimura, T.¹ Tokuda, K.² Masuko, T.³ Kobayashi, T.⁴ Kitamura, T.⁵

3
- 0003805597
- J. J. Odell, "The use of context in large vocabulary speech recognition, " 1995.
- (1995) The Use of Context in Large Vocabulary Speech Recognition
- Odell, J.J.¹

4
- 84890527090
- Multi-distribution deep belief network for speech synthesis
- S.-Y. Kang, X.-J. Qian, and H. Meng, "Multi-distribution deep belief network for speech synthesis, " in Proc. ICASSP, 2013, pp. 8012-8016.
- (2013) Proc. ICASSP , pp. 8012-8016
- Kang, S.-Y.¹ Qian, X.-J.² Meng, H.³

5
- 84890490547
- Statistical parametric speech synthesis using deep neural networks
- H. Zen, A. Senior, and M. Schuster, "Statistical parametric speech synthesis using deep neural networks, " in Proc. ICASSP, 2013, pp. 7962-7966.
- (2013) Proc. ICASSP , pp. 7962-7966
- Zen, H.¹ Senior, A.² Schuster, M.³

6
- 84929157442
- Combining a vector space representation of linguistic context with a deep neural network for text-to-speech synthesis
- H. Lu, S. King, and O. Watts, "Combining a vector space representation of linguistic context with a deep neural network for text-to-speech synthesis, " in Proc. ISCA SSW8, 2013, pp. 261- 265.
- (2013) Proc. ISCA SSW8 , pp. 261-265
- Lu, H.¹ King, S.² Watts, O.³

7
- 84890447002
- Modeling spectral envelopes using restricted boltzmann machines for statistical parametric speech synthesis
- Z.-H. Ling, L. Deng, and D. Yu, "Modeling spectral envelopes using restricted Boltzmann machines for statistical parametric speech synthesis, " in Proc. ICASSP, 2013, pp. 7825-7829.
- (2013) Proc. ICASSP , pp. 7825-7829
- Ling, Z.-H.¹ Deng, L.² Yu, D.³

8
- 84901237776
- Modeling spectral envelopes using restricted boltzmann machines and deep belief networks for statistical parametric speech synthesis
- Z.-H. Ling, L. Deng, and D. Yu, "Modeling spectral envelopes using restricted boltzmann machines and deep belief networks for statistical parametric speech synthesis, " Audio, Speech, and Language Processing, IEEE Transactions on, vol. 21, no. 10, pp. 2129-2139, 2013.
- (2013) Audio, Speech, and Language Processing, IEEE Transactions on , vol.21 , Issue.10 , pp. 2129-2139
- Ling, Z.-H.¹ Deng, L.² Yu, D.³

9
- 84890522099
- F0 contour prediction with a deep belief network-gaussian process hybrid model
- R. Fernandez, A. Rendel, B. Ramabhadran, and R. Hoory, "F0 contour prediction with a deep belief network-Gaussian process hybrid model, " in Proc. ICASSP, 2013, pp. 6885-6889.
- (2013) Proc. ICASSP , pp. 6885-6889
- Fernandez, R.¹ Rendel, A.² Ramabhadran, B.³ Hoory, R.⁴

10
- 84994214710
- Deep learning in speech synthesis
- H. Zen, "Deep learning in speech synthesis, " Keynote speech given at ISCA SSW8, 2013.
- (2013) Keynote Speech Given at ISCA SSW8
- Zen, H.¹

11
- 79955538498
- Context adaptive training with factorized decision trees for hmm-based statistical parametric speech synthesis
- K. Yu, H. Zen, F. Mairesse, and S. Young, "Context adaptive training with factorized decision trees for HMM-based statistical parametric speech synthesis, " Speech Communication, vol. 53, no. 6, pp. 914-923, 2011.
- (2011) Speech Communication , vol.53 , Issue.6 , pp. 914-923
- Yu, K.¹ Zen, H.² Mairesse, F.³ Young, S.⁴

12
- 33745805403
- A fast learning algorithm for deep belief nets
- G. E. Hinton, S. Osindero, and Y. W. Teh, "A fast learning algorithm for deep belief nets, " Neural Computation, vol. 18, no. 7, pp. 1527-1554, 2006.
- (2006) Neural Computation , vol.18 , Issue.7 , pp. 1527-1554
- Hinton, G.E.¹ Osindero, S.² Teh, Y.W.³

13
- 0000329993
- Information processing in dynamical systems: Foundations of harmony theory
- D. Rumelhart and M. J.L. Eds. MIT Press, ch. 6
- P. Smolensky, "Information processing in dynamical systems: Foundations of harmony theory, " in Parallel Distributed Processing, D. Rumelhart and M. J.L., Eds. MIT Press, 1986, vol. 1, ch. 6, pp. 194 - 281.
- (1986) Parallel Distributed Processing , vol.1 , pp. 194-281
- Smolensky, P.¹

14
- 14344259207
- Solving large scale linear prediction problems using stochastic gradient descent algorithms
- ACM
- T. Zhang, "Solving large scale linear prediction problems using stochastic gradient descent algorithms, " in Proceedings of the twenty-first international conference on Machine learning. ACM, 2004, p. 116.
- (2004) Proceedings of the Twenty-first International Conference on Machine Learning
- Zhang, T.¹

15
- 0013344078
- Training products of experts by minimizing contrastive divergence
- G. Hinton, "Training products of experts by minimizing contrastive divergence, " Neural Computation, vol. 14, no. 8, pp. 1711-1800, 2002.
- (2002) Neural Computation , vol.14 , Issue.8 , pp. 1711-1800
- Hinton, G.¹

16
- 70450180820
- Syllable hmm based mandarin tts and comparison with concatenative tts
- Z. Shuang, S. Kang, Q. Shi, Y. Qin, and L. Cai, "Syllable HMM based mandarin TTS and comparison with concatenative TTS, " in INTER SPEECH, 2009, pp. 1767-1770.
- (2009) Inter Speech , pp. 1767-1770
- Shuang, Z.¹ Kang, S.² Shi, Q.³ Qin, Y.⁴ Cai, L.⁵

17
- 85131821539
- Melgeneralized cepstral analysis - A unified approach to speech spectral estimation
- K. Tokuda, T. Kobayashi, T. Masuko, and S. Imai, "Melgeneralized cepstral analysis - A unified approach to speech spectral estimation, " in ICSLP, 1994.
- (1994) ICSLP
- Tokuda, K.¹ Kobayashi, T.² Masuko, T.³ Imai, S.⁴

18
- 85016140477
- An adaptive algorithm for mel-cepstral analysis of speech
- T. Fukada, K. Tokuda, T. Kobayashi, and S. Imai, "An adaptive algorithm for mel-cepstral analysis of speech, " in ICASSP, vol. 1, 1992, pp. 137-140.
- (1992) ICASSP , vol.1 , pp. 137-140
- Fukada, T.¹ Tokuda, K.² Kobayashi, T.³ Imai, S.⁴

19
- 0033708106
- Speech parameter generation algorithms for hmm-based speech synthesis
- K. Tokuda, T. Yoshimura, T. Masuko, T. Kobayashi, and T. Kitamura, "Speech parameter generation algorithms for HMM-based speech synthesis, " in ICASSP, 2000, pp. 1315-1318.
- (2000) ICASSP , pp. 1315-1318
- Tokuda, K.¹ Yoshimura, T.² Masuko, T.³ Kobayashi, T.⁴ Kitamura, T.⁵

20
- 70349197691
- Voice conversion using artificial neural networks
- S. Desai, E. V. Raghavendra, B. Yegnanarayana, A.W. Black, and K. Prahallad, "Voice conversion using artificial neural networks, " in ICASSP, 2009, pp. 3893-3896.
- (2009) ICASSP , pp. 3893-3896
- Desai, S.¹ Raghavendra, E.V.² Yegnanarayana, B.³ Black, A.W.⁴ Prahallad, K.⁵

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.