메뉴 건너뛰기




Volumn , Issue , 2014, Pages 1959-1963

Statistical parametric speech synthesis using weighted multi-distribution deep belief network

Author keywords

Deep belief network; Restricted Boltzmann machine; Speech synthesis

Indexed keywords

SPEECH COMMUNICATION; SPEECH SYNTHESIS;

EID: 84910030421     PISSN: 2308457X     EISSN: 19909772     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (15)

References (20)
  • 1
    • 67651002140 scopus 로고    scopus 로고
    • Statistical parametric speech synthesis
    • H. Zen, K. Tokuda, and A. W. Black, "Statistical parametric speech synthesis, " Speech Communication, vol. 51, no. 11, pp. 1039-1064, 2009.
    • (2009) Speech Communication , vol.51 , Issue.11 , pp. 1039-1064
    • Zen, H.1    Tokuda, K.2    Black, A.W.3
  • 2
    • 85009139544 scopus 로고    scopus 로고
    • Simultaneous modeling of spectrum, pitch and duration in hmm-based speech synthesis
    • T. Yoshimura, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura, "Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis, " in Euro speech, 1999, pp. 2347- 2350.
    • (1999) Euro Speech , pp. 2347-2350
    • Yoshimura, T.1    Tokuda, K.2    Masuko, T.3    Kobayashi, T.4    Kitamura, T.5
  • 4
    • 84890527090 scopus 로고    scopus 로고
    • Multi-distribution deep belief network for speech synthesis
    • S.-Y. Kang, X.-J. Qian, and H. Meng, "Multi-distribution deep belief network for speech synthesis, " in Proc. ICASSP, 2013, pp. 8012-8016.
    • (2013) Proc. ICASSP , pp. 8012-8016
    • Kang, S.-Y.1    Qian, X.-J.2    Meng, H.3
  • 5
    • 84890490547 scopus 로고    scopus 로고
    • Statistical parametric speech synthesis using deep neural networks
    • H. Zen, A. Senior, and M. Schuster, "Statistical parametric speech synthesis using deep neural networks, " in Proc. ICASSP, 2013, pp. 7962-7966.
    • (2013) Proc. ICASSP , pp. 7962-7966
    • Zen, H.1    Senior, A.2    Schuster, M.3
  • 6
    • 84929157442 scopus 로고    scopus 로고
    • Combining a vector space representation of linguistic context with a deep neural network for text-to-speech synthesis
    • H. Lu, S. King, and O. Watts, "Combining a vector space representation of linguistic context with a deep neural network for text-to-speech synthesis, " in Proc. ISCA SSW8, 2013, pp. 261- 265.
    • (2013) Proc. ISCA SSW8 , pp. 261-265
    • Lu, H.1    King, S.2    Watts, O.3
  • 7
    • 84890447002 scopus 로고    scopus 로고
    • Modeling spectral envelopes using restricted boltzmann machines for statistical parametric speech synthesis
    • Z.-H. Ling, L. Deng, and D. Yu, "Modeling spectral envelopes using restricted Boltzmann machines for statistical parametric speech synthesis, " in Proc. ICASSP, 2013, pp. 7825-7829.
    • (2013) Proc. ICASSP , pp. 7825-7829
    • Ling, Z.-H.1    Deng, L.2    Yu, D.3
  • 8
    • 84901237776 scopus 로고    scopus 로고
    • Modeling spectral envelopes using restricted boltzmann machines and deep belief networks for statistical parametric speech synthesis
    • Z.-H. Ling, L. Deng, and D. Yu, "Modeling spectral envelopes using restricted boltzmann machines and deep belief networks for statistical parametric speech synthesis, " Audio, Speech, and Language Processing, IEEE Transactions on, vol. 21, no. 10, pp. 2129-2139, 2013.
    • (2013) Audio, Speech, and Language Processing, IEEE Transactions on , vol.21 , Issue.10 , pp. 2129-2139
    • Ling, Z.-H.1    Deng, L.2    Yu, D.3
  • 9
    • 84890522099 scopus 로고    scopus 로고
    • F0 contour prediction with a deep belief network-gaussian process hybrid model
    • R. Fernandez, A. Rendel, B. Ramabhadran, and R. Hoory, "F0 contour prediction with a deep belief network-Gaussian process hybrid model, " in Proc. ICASSP, 2013, pp. 6885-6889.
    • (2013) Proc. ICASSP , pp. 6885-6889
    • Fernandez, R.1    Rendel, A.2    Ramabhadran, B.3    Hoory, R.4
  • 11
    • 79955538498 scopus 로고    scopus 로고
    • Context adaptive training with factorized decision trees for hmm-based statistical parametric speech synthesis
    • K. Yu, H. Zen, F. Mairesse, and S. Young, "Context adaptive training with factorized decision trees for HMM-based statistical parametric speech synthesis, " Speech Communication, vol. 53, no. 6, pp. 914-923, 2011.
    • (2011) Speech Communication , vol.53 , Issue.6 , pp. 914-923
    • Yu, K.1    Zen, H.2    Mairesse, F.3    Young, S.4
  • 12
    • 33745805403 scopus 로고    scopus 로고
    • A fast learning algorithm for deep belief nets
    • G. E. Hinton, S. Osindero, and Y. W. Teh, "A fast learning algorithm for deep belief nets, " Neural Computation, vol. 18, no. 7, pp. 1527-1554, 2006.
    • (2006) Neural Computation , vol.18 , Issue.7 , pp. 1527-1554
    • Hinton, G.E.1    Osindero, S.2    Teh, Y.W.3
  • 13
    • 0000329993 scopus 로고
    • Information processing in dynamical systems: Foundations of harmony theory
    • D. Rumelhart and M. J.L. Eds. MIT Press, ch. 6
    • P. Smolensky, "Information processing in dynamical systems: Foundations of harmony theory, " in Parallel Distributed Processing, D. Rumelhart and M. J.L., Eds. MIT Press, 1986, vol. 1, ch. 6, pp. 194 - 281.
    • (1986) Parallel Distributed Processing , vol.1 , pp. 194-281
    • Smolensky, P.1
  • 15
    • 0013344078 scopus 로고    scopus 로고
    • Training products of experts by minimizing contrastive divergence
    • G. Hinton, "Training products of experts by minimizing contrastive divergence, " Neural Computation, vol. 14, no. 8, pp. 1711-1800, 2002.
    • (2002) Neural Computation , vol.14 , Issue.8 , pp. 1711-1800
    • Hinton, G.1
  • 16
    • 70450180820 scopus 로고    scopus 로고
    • Syllable hmm based mandarin tts and comparison with concatenative tts
    • Z. Shuang, S. Kang, Q. Shi, Y. Qin, and L. Cai, "Syllable HMM based mandarin TTS and comparison with concatenative TTS, " in INTER SPEECH, 2009, pp. 1767-1770.
    • (2009) Inter Speech , pp. 1767-1770
    • Shuang, Z.1    Kang, S.2    Shi, Q.3    Qin, Y.4    Cai, L.5
  • 17
    • 85131821539 scopus 로고
    • Melgeneralized cepstral analysis - A unified approach to speech spectral estimation
    • K. Tokuda, T. Kobayashi, T. Masuko, and S. Imai, "Melgeneralized cepstral analysis - A unified approach to speech spectral estimation, " in ICSLP, 1994.
    • (1994) ICSLP
    • Tokuda, K.1    Kobayashi, T.2    Masuko, T.3    Imai, S.4
  • 18
    • 85016140477 scopus 로고
    • An adaptive algorithm for mel-cepstral analysis of speech
    • T. Fukada, K. Tokuda, T. Kobayashi, and S. Imai, "An adaptive algorithm for mel-cepstral analysis of speech, " in ICASSP, vol. 1, 1992, pp. 137-140.
    • (1992) ICASSP , vol.1 , pp. 137-140
    • Fukada, T.1    Tokuda, K.2    Kobayashi, T.3    Imai, S.4
  • 19
    • 0033708106 scopus 로고    scopus 로고
    • Speech parameter generation algorithms for hmm-based speech synthesis
    • K. Tokuda, T. Yoshimura, T. Masuko, T. Kobayashi, and T. Kitamura, "Speech parameter generation algorithms for HMM-based speech synthesis, " in ICASSP, 2000, pp. 1315-1318.
    • (2000) ICASSP , pp. 1315-1318
    • Tokuda, K.1    Yoshimura, T.2    Masuko, T.3    Kobayashi, T.4    Kitamura, T.5


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.