메뉴 건너뛰기




Volumn 2015-August, Issue , 2015, Pages 4455-4459

The effect of neural networks in statistical parametric speech synthesis

Author keywords

deep neural network; hidden Markov model; Statistical parametric speech synthesis

Indexed keywords

DEEP NEURAL NETWORKS; HIDDEN MARKOV MODELS; SPEECH COMMUNICATION; SPEECH SYNTHESIS; STATISTICS;

EID: 84946074523     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2015.7178813     Document Type: Conference Paper
Times cited : (46)

References (21)
  • 2
    • 0029765811 scopus 로고    scopus 로고
    • Unit selection in a concatenative speech synthesis system using a large speech database
    • A. Hunt and A.W. Black, "Unit selection in a concatenative speech synthesis system using a large speech database, " Proceedings of ICASSP 1996, pp. 373-376, 1996
    • (1996) Proceedings of ICASSP 1996 , pp. 373-376
    • Hunt, A.1    Black, A.W.2
  • 3
    • 0034842740 scopus 로고    scopus 로고
    • Adaptation of pitch and spectrum for HMM-based speech synthesis using mllr
    • M. Tamura, T. Masuko, K. Tokuda, and T. Kobayashi, "Adaptation of pitch and spectrum for HMM-based speech synthesis using mllr, " Proceedings of ICASSP 2001, pp. 805-808, 2001
    • (2001) Proceedings of ICASSP 2001 , pp. 805-808
    • Tamura, M.1    Masuko, T.2    Tokuda, K.3    Kobayashi, T.4
  • 6
    • 33847129573 scopus 로고    scopus 로고
    • Average-voice-based speech synthesis using HSMM-based speaker adaptation and adaptive training
    • J. Yamagishi and T. Kobayashi, "Average-voice-based speech synthesis using HSMM-based speaker adaptation and adaptive training, " IEICE Transactions on Information &Systems, vol. E90-D, no. 2, pp. 533-543, 2007
    • (2007) IEICE Transactions on Information &Systems , vol.E90-D , Issue.2 , pp. 533-543
    • Yamagishi, J.1    Kobayashi, T.2
  • 7
    • 33846935000 scopus 로고    scopus 로고
    • HMM-based Korean speech synthesis system for hand-held devices
    • S.J. Kim, J.J. Kim, and M.S. Hahn, "HMM-based Korean speech synthesis system for hand-held devices, " IEEE Trans. Consum. Electron., vol. 52, no. 4, pp. 1384-1390, 2006
    • (2006) IEEE Trans. Consum. Electron , vol.52 , Issue.4 , pp. 1384-1390
    • Kim, S.J.1    Kim, J.J.2    Hahn, M.S.3
  • 11
    • 84890490547 scopus 로고    scopus 로고
    • Statistical parametric speech synthesis using deep neural networks
    • H. Zen, A. Senior, and M. Schuster, "Statistical parametric speech synthesis using deep neural networks, " Proceedings of ICASSP 2013, pp. 7962-7966, 2013
    • (2013) Proceedings of ICASSP 2013 , pp. 7962-7966
    • Zen, H.1    Senior, A.2    Schuster, M.3
  • 12
    • 84929157442 scopus 로고    scopus 로고
    • Combining a vector space representation of linguistic context with a deep neural network for text-to-speech synthesis
    • H. Lu, S. King, and O. Watts, "Combining a vector space representation of linguistic context with a deep neural network for text-to-speech synthesis;' Proceedings of ISCA SSW8, pp. 281-285, 2013
    • (2013) Proceedings of ISCA SSW8 , pp. 281-285
    • Lu, H.1    King, S.2    Watts, O.3
  • 13
    • 84905251808 scopus 로고    scopus 로고
    • On the training aspects of deep neural network (DNN) for parametric TTS synthesis
    • Y. Qian, Y. Fan, H. Wenping, and EK. Soong, "On the training aspects of deep neural network (DNN) for parametric TTS synthesis, " Proceedings of ICASSP 2014, pp. 3857-3861, 2014
    • (2014) Proceedings of ICASSP 2014 , pp. 3857-3861
    • Qian, Y.1    Fan, Y.2    Wenping, H.3    Soong, E.K.4
  • 17
    • 0032673049 scopus 로고    scopus 로고
    • Restructuring speech representations using a pitch-adaptive timefrequency smoothing and an instantaneous-frequency-based FO extraction: Possible role of a repetitive structure in sounds
    • H. Kawahara, I. Masuda-Katsuse, and A. Cheveigne, "Restructuring speech representations using a pitch-adaptive timefrequency smoothing and an instantaneous-frequency-based FO extraction: Possible role of a repetitive structure in sounds, " Speech Communication, vol. 27, pp. 187-207, 1999
    • (1999) Speech Communication , vol.27 , pp. 187-207
    • Kawahara, H.1    Masuda-Katsuse, I.2    Cheveigne, A.3
  • 18
    • 85135145174 scopus 로고    scopus 로고
    • Acoustic modeling based on the MDL criterion for speech recognition
    • K. Shinoda and T. Watanabe, "Acoustic modeling based on the MDL criterion for speech recognition, " Proceedings of Eurospeech 1997, pp. 99-102, 1997
    • (1997) Proceedings of Eurospeech 1997 , pp. 99-102
    • Shinoda, K.1    Watanabe, T.2
  • 21
    • 38549096029 scopus 로고    scopus 로고
    • A speech parameter generation algorithm considering global variance for HMM-based speech synthesis
    • T. Toda and K. Tokuda, "A speech parameter generation algorithm considering global variance for HMM-based speech synthesis, " IEICE Transactions on Information &Systems, vol. E90-D, no. 5, pp. 816-824, 2007
    • (2007) IEICE Transactions on Information &Systems , vol.E90-D , Issue.5 , pp. 816-824
    • Toda, T.1    Tokuda, K.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.