메뉴 건너뛰기




Volumn , Issue , 2017, Pages

On the training of DNN-based average voice model for speech synthesis

Author keywords

[No Author keywords available]

Indexed keywords

DEEP NEURAL NETWORKS; LINGUISTICS; SPEECH PROCESSING;

EID: 85013762788     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/APSIPA.2016.7820818     Document Type: Conference Paper
Times cited : (19)

References (28)
  • 1
    • 67651002140 scopus 로고    scopus 로고
    • Statistical parametric speech synthesis
    • H. Zen, K. Tokuda, and A. W. Black, "Statistical parametric speech synthesis," Speech Communication, vol. 51, no. 11, pp. 1039-1064, 2009.
    • (2009) Speech Communication , vol.51 , Issue.11 , pp. 1039-1064
    • Zen, H.1    Tokuda, K.2    Black, A.W.3
  • 6
    • 84901237776 scopus 로고    scopus 로고
    • Modeling spectral envelopes using restricted boltzmann machines and deep belief networks for statistical parametric speech synthesis
    • Z.-H. Ling, L. Deng, and D. Yu, "Modeling spectral envelopes using restricted boltzmann machines and deep belief networks for statistical parametric speech synthesis," Audio, Speech, and Language Processing, IEEE Transactions on, vol. 21, no. 10, pp. 2129-2139, 2013.
    • (2013) Audio, Speech, and Language Processing, IEEE Transactions on , vol.21 , Issue.10 , pp. 2129-2139
    • Ling, Z.-H.1    Deng, L.2    Yu, D.3
  • 12
    • 33847129573 scopus 로고    scopus 로고
    • Average-voice-based speech synthesis using hsmm-based speaker adaptation and adaptive training
    • J. Yamagishi and T. Kobayashi, "Average-voice-based speech synthesis using hsmm-based speaker adaptation and adaptive training," IEICE TRANSACTIONS on Information and Systems, vol. 90, no. 2, pp. 533- 543, 2007.
    • (2007) IEICE TRANSACTIONS on Information and Systems , vol.90 , Issue.2 , pp. 533-543
    • Yamagishi, J.1    Kobayashi, T.2
  • 18
    • 84959106025 scopus 로고    scopus 로고
    • Sentence-level control vectors for deep neural network speech synthesis
    • O. Watts, Z. Wu, and S. King, "Sentence-level control vectors for deep neural network speech synthesis," in Interspeech, 2015.
    • (2015) Interspeech
    • Watts, O.1    Wu, Z.2    King, S.3
  • 19
    • 84865733857 scopus 로고    scopus 로고
    • Analysis of i-vector length normalization in speaker recognition systems
    • D. Garcia-Romero and C. Y. Espy-Wilson, "Analysis of i-vector length normalization in speaker recognition systems." in Interspeech, 2011, pp. 249-252.
    • (2011) Interspeech , pp. 249-252
    • Garcia-Romero, D.1    Espy-Wilson, C.Y.2
  • 22
    • 0000764772 scopus 로고
    • The use of multiple measurements in taxonomic problems
    • R. A. Fisher, "The use of multiple measurements in taxonomic problems," Annals of eugenics, vol. 7, no. 2, pp. 179-188, 1936.
    • (1936) Annals of Eugenics , vol.7 , Issue.2 , pp. 179-188
    • Fisher, R.A.1
  • 23
    • 0001565436 scopus 로고
    • The utilization of multiple measurements in problems of biological classification
    • C. R. Rao, "The utilization of multiple measurements in problems of biological classification," Journal of the Royal Statistical Society. Series B (Methodological), vol. 10, no. 2, pp. 159-203, 1948.
    • (1948) Journal of the Royal Statistical Society. Series B (Methodological) , vol.10 , Issue.2 , pp. 159-203
    • Rao, C.R.1
  • 24
    • 0032673049 scopus 로고    scopus 로고
    • Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based f0 extraction: Possible role of a repetitive structure in sounds
    • H. Kawahara, I. Masuda-Katsuse, and A. De Cheveigne, "Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based f0 extraction: Possible role of a repetitive structure in sounds," Speech communication, vol. 27, no. 3, pp. 187-207, 1999.
    • (1999) Speech Communication , vol.27 , Issue.3 , pp. 187-207
    • Kawahara, H.1    Masuda-Katsuse, I.2    De Cheveigne, A.3
  • 26
    • 85111073935 scopus 로고    scopus 로고
    • Merlin: An open source neural network speech synthesis system
    • Sunnyvale, CA, USA, September
    • Z. Wu, O. Watts, and S. King, "Merlin: An open source neural network speech synthesis system," in 9th ISCA Speech Synthesis Workshop (SSW9), Sunnyvale, CA, USA, September 2016.
    • (2016) 9th ISCA Speech Synthesis Workshop (SSW9)
    • Wu, Z.1    Watts, O.2    King, S.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.