SCOPUS 정보 검색 플랫폼

ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings

Volumn 2015-August, Issue , 2015, Pages 4475-4479

Multi-speaker modeling and speaker adaptation for DNN-based TTS synthesis

(4) Fan, Yuchen a Qian, Yao a Soong, Frank K a He, Lei a

a MICROSOFT RESEARCH ASIA (China)

Author keywords

deep neural networks; multi task learning; statistical parametric speech synthesis; transfer learning

Indexed keywords

AUDIO SIGNAL PROCESSING; LINGUISTICS; METADATA; SPEECH COMMUNICATION; SPEECH SYNTHESIS;

ACOUSTIC PARAMETERS; COMPLEX TRANSFORMATIONS; LAYERED ARCHITECTURE; LIMITED TRAINING DATA; LINGUISTIC FEATURES; MULTITASK LEARNING; STATISTICAL PARAMETRIC SPEECH SYNTHESIS; TRANSFER LEARNING;

DEEP NEURAL NETWORKS;

EID: 84946051934 PISSN: 15206149 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICASSP.2015.7178817 Document Type: Conference Paper

Times cited : (132)

References (10)

1
- 84890490547
- Statistical parametric speech synthesis using deep neural networks
- Heiga Zen, Andrew Senior, and Mike Schuster, Statistical parametric speech synthesis using deep neural networks, in Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on. IEEE, 2013, pp. 7962-7966
- (2013) Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference On. IEEE , pp. 7962-7966
- Zen, H.¹ Senior, A.² Schuster, M.³

2
- 84905251808
- On the training aspects of deep neural network (DNN) for parametric tts synthesis
- Yao Qian, Yuchen Fan, Wenping Hu, and Frank K Soong, On the training aspects of deep neural network (DNN) for parametric tts synthesis, in Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference on. IEEE, 2014, pp. 3829-3833
- (2014) Acoustics, Speech and Signal Processing (ICASSP), 2014 IEEE International Conference On. IEEE , pp. 3829-3833
- Qian, Y.¹ Fan, Y.² Hu, W.³ Soong, F.K.⁴

3
- 84910047819
- TTS synthesis with bidirectional LSTM based recurrent neural networks
- Yuchen Fan, Yao Qian, Feng-Long Xie, and Frank K Soong, TTS synthesis with bidirectional LSTM based recurrent neural networks, in INTERSPEECH, 2014
- (2014) INTERSPEECH
- Fan, Y.¹ Qian, Y.² Xie, F.-L.³ Soong, F.K.⁴

4
- 67651002140
- Statistical parametric speech synthesis
- Heiga Zen, Keiichi Tokuda, and Alan W Black, Statistical parametric speech synthesis, Speech Communication, vol. 51, no. 11, pp. 1039-1064, 2009
- (2009) Speech Communication , vol.51 , Issue.11 , pp. 1039-1064
- Zen, H.¹ Tokuda, K.² Black, A.W.³

5
- 85008006694
- Robust speaker-adaptive HMM-based text-to-speech synthesis
- Junichi Yamagishi, Takashi Nose, Heiga Zen, Zhen-Hua Ling, Tomoki Toda, Keiichi Tokuda, Simon King, and Steve Renals, Robust speaker-adaptive HMM-based text-to-speech synthesis, Audio, Speech, and Language Processing, IEEE Transactions on, vol. 17, no. 6, pp. 1208-1230, 2009
- (2009) Audio, Speech, and Language Processing, IEEE Transactions on , vol.17 , Issue.6 , pp. 1208-1230
- Yamagishi, J.¹ Nose, T.² Zen, H.³ Ling, Z.-H.⁴ Toda, T.⁵ Tokuda, K.⁶ King, S.⁷ Renals, S.⁸

6
- 1942470793
- Springer
- Rich Caruana, Multitask learning, Springer, 1998
- (1998) Multitask Learning
- Caruana, R.¹

7
- 77956031473
- A survey on transfer learning
- Sinno Jialin Pan and Qiang Yang, A survey on transfer learning, Knowledge and Data Engineering, IEEE Transactions on, vol. 22, no. 10, pp. 1345-1359, 2010
- (2010) Knowledge and Data Engineering, IEEE Transactions on , vol.22 , Issue.10 , pp. 1345-1359
- Jialin Pan, S.¹ Yang, Q.²

8
- 84904548965
- Deep learning of representations for unsupervised and transfer learning
- Yoshua Bengio, Deep learning of representations for unsupervised and transfer learning., in ICML Unsupervised and Transfer Learning, 2012, pp. 17-36
- (2012) ICML Unsupervised and Transfer Learning , pp. 17-36
- Bengio, Y.¹

9
- 84890527497
- Cross-language knowledge transfer using multilingual deep neural network with shared hidden layers
- Jui-Ting Huang, Jinyu Li, Dong Yu, Li Deng, and Yifan Gong, Cross-language knowledge transfer using multilingual deep neural network with shared hidden layers, in Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference on. IEEE, 2013, pp. 7304-7308
- (2013) Acoustics, Speech and Signal Processing (ICASSP), 2013 IEEE International Conference On. IEEE , pp. 7304-7308
- Huang, J.-T.¹ Li, J.² Yu, D.³ Deng, L.⁴ Gong, Y.⁵

10
- 84858976070
- Feature engineering in context-dependent deep neural networks for conversational speech transcription
- Frank Seide, Gang Li, Xie Chen, and Dong Yu, Feature engineering in context-dependent deep neural networks for conversational speech transcription, in Automatic Speech Recognition and Understanding (ASRU), 2011 IEEE Workshop on. IEEE, 2011, pp. 24-29
- (2011) Automatic Speech Recognition and Understanding (ASRU), 2011 IEEE Workshop On. IEEE , pp. 24-29
- Seide, F.¹ Li, G.² Chen, X.³ Yu, D.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.