-
1
-
-
84930664922
-
Vocaine the vocoder and applicationsin speech synthesis
-
Y. Agiomyrgiannakis. Vocaine the vocoder and applicationsin speech synthesis. In Proc. ICASSP, 2015.
-
(2015)
Proc. ICASSP
-
-
Agiomyrgiannakis, Y.1
-
5
-
-
84856248602
-
The deterministic plus stochasticmodel of the residual signal and its applications
-
T. Drugman and T. Dutoit. The deterministic plus stochasticmodel of the residual signal and its applications. IEEETransactions on Audio, Speech and Language Processing, 20 (3): 968-981, 2012.
-
(2012)
IEEETransactions on Audio, Speech and Language Processing
, vol.20
, Issue.3
, pp. 968-981
-
-
Drugman, T.1
Dutoit, T.2
-
6
-
-
84897865577
-
Harmonicsplus noise model based vocoder for statistical parametricspeech synthesis
-
D. Erro, I. Sainz, E. Navas, and I. Hernaez. Harmonicsplus noise model based vocoder for statistical parametricspeech synthesis. IEEE Journal of Selected Topics in SignalProcessing, 8 (2): 184-194, 2014.
-
(2014)
IEEE Journal of Selected Topics in SignalProcessing
, vol.8
, Issue.2
, pp. 184-194
-
-
Erro, D.1
Sainz, I.2
Navas, E.3
Hernaez, I.4
-
7
-
-
85032751458
-
Sainath deep neural networks for acoustic modelingin speech recognition: The shared views of four researchgroups
-
G. Hinton, L. Deng, D. Yu, G. Dahl, A. Mohamed, N. Jaitly, A. Senior, V. Vanhoucke, P. Nguyen, and T. Sainath. Deep neural networks for acoustic modelingin speech recognition: The shared views of four researchgroups. Signal Processing Magazine, IEEE, 29 (6): 82-97, 2012.
-
(2012)
Signal Processing Magazine, IEEE
, vol.29
, Issue.6
, pp. 82-97
-
-
Hinton, G.1
Deng, L.2
Yu, D.3
Dahl, G.4
Mohamed, A.5
Jaitly, N.6
Senior, A.7
Vanhoucke, V.8
Nguyen, T.P.9
-
9
-
-
84946025802
-
Methods for applying dynamic sinusoidal modelsto statistical parametric speech synthesis
-
Q. Hu, Y. Stylianou, R. Maia, K. Richmond, and J. Yamagishi. Methods for applying dynamic sinusoidal modelsto statistical parametric speech synthesis. In Proc. ICASSP, 2015.
-
(2015)
Proc. ICASSP
-
-
Hu, Q.1
Stylianou, Y.2
Maia, R.3
Richmond, K.4
Yamagishi, J.5
-
10
-
-
84910049275
-
An investigation of the application of dynamicsinusoidal models to statistical parametric speechsynthesis
-
Q. Hu, Y. Stylianou, R. Maia, K. Richmond, J. Yamagishi, and J. Latorre. An investigation of the application of dynamicsinusoidal models to statistical parametric speechsynthesis. In Proc. Interspeech, 2014.
-
(2014)
Proc. Interspeech
-
-
Hu, Q.1
Stylianou, Y.2
Maia, R.3
Richmond, K.4
Yamagishi, J.5
Latorre, J.6
-
11
-
-
84905280900
-
A fixed dimension and perceptually baseddynamic sinusoidal model of speech
-
Q. Hu, Y. Stylianou, K. Richmond, R. Maia, J. Yamagishi, and J. Latorre. A fixed dimension and perceptually baseddynamic sinusoidal model of speech. In Proc. ICASSP, 2014.
-
(2014)
Proc ICASSP
-
-
Hu, Q.1
Stylianou, Y.2
Richmond, K.3
Maia, R.4
Yamagishi, J.5
Latorre, J.6
-
12
-
-
84976212707
-
Sinusoidal speechsynthesis using deep neural networks
-
Q. Hu, Z. Wu, K. Richmond, J. Yamagishi, Y. Stylianou, R. Maia, S. King, and M. Akamine. Sinusoidal speechsynthesis using deep neural networks. manuscript, 2015.
-
(2015)
Manuscript
-
-
Hu, Q.1
Wu, Z.2
Richmond, K.3
Yamagishi, J.4
Stylianou, Y.5
Maia, R.6
King, S.7
Akamine, M.8
-
13
-
-
0032673049
-
Restructuring speech representations using a pitchadaptivetime-frequency smoothing and an instantaneousfrequency-based F0 extraction: Possible role of a repetitivestructure in sounds
-
H. Kawahara, I. Masuda-Katsuse, and A. de Cheveigné. Restructuring speech representations using a pitchadaptivetime-frequency smoothing and an instantaneousfrequency-based F0 extraction: Possible role of a repetitivestructure in sounds. Speech communication, 27 (3): 187-207, 1999.
-
(1999)
Speech Communication
, vol.27
, Issue.3
, pp. 187-207
-
-
Kawahara, H.1
Masuda-Katsuse, I.2
De Cheveigné, A.3
-
14
-
-
78649238036
-
Synthesizer voicequality of new languages calibrated with mean mel cepstraldistortion
-
J. Kominek, T. Schultz, and A. Black. Synthesizer voicequality of new languages calibrated with mean mel cepstraldistortion. In Pro. SLTU, 2008.
-
(2008)
Pro. SLTU
-
-
Kominek, J.1
Schultz, T.2
Black, A.3
-
15
-
-
84890458846
-
Multitask learning in connectionistspeech recognition
-
Y. Lu, F. Lu, S. Sehgal, S. Gupta, J. Du, C. Tham, P. Green, and V. Wan. Multitask learning in connectionistspeech recognition. In Proceedings of the AustralianInternational Conference on Speech Science and Technology, 2004.
-
(2004)
Proceedings of the AustralianInternational Conference on Speech Science and Technology
-
-
Lu, Y.1
Lu, F.2
Sehgal, S.3
Gupta, S.4
Du, J.5
Tham, C.6
Green, P.7
Wan, V.8
-
16
-
-
85009167968
-
Multitask learning in connectionistrobust asr using recurrent neural networks
-
S. Parveen and P. Green. Multitask learning in connectionistrobust asr using recurrent neural networks. In INTERSPEECH, 2003.
-
(2003)
INTERSPEECH
-
-
Parveen, S.1
Green, P.2
-
17
-
-
84905251808
-
On the training aspectsof deep neural network for parametric tts synthesis. in Proc
-
Y. Qian, Y. Fan, W. Hu, and F. Soong. On the training aspectsof deep neural network for parametric tts synthesis. In Proc. ICASSP, 2014.
-
(2014)
ICASSP
-
-
Qian, Y.1
Fan, Y.2
Hu, W.3
Soong, F.4
-
18
-
-
77957744515
-
HMM-based speech synthesisutilizing glottal inverse filtering
-
T. Raitio, A. Suni, J. Yamagishi, H. Pulakka, J. Nurminen, M. Vainio, and P. Alku. HMM-based speech synthesisutilizing glottal inverse filtering. IEEE Transactions onAudio, Speech, and Language Processing, 19 (1): 153-165, 2011.
-
(2011)
IEEE Transactions OnAudio, Speech, and Language Processing
, vol.19
, Issue.1
, pp. 153-165
-
-
Raitio, T.1
Suni, A.2
Yamagishi, J.3
Pulakka, H.4
Nurminen, J.5
Vainio, M.6
Alku, P.7
-
20
-
-
79959858197
-
Sinusoidal model parameterizationfor HMM-based TTS system
-
S. Shechtman and A. Sorin. Sinusoidal model parameterizationfor HMM-based TTS system. In Proc. Interspeech, 2010.
-
(2010)
Proc. Interspeech
-
-
Shechtman, S.1
Sorin, A.2
-
22
-
-
38549096029
-
A speech parameter generationalgorithm considering global variance for HMM-basedspeech synthesis
-
T. Toda and K. Tokuda. A speech parameter generationalgorithm considering global variance for HMM-basedspeech synthesis. IEICE Transactions on Information and Systems, 90 (5): 816-824, 2007.
-
(2007)
IEICE Transactions on Information and Systems
, vol.90
, Issue.5
, pp. 816-824
-
-
Toda, T.1
Tokuda, A.2
-
23
-
-
0033708106
-
Speech parameter generation algorithms forHMM-based speech synthesis
-
K. Tokuda, T. Yoshimura, T. Masuko, T. Kobayashi, and T. Kitamura. Speech parameter generation algorithms forHMM-based speech synthesis. In Proc. ICASSP, 2000.
-
(2000)
Proc. ICASSP
-
-
Tokuda, K.1
Yoshimura, T.2
Masuko, T.3
Kobayashi, T.4
Kitamura, T.5
-
24
-
-
33947651202
-
Multitask learning for spoken language understand ing
-
G. Tur. Multitask learning for spoken language understand ing. In ICASSP, 2006.
-
(2006)
ICASSP
-
-
Tur, G.1
-
25
-
-
84946033275
-
Deep neuralnetworks employing multi-task learning and stacked bottleneckfeatures for speech synthesis
-
Z. Wu, C. Botinhao, O. Watts, and S. King. Deep neuralnetworks employing multi-task learning and stacked bottleneckfeatures for speech synthesis. In Proc. ICASSP, 2015.
-
(2015)
Proc. ICASSP
-
-
Wu, Z.1
Botinhao, C.2
Watts, O.3
King, S.4
-
26
-
-
85009139544
-
Simultaneous modeling of spectrum, pitchand duration in HMM-based speech synthesis
-
T. Yoshimura, K. Tokuda, T. Masuko, T. Kobayashi, and T. Kitamura. Simultaneous modeling of spectrum, pitchand duration in HMM-based speech synthesis. In Proc. Eurospeech, 1999.
-
(1999)
Proc. Eurospeech
-
-
Yoshimura, T.1
Tokuda, K.2
Masuko, T.3
Kobayashi, T.4
Kitamura, T.5
-
27
-
-
84905262874
-
Deep mixture density networks foracoustic modeling in statistical parametric speech synthesis
-
H. Zen and A. Senior. Deep mixture density networks foracoustic modeling in statistical parametric speech synthesis. In Proc. ICASSP, 2014.
-
(2014)
Proc. ICASSP
-
-
Zen, H.1
Senior, A.2
-
28
-
-
84890490547
-
Statistical parametricspeech synthesis using deep neural networks
-
H. Zen, A. Senior, and M. Schuster. Statistical parametricspeech synthesis using deep neural networks. In Proc. ICASSP, 2013.
-
(2013)
Proc. ICASSP
-
-
Zen, H.1
Senior, A.2
Schuster, M.3
-
29
-
-
67651002140
-
Statistical parametricspeech synthesis
-
H. Zen, K. Tokuda, and A. Black. Statistical parametricspeech synthesis. Speech Communication, 51 (11): 1039-1064, 2009.
-
(2009)
Speech Communication
, vol.51
, Issue.11
, pp. 1039-1064
-
-
Zen, H.1
Tokuda, K.2
Black, A.3
|