-
2
-
-
84865747520
-
Intonation conversion from neutral to expressive speech
-
C. Veaux and X. Robet, "Intonation conversion from neutral to expressive speech, " Proc. Interspeech, pp.2765-2768, 2011.
-
(2011)
Proc. Interspeech
, pp. 2765-2768
-
-
Veaux, C.1
Robet, X.2
-
3
-
-
80052698826
-
Speakingaid systems using gmm-based voice conversion for electrolaryngeal speech
-
K. Nakamura, T. Toda, H. Saruwatari, and K. Shikano, "Speakingaid systems using gmm-based voice conversion for electrolaryngeal speech, " Speech Commun., vol.54, no.1, pp.134-146, 2012.
-
(2012)
Speech Commun.
, vol.54
, Issue.1
, pp. 134-146
-
-
Nakamura, K.1
Toda, T.2
Saruwatari, H.3
Shikano, K.4
-
4
-
-
0034855352
-
Highperformance robust speech recognition using stereo training data
-
L. Deng, A. Acero, L. Jiang, J. Droppo, and X. Huang, "Highperformance robust speech recognition using stereo training data, " Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp.301-304, 2001.
-
(2001)
Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
, pp. 301-304
-
-
Deng, L.1
Acero, A.2
Jiang, L.3
Droppo, J.4
Huang, X.5
-
5
-
-
70450192197
-
Speech generation from hand gestures based on space mapping
-
A. Kunikoshi, Y. Qiao, N. Minematsu, and K. Hirose, "Speech generation from hand gestures based on space mapping, " Proc. Interspeech, pp.308-311, 2009.
-
(2009)
Proc. Interspeech
, pp. 308-311
-
-
Kunikoshi, A.1
Qiao, Y.2
Minematsu, N.3
Hirose, K.4
-
6
-
-
0021412027
-
Vector quantization
-
R. Gray, "Vector quantization, " IEEE ASSP Mag., vol.1, no.2, pp.4- 29, 1984.
-
(1984)
IEEE ASSP Mag.
, vol.1
, Issue.2
, pp. 4-29
-
-
Gray, R.1
-
7
-
-
0026880275
-
Voice transformation using psola technique
-
H. Valbret, E. Moulines, and J.P. Tubach, "Voice transformation using psola technique, " Speech Commun., vol.11, no.2, pp.175-187, 1992.
-
(1992)
Speech Commun.
, vol.11
, Issue.2
, pp. 175-187
-
-
Valbret, H.1
Moulines, E.2
Tubach, J.P.3
-
8
-
-
0032026483
-
Continuous probabilistic transform for voice conversion
-
Y. Stylianou, O. Cappé, and E. Moulines, "Continuous probabilistic transform for voice conversion, " IEEE Trans. Speech Audio Process., vol.6, no.2, pp.131-142, 1998.
-
(1998)
IEEE Trans. Speech Audio Process.
, vol.6
, Issue.2
, pp. 131-142
-
-
Stylianou, Y.1
Cappe, O.2
Moulines, E.3
-
9
-
-
57749193836
-
Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory
-
T. Toda, A.W. Black, and K. Tokuda, "Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory, " IEEE Trans. Audio Speech Language Process., vol.15, no.8, pp.2222-2235, 2007.
-
(2007)
IEEE Trans. Audio Speech Language Process.
, vol.15
, Issue.8
, pp. 2222-2235
-
-
Toda, T.1
Black, A.W.2
Tokuda, K.3
-
10
-
-
77953712499
-
Voice conversion using partial least squares regression
-
E. Helander, T. Virtanen, J. Nurminen, andM. Gabbouj, "Voice conversion using partial least squares regression, " IEEE Trans. Audio Speech Language Process., vol.18, no.5, pp.912-921, 2010.
-
(2010)
IEEE Trans. Audio Speech Language Process.
, vol.18
, Issue.5
, pp. 912-921
-
-
Helander, E.1
Virtanen, T.2
Nurminen, J.3
Gabbouj, A.4
-
11
-
-
44949210554
-
Map-based adaptation for speech conversion using adaptation data selection and non-parallel training
-
C.H. Lee and C.H. Wu, "Map-based adaptation for speech conversion using adaptation data selection and non-parallel training, " Proc. Interspeech, pp.2254-2257, 2006.
-
(2006)
Proc. Interspeech
, pp. 2254-2257
-
-
Lee, C.H.1
Wu, C.H.2
-
12
-
-
34547512822
-
Eigenvoice conversion based on gaussian mixture model
-
T. Toda, Y. Ohtani, and K. Shikano, "Eigenvoice conversion based on gaussian mixture model, " Proc. Interspeech, pp.2446-2449, 2006.
-
(2006)
Proc. Interspeech
, pp. 2446-2449
-
-
Toda, T.1
Ohtani, Y.2
Shikano, K.3
-
13
-
-
84865798483
-
One-tomany voice conversion based on tensor representation of speaker space
-
D. Saito, K. Yamamoto, N. Minematsu, and K. Hirose, "One-tomany voice conversion based on tensor representation of speaker space, " Proc. Interspeech, pp.653-656, 2011.
-
(2011)
Proc. Interspeech
, pp. 653-656
-
-
Saito, D.1
Yamamoto, K.2
Minematsu, N.3
Hirose, K.4
-
14
-
-
79959834571
-
Probabilistic integration of joint density model and speaker model for voice conversion
-
D. Saito, S. Watanabe, A. Nakamura, and N. Minematsu, "Probabilistic integration of joint density model and speaker model for voice conversion, " Proc. Interspeech, pp.1728-1731, 2010.
-
(2010)
Proc. Interspeech
, pp. 1728-1731
-
-
Saito, D.1
Watanabe, S.2
Nakamura, A.3
Minematsu, N.4
-
15
-
-
35148852326
-
Voice conversion using canonical correlation analysis based on gaussian mixture model
-
IEEE
-
Z. Jian and Z. Yang, "Voice conversion using canonical correlation analysis based on gaussian mixture model, " Proc. International Conference on Software Engineering, Artificial Intelligence, Networking, and Parallel/Distributed Computing, pp.210-215, IEEE, 2007.
-
(2007)
Proc. International Conference on Software Engineering, Artificial Intelligence, Networking, and Parallel/Distributed Computing
, pp. 210-215
-
-
Jian, Z.1
Yang, Z.2
-
16
-
-
84874248255
-
Exemplar-based voice conversion in noisy environment
-
R. Takashima, T. Takiguchi, and Y. Ariki, "Exemplar-based voice conversion in noisy environment, " IEEE Spoken Language Technology Workshop (SLT), pp.313-317, 2012.
-
(2012)
IEEE Spoken Language Technology Workshop (SLT)
, pp. 313-317
-
-
Takashima, R.1
Takiguchi, T.2
Ariki, Y.3
-
17
-
-
70349197691
-
Voice conversion using artificial neural networks
-
S. Desai, E.V. Raghavendra, B. Yegnanarayana, A.W. Black, and K. Prahallad, "Voice conversion using artificial neural networks, " Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.3893-3896, 2009.
-
(2009)
Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
, pp. 3893-3896
-
-
Desai, S.1
Raghavendra, E.V.2
Yegnanarayana, B.3
Black, A.W.4
Prahallad, K.5
-
18
-
-
4544270860
-
Minimum segmentation error based discriminative training for speech synthesis application
-
Y.J. Wu, H. Kawai, J. Ni, and R.H. Wang, "Minimum segmentation error based discriminative training for speech synthesis application, " Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP), pp.I-629, 2004.
-
(2004)
Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)
-
-
Wu, Y.J.1
Kawai, H.2
Ni, J.3
Wang, R.H.4
-
19
-
-
34547522070
-
Discriminative training for large-vocabulary speech recognition using minimum classification error
-
E. McDermott, T.J. Hazen, J. Le Roux, A. Nakamura, and S. Katagiri, "Discriminative training for large-vocabulary speech recognition using minimum classification error, " IEEE Trans. Audio Speech Language Process., vol.15, no.1, pp.203-223, 2007.
-
(2007)
IEEE Trans. Audio Speech Language Process.
, vol.15
, Issue.1
, pp. 203-223
-
-
McDermott, E.1
Hazen, T.J.2
Roux, J.L.3
Nakamura, A.4
Katagiri, S.5
-
20
-
-
38549096029
-
A speech parameter generation algorithm considering global variance for hmm-based speech synthesis
-
May
-
T. Tomoki and K. Tokuda, "A speech parameter generation algorithm considering global variance for hmm-based speech synthesis, " IEICE Trans. Inf. & Syst., vol.E90-D, no.5, pp.816-824, May 2007.
-
(2007)
IEICE Trans. Inf. & Syst.
, vol.E90-D
, Issue.5
, pp. 816-824
-
-
Tomoki, T.1
Tokuda, K.2
-
21
-
-
84901793334
-
Minimum kullback-leibler divergence parameter generation for hmm-based speech synthesis
-
Z.H. Ling and L.R. Dai, "Minimum kullback-leibler divergence parameter generation for hmm-based speech synthesis, " IEEE Trans. Audio Speech Language Process., vol.20, no.5, pp.1492-1502, 2012.
-
(2012)
IEEE Trans. Audio Speech Language Process
, vol.20
, Issue.5
, pp. 1492-1502
-
-
Ling, Z.H.1
Dai, L.R.2
-
22
-
-
67650851754
-
Ustc system for blizzard challenge 2006 an improved hmm-based speech synthesis method
-
Z.H. Ling, Y.J.Wu, Y.P.Wang, L. Qin, and R.H.Wang, "Ustc system for blizzard challenge 2006 an improved hmm-based speech synthesis method, " Blizzard Challenge Workshop, 2006.
-
(2006)
Blizzard Challenge Workshop
-
-
Ling, Z.H.1
Wu, Y.J.2
Wang, Y.P.3
Qin, L.4
Wang, R.H.5
-
23
-
-
84901803470
-
Exemplarbased voice conversion using non-negative spectrogram deconvolution
-
Z. Wu, T. Virtanen, T. Kinnunen, E.S. Chng, and H. Li, "Exemplarbased voice conversion using non-negative spectrogram deconvolution, " Proc. 8th ISCA Speech Synthesis Workshop, 2013.
-
(2013)
Proc. 8th ISCA Speech Synthesis Workshop
-
-
Wu, Z.1
Virtanen, T.2
Kinnunen, T.3
Chng, E.S.4
Li, H.5
-
25
-
-
33745805403
-
A fast learning algorithm for deep belief nets
-
G.E. Hinton, S. Osindero, and Y.W. Teh, "A fast learning algorithm for deep belief nets, " Neural computation, vol.18, no.7, pp.1527- 1554, 2006.
-
(2006)
Neural Computation
, vol.18
, Issue.7
, pp. 1527-1554
-
-
Hinton, G.E.1
Osindero, S.2
Teh, Y.W.3
-
26
-
-
84901237776
-
Modeling spectral envelopes using restricted boltzmann machines and deep belief networks for statistical parametric speech synthesis
-
Z.H. Ling, L. Deng, and D. Yu, "Modeling spectral envelopes using restricted boltzmann machines and deep belief networks for statistical parametric speech synthesis, " IEEE Trans. Audio Speech Language Process., no.10, pp.2129-2139, 2013.
-
(2013)
IEEE Trans. Audio Speech Language Process.
, Issue.10
, pp. 2129-2139
-
-
Ling, Z.H.1
Deng, L.2
Yu, D.3
-
27
-
-
84055211743
-
Acoustic modeling using deep belief networks
-
A.r.Mohamed, G.E. Dahl, and G. Hinton, "Acoustic modeling using deep belief networks, " IEEE Trans. Audio Speech Language Process., vol.20, no.1, pp.14-22, 2012.
-
(2012)
IEEE Trans. Audio Speech Language Process.
, vol.20
, Issue.1
, pp. 14-22
-
-
Mohamed, A.R.1
Dahl, G.E.2
Hinton, G.3
-
29
-
-
84991233704
-
A deep learning approach to machine transliteration
-
Association for Computational Linguistics
-
T. Deselaers, S. Hasan, O. Bender, and H. Ney, "A deep learning approach to machine transliteration, " Proc. Fourth Workshop on Statistical Machine Translation, pp.233-241, Association for Computational Linguistics, 2009.
-
(2009)
Proc. Fourth Workshop on Statistical Machine Translation
, pp. 233-241
-
-
Deselaers, T.1
Hasan, S.2
Bender, O.3
Ney, H.4
-
30
-
-
84906280857
-
Voice conversion in high-order eigen space using deep belief nets
-
T. Nakashika, R. Takashima, T. Takiguchi, and Y. Ariki, "Voice conversion in high-order eigen space using deep belief nets, " Proc. Interspeech, pp.369-372, 2013.
-
(2013)
Proc. Interspeech
, pp. 369-372
-
-
Nakashika, T.1
Takashima, R.2
Takiguchi, T.3
Ariki, Y.4
-
32
-
-
84906225084
-
Joint spectral distribution modeling using restricted boltzmann machines for voice conversion
-
C. Ling-Hui, L. Zhen-Hua, S. Yan, and D. Li-Rong, "Joint spectral distribution modeling using restricted boltzmann machines for voice conversion, " Proc. Interspeech, pp.3052-3056, 2013.
-
(2013)
Proc. Interspeech
, pp. 3052-3056
-
-
Ling-Hui, C.1
Zhen-Hua, L.2
Yan, S.3
Li-Rong, D.4
-
33
-
-
0025475528
-
Atr japanese speech database as a tool of speech recognition and synthesis
-
A. Kurematsu, K. Takeda, Y. Sagisaka, S. Katagiri, H. Kuwabara, and K. Shikano, "Atr japanese speech database as a tool of speech recognition and synthesis, " Speech Commun., no.4, pp.357-363, 1990.
-
(1990)
Speech Commun
, Issue.4
, pp. 357-363
-
-
Kurematsu, A.1
Takeda, K.2
Sagisaka, Y.3
Katagiri, S.4
Kuwabara, H.5
Shikano, K.6
-
34
-
-
51449108867
-
Tandem-straight: A temporally stable power spectral representation for periodic signals and applications to interference-free spectrum, f0, and aperiodicity estimation
-
H. Kawahara, M.Morise, T. Takahashi, R. Nisimura, T. Irino, and H. Banno, "Tandem-straight: A temporally stable power spectral representation for periodic signals and applications to interference-free spectrum, f0, and aperiodicity estimation, " Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp.3933-3936, 2008.
-
(2008)
Proc. IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
, pp. 3933-3936
-
-
Kawahara, H.1
Morise, M.2
Takahashi, T.3
Nisimura, R.4
Irino, T.5
Banno, H.6
-
35
-
-
85039958911
-
Speech reconstruction from mel-frequency cepstral coefficients using a source-filter model
-
B. Milner and X. Shao, "Speech reconstruction from mel-frequency cepstral coefficients using a source-filter model., " Proc. Interspeech, 2002.
-
(2002)
Proc. Interspeech
-
-
Milner, B.1
Shao, X.2
-
37
-
-
78650474133
-
A practical guide to training restricted boltzmann machines
-
University of Toronto
-
G. Hinton, "A practical guide to training restricted boltzmann machines, " Tech. Rep. Department of Computer Science, University of Toronto, 2010.
-
(2010)
Tech. Rep. Department of Computer Science
-
-
Hinton, G.1
-
39
-
-
84865581203
-
An analysis of gaussianbinary restricted boltzmann machines for natural images
-
N. Wang, J. Melchior, and L. Wiskott, "An analysis of gaussianbinary restricted boltzmann machines for natural images, " Proc. European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning (ESANN), pp.287-292, 2012.
-
(2012)
Proc. European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning (ESANN)
, pp. 287-292
-
-
Wang, N.1
Melchior, J.2
Wiskott, L.3
|