-
3
-
-
80052698826
-
Speaking-aid systems using GMM-based voice conversion for electrolaryngeal speech
-
K Nakamura, T Toda, H Saruwatari, K Shikano, Speaking-aid systems using GMM-based voice conversion for electrolaryngeal speech. Speech Commun. 54(1), 134–146 (2012).
-
(2012)
Speech Commun
, vol.54
, Issue.1
, pp. 134-146
-
-
Nakamura, K.1
Toda, T.2
Saruwatari, H.3
Shikano, K.4
-
4
-
-
0034855352
-
A Acero, L Jiang
-
L Deng, A Acero, L Jiang, J Droppo, X Huang, in Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP). High-performance robust speech recognition using stereo training data, (2001), pp. 301–304.
-
(2001)
J Droppo, X Huang, in Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP). High-performance robust speech recognition using stereo training data
, pp. 301-304
-
-
-
5
-
-
70450192197
-
Minematsu, K Hirose, in Proceedings of Interspeech
-
A Kunikoshi, Y Qiao, N Minematsu, K Hirose, in Proceedings of Interspeech. Speech generation from hand gestures based on space mapping, (2009), pp. 308–311.
-
(2009)
Speech generation from hand gestures based on space mapping
, pp. 308-311
-
-
A Kunikoshi, Y.1
Qiao, N.2
-
6
-
-
0021412027
-
Vector quantization
-
R Gray, Vector quantization. ASSP Mag. IEEE. 1(2), 4–29 (1984).
-
(1984)
ASSP Mag. IEEE
, vol.1
, Issue.2
, pp. 4-29
-
-
Gray, R.1
-
7
-
-
0026880275
-
Voice transformation using PSOLA technique
-
H Valbret, E Moulines, J-P Tubach, Voice transformation using PSOLA technique. Speech Commun. 11(2), 175–187 (1992).
-
(1992)
Speech Commun
, vol.11
, Issue.2
, pp. 175-187
-
-
Valbret, H.1
Moulines, E.2
Tubach, J.-P.3
-
8
-
-
0032026483
-
Continuous probabilistic transform for voice conversion
-
Y Stylianou, Cappé O, E Moulines, Continuous probabilistic transform for voice conversion. IEEE Trans. Speech Audio Process. 6(2), 131–142 (1998).
-
(1998)
IEEE Trans. Speech Audio Process
, vol.6
, Issue.2
, pp. 131-142
-
-
Stylianou, Y.1
Cappé, O.2
Moulines, E.3
-
9
-
-
57749193836
-
Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory
-
T Toda, AW Black, K Tokuda, Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory. IEEE Trans. Audio Speech Lang. Process. 15(8), 2222–2235 (2007).
-
(2007)
IEEE Trans. Audio Speech Lang. Process
, vol.15
, Issue.8
, pp. 2222-2235
-
-
Toda, T.1
Black, A.W.2
Tokuda, K.3
-
10
-
-
77953712499
-
Voice conversion using partial least squares regression
-
E Helander, T Virtanen, J Nurminen, Gabbouj, Voice conversion using partial least squares regression. IEEE Trans. Audio Speech Lang. Process. 18(5), 912–921 (2010).
-
(2010)
IEEE Trans. Audio Speech Lang. Process
, vol.18
, Issue.5
, pp. 912-921
-
-
Helander, E.1
Virtanen, T.2
Nurminen, J.3
Gabbouj4
-
13
-
-
84865798483
-
K Hirose, in Proceedings of Interspeech
-
D Saito, Yamamoto K, N Minematsu, K Hirose, in Proceedings of Interspeech. One-to-many voice conversion based on tensor representation of speaker space, (2011), pp. 653–656.
-
(2011)
One-to-many voice conversion based on tensor representation of speaker space
, pp. 653-656
-
-
Saito, D.1
Yamamoto, K.2
Minematsu, N.3
-
14
-
-
79959834571
-
Nakamura
-
D Saito, S Watanabe, A Nakamura, N Minematsu, in Proceedings of Interspeech. Probabilistic integration of joint density model and speaker model for voice conversion, (2010), pp. 1728–1731.
-
(2010)
N Minematsu, in Proceedings of Interspeech. Probabilistic integration of joint density model and speaker model for voice conversion
, pp. 1728-1731
-
-
D Saito, S.1
Watanabe, A.2
-
15
-
-
35148852326
-
Z Yang, in Proceedings of International Conference on Software Engineering, Artificial Intelligence, Networking, and Parallel/Distributed Computing
-
Z Jian, Z Yang, in Proceedings of International Conference on Software Engineering, Artificial Intelligence, Networking, and Parallel/Distributed Computing. Voice conversion using canonical correlation analysis based on Gaussian mixture model, (2007), pp. 210–215.
-
(2007)
Voice conversion using canonical correlation analysis based on Gaussian mixture model
, pp. 210-215
-
-
-
17
-
-
0029254176
-
Transformation of formants for voice conversion using artificial neural networks
-
M Narendranath, HA Murthy, S Rajendran, B Yegnanarayana, Transformation of formants for voice conversion using artificial neural networks. Speech Commun. 16(2), 207–216 (1995).
-
(1995)
Speech Commun
, vol.16
, Issue.2
, pp. 207-216
-
-
Narendranath, M.1
Murthy, H.A.2
Rajendran, S.3
Yegnanarayana, B.4
-
18
-
-
70349197691
-
EV Raghavendra, B Yegnanarayana, AW Black, K Prahallad, in Proceedings of
-
S Desai, EV Raghavendra, B Yegnanarayana, AW Black, K Prahallad, in Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Voice conversion using artificial neural networks, (2009), pp. 3893–3896.
-
(2009)
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Voice conversion using artificial neural networks
, pp. 3893-3896
-
-
-
19
-
-
4544270860
-
-
Y-J Wu, H Kawai, J Ni, R-H Wang, in Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP). Minimum segmentation error based discriminative training for speech synthesis application, (2004), p. 629
-
Y-J Wu, H Kawai, J Ni, R-H Wang, in Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP). Minimum segmentation error based discriminative training for speech synthesis application, (2004), p. 629.
-
(2004)
-
-
-
20
-
-
34547522070
-
Discriminative training for large-vocabulary speech recognition using minimum classification error
-
E McDermott, TJ Hazen, J Le Roux, A Nakamura, S Katagiri, Discriminative training for large-vocabulary speech recognition using minimum classification error. IEEE Trans. Audio Speech Lang. Process. 15(1), 203–223 (2007).
-
(2007)
IEEE Trans. Audio Speech Lang. Process
, vol.15
, Issue.1
, pp. 203-223
-
-
McDermott, E.1
Hazen, T.J.2
Le Roux, J.3
Nakamura, A.4
Katagiri, S.5
-
21
-
-
38549096029
-
A speech parameter generation algorithm considering global variance for HMM-based speech synthesis
-
T Tomoki, K Tokuda, A speech parameter generation algorithm considering global variance for HMM-based speech synthesis. IEICE Trans. Inform. Syst. 90(5), 816–824 (2007).
-
(2007)
IEICE Trans. Inform. Syst
, vol.90
, Issue.5
, pp. 816-824
-
-
Tomoki, T.1
Tokuda, K.2
-
22
-
-
84901793334
-
Minimum Kullback-Leibler divergence parameter generation for HMM-based speech synthesis
-
Z-H Ling, L-R Dai, Minimum Kullback-Leibler divergence parameter generation for HMM-based speech synthesis. IEEE Trans. Audio Speech Lang. Process. 20(5), 1492–1502 (2012).
-
(2012)
IEEE Trans. Audio Speech Lang. Process
, vol.20
, Issue.5
, pp. 1492-1502
-
-
Ling, Z.-H.1
Dai, L.-R.2
-
23
-
-
84924311712
-
Qin, R-H Wang, in Blizzard Challenge Workshop
-
Z-H Ling, Y-J Wu, Y-P Wang, L Qin, R-H Wang, in Blizzard Challenge Workshop. USTC system for blizzard challenge 2006 an improved HMM-based speech synthesis method, (2006).
-
(2006)
USTC system for blizzard challenge 2006 an improved HMM-based speech synthesis method
-
-
Z-H Ling, Y.-J.W.1
Y-P Wang, L.2
-
24
-
-
84924336471
-
Chng, H Li, in Proceedings of the 8th ISCA Speech Synthesis Workshop
-
Z Wu, T Virtanen, T Kinnunen, ES Chng, H Li, in Proceedings of the 8th ISCA Speech Synthesis Workshop. Exemplar-based voice conversion using non-negative spectrogram deconvolution, (2013), pp. 221–226.
-
(2013)
Exemplar-based voice conversion using non-negative spectrogram deconvolution
, pp. 221-226
-
-
Z Wu, T.1
Virtanen, T.2
Kinnunen, E.S.3
-
25
-
-
84906280857
-
Takiguchi, Y Ariki, in Proceedings of Interspeech
-
T Nakashika, R Takashima, T Takiguchi, Y Ariki, in Proceedings of Interspeech. Voice conversion in high-order eigen space using deep belief nets, (2013), pp. 369–372.
-
(2013)
Voice conversion in high-order eigen space using deep belief nets
, pp. 369-372
-
-
T Nakashika, R.1
Takashima, T.2
-
26
-
-
0000329993
-
Information processing in dynamical systems: foundations of harmony theory
-
P Smolensky, Information processing in dynamical systems: foundations of harmony theory. Parallel Distributed Process. 1, 194–281 (1986).
-
(1986)
Parallel Distributed Process
, vol.1
, pp. 194-281
-
-
Smolensky, P.1
-
27
-
-
33745805403
-
A fast learning algorithm for deep belief nets
-
GE Hinton, S Osindero, Y-W Teh, A fast learning algorithm for deep belief nets. Neural Comput. 18(7), 1527–1554 (2006).
-
(2006)
Neural Comput
, vol.18
, Issue.7
, pp. 1527-1554
-
-
Hinton, G.E.1
Osindero, S.2
Teh, Y.-W.3
-
28
-
-
84901237776
-
Modeling spectral envelopes using restricted Boltzmann machines and deep belief networks for statistical parametric speech synthesis
-
Z-H Ling, L Deng, D Yu, Modeling spectral envelopes using restricted Boltzmann machines and deep belief networks for statistical parametric speech synthesis. IEEE Trans. Audio Speech Lang. Process. 21(10), 2129–2139 (2013).
-
(2013)
IEEE Trans. Audio Speech Lang. Process
, vol.21
, Issue.10
, pp. 2129-2139
-
-
Ling, Z.-H.1
Deng, L.2
Yu, D.3
-
29
-
-
84055211743
-
Acoustic modeling using deep belief networks
-
A-R Mohamed, GE Dahl, G Hinton, Acoustic modeling using deep belief networks. Audio Speech Lang. Process. IEEE Trans. 20(1), 14–22 (2012).
-
(2012)
Audio Speech Lang. Process. IEEE Trans
, vol.20
, Issue.1
, pp. 14-22
-
-
Mohamed, A.-R.1
Dahl, G.E.2
Hinton, G.3
-
30
-
-
78149306047
-
3-D object recognition with deep belief nets
-
V Nair, G Hinton, 3-D object recognition with deep belief nets. Adv. Neural Inform. Process. Syst. 22, 1339–1347 (2009).
-
(2009)
Adv. Neural Inform. Process. Syst
, vol.22
, pp. 1339-1347
-
-
Nair, V.1
Hinton, G.2
-
31
-
-
84991233704
-
Bender, H Ney, in Proceedings of the Fourth Workshop on Statistical Machine Translation
-
T Deselaers, S Hasan, O Bender, H Ney, in Proceedings of the Fourth Workshop on Statistical Machine Translation. A deep learning approach to machine transliteration, (2009), pp. 233–241.
-
(2009)
A deep learning approach to machine transliteration
, pp. 233-241
-
-
T Deselaers, S.1
Hasan, O.2
-
32
-
-
84924311711
-
H Li, in Proceedings of the IEEE China Summit and International Conference on Signal and Information Processing (ChinaSIP)
-
Z Wu, ES Chng, H Li, in Proceedings of the IEEE China Summit and International Conference on Signal and Information Processing (ChinaSIP). Conditional restricted Boltzmann machine for voice conversion, (2013).
-
(2013)
Conditional restricted Boltzmann machine for voice conversion
-
-
Wu, Z.1
Chng, E.S.2
-
33
-
-
84924334174
-
Yan, D Li-Rong, in Proceedings of Interspeech
-
C Ling-Hui, L Zhen-Hua, S Yan, D Li-Rong, in Proceedings of Interspeech. Joint spectral distribution modeling using restricted Boltzmann machines for voice conversion, (2013), pp. 3052–3056.
-
(2013)
Joint spectral distribution modeling using restricted Boltzmann machines for voice conversion
, pp. 3052-3056
-
-
C Ling-Hui, L.1
Zhen-Hua, S.2
-
34
-
-
0001578518
-
A learning algorithm for Boltzmann machines
-
DH Ackley, GE Hinton, TJ Sejnowski, A learning algorithm for Boltzmann machines. Cogn. Sci. 9(1), 147–169 (1985).
-
(1985)
Cogn. Sci
, vol.9
, Issue.1
, pp. 147-169
-
-
Ackley, D.H.1
Hinton, G.E.2
Sejnowski, T.J.3
-
35
-
-
0345368881
-
Unsupervised learning of distributions of binary vectors using two layer networks
-
Y Freund, D Haussler, Unsupervised learning of distributions of binary vectors using two layer networks. Adv, Neural Inform. Process. Syst. 4, 912–919 (1991).
-
(1991)
Adv, Neural Inform. Process. Syst
, vol.4
, pp. 912-919
-
-
Freund, Y.1
Haussler, D.2
-
36
-
-
33746600649
-
Reducing the dimensionality of data with neural networks
-
GE Hinton, RR Salakhutdinov, Reducing the dimensionality of data with neural networks. Science. 313(5786), 504–507 (2006).
-
(2006)
Science
, vol.313
, Issue.5786
, pp. 504-507
-
-
Hinton, G.E.1
Salakhutdinov, R.R.2
-
42
-
-
0025475528
-
ATR japanese speech database as a tool of speech recognition and synthesis
-
A Kurematsu, K Takeda, Y Sagisaka, S Katagiri, H Kuwabara, K Shikano, ATR japanese speech database as a tool of speech recognition and synthesis. Speech Communication. 9(4), 357–363 (1990).
-
(1990)
Speech Communication
, vol.9
, Issue.4
, pp. 357-363
-
-
Kurematsu, A.1
Takeda, K.2
Sagisaka, Y.3
Katagiri, S.4
Kuwabara, H.5
Shikano, K.6
-
43
-
-
51449108867
-
M Morise, T Takahashi, R Nisimura, T Irino, H Banno, in Proceedings of
-
H Kawahara, M Morise, T Takahashi, R Nisimura, T Irino, H Banno, in Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Tandem-straight: a temporally stable power spectral representation for periodic signals and applications to interference-free spectrum, f0, and aperiodicity estimation, (2008), pp. 3933–3936.
-
(2008)
IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP). Tandem-straight: a temporally stable power spectral representation for periodic signals and applications to interference-free spectrum, f0, and aperiodicity estimation
, pp. 3933-3936
-
-
|