-
1
-
-
0031623661
-
Spectral voice conversion for text-tospeech synthesis
-
A. Kain and M.W. Macon, "Spectral voice conversion for text-tospeech synthesis, " Proc. ICASSP, vol. 1, pp. 285-288, 1998.
-
(1998)
Proc. ICASSP
, vol.1
, pp. 285-288
-
-
Kain, A.1
Macon, M.W.2
-
2
-
-
84865747520
-
Intonation conversion from neutral to expressive speech
-
C. Veaux and X. Robet, "Intonation conversion from neutral to expressive speech, " in Proc. INTERSPEECH, pp. 2765-2768, 2011.
-
(2011)
Proc. INTERSPEECH
, pp. 2765-2768
-
-
Veaux, C.1
Robet, X.2
-
3
-
-
80052698826
-
Speakingaid systems using GMM-based voice conversion for electrolaryngeal speech
-
K. Nakamura, T. Toda, H. Saruwatari, and K. Shikano, "Speakingaid systems using GMM- based voice conversion for electrolaryngeal speech, " Speech Communication, Vol. 54, No. 1, pp. 134- 146, 2012.
-
(2012)
Speech Communication
, vol.54
, Issue.1
, pp. 134-146
-
-
Nakamura, K.1
Toda, T.2
Saruwatari, H.3
Shikano, K.4
-
4
-
-
0034855352
-
Highperformance robust speech recognition using stereo training data
-
L. Deng, A. Acero, L. Jiang, J. Droppo, and X. Huang, "Highperformance robust speech recognition using stereo training data, " Proc. ICASSP, pp. 301-304, 2001.
-
(2001)
Proc. ICASSP
, pp. 301-304
-
-
Deng, L.1
Acero, A.2
Jiang, L.3
Droppo, J.4
Huang, X.5
-
5
-
-
70450192197
-
Speech generation from hand gestures based on space mapping
-
A. Kunikoshi, Y. Qiao, N. Minematsu, and K. Hirose, "Speech generation from hand gestures based on space mapping, " Proc. INTERSPEECH, pp. 308-311, 2009.
-
(2009)
Proc. INTERSPEECH
, pp. 308-311
-
-
Kunikoshi, A.1
Qiao, Y.2
Minematsu, N.3
Hirose, K.4
-
6
-
-
0023739214
-
Vice conversion through vector quantization
-
M. Abe, S. Nakamura, K. Shikano, and H. Kuwabara, "Vice conversion through vector quantization, " in Proc. ICASSP, pp. 655- 658, 1988.
-
(1988)
Proc. ICASSP
, pp. 655-658
-
-
Abe, M.1
Nakamura, S.2
Shikano, K.3
Kuwabara, H.4
-
7
-
-
0026880275
-
Voice transformation using PSOLA technique
-
H. Valbret, E. Moulines and J. P. Tubach, "Voice transformation using PSOLA technique, " Speech Communication, Vol. 11, No. 2-3, pp. 175-187, 1992.
-
(1992)
Speech Communication
, vol.11
, Issue.2-3
, pp. 175-187
-
-
Valbret, H.1
Moulines, E.2
Tubach, J.P.3
-
8
-
-
0032026483
-
Continuous probabilistic transform for voice conversion
-
Y. Stylianou, O. Cappe, and E. Moulines, "Continuous probabilistic transform for voice conversion, " IEEE Trans. Speech and Audio Processing, Vol. 6, No. 2, pp. 131-142, 1998.
-
(1998)
IEEE Trans. Speech and Audio Processing
, vol.6
, Issue.2
, pp. 131-142
-
-
Stylianou, Y.1
Cappe, O.2
Moulines, E.3
-
9
-
-
57749193836
-
Voice conversion based on maximum likelihood estimation of spectral parameter trajectory
-
T. Toda, A. Black, and K. Tokuda, "Voice conversion based on maximum likelihood estimation of spectral parameter trajectory, " IEEE Trans. Audio, Speech, Lang. Process., Vol. 15, No. 8, pp. 2222-2235, 2007.
-
(2007)
IEEE Trans. Audio, Speech, Lang. Process
, vol.15
, Issue.8
, pp. 2222-2235
-
-
Toda, T.1
Black, A.2
Tokuda, K.3
-
10
-
-
77953712499
-
Voice conversion using partial least squares regression
-
E. Helander, T. Virtanen, J. Nurminen, and M. Gabbouj, "Voice conversion using partial least squares regression, " IEEE Trans. Audo, Speech, Lang. Process., Vol. 18, No. 5, pp. 912-921, 2010.
-
(2010)
IEEE Trans. Audo, Speech, Lang. Process
, vol.18
, Issue.5
, pp. 912-921
-
-
Helander, E.1
Virtanen, T.2
Nurminen, J.3
Gabbouj, M.4
-
11
-
-
44949210554
-
Map-based adaptation for speech conversion using adaptation data selection and non-parallel training
-
C. H. Lee and C. H. Wu, "Map-based adaptation for speech conversion using adaptation data selection and non-parallel training, " in Proc. INTERSPEECH, pp. 2254-2257, 2006.
-
(2006)
Proc. INTERSPEECH
, pp. 2254-2257
-
-
Lee, C.H.1
Wu, C.H.2
-
12
-
-
84906237458
-
Voice conversion based on probabilistic integration of joint density model and speaker model
-
D. Saito, S. Watanabe, A. Nakamura, N. Minematsu, "Voice conversion based on probabilistic integration of joint density model and speaker model, " in Proc. Acoustic Society of Japan, pp. 335- 338, 2010.
-
(2010)
Proc. Acoustic Society of Japan
, pp. 335-338
-
-
Saito, D.1
Watanabe, S.2
Nakamura, A.3
Minematsu, N.4
-
13
-
-
34547512822
-
Eigenvoice conversion based on Gaussian mixture model
-
T. Toda, Y. Ohtani, and K. Shikano, "Eigenvoice conversion based on Gaussian mixture model, " in Proc. INTERSPEECH, pp. 2446 -2449, 2006.
-
(2006)
Proc. INTERSPEECH
, pp. 2446-2449
-
-
Toda, T.1
Ohtani, Y.2
Shikano, K.3
-
14
-
-
84865798483
-
One-tomany voice conversion based on tensor representation of speaker space
-
D. Saito, K. Yamamoto, N. Minematsu, and K. Hirose, "One-tomany voice conversion based on tensor representation of speaker space, " in Proc. INTERSPEECH, pp. 653-656, 2011.
-
(2011)
Proc. INTERSPEECH
, pp. 653-656
-
-
Saito, D.1
Yamamoto, K.2
Minematsu, N.3
Hirose, K.4
-
15
-
-
35148852326
-
Voice conversion using canonical correlation analysis based on gaussian mixture model
-
Z. H. Jian and Z. Yang, "Voice conversion using canonical correlation analysis based on gaussian mixture model, " SNPD, Vol. 1, pp. 210-215, 2007.
-
(2007)
SNPD
, vol.1
, pp. 210-215
-
-
Jian, Z.H.1
Yang, Z.2
-
16
-
-
84874248255
-
Exemplar-based voice conversion in noisy environment
-
R. Takashima, T. Takiguchi, Y. Ariki, "Exemplar-based voice conversion in noisy environment, " SLT, pp.313-317, 2012.
-
(2012)
SLT
, pp. 313-317
-
-
Takashima, R.1
Takiguchi, T.2
Ariki, Y.3
-
17
-
-
70349197691
-
Voice conversion using artificial neural networks
-
S. Desai, E. V. Raghavendra, B. Yegnanarayana, A.W. Black, and K. Prahallad, "Voice conversion using artificial neural networks, " in Proc. ICASSP, pp. 3893-3896, 2009.
-
(2009)
Proc. ICASSP
, pp. 3893-3896
-
-
Desai, S.1
Raghavendra, E.V.2
Yegnanarayana, B.3
Black, A.W.4
Prahallad, K.5
-
18
-
-
4544270860
-
Minimum segmentation error based discriminative training for speech synthesis application
-
Y. J. Wu, H. Kawai, J. Ni, and R. H. Wang, "Minimum segmentation error based discriminative training for speech synthesis application, " in Proc. ICASSP 04, vol. 1, pp. 629-32, 2004.
-
(2004)
Proc. ICASSP 04
, vol.1
, pp. 629-632
-
-
Wu, Y.J.1
Kawai, H.2
Ni, J.3
Wang, R.H.4
-
19
-
-
34547522070
-
Discriminative training for large vocabulary speech recognition using minimum classification error
-
E. McDermott, T. Hazen, J. L. Roux, A. Nakamura, and S. Katagiri, "Discriminative training for large vocabulary speech recognition using minimum classification error, " IEEE Transactions on Speech and Audio Processing, vol. 15, no. 1, pp. 203-223, 2007.
-
(2007)
IEEE Transactions on Speech and Audio Processing
, vol.15
, Issue.1
, pp. 203-223
-
-
McDermott, E.1
Hazen, T.2
Roux, J.L.3
Nakamura, A.4
Katagiri, S.5
-
20
-
-
33745805403
-
A fast learning algorithm for deep belief nets
-
G. E. Hinton, S. Osindero, and Y. Teh, "A fast learning algorithm for deep belief nets, " Neural Computation, vol. 18, pp. 1527- 1554, 2006.
-
(2006)
Neural Computation
, vol.18
, pp. 1527-1554
-
-
Hinton, G.E.1
Osindero, S.2
Teh, Y.3
-
21
-
-
33745805403
-
A fast learning algorithm for deep belief nets
-
G. E. Hinton, S. Osindero, and Y. Teh, "A fast learning algorithm for deep belief nets, " Neural Computation, vol. 18, pp. 1527- 1554, 2006.
-
(2006)
Neural Computation
, vol.18
, pp. 1527-1554
-
-
Hinton, G.E.1
Osindero, S.2
Teh, Y.3
-
23
-
-
84991233704
-
A deep learning approach to machine transliteration
-
T Deselaers, S. Hasan, O. Bender, and H. Ney, "A deep learning approach to machine transliteration, " in Proc. EACLWorkshop on Statistical Machine Translation, 2009, pp. 233-241.
-
(2009)
Proc. EACLWorkshop on Statistical Machine Translation
, pp. 233-241
-
-
Deselaers, T.1
Hasan, S.2
Bender, O.3
Ney, H.4
-
24
-
-
84055211743
-
Acoustic modeling using deep belief networks
-
A. Mohamed, G. Dahl, and G. Hinton, "Acoustic Modeling using Deep Belief Networks, " IEEE Trans. on Audio, Speech, and Language Procesing, vol. 20, no. 1, pp. 14-22, 2012.
-
(2012)
IEEE Trans. on Audio, Speech, and Language Procesing
, vol.20
, Issue.1
, pp. 14-22
-
-
Mohamed, A.1
Dahl, G.2
Hinton, G.3
-
25
-
-
0025475528
-
ATR Japanese speech database as a tool of speech recognition and synthesis
-
A. Kurematsu, K. Takeda, Y. Sagisaka, S. Katagiri, H. Kuwabara, and K.Shikano, "ATR Japanese speech database as a tool of speech recognition and synthesis, " Speech Communication, vol. 9, pp. 357-363, 1990.
-
(1990)
Speech Communication
, vol.9
, pp. 357-363
-
-
Kurematsu, A.1
Takeda, K.2
Sagisaka, Y.3
Katagiri, S.4
Kuwabara, H.5
Shikano, K.6
|