SCOPUS 정보 검색 플랫폼

Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

Volumn , Issue , 2013, Pages 369-372

Voice conversion in high-order eigen space using deep belief nets

(4) Nakashika, Toru a Takashima, Ryoichi a Takiguchi, Tetsuya a Ariki, Yasuo a

a KOBE UNIVERSITY (Japan)

Author keywords

Deep belief nets; Deep learning; Voice conversion

Indexed keywords

ABSTRACTING; SPEECH COMMUNICATION;

DEEP ARCHITECTURES; DEEP BELIEF NETS; DEEP LEARNING; MODEL-BASED METHOD; NEURAL NETWORKS (NNS); OBJECTIVE CRITERIA; VOICE CONVERSION; VOICE CONVERSION TECHNIQUES;

SPEECH PROCESSING;

EID: 84906280857 PISSN: 2308457X EISSN: 19909772 Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (132)

References (25)

1
- 0031623661
- Spectral voice conversion for text-tospeech synthesis
- A. Kain and M.W. Macon, "Spectral voice conversion for text-tospeech synthesis, " Proc. ICASSP, vol. 1, pp. 285-288, 1998.
- (1998) Proc. ICASSP , vol.1 , pp. 285-288
- Kain, A.¹ Macon, M.W.²

2
- 84865747520
- Intonation conversion from neutral to expressive speech
- C. Veaux and X. Robet, "Intonation conversion from neutral to expressive speech, " in Proc. INTERSPEECH, pp. 2765-2768, 2011.
- (2011) Proc. INTERSPEECH , pp. 2765-2768
- Veaux, C.¹ Robet, X.²

3
- 80052698826
- Speakingaid systems using GMM-based voice conversion for electrolaryngeal speech
- K. Nakamura, T. Toda, H. Saruwatari, and K. Shikano, "Speakingaid systems using GMM- based voice conversion for electrolaryngeal speech, " Speech Communication, Vol. 54, No. 1, pp. 134- 146, 2012.
- (2012) Speech Communication , vol.54 , Issue.1 , pp. 134-146
- Nakamura, K.¹ Toda, T.² Saruwatari, H.³ Shikano, K.⁴

4
- 0034855352
- Highperformance robust speech recognition using stereo training data
- L. Deng, A. Acero, L. Jiang, J. Droppo, and X. Huang, "Highperformance robust speech recognition using stereo training data, " Proc. ICASSP, pp. 301-304, 2001.
- (2001) Proc. ICASSP , pp. 301-304
- Deng, L.¹ Acero, A.² Jiang, L.³ Droppo, J.⁴ Huang, X.⁵

5
- 70450192197
- Speech generation from hand gestures based on space mapping
- A. Kunikoshi, Y. Qiao, N. Minematsu, and K. Hirose, "Speech generation from hand gestures based on space mapping, " Proc. INTERSPEECH, pp. 308-311, 2009.
- (2009) Proc. INTERSPEECH , pp. 308-311
- Kunikoshi, A.¹ Qiao, Y.² Minematsu, N.³ Hirose, K.⁴

6
- 0023739214
- Vice conversion through vector quantization
- M. Abe, S. Nakamura, K. Shikano, and H. Kuwabara, "Vice conversion through vector quantization, " in Proc. ICASSP, pp. 655- 658, 1988.
- (1988) Proc. ICASSP , pp. 655-658
- Abe, M.¹ Nakamura, S.² Shikano, K.³ Kuwabara, H.⁴

7
- 0026880275
- Voice transformation using PSOLA technique
- H. Valbret, E. Moulines and J. P. Tubach, "Voice transformation using PSOLA technique, " Speech Communication, Vol. 11, No. 2-3, pp. 175-187, 1992.
- (1992) Speech Communication , vol.11 , Issue.2-3 , pp. 175-187
- Valbret, H.¹ Moulines, E.² Tubach, J.P.³

8
- 0032026483
- Continuous probabilistic transform for voice conversion
- Y. Stylianou, O. Cappe, and E. Moulines, "Continuous probabilistic transform for voice conversion, " IEEE Trans. Speech and Audio Processing, Vol. 6, No. 2, pp. 131-142, 1998.
- (1998) IEEE Trans. Speech and Audio Processing , vol.6 , Issue.2 , pp. 131-142
- Stylianou, Y.¹ Cappe, O.² Moulines, E.³

9
- 57749193836
- Voice conversion based on maximum likelihood estimation of spectral parameter trajectory
- T. Toda, A. Black, and K. Tokuda, "Voice conversion based on maximum likelihood estimation of spectral parameter trajectory, " IEEE Trans. Audio, Speech, Lang. Process., Vol. 15, No. 8, pp. 2222-2235, 2007.
- (2007) IEEE Trans. Audio, Speech, Lang. Process , vol.15 , Issue.8 , pp. 2222-2235
- Toda, T.¹ Black, A.² Tokuda, K.³

10
- 77953712499
- Voice conversion using partial least squares regression
- E. Helander, T. Virtanen, J. Nurminen, and M. Gabbouj, "Voice conversion using partial least squares regression, " IEEE Trans. Audo, Speech, Lang. Process., Vol. 18, No. 5, pp. 912-921, 2010.
- (2010) IEEE Trans. Audo, Speech, Lang. Process , vol.18 , Issue.5 , pp. 912-921
- Helander, E.¹ Virtanen, T.² Nurminen, J.³ Gabbouj, M.⁴

11
- 44949210554
- Map-based adaptation for speech conversion using adaptation data selection and non-parallel training
- C. H. Lee and C. H. Wu, "Map-based adaptation for speech conversion using adaptation data selection and non-parallel training, " in Proc. INTERSPEECH, pp. 2254-2257, 2006.
- (2006) Proc. INTERSPEECH , pp. 2254-2257
- Lee, C.H.¹ Wu, C.H.²

12
- 84906237458
- Voice conversion based on probabilistic integration of joint density model and speaker model
- D. Saito, S. Watanabe, A. Nakamura, N. Minematsu, "Voice conversion based on probabilistic integration of joint density model and speaker model, " in Proc. Acoustic Society of Japan, pp. 335- 338, 2010.
- (2010) Proc. Acoustic Society of Japan , pp. 335-338
- Saito, D.¹ Watanabe, S.² Nakamura, A.³ Minematsu, N.⁴

13
- 34547512822
- Eigenvoice conversion based on Gaussian mixture model
- T. Toda, Y. Ohtani, and K. Shikano, "Eigenvoice conversion based on Gaussian mixture model, " in Proc. INTERSPEECH, pp. 2446 -2449, 2006.
- (2006) Proc. INTERSPEECH , pp. 2446-2449
- Toda, T.¹ Ohtani, Y.² Shikano, K.³

14
- 84865798483
- One-tomany voice conversion based on tensor representation of speaker space
- D. Saito, K. Yamamoto, N. Minematsu, and K. Hirose, "One-tomany voice conversion based on tensor representation of speaker space, " in Proc. INTERSPEECH, pp. 653-656, 2011.
- (2011) Proc. INTERSPEECH , pp. 653-656
- Saito, D.¹ Yamamoto, K.² Minematsu, N.³ Hirose, K.⁴

15
- 35148852326
- Voice conversion using canonical correlation analysis based on gaussian mixture model
- Z. H. Jian and Z. Yang, "Voice conversion using canonical correlation analysis based on gaussian mixture model, " SNPD, Vol. 1, pp. 210-215, 2007.
- (2007) SNPD , vol.1 , pp. 210-215
- Jian, Z.H.¹ Yang, Z.²

16
- 84874248255
- Exemplar-based voice conversion in noisy environment
- R. Takashima, T. Takiguchi, Y. Ariki, "Exemplar-based voice conversion in noisy environment, " SLT, pp.313-317, 2012.
- (2012) SLT , pp. 313-317
- Takashima, R.¹ Takiguchi, T.² Ariki, Y.³

17
- 70349197691
- Voice conversion using artificial neural networks
- S. Desai, E. V. Raghavendra, B. Yegnanarayana, A.W. Black, and K. Prahallad, "Voice conversion using artificial neural networks, " in Proc. ICASSP, pp. 3893-3896, 2009.
- (2009) Proc. ICASSP , pp. 3893-3896
- Desai, S.¹ Raghavendra, E.V.² Yegnanarayana, B.³ Black, A.W.⁴ Prahallad, K.⁵

18
- 4544270860
- Minimum segmentation error based discriminative training for speech synthesis application
- Y. J. Wu, H. Kawai, J. Ni, and R. H. Wang, "Minimum segmentation error based discriminative training for speech synthesis application, " in Proc. ICASSP 04, vol. 1, pp. 629-32, 2004.
- (2004) Proc. ICASSP 04 , vol.1 , pp. 629-632
- Wu, Y.J.¹ Kawai, H.² Ni, J.³ Wang, R.H.⁴

19
- 34547522070
- Discriminative training for large vocabulary speech recognition using minimum classification error
- E. McDermott, T. Hazen, J. L. Roux, A. Nakamura, and S. Katagiri, "Discriminative training for large vocabulary speech recognition using minimum classification error, " IEEE Transactions on Speech and Audio Processing, vol. 15, no. 1, pp. 203-223, 2007.
- (2007) IEEE Transactions on Speech and Audio Processing , vol.15 , Issue.1 , pp. 203-223
- McDermott, E.¹ Hazen, T.² Roux, J.L.³ Nakamura, A.⁴ Katagiri, S.⁵

20
- 33745805403
- A fast learning algorithm for deep belief nets
- G. E. Hinton, S. Osindero, and Y. Teh, "A fast learning algorithm for deep belief nets, " Neural Computation, vol. 18, pp. 1527- 1554, 2006.
- (2006) Neural Computation , vol.18 , pp. 1527-1554
- Hinton, G.E.¹ Osindero, S.² Teh, Y.³

21
- 33745805403
- A fast learning algorithm for deep belief nets
- G. E. Hinton, S. Osindero, and Y. Teh, "A fast learning algorithm for deep belief nets, " Neural Computation, vol. 18, pp. 1527- 1554, 2006.
- (2006) Neural Computation , vol.18 , pp. 1527-1554
- Hinton, G.E.¹ Osindero, S.² Teh, Y.³

22
- 78149306047
- 3-D object recognition with deep belief nets
- V. Nair and G. Hinton, "3-d object recognition with deep belief nets., " in To appear in Advances in Neural Information Processing Systems 22, 2009.
- (2009) To Appear in Advances in Neural Information Processing Systems , vol.22
- Nair, V.¹ Hinton, G.²

23
- 84991233704
- A deep learning approach to machine transliteration
- T Deselaers, S. Hasan, O. Bender, and H. Ney, "A deep learning approach to machine transliteration, " in Proc. EACLWorkshop on Statistical Machine Translation, 2009, pp. 233-241.
- (2009) Proc. EACLWorkshop on Statistical Machine Translation , pp. 233-241
- Deselaers, T.¹ Hasan, S.² Bender, O.³ Ney, H.⁴

24
- 84055211743
- Acoustic modeling using deep belief networks
- A. Mohamed, G. Dahl, and G. Hinton, "Acoustic Modeling using Deep Belief Networks, " IEEE Trans. on Audio, Speech, and Language Procesing, vol. 20, no. 1, pp. 14-22, 2012.
- (2012) IEEE Trans. on Audio, Speech, and Language Procesing , vol.20 , Issue.1 , pp. 14-22
- Mohamed, A.¹ Dahl, G.² Hinton, G.³

25
- 0025475528
- ATR Japanese speech database as a tool of speech recognition and synthesis
- A. Kurematsu, K. Takeda, Y. Sagisaka, S. Katagiri, H. Kuwabara, and K.Shikano, "ATR Japanese speech database as a tool of speech recognition and synthesis, " Speech Communication, vol. 9, pp. 357-363, 1990.
- (1990) Speech Communication , vol.9 , pp. 357-363
- Kurematsu, A.¹ Takeda, K.² Sagisaka, Y.³ Katagiri, S.⁴ Kuwabara, H.⁵ Shikano, K.⁶

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.