SCOPUS 정보 검색 플랫폼

Volumn , Issue , 2012, Pages 313-317

Exemplar-based voice conversion in noisy environment

Author keywords

exemplar based; noise robustness; non negative matrix factorization; sparse coding; voice conversion

Indexed keywords

EXEMPLAR-BASED; NOISE ROBUSTNESS; NONNEGATIVE MATRIX FACTORIZATION; SPARSE CODING; VOICE CONVERSION;

SIGNAL PROCESSING; SPEECH COMMUNICATION; SPEECH RECOGNITION;

SPEECH PROCESSING;

EID: 84874248255 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/SLT.2012.6424242 Document Type: Conference Paper

Times cited : (134)

References (16)

1
- 84876497245
- GMM-based Voice conversion applied to emotional speech synthesis
- Y. Iwami, T. Toda, H. Saruwatari, and K. Shikano, "GMM-based Voice Conversion Applied to Emotional Speech Synthesis," IEEE Trans. Seech and Audio Proc., Vol. 7, pp. 2401-2404, 1999.
- (1999) IEEE Trans. Seech and Audio Proc , vol.7 , pp. 2401-2404
- Iwami, Y.¹ Toda, T.² Saruwatari, H.³ Shikano, K.⁴

2
- 84865747520
- Intonation conversion from neutral to expressive speech
- C. Veaux and X. Robet, "Intonation conversion from neutral to expressive speech," in Proc. INTERSPEECH, pp. 2765-2768, 2011.
- (2011) Proc. INTERSPEECH , pp. 2765-2768
- Veaux, C.¹ Robet, X.²

3
- 80052698826
- Speaking-aid systems using GMM-based voice conversion for electrolaryngeal speech
- K. Nakamura, T. Toda, H. Saruwatari, and K. Shikano, "Speaking-aid systems using GMM-based voice conversion for electrolaryngeal speech," Speech Communication, Vol. 54, No. 1, pp. 134-146, 2012.
- (2012) Speech Communication , vol.54 , Issue.1 , pp. 134-146
- Nakamura, K.¹ Toda, T.² Saruwatari, H.³ Shikano, K.⁴

4
- 0023739214
- Vice conversion through vector quantization
- M. Abe, S. Nakamura, K. Shikano, and H. Kuwabara, "Vice conversion through vector quantization," in Proc. ICASSP, pp. 655-658, 1988.
- (1988) Proc. ICASSP , pp. 655-658
- Abe, M.¹ Nakamura, S.² Shikano, K.³ Kuwabara, H.⁴

5
- 0026880275
- Voice transformation using PSOLA technique
- H. Valbret, E. Moulines and J. P. Tubach, "Voice transformation using PSOLA technique," Speech Communication, Vol. 11, No. 2-3, pp. 175-187, 1992.
- (1992) Speech Communication , vol.11 , Issue.2-3 , pp. 175-187
- Valbret, H.¹ Moulines, E.² Tubach, J.P.³

6
- 0032026483
- Continuous probabilistic transform for voice conversion
- Y. Stylianou, O. Cappe, and E. Moulines, "Continuous probabilistic transform for voice conversion," IEEE Trans. Speech and Audio Processing, Vol. 6, No. 2, pp. 131-142, 1998.
- (1998) IEEE Trans. Speech and Audio Processing , vol.6 , Issue.2 , pp. 131-142
- Stylianou, Y.¹ Cappe, O.² Moulines, E.³

7
- 57749193836
- Voice conversion based on maximum likelihood estimation of spectral parameter trajectory
- T. Toda, A. Black, and K. Tokuda, "Voice conversion based on maximum likelihood estimation of spectral parameter trajectory," IEEE Trans. Audio, Speech, Lang. Process., Vol. 15, No. 8, pp. 2222-2235, 2007.
- (2007) IEEE Trans. Audio, Speech, Lang. Process , vol.15 , Issue.8 , pp. 2222-2235
- Toda, T.¹ Black, A.² Tokuda, K.³

8
- 77953712499
- Voice conversion using partial least squares regression
- E. Helander, T. Virtanen, J. Nurminen, and M. Gabbouj, "Voice conversion using partial least squares regression," IEEE Trans. Audo, Speech, Lang. Process., Vol. 18, No. 5, pp. 912-921, 2010.
- (2010) IEEE Trans. Audo, Speech, Lang. Process , vol.18 , Issue.5 , pp. 912-921
- Helander, E.¹ Virtanen, T.² Nurminen, J.³ Gabbouj, M.⁴

9
- 44949210554
- Map-based adaptation for speech conversion using adaptation data selection and non-parallel training
- C. H. Lee and C. H. Wu, "Map-based adaptation for speech conversion using adaptation data selection and non-parallel training," in Proc. INTERSPEECH, pp. 2254-2257, 2006.
- (2006) Proc. INTERSPEECH , pp. 2254-2257
- Lee, C.H.¹ Wu, C.H.²

10
- 34547512822
- Eigenvoice conversion based on Gaussian mixture model
- T. Toda, Y. Ohtani, and K. Shikano, "Eigenvoice conversion based on Gaussian mixture model," in Proc. INTERSPEECH, pp. 2446-2449, 2006.
- (2006) Proc. INTERSPEECH , pp. 2446-2449
- Toda, T.¹ Ohtani, Y.² Shikano, K.³

11
- 84865798483
- One-to-many voice conversion based on tensor representation of speaker space
- D. Saito, K. Yamamoto, N. Minematsu, and K. Hirose, "One-to-many voice conversion based on tensor representation of speaker space," in Proc. INTERSPEECH, pp. 653-656, 2011.
- (2011) Proc. INTERSPEECH , pp. 653-656
- Saito, D.¹ Yamamoto, K.² Minematsu, N.³ Hirose, K.⁴

12
- 84898964201
- Algorithms for nonnegative matrix factorization
- D. D. Lee and H. S. Seung, "Algorithms for nonnegative matrix factorization," in Proc. Neural Information Processing System, pp. 556-562, 2001.
- (2001) Proc. Neural Information Processing System , pp. 556-562
- Lee, D.D.¹ Seung, H.S.²

13
- 50249152311
- Monaural sound source separation by nonnegative matrix factorization with temporal continuity and sparseness criteria
- T. Virtanen, "Monaural sound source separation by nonnegative matrix factorization with temporal continuity and sparseness criteria," IEEE Trans. Audio, Speech, Lang. Process., Vol. 15, No. 3, pp. 1066-1074, 2007.
- (2007) IEEE Trans. Audio, Speech, Lang. Process , vol.15 , Issue.3 , pp. 1066-1074
- Virtanen, T.¹

14
- 44949110218
- Single-channel speech separation using sparse non-negative matrix factorization
- M. N. Schmidt and R. K. Olsson, "Single-channel speech separation using sparse non-negative matrix factorization," in Proc. INTERSPEECH, pp. 2614-2617, 2006.
- (2006) Proc. INTERSPEECH , pp. 2614-2617
- Schmidt, M.N.¹ Olsson, R.K.²

15
- 79960657803
- Exemplar-based sparse representations for noise robust automatic speech recognition
- J. F. Gemmeke, T. Viratnen, and A. Hurmalainen, "Exemplar-Based Sparse Representations for Noise Robust Automatic Speech Recognition," IEEE Trans. Audio, Speech, Lang. Process., Vol. 19, Issue 7, pp. 2067-2080, 2011.
- (2011) IEEE Trans. Audio, Speech, Lang. Process , vol.19 , Issue.7 , pp. 2067-2080
- Gemmeke, J.F.¹ Viratnen, T.² Hurmalainen, A.³

16
- 0032673049
- Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds
- H. Kawahara, I. Masuda-Katsuse, and A.de Cheveigne, "Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds," Speech Communication, Vol.27, pp. 187-207, 1999.
- (1999) Speech Communication , vol.27 , pp. 187-207
- Kawahara, H.¹ Masuda-Katsuse, I.² De Cheveigne, A.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.