SCOPUS 정보 검색 플랫폼

8th ISCA Workshop on Speech Synthesis, SSW 2013

Volumn , Issue , 2013, Pages 71-75

Noise-Robust Voice Conversion Based on Spectral Mapping on Sparse Space

(4) Takashima, Ryoichi a Aihara, Ryo a Takiguchi, Tetsuya a Ariki, Yasuo a

a KOBE UNIVERSITY (Japan)

Author keywords

noise robustness; nonnegative matrix factorization; sparse representation; Voice conversion

Indexed keywords

GAUSSIAN DISTRIBUTION; MATRIX FACTORIZATION; PHOTOMAPPING;

BASE MATRIX; COMPUTATION TIME; EXEMPLAR BASED METHODS; EXEMPLAR-BASED; NOISE ROBUSTNESS; NOISY ENVIRONMENT; NONNEGATIVE MATRIX FACTORIZATION; SPARSE REPRESENTATION; VOICE CONVERSION; VOICE CONVERSION TECHNIQUES;

MATRIX ALGEBRA;

EID: 84905271796 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (21)

References (18)

1
- 84876497245
- GMM-based voice conversion applied to emotional speech synthesis
- H. Kawanami, Y. Iwami, T. Toda, H. Saruwatari, and K. Shikano, "GMM-based voice conversion applied to emotional speech synthesis," in Proc. INTERSPEECH, 2003, pp. 2401-2404.
- (2003) Proc. INTERSPEECH , pp. 2401-2404
- Kawanami, H.¹ Iwami, Y.² Toda, T.³ Saruwatari, H.⁴ Shikano, K.⁵

2
- 84865747520
- Intonation conversion from neutral to expressive speech
- C. Veaux and X. Robet, "Intonation conversion from neutral to expressive speech," in Proc. INTERSPEECH, 2011, pp. 2765-2768.
- (2011) Proc. INTERSPEECH , pp. 2765-2768
- Veaux, C.¹ Robet, X.²

3
- 80052698826
- Speakingaid systems using gmm-based voice conversion for electrolaryngeal speech
- K. Nakamura, T. Toda, H. Saruwatari, and K. Shikano, "Speakingaid systems using gmm-based voice conversion for electrolaryngeal speech," Speech Communication, vol. 54, no. 1, pp. 134-146, 2012.
- (2012) Speech Communication , vol.54 , Issue.1 , pp. 134-146
- Nakamura, K.¹ Toda, T.² Saruwatari, H.³ Shikano, K.⁴

4
- 77956795483
- Esophageal speech enhancement based on statistical voice conversion with Gaussian mixture models
- H. Doi, K. Nakamura, T. Toda, H. Saruwatari, and K. Shikano, "Esophageal speech enhancement based on statistical voice conversion with Gaussian mixture models," IEICE Trans. Information and Systems, vol. E93-D, no. 9, pp. 2472-2482, 2010.
- (2010) IEICE Trans. Information and Systems , vol.E93-D , Issue.9 , pp. 2472-2482
- Doi, H.¹ Nakamura, K.² Toda, T.³ Saruwatari, H.⁴ Shikano, K.⁵

5
- 0023739214
- Voice conversion through vector quantization
- M. Abe, S. Nakamura, K. Shikano, and H. Kuwabara, "Voice conversion through vector quantization," in Proc. ICASSP, 1988, pp. 655-658.
- (1988) Proc. ICASSP , pp. 655-658
- Abe, M.¹ Nakamura, S.² Shikano, K.³ Kuwabara, H.⁴

6
- 0026880275
- Voice transformation using PSOLA technique
- H. Valbret, E. Moulines, and J. P. Tubach, "Voice transformation using PSOLA technique," Speech Communication, vol. 11, no. 2-3, pp. 175-187, 1992.
- (1992) Speech Communication , vol.11 , Issue.2-3 , pp. 175-187
- Valbret, H.¹ Moulines, E.² Tubach, J. P.³

7
- 0032026483
- Continuous probabilistic transform for voice conversion
- Y. Stylianou, O. Cappe, and E. Moulines, "Continuous probabilistic transform for voice conversion," IEEE Trans. Speech and Audio Processing, vol. 6, no. 2, pp. 131-142, 1998.
- (1998) IEEE Trans. Speech and Audio Processing , vol.6 , Issue.2 , pp. 131-142
- Stylianou, Y.¹ Cappe, O.² Moulines, E.³

8
- 57749193836
- Voice conversion based on maximum likelihood estimation of spectral parameter trajectory
- T. Toda, A. Black, and K. Tokuda, "Voice conversion based on maximum likelihood estimation of spectral parameter trajectory," IEEE Trans. Audio, Speech and Language Processing, vol. 15, no. 8, pp. 2222-2235, 2007.
- (2007) IEEE Trans. Audio, Speech and Language Processing , vol.15 , Issue.8 , pp. 2222-2235
- Toda, T.¹ Black, A.² Tokuda, K.³

9
- 77953712499
- Voice conversion using partial least squares regression
- E. Helander, T. Virtanen, J. Nurminen, and M. Gabbouj, "Voice conversion using partial least squares regression," IEEE Trans. Audio, Speech and Language Processing, vol. 18, no. 5, pp. 912-921, 2010.
- (2010) IEEE Trans. Audio, Speech and Language Processing , vol.18 , Issue.5 , pp. 912-921
- Helander, E.¹ Virtanen, T.² Nurminen, J.³ Gabbouj, M.⁴

10
- 44949210554
- Map-based adaptation for speech conversion using adaptation data selection and non-parallel training
- C. H. Lee and C. H. Wu, "Map-based adaptation for speech conversion using adaptation data selection and non-parallel training," in Proc. INTERSPEECH, 2006, pp. 2254-2257.
- (2006) Proc. INTERSPEECH , pp. 2254-2257
- Lee, C. H.¹ Wu, C. H.²

11
- 34547512822
- Eigenvoice conversion based on gaussian mixture model
- T. Toda, Y. Ohtani, and K. Shikano, "Eigenvoice conversion based on gaussian mixture model," in Proc. INTERSPEECH, 2006, pp. 2446-2449.
- (2006) Proc. INTERSPEECH , pp. 2446-2449
- Toda, T.¹ Ohtani, Y.² Shikano, K.³

12
- 84865798483
- One-tomany voice conversion based on tensor representation of speaker space
- D. Saito, K. Yamamoto, N. Minematsu, and K. Hirose, "One-tomany voice conversion based on tensor representation of speaker space," in Proc. INTERSPEECH, 2011, pp. 653-656.
- (2011) Proc. INTERSPEECH , pp. 653-656
- Saito, D.¹ Yamamoto, K.² Minematsu, N.³ Hirose, K.⁴

13
- 84898964201
- Algorithms for non-negative matrix factorization
- D. D. Lee and H. S. Seung, "Algorithms for non-negative matrix factorization," in Proc. Neural Information Processing System, 2001, pp. 556-562.
- (2001) Proc. Neural Information Processing System , pp. 556-562
- Lee, D. D.¹ Seung, H. S.²

14
- 50249152311
- Monaural sound source separation by non-negative matrix factorization with temporal continuity and sparseness criteria
- T. Virtanen, "Monaural sound source separation by non-negative matrix factorization with temporal continuity and sparseness criteria," IEEE Trans. Audio, Speech and Language Processing, vol. 15, no. 3, pp. 1066-1074, 2007.
- (2007) IEEE Trans. Audio, Speech and Language Processing , vol.15 , Issue.3 , pp. 1066-1074
- Virtanen, T.¹

15
- 44949110218
- Single-channel speech separation using sparse non-negative matrix factorization
- M. N. Schmidt and R. K. Olsson, "Single-channel speech separation using sparse non-negative matrix factorization," in Proc. INTERSPEECH, 2006, pp. 2614-2617.
- (2006) Proc. INTERSPEECH , pp. 2614-2617
- Schmidt, M. N.¹ Olsson, R. K.²

16
- 79960657803
- Exemplarbased sparse representations for noise robust automatic speech recognition
- J. F. Gemmeke, T. Viratnen, and A. Hurmalainen, "Exemplarbased sparse representations for noise robust automatic speech recognition," IEEE Trans. Audio, Speech and Language Processing, vol. 19, no. 7, pp. 2067-2080, 2011.
- (2011) IEEE Trans. Audio, Speech and Language Processing , vol.19 , Issue.7 , pp. 2067-2080
- Gemmeke, J. F.¹ Viratnen, T.² Hurmalainen, A.³

17
- 84874248255
- Exemplar-based voice conversion in noisy environment
- R. Takashima, T. Takiguchi, and Y. Ariki, "Exemplar-based voice conversion in noisy environment," in Proc. SLT, 2012, pp. 313-317.
- (2012) Proc. SLT , pp. 313-317
- Takashima, R.¹ Takiguchi, T.² Ariki, Y.³

18
- 0032673049
- Restructuring speech representations using a pitch-adaptive timefrequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds
- H. Kawahara, I. Masuda-Katsuse, and A. de Cheveigne, "Restructuring speech representations using a pitch-adaptive timefrequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds," Speech Communication, vol. 27, pp. 187-207, 1999.
- (1999) Speech Communication , vol.27 , pp. 187-207
- Kawahara, H.¹ Masuda-Katsuse, I.² de Cheveigne, A.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.