메뉴 건너뛰기




Volumn , Issue , 2013, Pages 71-75

Noise-Robust Voice Conversion Based on Spectral Mapping on Sparse Space

Author keywords

noise robustness; nonnegative matrix factorization; sparse representation; Voice conversion

Indexed keywords

GAUSSIAN DISTRIBUTION; MATRIX FACTORIZATION; PHOTOMAPPING;

EID: 84905271796     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (21)

References (18)
  • 2
    • 84865747520 scopus 로고    scopus 로고
    • Intonation conversion from neutral to expressive speech
    • C. Veaux and X. Robet, "Intonation conversion from neutral to expressive speech," in Proc. INTERSPEECH, 2011, pp. 2765-2768.
    • (2011) Proc. INTERSPEECH , pp. 2765-2768
    • Veaux, C.1    Robet, X.2
  • 3
    • 80052698826 scopus 로고    scopus 로고
    • Speakingaid systems using gmm-based voice conversion for electrolaryngeal speech
    • K. Nakamura, T. Toda, H. Saruwatari, and K. Shikano, "Speakingaid systems using gmm-based voice conversion for electrolaryngeal speech," Speech Communication, vol. 54, no. 1, pp. 134-146, 2012.
    • (2012) Speech Communication , vol.54 , Issue.1 , pp. 134-146
    • Nakamura, K.1    Toda, T.2    Saruwatari, H.3    Shikano, K.4
  • 4
    • 77956795483 scopus 로고    scopus 로고
    • Esophageal speech enhancement based on statistical voice conversion with Gaussian mixture models
    • H. Doi, K. Nakamura, T. Toda, H. Saruwatari, and K. Shikano, "Esophageal speech enhancement based on statistical voice conversion with Gaussian mixture models," IEICE Trans. Information and Systems, vol. E93-D, no. 9, pp. 2472-2482, 2010.
    • (2010) IEICE Trans. Information and Systems , vol.E93-D , Issue.9 , pp. 2472-2482
    • Doi, H.1    Nakamura, K.2    Toda, T.3    Saruwatari, H.4    Shikano, K.5
  • 6
    • 0026880275 scopus 로고
    • Voice transformation using PSOLA technique
    • H. Valbret, E. Moulines, and J. P. Tubach, "Voice transformation using PSOLA technique," Speech Communication, vol. 11, no. 2-3, pp. 175-187, 1992.
    • (1992) Speech Communication , vol.11 , Issue.2-3 , pp. 175-187
    • Valbret, H.1    Moulines, E.2    Tubach, J. P.3
  • 8
    • 57749193836 scopus 로고    scopus 로고
    • Voice conversion based on maximum likelihood estimation of spectral parameter trajectory
    • T. Toda, A. Black, and K. Tokuda, "Voice conversion based on maximum likelihood estimation of spectral parameter trajectory," IEEE Trans. Audio, Speech and Language Processing, vol. 15, no. 8, pp. 2222-2235, 2007.
    • (2007) IEEE Trans. Audio, Speech and Language Processing , vol.15 , Issue.8 , pp. 2222-2235
    • Toda, T.1    Black, A.2    Tokuda, K.3
  • 10
    • 44949210554 scopus 로고    scopus 로고
    • Map-based adaptation for speech conversion using adaptation data selection and non-parallel training
    • C. H. Lee and C. H. Wu, "Map-based adaptation for speech conversion using adaptation data selection and non-parallel training," in Proc. INTERSPEECH, 2006, pp. 2254-2257.
    • (2006) Proc. INTERSPEECH , pp. 2254-2257
    • Lee, C. H.1    Wu, C. H.2
  • 11
    • 34547512822 scopus 로고    scopus 로고
    • Eigenvoice conversion based on gaussian mixture model
    • T. Toda, Y. Ohtani, and K. Shikano, "Eigenvoice conversion based on gaussian mixture model," in Proc. INTERSPEECH, 2006, pp. 2446-2449.
    • (2006) Proc. INTERSPEECH , pp. 2446-2449
    • Toda, T.1    Ohtani, Y.2    Shikano, K.3
  • 12
    • 84865798483 scopus 로고    scopus 로고
    • One-tomany voice conversion based on tensor representation of speaker space
    • D. Saito, K. Yamamoto, N. Minematsu, and K. Hirose, "One-tomany voice conversion based on tensor representation of speaker space," in Proc. INTERSPEECH, 2011, pp. 653-656.
    • (2011) Proc. INTERSPEECH , pp. 653-656
    • Saito, D.1    Yamamoto, K.2    Minematsu, N.3    Hirose, K.4
  • 14
    • 50249152311 scopus 로고    scopus 로고
    • Monaural sound source separation by non-negative matrix factorization with temporal continuity and sparseness criteria
    • T. Virtanen, "Monaural sound source separation by non-negative matrix factorization with temporal continuity and sparseness criteria," IEEE Trans. Audio, Speech and Language Processing, vol. 15, no. 3, pp. 1066-1074, 2007.
    • (2007) IEEE Trans. Audio, Speech and Language Processing , vol.15 , Issue.3 , pp. 1066-1074
    • Virtanen, T.1
  • 15
    • 44949110218 scopus 로고    scopus 로고
    • Single-channel speech separation using sparse non-negative matrix factorization
    • M. N. Schmidt and R. K. Olsson, "Single-channel speech separation using sparse non-negative matrix factorization," in Proc. INTERSPEECH, 2006, pp. 2614-2617.
    • (2006) Proc. INTERSPEECH , pp. 2614-2617
    • Schmidt, M. N.1    Olsson, R. K.2
  • 16
  • 17
    • 84874248255 scopus 로고    scopus 로고
    • Exemplar-based voice conversion in noisy environment
    • R. Takashima, T. Takiguchi, and Y. Ariki, "Exemplar-based voice conversion in noisy environment," in Proc. SLT, 2012, pp. 313-317.
    • (2012) Proc. SLT , pp. 313-317
    • Takashima, R.1    Takiguchi, T.2    Ariki, Y.3
  • 18
    • 0032673049 scopus 로고    scopus 로고
    • Restructuring speech representations using a pitch-adaptive timefrequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds
    • H. Kawahara, I. Masuda-Katsuse, and A. de Cheveigne, "Restructuring speech representations using a pitch-adaptive timefrequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds," Speech Communication, vol. 27, pp. 187-207, 1999.
    • (1999) Speech Communication , vol.27 , pp. 187-207
    • Kawahara, H.1    Masuda-Katsuse, I.2    de Cheveigne, A.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.