메뉴 건너뛰기




Volumn 2015-January, Issue , 2015, Pages 2749-2753

Many-to-many voice conversion based on multiple non-negative matrix factorization

Author keywords

Exemplar based; Many tomany; NMF; Speech synthesis; Voice conversion

Indexed keywords

FACTORIZATION; GAUSSIAN DISTRIBUTION; SPEECH COMMUNICATION; SPEECH PROCESSING; SPEECH SYNTHESIS;

EID: 84959090646     PISSN: 2308457X     EISSN: 19909772     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (3)

References (27)
  • 2
    • 80052698826 scopus 로고    scopus 로고
    • Speakingaidsystems using GMM-based voice conversion for electrolaryngealspeech
    • K. Nakamura, T. Toda, H. Saruwatari, and K. Shikano, "Speakingaidsystems using GMM-based voice conversion for electrolaryngealspeech, " Speech Communication, vol. 54, no. 1, pp. 134-146, 2012.
    • (2012) Speech Communication , vol.54 , Issue.1 , pp. 134-146
    • Nakamura, K.1    Toda, T.2    Saruwatari, H.3    Shikano, K.4
  • 3
    • 0031623661 scopus 로고    scopus 로고
    • Spectral voice conversion for text-tospeechsynthesis
    • A. Kain and M. W. Macon, "Spectral voice conversion for text-tospeechsynthesis, " in Proc. ICASSP, vol. 1, pp. 285-288, 1998.
    • (1998) Proc. ICASSP , vol.1 , pp. 285-288
    • Kain, A.1    Macon, M.W.2
  • 4
    • 84910069658 scopus 로고    scopus 로고
    • A mel-cepstral analysis technique restoring high frequencycomponents from low-sampling-rate speech
    • K. Nakamura, K. Hashimoto, K. Oura, Y. Nankaku, and K. Tokuda, "A mel-cepstral analysis technique restoring high frequencycomponents from low-sampling-rate speech, " in Proc. Interspeech, pp. 2494-2498, 2014.
    • (2014) Proc. Interspeech , pp. 2494-2498
    • Nakamura, K.1    Hashimoto, K.2    Oura, K.3    Nankaku, Y.4    Tokuda, K.5
  • 5
    • 84910024857 scopus 로고    scopus 로고
    • GMM-basedband width extension using sub-band basis spectrum model
    • Y. Ohtani, M. Tamura, M. Morita, and M. Akamine, "GMM-basedband width extension using sub-band basis spectrum model, " inProc. Interspeech, pp. 2489-2493, 2014.
    • (2014) Proc. Interspeech , pp. 2489-2493
    • Ohtani, Y.1    Tamura, M.2    Morita, M.3    Akamine, M.4
  • 6
    • 0023739214 scopus 로고
    • Esophageal speech enhancement based on statistical voice conversionwith Gaussian mixture models
    • M. Abe, S. Nakamura, K. Shikano, and H. Kuwabara, "Esophageal speech enhancement based on statistical voice conversionwith Gaussian mixture models, " in Proc. ICASSP, pp. 655-658, 1988.
    • (1988) Proc. ICASSP , pp. 655-658
    • Abe, M.1    Nakamura, S.2    Shikano, K.3    Kuwabara, H.4
  • 7
    • 0026880275 scopus 로고
    • Voice transformationusing PSOLA technique
    • H. Valbret, E. Moulines, and J. P. Tubach, "Voice transformationusing PSOLA technique, " Speech Communication, vol. 11, no. 2-3, pp. 175-187, 1992.
    • (1992) Speech Communication , vol.11 , Issue.2-3 , pp. 175-187
    • Valbret, H.1    Moulines, E.2    Tubach, J.P.3
  • 8
    • 57749193836 scopus 로고    scopus 로고
    • Voice conversion based onmaximum likelihood estimation of spectral parameter trajectory
    • T. Toda, A. Black, and K. Tokuda, "Voice conversion based onmaximum likelihood estimation of spectral parameter trajectory, "IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 8, pp. 2222-2235, 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process , vol.15 , Issue.8 , pp. 2222-2235
    • Toda, T.1    Black, A.2    Tokuda, K.3
  • 10
    • 84874248255 scopus 로고    scopus 로고
    • Exemplar-based voiceconversion in noisy environment
    • R. Takashima, T. Takiguchi, and Y. Ariki, "Exemplar-based voiceconversion in noisy environment, " in Proc. SLT, pp. 313-317, 2012.
    • (2012) Proc. SLT , pp. 313-317
    • Takashima, R.1    Takiguchi, T.2    Ariki, Y.3
  • 11
    • 84911369131 scopus 로고    scopus 로고
    • Exemplar-basedsparse representation with residual compensation for voice conversion
    • Z. Wu, T. Virtanen, E. S. Chng, and H. Li, "Exemplar-basedsparse representation with residual compensation for voice conversion, "IEEE Trans. Audio, Speech, Lang. Process., vol. 22, no. 10, pp. 1506-1521, 2014.
    • (2014) IEEE Trans. Audio, Speech, Lang. Process , vol.22 , Issue.10 , pp. 1506-1521
    • Wu, Z.1    Virtanen, T.2    Chng, E.S.3    Li, H.4
  • 12
    • 84901806271 scopus 로고    scopus 로고
    • Noiserobustvoice conversion based on sparse spectral mapping usingnon-negative matrix factorization
    • R. Aihara, R. Takashima, T. Takiguchi, and Y. Ariki, "Noiserobustvoice conversion based on sparse spectral mapping usingnon-negative matrix factorization, " IEICE Transactions on Informationand Systems, vol. E97-D, no. 6, pp. 1411-1418, 2014.
    • (2014) IEICE Transactions on Informationand Systems , vol.E97-D , Issue.6 , pp. 1411-1418
    • Aihara, R.1    Takashima, R.2    Takiguchi, T.3    Ariki, Y.4
  • 14
    • 44949110218 scopus 로고    scopus 로고
    • Single-channel speech separationusing sparse non-negative matrix factorization
    • M. N. Schmidt and R. K. Olsson, "Single-channel speech separationusing sparse non-negative matrix factorization, " in Proc. Interspeech, 2006.
    • (2006) Proc. Interspeech
    • Schmidt, M.N.1    Olsson, R.K.2
  • 15
    • 50249152311 scopus 로고    scopus 로고
    • Monaural sound source separation by non-negativematrix factorization with temporal continuity and sparseness criteria
    • T. Virtanen, "Monaural sound source separation by non-negativematrix factorization with temporal continuity and sparseness criteria, "IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 3, pp. 1066-1074, 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process , vol.15 , Issue.3 , pp. 1066-1074
    • Virtanen, T.1
  • 16
    • 79960657803 scopus 로고    scopus 로고
    • Exemplarbasedsparse representations for noise robust automatic speechrecognition
    • J. F. Gemmeke, T. Viratnen, and A. Hurmalainen, "Exemplarbasedsparse representations for noise robust automatic speechrecognition, " IEEE Trans. Audio, Speech, Lang. Process., vol. 19, no. 7, pp. 2067-2080, 2011.
    • (2011) IEEE Trans. Audio, Speech, Lang. Process , vol.19 , Issue.7 , pp. 2067-2080
    • Gemmeke, J.F.1    Viratnen, T.2    Hurmalainen, A.3
  • 17
    • 84905227265 scopus 로고    scopus 로고
    • Voiceconversion based on non-negative matrix factorization usingphoneme-categorized dictionary
    • R. Aihara, T. Nakashika, T. Takiguchi, and Y. Ariki, "Voiceconversion based on non-negative matrix factorization usingphoneme-categorized dictionary, " in Proc. ICASSP, pp. 7944-7948, 2014.
    • (2014) Proc. ICASSP , pp. 7944-7948
    • Aihara, R.1    Nakashika, T.2    Takiguchi, T.3    Ariki, Y.4
  • 18
    • 84901801701 scopus 로고    scopus 로고
    • A preliminarydemonstration of exemplar-based voice conversion for articulationdisorders using an individuality-preserving dictionary
    • R. Aihara, R. Takashima, T. Takiguchi, and Y. Ariki, "A preliminarydemonstration of exemplar-based voice conversion for articulationdisorders using an individuality-preserving dictionary, "EURASIP Journal on Audio, Speech, and Music Processing, vol. 2014: 5, doi: 10. 1186/1687-4722-2014-5, 2014.
    • (2014) EURASIP Journal on Audio, Speech, and Music Processing , vol.2014 , pp. 5
    • Aihara, R.1    Takashima, R.2    Takiguchi, T.3    Ariki, Y.4
  • 19
    • 84910091291 scopus 로고    scopus 로고
    • Multimodalexemplar-based voice conversion using lip features in noisy environments
    • K. Masaka, R. Aihara, T. Takiguchi, and Y. Ariki, "Multimodalexemplar-based voice conversion using lip features in noisy environments, "in Proc. INTERSPEECH, vol. 1159-1163, 2014.
    • (2014) Proc. INTERSPEECH , vol.1159-1163
    • Masaka, K.1    Aihara, R.2    Takiguchi, T.3    Ariki, Y.4
  • 20
    • 44949210554 scopus 로고    scopus 로고
    • MAP-based adaptation for speech conversionusing adaptation data selection and non-parallel training
    • C. H. Lee and C. H. Wu, "MAP-based adaptation for speech conversionusing adaptation data selection and non-parallel training, "in Proc. INTERSPEECH, pp. 2254-2257, 2006.
    • (2006) Proc. INTERSPEECH , pp. 2254-2257
    • Lee, C.H.1    Wu, C.H.2
  • 22
    • 34547512822 scopus 로고    scopus 로고
    • Eigenvoice conversion basedon Gaussian mixture model
    • T. Toda, Y. Ohtani, and K. Shikano, "Eigenvoice conversion basedon Gaussian mixture model, " in Proc. Interspeech, pp. 2446-2449, 2006.
    • (2006) Proc. Interspeech , pp. 2446-2449
    • Toda, T.1    Ohtani, Y.2    Shikano, K.3
  • 23
    • 70450194389 scopus 로고    scopus 로고
    • Many-tomanyeigenvoice conversion with reference voice
    • Y. Ohtani, T. Toda, H. Saruwatari, and K. Shikano, "Many-tomanyeigenvoice conversion with reference voice, " in Proc. Interspeech, pp. 1623-1626, 2009.
    • (2009) Proc. Interspeech , pp. 1623-1626
    • Ohtani, Y.1    Toda, T.2    Saruwatari, H.3    Shikano, K.4
  • 24
    • 84865798483 scopus 로고    scopus 로고
    • One-tomanyvoice conversion based on tensor representation of speakerspace
    • D. Saito, K. Yamamoto, N. Minematsu, and K. Hirose, "One-tomanyvoice conversion based on tensor representation of speakerspace, " in Proc. INTERSPEECH, pp. 653-656, 2011.
    • (2011) Proc. INTERSPEECH , pp. 653-656
    • Saito, D.1    Yamamoto, K.2    Minematsu, N.3    Hirose, K.4
  • 26
    • 33750915991 scopus 로고    scopus 로고
    • STRAIGHT, exploitation of the other aspectof vocoder: Perceptually isomorphic decomposition of speechsounds
    • H. Kawahara, "STRAIGHT, exploitation of the other aspectof vocoder: Perceptually isomorphic decomposition of speechsounds, " Acoustical Science and Technology, pp. 349-353, 2006.
    • (2006) Acoustical Science and Technology , pp. 349-353
    • Kawahara, H.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.