메뉴 건너뛰기




Volumn , Issue , 2014, Pages

Exemplar-based emotional voice conversion using non-negative matrix factorization

Author keywords

[No Author keywords available]

Indexed keywords

FACE RECOGNITION; MATRIX ALGEBRA; SPEECH COMMUNICATION; SPEECH PROCESSING;

EID: 84949924136     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/APSIPA.2014.7041640     Document Type: Conference Paper
Times cited : (29)

References (31)
  • 1
    • 84966398940 scopus 로고
    • Optimising selection of units from speech database for concatenative synthesis
    • A. W. Black and N. Cambpbell, "Optimising selection of units from speech database for concatenative synthesis," in EUROSPEECH, pp. 581-584, 1995.
    • (1995) EUROSPEECH , pp. 581-584
    • Black, A.W.1    Cambpbell, N.2
  • 2
    • 0033708106 scopus 로고    scopus 로고
    • Speech parameter generation algorithms for HMM based speech synthesis
    • K. Tokuda, T. Yoshimura, T. Masuko, T. Kobayashi, and T. Kitamura, "Speech parameter generation algorithms for HMM based speech synthesis," in ICASSP, pp. 1315-1318, 2000.
    • (2000) ICASSP , pp. 1315-1318
    • Tokuda, K.1    Yoshimura, T.2    Masuko, T.3    Kobayashi, T.4    Kitamura, T.5
  • 4
    • 77949913458 scopus 로고    scopus 로고
    • Analysis of statistical parametric and unit selection speech synthesis systems applied to emotional speech
    • R. Barra Chicote, J. Yamagichi, S. King, J. M. Montero, and J. Macias Guarasa, "Analysis of statistical parametric and unit selection speech synthesis systems applied to emotional speech," Speech Communication, vol. 52, pp. 394-404, 2010.
    • (2010) Speech Communication , vol.52 , pp. 394-404
    • Barra Chicote, R.1    Yamagichi, J.2    King, S.3    Montero, J.M.4    Macias Guarasa, J.5
  • 6
    • 0031623661 scopus 로고    scopus 로고
    • Spectral voice conversion for text to speech synthesis
    • A. Kain and M. W. Macon, "Spectral voice conversion for text to speech synthesis," in ICASSP, vol. 1, pp. 285-288, 1998.
    • (1998) ICASSP , vol.1 , pp. 285-288
    • Kain, A.1    Macon, M.W.2
  • 7
    • 78649328053 scopus 로고    scopus 로고
    • Survey on speech emotion recognition: Features, classification schemes, and databases
    • M. E. Ayadia, M. S. Kamel, and F. Karray, "Survey on speech emotion recognition: Features, classification schemes, and databases," Pattern Recognition, vol. 44, 2011.
    • (2011) Pattern Recognition , vol.44
    • Ayadia, M.E.1    Kamel, M.S.2    Karray, F.3
  • 8
    • 84874248255 scopus 로고    scopus 로고
    • Exemplar based voice conversion in noisy environment
    • R. Takashima, T. Takiguchi, and Y. Ariki, "Exemplar based voice conversion in noisy environment," in SLT, pp. 313-317, 2012.
    • (2012) SLT , pp. 313-317
    • Takashima, R.1    Takiguchi, T.2    Ariki, Y.3
  • 9
    • 79960657803 scopus 로고    scopus 로고
    • Exemplar based sparse representations for noise robust automatic speech recognition
    • J. F. Gemmeke, T. Viratnen, and A. Hurmalainen, "Exemplar based sparse representations for noise robust automatic speech recognition," IEEE Trans. Audio, Speech and Language Processing, vol. 19, no. 7, pp. 2067-2080, 2011.
    • (2011) IEEE Trans. Audio, Speech and Language Processing , vol.19 , Issue.7 , pp. 2067-2080
    • Gemmeke, J.F.1    Viratnen, T.2    Hurmalainen, A.3
  • 11
    • 50249152311 scopus 로고    scopus 로고
    • Monaural sound source separation by non negative matrix factorization with temporal continuity and sparseness criteria
    • T. Virtanen, "Monaural sound source separation by non negative matrix factorization with temporal continuity and sparseness criteria," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 3, pp. 1066-1074, 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process , vol.15 , Issue.3 , pp. 1066-1074
    • Virtanen, T.1
  • 12
    • 44949110218 scopus 로고    scopus 로고
    • Single channel speech separation using sparse non negative matrix factorization
    • M. N. Schmidt and R. K. Olsson, "Single channel speech separation using sparse non negative matrix factorization," in Interspeech, 2006.
    • (2006) Interspeech
    • Schmidt, M.N.1    Olsson, R.K.2
  • 13
    • 84905268745 scopus 로고    scopus 로고
    • Active set newton algorithm for non negative sparse coding of audio
    • T. Virtanen, B. Raj, J. F. Gemmeke, and H. Van hamme, "Active set newton algorithm for non negative sparse coding of audio," in ICASSP, pp. 3116-3120, 2014.
    • (2014) ICASSP , pp. 3116-3120
    • Virtanen, T.1    Raj, B.2    Gemmeke, J.F.3    Van Hamme, H.4
  • 14
    • 0023739214 scopus 로고
    • Esophageal speech enhancement based on statistical voice conversion with Gaussian mixture models
    • M. Abe, S. Nakamura, K. Shikano, and H. Kuwabara, "Esophageal speech enhancement based on statistical voice conversion with Gaussian mixture models," in Proc. ICASSP, pp. 655 658, 1988.
    • (1988) Proc. ICASSP , pp. 655-658
    • Abe, M.1    Nakamura, S.2    Shikano, K.3    Kuwabara, H.4
  • 15
    • 0026880275 scopus 로고
    • Voice transformation using PSOLA technique
    • H. Valbret, E. Moulines, and J. P. Tubach, "Voice transformation using PSOLA technique," Speech Communication, vol. 11, no. 2 3, pp. 175 187, 1992.
    • (1992) Speech Communication , vol.11 , Issue.2-3 , pp. 175-187
    • Valbret, H.1    Moulines, E.2    Tubach, J.P.3
  • 16
    • 57749193836 scopus 로고    scopus 로고
    • Voice conversion based on maximum likelihood estimation of spectral parameter trajectory
    • T. Toda, A. Black, and K. Tokuda, "Voice conversion based on maximum likelihood estimation of spectral parameter trajectory," IEEE Trans. Audio, Speech, Lang. Process., vol. 15, no. 8, pp. 2222-2235, 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process , vol.15 , Issue.8 , pp. 2222-2235
    • Toda, T.1    Black, A.2    Tokuda, K.3
  • 18
    • 44949210554 scopus 로고    scopus 로고
    • Map based adaptation for speech conversion using adaptation data selection and non parallel training
    • C. H. Lee and C. H. Wu, "Map based adaptation for speech conversion using adaptation data selection and non parallel training," in Interspeech, pp. 2254-2257, 2006.
    • (2006) Interspeech , pp. 2254-2257
    • Lee, C.H.1    Wu, C.H.2
  • 19
    • 34547512822 scopus 로고    scopus 로고
    • Eigenvoice conversion based on Gaussian mixture model
    • T. Toda, Y. Ohtani, and K. Shikano, "Eigenvoice conversion based on Gaussian mixture model," in Interspeech, pp. 2446-2449, 2006.
    • (2006) Interspeech , pp. 2446-2449
    • Toda, T.1    Ohtani, Y.2    Shikano, K.3
  • 20
    • 84865798483 scopus 로고    scopus 로고
    • One to many voice conversion based on tensor representation of speaker space
    • D. Saito, K. Yamamoto, N. Minematsu, and K. Hirose, "One to many voice conversion based on tensor representation of speaker space," in Interspeech, pp. 653-656, 2011.
    • (2011) Interspeech , pp. 653-656
    • Saito, D.1    Yamamoto, K.2    Minematsu, N.3    Hirose, K.4
  • 21
    • 84905227265 scopus 로고    scopus 로고
    • Voice conversion based on non negative matrix factorization using phoneme categorized dictionary
    • R. Aihara, T. Nakashika, T. Takiguchi, and Y. Ariki, "Voice conversion based on non negative matrix factorization using phoneme categorized dictionary," in ICASSP, pp. 7944-7948, 2014.
    • (2014) ICASSP , pp. 7944-7948
    • Aihara, R.1    Nakashika, T.2    Takiguchi, T.3    Ariki, Y.4
  • 22
    • 84905269973 scopus 로고    scopus 로고
    • Multimodal voice conversion using non negative matrix factorization in noisy environments
    • K. Masaka, R. Aihara, T. Takiguchi, and Y. Ariki, "Multimodal voice conversion using non negative matrix factorization in noisy environments," ICASSP2014, pp. 1561-1565, 2014.
    • (2014) ICASSP2014 , pp. 1561-1565
    • Masaka, K.1    Aihara, R.2    Takiguchi, T.3    Ariki, Y.4
  • 23
    • 84890519936 scopus 로고    scopus 로고
    • Individualitypreserving voice conversion for articulation disorders based on nonnegative matrix factorization
    • R. Aihara, R. Takashima, T. Takiguchi, and Y. Ariki, "Individualitypreserving voice conversion for articulation disorders based on Nonnegative Matrix Factorization," in ICASSP, pp. 8037-8040, 2013.
    • (2013) ICASSP , pp. 8037-8040
    • Aihara, R.1    Takashima, R.2    Takiguchi, T.3    Ariki, Y.4
  • 25
    • 77955722263 scopus 로고    scopus 로고
    • Hierarchical prosody conversion using regression based clustering for emotional synthesis
    • C. H. Wu, C. C. Hsia, and C. H. Lee, "Hierarchical prosody conversion using regression based clustering for emotional synthesis," IEEE Trans. Audio, Speech and Lang Proc., 2010.
    • (2010) IEEE Trans. Audio, Speech and Lang Proc
    • Wu, C.H.1    Hsia, C.C.2    Lee, C.H.3
  • 27
    • 84865747520 scopus 로고    scopus 로고
    • Intonation conversion from neutral to expressive speech
    • C. Veaux and X. Robet, "Intonation conversion from neutral to expressive speech," in Interspeech, pp. 2765-2768, 2011.
    • (2011) Interspeech , pp. 2765-2768
    • Veaux, C.1    Robet, X.2
  • 28
  • 29
    • 33750915991 scopus 로고    scopus 로고
    • STRAIGHT, exploitation of the other aspect of vocoder: Perceptually isomorphic decomposition of speech sounds
    • H. Kawahara, "STRAIGHT, exploitation of the other aspect of vocoder: Perceptually isomorphic decomposition of speech sounds," Acoustical Science and Technology, pp. 349-353, 2006.
    • (2006) Acoustical Science and Technology , pp. 349-353
    • Kawahara, H.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.