메뉴 건너뛰기




Volumn , Issue , 2013, Pages 1057-1061

An investigation of acoustic features for singing voice conversion based on perceptual age

Author keywords

Perceptual age; Singing voice; Spectral and prosodic features; Subjective evaluations; Voice conversion

Indexed keywords

COMPUTER APPLICATIONS; COMPUTER SIMULATION;

EID: 84905262778     PISSN: 2308457X     EISSN: 19909772     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (5)

References (21)
  • 1
    • 84867616167 scopus 로고    scopus 로고
    • Temporally variable multi-aspect auditory morphing enabling extrapolation without objective and perceptual breakdown
    • Mar
    • H. Kawahara and M. Morise, "Temporally variable multi-aspect auditory morphing enabling extrapolation without objective and perceptual breakdown, " Proc. ICASSP, pp. 5389-5392, Mar. 2012.
    • (2012) Proc. ICASSP , pp. 5389-5392
    • Kawahara, H.1    Morise, M.2
  • 2
    • 0032026483 scopus 로고    scopus 로고
    • Continuous proba- bilistic transform for voice conversion
    • Mar
    • Y. Stylianou, O. Cappé, and E. Moulines, "Continuous proba- bilistic transform for voice conversion, " IEEE Trans. SAP, vol. 6, no. 2, pp. 131-142, Mar. 1998.
    • (1998) IEEE Trans. SAP , vol.6 , Issue.2 , pp. 131-142
    • Stylianou, Y.1    Cappé, O.2    Moulines, E.3
  • 3
    • 57749193836 scopus 로고    scopus 로고
    • Voice conversion based on maximum likelihood estimation of spectral parameter trajectory
    • Nov
    • T. Toda, A.W. Black, and K. Tokuda, "Voice conversion based on maximum likelihood estimation of spectral parameter trajectory, " IEEE Trans. ASLP, vol. 15, no. 8, pp. 2222-2235, Nov. 2007.
    • (2007) IEEE Trans. ASLP , vol.15 , Issue.8 , pp. 2222-2235
    • Toda, T.1    Black, A.W.2    Tokuda, K.3
  • 4
    • 79959827418 scopus 로고    scopus 로고
    • Applying voice conversion to concatenative singing-voice synthesis
    • Sept
    • F. Villavicencio and J. Bonada, "Applying voice conversion to concatenative singing-voice synthesis, " Proc. INTERSPEECH, pp. 2162-2165, Sept. 2010.
    • (2010) Proc. INTERSPEECH , pp. 2162-2165
    • Villavicencio, F.1    Bonada, J.2
  • 5
    • 84874432462 scopus 로고    scopus 로고
    • GMM voice conversion of singing voice using vocal tract area function
    • Speech (Japanese edition), Nov
    • Y. Kawakami, H. Banno, and F. Itakura, "GMM voice conversion of singing voice using vocal tract area function, " IEICE technical report. Speech (Japanese edition), vol. 110, no. 297, pp. 71-76, Nov. 2010.
    • (2010) IEICE Technical Report , vol.110 , Issue.297 , pp. 71-76
    • Kawakami, Y.1    Banno, H.2    Itakura, F.3
  • 6
    • 34547496175 scopus 로고    scopus 로고
    • One-to-many and many-to- one voice conversion based on eigenvoices
    • Apr
    • T. Toda, Y. Ohtani, and K. Shikano, "One-to-many and many-to- one voice conversion based on eigenvoices, " Proc. ICASSP, pp. 1249-1252, Apr. 2007.
    • (2007) Proc. ICASSP , pp. 1249-1252
    • Toda, T.1    Ohtani, Y.2    Shikano, K.3
  • 7
    • 84874403435 scopus 로고    scopus 로고
    • Singing voice conversion method based on many-to-many eigenvoice con- version and training data generation using a singing-to-singing synthesis system
    • Nov
    • H. Doi, T. Toda, T. Nakano, M. Goto, and S. Nakamura, "Singing voice conversion method based on many-to-many eigenvoice con- version and training data generation using a singing-to-singing synthesis system, " Proc. APSIPA ASC, Nov. 2012.
    • (2012) Proc. APSIPA ASC
    • Doi, H.1    Toda, T.2    Nakano, T.3    Goto, M.4    Nakamura, S.5
  • 8
    • 70450194389 scopus 로고    scopus 로고
    • Many-to- many eigenvoice conversion with reference voice
    • Sept
    • Y. Ohtani, T. Toda, H. Saruwatari, and K. Shikano, "Many-to- many eigenvoice conversion with reference voice, " Proc. INTER- SPEECH, pp. 1623-1626, Sept. 2009.
    • (2009) Proc. INTER- SPEECH , pp. 1623-1626
    • Ohtani, Y.1    Toda, T.2    Saruwatari, H.3    Shikano, K.4
  • 9
    • 67651002140 scopus 로고    scopus 로고
    • Statistical parametric speech synthesis
    • Nov
    • H. Zen, K. Tokuda, and A. W. Black, "Statistical parametric speech synthesis, " Speech Communication, vol. 51, no. 11, pp. 1039-1064, Nov. 2009.
    • (2009) Speech Communication , vol.51 , Issue.11 , pp. 1039-1064
    • Zen, H.1    Tokuda, K.2    Black, A.W.3
  • 10
    • 51449114529 scopus 로고    scopus 로고
    • A style control technique for HMM-based expressive speech synthesis (speech and hearing)
    • Sep
    • T. Nose, J. Yamagishi, T. Masuko, and T. Kobayashi, "A style control technique for HMM-based expressive speech synthesis (speech and hearing), " IEICE transactions on information and systems, vol. 90, no. 9, pp. 1406-1413, Sep. 2007.
    • (2007) IEICE Transactions on Information and Systems , vol.90 , Issue.9 , pp. 1406-1413
    • Nose, T.1    Yamagishi, J.2    Masuko, T.3    Kobayashi, T.4
  • 11
    • 44949155552 scopus 로고    scopus 로고
    • A tech- nique for controlling voice quality of synthetic speech using mul- Tiple regression HSMM
    • Sept
    • M. Tachibana, T. Nose, J. Yamagishi, and T. Kobayashi, "A tech- nique for controlling voice quality of synthetic speech using mul- Tiple regression HSMM, " Proc. INTERSPEECH, pp. 2438-2441, Sept. 2006.
    • (2006) Proc. INTERSPEECH , pp. 2438-2441
    • Tachibana, M.1    Nose, T.2    Yamagishi, J.3    Kobayashi, T.4
  • 12
    • 79959847554 scopus 로고    scopus 로고
    • Adaptive voice-quality control based on one-to-many eigenvoice conversion
    • Sept
    • K. Ohta, T. Toda, Y. Ohtani, H. Saruwatari, and K. Shikano, "Adaptive voice-quality control based on one-to-many eigenvoice conversion, " Proc. INTERSPEECH, pp. 2158-2161, Sept. 2010.
    • (2010) Proc. INTERSPEECH , pp. 2158-2161
    • Ohta, K.1    Toda, T.2    Ohtani, Y.3    Saruwatari, H.4    Shikano, K.5
  • 13
    • 79959816772 scopus 로고    scopus 로고
    • Longitudinal changes of selected voice source parameters
    • Sept
    • H. Kasuya, H. Yoshida, S. Ebihara, and H. Mori, "Longitudi- nal changes of selected voice source parameters, " Proc. INTER- SPEECH, pp. 2570-2573, Sept. 2010.
    • (2010) Proc. INTER- SPEECH , pp. 2570-2573
    • Kasuya, H.1    Yoshida, H.2    Ebihara, S.3    Mori, H.4
  • 14
    • 0036299156 scopus 로고    scopus 로고
    • Automatic estimation of one's age with his/her speech based upon acoustic modeling techniques of speakers
    • May
    • N. Minematsu, M. Sekiguchi, and K. Hirose, "Automatic estimation of one's age with his/her speech based upon acoustic modeling techniques of speakers, " Proc. ICASSP, pp. 137-140, May. 2002.
    • (2002) Proc. ICASSP , pp. 137-140
    • Minematsu, N.1    Sekiguchi, M.2    Hirose, K.3
  • 15
    • 0033708106 scopus 로고    scopus 로고
    • Speech parameter generation algorithms for HMM-based speech synthesis
    • June
    • K. Tokuda, T. Yoshimura, T. Masuko, T. Kobayashi, and T. Kitamura, "Speech parameter generation algorithms for HMM-based speech synthesis, " Proc. ICASSP, pp. 1315-1318, June 2000.
    • (2000) Proc. ICASSP , pp. 1315-1318
    • Tokuda, K.1    Yoshimura, T.2    Masuko, T.3    Kobayashi, T.4    Kitamura, T.5
  • 16
    • 84874199000 scopus 로고    scopus 로고
    • Aperiodicity extraction and control using mixed mode excitation and group delay manipulation for a high quality speech analysis, modification and system straight
    • Sept
    • H. Kawahara, J. Estill, and O. Fujimura, "Aperiodicity extraction and control using mixed mode excitation and group delay manipulation for a high quality speech analysis, modification and system straight, " Proc. MAVEBA, Sept. 2001.
    • (2001) Proc. MAVEBA
    • Kawahara, H.1    Estill, J.2    Fujimura, O.3
  • 17
    • 44949143155 scopus 로고    scopus 로고
    • Maximum likelihood voice conversion based on GMM with STRAIGHT mixed excitation
    • Sept
    • Y. Ohtani, T. Toda, H. Saruwatari, and K. Shikano, "Maximum likelihood voice conversion based on GMM with STRAIGHT mixed excitation, " Proc. INTERSPEECH, pp. 2266-2269, Sept. 2006.
    • (2006) Proc. INTERSPEECH , pp. 2266-2269
    • Ohtani, Y.1    Toda, T.2    Saruwatari, H.3    Shikano, K.4
  • 18
    • 84867211725 scopus 로고    scopus 로고
    • Low-delay voice conversion based on maximum likelihood es- Timation of spectral parameter trajectory
    • Sept
    • T. Muramatsu, Y. Ohtani, T. Toda, H. Saruwatari, and K. Shikano, "Low-delay voice conversion based on maximum likelihood es- Timation of spectral parameter trajectory, " Proc. INTERSPEECH, pp. 1076-1079, Sept. 2008.
    • (2008) Proc. INTERSPEECH , pp. 1076-1079
    • Muramatsu, T.1    Ohtani, Y.2    Toda, T.3    Saruwatari, H.4    Shikano, K.5
  • 19
    • 84878390910 scopus 로고    scopus 로고
    • Implementation of com- putationally efficient real-time voice conversion
    • Sept
    • T. Toda, T. Muramatsu, and H. Banno, "Implementation of com- putationally efficient real-time voice conversion, " Proc. INTER- SPEECH, Sept. 2012.
    • (2012) Proc. INTER- SPEECH
    • Toda, T.1    Muramatsu, T.2    Banno, H.3
  • 20
    • 0032673049 scopus 로고    scopus 로고
    • Restructur- ing speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based f0 extraction: Possible role of a repetitive structure in sounds
    • Apr
    • H. Kawahara, I. Masuda-Katsuse, and A. Cheveigne, "Restructur- ing speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based f0 extraction: Possible role of a repetitive structure in sounds, " Speech Communication, vol. 27, no. 3-4, pp. 187-207, Apr. 1999.
    • (1999) Speech Communication , vol.27 , Issue.3-4 , pp. 187-207
    • Kawahara, H.1    Masuda-Katsuse, I.2    Cheveigne, A.3
  • 21
    • 84901764550 scopus 로고    scopus 로고
    • AIST humming database: Music database for singing research
    • (Japanese edition), vol. 2005-MUS-61-2, Aug
    • M. Goto and T. Nishimura, "AIST humming database: Music database for singing research, " IPSJ SIG Notes (Technical Report) (Japanese edition), vol. 2005-MUS-61-2, pp. 7-12, Aug. 2005.
    • (2005) IPSJ SIG Notes (Technical Report) , pp. 7-12
    • Goto, M.1    Nishimura, T.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.