메뉴 건너뛰기




Volumn E97-D, Issue 6, 2014, Pages 1419-1428

Voice timbre control based on perceived age in singing voice conversion

Author keywords

Perceived age; Singing voice; Spectral and prosodic features; Subjective evaluations; Voice conversion

Indexed keywords

INFORMATION SCIENCE; SOFTWARE ENGINEERING;

EID: 84901767453     PISSN: 09168532     EISSN: 17451361     Source Type: Journal    
DOI: 10.1587/transinf.E97.D.1419     Document Type: Article
Times cited : (17)

References (24)
  • 1
    • 70350589495 scopus 로고    scopus 로고
    • V.Morish '09: A morphing-based singing design interface for vocal melodies
    • Springer
    • M. Morise, M. Onishi, H. Kawahara, and H. Katayose, "v. morish '09: A morphing-based singing design interface for vocal melodies, " in Entertainment Computing-ICEC 2009, pp.185-190, Springer, 2009.
    • (2009) Entertainment Computing-ICEC 2009 , pp. 185-190
    • Morise, M.1    Onishi, M.2    Kawahara, H.3    Katayose, H.4
  • 2
    • 0032026483 scopus 로고    scopus 로고
    • Continuous probabilistic transform for voice conversion
    • March
    • Y. Stylianou, O. Cappé, and E. Moulines, "Continuous probabilistic transform for voice conversion, " IEEE Trans. SAP, vol.6, no.2, pp.131-142, March 1998.
    • (1998) IEEE Trans. SAP , vol.6 , Issue.2 , pp. 131-142
    • Stylianou, Y.1    Cappe, O.2    Moulines, E.3
  • 3
    • 57749193836 scopus 로고    scopus 로고
    • Voice conversion based on maximum likelihood estimation of spectral parameter trajectory
    • Nov
    • T. Toda, A.W. Black, and K. Tokuda, "Voice conversion based on maximum likelihood estimation of spectral parameter trajectory, " IEEE Trans. ASLP, vol.15, no.8, pp.2222-2235, Nov. 2007.
    • (2007) IEEE Trans. ASLP , vol.15 , Issue.8 , pp. 2222-2235
    • Toda, T.1    Black, A.W.2    Tokuda, K.3
  • 4
    • 79959827418 scopus 로고    scopus 로고
    • Applying voice conversion to concatenative singing-voice synthesis
    • Sept
    • F. Villavicencio and J. Bonada, "Applying voice conversion to concatenative singing-voice synthesis, " Proc. INTERSPEECH, pp.2162-2165, Sept. 2010.
    • (2010) Proc. INTERSPEECH , pp. 2162-2165
    • Villavicencio, F.1    Bonada, J.2
  • 5
    • 84874432462 scopus 로고    scopus 로고
    • GMM voice conversion of singing voice using vocal tract area function
    • (Japanese edition), Nov
    • Y. Kawakami, H. Banno, and F. Itakura, "GMM voice conversion of singing voice using vocal tract area function, " IEICE Technical Report, Speech (Japanese edition), vol.110, no.297, pp.71-76, Nov. 2010.
    • (2010) IEICE Technical Report, Speech , vol.110 , Issue.297 , pp. 71-76
    • Kawakami, Y.1    Banno, H.2    Itakura, F.3
  • 6
    • 34547496175 scopus 로고    scopus 로고
    • One-to-many and many-to-one voice conversion based on eigenvoices
    • April
    • T. Toda, Y. Ohtani, and K. Shikano, "One-to-many and many-to-one voice conversion based on eigenvoices, " Proc. ICASSP, pp.1249- 1252, April 2007.
    • (2007) Proc. ICASSP , pp. 1249-1252
    • Toda, T.1    Ohtani, Y.2    Shikano, K.3
  • 7
    • 84874403435 scopus 로고    scopus 로고
    • Singing voice conversion method based on many-to-many eigenvoice conversion and training data generation using a singing-to-singing synthesis system
    • Nov
    • H. Doi, T. Toda, T. Nakano, M. Goto, and S. Nakamura, "Singing voice conversion method based on many-to-many eigenvoice conversion and training data generation using a singing-to-singing synthesis system, " Proc. APSIPA ASC, Nov. 2012.
    • (2012) Proc. APSIPA ASC
    • Doi, H.1    Toda, T.2    Nakano, T.3    Goto, M.4    Nakamura, S.5
  • 8
    • 70450194389 scopus 로고    scopus 로고
    • Many-to-many eigenvoice conversion with reference voice
    • Sept
    • Y. Ohtani, T. Toda, H. Saruwatari, and K. Shikano, "Many-to-many eigenvoice conversion with reference voice, " Proc. INTERSPEECH, pp.1623-1626, Sept. 2009.
    • (2009) Proc. INTERSPEECH , pp. 1623-1626
    • Ohtani, Y.1    Toda, T.2    Saruwatari, H.3    Shikano, K.4
  • 9
    • 67651002140 scopus 로고    scopus 로고
    • Statistical parametric speech synthesis
    • Nov
    • H. Zen, K. Tokuda, and A.W. Black, "Statistical parametric speech synthesis, " Speech Commun., vol.51, no.11, pp.1039-1064, Nov. 2009.
    • (2009) Speech Commun. , vol.51 , Issue.11 , pp. 1039-1064
    • Zen, H.1    Tokuda, K.2    Black, A.W.3
  • 10
    • 51449114529 scopus 로고    scopus 로고
    • A style control technique for hmm-based expressive speech synthesis
    • Sep
    • T. Nose, J. Yamagishi, T. Masuko, and T. Kobayashi, "A style control technique for HMM-based expressive speech synthesis, " IEICE Trans. Inf. & Syst., vol.E90-D, no.9, pp.1406-1413, Sep. 2007.
    • (2007) IEICE Trans. Inf. & Syst. , vol.E90-D , Issue.9 , pp. 1406-1413
    • Nose, T.1    Yamagishi, J.2    Masuko, T.3    Kobayashi, T.4
  • 11
    • 84870246600 scopus 로고    scopus 로고
    • An intuitive style control technique in HMM-based expressive speech synthesis using subjective style intensity and multiple-regression global variance model
    • T. Nose and T. Kobayashi, "An intuitive style control technique in HMM-based expressive speech synthesis using subjective style intensity and multiple-regression global variance model, " Speech Commun., vol.55, no.2, pp.347-357, 2013.
    • (2013) Speech Commun. , vol.55 , Issue.2 , pp. 347-357
    • Nose, T.1    Kobayashi, T.2
  • 12
    • 44949155552 scopus 로고    scopus 로고
    • A technique for controlling voice quality of synthetic speech using multiple regression HSMM
    • Sept
    • M. Tachibana, T. Nose, J. Yamagishi, and T. Kobayashi, "A technique for controlling voice quality of synthetic speech using multiple regression HSMM, " Proc. INTERSPEECH, pp.2438-2441, Sept. 2006.
    • (2006) Proc. INTERSPEECH , pp. 2438-2441
    • Tachibana, M.1    Nose, T.2    Yamagishi, J.3    Kobayashi, T.4
  • 13
    • 79959847554 scopus 로고    scopus 로고
    • Adaptive voice-quality control based on one-to-many eigenvoice conversion
    • Sept
    • K. Ohta, T. Toda, Y. Ohtani, H. Saruwatari, and K. Shikano, "Adaptive voice-quality control based on one-to-many eigenvoice conversion, " Proc. INTERSPEECH, pp.2158-2161, Sept. 2010.
    • (2010) Proc. INTERSPEECH , pp. 2158-2161
    • Ohta, K.1    Toda, T.2    Ohtani, Y.3    Saruwatari, H.4    Shikano, K.5
  • 14
    • 0016235711 scopus 로고
    • Perceptual and acoustic correlates of aging in the speech of males
    • W. Ryan and K. Burk, "Perceptual and acoustic correlates of aging in the speech of males, " J. Communication Disorders, vol.7, no.2, pp.181-192, 1974.
    • (1974) J. Communication Disorders , vol.7 , Issue.2 , pp. 181-192
    • Ryan, W.1    Burk, K.2
  • 15
    • 79959816772 scopus 로고    scopus 로고
    • Longitudinal changes of selected voice source parameters
    • Sept
    • H. Kasuya, H. Yoshida, S. Ebihara, and H. Mori, "Longitudinal changes of selected voice source parameters, " Proc. INTERSPEECH, pp.2570-2573, Sept. 2010.
    • (2010) Proc. INTERSPEECH , pp. 2570-2573
    • Kasuya, H.1    Yoshida, H.2    Ebihara, S.3    Mori, H.4
  • 16
    • 77956187922 scopus 로고    scopus 로고
    • Noise and tremor in the perception of vocal aging in males
    • J.D. Harnsberger, W.S. Brown Jr., R. Shrivastav, and H. Rothman, "Noise and tremor in the perception of vocal aging in males, " J. Voice, vol.24, no.5, pp.523-530, 2010.
    • (2010) J. Voice , vol.24 , Issue.5 , pp. 523-530
    • Harnsberger, J.D.1    Brown Jr., W.S.2    Shrivastav, R.3    Rothman, H.4
  • 17
    • 0036299156 scopus 로고    scopus 로고
    • Automatic estimation of one's age with his/her speech based upon acoustic modeling techniques of speakers
    • May
    • N. Minematsu, M. Sekiguchi, and K. Hirose, "Automatic estimation of one's age with his/her speech based upon acoustic modeling techniques of speakers, " Proc. ICASSP, pp.137-140, May 2002.
    • (2002) Proc. ICASSP , pp. 137-140
    • Minematsu, N.1    Sekiguchi, M.2    Hirose, K.3
  • 18
    • 0033708106 scopus 로고    scopus 로고
    • Speech parameter generation algorithms for hmmbased speech synthesis
    • June
    • K. Tokuda, T. Yoshimura, T. Masuko, T. Kobayashi, and T. Kitamura, "Speech parameter generation algorithms for HMMbased speech synthesis, " Proc. ICASSP, pp.1315-1318, June 2000.
    • (2000) Proc. ICASSP , pp. 1315-1318
    • Tokuda, K.1    Yoshimura, T.2    Masuko, T.3    Kobayashi, T.4    Kitamura, T.5
  • 19
    • 84874199000 scopus 로고    scopus 로고
    • Aperiodicity extraction and control using mixed mode excitation and group delay manipulation for a high quality speech analysis, modification and system straight
    • Sept
    • H. Kawahara, J. Estill, and O. Fujimura, "Aperiodicity extraction and control using mixed mode excitation and group delay manipulation for a high quality speech analysis, modification and system straight, " Proc. MAVEBA, Sept. 2001.
    • (2001) Proc. MAVEBA
    • Kawahara, H.1    Estill, J.2    Fujimura, O.3
  • 20
    • 44949143155 scopus 로고    scopus 로고
    • Maximum likelihood voice conversion based on GMM with STRAIGHT mixed excitation
    • Sept
    • Y. Ohtani, T. Toda, H. Saruwatari, and K. Shikano, "Maximum likelihood voice conversion based on GMM with STRAIGHT mixed excitation, " Proc. INTERSPEECH, pp.2266-2269, Sept. 2006.
    • (2006) Proc. INTERSPEECH , pp. 2266-2269
    • Ohtani, Y.1    Toda, T.2    Saruwatari, H.3    Shikano, K.4
  • 21
    • 84867211725 scopus 로고    scopus 로고
    • Low-delay voice conversion based on maximum likelihood estimation of spectral parameter trajectory
    • Sept
    • T. Muramatsu, Y. Ohtani, T. Toda, H. Saruwatari, and K. Shikano, "Low-delay voice conversion based on maximum likelihood estimation of spectral parameter trajectory, " Proc. INTERSPEECH, pp.1076-1079, Sept. 2008.
    • (2008) Proc. INTERSPEECH , pp. 1076-1079
    • Muramatsu, T.1    Ohtani, Y.2    Toda, T.3    Saruwatari, H.4    Shikano, K.5
  • 22
    • 84878390910 scopus 로고    scopus 로고
    • Implementation of computationally efficient real-time voice conversion
    • Sept
    • T. Toda, T. Muramatsu, and H. Banno, "Implementation of computationally efficient real-time voice conversion, " Proc. INTERSPEECH, Sept. 2012.
    • (2012) Proc. INTERSPEECH
    • Toda, T.1    Muramatsu, T.2    Banno, H.3
  • 23
    • 0032673049 scopus 로고    scopus 로고
    • Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based f0 extraction: Possible role of a repetitive structure in sounds
    • April
    • H. Kawahara, I. Masuda-Katsuse, and A. Cheveigné, "Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based f0 extraction: Possible role of a repetitive structure in sounds, " Speech Commun., vol.27, no.3-4, pp.187-207, April 1999.
    • (1999) Speech Commun. , vol.27 , Issue.3-4 , pp. 187-207
    • Kawahara, H.1    Masuda-Katsuse, I.2    Cheveigne, A.3
  • 24
    • 84901764550 scopus 로고    scopus 로고
    • Aist humming database: Music database for singing research
    • (Technical Report) (Japanese edition), vol.2005-MUS-61-2, Aug
    • M. Goto and T. Nishimura, "AIST humming database: Music database for singing research, " IPSJ SIG Notes (Technical Report) (Japanese edition), vol.2005-MUS-61-2, no.82, pp.7-12, Aug. 2005.
    • (2005) IPSJ SIG Notes , Issue.82 , pp. 7-12
    • Goto, M.1    Nishimura, T.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.