메뉴 건너뛰기




Volumn , Issue , 2007, Pages 101-106

Regression Approaches to Voice Quality Control Based on One-to-Many Eigenvoice Conversion

Author keywords

[No Author keywords available]

Indexed keywords

GAUSSIAN DISTRIBUTION; QUALITY ASSURANCE; QUALITY CONTROL;

EID: 78049390735     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (9)

References (19)
  • 1
    • 0029256373 scopus 로고
    • Acoustic characteristics of speaker individuality: control and conversion
    • H. Kuwabara and Y. Sagisaka, “Acoustic characteristics of speaker individuality: control and conversion,” Speech Communication, Vol. 16, No. 2, pp. 165-173, 1995.
    • (1995) Speech Communication , vol.16 , Issue.2 , pp. 165-173
    • Kuwabara, H.1    Sagisaka, Y.2
  • 2
    • 0030374940 scopus 로고    scopus 로고
    • Speech morphing by gradually changing spectrum parameter and fundamental frequency
    • Oct
    • M. Abe, “Speech morphing by gradually changing spectrum parameter and fundamental frequency,” in Proc. ICSLP 96, Oct. 1996, Vol. 4, pp. 2235-2238.
    • (1996) Proc. ICSLP 96 , vol.4 , pp. 2235-2238
    • Abe, M.1
  • 3
    • 34247565129 scopus 로고    scopus 로고
    • Implementation of realtime STRAIGHT speech manipulation system: Report on its implementation
    • H. Banno, H. Hata, M. Morise, T. Takahashi, T. Irino, and H. Kawahara, “Implementation of realtime STRAIGHT speech manipulation system: Report on its implementation,” Acoust. Sci. & Tech., Vol. 28, No. 3, pp. 140-146, 2007.
    • (2007) Acoust. Sci. & Tech , vol.28 , Issue.3 , pp. 140-146
    • Banno, H.1    Hata, H.2    Morise, M.3    Takahashi, T.4    Irino, T.5    Kawahara, H.6
  • 6
    • 33646779506 scopus 로고    scopus 로고
    • Spectral conversion based on maximum likelihood estimationconsidering global variance of converted parameter
    • Mar
    • T. Toda, A.W. Black, and K. Tokuda, “Spectral conversion based on maximum likelihood estimationconsidering global variance of converted parameter,” in Proc. ICASSP 2005, Mar. 2005, Vol. 1, pp. 9-12.
    • (2005) Proc. ICASSP 2005 , vol.1 , pp. 9-12
    • Toda, T.1    Black, A.W.2    Tokuda, K.3
  • 7
    • 34547512822 scopus 로고    scopus 로고
    • Eigenvoice conversion based on Gaussian mixture model
    • Sep
    • T. Toda, Y. Ohtani, and K. Shikano, “Eigenvoice conversion based on Gaussian mixture model,” in Proc. INTERSPEECH2006-ICSLP, Sep. 2006, pp. 2446-2449.
    • (2006) Proc. INTERSPEECH2006-ICSLP , pp. 2446-2449
    • Toda, T.1    Ohtani, Y.2    Shikano, K.3
  • 8
    • 34547496175 scopus 로고    scopus 로고
    • One-to-many and many-to-one voice conversion based on eigenvoices
    • Apr
    • T. Toda, Y. Ohtani, and K. Shikano, “One-to-many and many-to-one voice conversion based on eigenvoices,” in Proc. ICASSP 2007, Apr. 2007, pp. 1249-1252.
    • (2007) Proc. ICASSP 2007 , pp. 1249-1252
    • Toda, T.1    Ohtani, Y.2    Shikano, K.3
  • 10
    • 85009250381 scopus 로고    scopus 로고
    • Maximum likelihood estimation of eigenvoices and residual variances for large vocabulary speech recognition tasks
    • Sep
    • P. Kenny, G. Boulianne, and P. Dumouchel, “Maximum likelihood estimation of eigenvoices and residual variances for large vocabulary speech recognition tasks,” in Proc. ICSLP 2002, Sep. 2002, pp. 57-60.
    • (2002) Proc. ICSLP 2002 , pp. 57-60
    • Kenny, P.1    Boulianne, G.2    Dumouchel, P.3
  • 12
    • 85009069226 scopus 로고    scopus 로고
    • A style control technique for HMM-based speech synthesis
    • Oct
    • K. Miyanaga, T. Masuko, and T. Kobayashi, “A style control technique for HMM-based speech synthesis,” in Proc. INTERSPEECH2004-ICSLP, Oct. 2004, pp. 1437-1440.
    • (2004) Proc. INTERSPEECH2004-ICSLP , pp. 1437-1440
    • Miyanaga, K.1    Masuko, T.2    Kobayashi, T.3
  • 13
    • 44949155552 scopus 로고    scopus 로고
    • A technique for controlling voice quality of synthetic speech using multiple regression HSMM
    • Sep
    • M. Tachibana, T. Nose, J. Yamagishi, and T. Kobayashi, “A technique for controlling voice quality of synthetic speech using multiple regression HSMM,” in Proc. INTERSPEECH2006-ICSLP, Sep. 2006, pp. 2438-2441.
    • (2006) Proc. INTERSPEECH2006-ICSLP , pp. 2438-2441
    • Tachibana, M.1    Nose, T.2    Yamagishi, J.3    Kobayashi, T.4
  • 14
  • 15
    • 85133679946 scopus 로고    scopus 로고
    • Speaker adaptive training for voice conversion based on eigenvoice
    • SP2006-40, [in Japanese]
    • Y. Ohtani, T. Toda, H. Saruwatari, and K. Shikano, “Speaker adaptive training for voice conversion based on eigenvoice,” IEICE Tech. Rep., SP2006-40, pp. 31-36, 2006 [in Japanese].
    • (2006) IEICE Tech. Rep , pp. 31-36
    • Ohtani, Y.1    Toda, T.2    Saruwatari, H.3    Shikano, K.4
  • 17
    • 85009291324 scopus 로고    scopus 로고
    • Extraction of everyday expression associated with voice quality of normal utterance
    • [in Japanese]
    • H. Kido, and H. Kasuya, “Extraction of everyday expression associated with voice quality of normal utterance,” J. Acoust. Soc. Jpn., Vol. 55, No. 6, pp. 405-411, 1999 [in Japanese].
    • (1999) J. Acoust. Soc. Jpn , vol.55 , Issue.6 , pp. 405-411
    • Kido, H.1    Kasuya, H.2
  • 18
    • 60649119506 scopus 로고    scopus 로고
    • Everyday expressions associated with voice quality of normal utterance —Extraction by perceptual evaluation
    • [in Japanese]
    • H. Kido, and H. Kasuya, “Everyday expressions associated with voice quality of normal utterance —Extraction by perceptual evaluation—,” J. Acoust. Soc. Jpn., Vol. 57, No. 5, pp. 337-344, 2001 [in Japanese].
    • (2001) J. Acoust. Soc. Jpn , vol.57 , Issue.5 , pp. 337-344
    • Kido, H.1    Kasuya, H.2
  • 19
    • 0032673049 scopus 로고    scopus 로고
    • Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: possible role of a repetitive structure in sounds
    • H. Kawahara, I. Masuda-Katsuse, and A.de Cheveigné, “Restructuring speech representations using a pitch-adaptive time-frequency smoothing and an instantaneous-frequency-based F0 extraction: possible role of a repetitive structure in sounds,” Speech Communication, Vol. 27, No. 3-4, pp. 187-207, 1999.
    • (1999) Speech Communication , vol.27 , Issue.3-4 , pp. 187-207
    • Kawahara, H.1    Masuda-Katsuse, I.2    de Cheveigné, A.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.