메뉴 건너뛰기




Volumn 30, Issue 1, 2015, Pages 3-15

Interpretable parametric voice conversion functions based on Gaussian mixture models and constrained transformations

Author keywords

Amplitude scaling; Frequency warping; Gaussian mixture models; Spectral tilt; Voice conversion

Indexed keywords

GAUSSIAN DISTRIBUTION;

EID: 84913585254     PISSN: 08852308     EISSN: 10958363     Source Type: Journal    
DOI: 10.1016/j.csl.2014.03.001     Document Type: Article
Times cited : (18)

References (31)
  • 2
    • 0033154052 scopus 로고    scopus 로고
    • Speaker transformation algorithm using segmental codebooks (STASC)
    • L.M. Arslan Speaker transformation algorithm using segmental codebooks (STASC) Speech Communication 28 1999 211 226
    • (1999) Speech Communication , vol.28 , pp. 211-226
    • Arslan, L.M.1
  • 3
    • 84865754815 scopus 로고    scopus 로고
    • Voice conversion using GMM with enhanced global variance
    • H. Benisty, and D. Malah Voice conversion using GMM with enhanced global variance Proc. Interspeech 2011 669 672
    • (2011) Proc. Interspeech , pp. 669-672
    • Benisty, H.1    Malah, D.2
  • 6
    • 84994241109 scopus 로고    scopus 로고
    • Including dynamic and phonetic information in voice conversion systems
    • H. Duxans, A. Bonafonte, A. Kain, and J. Van Santen Including dynamic and phonetic information in voice conversion systems Proc. ICSLP 2004 1193 1196
    • (2004) Proc. ICSLP , pp. 1193-1196
    • Duxans, H.1    Bonafonte, A.2    Kain, A.3    Van Santen, J.4
  • 8
    • 80051629671 scopus 로고    scopus 로고
    • HNM-based MFCC + F0 extractor applied to statistical speech synthesis
    • D. Erro, I. Sainz, E. Navas, and I. Hernaez HNM-based MFCC + F0 extractor applied to statistical speech synthesis Proc. ICASSP 2011 4728 4731
    • (2011) Proc. ICASSP , pp. 4728-4731
    • Erro, D.1    Sainz, I.2    Navas, E.3    Hernaez, I.4
  • 9
    • 84878409257 scopus 로고    scopus 로고
    • Iterative MMSE estimation of vocal tract length normalization factors for voice transformation
    • D. Erro, E. Navas, and I. Hernaez Iterative MMSE estimation of vocal tract length normalization factors for voice transformation Proc. Interspeech 2012 86 89
    • (2012) Proc. Interspeech , pp. 86-89
    • Erro, D.1    Navas, E.2    Hernaez, I.3
  • 12
    • 84857498745 scopus 로고    scopus 로고
    • Voice conversion using dynamic frequency warping with amplitude scaling, for parallel or nonparallel corpora
    • E. Godoy, O. Rosec, and T. Chonavel Voice conversion using dynamic frequency warping with amplitude scaling, for parallel or nonparallel corpora IEEE Transactions on Audio, Speech and Language Processing 20 2012 1313 1323
    • (2012) IEEE Transactions on Audio, Speech and Language Processing , vol.20 , pp. 1313-1323
    • Godoy, E.1    Rosec, O.2    Chonavel, T.3
  • 14
    • 4444285698 scopus 로고    scopus 로고
    • Oregon Health and Science University Portland, Oregon, USA
    • A. Kain High Resolution Voice Transformation 2001 Oregon Health and Science University Portland, Oregon, USA
    • (2001) High Resolution Voice Transformation
    • Kain, A.1
  • 16
    • 78049373493 scopus 로고    scopus 로고
    • Pronunciation variation generation for spontaneous speech synthesis using state-based voice transformation
    • C.H. Lee, C.H. Wu, and J.C. Guo Pronunciation variation generation for spontaneous speech synthesis using state-based voice transformation Proc. ICASSP 2010 4826 4829
    • (2010) Proc. ICASSP , pp. 4826-4829
    • Lee, C.H.1    Wu, C.H.2    Guo, J.C.3
  • 17
    • 0032657747 scopus 로고    scopus 로고
    • Speaker adaptation with all-pass transforms
    • J. McDonough, and W. Byrne Speaker adaptation with all-pass transforms Proc. ICASSP 1999 757 760
    • (1999) Proc. ICASSP , pp. 757-760
    • McDonough, J.1    Byrne, W.2
  • 18
    • 58149209073 scopus 로고
    • Voice conversion: State of the art and perspectives
    • E. Moulines, and Y. Sagisaka Voice conversion: state of the art and perspectives Speech Communication 16 1995 125 126
    • (1995) Speech Communication , vol.16 , pp. 125-126
    • Moulines, E.1    Sagisaka, Y.2
  • 20
    • 27644522706 scopus 로고    scopus 로고
    • Vocal tract normalization equals linear transformation in cepstral space
    • M. Pitz, and H. Ney Vocal tract normalization equals linear transformation in cepstral space IEEE Transactions on Speech and Audio Processing 13 2005 930 944
    • (2005) IEEE Transactions on Speech and Audio Processing , vol.13 , pp. 930-944
    • Pitz, M.1    Ney, H.2
  • 21
    • 4544361661 scopus 로고    scopus 로고
    • Voice conversion through transformation of spectral and intonation features
    • D. Rentzos, S. Vaseghi, Q. Yan, and C.H. Ho Voice conversion through transformation of spectral and intonation features Proc. ICASSP 2004 21 24
    • (2004) Proc. ICASSP , pp. 21-24
    • Rentzos, D.1    Vaseghi, S.2    Yan, Q.3    Ho, C.H.4
  • 25
    • 84948175540 scopus 로고    scopus 로고
    • VTLN-based voice conversion
    • D. Suendermann, and H. Ney VTLN-based voice conversion Proc. ISSPIT 2003 556 559
    • (2003) Proc. ISSPIT , pp. 556-559
    • Suendermann, D.1    Ney, H.2
  • 26
    • 80051619373 scopus 로고    scopus 로고
    • One sentence voice adaptation using GMM-based frequency-warping and shift with a sub-band basis spectrum model
    • M. Tamura, M. Morita, T. Kagoshima, and M. Akamine One sentence voice adaptation using GMM-based frequency-warping and shift with a sub-band basis spectrum model Proc. ICASSP 2011 5124 5127
    • (2011) Proc. ICASSP , pp. 5124-5127
    • Tamura, M.1    Morita, M.2    Kagoshima, T.3    Akamine, M.4
  • 27
    • 84946236688 scopus 로고    scopus 로고
    • High quality voice conversion based on Gaussian mixture model with dynamic frequency warping
    • T. Toda, H. Saruwatari, and K. Shikano High quality voice conversion based on Gaussian mixture model with dynamic frequency warping Proc. Interspeech 2001 349 352
    • (2001) Proc. Interspeech , pp. 349-352
    • Toda, T.1    Saruwatari, H.2    Shikano, K.3
  • 31
    • 84871520443 scopus 로고    scopus 로고
    • Improving the quality of standard GMM-based voice conversion systems by considering physically motivated linear transformations
    • T.C. Zorila, D. Erro, and I. Hernaez Improving the quality of standard GMM-based voice conversion systems by considering physically motivated linear transformations Communications in Computer and Information Science 328 2012 30 39
    • (2012) Communications in Computer and Information Science , vol.328 , pp. 30-39
    • Zorila, T.C.1    Erro, D.2    Hernaez, I.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.