메뉴 건너뛰기




Volumn 1, Issue , 2012, Pages 86-89

Iterative MMSE estimation of vocal tract length normalization factors for voice transformation

Author keywords

Frequency warping plus amplitude scaling; Speech synthesis; Vocal tract length normalization; Voice conversion

Indexed keywords

AMPLITUDE CORRECTION; AMPLITUDE SCALING; CEPSTRAL DOMAIN; CONVERSION ACCURACIES; ITERATIVE PROCEDURES; SINGLE PARAMETER; VOCAL TRACT LENGTH NORMALIZATION; VOICE CONVERSION;

EID: 84878409257     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (9)

References (16)
  • 3
    • 57749193836 scopus 로고    scopus 로고
    • Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory
    • T. Toda, A.W. Black, K. Tokuda, "Voice conversion based on maximum-likelihood estimation of spectral parameter trajectory", IEEE Trans. Audio, Speech, Lang. Process., vol. 15(8), pp. 2222-2235, 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.8 , pp. 2222-2235
    • Toda, T.1    Black, A.W.2    Tokuda, K.3
  • 4
    • 85010815133 scopus 로고
    • Voice transformation using PSOLA technique
    • H. Valbret, E. Moulines, J.P. Tubach, "Voice transformation using PSOLA technique", Speech Commun., vol. 1, pp. 145-148, 1992.
    • (1992) Speech Commun. , vol.1 , pp. 145-148
    • Valbret, H.1    Moulines, E.2    Tubach, J.P.3
  • 7
    • 80051619373 scopus 로고    scopus 로고
    • One sentence voice adaptation using GMM-based frequency-warping and shift with a sub-band basis spectrum model
    • M. Tamura, M. Morita, T. Kagoshima, M. Akamine, "One sentence voice adaptation using GMM-based frequency-warping and shift with a sub-band basis spectrum model", Proc. ICASSP, pp. 5124-5127, 2011.
    • (2011) Proc. ICASSP , pp. 5124-5127
    • Tamura, M.1    Morita, M.2    Kagoshima, T.3    Akamine, M.4
  • 8
    • 84865717274 scopus 로고    scopus 로고
    • Spectral envelope transformation using DFW and amplitude scaling for voice conversion with parallel or nonparallel corpora
    • E. Godoy, O. Rosec, T. Chonavel, "Spectral envelope transformation using DFW and amplitude scaling for voice conversion with parallel or nonparallel corpora", Proc. Interspeech, pp. 673-676, 2011.
    • (2011) Proc. Interspeech , pp. 673-676
    • Godoy, E.1    Rosec, O.2    Chonavel, T.3
  • 9
    • 4544373000 scopus 로고    scopus 로고
    • Voice characteristics conversion for TTS using reverse VTLN
    • M. Eichner, M. Wolff, R. Hoffmann, "Voice characteristics conversion for TTS using reverse VTLN", Proc. ICASSP, pp. 17-20, 2004.
    • (2004) Proc. ICASSP , pp. 17-20
    • Eichner, M.1    Wolff, M.2    Hoffmann, R.3
  • 10
    • 0009589496 scopus 로고    scopus 로고
    • Vocal tract length normalization for large vocabulary continuous speech recognition
    • P. Zhan, A. Waibel, "Vocal tract length normalization for large vocabulary continuous speech recognition", CMU computer science technical reports, 1997.
    • (1997) CMU Computer Science Technical Reports
    • Zhan, P.1    Waibel, A.2
  • 11
    • 0032657747 scopus 로고    scopus 로고
    • Speaker adaptation with all-pass transforms
    • J. McDonough, W. Byrne, "Speaker adaptation with all-pass transforms", Proc. ICASSP, pp. 757-760, 1999.
    • (1999) Proc. ICASSP , pp. 757-760
    • McDonough, J.1    Byrne, W.2
  • 12
    • 27644522706 scopus 로고    scopus 로고
    • Vocal tract normalization equals linear transformation in cepstral space
    • M. Pitz, H. Ney, "Vocal tract normalization equals linear transformation in cepstral space", IEEE Trans. Speech and Audio Process., vol. 13(5), pp. 930-944, 2005.
    • (2005) IEEE Trans. Speech and Audio Process. , vol.13 , Issue.5 , pp. 930-944
    • Pitz, M.1    Ney, H.2
  • 13
    • 51449094035 scopus 로고    scopus 로고
    • Rapid vocal tract length normalization using maximum likelihood estimation
    • T. Emori, K. Shinoda, "Rapid vocal tract length normalization using maximum likelihood estimation", Proc. Eurospeech, pp. 1649-1652, 2001.
    • (2001) Proc. Eurospeech , pp. 1649-1652
    • Emori, T.1    Shinoda, K.2
  • 16
    • 80051629671 scopus 로고    scopus 로고
    • HNM-based MFCC+f0 extractor applied to statistical speech synthesis
    • Available at:
    • D. Erro, I. Sainz, E. Navas, I. Hernaez, "HNM-based MFCC+f0 extractor applied to statistical speech synthesis", Proc. ICASSP, pp. 4728-4731, 2011. Available at: http://aholab.ehu.es/ahocoder
    • (2011) Proc. ICASSP , pp. 4728-4731
    • Erro, D.1    Sainz, I.2    Navas, E.3    Hernaez, I.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.