메뉴 건너뛰기




Volumn 1, Issue , 2012, Pages 94-97

Implementation of computationally efficient real-time voice conversion

Author keywords

Computational efficiency; Low delay conversion; Real time processing; Voice conversion

Indexed keywords

COMPUTATIONALLY EFFICIENT; CONVERSION ALGORITHM; COVARIANCE MATRICES; GAUSSIAN MIXTURE MODEL (GMMS); LOW DELAY; REAL-TIME CONVERSION; REALTIME PROCESSING; VOICE CONVERSION;

EID: 84878390910     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (62)

References (10)
  • 1
    • 77956795483 scopus 로고    scopus 로고
    • Esophageal speech enhancement based on statistical voice conversion with Gaussian mixture models
    • H. Doi, K. Nakamura, T. Toda, H. Saruwatari, and K. Shikano. Esophageal speech enhancement based on statistical voice conversion with Gaussian mixture models. IEICE Trans. Inf. & Syst., Vol. E93-D, No. 9, pp. 2472-2482, 2010.
    • (2010) IEICE Trans. Inf. & Syst. , vol.E93-D , Issue.9 , pp. 2472-2482
    • Doi, H.1    Nakamura, K.2    Toda, T.3    Saruwatari, H.4    Shikano, K.5
  • 3
    • 57749193836 scopus 로고    scopus 로고
    • Voice conversion based on maximum likelihood estimation of spectral parameter trajectory
    • T. Toda, A.W. Black, and K. Tokuda. Voice conversion based on maximum likelihood estimation of spectral parameter trajectory. IEEE Trans. Audio, Speech and Language Processing, Vol. 15, No. 8, pp. 2222-2235, 2007.
    • (2007) IEEE Trans. Audio, Speech and Language Processing , vol.15 , Issue.8 , pp. 2222-2235
    • Toda, T.1    Black, A.W.2    Tokuda, K.3
  • 4
    • 84867211725 scopus 로고    scopus 로고
    • Low-delay voice conversion based on maximum likelihood estimation of spectral parameter trajectory
    • Brisbane, Australia, Sep.
    • T. Muramatsu, Y. Ohtani, T. Toda, H. Saruwatari, and K. Shikano. Low-delay voice conversion based on maximum likelihood estimation of spectral parameter trajectory. Proc. INTERSPEECH, pp. 1076-1079, Brisbane, Australia, Sep. 2008.
    • (2008) Proc. INTERSPEECH , pp. 1076-1079
    • Muramatsu, T.1    Ohtani, Y.2    Toda, T.3    Saruwatari, H.4    Shikano, K.5
  • 5
    • 0028996993 scopus 로고
    • Speech parameter generation from HMM using dynamic features
    • Detroit, USA, May
    • K. Tokuda, T. Kobayashi, and S. Imai. Speech parameter generation from HMM using dynamic features. Proc. of ICASSP, pp. 660-663, Detroit, USA, May 1995.
    • (1995) Proc. of ICASSP , pp. 660-663
    • Tokuda, K.1    Kobayashi, T.2    Imai, S.3
  • 6
    • 0035483059 scopus 로고    scopus 로고
    • Vector quantization of speech spectral parameters using statistics of dynamic features
    • K. Koishida, K. Tokuda, T. Masuko, and T. Kobayashi. Vector quantization of speech spectral parameters using statistics of dynamic features. IEICE Trans. Information and Systems, Vol. E84-D, No. 10, pp. 1427-1434, 2001.
    • (2001) IEICE Trans. Information and Systems , vol.E84-D , Issue.10 , pp. 1427-1434
    • Koishida, K.1    Tokuda, K.2    Masuko, T.3    Kobayashi, T.4
  • 8
    • 70349200844 scopus 로고    scopus 로고
    • Voice conversion for various types of body transmitted speech
    • Taipei, Taiwan, Apr.
    • T. Toda, K. Nakamura, H. Sekimoto, and K. Shikano. Voice conversion for various types of body transmitted speech. Proc. ICASSP, pp. 3601-3604, Taipei, Taiwan, Apr. 2009.
    • (2009) Proc. ICASSP , pp. 3601-3604
    • Toda, T.1    Nakamura, K.2    Sekimoto, H.3    Shikano, K.4
  • 9
    • 0032638856 scopus 로고    scopus 로고
    • Semi-tied covariance matrices for hidden Markov models
    • M.J.F. Gales. Semi-tied covariance matrices for hidden Markov models. IEEE Trans. Speech and Audio Processing, Vol. 7, No. 3, pp. 272-281, 1999.
    • (1999) IEEE Trans. Speech and Audio Processing , vol.7 , Issue.3 , pp. 272-281
    • Gales, M.J.F.1
  • 10
    • 0032050110 scopus 로고    scopus 로고
    • Maximum likelihood linear transformations for HMM-based speech recognition
    • M.J.F. Gales. Maximum likelihood linear transformations for HMM-based speech recognition. Computer Speech and Language, Vol. 12, No. 2, pp. 75-98, 1998.
    • (1998) Computer Speech and Language , vol.12 , Issue.2 , pp. 75-98
    • Gales, M.J.F.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.