메뉴 건너뛰기




Volumn E89-D, Issue 11, 2006, Pages 2775-2782

Hybrid voice conversion of unit selection and generation using prosody dependent HMM

Author keywords

HMM; MLLR; Speech synthesis; Unit selection; Voice conversion

Indexed keywords

OPTIMIZATION; PROBABILITY DISTRIBUTIONS; WAVEFORM ANALYSIS;

EID: 33845586220     PISSN: 09168532     EISSN: 17451361     Source Type: Journal    
DOI: 10.1093/ietisy/e89-d.11.2775     Document Type: Article
Times cited : (8)

References (14)
  • 2
    • 0011946055 scopus 로고    scopus 로고
    • CHATR: A high-definition speech re-sequencing system
    • N. Campbell, "CHATR: A high-definition speech re-sequencing system," Proc. 3rd ASA/ASJ Joint Meeting, pp.1223-1228, 1996.
    • (1996) Proc. 3rd ASA/ASJ Joint Meeting , pp. 1223-1228
    • Campbell, N.1
  • 3
    • 0029725605 scopus 로고    scopus 로고
    • Speech synthesis from HMMs using dynamic features
    • T. Masuko, K. Tokuda, T. Kobayashi, and S. Imai, "Speech synthesis from HMMs using dynamic features," Proc. ICASSP96, vol.1, pp.389-392, 1996.
    • (1996) Proc. ICASSP96 , vol.1 , pp. 389-392
    • Masuko, T.1    Tokuda, K.2    Kobayashi, T.3    Imai, S.4
  • 4
    • 0030263447 scopus 로고    scopus 로고
    • Mean and variance adaptation within the MLLR framework
    • M.J.F. Gales and P.C. Woodland, "Mean and variance adaptation within the MLLR framework," Comput. Speech Lang., vol.10, no.4, pp.249-264, 1996.
    • (1996) Comput. Speech Lang. , vol.10 , Issue.4 , pp. 249-264
    • Gales, M.J.F.1    Woodland, P.C.2
  • 5
    • 0034842740 scopus 로고    scopus 로고
    • Adaptation of pitch and spectrum for HMM-based speech synthesis using MLLR
    • M. Tamura, T. Masuko, K. Tokuda, and T. Kobayashi, "Adaptation of pitch and spectrum for HMM-based speech synthesis using MLLR," Proc. ICASSP2001, vol.2, pp.805-808, 2001.
    • (2001) Proc. ICASSP2001 , vol.2 , pp. 805-808
    • Tamura, M.1    Masuko, T.2    Tokuda, K.3    Kobayashi, T.4
  • 7
    • 0032026483 scopus 로고    scopus 로고
    • Continuous probabilistic transform for voice conversion
    • March
    • Y. Stylianou, O. Cappe, and E. Moulines, "Continuous probabilistic transform for voice conversion," IEEE Trans. Speech Audio Process., vol.6, no.2, pp.131-142, March 1998.
    • (1998) IEEE Trans. Speech Audio Process , vol.6 , Issue.2 , pp. 131-142
    • Stylianou, Y.1    Cappe, O.2    Moulines, E.3
  • 8
    • 0031623661 scopus 로고    scopus 로고
    • Spectral voice conversion for text-to-speech synthesis
    • A. Kain and M. Macon, "Spectral voice conversion for text-to-speech synthesis," Proc. ICASSP98, pp.285-299, 1998.
    • (1998) Proc. ICASSP98 , pp. 285-299
    • Kain, A.1    Macon, M.2
  • 9
    • 85031628788 scopus 로고
    • An algorithm for speech parameter generation from continuous mixture HMMs with dynamic features
    • K. Tokuda, T. Masuko, T. Yamada, T. Kobayashi, and S. Imai, "An algorithm for speech parameter generation from continuous mixture HMMs with dynamic features," Proc. EUROSPEECH95, pp.757-760, 1995.
    • (1995) Proc. EUROSPEECH95 , pp. 757-760
    • Tokuda, K.1    Masuko, T.2    Yamada, T.3    Kobayashi, T.4    Imai, S.5
  • 10
    • 0029765811 scopus 로고    scopus 로고
    • Unit selection in a concatenative speech synthesis system using a large speech database
    • A.J. Hunt and A.W. Black, "Unit selection in a concatenative speech synthesis system using a large speech database," Proc. ICASSP96, pp.373-376, 1996.
    • (1996) Proc. ICASSP96 , pp. 373-376
    • Hunt, A.J.1    Black, A.W.2
  • 11
    • 0028996983 scopus 로고
    • Automatic speech synthesizer parameter estimation using HMMs
    • R.E. Donovan and P.C. Woodland, "Automatic speech synthesizer parameter estimation using HMMs," Proc. ICASSP95, pp.640-643, 1995.
    • (1995) Proc. ICASSP95 , pp. 640-643
    • Donovan, R.E.1    Woodland, P.C.2
  • 12
    • 0030683369 scopus 로고    scopus 로고
    • Recent improvements on Microsoft's trainable text-to-speech system -Whistler
    • X. Huang, A. Acero, H. Hon, Y. Ju, J. Liu, S. Meredith, and M. Plumpe, "Recent improvements on Microsoft's trainable text-to-speech system -Whistler," Proc. ICASSP97, vol.2, pp.959-962, 1997.
    • (1997) Proc. ICASSP97 , vol.2 , pp. 959-962
    • Huang, X.1    Acero, A.2    Hon, H.3    Ju, Y.4    Liu, J.5    Meredith, S.6    Plumpe, M.7
  • 13
    • 0030364809 scopus 로고    scopus 로고
    • An excitation synchronous pitch waveform extraction method and its application to the VCV-concatenation synthesis of Japanese spoken words
    • Y. Arai, R. Mochizuki, H. Nishimura, and T. Honda, "An excitation synchronous pitch waveform extraction method and its application to the VCV-concatenation synthesis of Japanese spoken words," Proc. ICSLP96, vol.3, pp.1437-1440, 1996.
    • (1996) Proc. ICSLP96 , vol.3 , pp. 1437-1440
    • Arai, Y.1    Mochizuki, R.2    Nishimura, H.3    Honda, T.4
  • 14
    • 0025543906 scopus 로고
    • Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones
    • E. Moulines and F. Charpentier, "Pitch-synchronous waveform processing techniques for text-to-speech synthesis using diphones," Speech Commun., vol.9, pp.453-467, 1990.
    • (1990) Speech Commun. , vol.9 , pp. 453-467
    • Moulines, E.1    Charpentier, F.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.