메뉴 건너뛰기




Volumn , Issue , 2000, Pages

Hmm-based text-to-audio-visual speech synthesis

Author keywords

[No Author keywords available]

Indexed keywords

COMPUTER VISION; HIDDEN MARKOV MODELS; IMAGE PROCESSING; MARKOV PROCESSES; PRINCIPAL COMPONENT ANALYSIS; SPEECH ANALYSIS; SPEECH SYNTHESIS; VECTOR SPACES;

EID: 85009089413     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (39)

References (13)
  • 1
    • 0017199877 scopus 로고
    • Hearing lips and seeing voices
    • Dec.
    • H. McGurk and J. MacDonald, "Hearing lips and seeing voices", Nature, 264, pp. 746-748, Dec. 1976.
    • (1976) Nature , vol.264 , pp. 746-748
    • McGurk, H.1    MacDonald, J.2
  • 2
    • 78649308717 scopus 로고    scopus 로고
    • Recent developments in facial animation: An Inside View
    • M. M. Cohen, J. Beskow and D. W. Massaro, "Recent developments in facial animation: An Inside View", Proc. AVSP, pp. 201-206, 1998.
    • (1998) Proc. AVSP , pp. 201-206
    • Cohen, M.M.1    Beskow, J.2    Massaro, D.W.3
  • 3
    • 84919370414 scopus 로고    scopus 로고
    • Text-to-audio-visual speech synthesis based on parameter generation from HMM
    • M. Tamura, S. Kondo, T. Masuko, and T. Kobayashi, "Text-to-audio-visual speech synthesis based on parameter generation from HMM", Proc. of EUROSPEECH, Vol 2, pp. 959-962, 1999.
    • (1999) Proc. of EUROSPEECH , vol.2 , pp. 959-962
    • Tamura, M.1    Kondo, S.2    Masuko, T.3    Kobayashi, T.4
  • 5
    • 85009139544 scopus 로고    scopus 로고
    • Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis
    • T. Yoshimura, K. Tokuda, T. Masuko, T. Kobayashi and T. Kitamura, "Simultaneous modeling of spectrum, pitch and duration in HMM-based speech synthesis", Proc. EUROSPEECH, vol. 5, pp. 2347-2350, 1999.
    • (1999) Proc. EUROSPEECH , vol.5 , pp. 2347-2350
    • Yoshimura, T.1    Tokuda, K.2    Masuko, T.3    Kobayashi, T.4    Kitamura, T.5
  • 6
    • 84925596359 scopus 로고    scopus 로고
    • Two- and threedimensional audio-visual speech synthesis
    • N. M. Brooke and S. D. Scott, "Two- and threedimensional audio-visual speech synthesis", Proc. AVSP, pp. 213-218, 1998.
    • (1998) Proc. AVSP , pp. 213-218
    • Brooke, N.M.1    Scott, S.D.2
  • 7
    • 0033721603 scopus 로고    scopus 로고
    • A hidden Markov model based visual speech synthesizer
    • J. J. Williams, A. K. Katsaggelos and M. A. Randolph, "A hidden Markov model based visual speech synthesizer", Proc. ICASSP, vol. 4, pp. 2393-2396, 2000.
    • (2000) Proc. ICASSP , vol.4 , pp. 2393-2396
    • Williams, J.J.1    Katsaggelos, A.K.2    Randolph, M.A.3
  • 9
    • 0032678076 scopus 로고    scopus 로고
    • Hidden Markov models based on multi-space probability distribution for pitch pattern modeling
    • Mar.
    • K. Tokuda, T. Masuko, N. Miyazaki and T. Kobayashi, "Hidden Markov models based on multi-space probability distribution for pitch pattern modeling", Proc. of ICASSP, vol. 1, pp. 229-232, Mar. 1999.
    • (1999) Proc. of ICASSP , vol.1 , pp. 229-232
    • Tokuda, K.1    Masuko, T.2    Miyazaki, N.3    Kobayashi, T.4
  • 11
    • 85016140477 scopus 로고
    • An adaptive algorithm for mel-cepstral analysis of speech
    • T. Fukada, K. Tokuda, T. Kobayashi and S. Imai, "An adaptive algorithm for mel-cepstral analysis of speech", Proc. of ICASSP, vol. 1, pp. 137-140, 1992.
    • (1992) Proc. of ICASSP , vol.1 , pp. 137-140
    • Fukada, T.1    Tokuda, K.2    Kobayashi, T.3    Imai, S.4
  • 12
    • 0020596154 scopus 로고
    • Cepstral analysis synthesis on the mel frequency scale
    • S. Imai, "Cepstral analysis synthesis on the mel frequency scale", Proc. of ICASSP, pp. 93-96, 1983.
    • (1983) Proc. of ICASSP , pp. 93-96
    • Imai, S.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.