메뉴 건너뛰기




Volumn 14, Issue 5, 2004, Pages 682-692

Speech-to-video synthesis using MPEG-4 compliant visual features

Author keywords

Audio visual speech recognition; Correlation hidden Markov models (CHMMs); Facial animation parameters (FAPs); Speech to video synthesis

Indexed keywords

ALGORITHMS; CORRELATION METHODS; DATA REDUCTION; MARKOV PROCESSES; SIGNAL TO NOISE RATIO; SPEECH RECOGNITION; SPEECH SYNTHESIS; SYNCHRONIZATION; TOPOLOGY;

EID: 2542499812     PISSN: 10518215     EISSN: None     Source Type: Journal    
DOI: 10.1109/TCSVT.2004.826760     Document Type: Article
Times cited : (21)

References (32)
  • 1
    • 0031187171 scopus 로고    scopus 로고
    • Speech recognition by machines and humans
    • July
    • R. Lippman, "Speech recognition by machines and humans," Speech Commun., vol. 22, no. 1, pp. 1-15, July 1997.
    • (1997) Speech Commun. , vol.22 , Issue.1 , pp. 1-15
    • Lippman, R.1
  • 2
    • 0029288202 scopus 로고
    • Speech recognition in noisy environments: A survey
    • Y. Gong, "Speech recognition in noisy environments: A survey," Speech Commun., vol. 16, pp. 261-291, 1995.
    • (1995) Speech Commun. , vol.16 , pp. 261-291
    • Gong, Y.1
  • 4
    • 0024610919 scopus 로고
    • A tutorial on hidden Markov models and selected applications in speech recognition
    • Feb.
    • L. R. Rabiner, "A tutorial on hidden Markov models and selected applications in speech recognition," Proc. IEEE, vol. 77, pp. 257-286, Feb. 1989.
    • (1989) Proc. IEEE , vol.77 , pp. 257-286
    • Rabiner, L.R.1
  • 5
    • 0029270677 scopus 로고
    • Converting speech into lip movements: A multimedia telephone for hard of hearing people
    • Mar.
    • F. Lavagetto, "Converting speech into lip movements: A multimedia telephone for hard of hearing people," IEEE Trans. Rehab. Eng., vol. 3, pp. 1-14, Mar. 1995.
    • (1995) IEEE Trans. Rehab. Eng. , vol.3 , pp. 1-14
    • Lavagetto, F.1
  • 6
    • 0000051247 scopus 로고
    • Generation of mouth shapes for a synthetic talking head
    • A. Simons and S. Cox, "Generation of mouth shapes for a synthetic talking head," Proc. Inst. Acoust., vol. 12, pp. 475-482, 1990.
    • (1990) Proc. Inst. Acoust. , vol.12 , pp. 475-482
    • Simons, A.1    Cox, S.2
  • 7
    • 0032074310 scopus 로고    scopus 로고
    • Audio-visual integration in multimedia communication
    • May
    • T. Chen and R. R. Rao, "Audio-visual integration in multimedia communication," Proc. IEEE, vol. 86, pp. 837-852, May 1998.
    • (1998) Proc. IEEE , vol.86 , pp. 837-852
    • Chen, T.1    Rao, R.R.2
  • 8
    • 0030677313 scopus 로고    scopus 로고
    • Video rewrite: Driving visual speech with audio
    • C. Bregler, M. Covell, and M. Slaney, "Video rewrite: Driving visual speech with audio," in Proc. ACM SIGGRAPH, 1997, pp. 353-360.
    • (1997) Proc. ACM SIGGRAPH , pp. 353-360
    • Bregler, C.1    Covell, M.2    Slaney, M.3
  • 9
    • 0000497160 scopus 로고    scopus 로고
    • Baum-welch hidden Markov model inversion for reliable audio-to-video conversion
    • K. Choi and J.-N. Hwang, "Baum-welch hidden Markov model inversion for reliable audio-to-video conversion," in Proc. IEEE 3rd Workshop Multimedia Signal Processing, 1999, pp. 175-180.
    • (1999) Proc. IEEE 3rd Workshop Multimedia Signal Processing , pp. 175-180
    • Choi, K.1    Hwang, J.-N.2
  • 10
    • 0031100269 scopus 로고    scopus 로고
    • Robust speech recognition based on joint model and feature space optimization of hidden Markov models
    • Mar.
    • S. Moon and J.-N. Hwang, "Robust speech recognition based on joint model and feature space optimization of hidden Markov models," IEEE Trans. Neural Networks, vol. 8, pp. 194-204, Mar. 1997.
    • (1997) IEEE Trans. Neural Networks , vol.8 , pp. 194-204
    • Moon, S.1    Hwang, J.-N.2
  • 17
    • 0035472468 scopus 로고    scopus 로고
    • An efficient use of MPEG-4 FAP interpolation for facial animation at 70 bits/frame
    • Oct.
    • F. Lavagetto and R. Pockaj, "An efficient use of MPEG-4 FAP interpolation for facial animation at 70 bits/frame," IEEE Trans. Circuits Syst. Video Technol., vol. 11, pp. 1085-1097, Oct. 2001.
    • (2001) IEEE Trans. Circuits Syst. Video Technol. , vol.11 , pp. 1085-1097
    • Lavagetto, F.1    Pockaj, R.2
  • 19
    • 0036447870 scopus 로고    scopus 로고
    • Audio-visual continuous speech recognition using MPEG-4 compliant visual feature
    • Rochester, NY, Sept.
    • _, "Audio-visual continuous speech recognition using MPEG-4 compliant visual feature," in Proc. Int. Conf. Image Processing, Rochester, NY, Sept. 2002, pp. 960-963.
    • (2002) Proc. Int. Conf. Image Processing , pp. 960-963
  • 26
    • 4544290191 scopus 로고    scopus 로고
    • Recent advances in the automatic recognition of audio-visual speech
    • Sept.
    • G. Potamianos, C. Neti, G. Gravier, A. Garg, and A. W. Senior, "Recent advances in the automatic recognition of audio-visual speech," Proc. IEEE, vol. 91, pp. 1306-1326, Sept. 2003.
    • (2003) Proc. IEEE , vol.91 , pp. 1306-1326
    • Potamianos, G.1    Neti, C.2    Gravier, G.3    Garg, A.4    Senior, A.W.5
  • 29
    • 0034270644 scopus 로고    scopus 로고
    • Audio-visual speech modeling for continuous speech recognition
    • Mar.
    • S. Dupont and J. Luettin, "Audio-visual speech modeling for continuous speech recognition," IEEE Trans. Multimedia, vol. 2, pp. 141-151, Mar. 2000.
    • (2000) IEEE Trans. Multimedia , vol.2 , pp. 141-151
    • Dupont, S.1    Luettin, J.2
  • 32
    • 0345134263 scopus 로고    scopus 로고
    • Speech-to-video synthesis using facial animation parameters
    • Barcelona, Spain, Sept.
    • P. S. Aleksic and A. K. Katsaggelos, "Speech-to-video synthesis using facial animation parameters," in Proc. Int. Conf. Image Processing, Barcelona, Spain, Sept. 2003, pp. 1-4.
    • (2003) Proc. Int. Conf. Image Processing , pp. 1-4
    • Aleksic, P.S.1    Katsaggelos, A.K.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.