메뉴 건너뛰기




Volumn 52, Issue 6, 2004, Pages 1783-1790

Constrained optimization for audio-to-visual conversion

Author keywords

Audio to visual conversion; HMM; HMMI; Talking heads

Indexed keywords

ALGORITHMS; COMPUTER SIMULATION; CONSTRAINT THEORY; IMAGE ANALYSIS; LAGRANGE MULTIPLIERS; MARKOV PROCESSES; NEURAL NETWORKS; OPTIMIZATION; PARAMETER ESTIMATION; PROBABILITY DISTRIBUTIONS; VECTOR QUANTIZATION;

EID: 2942596586     PISSN: 1053587X     EISSN: None     Source Type: Journal    
DOI: 10.1109/TSP.2004.827153     Document Type: Article
Times cited : (8)

References (25)
  • 1
    • 2942629743 scopus 로고    scopus 로고
    • ISO/IEC JTC1/SC29/WG11 N2501, Nov.
    • ISO/IEC FDIS 14 496-1 Systems, ISO/IEC JTC1/SC29/WG11 N2501, Nov. 1998.
    • (1998) ISO/IEC FDIS 14 496-1 Systems
  • 2
    • 0038669820 scopus 로고    scopus 로고
    • ISO/IEC JTC1/SC29/WG11 N2502, Nov.
    • ISO/IEC FDIS 14 496-2 Visual, ISO/IEC JTC1/SC29/WG11 N2502, Nov. 1998.
    • (1998) ISO/IEC FDIS 14 496-2 Visual
  • 4
    • 0032683588 scopus 로고    scopus 로고
    • SeamlessDesign: A face-to-face collaborative virtual/augmented environment for rapid prototyping of geometrically constrained 3-D objects
    • K. Kiyokawa, H. Takemura, and N. Yokoya, "SeamlessDesign: a face-to-face collaborative virtual/augmented environment for rapid prototyping of geometrically constrained 3-D objects," in Proc. IEEE Int. Conf. Multimedia Comput. Syst., vol. 2, 1999, pp. 447-453.
    • (1999) Proc. IEEE Int. Conf. Multimedia Comput. Syst. , vol.2 , pp. 447-453
    • Kiyokawa, K.1    Takemura, H.2    Yokoya, N.3
  • 7
    • 0026156861 scopus 로고
    • A media conversion from speech to facial image for intelligent man-machine interface
    • May
    • S. Morishima and H. Harashima, "A media conversion from speech to facial image for intelligent man-machine interface," IEEE J. Select. Areas Commun., vol. 9, pp. 594-600, May 1991.
    • (1991) IEEE J. Select. Areas Commun. , vol.9 , pp. 594-600
    • Morishima, S.1    Harashima, H.2
  • 8
    • 0029270677 scopus 로고
    • Converting speech into lip movement: A multimedia telephone for hard of hearing people
    • Jan.
    • F. Lavagetto, "Converting speech into lip movement: a multimedia telephone for hard of hearing people," IEEE Trans. Rehab. Eng., vol. 3, pp. 90-102, Jan. 1995.
    • (1995) IEEE Trans. Rehab. Eng. , vol.3 , pp. 90-102
    • Lavagetto, F.1
  • 9
    • 0031257449 scopus 로고    scopus 로고
    • Time-delay neural networks for estimating lip movements from speech analysis: A useful tool in audio-video synchronization
    • May
    • ____, "Time-delay neural networks for estimating lip movements from speech analysis: a useful tool in audio-video synchronization," IEEE Trans. Circuits Syst. Video Technol., vol. 7, pp. 786-800, May 1997.
    • (1997) IEEE Trans. Circuits Syst. Video Technol. , vol.7 , pp. 786-800
    • Lavagetto, F.1
  • 10
    • 0031997085 scopus 로고    scopus 로고
    • Audio-to-visual conversion for multimedia communication
    • Jan.
    • R. R. Rao, T. Chen, and R. M. Mersereau, "Audio-to-visual conversion for multimedia communication," IEEE Trans. Ind. Electron., vol. 45, pp. 15-22, Jan. 1998.
    • (1998) IEEE Trans. Ind. Electron. , vol.45 , pp. 15-22
    • Rao, R.R.1    Chen, T.2    Mersereau, R.M.3
  • 11
    • 0035251712 scopus 로고    scopus 로고
    • Speech-to-lip movement synthesis by maximizing audio-visual joint probability based on the EM algorithm
    • S.Satoshi Nakamura and E.Eli Yamamoto, "Speech-to-lip movement synthesis by maximizing audio-visual joint probability based on the EM algorithm," J. VLSI Signal Process., vol. 27, pp. 119-126, 2001.
    • (2001) J. VLSI Signal Process. , vol.27 , pp. 119-126
    • Nakamura, S.S.1    Yamamoto, E.E.2
  • 15
    • 0031100269 scopus 로고    scopus 로고
    • Robust speech recognition based on joint model and feature space optimization of hidden Markov models
    • Mar.
    • ____, "Robust speech recognition based on joint model and feature space optimization of hidden Markov models," IEEE Trans. Neural Networks, vol. 8, pp. 194-204, Mar. 1997.
    • (1997) IEEE Trans. Neural Networks , vol.8 , pp. 194-204
    • Moon, S.Y.1    Hwang, J.N.2
  • 16
    • 0028317510 scopus 로고
    • A projection-based likelihood measure for speech recognition in noise
    • Jan.
    • B. A. Carlson and M. A. Clements, "A projection-based likelihood measure for speech recognition in noise," IEEE Trans. Speech Audio Processing, vol. 2, pp. 97-102, Jan. 1994.
    • (1994) IEEE Trans. Speech Audio Processing , vol.2 , pp. 97-102
    • Carlson, B.A.1    Clements, M.A.2
  • 22
    • 0034512820 scopus 로고    scopus 로고
    • Emotional expressions in audiovisual human computer interaction
    • L. S. Chen, and T. S. Huang, "Emotional expressions in audiovisual human computer interaction," in Proc. IEEE Int. Conf. Multimedia Expo, vol. 1, 2000, pp. 423-426.
    • (2000) Proc. IEEE Int. Conf. Multimedia Expo , vol.1 , pp. 423-426
    • Chen, L.S.1    Huang, T.S.2
  • 23
    • 2942630206 scopus 로고    scopus 로고
    • [Online] http://htk.eng.cam.ac.uk
  • 25
    • 0038370173 scopus 로고    scopus 로고
    • A probabilistic network for facial feature verification
    • K. H. Choi, J. J. Yoo, T. H. Hwang, J. H. Park, and J. H. Lee, "A probabilistic network for facial feature verification," ETRI J., vol. 25, no. 2, pp. 140-143, 2003.
    • (2003) ETRI J. , vol.25 , Issue.2 , pp. 140-143
    • Choi, K.H.1    Yoo, J.J.2    Hwang, T.H.3    Park, J.H.4    Lee, J.H.5


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.