메뉴 건너뛰기




Volumn 7, Issue 2, 2005, Pages 243-252

Audio/visual mapping with cross-modal hidden Markov models

Author keywords

3 D audio video processing; Joint media and multimodal processing; Speech reading and lip synchroization

Indexed keywords

ANIMATION; DATA ACQUISITION; FEATURE EXTRACTION; MAPPING; MARKOV PROCESSES; MATHEMATICAL MODELS; MODAL ANALYSIS; VIDEO SIGNAL PROCESSING;

EID: 16244385915     PISSN: 15209210     EISSN: None     Source Type: Journal    
DOI: 10.1109/TMM.2005.843341     Document Type: Review
Times cited : (59)

References (32)
  • 1
    • 0033336969 scopus 로고    scopus 로고
    • User evaluation: Synthetic talking faces for interactive services
    • I. Pandzic, J. Ostermann, and D. Millen, "User evaluation: Synthetic talking faces for interactive services," Vis. Comput., vol. 15, no. 7-8, pp. 330-340, 1999.
    • (1999) Vis. Comput. , vol.15 , Issue.7-8 , pp. 330-340
    • Pandzic, I.1    Ostermann, J.2    Millen, D.3
  • 2
    • 78649308717 scopus 로고    scopus 로고
    • Recent developments in facial animation: An inside view
    • Terrigal, Australia
    • M. M. Cohen, J. Beskow, and D. W. Massaro, "Recent developments in facial animation: An inside view," in Proc. AVSP, Terrigal, Australia, 1998, pp. 201-206.
    • (1998) Proc. AVSP , pp. 201-206
    • Cohen, M.M.1    Beskow, J.2    Massaro, D.W.3
  • 5
    • 0242664388 scopus 로고    scopus 로고
    • Real-time talking head driven by voice and its application to communication and entertainment
    • Terrigal, Australia
    • S. Morishima, "Real-time talking head driven by voice and its application to communication and entertainment," in Proc. AVSP, Terrigal, Australia, 1998, pp. 195-199.
    • (1998) Proc. AVSP , pp. 195-199
    • Morishima, S.1
  • 6
    • 0030677313 scopus 로고    scopus 로고
    • Video rewrite: Driving visual speech with audio
    • C. Bregler, T. Covell, and M. Slaney, "Video rewrite: Driving visual speech with audio," in Proc. ACM SIGGRAPH'97, 1997, pp. 353-360.
    • (1997) Proc. ACM SIGGRAPH'97 , pp. 353-360
    • Bregler, C.1    Covell, T.2    Slaney, M.3
  • 7
    • 0026156861 scopus 로고
    • A media conversion from speech to facial image for intelligent man-machine interface
    • May
    • S. Morishima and H. Harashima, "A media conversion from speech to facial image for intelligent man-machine interface," IEEE J Select. Areas Commun., vol. 9, no. 4, pp. 594-600, May 1991.
    • (1991) IEEE J Select. Areas Commun. , vol.9 , Issue.4 , pp. 594-600
    • Morishima, S.1    Harashima, H.2
  • 8
    • 0029270677 scopus 로고
    • Converting speech into lip movements: A multimedia telephone for hard of hearing people
    • Mar.
    • F. Lavagetto, "Converting speech into lip movements: A multimedia telephone for hard of hearing people," IEEE Trans. Rehabil. Eng., vol. 3, no. 1, pp. 90-102, Mar. 1995.
    • (1995) IEEE Trans. Rehabil. Eng. , vol.3 , Issue.1 , pp. 90-102
    • Lavagetto, F.1
  • 9
    • 85133709259 scopus 로고    scopus 로고
    • Picture my voice: Audio to visual speech synthesis using artificial neural networks
    • D. W. Massaro, Ed., Santa Cruz, CA
    • D. W. Massaro, J. Beskow, M. M. Cohen, C. L. Fry, and T. Rodriguez, "Picture my voice: Audio to visual speech synthesis using artificial neural networks," in Proc. AVSP, D. W. Massaro, Ed., Santa Cruz, CA, 1999, pp. 133-138.
    • (1999) Proc. AVSP , pp. 133-138
    • Massaro, D.W.1    Beskow, J.2    Cohen, M.M.3    Fry, C.L.4    Rodriguez, T.5
  • 10
    • 0036650837 scopus 로고    scopus 로고
    • Real-time speech-driven face animation with expressions using neural networks
    • Jul.
    • P. Hong, Z. Wen, and T. S. Huang, "Real-time speech-driven face animation with expressions using neural networks," IEEE Trans. Neural Netw., vol. 13, no. 4, pp. 916-927, Jul. 2002.
    • (2002) IEEE Trans. Neural Netw. , vol.13 , Issue.4 , pp. 916-927
    • Hong, P.1    Wen, Z.2    Huang, T.S.3
  • 11
    • 85032752352 scopus 로고    scopus 로고
    • Audiovisual speech processing: Lip reading and lip synchronization
    • Jan.
    • T. Chen, "Audiovisual speech processing: Lip reading and lip synchronization," IEEE Signal Process. Mag., vol. 18, no. 1, pp. 9-21, Jan. 2001.
    • (2001) IEEE Signal Process. Mag. , vol.18 , Issue.1 , pp. 9-21
    • Chen, T.1
  • 12
    • 0035426641 scopus 로고    scopus 로고
    • Hidden Markov model inversion for audio-to-visual conversion in an MPEG-4 facial animation system
    • K. Choi, Y. Luo, and J.-N. Hwang, "Hidden Markov model inversion for audio-to-visual conversion in an MPEG-4 facial animation system," J. VLSI Signal Process., vol. 29, no. 1-2, pp. 51-61, 2001.
    • (2001) J. VLSI Signal Process. , vol.29 , Issue.1-2 , pp. 51-61
    • Choi, K.1    Luo, Y.2    Hwang, J.-N.3
  • 13
    • 84937437186 scopus 로고    scopus 로고
    • Voice puppetry
    • Los Angeles, CA
    • M. Brand, "Voice puppetry," in Proc. SIGGRAPH'99, Los Angeles, CA, 1999, pp. 21-28.
    • (1999) Proc. SIGGRAPH'99 , pp. 21-28
    • Brand, M.1
  • 14
    • 0032179320 scopus 로고    scopus 로고
    • Lip movement synthesis from speech based on hidden Markov models
    • E. Yamamoto, S. Nakamura, and K. Shikano, "Lip movement synthesis from speech based on hidden Markov models," Speech Commun., vol. 26, no. 1-2, pp. 105-115, 1998.
    • (1998) Speech Commun. , vol.26 , Issue.1-2 , pp. 105-115
    • Yamamoto, E.1    Nakamura, S.2    Shikano, K.3
  • 15
    • 0031997085 scopus 로고    scopus 로고
    • Audio-to-visual conversion for multimedia communication
    • Feb.
    • R. R. Rao, T. Chen, and R. M. Mersereau, "Audio-to-visual conversion for multimedia communication," IEEE Trans. Ind. Electron., vol. 45, no. 1, pp. 15-22, Feb. 1998.
    • (1998) IEEE Trans. Ind. Electron. , vol.45 , Issue.1 , pp. 15-22
    • Rao, R.R.1    Chen, T.2    Mersereau, R.M.3
  • 16
    • 0024610919 scopus 로고
    • A tutorial on hidden Markov models and selected applications in speech recognition
    • Feb.
    • L. R. Rabiner, "A tutorial on hidden Markov models and selected applications in speech recognition," Proc. IEEE, vol. 77, no. 2, pp. 257-286, Feb. 1989.
    • (1989) Proc. IEEE , vol.77 , Issue.2 , pp. 257-286
    • Rabiner, L.R.1
  • 17
    • 0033283938 scopus 로고    scopus 로고
    • Shadow puppetry
    • Corfu, Greece, Sep.
    • M. Brand, "Shadow puppetry," in Proc. ICCV'99, Corfu, Greece, Sep. 1999, pp. 1237-1244.
    • (1999) Proc. ICCV'99 , pp. 1237-1244
    • Brand, M.1
  • 19
    • 16244388770 scopus 로고    scopus 로고
    • Master's thesis, Dept. Comput. Sci. and Eng., Wright State Univ., Dayton, OH
    • S. Fu, "Audio/Visual Mapping Based on Hidden Markov Models," Master's thesis, Dept. Comput. Sci. and Eng., Wright State Univ., Dayton, OH, 2002.
    • (2002) Audio/Visual Mapping Based on Hidden Markov Models
    • Fu, S.1
  • 21
    • 0031100269 scopus 로고    scopus 로고
    • Robust speech recognition based on joint model and feature space optimization of hidden Markov models
    • Mar.
    • S. Moon and J.-N. Hwang, "Robust speech recognition based on joint model and feature space optimization of hidden Markov models," IEEE Tran. Neural Netw., vol. 8, no. 2, pp. 194-204, Mar. 1997.
    • (1997) IEEE Tran. Neural Netw. , vol.8 , Issue.2 , pp. 194-204
    • Moon, S.1    Hwang, J.-N.2
  • 22
    • 84972571328 scopus 로고
    • Growth functions for transformations on manifolds
    • L. E. Baum and G. R. Sell, "Growth functions for transformations on manifolds," Pacific J. Math., vol. 27, no. 2, pp. 211-227, 1968.
    • (1968) Pacific J. Math. , vol.27 , Issue.2 , pp. 211-227
    • Baum, L.E.1    Sell, G.R.2
  • 23
    • 0037569390 scopus 로고    scopus 로고
    • Learning dynamic audio/visual mapping with input-output hidden Markov models
    • Melbourne, Australia, Jan.
    • Y. Li and H.-Y. Shum, "Learning dynamic audio/visual mapping with input-output hidden Markov models," in Proc. 5th Asian Conf. on Computer Vision, Melbourne, Australia, Jan. 2002.
    • (2002) Proc. 5th Asian Conf. on Computer Vision
    • Li, Y.1    Shum, H.-Y.2
  • 24
    • 0000675167 scopus 로고    scopus 로고
    • Structure learning in conditional probability models via an entropic prior and parameter extinction
    • M. Brand, "Structure learning in conditional probability models via an entropic prior and parameter extinction," Neural Comput., vol. 11, no. 5, pp. 1155-1182, 1999.
    • (1999) Neural Comput. , vol.11 , Issue.5 , pp. 1155-1182
    • Brand, M.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.