메뉴 건너뛰기




Volumn 28, Issue 11, 2007, Pages 1368-1382

Audio-visual person authentication using lip-motion from orientation maps

Author keywords

Audio visual recognition; Biometric recognition; Biometrics; Gaussian Markov model; Hidden Markov model; Lip movements; Motion; Optical flow; Orientation; Person identification; Speaker authentication; Speaker verification; Structure tensor

Indexed keywords

AUDIO ACOUSTICS; BIOMETRICS; COMPUTATIONAL EFFICIENCY; DATABASE SYSTEMS; HIDDEN MARKOV MODELS; OPTICAL FLOWS;

EID: 34249752774     PISSN: 01678655     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.patrec.2007.02.017     Document Type: Article
Times cited : (45)

References (39)
  • 3
    • 0026202914 scopus 로고
    • Multidimensional orientation estimation with applications to texture analysis of optical flow
    • Bigun J., Granlund G., and Wiklund J. Multidimensional orientation estimation with applications to texture analysis of optical flow. IEEE Trans. Pattern Anal. Machine Intell. 13 8 (1991) 775-790
    • (1991) IEEE Trans. Pattern Anal. Machine Intell. , vol.13 , Issue.8 , pp. 775-790
    • Bigun, J.1    Granlund, G.2    Wiklund, J.3
  • 4
    • 84947902509 scopus 로고    scopus 로고
    • Bigun, E., Bigun, J., Duc, B., Fischer, S., 1997. Expert conciliation for multi modal person authentication systems by bayesian statistics. In: Bigun, J., Chollet, G., Borgefors, G. (Eds.), Audio and Video Based Person Authentication - AVBPA97, pp. 291-300.
  • 6
    • 85032752352 scopus 로고    scopus 로고
    • Audiovisual speech processing
    • Chen T. Audiovisual speech processing. IEEE Signal Process. Mag. 18 1 (2001) 9-21
    • (2001) IEEE Signal Process. Mag. , vol.18 , Issue.1 , pp. 9-21
    • Chen, T.1
  • 8
    • 21744455232 scopus 로고    scopus 로고
    • Dieckmann, U., Plankensteiner, P., Wagner, T., 1997. Acoustic-labial speaker verification. In: Proc. First Internat. Conf. on Audio- and Video-Based Biometric Person Authentication, LNCS 1206, pp. 301-310.
  • 10
    • 0034270644 scopus 로고    scopus 로고
    • Audio-visual speech modelling for continuous speech recognition
    • Dupont S., and Luettin J. Audio-visual speech modelling for continuous speech recognition. IEEE Trans. Multimedia 2 3 (2000) 141-151
    • (2000) IEEE Trans. Multimedia , vol.2 , Issue.3 , pp. 141-151
    • Dupont, S.1    Luettin, J.2
  • 11
    • 33845524427 scopus 로고    scopus 로고
    • Faraj, M.I., Bigun, J., 2006. Person verification by lip-motion. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition workshop - CVPR 2006, pp. 37-45.
  • 12
    • 0037209464 scopus 로고    scopus 로고
    • Automatic facial expression analysis: A survey
    • Fasel B., and Luettin J. Automatic facial expression analysis: A survey. J. Pattern Recognition Soc. 36 1 (2003) 259-275
    • (2003) J. Pattern Recognition Soc. , vol.36 , Issue.1 , pp. 259-275
    • Fasel, B.1    Luettin, J.2
  • 13
    • 0033899298 scopus 로고    scopus 로고
    • Frischholz, R., Dieckmann, U., 2000. Bioid: A multimodal biometric identification system. IEEE-Computer Society Press, vol. 33(2), 2000, pp. 64-68.
  • 14
    • 84947940111 scopus 로고    scopus 로고
    • Furui, S., 1997. Recent advances in speaker recognition. In: Proc. First Internat. Conf. on Audio- and Video-Based Biometric Person Authentication, LNCS 1206, pp. 237-252.
  • 15
    • 0018026724 scopus 로고
    • In search of a general picture processing operator
    • Granlund G.H. In search of a general picture processing operator. Computer Graphics Image Process. 8 2 (1978) 155-173
    • (1978) Computer Graphics Image Process. , vol.8 , Issue.2 , pp. 155-173
    • Granlund, G.H.1
  • 16
    • 34047263009 scopus 로고    scopus 로고
    • Visual model structures and synchrony constraints for audio-visual speech recognition
    • Hazen T.J. Visual model structures and synchrony constraints for audio-visual speech recognition. IEEE Trans. Audio Speech Lang. Process. 14 3 (2006) 1082-1089
    • (2006) IEEE Trans. Audio Speech Lang. Process. , vol.14 , Issue.3 , pp. 1082-1089
    • Hazen, T.J.1
  • 17
    • 0019597413 scopus 로고
    • Determining optical flow
    • Horn B., and Schunck B. Determining optical flow. J. Art. Intell. 17 1 (1981) 185-203
    • (1981) J. Art. Intell. , vol.17 , Issue.1 , pp. 185-203
    • Horn, B.1    Schunck, B.2
  • 18
    • 84947907880 scopus 로고    scopus 로고
    • Jourlin, P., Luettin, J., Genoud, D., Wassner, H., 1997. Acoustic-labial speaker verification. In: Proc. First Internat. Conf. on Audio- and Video-Based Biometric Person Authentication, LNCS 1206, pp. 319-326.
  • 19
    • 33750962696 scopus 로고    scopus 로고
    • Kollreider, K., Fronthaler, H., Bigun, J., 2005. Evaluating liveness by face images and the structure tensor. In: AutoID 2005: Fourth Workshop on Automatic Identification Advanced Technologies. IEEE Computer Society, pp. 75-80.
  • 20
    • 0019647180 scopus 로고
    • An iterative image registration technique with an application to stereo vision
    • Lucas B.D., and Kanade T. An iterative image registration technique with an application to stereo vision. Int. Joint Conf. Art. Intell. (1981) 674-679
    • (1981) Int. Joint Conf. Art. Intell. , pp. 674-679
    • Lucas, B.D.1    Kanade, T.2
  • 21
    • 20444375102 scopus 로고    scopus 로고
    • Integration strategies for audiovisual speech processing: Applied to text-dependent speaker recognition
    • Lucey S., Chen T., Sridharan S., and Chandran V. Integration strategies for audiovisual speech processing: Applied to text-dependent speaker recognition. IEEE Trans. Multimedia 7 3 (2005) 495-506
    • (2005) IEEE Trans. Multimedia , vol.7 , Issue.3 , pp. 495-506
    • Lucey, S.1    Chen, T.2    Sridharan, S.3    Chandran, V.4
  • 22
    • 34249664492 scopus 로고    scopus 로고
    • Luettin, J., Maitre, G., 1998. Evaluation protocol for the extended m2vts database xm2vtsdb. In: IDIAP Communication 98-054, Technical report R R-21, number = IDIAP - 1998.
  • 23
    • 0030366433 scopus 로고    scopus 로고
    • Luettin, J., Thacker, N., Beet, S., 1996. Speaker identification by lipreading. In: Proc. 4th Internat. Conf. on Spoken Language Processing ICSLP'96, pp. 62-65.
  • 24
    • 0025750892 scopus 로고
    • Automatic lip-reading by optical-flow analysis
    • Mase K., and Pentland A. Automatic lip-reading by optical-flow analysis. Systems Comput. Jpn. 22 6 (1991) 67-76
    • (1991) Systems Comput. Jpn. , vol.22 , Issue.6 , pp. 67-76
    • Mase, K.1    Pentland, A.2
  • 25
    • 34249671795 scopus 로고    scopus 로고
    • Messer, K., Matas, J., Kittler, J., Luettin, J., 1999. Xm2vtsdb: The extended m2vts database. In: Second International Conference of Audio and Video-Based Biometric Person Authentication ICSLP'96, pp. 72-77.
  • 26
    • 0029360028 scopus 로고
    • On cluster validity for the fuzzy c-means model
    • Pal N., and Bezdek J. On cluster validity for the fuzzy c-means model. IEEE Trans. Fuzzy Systems 3 3 (1995) 370-379
    • (1995) IEEE Trans. Fuzzy Systems , vol.3 , Issue.3 , pp. 370-379
    • Pal, N.1    Bezdek, J.2
  • 27
    • 4544290191 scopus 로고    scopus 로고
    • Recent advances in the automatic recognition of audiovisual speech
    • Potamianos G., Neti C., Gravier G., Garg A., and Senior A. Recent advances in the automatic recognition of audiovisual speech. Proc. IEEE 91 9 (2003) 1306-1326
    • (2003) Proc. IEEE , vol.91 , Issue.9 , pp. 1306-1326
    • Potamianos, G.1    Neti, C.2    Gravier, G.3    Garg, A.4    Senior, A.5
  • 28
    • 0029209272 scopus 로고
    • Robust text-independent speaker identification using gaussian mixture models
    • Reynolds D., and Rose R. Robust text-independent speaker identification using gaussian mixture models. IEEE Trans. Speech Audio Process. ICASSP090 3 1 (1995) 72-83
    • (1995) IEEE Trans. Speech Audio Process. ICASSP090 , vol.3 , Issue.1 , pp. 72-83
    • Reynolds, D.1    Rose, R.2
  • 29
    • 84947902135 scopus 로고    scopus 로고
    • Sanchez, M.R., Matas, J., Kittler, J., 1997. Statistical chromaticity-based lip tracking with b-splines. In: Proc. First Internat. Conf. on Audio- and Video-Based Biometric Person Authentication, LNCS 1206, pp. 69-76.
  • 30
    • 0036192336 scopus 로고    scopus 로고
    • Retinal vision applied to facial features detection and face authentication
    • Smeraldi F., and Bigun J. Retinal vision applied to facial features detection and face authentication. Pattern Recognition Lett. 23 (2002) 463-475
    • (2002) Pattern Recognition Lett. , vol.23 , pp. 463-475
    • Smeraldi, F.1    Bigun, J.2
  • 31
    • 0030173862 scopus 로고    scopus 로고
    • A fuzzy-logic architecture for autonomous multisensor data fusion
    • Stover J., Hall D., and Gibson R. A fuzzy-logic architecture for autonomous multisensor data fusion. IEEE Trans. Industr. Electron. 43 3 (1996) 403-410
    • (1996) IEEE Trans. Industr. Electron. , vol.43 , Issue.3 , pp. 403-410
    • Stover, J.1    Hall, D.2    Gibson, R.3
  • 32
    • 1542572925 scopus 로고    scopus 로고
    • Multi-modal speech recognition using optical flow analysis for lip images
    • Tamura S., Iwano K., and Furui S. Multi-modal speech recognition using optical flow analysis for lip images. J. VLSI Signal Process. 36 2 (2004) 117-124
    • (2004) J. VLSI Signal Process. , vol.36 , Issue.2 , pp. 117-124
    • Tamura, S.1    Iwano, K.2    Furui, S.3
  • 33
    • 82055176921 scopus 로고    scopus 로고
    • Tang, X., Li, X., 2001. Fusion of audio-visual information integrated speech processing. In: Third Internat. Conf. on Audio- and Video-Based Biometric Person Authentication AV BPA02001, LNCS 2091, pp. 127-143.
  • 34
    • 4544270398 scopus 로고    scopus 로고
    • Tang, X., Li, X., 2004. Video based face recognition using multiple classifiers. In: Sixth IEEE Internat. Conf. on Automatic Face and Gesture Recognition FGR2004 - IEEE Computer Society, pp. 345-349.
  • 35
    • 0031341829 scopus 로고    scopus 로고
    • Multisensor data fusion
    • Varshney P. Multisensor data fusion. Electron. Commun. Eng. J. 9 6 (1997) 245-253
    • (1997) Electron. Commun. Eng. J. , vol.9 , Issue.6 , pp. 245-253
    • Varshney, P.1
  • 36
    • 33744917510 scopus 로고    scopus 로고
    • Veeravalli, A.G., Pan, W., Adhami, R., Cox, P.G., 2005. A tutorial on using hidden markov models for phoneme recognition. In: Proc. Thirty-Seventh Southeastern Symp. on System Theory, SSST 2005.
  • 37
    • 0032646868 scopus 로고    scopus 로고
    • The use of speech and lip modalities for robust speaker verification under adverse conditions
    • Wark T., Sridharan S., and Chandran V. The use of speech and lip modalities for robust speaker verification under adverse conditions. IEEE Int. Conf. Multimedia Comput. Systems 1 (1999)
    • (1999) IEEE Int. Conf. Multimedia Comput. Systems , vol.1
    • Wark, T.1    Sridharan, S.2    Chandran, V.3
  • 38
    • 0032179320 scopus 로고    scopus 로고
    • Lip movement synthesis from speech based on hidden markov models
    • Yamamoto E., Nakamura S., and Shikano K. Lip movement synthesis from speech based on hidden markov models. J. Speech Commun. 26 1 (1998) 105-115
    • (1998) J. Speech Commun. , vol.26 , Issue.1 , pp. 105-115
    • Yamamoto, E.1    Nakamura, S.2    Shikano, K.3
  • 39
    • 34249677960 scopus 로고    scopus 로고
    • Young, S., Kershaw, D., Odell, J., Ollason, D., Valtchev, V., Woodland, P., 2000. The htk book (for htk version 3.0) http://htk.eng.cam.ac.uk/docs/docs.shtml.


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.