SCOPUS 정보 검색 플랫폼

Volumn 28, Issue 11, 2007, Pages 1368-1382

Audio-visual person authentication using lip-motion from orientation maps

(2) Faraj, Maycel Isaac a Bigun, Josef a

Author keywords

Audio visual recognition; Biometric recognition; Biometrics; Gaussian Markov model; Hidden Markov model; Lip movements; Motion; Optical flow; Orientation; Person identification; Speaker authentication; Speaker verification; Structure tensor

Indexed keywords

AUDIO ACOUSTICS; BIOMETRICS; COMPUTATIONAL EFFICIENCY; DATABASE SYSTEMS; HIDDEN MARKOV MODELS; OPTICAL FLOWS;

AUDIOVISUAL RECOGNITION; BIOMETRIC RECOGNITION; GAUSSIAN MARKOV MODELS; LIP-MOVEMENTS; PERSON IDENTIFICATION; SPEAKER AUTHENTICATION; SPEAKER VERIFICATION; STRUCTURE TENSORS;

SPEECH RECOGNITION;

EID: 34249752774 PISSN: 01678655 EISSN: None Source Type: Journal
DOI: 10.1016/j.patrec.2007.02.017 Document Type: Article

Times cited : (45)

References (39)

1
- 84892238142
- Springer, Heidelberg
- Bigun J. Vision with Direction (2006), Springer, Heidelberg
- (2006) Vision with Direction
- Bigun, J.¹

2
- 0023171692
- Optimal orientation detection of linear symmetry
- IEEE Computer Society
- Bigun J., and Granlund G. Optimal orientation detection of linear symmetry. First International Conference on Computer Vision, ICCV, London, June 8-11 (1987), IEEE Computer Society 433-438
- (1987) First International Conference on Computer Vision, ICCV, London, June 8-11 , pp. 433-438
- Bigun, J.¹ Granlund, G.²

3
- 0026202914
- Multidimensional orientation estimation with applications to texture analysis of optical flow
- Bigun J., Granlund G., and Wiklund J. Multidimensional orientation estimation with applications to texture analysis of optical flow. IEEE Trans. Pattern Anal. Machine Intell. 13 8 (1991) 775-790
- (1991) IEEE Trans. Pattern Anal. Machine Intell. , vol.13 , Issue.8 , pp. 775-790
- Bigun, J.¹ Granlund, G.² Wiklund, J.³

4
- 84947902509
- Bigun, E., Bigun, J., Duc, B., Fischer, S., 1997. Expert conciliation for multi modal person authentication systems by bayesian statistics. In: Bigun, J., Chollet, G., Borgefors, G. (Eds.), Audio and Video Based Person Authentication - AVBPA97, pp. 291-300.

5
- 0029393187
- Person identification using multiple cues
- Brunelli K.R., and Falavigna D. Person identification using multiple cues. IEEE Trans. Pattern Anal. Machine Intell. 17 10 (1995) 955-966
- (1995) IEEE Trans. Pattern Anal. Machine Intell. , vol.17 , Issue.10 , pp. 955-966
- Brunelli, K.R.¹ Falavigna, D.²

6
- 85032752352
- Audiovisual speech processing
- Chen T. Audiovisual speech processing. IEEE Signal Process. Mag. 18 1 (2001) 9-21
- (2001) IEEE Signal Process. Mag. , vol.18 , Issue.1 , pp. 9-21
- Chen, T.¹

7
- 0036502797
- A review of speech-based bimodal recognition
- Chibelushi C., Deravi F., and Mason J. A review of speech-based bimodal recognition. IEEE Trans. Multimedia 4 1 (2002) 23-37
- (2002) IEEE Trans. Multimedia , vol.4 , Issue.1 , pp. 23-37
- Chibelushi, C.¹ Deravi, F.² Mason, J.³

8
- 21744455232
- Dieckmann, U., Plankensteiner, P., Wagner, T., 1997. Acoustic-labial speaker verification. In: Proc. First Internat. Conf. on Audio- and Video-Based Biometric Person Authentication, LNCS 1206, pp. 301-310.

9
- 0030674147
- Face authentication with sparse grid gabor information
- Duc B., Fischer S., and Bigun J. Face authentication with sparse grid gabor information. IEEE Int. Conf. Acoust. Speech Signal Process. 4 21 (1997) 3053-3056
- (1997) IEEE Int. Conf. Acoust. Speech Signal Process. , vol.4 , Issue.21 , pp. 3053-3056
- Duc, B.¹ Fischer, S.² Bigun, J.³

10
- 0034270644
- Audio-visual speech modelling for continuous speech recognition
- Dupont S., and Luettin J. Audio-visual speech modelling for continuous speech recognition. IEEE Trans. Multimedia 2 3 (2000) 141-151
- (2000) IEEE Trans. Multimedia , vol.2 , Issue.3 , pp. 141-151
- Dupont, S.¹ Luettin, J.²

11
- 33845524427
- Faraj, M.I., Bigun, J., 2006. Person verification by lip-motion. In: IEEE Computer Society Conference on Computer Vision and Pattern Recognition workshop - CVPR 2006, pp. 37-45.

12
- 0037209464
- Automatic facial expression analysis: A survey
- Fasel B., and Luettin J. Automatic facial expression analysis: A survey. J. Pattern Recognition Soc. 36 1 (2003) 259-275
- (2003) J. Pattern Recognition Soc. , vol.36 , Issue.1 , pp. 259-275
- Fasel, B.¹ Luettin, J.²

13
- 0033899298
- Frischholz, R., Dieckmann, U., 2000. Bioid: A multimodal biometric identification system. IEEE-Computer Society Press, vol. 33(2), 2000, pp. 64-68.

14
- 84947940111
- Furui, S., 1997. Recent advances in speaker recognition. In: Proc. First Internat. Conf. on Audio- and Video-Based Biometric Person Authentication, LNCS 1206, pp. 237-252.

15
- 0018026724
- In search of a general picture processing operator
- Granlund G.H. In search of a general picture processing operator. Computer Graphics Image Process. 8 2 (1978) 155-173
- (1978) Computer Graphics Image Process. , vol.8 , Issue.2 , pp. 155-173
- Granlund, G.H.¹

16
- 34047263009
- Visual model structures and synchrony constraints for audio-visual speech recognition
- Hazen T.J. Visual model structures and synchrony constraints for audio-visual speech recognition. IEEE Trans. Audio Speech Lang. Process. 14 3 (2006) 1082-1089
- (2006) IEEE Trans. Audio Speech Lang. Process. , vol.14 , Issue.3 , pp. 1082-1089
- Hazen, T.J.¹

17
- 0019597413
- Determining optical flow
- Horn B., and Schunck B. Determining optical flow. J. Art. Intell. 17 1 (1981) 185-203
- (1981) J. Art. Intell. , vol.17 , Issue.1 , pp. 185-203
- Horn, B.¹ Schunck, B.²

18
- 84947907880
- Jourlin, P., Luettin, J., Genoud, D., Wassner, H., 1997. Acoustic-labial speaker verification. In: Proc. First Internat. Conf. on Audio- and Video-Based Biometric Person Authentication, LNCS 1206, pp. 319-326.

19
- 33750962696
- Kollreider, K., Fronthaler, H., Bigun, J., 2005. Evaluating liveness by face images and the structure tensor. In: AutoID 2005: Fourth Workshop on Automatic Identification Advanced Technologies. IEEE Computer Society, pp. 75-80.

20
- 0019647180
- An iterative image registration technique with an application to stereo vision
- Lucas B.D., and Kanade T. An iterative image registration technique with an application to stereo vision. Int. Joint Conf. Art. Intell. (1981) 674-679
- (1981) Int. Joint Conf. Art. Intell. , pp. 674-679
- Lucas, B.D.¹ Kanade, T.²

21
- 20444375102
- Integration strategies for audiovisual speech processing: Applied to text-dependent speaker recognition
- Lucey S., Chen T., Sridharan S., and Chandran V. Integration strategies for audiovisual speech processing: Applied to text-dependent speaker recognition. IEEE Trans. Multimedia 7 3 (2005) 495-506
- (2005) IEEE Trans. Multimedia , vol.7 , Issue.3 , pp. 495-506
- Lucey, S.¹ Chen, T.² Sridharan, S.³ Chandran, V.⁴

22
- 34249664492
- Luettin, J., Maitre, G., 1998. Evaluation protocol for the extended m2vts database xm2vtsdb. In: IDIAP Communication 98-054, Technical report R R-21, number = IDIAP - 1998.

23
- 0030366433
- Luettin, J., Thacker, N., Beet, S., 1996. Speaker identification by lipreading. In: Proc. 4th Internat. Conf. on Spoken Language Processing ICSLP'96, pp. 62-65.

24
- 0025750892
- Automatic lip-reading by optical-flow analysis
- Mase K., and Pentland A. Automatic lip-reading by optical-flow analysis. Systems Comput. Jpn. 22 6 (1991) 67-76
- (1991) Systems Comput. Jpn. , vol.22 , Issue.6 , pp. 67-76
- Mase, K.¹ Pentland, A.²

25
- 34249671795
- Messer, K., Matas, J., Kittler, J., Luettin, J., 1999. Xm2vtsdb: The extended m2vts database. In: Second International Conference of Audio and Video-Based Biometric Person Authentication ICSLP'96, pp. 72-77.

26
- 0029360028
- On cluster validity for the fuzzy c-means model
- Pal N., and Bezdek J. On cluster validity for the fuzzy c-means model. IEEE Trans. Fuzzy Systems 3 3 (1995) 370-379
- (1995) IEEE Trans. Fuzzy Systems , vol.3 , Issue.3 , pp. 370-379
- Pal, N.¹ Bezdek, J.²

27
- 4544290191
- Recent advances in the automatic recognition of audiovisual speech
- Potamianos G., Neti C., Gravier G., Garg A., and Senior A. Recent advances in the automatic recognition of audiovisual speech. Proc. IEEE 91 9 (2003) 1306-1326
- (2003) Proc. IEEE , vol.91 , Issue.9 , pp. 1306-1326
- Potamianos, G.¹ Neti, C.² Gravier, G.³ Garg, A.⁴ Senior, A.⁵

28
- 0029209272
- Robust text-independent speaker identification using gaussian mixture models
- Reynolds D., and Rose R. Robust text-independent speaker identification using gaussian mixture models. IEEE Trans. Speech Audio Process. ICASSP090 3 1 (1995) 72-83
- (1995) IEEE Trans. Speech Audio Process. ICASSP090 , vol.3 , Issue.1 , pp. 72-83
- Reynolds, D.¹ Rose, R.²

29
- 84947902135
- Sanchez, M.R., Matas, J., Kittler, J., 1997. Statistical chromaticity-based lip tracking with b-splines. In: Proc. First Internat. Conf. on Audio- and Video-Based Biometric Person Authentication, LNCS 1206, pp. 69-76.

30
- 0036192336
- Retinal vision applied to facial features detection and face authentication
- Smeraldi F., and Bigun J. Retinal vision applied to facial features detection and face authentication. Pattern Recognition Lett. 23 (2002) 463-475
- (2002) Pattern Recognition Lett. , vol.23 , pp. 463-475
- Smeraldi, F.¹ Bigun, J.²

31
- 0030173862
- A fuzzy-logic architecture for autonomous multisensor data fusion
- Stover J., Hall D., and Gibson R. A fuzzy-logic architecture for autonomous multisensor data fusion. IEEE Trans. Industr. Electron. 43 3 (1996) 403-410
- (1996) IEEE Trans. Industr. Electron. , vol.43 , Issue.3 , pp. 403-410
- Stover, J.¹ Hall, D.² Gibson, R.³

32
- 1542572925
- Multi-modal speech recognition using optical flow analysis for lip images
- Tamura S., Iwano K., and Furui S. Multi-modal speech recognition using optical flow analysis for lip images. J. VLSI Signal Process. 36 2 (2004) 117-124
- (2004) J. VLSI Signal Process. , vol.36 , Issue.2 , pp. 117-124
- Tamura, S.¹ Iwano, K.² Furui, S.³

33
- 82055176921
- Tang, X., Li, X., 2001. Fusion of audio-visual information integrated speech processing. In: Third Internat. Conf. on Audio- and Video-Based Biometric Person Authentication AV BPA02001, LNCS 2091, pp. 127-143.

34
- 4544270398
- Tang, X., Li, X., 2004. Video based face recognition using multiple classifiers. In: Sixth IEEE Internat. Conf. on Automatic Face and Gesture Recognition FGR2004 - IEEE Computer Society, pp. 345-349.

35
- 0031341829
- Multisensor data fusion
- Varshney P. Multisensor data fusion. Electron. Commun. Eng. J. 9 6 (1997) 245-253
- (1997) Electron. Commun. Eng. J. , vol.9 , Issue.6 , pp. 245-253
- Varshney, P.¹

36
- 33744917510
- Veeravalli, A.G., Pan, W., Adhami, R., Cox, P.G., 2005. A tutorial on using hidden markov models for phoneme recognition. In: Proc. Thirty-Seventh Southeastern Symp. on System Theory, SSST 2005.

37
- 0032646868
- The use of speech and lip modalities for robust speaker verification under adverse conditions
- Wark T., Sridharan S., and Chandran V. The use of speech and lip modalities for robust speaker verification under adverse conditions. IEEE Int. Conf. Multimedia Comput. Systems 1 (1999)
- (1999) IEEE Int. Conf. Multimedia Comput. Systems , vol.1
- Wark, T.¹ Sridharan, S.² Chandran, V.³

38
- 0032179320
- Lip movement synthesis from speech based on hidden markov models
- Yamamoto E., Nakamura S., and Shikano K. Lip movement synthesis from speech based on hidden markov models. J. Speech Commun. 26 1 (1998) 105-115
- (1998) J. Speech Commun. , vol.26 , Issue.1 , pp. 105-115
- Yamamoto, E.¹ Nakamura, S.² Shikano, K.³

39
- 34249677960
- Young, S., Kershaw, D., Odell, J., Ollason, D., Valtchev, V., Woodland, P., 2000. The htk book (for htk version 3.0) http://htk.eng.cam.ac.uk/docs/docs.shtml.

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.