SCOPUS 정보 검색 플랫폼

Volumn 56, Issue 9, 2007, Pages 1169-1175

Synergy of lip-motion and acoustic features in biometric speech and speaker recognition

(2) Faraj, Maycel Isaac a Bigun, Josef a

Author keywords

Biometrics; GMM; Lip motion; Lip reading; Motion estimation; Normal image flow; Normal image velocity; Speaker recognition; Speech recognition; SVM

Indexed keywords

AUDIO SYSTEMS; BIOMETRICS; MOTION ESTIMATION; SUPPORT VECTOR MACHINES; VERIFICATION;

LIP MOTION; LIP READING; NORMAL IMAGE FLOW; NORMAL IMAGE VELOCITY; SPEAKER RECOGNITION;

SPEECH RECOGNITION;

EID: 34548205797 PISSN: 00189340 EISSN: None Source Type: Journal
DOI: 10.1109/TC.2007.1074 Document Type: Article

Times cited : (32)

References (40)

1
- 84947902509
- Expert Conciliation for Multi Modal Person Authentication Systems by Bayesian Statistics
- J. Bigun, G. Chollet, and G. Borgefors, eds, pp
- E. Bigun, J. Bigun, B. Duc, and S. Fischer, "Expert Conciliation for Multi Modal Person Authentication Systems by Bayesian Statistics," Proc. First Int'l Conf. Audio- and Video-Based Person Authentication (AVBPA '97), J. Bigun, G. Chollet, and G. Borgefors, eds., pp. 291-300, 1997.
- (1997) Proc. First Int'l Conf. Audio- and Video-Based Person Authentication (AVBPA '97) , pp. 291-300
- Bigun, E.¹ Bigun, J.² Duc, B.³ Fischer, S.⁴

2
- 0026202914
- Multidimensional Orientation Estimation with Applications to Texture Analysis of Optical Flow
- Aug
- J. Bigun, G. Granlund, and J. Wiklund, "Multidimensional Orientation Estimation with Applications to Texture Analysis of Optical Flow," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 13, no. 8, pp. 775-790, Aug. 1991.
- (1991) IEEE Trans. Pattern Analysis and Machine Intelligence , vol.13 , Issue.8 , pp. 775-790
- Bigun, J.¹ Granlund, G.² Wiklund, J.³

3
- 0029393187
- Person Identification Using Multiple Cues
- Oct
- K.R. Brunelli and D. Falavigna, "Person Identification Using Multiple Cues," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 17, no. 10, pp. 955-966, Oct. 1995.
- (1995) IEEE Trans. Pattern Analysis and Machine Intelligence , vol.17 , Issue.10 , pp. 955-966
- Brunelli, K.R.¹ Falavigna, D.²

4
- 27144489164
- A Tutorial on Support Vector Machines for Pattern Recognition
- C.J. Burges, "A Tutorial on Support Vector Machines for Pattern Recognition," Data Mining and Knowledge Discovery, vol. 2, no. 2, pp. 121-167, 1998.
- (1998) Data Mining and Knowledge Discovery , vol.2 , Issue.2 , pp. 121-167
- Burges, C.J.¹

5
- 0003710380
- C.-C. Chang and C.-J. Lin, "LIBSVM_A Library for Support Vector Machines," www.csie.ntu.edu.tw/cjlin/libsvm, 2001.
- (2001) LIBSVM_A Library for Support Vector Machines
- Chang, C.-C.¹ Lin, C.-J.²

6
- 85032752352
- Audiovisual Speech Processing
- T. Chen, "Audiovisual Speech Processing," IEEE Signal Processing Magazine, vol. 18, no. 1, pp. 9-21, 2001.
- (2001) IEEE Signal Processing Magazine , vol.18 , Issue.1 , pp. 9-21
- Chen, T.¹

7
- 0036502797
- A Review of Speech-Based Bimodal Recognition
- C. Chibelushi, F. Deravi, and J. Mason, "A Review of Speech-Based Bimodal Recognition," IEEE Trans. Multimedia, vol. 4, no. 1, pp. 23-37, 2002.
- (2002) IEEE Trans. Multimedia , vol.4 , Issue.1 , pp. 23-37
- Chibelushi, C.¹ Deravi, F.² Mason, J.³

8
- 0032639886
- On the Use of Support Vector Machines for Phonetic Classification
- P. Clarkson and P. Moreno, "On the Use of Support Vector Machines for Phonetic Classification," Proc. IEEE Int'l Conf. Acoustics, Speech, and Signal Processing (ICASSP '99), vol. 2, pp. 585-588, 1999.
- (1999) Proc. IEEE Int'l Conf. Acoustics, Speech, and Signal Processing (ICASSP '99) , vol.2 , pp. 585-588
- Clarkson, P.¹ Moreno, P.²

9
- 0019053271
- Comparison of Parametric Representations for Monosyllabic Word Recognition in Continuously Spoken Sentences
- S. Davis and P. Mermelstein, "Comparison of Parametric Representations for Monosyllabic Word Recognition in Continuously Spoken Sentences," IEEE Trans. Acoustics, Speech, and Signal Processing, vol. 28, no. 4, pp. 357-366, 1980.
- (1980) IEEE Trans. Acoustics, Speech, and Signal Processing , vol.28 , Issue.4 , pp. 357-366
- Davis, S.¹ Mermelstein, P.²

10
- 0034224204
- Optical Flow Constraints on Deformable Models with Applications to Face Tracking
- D. DeCarlo and D. Metaxas, "Optical Flow Constraints on Deformable Models with Applications to Face Tracking," Int'l J. Computer Vision vol. 38, no. 2, pp. 99-127, 2000.
- (2000) Int'l J. Computer Vision , vol.38 , Issue.2 , pp. 99-127
- DeCarlo, D.¹ Metaxas, D.²

11
- 21744455232
- Acoustic-Labial Speaker Verification
- U. Dieckmann, P. Plankensteiner, and T. Wagner, "Acoustic-Labial Speaker Verification," Proc. First Int'l Conf. Audio- and Video-Based Biometric Person Authentication (AVBPA '97), pp. 301-310, 1997.
- (1997) Proc. First Int'l Conf. Audio- and Video-Based Biometric Person Authentication (AVBPA '97) , pp. 301-310
- Dieckmann, U.¹ Plankensteiner, P.² Wagner, T.³

12
- 0030674147
- Face Authentication with Sparse Grid Gabor Information
- B. Duc, S. Fischer, and J. Bigun, "Face Authentication with Sparse Grid Gabor Information," Proc. IEEE Int'l Conf. Acoustics, Speech, and Signal Processing (ICASSP '97), vol. 4, no. 21, pp. 3053-3056, 1997.
- (1997) Proc. IEEE Int'l Conf. Acoustics, Speech, and Signal Processing (ICASSP '97) , vol.4 , Issue.21 , pp. 3053-3056
- Duc, B.¹ Fischer, S.² Bigun, J.³

13
- 33845524427
- Person Verification by Lip-Motion
- M.I. Faraj and J. Bigun, "Person Verification by Lip-Motion," Proc. Conf. Computer Vision and Pattern Recognition Workshop (CVPRW '06), pp. 37-45, 2006.
- (2006) Proc. Conf. Computer Vision and Pattern Recognition Workshop (CVPRW '06) , pp. 37-45
- Faraj, M.I.¹ Bigun, J.²

14
- 0028204659
- Speaker Recognition Using Neural Networks and Conventional Classifiers
- K. Farrell, R. Mammone, and K. Assaleh, "Speaker Recognition Using Neural Networks and Conventional Classifiers," IEEE Trans. Speech and Audio Processing, vol. 2, no. 1, pp. 194-205, 1994.
- (1994) IEEE Trans. Speech and Audio Processing , vol.2 , Issue.1 , pp. 194-205
- Farrell, K.¹ Mammone, R.² Assaleh, K.³

15
- 20844454540
- Robust Speech Recognizer Using Multiclass SVM
- I. Gavat, G. Costache, and C. Iancu, "Robust Speech Recognizer Using Multiclass SVM," Proc. Seventh Seminar Neural Network Applications in Electrical Eng. (NEUREL '04), pp. 63-66, 2004.
- (2004) Proc. Seventh Seminar Neural Network Applications in Electrical Eng. (NEUREL '04) , pp. 63-66
- Gavat, I.¹ Costache, G.² Iancu, C.³

16
- 0018026724
- In Search of a General Picture Processing Operator
- G.H. Granlund, "In Search of a General Picture Processing Operator," Computer Graphics and Image Processing, vol. 8, no. 2, pp. 155-173, 1978.
- (1978) Computer Graphics and Image Processing , vol.8 , Issue.2 , pp. 155-173
- Granlund, G.H.¹

17
- 34047263009
- Visual Model Structures and Synchrony Constraints for Audio-Visual Speech Recognition
- T.J. Hazen, "Visual Model Structures and Synchrony Constraints for Audio-Visual Speech Recognition," IEEE Trans. Audio, Speech, and Language Processing, vol. 14, no. 3, pp. 1082-1089, 2006.
- (2006) IEEE Trans. Audio, Speech, and Language Processing , vol.14 , Issue.3 , pp. 1082-1089
- Hazen, T.J.¹

18
- 0019597413
- Determining Optical Flow
- B. Horn and B. Schunck, "Determining Optical Flow," J. Artificial Intelligence, vol. 17, no. 1, pp. 185-203, 1981.
- (1981) J. Artificial Intelligence , vol.17 , Issue.1 , pp. 185-203
- Horn, B.¹ Schunck, B.²

19
- 84947907880
- Acoustic-Labial Speaker Verification
- P. Jourlin, J. Luettin, D. Genoud, and H. Wassner, "Acoustic-Labial Speaker Verification," Proc. First Int'l Conf. Audio- and Video-Based Biometric Person Authentication (AVBPA '97), pp. 319-326, 1997.
- (1997) Proc. First Int'l Conf. Audio- and Video-Based Biometric Person Authentication (AVBPA '97) , pp. 319-326
- Jourlin, P.¹ Luettin, J.² Genoud, D.³ Wassner, H.⁴

20
- 33750962696
- Evaluating Liveness by Face Images and the Structure Tensor
- K. Kollreider, H. Fronthaler, and J. Bigun, "Evaluating Liveness by Face Images and the Structure Tensor," Proc. Fourth IEEE Workshop Automatic Identification Advanced Technologies (AutoID '05), pp. 75-80, 2005.
- (2005) Proc. Fourth IEEE Workshop Automatic Identification Advanced Technologies (AutoID '05) , pp. 75-80
- Kollreider, K.¹ Fronthaler, H.² Bigun, J.³

21
- 79952493967
- Speaker Independent Audio-Visual Continuous Speech Recognition
- L. Liang, X.L.Y. Zhao, X. Pi, and A. Nefian, "Speaker Independent Audio-Visual Continuous Speech Recognition," Proc. IEEE Int'l Conf. Multimedia and Expo (ICME '02), vol. 2, pp. 26-29, 2002.
- (2002) Proc. IEEE Int'l Conf. Multimedia and Expo (ICME '02) , vol.2 , pp. 26-29
- Liang, L.¹ Zhao, X.L.Y.² Pi, X.³ Nefian, A.⁴

22
- 0019647180
- An Iterative Image Registration Technique with an Application to Stereo Vision
- B.D. Lucas and T. Kanade, "An Iterative Image Registration Technique with an Application to Stereo Vision," Proc. Int'l Joint Conf. Artificial Intelligence, pp. 674-679, 1981.
- (1981) Proc. Int'l Joint Conf. Artificial Intelligence , pp. 674-679
- Lucas, B.D.¹ Kanade, T.²

23
- 20444375102
- Integration Strategies for Audio-Visual Speech Processing: Applied to Text-Dependent Speaker Recognition
- S. Lucey, T. Chen, S. Sridharan, and V. Chandran, "Integration Strategies for Audio-Visual Speech Processing: Applied to Text-Dependent Speaker Recognition," IEEE Trans. Multimedia, vol. 7, no. 3, pp. 495-506, 2005.
- (2005) IEEE Trans. Multimedia , vol.7 , Issue.3 , pp. 495-506
- Lucey, S.¹ Chen, T.² Sridharan, S.³ Chandran, V.⁴

24
- 34548282836
- J. Luettin and G. Maitre, Evaluation Protocol for the Extended M2VTS Database xm2vtsdb, IDIAP Communication 98-054, Technical Report R R-21, number = IDIAP - 1998, 1998.
- J. Luettin and G. Maitre, "Evaluation Protocol for the Extended M2VTS Database xm2vtsdb," IDIAP Communication 98-054, Technical Report R R-21, number = IDIAP - 1998, 1998.

25
- 0031069562
- Speechreading Using Probabilistic Models
- J. Luettin and N. Thacker, "Speechreading Using Probabilistic Models," Computer Vision and Image Understanding, vol. 65, no. 2, pp. 163-178, 1997.
- (1997) Computer Vision and Image Understanding , vol.65 , Issue.2 , pp. 163-178
- Luettin, J.¹ Thacker, N.²

26
- 0025750892
- Automatic Lip-Reading by Optical-Flow Analysis
- K. Mase and A. Pentland, "Automatic Lip-Reading by Optical-Flow Analysis," Systems and Computers in Japan, vol. 22, no. 6, pp. 67-76, 1991.
- (1991) Systems and Computers in Japan , vol.22 , Issue.6 , pp. 67-76
- Mase, K.¹ Pentland, A.²

27
- 0001935972
- Xm2vtsdb: The Extended M2VTS Database
- K. Messer, J. Matas, J. Kittler, and J. Luettin, "Xm2vtsdb: The Extended M2VTS Database," Proc. Second Int'l Conf. Audio- and Video-Based Biometric Person Authentication (AVBPA '99), pp. 72-77, 1999.
- (1999) Proc. Second Int'l Conf. Audio- and Video-Based Biometric Person Authentication (AVBPA '99) , pp. 72-77
- Messer, K.¹ Matas, J.² Kittler, J.³ Luettin, J.⁴

28
- 84893671339
- An Improved Automatic Lipreading System to Enhance Speech Recognition
- E. Petajan, B. Bischoff, D. Bodoff, and N.M. Brooke, "An Improved Automatic Lipreading System to Enhance Speech Recognition," Proc. SIGCHI Conf. Human Factors in Computing Systems (CHI '88), pp. 19-25, 1988.
- (1988) Proc. SIGCHI Conf. Human Factors in Computing Systems (CHI '88) , pp. 19-25
- Petajan, E.¹ Bischoff, B.² Bodoff, D.³ Brooke, N.M.⁴

29
- 0033884858
- Speaker Verification Using Adapted Gaussian Mixture Models
- D. Reynolds, T. Quatieri, and R.B. Dunn, "Speaker Verification Using Adapted Gaussian Mixture Models," Digital Signal Processing, vol. 10, nos. 1-3, pp. 19-41, 2000.
- (2000) Digital Signal Processing , vol.10 , Issue.1-3 , pp. 19-41
- Reynolds, D.¹ Quatieri, T.² Dunn, R.B.³

30
- 0029209272
- Robust Text-Independent Speaker Identification Using Gaussian Mixture Models
- D. Reynolds and R. Rose, "Robust Text-Independent Speaker Identification Using Gaussian Mixture Models," IEEE Trans. Speech and Audio Processing, vol. 3, no. 1, pp. 72-83, 1995.
- (1995) IEEE Trans. Speech and Audio Processing , vol.3 , Issue.1 , pp. 72-83
- Reynolds, D.¹ Rose, R.²

31
- 0029748333
- Speaker Identification via Support Vector Classifiers
- M. Schmidt and H. Gish, "Speaker Identification via Support Vector Classifiers," Proc. IEEE Int'l Conf. Acoustics, Speech, and Signal Processing (ICASSP '96), pp. 105-108, 1996.
- (1996) Proc. IEEE Int'l Conf. Acoustics, Speech, and Signal Processing (ICASSP '96) , pp. 105-108
- Schmidt, M.¹ Gish, H.²

32
- 34548264480
- Fusion of Audio-Visual Information Integrated Speech Processing
- X. Tang and X. Li, "Fusion of Audio-Visual Information Integrated Speech Processing," Proc. Third Int'l Conf. Audio- and Video-Based Biometric Person Authentication (AVBPA '01), pp. 127-143, 2001.
- (2001) Proc. Third Int'l Conf. Audio- and Video-Based Biometric Person Authentication (AVBPA '01) , pp. 127-143
- Tang, X.¹ Li, X.²

33
- 4544270398
- Video Based Face Recognition Using Multiple Classifiers
- X. Tang and X. Li, "Video Based Face Recognition Using Multiple Classifiers," Proc. Sixth IEEE Int'l Conf. Automatic Face and Gesture Recognition (FGR '04), pp. 345-349, 2004.
- (2004) Proc. Sixth IEEE Int'l Conf. Automatic Face and Gesture Recognition (FGR '04) , pp. 345-349
- Tang, X.¹ Li, X.²

34
- 0003450542
- Springer
- V.N. Vapnik, The Nature of Statistical Learning Theory. Springer, 1995.
- (1995) The Nature of Statistical Learning Theory
- Vapnik, V.N.¹

35
- 0031341829
- Multisensor Data Fusion
- P. Varshney, "Multisensor Data Fusion," Electronics and Comm. Eng. J., vol. 9, no. 6, pp. 245-253, 1997.
- (1997) Electronics and Comm. Eng. J , vol.9 , Issue.6 , pp. 245-253
- Varshney, P.¹

36
- 0034505639
- Support Vector Machines for Speaker Verification and Identification
- V. Wan and W. Campbell, "Support Vector Machines for Speaker Verification and Identification," Proc. IEEE Signal Processing Soc. Workshop Neural Networks for Signal Processing X, vol. 2, pp. 775-784, 2000.
- (2000) Proc. IEEE Signal Processing Soc. Workshop Neural Networks for Signal Processing X , vol.2 , pp. 775-784
- Wan, V.¹ Campbell, W.²

37
- 0032646868
- The Use of Speech and Lip Modalities for Robust Speaker Verification under Adverse Conditions
- T. Wark, S. Sridharan, and V. Chandran, "The Use of Speech and Lip Modalities for Robust Speaker Verification under Adverse Conditions," Proc. IEEE Int'l Conf. Multimedia Computing and Systems (ICMCS '99) vol. 1, 1999.
- (1999) Proc. IEEE Int'l Conf. Multimedia Computing and Systems (ICMCS '99) , vol.1
- Wark, T.¹ Sridharan, S.² Chandran, V.³

38
- 0025474465
- Performance-Driven Facial Animation
- 90, pp
- L. Williams, "Performance-Driven Facial Animation," Proc. SIGGRAPH '90, pp. 235-242, 1990.
- (1990) Proc. SIGGRAPH , pp. 235-242
- Williams, L.¹

39
- 0032179320
- Lip Movement Synthesis from Speech Based on Hidden Markov Models
- E. Yamamoto, S. Nakamura, and K. Shikano, "Lip Movement Synthesis from Speech Based on Hidden Markov Models," J. Speech Comm., vol. 26, no. 1, pp. 105-115, 1998.
- (1998) J. Speech Comm , vol.26 , Issue.1 , pp. 105-115
- Yamamoto, E.¹ Nakamura, S.² Shikano, K.³

40
- 34548203688
- S. Young, D. Kershaw, J. Odell, D. Ollason, V. Valtchev, and P. Woodland, The HTK Book for HTK Version 3.0, 2000
- S. Young, D. Kershaw, J. Odell, D. Ollason, V. Valtchev, and P. Woodland, The HTK Book (for HTK Version 3.0), http://htk.eng.cam.ac.uk/docs/docs.shtml, 2000.

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.