SCOPUS 정보 검색 플랫폼

Volumn 94, Issue 11, 2006, Pages 2025-2044

Audio-visual biometrics

(2) Aleksic, Petar S a Katsaggelos, Aggelos K a

a Northwestern University (United States)

Author keywords

Audio visual biometrics; Audio visual databases; Audio visual fusion; Audio visual person recognition; Face tracking; Hidden Markov models; Multimodal recognition; Visual feature extraction

Indexed keywords

COMPUTER SIMULATION; DATABASE SYSTEMS; FACE RECOGNITION; FEATURE EXTRACTION; HIDDEN MARKOV MODELS; INFORMATION FUSION;

AUDIO-VISUAL BIOMETRICS; AUDIO-VISUAL DATABASES; AUDIO-VISUAL FUSION; AUDIO-VISUAL PERSON RECOGNITION; MULTIMODAL RECOGNITION; VISUAL FEATURE EXTRACTION;

BIOMETRICS;

EID: 33947384963 PISSN: 00189219 EISSN: None Source Type: Journal
DOI: 10.1109/JPROC.2006.886017 Document Type: Article

Times cited : (109)

References (137)

1
- 0742290133
- An introduction to biometric recognition
- Jan
- A. K. Jain, A. Ross, and S. Prabhakar, "An introduction to biometric recognition," IEEE Trans. Circuits Systems Video Technol., vol. 14, no. 1, pp. 4-20, Jan. 2004.
- (2004) IEEE Trans. Circuits Systems Video Technol , vol.14 , Issue.1 , pp. 4-20
- Jain, A.K.¹ Ross, A.² Prabhakar, S.³

2
- 84915757164
- Automated biometrics
- Rio de Janeiro, Brazil
- N. K. Ratha, A. W. Senior, and R. M. Bolle, "Automated biometrics," in Proc. Int. Conf. Advances Pattern Recognition, Rio de Janeiro, Brazil, 2001, pp. 445-474.
- (2001) Proc. Int. Conf. Advances Pattern Recognition , pp. 445-474
- Ratha, N.K.¹ Senior, A.W.² Bolle, R.M.³

3
- 33947430698
- Financial crimes report to the public, Online, Available
- Financial crimes report to the public. Fed. Bur. Investigation, Financial Crimes Section, Criminal Investigation Division. [Online]. Available: http://www.fbi.gov/publications/financial/fcs_report052005/fcs_report052005.htm
- Fed. Bur. Investigation, Financial Crimes Section, Criminal Investigation Division

4
- 0442296538
- Hiding biometric data
- Nov
- A. K. Jain and U. Uludag, "Hiding biometric data," IEEE Trans. Pattern Anal. Machine Intell., vol. 25, no. 11, pp. 1494-1498, Nov. 2003.
- (2003) IEEE Trans. Pattern Anal. Machine Intell , vol.25 , Issue.11 , pp. 1494-1498
- Jain, A.K.¹ Uludag, U.²

5
- 33947381924
- Voice and facial image integration for speaker recognition
- Southampton, U.K
- C. C. Chibelushi, F. Deravi, and J. S. Mason, "Voice and facial image integration for speaker recognition," in Proc. IEEE Int. Symp. Multimedia Technologies Future Appl., Southampton, U.K., 1993.
- (1993) Proc. IEEE Int. Symp. Multimedia Technologies Future Appl
- Chibelushi, C.C.¹ Deravi, F.² Mason, J.S.³

6
- 0029393187
- Person identification using multiple cues
- Oct
- R. Brunelli and D. Falavigna, "Person identification using multiple cues," IEEE Trans. Pattern Anal. Machine Intell., vol. 10, pp. 955-965, Oct. 1995.
- (1995) IEEE Trans. Pattern Anal. Machine Intell , vol.10 , pp. 955-965
- Brunelli, R.¹ Falavigna, D.²

7
- 0032594952
- Fusion of face and speech data for person identity verification
- S. Ben-Yacoub, Y. Abdeljaoued, and E. Mayoraz, "Fusion of face and speech data for person identity verification," IEEE Trans. Neural Networks, vol. 10, pp. 1065-1074, 1999.
- (1999) IEEE Trans. Neural Networks , vol.10 , pp. 1065-1074
- Ben-Yacoub, S.¹ Abdeljaoued, Y.² Mayoraz, E.³

8
- 4544228318
- Identity verification using speech and face information
- C. Sanderson and K. K. Paliwal, "Identity verification using speech and face information," Digital Signal Processing, vol. 14, no. 5, pp. 449-480, 2004.
- (2004) Digital Signal Processing , vol.14 , Issue.5 , pp. 449-480
- Sanderson, C.¹ Paliwal, K.K.²

9
- 21844446305
- Multi-modal face and speaker identification on a handheld device
- Santa Barbara, CA
- T. J. Hazen, E. Weinstein, R. Kabir, A. Park, and B. Heisele, "Multi-modal face and speaker identification on a handheld device," in Proc. Works. Multimodal User Authentication, Santa Barbara, CA, 2003, pp. 113-120.
- (2003) Proc. Works. Multimodal User Authentication , pp. 113-120
- Hazen, T.J.¹ Weinstein, E.² Kabir, R.³ Park, A.⁴ Heisele, B.⁵

10
- 84870683671
- Integrating acoustic and labial information for speaker identification and verification
- Rhodes, Greece
- P. Jourlin, J. Luettin, D. Genoud, and H. Wassner, "Integrating acoustic and labial information for speaker identification and verification," in Proc. 5th Eur. Conf. Speech Communication Technology, Rhodes, Greece, 1997, pp. 1603-1606.
- (1997) Proc. 5th Eur. Conf. Speech Communication Technology , pp. 1603-1606
- Jourlin, P.¹ Luettin, J.² Genoud, D.³ Wassner, H.⁴

11
- 0032638088
- Robust speaker verification via fusion of speech and lip modalities
- Phoenix, AZ
- T. Wark, S. Sridharan, and V. Chandran, "Robust speaker verification via fusion of speech and lip modalities," in Proc. Inf. Conf. Acoustics, Speech Signal Processing, Phoenix, AZ, 1999, pp. 3061-3064.
- (1999) Proc. Inf. Conf. Acoustics, Speech Signal Processing , pp. 3061-3064
- Wark, T.¹ Sridharan, S.² Chandran, V.³

12
- 0008782019
- Robust speaker verification via asynchronous fusion of speech and lip information
- Washington, DC
- _, "Robust speaker verification via asynchronous fusion of speech and lip information," in Proc. 2th Int. Conf. Audio- and Video-Based Biometric Person Authentication, Washington, DC, 1999, pp. 37-42.
- (1999) Proc. 2th Int. Conf. Audio- and Video-Based Biometric Person Authentication , pp. 37-42
- Wark, T.¹ Sridharan, S.² Chandran, V.³

13
- 0033692608
- The use of temporal speech and lip information for multi-modal speaker identification via multi-stream HMMs
- Istanbul, Turkey
- _, "The use of temporal speech and lip information for multi-modal speaker identification via multi-stream HMMs," in Proc. Int. Conf. Acoustics, Speech Signal Processing, Istanbul, Turkey, 2000, pp. 2389-2392.
- (2000) Proc. Int. Conf. Acoustics, Speech Signal Processing , pp. 2389-2392
- Wark, T.¹ Sridharan, S.² Chandran, V.³

14
- 33947355927
- An audio-visual person identification and verification system using FAPs as visual features
- Santa Barbara, CA
- P. S. Aleksic and A. K. Katsaggelos, "An audio-visual person identification and verification system using FAPs as visual features," in Proc. Works. Multimedia User Authentication, Santa Barbara, CA, 2003, pp. 80-84.
- (2003) Proc. Works. Multimedia User Authentication , pp. 80-84
- Aleksic, P.S.¹ Katsaggelos, A.K.²

15
- 26844468363
- Information fusion and decision cascading for audio-visual speaker recognition based on time-varying stream reliability prediction
- Baltimore, MD, Jul. 6-9
- U. V. Chaudhari, G. N. Ramaswamy, G. Potamianos, and C. Neti, "Information fusion and decision cascading for audio-visual speaker recognition based on time-varying stream reliability prediction," in Proc. Int. Conf. Multimedia Expo, Baltimore, MD, Jul. 6-9, 2003, pp. 9-12.
- (2003) Proc. Int. Conf. Multimedia Expo , pp. 9-12
- Chaudhari, U.V.¹ Ramaswamy, G.N.² Potamianos, G.³ Neti, C.⁴

16
- 0031223878
- SESAM: A biometric person identification system using sensor fusion
- U. Dieckmann, P. Plankensteiner, and T. Wagner, "SESAM: A biometric person identification system using sensor fusion," Pattern Recogn. Lett., vol. 18, pp. 827-833, 1997.
- (1997) Pattern Recogn. Lett , vol.18 , pp. 827-833
- Dieckmann, U.¹ Plankensteiner, P.² Wagner, T.³

17
- 0031233424
- Speaker recognition: A tutorial
- Sep
- J. P. Campbell, "Speaker recognition: A tutorial," Proc. IEEE, vol. 85, no. 9, pp. 1437-1462, Sep. 1997.
- (1997) Proc. IEEE , vol.85 , Issue.9 , pp. 1437-1462
- Campbell, J.P.¹

18
- 1842499650
- W.-Y. Zhao, R. Chellappa, P. J. J. Phillips, and A. Rosenfeld, Face recognition: A literature survey, ACM Computing Survey, pp. 399-458, 2003, Dec. Issue.
- W.-Y. Zhao, R. Chellappa, P. J. J. Phillips, and A. Rosenfeld, "Face recognition: A literature survey," ACM Computing Survey, pp. 399-458, 2003, Dec. Issue.

19
- 0026065565
- Eigenfaces for recognition
- Sep
- M. Turk and A. Pentland, "Eigenfaces for recognition," J. Cognitive Neuroscience, vol. 3, no. 1, pp. 586-591, Sep. 1991.
- (1991) J. Cognitive Neuroscience , vol.3 , Issue.1 , pp. 586-591
- Turk, M.¹ Pentland, A.²

20
- 0025236073
- Application of the Karhunen-Loeve procedure for the characterization of human faces
- Jan
- M. Kirby and L. Sirovich, "Application of the Karhunen-Loeve procedure for the characterization of human faces," IEEE Trans. Pottem Anal. Mach. Intell., vol. 12, no. 1, pp. 103-108, Jan. 1990.
- (1990) IEEE Trans. Pottem Anal. Mach. Intell , vol.12 , Issue.1 , pp. 103-108
- Kirby, M.¹ Sirovich, L.²

21
- 0031185845
- Eigenfaces versus fisherfaces: Recognition using class specific linear projection
- P. N. Belhumeur, J. P. Hespanha, and D. J. Kriegman, "Eigenfaces versus fisherfaces: Recognition using class specific linear projection," IEEE Trans. Pattern Anal. Mach. Intell., vol. 19, pp. 711-720, 1997.
- (1997) IEEE Trans. Pattern Anal. Mach. Intell , vol.19 , pp. 711-720
- Belhumeur, P.N.¹ Hespanha, J.P.² Kriegman, D.J.³

22
- 1842499650
- Face recognition: A literature survey
- W. Zhao, R. Chellappa, P. J. Phillips, and A. Rosenfeld, "Face recognition: A literature survey," Proc. ACM Computing Surverys (CSUR), vol. 35, no. 4, pp. 399-458, 2003.
- (2003) Proc. ACM Computing Surverys (CSUR) , vol.35 , Issue.4 , pp. 399-458
- Zhao, W.¹ Chellappa, R.² Phillips, P.J.³ Rosenfeld, A.⁴

23
- 33947424506
- J. Luettin, Visual speech and speaker recognition, Ph.D. dissertation, Dept. Computer Science, Univ. Sheffield, Sheffield, U.K., 1997.
- J. Luettin, "Visual speech and speaker recognition," Ph.D. dissertation, Dept. Computer Science, Univ. Sheffield, Sheffield, U.K., 1997.

24
- 0031335829
- Audio-visual person recognition: An evaluation of data fusion strategies
- London, U.K
- C. C. Chibelushi, F. Deravi, and J. S. Mason, "Audio-visual person recognition: An evaluation of data fusion strategies," in Proc. Eur. Conf. Security Detection, London, U.K., 1997, pp. 26-30.
- (1997) Proc. Eur. Conf. Security Detection , pp. 26-30
- Chibelushi, C.C.¹ Deravi, F.² Mason, J.S.³

25
- 0029527336
- Automatic person recognition using acoustic and geometric features
- R. Brunelli, D. Falavigna, T. Poggio, and L. Stringa, "Automatic person recognition using acoustic and geometric features," Machine Vision Appl., vol. 8, pp. 317-325, 1995.
- (1995) Machine Vision Appl , vol.8 , pp. 317-325
- Brunelli, R.¹ Falavigna, D.² Poggio, T.³ Stringa, L.⁴

26
- 0036487270
- Noise compensation in a person verification system using face and multiple speech features
- Feb
- C. Sanderson and K. K. Paliwal, "Noise compensation in a person verification system using face and multiple speech features," Pattern Recognition, vol. 36, no. 2, pp. 293-302, Feb. 2003.
- (2003) Pattern Recognition , vol.36 , Issue.2 , pp. 293-302
- Sanderson, C.¹ Paliwal, K.K.²

27
- 0031220766
- Acoustic-labial speaker verification
- P. Jourlin, J. Luettin, D. Genoud, and H. Wassner, "Acoustic-labial speaker verification," Pattern Recogn. Lett., vol. 18, pp. 853-858, 1997.
- (1997) Pattern Recogn. Lett , vol.18 , pp. 853-858
- Jourlin, P.¹ Luettin, J.² Genoud, D.³ Wassner, H.⁴

28
- 0141855071
- Audio-visual speaker recognition using time-varying stream reliability prediction
- Hong Kong, China
- U. V. Chaudhari, G. N. Ramaswamy, G. Potamianos, and C. Neti, "Audio-visual speaker recognition using time-varying stream reliability prediction," in Proc. Int. Conf. Acoustics, Speech Signal Processing, Hong Kong, China, 2003, pp. V-712-V-715.
- (2003) Proc. Int. Conf. Acoustics, Speech Signal Processing
- Chaudhari, U.V.¹ Ramaswamy, G.N.² Potamianos, G.³ Neti, C.⁴

29
- 33745546675
- Multimodal authentication using asynchronous HMMs
- Guildford, U.K
- S. Bengio, "Multimodal authentication using asynchronous HMMs," in Proc. 4th Int. Conf. Audio- and Video-Based Biometric Person Authentication, Guildford, U.K., 2003, pp. 770-777.
- (2003) Proc. 4th Int. Conf. Audio- and Video-Based Biometric Person Authentication , pp. 770-777
- Bengio, S.¹

30
- 35248821400
- A Bayesian approach to audio-visual speaker identification
- Guildford, U.K
- A. V. Nefian, L. H. Liang, T. Fu, and X. X. Liu, "A Bayesian approach to audio-visual speaker identification," in Proc. 4th Int. Conf. Audio- and Video-Based Biometric Person Authentication, Guildford, U.K., 2003, pp. 761-769.
- (2003) Proc. 4th Int. Conf. Audio- and Video-Based Biometric Person Authentication , pp. 761-769
- Nefian, A.V.¹ Liang, L.H.² Fu, T.³ Liu, X.X.⁴

31
- 0345565778
- An audio-visual speaker identification using coupled hidden Markov models
- Spain
- T. Fu, X. X. Liu, L. H. Liang, X. Pi, and A. V. Nefian, "An audio-visual speaker identification using coupled hidden Markov models," in Proc. Int. Conf. Image Processing, Barcelona, Spain, 2003, pp. 29-32.
- (2003) Proc. Int. Conf. Image Processing, Barcelona , pp. 29-32
- Fu, T.¹ Liu, X.X.² Liang, L.H.³ Pi, X.⁴ Nefian, A.V.⁵

32
- 33745546675
- Multimodal authentication using asynchronous HMMs
- Guildford, U.K
- S. Bengio, "Multimodal authentication using asynchronous HMMs," in Proc. 4th Int. Conf. Audio- and Video-Based Biometric Person Authentication, Guildford, U.K., 2003, pp. 770-777.
- (2003) Proc. 4th Int. Conf. Audio- and Video-Based Biometric Person Authentication , pp. 770-777
- Bengio, S.¹

33
- 1842854568
- Multimodal speech processing using asynchronous hidden Markov models
- _, "Multimodal speech processing using asynchronous hidden Markov models," Information Fusion, vol. 5, pp. 81-89, 2004.
- (2004) Information Fusion , vol.5 , pp. 81-89
- Bengio, S.¹

34
- 84921606034
- Person identification using automatic integration of speech, lip. and face experts
- Berkeley, CA
- N. A. Fox, R. Gross, P. de Chazal, J. F. Cohn, and R. B. Reilly, "Person identification using automatic integration of speech, lip. and face experts," in Proc. ACM SIGMM 2003 Multimedia Biometrics Methods and Applications Workshop (WBMA'03), Berkeley, CA, 2003, pp. 25-32.
- (2003) Proc. ACM SIGMM 2003 Multimedia Biometrics Methods and Applications Workshop (WBMA'03) , pp. 25-32
- Fox, N.A.¹ Gross, R.² de Chazal, P.³ Cohn, J.F.⁴ Reilly, R.B.⁵

35
- 35248851586
- Audio-visual speaker identification based on the use of dynamic audio and visual features
- Guildford, U.K
- N. A. Fox and R. B. Reilly, "Audio-visual speaker identification based on the use of dynamic audio and visual features," in Proc. 4th Int. Conf. Audio- and Video-Based Biometric Person Authentication, Guildford, U.K., 2003, pp. 743-751.
- (2003) Proc. 4th Int. Conf. Audio- and Video-Based Biometric Person Authentication , pp. 743-751
- Fox, N.A.¹ Reilly, R.B.²

36
- 84867083799
- Fusion of person authentication probabilities by Bayesian statistics
- Washington, DC
- Y. Abdeljaoued, "Fusion of person authentication probabilities by Bayesian statistics," in Proc. 2nd Int. Conf. Audio- and Video-Based Biometric Person Authentication, Washington, DC, 1999, pp. 172-175.
- (1999) Proc. 2nd Int. Conf. Audio- and Video-Based Biometric Person Authentication , pp. 172-175
- Abdeljaoued, Y.¹

37
- 0345565788
- Multimodal speaker identification with audio-video processing
- Barcelona, Spain
- Y. Yemez, A. Kanak, E. Erzin, and A. M. Tekalp, "Multimodal speaker identification with audio-video processing," in Proc. Int. Conf. Image Processing, Barcelona, Spain, 2003, pp. 5-8.
- (2003) Proc. Int. Conf. Image Processing , pp. 5-8
- Yemez, Y.¹ Kanak, A.² Erzin, E.³ Tekalp, A.M.⁴

38
- 0141590222
- Joint audio-video processing for biometric speaker identification
- Hong Kong, China
- A. Kanak, E. Erzin, Y. Yemez, and A. M. Tekalp, "Joint audio-video processing for biometric speaker identification," in Proc. Int. Conf. Acoustic, Speech Signal Processing, Hong Kong, China, 2003, pp. 561-564.
- (2003) Proc. Int. Conf. Acoustic, Speech Signal Processing , pp. 561-564
- Kanak, A.¹ Erzin, E.² Yemez, Y.³ Tekalp, A.M.⁴

39
- 26844533276
- Multimodal speaker identification using an adaptive classifier cascade based on modality reliability
- Oct
- E. Erzin, Y. Yemez, and A. M. Tekalp, "Multimodal speaker identification using an adaptive classifier cascade based on modality reliability," IEEE Trans. Multimedia, vol. 7, no. 5, pp. 840-852, Oct. 2005.
- (2005) IEEE Trans. Multimedia , vol.7 , Issue.5 , pp. 840-852
- Erzin, E.¹ Yemez, Y.² Tekalp, A.M.³

40
- 33947376189
- Multimodal speaker identification using canonical correlation analysis
- Toulouse, France, May
- M. E. Sargin, E. Erzin, Y. Yemez, and A. M. Tekalp, "Multimodal speaker identification using canonical correlation analysis," in IEEE Proc. Int. Conf. Acoustics, Speech Signal Processing, Toulouse, France, May 2006, pp. 613-616.
- (2006) IEEE Proc. Int. Conf. Acoustics, Speech Signal Processing , pp. 613-616
- Sargin, M.E.¹ Erzin, E.² Yemez, Y.³ Tekalp, A.M.⁴

41
- 0031223556
- Combining evidence in personal identity verification systems
- J. Kittler, J. Matas, K. Johnsson, and M. U. Ramos-Sánchez, "Combining evidence in personal identity verification systems," Pattern Recogn. Lett., vol. 18, pp. 845-852, 1997.
- (1997) Pattern Recogn. Lett , vol.18 , pp. 845-852
- Kittler, J.¹ Matas, J.² Johnsson, K.³ Ramos-Sánchez, M.U.⁴

42
- 0032021555
- On combining classifiers
- J. Kittler, M. Hatef, R. P. W. Duin, and J. Matas, "On combining classifiers," IEEE Trans. Pattern Anal. Machine Intell., vol. 20, pp. 226-239, 1998.
- (1998) IEEE Trans. Pattern Anal. Machine Intell , vol.20 , pp. 226-239
- Kittler, J.¹ Hatef, M.² Duin, R.P.W.³ Matas, J.⁴

43
- 84953735755
- Fusion of multiple experts in multimodal biometric personal identity verification systems
- Switzerland
- J. Kittler and K. Messer, "Fusion of multiple experts in multimodal biometric personal identity verification systems," in Proc. 12th IEEE Workshop Neural Networks Sig. Processing, Switzerland, 2002, pp. 3-12.
- (2002) Proc. 12th IEEE Workshop Neural Networks Sig. Processing , pp. 3-12
- Kittler, J.¹ Messer, K.²

44
- 84947902509
- Expert conciliation for multi modal person authentication systems by Bayesian statistics
- Crans-Montana, Switzerland, Mar
- E. S. Bigun, J. Bigun, B. Due, and S. Fisher, "Expert conciliation for multi modal person authentication systems by Bayesian statistics," in Proc. 1st Int. Conf. Audio- and Video-Based Biometric Person Authentication, Crans-Montana, Switzerland, Mar. 1997, pp. 291-300.
- (1997) Proc. 1st Int. Conf. Audio- and Video-Based Biometric Person Authentication , pp. 291-300
- Bigun, E.S.¹ Bigun, J.² Due, B.³ Fisher, S.⁴

45
- 0033899298
- BiolD: A multimodal biometric identification system
- R. W. Frischholz and U. Dieckmann, "BiolD: A multimodal biometric identification system," Computer, vol. 33, pp. 64-68, 2000.
- (2000) Computer , vol.33 , pp. 64-68
- Frischholz, R.W.¹ Dieckmann, U.²

46
- 33947430259
- Methods and apparatus for audio-visual speaker recognition and utterance verification,
- U.S. Patent 6 219 640
- S. Basu, H. S. M. Beigi, S. H. Maes, M. Ghislain, E. Benoit, C. Neti, and A. W. Senior, "Methods and apparatus for audio-visual speaker recognition and utterance verification," U.S. Patent 6 219 640, 1999.
- (1999)
- Basu, S.¹ Beigi, H.S.M.² Maes, S.H.³ Ghislain, M.⁴ Benoit, E.⁵ Neti, C.⁶ Senior, A.W.⁷

47
- 0032295436
- Integrating faces and fingerprints for personal identification
- L. Hong and A. Jain, "Integrating faces and fingerprints for personal identification," IEEE Trans. Pattern Anal. Machine Intell., vol. 20, pp. 1295-1307, 1998.
- (1998) IEEE Trans. Pattern Anal. Machine Intell , vol.20 , pp. 1295-1307
- Hong, L.¹ Jain, A.²

48
- 0030647922
- An approach to speaker identification using multiple classifiers
- Munich, Germany
- V. Radova and J. Psutka, "An approach to speaker identification using multiple classifiers," in Proc. IEEE Conf. Acoustics, Speech Signal Processing, Munich, Germany, 1997, vol. 2, pp. 1135-1138.
- (1997) Proc. IEEE Conf. Acoustics, Speech Signal Processing , vol.2 , pp. 1135-1138
- Radova, V.¹ Psutka, J.²

49
- 0038343934
- Information fusion in biometrics
- A. Ross and A. Jain, "Information fusion in biometrics," Pattern Rccogn. Lett., vol. 24, pp. 2115-2125, 2003.
- (2003) Pattern Rccogn. Lett , vol.24 , pp. 2115-2125
- Ross, A.¹ Jain, A.²

50
- 0001966565
- Multimodal decision-level fusion for person authentication
- Nov
- V. Chatzis, A. G. Bors, and I. Pitas, "Multimodal decision-level fusion for person authentication," IEEE Trans. Systems, Man, Cybernetics, Part A: Syst. Humans, vol. 29, no. 5, pp. 674-680, Nov. 1999.
- (1999) IEEE Trans. Systems, Man, Cybernetics, Part A: Syst. Humans , vol.29 , Issue.5 , pp. 674-680
- Chatzis, V.¹ Bors, A.G.² Pitas, I.³

51
- 33947359565
- Robust automatic human identification using face, mouth, and acoustic information
- Beijing, China, Oct
- N. Fox, R. Gross, J. Cohn, and R. B. Reilly, "Robust automatic human identification using face, mouth, and acoustic information," in Proc. Int. Workshop Analysis Modeling of Faces and Gestures, Beijing, China, Oct. 2005, pp. 263-277.
- (2005) Proc. Int. Workshop Analysis Modeling of Faces and Gestures , pp. 263-277
- Fox, N.¹ Gross, R.² Cohn, J.³ Reilly, R.B.⁴

52
- 0035791629
- Consideration of Lombard effect for speechreading
- F. J. Huang and T. Chen, "Consideration of Lombard effect for speechreading," in Proc. Works. Multimedia Signal Process, 2001, pp. 613-618.
- (2001) Proc. Works. Multimedia Signal Process , pp. 613-618
- Huang, F.J.¹ Chen, T.²

53
- 0031238278
- Biometrics: Privacy's foe or privacy's friend?
- J. D. Woodward, "Biometrics: Privacy's foe or privacy's friend?" Proc. IEEE, vol. 85, pp. 1480-1492, 1997.
- (1997) Proc. IEEE , vol.85 , pp. 1480-1492
- Woodward, J.D.¹

54
- 0031187171
- Speech recognition by machines and humans
- R. P. Lippmann, "Speech recognition by machines and humans," Speech Commun., vol. 22, no. 1, pp. 1-15, 1997.
- (1997) Speech Commun , vol.22 , Issue.1 , pp. 1-15
- Lippmann, R.P.¹

55
- 0032178592
- Quantitative association of vocal-tract and facial behavior
- H. Yehia, P. Rubin, and E. Vatikiotis-Bateson, "Quantitative association of vocal-tract and facial behavior," Speech Commun., vol. 26, no. 1-2, pp. 23-43, 1998.
- (1998) Speech Commun , vol.26 , Issue.1-2 , pp. 23-43
- Yehia, H.¹ Rubin, P.² Vatikiotis-Bateson, E.³

56
- 0036874551
- On the relationship between face movements, tongue movements, and speech acoustics
- Nov
- J. Jiang, A. Alwan, P. A. Keating, E. T. Auer, Jr., and L. E. Bernstein, "On the relationship between face movements, tongue movements, and speech acoustics," EURASIP J. Appl. Signal Processing, vol. 2002, no. 11, pp. 1174-1188, Nov. 2002.
- (2002) EURASIP J. Appl. Signal Processing , vol.2002 , Issue.11 , pp. 1174-1188
- Jiang, J.¹ Alwan, A.² Keating, P.A.³ Auer Jr., E.T.⁴ Bernstein, L.E.⁵

57
- 0012725678
- Estimation of speech acoustics from visual speech features: A comparison of linear and non-linear models
- Santa Cruz, CA
- J. P. Barker and F. Berthommier, "Estimation of speech acoustics from visual speech features: A comparison of linear and non-linear models," in Proc. Int. Conf. Auditory Visual Speech Processing, Santa Cruz, CA, 1999, pp. 112-117.
- (1999) Proc. Int. Conf. Auditory Visual Speech Processing , pp. 112-117
- Barker, J.P.¹ Berthommier, F.²

58
- 0001473062
- Using speech acoustics to drive facial motion
- San Francisco, CA
- H. C. Yehia, T. Kuratate, and E. Vatikiotis-Bateson, "Using speech acoustics to drive facial motion," in Proc. 14th Int. Congr. Phonetic Sciences, San Francisco, CA, 1999, pp. 631-634.
- (1999) Proc. 14th Int. Congr. Phonetic Sciences , pp. 631-634
- Yehia, H.C.¹ Kuratate, T.² Vatikiotis-Bateson, E.³

59
- 0034853042
- Measuring the relation between speech acoustics and 2-D facial motion
- Salt Lake City, UT
- A. V. Barbosa and H. C. Yehia, "Measuring the relation between speech acoustics and 2-D facial motion," in Proc. Int. Conf. Acoustics, Speech Signal Processing, Salt Lake City, UT, 2001, vol. 1, pp. 181-184.
- (2001) Proc. Int. Conf. Acoustics, Speech Signal Processing , vol.1 , pp. 181-184
- Barbosa, A.V.¹ Yehia, H.C.²

60
- 2542499812
- Speech-to-video synthesis using MPEG-4 compliant visual features
- May
- P. S. Aleksic and A. K. Katsaggelos, "Speech-to-video synthesis using MPEG-4 compliant visual features," IEEE Trans. CSVT, Special Issue Audio Video Analysis for Multimedia Interactive Sea-ices, pp. 682-692, May 2004.
- (2004) IEEE Trans. CSVT, Special Issue Audio Video Analysis for Multimedia Interactive Sea-ices , pp. 682-692
- Aleksic, P.S.¹ Katsaggelos, A.K.²

61
- 0002028032
- Some preliminaries to a comprehensive account of audio-visual speech perception
- R. Campbell and B. Dodd, Eds. London, U.K, Lawrence Erlbaum
- A. Q. Summerfield, "Some preliminaries to a comprehensive account of audio-visual speech perception," in Hearing by Eye: The Psychology of Lip-Reading, R. Campbell and B. Dodd, Eds. London, U.K.: Lawrence Erlbaum, 1987, pp. 3-51.
- (1987) Hearing by Eye: The Psychology of Lip-Reading , pp. 3-51
- Summerfield, A.Q.¹

62
- 0032072433
- Speech recognition and sensory integration
- D. W. Massaro and D. G. Stork, "Speech recognition and sensory integration," Amer. Scientist, vol. 86, no. 3, pp. 236-244, 1998.
- (1998) Amer. Scientist , vol.86 , Issue.3 , pp. 236-244
- Massaro, D.W.¹ Stork, D.G.²

63
- 0036650527
- An HMM-based speech-to-video synthesizer
- Jul
- J. J. Williams and A. K. Katsaggelos, "An HMM-based speech-to-video synthesizer," IEEE Trans. Neural Networks, Special Issue Intelligent Multimedia, vol. 13, no. 4, pp. 900-915, Jul. 2002.
- (2002) IEEE Trans. Neural Networks, Special Issue Intelligent Multimedia , vol.13 , Issue.4 , pp. 900-915
- Williams, J.J.¹ Katsaggelos, A.K.²

64
- 0018701386
- Use of visual information in phonetic perception
- Q. Summerfield, "Use of visual information in phonetic perception," Phonetica, vol. 36, pp. 314-331, 1979.
- (1979) Phonetica , vol.36 , pp. 314-331
- Summerfield, Q.¹

65
- 0027128576
- Lipreading and audio-visual speech perception
- _, "Lipreading and audio-visual speech perception," Phil. Trans. R. Soc. Lond. B., vol. 335, pp. 71-78, 1992.
- (1992) Phil. Trans. R. Soc. Lond. B , vol.335 , pp. 71-78
- Summerfield, Q.¹

66
- 0025767028
- Evaluating the articulation index for auditory-visual input
- Jun
- K. W. Grant and L. D. Braida, "Evaluating the articulation index for auditory-visual input," J. Acoustical Soc. Amer., vol. 89, pp. 2950-2960, Jun. 1991.
- (1991) J. Acoustical Soc. Amer , vol.89 , pp. 2950-2960
- Grant, K.W.¹ Braida, L.D.²

67
- 0003418124
- Amsterdam, The Netherlands: Mouton
- G. Fant, Acoustic Theory of Speech Production, S-Gravenhage. Amsterdam, The Netherlands: Mouton, 1960.
- (1960) Acoustic Theory of Speech Production, S-Gravenhage
- Fant, G.¹

68
- 0003757962
- Berlin, Germany: Springer-Verlag
- J. L. Flanagan, Speech Analysis Synthesis and Perception. Berlin, Germany: Springer-Verlag, 1965.
- (1965) Speech Analysis Synthesis and Perception
- Flanagan, J.L.¹

69
- 0012730684
- Articulatory-acoustic models for fricative consonants
- Jun
- S. Narayanan and A. Alwan, "Articulatory-acoustic models for fricative consonants," IEEE Trans. Speech Audio Processing, vol. 8, no. 3, pp. 328-344, Jun. 2000.
- (2000) IEEE Trans. Speech Audio Processing , vol.8 , Issue.3 , pp. 328-344
- Narayanan, S.¹ Alwan, A.²

70
- 0028259480
- Techniques for estimating vocal-tract shapes from the speech signal
- Feb
- J. Schroeter and M. Sondhi, "Techniques for estimating vocal-tract shapes from the speech signal," IEEE Trans. Speech Audio Processing, vol. 2, no. 1, pp. 133-150, Feb. 1994.
- (1994) IEEE Trans. Speech Audio Processing , vol.2 , Issue.1 , pp. 133-150
- Schroeter, J.¹ Sondhi, M.²

71
- 0017199877
- Hearing lips and seeing voices
- H. McGurk and J. MacDonald, "Hearing lips and seeing voices," Nature, vol. 264, pp. 746-748, 1976.
- (1976) Nature , vol.264 , pp. 746-748
- McGurk, H.¹ MacDonald, J.²

72
- 0032074310
- Audio-visual integration in multimodal communication
- May
- T. Chen and R. R. Rao, "Audio-visual integration in multimodal communication," Proc. IEEE, vol. 86, no. 5, pp. 837-852, May 1998.
- (1998) Proc. IEEE , vol.86 , Issue.5 , pp. 837-852
- Chen, T.¹ Rao, R.R.²

73
- 0034448810
- Designing the user interface for multimodal speech and pen-based gesture applications: State-of-the-art systems and research directions
- Aug
- S. Oviatt, P. Cohen, L. Wu, J. Vergo, L. Duncan, B. Suhm, J. Bers, T. Holzman, T, Winograd, J. Landay, J. Larson, and D. Ferro, "Designing the user interface for multimodal speech and pen-based gesture applications: State-of-the-art systems and research directions," Human-Computer Interaction, vol. 15, no. 4, pp. 263-322, Aug. 2000.
- (2000) Human-Computer Interaction , vol.15 , Issue.4 , pp. 263-322
- Oviatt, S.¹ Cohen, P.² Wu, L.³ Vergo, J.⁴ Duncan, L.⁵ Suhm, B.⁶ Bers, J.⁷ Holzman, T.⁸ Winograd, T.⁹ Landay, J.¹⁰ Larson, J.¹¹ Ferro, D.¹²

74
- 0034509487
- Multimodal Speech synthesis
- New York
- J. Schroeter, J. Ostermann, H. P. Graf, M. Beutnagel, E. Cosatto, A. Syrdal, A. Conkie, and Y. Stylianou, "Multimodal Speech synthesis," in Proc. Int. Conf. Multimedia Expo, New York, 2000, pp. 571-574.
- (2000) Proc. Int. Conf. Multimedia Expo , pp. 571-574
- Schroeter, J.¹ Ostermann, J.² Graf, H.P.³ Beutnagel, M.⁴ Cosatto, E.⁵ Syrdal, A.⁶ Conkie, A.⁷ Stylianou, Y.⁸

75
- 0036502797
- A review of speech-based bimodal recognition
- Mar
- C. C. Chibelushi, F. Deravi, and J. S. D. Mason, "A review of speech-based bimodal recognition," IEEE Trans. Multimedia, vol. 4, no. 1, pp. 23-37, Mar. 2002.
- (2002) IEEE Trans. Multimedia , vol.4 , Issue.1 , pp. 23-37
- Chibelushi, C.C.¹ Deravi, F.² Mason, J.S.D.³

76
- 0003544881
- D. G. Stork and M. E. Hennecke, Eds, Berlin, Germany: Springer
- D. G. Stork and M. E. Hennecke, Eds., Speechreading by Humans and Machines. Berlin, Germany: Springer, 1996.
- (1996) Speechreading by Humans and Machines

77
- 33947376624
- Exploiting visual information in automatic speech processing
- A. Bovik, Ed. New York: Academic, Jun
- P. S. Aleksic, G. Potamianos, and A. K. Katsaggelos, "Exploiting visual information in automatic speech processing," in Handbook of Image and Video Processing, A. Bovik, Ed. New York: Academic, Jun. 2005, pp. 1263-1289.
- (2005) Handbook of Image and Video Processing , pp. 1263-1289
- Aleksic, P.S.¹ Potamianos, G.² Katsaggelos, A.K.³

78
- 0003699540
- Automatic lipreading to enhance speech recognition,
- Ph.D. dissertation, Univ. Illinois at Urbana-Champaign, Urbana, IL
- E. Petajan, "Automatic lipreading to enhance speech recognition," Ph.D. dissertation, Univ. Illinois at Urbana-Champaign, Urbana, IL, 1984.
- (1984)
- Petajan, E.¹

79
- 0034270644
- Audio-visual speech modeling for continuous speech recognition
- Sep
- S. Dupont and J. Luettin, "Audio-visual speech modeling for continuous speech recognition," IEEE Trans. Multimedia, vol. 2, no. 3, pp. 141-151, Sep. 2000.
- (2000) IEEE Trans. Multimedia , vol.2 , Issue.3 , pp. 141-151
- Dupont, S.¹ Luettin, J.²

80
- 4544290191
- Recent advances in the automatic recognition of audiovisual speech
- Sep
- G. Potamianos, C. Neti, G. Gravier, A. Garg, and A. W. Senior, "Recent advances in the automatic recognition of audiovisual speech," Proc. IEEE, vol. 91, no. 9, pp. 1306-1326, Sep. 2003.
- (2003) Proc. IEEE , vol.91 , Issue.9 , pp. 1306-1326
- Potamianos, G.¹ Neti, C.² Gravier, G.³ Garg, A.⁴ Senior, A.W.⁵

81
- 15044345504
- Audio-visual automatic speech recognition: An overview
- G. Bailly, E. Vatikiotis-Bateson, and P. Perrier, Eds. Cambridge, MA: MIT Press
- G. Potamianos, C. Neti, J. Luettin, and I. Matthews, "Audio-visual automatic speech recognition: An overview," in Issues in Visual and Audio-Visual Speech Processing, G. Bailly, E. Vatikiotis-Bateson, and P. Perrier, Eds. Cambridge, MA: MIT Press, 2004.
- (2004) Issues in Visual and Audio-Visual Speech Processing
- Potamianos, G.¹ Neti, C.² Luettin, J.³ Matthews, I.⁴

82
- 0036874915
- Audio-visual speech recognition using MPEG-4 compliant visual features
- Nov
- P. S. Aleksic, J. J. Williams, Z. Wu, and A. K. Katsaggelos, "Audio-visual speech recognition using MPEG-4 compliant visual features," EURASIP J. Appl. Signal Processing, vol. 2002, no. 11, pp. 1213-1227, Nov. 2002.
- (2002) EURASIP J. Appl. Signal Processing , vol.2002 , Issue.11 , pp. 1213-1227
- Aleksic, P.S.¹ Williams, J.J.² Wu, Z.³ Katsaggelos, A.K.⁴

83
- 85032752352
- Audiovisual speech processing. Lip reading and lip synchronization
- Jan
- T. Chen, "Audiovisual speech processing. Lip reading and lip synchronization," IEEE Signal Processing Mag., vol. 18, no. 1, pp. 9-21, Jan. 2001.
- (2001) IEEE Signal Processing Mag , vol.18 , Issue.1 , pp. 9-21
- Chen, T.¹

84
- 0003424145
- Englewood Cliffs, NJ: Macmillan
- J. R. Deller, Jr., J. G. Proakis, and J. H. L. Hansen, Discrete-Time Processing of Speech Signals. Englewood Cliffs, NJ: Macmillan, 1993.
- (1993) Discrete-Time Processing of Speech Signals
- Deller Jr., J.R.¹ Proakis, J.G.² Hansen, J.H.L.³

85
- 33947395786
- R. Campbell, B. Dodd, and D. Burnham, Eds., Hearing by Eye II: Advances in the Psychology of Speechreading and Auditory Visual Speech. Hove, U.K.: Psychology Press, 1998.
- R. Campbell, B. Dodd, and D. Burnham, Eds., Hearing by Eye II: Advances in the Psychology of Speechreading and Auditory Visual Speech. Hove, U.K.: Psychology Press, 1998.

86
- 33947426576
- S. Young, G. Evermann, T. Hain, D. Kershaw, G. Moore, J. Odell, D. Ollason, D. Povey, V. Valtchev, and P. Woodland, The HTK Book. London, U.K.: Entropic, 2005.
- S. Young, G. Evermann, T. Hain, D. Kershaw, G. Moore, J. Odell, D. Ollason, D. Povey, V. Valtchev, and P. Woodland, The HTK Book. London, U.K.: Entropic, 2005.

87
- 0012745879
- Rationale for phoneme-viseme mapping and feature selection in visual speech recognition
- D. G. Stork and M. E. Hennecke, Eds. Berlin, Germany: Springer
- A. J. Goldschen, O. N. Garcia, and E. D. Petajan, "Rationale for phoneme-viseme mapping and feature selection in visual speech recognition," in Speechreading by Humans and Machines, D. G. Stork and M. E. Hennecke, Eds. Berlin, Germany: Springer, 1996, pp. 505-515.
- (1996) Speechreading by Humans and Machines , pp. 505-515
- Goldschen, A.J.¹ Garcia, O.N.² Petajan, E.D.³

88
- 0004244302
- Englewood Cliffs, NJ: Prentice Hall
- L. Rabiner and B.-H. Juang, Fundamentals of Speech Recognition. Englewood Cliffs, NJ: Prentice Hall, 1993.
- (1993) Fundamentals of Speech Recognition
- Rabiner, L.¹ Juang, B.-H.²

89
- 0032785783
- Auditory processing of speech signals for robust speech recognition in real-world noisy environments
- Jan
- D.-S. Kim, S.-Y. Lee, and R. M. Kil, "Auditory processing of speech signals for robust speech recognition in real-world noisy environments," IEEE Trans. Speech Audio Processing, vol. 7, no. 1, pp. 55-69, Jan. 1999.
- (1999) IEEE Trans. Speech Audio Processing , vol.7 , Issue.1 , pp. 55-69
- Kim, D.-S.¹ Lee, S.-Y.² Kil, R.M.³

90
- 84892178050
- Spectral subband centroids features for speech recognition
- Seattle, WA
- K. K. Paliwal, "Spectral subband centroids features for speech recognition," in Proc. Int. Conf. Acoustics, Speech and Signal Processing, Seattle, WA, 1998, vol. 2, pp. 617-620.
- (1998) Proc. Int. Conf. Acoustics, Speech and Signal Processing , vol.2 , pp. 617-620
- Paliwal, K.K.¹

91
- 0141702085
- Environmental sniffing: Noise knowledge estimation for robust speech systems
- Hong Kong, China
- M. Akbacak and J. H. L. Hansen, "Environmental sniffing: Noise knowledge estimation for robust speech systems," in Proc. Int. Conf. Acoustics, Speech and Signal Processing, Hong Kong, China, 2003, vol. 2, pp. 113-116.
- (2003) Proc. Int. Conf. Acoustics, Speech and Signal Processing , vol.2 , pp. 113-116
- Akbacak, M.¹ Hansen, J.H.L.²

92
- 0031672526
- Neutral networks-based face detection
- Jan
- H. A. Rowley, S. Baluja, and T. Kanade, "Neutral networks-based face detection," IEEE Trans. Pattern Anal. Machine Intell., vol. 20, no. 1, pp. 23-38, Jan. 1998.
- (1998) IEEE Trans. Pattern Anal. Machine Intell , vol.20 , Issue.1 , pp. 23-38
- Rowley, H.A.¹ Baluja, S.² Kanade, T.³

93
- 0002656434
- Face and feature finding for a face recognition system
- Washington, DC
- A. W. Senior, "Face and feature finding for a face recognition system," in Proc. Int. Conf. Audio Video-based Biometric Person Authentication, Washington, DC, 1999, pp. 154-159.
- (1999) Proc. Int. Conf. Audio Video-based Biometric Person Authentication , pp. 154-159
- Senior, A.W.¹

94
- 0031648023
- Example-based learning for view-based human face detection
- K. Sung and T. Poggio, "Example-based learning for view-based human face detection," IEEE Trans. Pattern Anal. Machine Intell., vol. 20, no. 1, pp. 39-51, 1998.
- (1998) IEEE Trans. Pattern Anal. Machine Intell , vol.20 , Issue.1 , pp. 39-51
- Sung, K.¹ Poggio, T.²

95
- 0035438492
- Face detection: A survey
- Sep
- E. Hjelmas and B. K. Low, "Face detection: A survey," Computer Vision and Image Understanding, vol. 83, no. 3, pp. 236-274, Sep. 2001.
- (2001) Computer Vision and Image Understanding , vol.83 , Issue.3 , pp. 236-274
- Hjelmas, E.¹ Low, B.K.²

96
- 0036223025
- Detecting faces in images: A survey
- Jan
- M.-H. Yang, D. Kriegman, and N. Ahuja, "Detecting faces in images: A survey," IEEE Trans. Pattern Anal. Machine Intell., vol. 24, no. 1, pp. 34-58, Jan. 2002.
- (2002) IEEE Trans. Pattern Anal. Machine Intell , vol.24 , Issue.1 , pp. 34-58
- Yang, M.-H.¹ Kriegman, D.² Ahuja, N.³

97
- 24644442878
- Face recognition based on frontal views generated from non-frontal images
- V. Blanz, P. Grother, P. J. Phillips, and T. Vetter, "Face recognition based on frontal views generated from non-frontal images," in Proc. Computer Vision Pattern Recognition, 2005, pp. 454-461.
- (2005) Proc. Computer Vision Pattern Recognition , pp. 454-461
- Blanz, V.¹ Grother, P.² Phillips, P.J.³ Vetter, T.⁴

98
- 27744546990
- On transforming statistical models for non-frontal face verification
- C. Sanderson, S. Bengio, and Y. Gao, "On transforming statistical models for non-frontal face verification," Pattern Recognition, vol. 39, no. 2, pp. 288-302, 2006.
- (2006) Pattern Recognition , vol.39 , Issue.2 , pp. 288-302
- Sanderson, C.¹ Bengio, S.² Gao, Y.³

99
- 27844534088
- A survey of approaches and challenges in 3-D and multi-modal 3-D face recognition
- K. W. Bowyer, K. Chang, and P. Flynn, "A survey of approaches and challenges in 3-D and multi-modal 3-D face recognition," Computer Vision Image Understanding, vol. 101, no. 1, pp. 1-15, 2006.
- (2006) Computer Vision Image Understanding , vol.101 , Issue.1 , pp. 1-15
- Bowyer, K.W.¹ Chang, K.² Flynn, P.³

100
- 11144226973
- Recent advances in visual and infrared face recognition - A review
- S. G. Kong, J. Heo, B. R. Abidi, J. Paik, and M. A. Abidi, "Recent advances in visual and infrared face recognition - A review," Computer Vision Image Understanding, vol. 97, no. 1, pp. 103-135, 2005.
- (2005) Computer Vision Image Understanding , vol.97 , Issue.1 , pp. 103-135
- Kong, S.G.¹ Heo, J.² Abidi, B.R.³ Paik, J.⁴ Abidi, M.A.⁵

101
- 0000417467
- Visionary speech: Looking ahead to practical speechreading systems
- D. G. Stork and M. E. Hennecke, Eds. Berlin, Germany: Springer
- M. E. Hennecke, D. G. Stork, and K. V. Prasad, "Visionary speech: Looking ahead to practical speechreading systems," in Speechreading by Humans and Machines, D. G. Stork and M. E. Hennecke, Eds. Berlin, Germany: Springer, 1996, pp. 331-349.
- (1996) Speechreading by Humans and Machines , pp. 331-349
- Hennecke, M.E.¹ Stork, D.G.² Prasad, K.V.³

102
- 4544329810
- Comparison of low- and high-level visual features for audio-visual continuous automatic speech recognition
- Montreal, Canada
- P. S. Aleksic and A. K. Katsaggelos, "Comparison of low- and high-level visual features for audio-visual continuous automatic speech recognition," in Proc. Jnt. Conf. Acoustics, Speech Signal Processing, Montreal, Canada, 2004, pp. 917-920.
- (2004) Proc. Jnt. Conf. Acoustics, Speech Signal Processing , pp. 917-920
- Aleksic, P.S.¹ Katsaggelos, A.K.²

103
- 84908265391
- A comparison of model and transform-based visual features for audio-visual LVCSR
- I. Matthews, G. Potamianos, C. Neti, and J. Luettin, "A comparison of model and transform-based visual features for audio-visual LVCSR," in Proc. Int. Conf. Multimedia Expo, 2001, pp. 22-25.
- (2001) Proc. Int. Conf. Multimedia Expo , pp. 22-25
- Matthews, I.¹ Potamianos, G.² Neti, C.³ Luettin, J.⁴

104
- 0031672526
- Neural network based face detection
- Jan
- H. A. Rowley, S. Baluja, and T. Kanade, "Neural network based face detection," IEEE Trans. Pattern Anal. Machine Intell., vol. 20, no. 1, pp. 23-38, Jan. 1998.
- (1998) IEEE Trans. Pattern Anal. Machine Intell , vol.20 , Issue.1 , pp. 23-38
- Rowley, H.A.¹ Baluja, S.² Kanade, T.³

105
- 0035680116
- Rapid object detection using a boosted cascade of simple features
- Kauai, HI, Dec. 11-13
- P. Viola and M. Jones, "Rapid object detection using a boosted cascade of simple features," in Proc. Conf. Computer Vision Pattern Recognition, Kauai, HI, Dec. 11-13, 2001, pp. 511-518.
- (2001) Proc. Conf. Computer Vision Pattern Recognition , pp. 511-518
- Viola, P.¹ Jones, M.²

106
- 0031361424
- Robust recognition of faces and facial features with a multi-modal system
- Orlando, FL
- H. P. Graf, E. Cosatto, and G. Potamianos, "Robust recognition of faces and facial features with a multi-modal system," in Proc. Jnt. Conf. Systems, Man, Cybernetics, Orlando, FL, 1997, pp. 2034-2039.
- (1997) Proc. Jnt. Conf. Systems, Man, Cybernetics , pp. 2034-2039
- Graf, H.P.¹ Cosatto, E.² Potamianos, G.³

107
- 84925639646
- Real-time lip tracking and bimodal continuous speech recognition
- Redondo Beach, CA
- M. T. Chan, Y. Zhang, and T. S. Huang, "Real-time lip tracking and bimodal continuous speech recognition," in Proc. Workshop Multimedia Signal Processing, Redondo Beach, CA, 1998, pp. 65-70.
- (1998) Proc. Workshop Multimedia Signal Processing , pp. 65-70
- Chan, M.T.¹ Zhang, Y.² Huang, T.S.³

108
- 84931090061
- Liveness verification in audio-video authentication
- Jeju Island, Korea
- G. Chetty and M. Wagner, '"Liveness" verification in audio-video authentication," in Proc. Int. Conf. Spoken Language Processing, Jeju Island, Korea, 2004, pp. 2509-2512.
- (2004) Proc. Int. Conf. Spoken Language Processing , pp. 2509-2512
- Chetty, G.¹ Wagner, M.²

109
- 34250090755
- Snakes: Active contour models
- M. Kass, A. Witkin, and D. Terzopoulos, "Snakes: Active contour models," Int. J. Computer Vision, vol. 4, no. 4, pp. 321-331, 1988.
- (1988) Int. J. Computer Vision , vol.4 , Issue.4 , pp. 321-331
- Kass, M.¹ Witkin, A.² Terzopoulos, D.³

110
- 0026903014
- Feature extraction from faces using deformable templates
- A. L. Yuille, P. W. Hallinan, and D. S. Cohen, "Feature extraction from faces using deformable templates," Int. J. Computer Vision, vol. 8, no. 2, pp. 99-111, 1992.
- (1992) Int. J. Computer Vision , vol.8 , Issue.2 , pp. 99-111
- Yuille, A.L.¹ Hallinan, P.W.² Cohen, D.S.³

111
- 84957810778
- Active appearance models
- Freiburg, Germany
- T. F. Cootes, G. J. Edwards, and C. J. Taylor, "Active appearance models," in Proc. Eur. Conf. Computer Vision, Freiburg, Germany, 1998, pp. 484-498.
- (1998) Proc. Eur. Conf. Computer Vision , pp. 484-498
- Cootes, T.F.¹ Edwards, G.J.² Taylor, C.J.³

112
- 0003922190
- Hoboken, NJ: Wiley
- R. O. Duda, P. E. Hart, and D. G. Stork, Pattern Classification. Hoboken, NJ: Wiley, 2001.
- (2001) Pattern Classification
- Duda, R.O.¹ Hart, P.E.² Stork, D.G.³

113
- 21244474602
- Audio-visual speaker recognition for broadcast news: Some fusion techniques
- Copenhagen, Denmark
- B. Maison, C. Neti, and A. Senior, "Audio-visual speaker recognition for broadcast news: Some fusion techniques," in Proc. Works. Multimedia Signal Processing, Copenhagen, Denmark, 1999, pp. 161-167.
- (1999) Proc. Works. Multimedia Signal Processing , pp. 161-167
- Maison, B.¹ Neti, C.² Senior, A.³

114
- 85135321224
- See me, hear me: Integrating automatic speech recognition and lip-reading
- Yokohama, Japan, Sep. 18-22
- P. Duchnowski, U. Meier, and A. Waibel, "See me, hear me: Integrating automatic speech recognition and lip-reading," in Proc. Jnt. Conf. Spoken Long. Processing, Yokohama, Japan, Sep. 18-22, 1994, pp. 547-550.
- (1994) Proc. Jnt. Conf. Spoken Long. Processing , pp. 547-550
- Duchnowski, P.¹ Meier, U.² Waibel, A.³

115
- 0032314380
- An image transform approach for HMM based automatic lipreading
- Chicago, IL, Oct. 4-7
- G. Potamianos, H. P. Graf, and E. Cosatto, "An image transform approach for HMM based automatic lipreading," in Proc. Int. Conf. Image Processing, Chicago, IL, Oct. 4-7, 1998, vol. 1, pp. 173-177.
- (1998) Proc. Int. Conf. Image Processing , vol.1 , pp. 173-177
- Potamianos, G.¹ Graf, H.P.² Cosatto, E.³

116
- 33749247429
- Comparison of MPEG-4 facial animation parameter groups with respect to audio-visual speech recognition performance
- Italy, Sep
- P. S. Aleksic and A. K. Katsaggelos, "Comparison of MPEG-4 facial animation parameter groups with respect to audio-visual speech recognition performance," in Proc. Int. Conf. Image Processing. Italy, Sep. 2005, vol. 5, pp. 501-504.
- (2005) Proc. Int. Conf. Image Processing , vol.5 , pp. 501-504
- Aleksic, P.S.¹ Katsaggelos, A.K.²

117
- 0036875048
- Automatic speechreading with applications to human-computer interfaces
- X. Zhang, C. C. Broun, R. M. Mersereau, and M. Clements, "Automatic speechreading with applications to human-computer interfaces," EURASIP J. Appl. Signal Processing, vol. 2002, no. 11, pp. 1228-1247, 2002.
- (2002) EURASIP J. Appl. Signal Processing , vol.2002 , Issue.11 , pp. 1228-1247
- Zhang, X.¹ Broun, C.C.² Mersereau, R.M.³ Clements, M.⁴

118
- 0036875002
- A support vector machine-based dynamic network for visual speech recognition applications
- M. Gordan, C. Kotropoulos, and I. Pitas, "A support vector machine-based dynamic network for visual speech recognition applications," EURASIP J. Appl. Signal Processing, vol. 2002, no. 11, pp. 1248-1259, 2002.
- (2002) EURASIP J. Appl. Signal Processing , vol.2002 , Issue.11 , pp. 1248-1259
- Gordan, M.¹ Kotropoulos, C.² Pitas, I.³

119
- 30344436680
- User authentication via adapted statistical models of face images
- Jan
- F. Cardinaux, C. Sanderson, and S. Bengio, "User authentication via adapted statistical models of face images," IEEE Trans. Signal Processing, vol. 54, no. 1, pp. 361-373, Jan. 2006.
- (2006) IEEE Trans. Signal Processing , vol.54 , Issue.1 , pp. 361-373
- Cardinaux, F.¹ Sanderson, C.² Bengio, S.³

120
- 0033738539
- The NIST speaker recognition evaluation - Overview, methodology, systems, results, perspective
- G. R. Doddington, M. A. Przybycki, A. F. Martin, and D. A. Reynolds, "The NIST speaker recognition evaluation - Overview, methodology, systems, results, perspective," Speech Commun., vol. 31, no. 2-3, pp. 225-254, 2000.
- (2000) Speech Commun , vol.31 , Issue.2-3 , pp. 225-254
- Doddington, G.R.¹ Przybycki, M.A.² Martin, A.F.³ Reynolds, D.A.⁴

121
- 47849133262
- The expected performance curve
- Bonn, Germany
- S. Bengio, J. Mariethoz, and M. Keller, "The expected performance curve," in Int. Conf. Machine Learning, Workshop ROC Analysis Machine Learning, Bonn, Germany, 2005.
- (2005) Int. Conf. Machine Learning, Workshop ROC Analysis Machine Learning
- Bengio, S.¹ Mariethoz, J.² Keller, M.³

122
- 23744485282
- The expected performance curve: A new assessment measure for person authentication
- Toledo, OH
- S. Bengio and J. Mariethoz, "The expected performance curve: A new assessment measure for person authentication," in Proc. Speaker Language Recognition Works. (Odyssey), Toledo, OH, 2004, pp. 279-284.
- (2004) Proc. Speaker Language Recognition Works. (Odyssey) , pp. 279-284
- Bengio, S.¹ Mariethoz, J.²

123
- 84875984350
- Multisensor data fusion
- D. L. Hall and J. Llinas, Eds. Boca Raton, FL: CRC
- D. L. Hall and J. Llinas, "Multisensor data fusion," in Handbook of Multisensor Data Fusion, D. L. Hall and J. Llinas, Eds. Boca Raton, FL: CRC, 2001, pp. 1-10.
- (2001) Handbook of Multisensor Data Fusion , pp. 1-10
- Hall, D.L.¹ Llinas, J.²

124
- 0028259890
- Decision combination in multiple classifier systems
- T. K. Ho, J. J. Hull, and S. N. Srihari, "Decision combination in multiple classifier systems," IEEE Trans. Pattern Anal. Machine Intell., vol. 16, pp. 66-75, 1994.
- (1994) IEEE Trans. Pattern Anal. Machine Intell , vol.16 , pp. 66-75
- Ho, T.K.¹ Hull, J.J.² Srihari, S.N.³

125
- 4544350286
- Introduction
- R. C. Luo and M. G. Kay, Eds. Norwood, NJ: Ablex
- R. C. Luo and M. G. Kay, "Introduction," in Multisensor Integration and Fusion for Intelligent Machines and Systems, R. C. Luo and M. G. Kay, Eds. Norwood, NJ: Ablex, 1995, pp. 1-26.
- (1995) Multisensor Integration and Fusion for Intelligent Machines and Systems , pp. 1-26
- Luo, R.C.¹ Kay, M.G.²

126
- 84947917954
- The M2VTS multimodal face database (release 1.00)
- Crans-Montana, Switzerland
- S. Pigeon and L. Vandendorpe, "The M2VTS multimodal face database (release 1.00)," in Proc. 1st Int. Conf. Audio- and Video-Based Biometric Person Authentication, Crans-Montana, Switzerland, 1997, pp. 403-409.
- (1997) Proc. 1st Int. Conf. Audio- and Video-Based Biometric Person Authentication , pp. 403-409
- Pigeon, S.¹ Vandendorpe, L.²

127
- 0001935972
- XM2VTSDB: Te extended M2VTS database
- Washington, DC
- K. Messer, J. Matas, J. Kittler, J. Luettin, and G. Maitre, "XM2VTSDB: Te extended M2VTS database," in Proc. 2nd Int. Conf. Audio- and Video-Based Biometric Person Authentication, Washington, DC, 1999, pp. 72-77.
- (1999) Proc. 2nd Int. Conf. Audio- and Video-Based Biometric Person Authentication , pp. 72-77
- Messer, K.¹ Matas, J.² Kittler, J.³ Luettin, J.⁴ Maitre, G.⁵

128
- 35248819751
- The BANCA database and evaluation protocol
- Guilford
- E. Bailly-Bailliere, S. Bengio, F. Bimbot, M. Hamouz, J. Kittler, J. Mariethoz, J. Matas, K. Messer, V. Popovici, F. Poree, B. Ruiz, and J.-P. Thiran, "The BANCA database and evaluation protocol," in Proc. Audio- and Video-Based Biometric Person Authentication, Guilford, 2003, pp. 625-638.
- (2003) Proc. Audio- and Video-Based Biometric Person Authentication , pp. 625-638
- Bailly-Bailliere, E.¹ Bengio, S.² Bimbot, F.³ Hamouz, M.⁴ Kittler, J.⁵ Mariethoz, J.⁶ Matas, J.⁷ Messer, K.⁸ Popovici, V.⁹ Poree, F.¹⁰ Ruiz, B.¹¹ Thiran, J.-P.¹²

129
- 33947384251
- Speech and Image Processing Research Group, Dept. of Electrical and Electronic Engineering, Univ, les Swansea
- C. C. Chibelushi, F. Deravi, and J. S. Mason, BT DAVID Database - Internal Rep., Speech and Image Processing Research Group, Dept. of Electrical and Electronic Engineering, Univ, les Swansea, 1996.
- (1996) BT DAVID Database - Internal Rep
- Chibelushi, C.C.¹ Deravi, F.² Mason, J.S.³

130
- 26444562315
- The realistic multi-modal VALID database and visual speaker identification comparison experiments
- T. Kanade, A. K. Jain, and N. K. Ratha, Eds. New York: Springer-Verlag
- N. Fox, B. O'Mullane, and R. B. Reilly, "The realistic multi-modal VALID database and visual speaker identification comparison experiments," in Lecture Notes in Computer Science, T. Kanade, A. K. Jain, and N. K. Ratha, Eds. New York: Springer-Verlag, 2005, vol. 3546, p. 777.
- (2005) Lecture Notes in Computer Science , vol.3546 , pp. 777
- Fox, N.¹ O'Mullane, B.² Reilly, R.B.³

131
- 85009135251
- AVICAR: Audio-visual speech corpus in a car environment
- Jeju, Korea
- B. Lee, M. Hasegawa-Johnson, C. Goudeseune, S. Kamdar, S. Borys, M. Liu, and T. Huang, "AVICAR: Audio-visual speech corpus in a car environment," in Proc. Conf. Spoken Language, Jeju, Korea, 2004.
- (2004) Proc. Conf. Spoken Language
- Lee, B.¹ Hasegawa-Johnson, M.² Goudeseune, C.³ Kamdar, S.⁴ Borys, S.⁵ Liu, M.⁶ Huang, T.⁷

132
- 0036299249
- CUAVE: A new audio-visual database for multimodal human-computer interface research
- Orlando, FL
- E. K. Patterson, S. Gurbuz, Z. Tufekci, and J. N. Gowdy, "CUAVE: A new audio-visual database for multimodal human-computer interface research," in Proc. Int. Conf. Acoustics, Speech and Signal Processing, Orlando, FL, 2002.
- (2002) Proc. Int. Conf. Acoustics, Speech and Signal Processing
- Patterson, E.K.¹ Gurbuz, S.² Tufekci, Z.³ Gowdy, J.N.⁴

133
- 85032752352
- Audiovisual speech processing
- Jan
- T. Chen, "Audiovisual speech processing," IEEE Signal Processing Mag., vol. 18, pp. 9-21, Jan. 2001.
- (2001) IEEE Signal Processing Mag , vol.18 , pp. 9-21
- Chen, T.¹

134
- 0000886386
- Visual speech recognition with stochastic networks
- G. Tesauro, D. Toruetzky, and T. Leen, Eds. Cambridge, MA: MIT Press
- J. R. Movellan, "Visual speech recognition with stochastic networks," in Advances in Neural Information Processing Systems, G. Tesauro, D. Toruetzky, and T. Leen, Eds. Cambridge, MA: MIT Press, 1995, vol. 7.
- (1995) Advances in Neural Information Processing Systems , vol.7
- Movellan, J.R.¹

135
- 0030366433
- Speaker identification by lipreading
- Philadelphia, PA
- J. Luettin, N. Thacker, and S. Beet, "Speaker identification by lipreading," in Proc Int. Conf. Speech and Language Processing, Philadelphia, PA, 1996, pp. 62-64.
- (1996) Proc Int. Conf. Speech and Language Processing , pp. 62-64
- Luettin, J.¹ Thacker, N.² Beet, S.³

136
- 26444587869
- A statistical significance test for person authentication
- Toledo
- S. Bengio and J. Mariethoz, "A statistical significance test for person authentication," in Proc. Speaker and Language Recognition Workshop (Odyssey), Toledo, 2004, pp. 237-244.
- (2004) Proc. Speaker and Language Recognition Workshop (Odyssey) , pp. 237-244
- Bengio, S.¹ Mariethoz, J.²

137
- 33947394973
- Biometric authentication in the e-World
- D. Zhang, Ed. Boston, MA: Kluwer, ch. 16
- N. Poh and J. Korczak, "Biometric authentication in the e-World," in Automated Authentication Using Hybrid Biometric System, D. Zhang, Ed. Boston, MA: Kluwer, 2003, ch. 16.
- (2003) Automated Authentication Using Hybrid Biometric System
- Poh, N.¹ Korczak, J.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.