메뉴 건너뛰기




Volumn 9, Issue 4, 2007, Pages 701-713

Robust biometric person identification using automatic classifier fusion of speech, mouth, and face experts

Author keywords

Biometric fusion; Expert reliability; Hidden Markov models; Image information loss; Mouth features; Multi modal; Person recognition; Robustness; Tri expert

Indexed keywords

BIOMETRIC FUSION; EXPERT RELIABILITY; PERSON IDENTIFICATION; PERSON RECOGNITION; TRI-EXPERT;

EID: 34249312658     PISSN: 15209210     EISSN: None     Source Type: Journal    
DOI: 10.1109/TMM.2007.893339     Document Type: Article
Times cited : (57)

References (52)
  • 1
  • 2
    • 4544228318 scopus 로고    scopus 로고
    • Identity verification using speech and face information
    • Sep
    • C. Sanderson and K. K. Paliwal, "Identity verification using speech and face information," Dig. Signal Process., vol. 14, no. 5, pp. 449-480, Sep. 2004.
    • (2004) Dig. Signal Process , vol.14 , Issue.5 , pp. 449-480
    • Sanderson, C.1    Paliwal, K.K.2
  • 3
    • 0032646868 scopus 로고    scopus 로고
    • The use of speech and lip modalities for robust speaker verification under adverse conditions
    • Florence, Italy, Jun
    • T. J. Wark, S. Sridharan, and V. Chandran, "The use of speech and lip modalities for robust speaker verification under adverse conditions," in Proc. IEEE Int. Conf. Multimedia Computing and Systems, Florence, Italy, Jun. 1999, vol. 1, pp. 812-816.
    • (1999) Proc. IEEE Int. Conf. Multimedia Computing and Systems , vol.1 , pp. 812-816
    • Wark, T.J.1    Sridharan, S.2    Chandran, V.3
  • 4
    • 27744546990 scopus 로고    scopus 로고
    • On transforming statistical models for non-frontal face verification
    • C. Sanderson, S. Bengio, and Y. Gao, "On transforming statistical models for non-frontal face verification," Pattern Recognit., vol. 39, no. 2, pp. 288-302, 2006.
    • (2006) Pattern Recognit , vol.39 , Issue.2 , pp. 288-302
    • Sanderson, C.1    Bengio, S.2    Gao, Y.3
  • 5
    • 34249279656 scopus 로고    scopus 로고
    • D. Blackburn, M. Bone, and P. J. Phillips, Facial Recognition Vendor Test 2000: Evaluation Report Feb. 2001 [Online]. Available: http://www.frvt.org, Tech. Rep.
    • D. Blackburn, M. Bone, and P. J. Phillips, Facial Recognition Vendor Test 2000: Evaluation Report Feb. 2001 [Online]. Available: http://www.frvt.org, Tech. Rep.
  • 8
    • 4544290191 scopus 로고    scopus 로고
    • Recent advances in the automatic recognition of audiovisual speech
    • Sep
    • G. Potamianos, C. Neti, G. Gravier, A. Garg, and A. W. Senior, "Recent advances in the automatic recognition of audiovisual speech," Proc. IEEE, vol. 91, no. 9, pp. 1306-1324, Sep. 2003.
    • (2003) Proc. IEEE , vol.91 , Issue.9 , pp. 1306-1324
    • Potamianos, G.1    Neti, C.2    Gravier, G.3    Garg, A.4    Senior, A.W.5
  • 9
    • 0036502797 scopus 로고    scopus 로고
    • A review of speech-based bimodal recognition
    • Mar
    • C. C. Chibelushi, F. Deravi, and J. S. D. Mason, "A review of speech-based bimodal recognition," IEEE Trans. Multimedia, vol. 4, no. 1, pp. 23-35, Mar. 2002.
    • (2002) IEEE Trans. Multimedia , vol.4 , Issue.1 , pp. 23-35
    • Chibelushi, C.C.1    Deravi, F.2    Mason, J.S.D.3
  • 10
    • 0031223878 scopus 로고    scopus 로고
    • SESAM: A biometric person identification system using sensor fusion
    • Sept
    • U. Dieckmann, P. Plankensteiner, and T. Wagner, "SESAM: A biometric person identification system using sensor fusion," Pattern Recognit. Lett., vol. 18, no. 9, pp. 827-833, Sept. 1997.
    • (1997) Pattern Recognit. Lett , vol.18 , Issue.9 , pp. 827-833
    • Dieckmann, U.1    Plankensteiner, P.2    Wagner, T.3
  • 11
    • 0345565788 scopus 로고    scopus 로고
    • Multimodal speaker identification with audio-video processing
    • Barcelona, Spain, Sep
    • Y. Yemez, A. Kanak, E. Erzin, and A. M. Tekalp, "Multimodal speaker identification with audio-video processing," in Proc. Int. Conf. Image Processing, Barcelona, Spain, Sep. 2003, vol. 3, pp. 5-8.
    • (2003) Proc. Int. Conf. Image Processing , vol.3 , pp. 5-8
    • Yemez, Y.1    Kanak, A.2    Erzin, E.3    Tekalp, A.M.4
  • 12
    • 0032594952 scopus 로고    scopus 로고
    • Fusion of face and speech data for person identity verification
    • May
    • S. Ben-Yacoub, Y. Abdeljaoued, and E. Mayoraz, "Fusion of face and speech data for person identity verification," IEEE Trans. Neural Netw., vol. 10, no. 5, pp. 1065-1074, May 1999.
    • (1999) IEEE Trans. Neural Netw , vol.10 , Issue.5 , pp. 1065-1074
    • Ben-Yacoub, S.1    Abdeljaoued, Y.2    Mayoraz, E.3
  • 13
    • 0035394653 scopus 로고    scopus 로고
    • Adaptive fusion of speech and lip information for robust speaker identification
    • July
    • T. Wark and S. Sridharan, "Adaptive fusion of speech and lip information for robust speaker identification," Dig. Sig. Process., vol. 11, no. 3, pp. 169-186, July 2001.
    • (2001) Dig. Sig. Process , vol.11 , Issue.3 , pp. 169-186
    • Wark, T.1    Sridharan, S.2
  • 15
  • 17
    • 0036448934 scopus 로고    scopus 로고
    • Learning user-specific parameters in a multi-biometric system
    • Rochester, NY, Sep. 22-25
    • A. K. Jain and A. Ross, "Learning user-specific parameters in a multi-biometric system," in Proc. Int. Conf. Image Processing., Rochester, NY, Sep. 22-25, 2002, vol. 1, pp. 57-60.
    • (2002) Proc. Int. Conf. Image Processing , vol.1 , pp. 57-60
    • Jain, A.K.1    Ross, A.2
  • 18
    • 0029209272 scopus 로고
    • Robust text-independent speaker identification using gaussian mixture speaker models
    • Jan
    • D. A. Reynolds and R. C. Rose, "Robust text-independent speaker identification using gaussian mixture speaker models," IEEE Trans. Speech Audio Process., vol. 3, no. 1, pp. 72-83, Jan. 1995.
    • (1995) IEEE Trans. Speech Audio Process , vol.3 , Issue.1 , pp. 72-83
    • Reynolds, D.A.1    Rose, R.C.2
  • 19
    • 34249338888 scopus 로고    scopus 로고
    • S. Young, G. Evermann, D. Kershaw, G. Moore, J. Odell, D. Ollason, V. Valtchev, and P. Woodland, The HTK Book for HTK Version 3.1, Cambridge, U.K, Cambridge Univ. Eng. Dept, Microsoft Corporation, 2001
    • S. Young, G. Evermann, D. Kershaw, G. Moore, J. Odell, D. Ollason, V. Valtchev, and P. Woodland, The HTK Book (for HTK Version 3.1). Cambridge, U.K.: Cambridge Univ. Eng. Dept.: Microsoft Corporation, 2001.
  • 20
    • 20444375102 scopus 로고    scopus 로고
    • Integration strategies for audio-visual speech processing: Applied to text dependent speaker recognition
    • Jun
    • S. Lucey, T. Chen, S. Sridharan, and V. Chandran, "Integration strategies for audio-visual speech processing: Applied to text dependent speaker recognition," IEEE Trans. Multimedia, vol. 7, no. 3, pp. 495-506, Jun. 2005.
    • (2005) IEEE Trans. Multimedia , vol.7 , Issue.3 , pp. 495-506
    • Lucey, S.1    Chen, T.2    Sridharan, S.3    Chandran, V.4
  • 21
    • 0024610919 scopus 로고
    • A tutorial on hidden Markov models and selected applications in speech recognition
    • Feb
    • L. R. Rabiner, "A tutorial on hidden Markov models and selected applications in speech recognition," Proc. IEEE, vol. 77, no. 2, pp. 257-286, Feb. 1989.
    • (1989) Proc. IEEE , vol.77 , Issue.2 , pp. 257-286
    • Rabiner, L.R.1
  • 23
    • 0032314380 scopus 로고    scopus 로고
    • An image transform approach for HMM based automatic lipreading
    • Chicago, IL, Oct
    • G. Potamianos, H. Graf, and E. Cosatto, "An image transform approach for HMM based automatic lipreading," in Proc. IEEE Int. Conf. Image Processing, Chicago, IL, Oct. 1998, vol. 3, pp. 173-177.
    • (1998) Proc. IEEE Int. Conf. Image Processing , vol.3 , pp. 173-177
    • Potamianos, G.1    Graf, H.2    Cosatto, E.3
  • 24
    • 84908265391 scopus 로고    scopus 로고
    • A comparison of model and transform-based visual features for audio-visual LVCSR
    • Tokyo, Japan, Aug
    • I. Matthews, G. Potamianos, C. Neti, and J. Luettin, "A comparison of model and transform-based visual features for audio-visual LVCSR," in Proc. IEEE Int. Conf. Multimedia and Expo, Tokyo, Japan, Aug. 2001, pp. 825-828.
    • (2001) Proc. IEEE Int. Conf. Multimedia and Expo , pp. 825-828
    • Matthews, I.1    Potamianos, G.2    Neti, C.3    Luettin, J.4
  • 27
    • 0141515102 scopus 로고    scopus 로고
    • Support vector regression and classification based multi-view face detection and recognition
    • Grenoble, France, Mar
    • Y. Li, S. Gong, and H. Liddell, "Support vector regression and classification based multi-view face detection and recognition," in Proc. 4th IEEE Int. Conf. Automatic Face and Gesture Recognition, Grenoble, France, Mar. 2000, pp. 300-305.
    • (2000) Proc. 4th IEEE Int. Conf. Automatic Face and Gesture Recognition , pp. 300-305
    • Li, Y.1    Gong, S.2    Liddell, H.3
  • 28
    • 0031185845 scopus 로고    scopus 로고
    • Eigenfaces vs. Fisherfaces: Recognition using class specific linear projection
    • Jul
    • P. N. Belhumeur, J. P. Hespanha, and D. J. Kriegman, "Eigenfaces vs. Fisherfaces: Recognition using class specific linear projection," IEEE Trans. Pattern Anal. Mach. Intell., vol. 19, no. 7, pp. 711-720, Jul. 1997.
    • (1997) IEEE Trans. Pattern Anal. Mach. Intell , vol.19 , Issue.7 , pp. 711-720
    • Belhumeur, P.N.1    Hespanha, J.P.2    Kriegman, D.J.3
  • 29
    • 84975551912 scopus 로고
    • Low-dimensional procedure for the characterization of human faces
    • L. Sirovich and M. Kirby, "Low-dimensional procedure for the characterization of human faces," J. Opt. Soc. Amer. A, vol. 4, no. 3, pp. 519-524, 1987.
    • (1987) J. Opt. Soc. Amer. A , vol.4 , Issue.3 , pp. 519-524
    • Sirovich, L.1    Kirby, M.2
  • 30
    • 0026065565 scopus 로고
    • Eigenfaces for recognition
    • M. Turk and A. Pentland, "Eigenfaces for recognition," J. Cog. Neurosci., vol. 3, no. 1, pp. 71-86, 1991.
    • (1991) J. Cog. Neurosci , vol.3 , Issue.1 , pp. 71-86
    • Turk, M.1    Pentland, A.2
  • 31
    • 0030737097 scopus 로고    scopus 로고
    • Face recognition: A convolutional neural-network approach
    • S. Lawrence, C. L. Giles, A. C. Tsoi, and A. D. Back, "Face recognition: A convolutional neural-network approach," IEEE Trans. Neural Netw., vol. 8, no. 1, pp. 98-113, 1997.
    • (1997) IEEE Trans. Neural Netw , vol.8 , Issue.1 , pp. 98-113
    • Lawrence, S.1    Giles, C.L.2    Tsoi, A.C.3    Back, A.D.4
  • 33
    • 0031187270 scopus 로고    scopus 로고
    • Automatic interpretation and coding of face images using flexible models
    • Jun
    • A. Lanitis, C. J. Taylor, and T. F. Cootes, "Automatic interpretation and coding of face images using flexible models," IEEE Trans. Pattern Anal. Mach. Intell., vol. 19, no. 7, pp. 743-756, Jun. 1997.
    • (1997) IEEE Trans. Pattern Anal. Mach. Intell , vol.19 , Issue.7 , pp. 743-756
    • Lanitis, A.1    Taylor, C.J.2    Cootes, T.F.3
  • 34
    • 0025972453 scopus 로고
    • Deformable templates for face recognition
    • A. Yuille, "Deformable templates for face recognition," J. Cog. Neurosci., vol. 3, no. 1, pp. 59-70, 1991.
    • (1991) J. Cog. Neurosci , vol.3 , Issue.1 , pp. 59-70
    • Yuille, A.1
  • 38
    • 0000209334 scopus 로고    scopus 로고
    • Local feature analysis: A general statistical theory for object representation
    • P. Penev and J. Atick, "Local feature analysis: A general statistical theory for object representation," Network: Comput. in Neural Syst., vol. 7, no. 3, pp. 477-500, 1996.
    • (1996) Network: Comput. in Neural Syst , vol.7 , Issue.3 , pp. 477-500
    • Penev, P.1    Atick, J.2
  • 39
    • 13444309366 scopus 로고    scopus 로고
    • Comparing and combining depth and texture cues for face recognition
    • C. BenAbdelkader and P. Griffin, "Comparing and combining depth and texture cues for face recognition," Image Vis. Comput., vol. 23, pp. 339-352, 2005.
    • (2005) Image Vis. Comput , vol.23 , pp. 339-352
    • BenAbdelkader, C.1    Griffin, P.2
  • 43
    • 0034270644 scopus 로고    scopus 로고
    • Audio-visual speech modeling for continuous speech recognition
    • Sep
    • S. Dupont and J. Luettin, "Audio-visual speech modeling for continuous speech recognition," IEEE Trans. Multimedia, vol. 2, no. 3, pp. 141-151, Sep. 2000.
    • (2000) IEEE Trans. Multimedia , vol.2 , Issue.3 , pp. 141-151
    • Dupont, S.1    Luettin, J.2
  • 44
    • 4544224863 scopus 로고    scopus 로고
    • A stream-weight optimization method for audio-visual speech recognition using multi-stream HMMs
    • Montreal, Canada, May
    • S. Tamura, K. Iwano, and S. Furui, "A stream-weight optimization method for audio-visual speech recognition using multi-stream HMMs," in Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing, Montreal, Canada, May 2004, vol. 1, pp. 857-860.
    • (2004) Proc. IEEE Int. Conf. Acoustics, Speech, and Signal Processing , vol.1 , pp. 857-860
    • Tamura, S.1    Iwano, K.2    Furui, S.3
  • 45
    • 15744362948 scopus 로고    scopus 로고
    • Robust multi-modal person identification with tolerance of facial expression
    • The Hague, The Netherlands, Oct. 10-13
    • N. A. Fox and R. B. Reilly, "Robust multi-modal person identification with tolerance of facial expression," in Proc. IEEE Int. Conf. Systems, Man and Cybernetics, The Hague, The Netherlands, Oct. 10-13, 2004, vol. 1, pp. 580-585.
    • (2004) Proc. IEEE Int. Conf. Systems, Man and Cybernetics , vol.1 , pp. 580-585
    • Fox, N.A.1    Reilly, R.B.2
  • 46
    • 0033899298 scopus 로고    scopus 로고
    • BiolD: A multimodal biometric identification system
    • R. W. Frischholz and U. Dieckmann, "BiolD: A multimodal biometric identification system," Computer, vol. 33, no. 2, pp. 64-68, 2000.
    • (2000) Computer , vol.33 , Issue.2 , pp. 64-68
    • Frischholz, R.W.1    Dieckmann, U.2
  • 47
    • 25144471298 scopus 로고    scopus 로고
    • Score normalization in multi-modal biometric systems
    • A. Jain, K. Nandakumar, and A. Ross, "Score normalization in multi-modal biometric systems," Pattern Recognit., 2005.
    • (2005) Pattern Recognit
    • Jain, A.1    Nandakumar, K.2    Ross, A.3
  • 48
    • 0036874527 scopus 로고    scopus 로고
    • Noise adaptive stream weighting in audio-visual speech recognition
    • Nov
    • M. Heckmann, F. Berthommier, and K. Kroschel, "Noise adaptive stream weighting in audio-visual speech recognition," EURASIP J. Appl. Signal Process., vol. 2002, no. 11, pp. 1260-1273, Nov. 2002.
    • (2002) EURASIP J. Appl. Signal Process , vol.2002 , Issue.11 , pp. 1260-1273
    • Heckmann, M.1    Berthommier, F.2    Kroschel, K.3
  • 51
    • 34249312819 scopus 로고    scopus 로고
    • A. P. Varga, H. J. M. Steeneken, M. Tomlinson, and D. Jones, The Noisex-92 Study on the Effect of Additive Noise on Automatic Speech Recognition Speech Res. Unit, Defence Research Agency, Malvern, U.K., 1992, Tech. Rep..
    • A. P. Varga, H. J. M. Steeneken, M. Tomlinson, and D. Jones, The Noisex-92 Study on the Effect of Additive Noise on Automatic Speech Recognition Speech Res. Unit, Defence Research Agency, Malvern, U.K., 1992, Tech. Rep..
  • 52
    • 0004524499 scopus 로고    scopus 로고
    • An approach to statistical lip modelling for speaker identification via chromatic feature extraction
    • Brisbane, Australia, Aug
    • T. Wark, S. Sridharan, and V. Chandran, "An approach to statistical lip modelling for speaker identification via chromatic feature extraction," in Proc. 14th Int. Conf. Pattern Recognition, Brisbane, Australia, Aug. 1998, vol. 1, pp. 123-125.
    • (1998) Proc. 14th Int. Conf. Pattern Recognition , vol.1 , pp. 123-125
    • Wark, T.1    Sridharan, S.2    Chandran, V.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.