메뉴 건너뛰기




Volumn 56, Issue 9, 2007, Pages 1169-1175

Synergy of lip-motion and acoustic features in biometric speech and speaker recognition

Author keywords

Biometrics; GMM; Lip motion; Lip reading; Motion estimation; Normal image flow; Normal image velocity; Speaker recognition; Speech recognition; SVM

Indexed keywords

AUDIO SYSTEMS; BIOMETRICS; MOTION ESTIMATION; SUPPORT VECTOR MACHINES; VERIFICATION;

EID: 34548205797     PISSN: 00189340     EISSN: None     Source Type: Journal    
DOI: 10.1109/TC.2007.1074     Document Type: Article
Times cited : (32)

References (40)
  • 2
    • 0026202914 scopus 로고
    • Multidimensional Orientation Estimation with Applications to Texture Analysis of Optical Flow
    • Aug
    • J. Bigun, G. Granlund, and J. Wiklund, "Multidimensional Orientation Estimation with Applications to Texture Analysis of Optical Flow," IEEE Trans. Pattern Analysis and Machine Intelligence, vol. 13, no. 8, pp. 775-790, Aug. 1991.
    • (1991) IEEE Trans. Pattern Analysis and Machine Intelligence , vol.13 , Issue.8 , pp. 775-790
    • Bigun, J.1    Granlund, G.2    Wiklund, J.3
  • 4
    • 27144489164 scopus 로고    scopus 로고
    • A Tutorial on Support Vector Machines for Pattern Recognition
    • C.J. Burges, "A Tutorial on Support Vector Machines for Pattern Recognition," Data Mining and Knowledge Discovery, vol. 2, no. 2, pp. 121-167, 1998.
    • (1998) Data Mining and Knowledge Discovery , vol.2 , Issue.2 , pp. 121-167
    • Burges, C.J.1
  • 6
    • 85032752352 scopus 로고    scopus 로고
    • Audiovisual Speech Processing
    • T. Chen, "Audiovisual Speech Processing," IEEE Signal Processing Magazine, vol. 18, no. 1, pp. 9-21, 2001.
    • (2001) IEEE Signal Processing Magazine , vol.18 , Issue.1 , pp. 9-21
    • Chen, T.1
  • 7
    • 0036502797 scopus 로고    scopus 로고
    • A Review of Speech-Based Bimodal Recognition
    • C. Chibelushi, F. Deravi, and J. Mason, "A Review of Speech-Based Bimodal Recognition," IEEE Trans. Multimedia, vol. 4, no. 1, pp. 23-37, 2002.
    • (2002) IEEE Trans. Multimedia , vol.4 , Issue.1 , pp. 23-37
    • Chibelushi, C.1    Deravi, F.2    Mason, J.3
  • 9
    • 0019053271 scopus 로고
    • Comparison of Parametric Representations for Monosyllabic Word Recognition in Continuously Spoken Sentences
    • S. Davis and P. Mermelstein, "Comparison of Parametric Representations for Monosyllabic Word Recognition in Continuously Spoken Sentences," IEEE Trans. Acoustics, Speech, and Signal Processing, vol. 28, no. 4, pp. 357-366, 1980.
    • (1980) IEEE Trans. Acoustics, Speech, and Signal Processing , vol.28 , Issue.4 , pp. 357-366
    • Davis, S.1    Mermelstein, P.2
  • 10
    • 0034224204 scopus 로고    scopus 로고
    • Optical Flow Constraints on Deformable Models with Applications to Face Tracking
    • D. DeCarlo and D. Metaxas, "Optical Flow Constraints on Deformable Models with Applications to Face Tracking," Int'l J. Computer Vision vol. 38, no. 2, pp. 99-127, 2000.
    • (2000) Int'l J. Computer Vision , vol.38 , Issue.2 , pp. 99-127
    • DeCarlo, D.1    Metaxas, D.2
  • 14
    • 0028204659 scopus 로고
    • Speaker Recognition Using Neural Networks and Conventional Classifiers
    • K. Farrell, R. Mammone, and K. Assaleh, "Speaker Recognition Using Neural Networks and Conventional Classifiers," IEEE Trans. Speech and Audio Processing, vol. 2, no. 1, pp. 194-205, 1994.
    • (1994) IEEE Trans. Speech and Audio Processing , vol.2 , Issue.1 , pp. 194-205
    • Farrell, K.1    Mammone, R.2    Assaleh, K.3
  • 16
    • 0018026724 scopus 로고
    • In Search of a General Picture Processing Operator
    • G.H. Granlund, "In Search of a General Picture Processing Operator," Computer Graphics and Image Processing, vol. 8, no. 2, pp. 155-173, 1978.
    • (1978) Computer Graphics and Image Processing , vol.8 , Issue.2 , pp. 155-173
    • Granlund, G.H.1
  • 17
    • 34047263009 scopus 로고    scopus 로고
    • Visual Model Structures and Synchrony Constraints for Audio-Visual Speech Recognition
    • T.J. Hazen, "Visual Model Structures and Synchrony Constraints for Audio-Visual Speech Recognition," IEEE Trans. Audio, Speech, and Language Processing, vol. 14, no. 3, pp. 1082-1089, 2006.
    • (2006) IEEE Trans. Audio, Speech, and Language Processing , vol.14 , Issue.3 , pp. 1082-1089
    • Hazen, T.J.1
  • 18
    • 0019597413 scopus 로고
    • Determining Optical Flow
    • B. Horn and B. Schunck, "Determining Optical Flow," J. Artificial Intelligence, vol. 17, no. 1, pp. 185-203, 1981.
    • (1981) J. Artificial Intelligence , vol.17 , Issue.1 , pp. 185-203
    • Horn, B.1    Schunck, B.2
  • 22
    • 0019647180 scopus 로고
    • An Iterative Image Registration Technique with an Application to Stereo Vision
    • B.D. Lucas and T. Kanade, "An Iterative Image Registration Technique with an Application to Stereo Vision," Proc. Int'l Joint Conf. Artificial Intelligence, pp. 674-679, 1981.
    • (1981) Proc. Int'l Joint Conf. Artificial Intelligence , pp. 674-679
    • Lucas, B.D.1    Kanade, T.2
  • 23
    • 20444375102 scopus 로고    scopus 로고
    • Integration Strategies for Audio-Visual Speech Processing: Applied to Text-Dependent Speaker Recognition
    • S. Lucey, T. Chen, S. Sridharan, and V. Chandran, "Integration Strategies for Audio-Visual Speech Processing: Applied to Text-Dependent Speaker Recognition," IEEE Trans. Multimedia, vol. 7, no. 3, pp. 495-506, 2005.
    • (2005) IEEE Trans. Multimedia , vol.7 , Issue.3 , pp. 495-506
    • Lucey, S.1    Chen, T.2    Sridharan, S.3    Chandran, V.4
  • 24
    • 34548282836 scopus 로고    scopus 로고
    • J. Luettin and G. Maitre, Evaluation Protocol for the Extended M2VTS Database xm2vtsdb, IDIAP Communication 98-054, Technical Report R R-21, number = IDIAP - 1998, 1998.
    • J. Luettin and G. Maitre, "Evaluation Protocol for the Extended M2VTS Database xm2vtsdb," IDIAP Communication 98-054, Technical Report R R-21, number = IDIAP - 1998, 1998.
  • 26
    • 0025750892 scopus 로고
    • Automatic Lip-Reading by Optical-Flow Analysis
    • K. Mase and A. Pentland, "Automatic Lip-Reading by Optical-Flow Analysis," Systems and Computers in Japan, vol. 22, no. 6, pp. 67-76, 1991.
    • (1991) Systems and Computers in Japan , vol.22 , Issue.6 , pp. 67-76
    • Mase, K.1    Pentland, A.2
  • 29
    • 0033884858 scopus 로고    scopus 로고
    • Speaker Verification Using Adapted Gaussian Mixture Models
    • D. Reynolds, T. Quatieri, and R.B. Dunn, "Speaker Verification Using Adapted Gaussian Mixture Models," Digital Signal Processing, vol. 10, nos. 1-3, pp. 19-41, 2000.
    • (2000) Digital Signal Processing , vol.10 , Issue.1-3 , pp. 19-41
    • Reynolds, D.1    Quatieri, T.2    Dunn, R.B.3
  • 30
    • 0029209272 scopus 로고
    • Robust Text-Independent Speaker Identification Using Gaussian Mixture Models
    • D. Reynolds and R. Rose, "Robust Text-Independent Speaker Identification Using Gaussian Mixture Models," IEEE Trans. Speech and Audio Processing, vol. 3, no. 1, pp. 72-83, 1995.
    • (1995) IEEE Trans. Speech and Audio Processing , vol.3 , Issue.1 , pp. 72-83
    • Reynolds, D.1    Rose, R.2
  • 35
    • 0031341829 scopus 로고    scopus 로고
    • Multisensor Data Fusion
    • P. Varshney, "Multisensor Data Fusion," Electronics and Comm. Eng. J., vol. 9, no. 6, pp. 245-253, 1997.
    • (1997) Electronics and Comm. Eng. J , vol.9 , Issue.6 , pp. 245-253
    • Varshney, P.1
  • 38
    • 0025474465 scopus 로고
    • Performance-Driven Facial Animation
    • 90, pp
    • L. Williams, "Performance-Driven Facial Animation," Proc. SIGGRAPH '90, pp. 235-242, 1990.
    • (1990) Proc. SIGGRAPH , pp. 235-242
    • Williams, L.1
  • 39
    • 0032179320 scopus 로고    scopus 로고
    • Lip Movement Synthesis from Speech Based on Hidden Markov Models
    • E. Yamamoto, S. Nakamura, and K. Shikano, "Lip Movement Synthesis from Speech Based on Hidden Markov Models," J. Speech Comm., vol. 26, no. 1, pp. 105-115, 1998.
    • (1998) J. Speech Comm , vol.26 , Issue.1 , pp. 105-115
    • Yamamoto, E.1    Nakamura, S.2    Shikano, K.3
  • 40
    • 34548203688 scopus 로고    scopus 로고
    • S. Young, D. Kershaw, J. Odell, D. Ollason, V. Valtchev, and P. Woodland, The HTK Book for HTK Version 3.0, 2000
    • S. Young, D. Kershaw, J. Odell, D. Ollason, V. Valtchev, and P. Woodland, The HTK Book (for HTK Version 3.0), http://htk.eng.cam.ac.uk/docs/docs.shtml, 2000.


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.