메뉴 건너뛰기




Volumn 15, Issue 10, 2006, Pages 2879-2891

Discriminative analysis of lip motion features for speaker identification and speech-reading

Author keywords

Bayesian discriminative feature selection; Lip motion; Speaker identification; Speech recognition; Temporal discriminative feature selection

Indexed keywords

COMPUTATIONAL GEOMETRY; FEATURE EXTRACTION; MARKOV PROCESSES; MATHEMATICAL MODELS;

EID: 33749187783     PISSN: 10577149     EISSN: None     Source Type: Journal    
DOI: 10.1109/TIP.2006.877528     Document Type: Article
Times cited : (109)

References (42)
  • 3
    • 4544290191 scopus 로고    scopus 로고
    • "Recent advances in the automatic recognition of audio-visual speech"
    • Sep
    • G. Potamianos, C. Neti, G. Gravier,A. Garg, and A. W. Senior, "Recent advances in the automatic recognition of audio-visual speech," Proc. IEEE, vol. 91, no. 9, pp. 1306-1326, Sep. 2003.
    • (2003) Proc. IEEE , vol.91 , Issue.9 , pp. 1306-1326
    • Potamianos, G.1    Neti, C.2    Gravier, G.3    Garg, A.4    Senior, A.W.5
  • 4
    • 2542475258 scopus 로고    scopus 로고
    • "Recognition of visual speech elements using adaptively boosted hidden Markov models"
    • May
    • S. W. Foo, Y. Lian, and L. Dong, "Recognition of visual speech elements using adaptively boosted hidden Markov models," IEEE Trans. Circuits Syst. Video Technol., vol. 14, no. 5, pp. 693-705, May 2004.
    • (2004) IEEE Trans. Circuits Syst. Video Technol. , vol.14 , Issue.5 , pp. 693-705
    • Foo, S.W.1    Lian, Y.2    Dong, L.3
  • 5
    • 85032752352 scopus 로고    scopus 로고
    • "Audiovisual speech processing"
    • Jan
    • T. Chen, "Audiovisual speech processing," IEEE Signal Process. Mag., vol. 18, pp. 9-21, Jan. 2001.
    • (2001) IEEE Signal Process. Mag. , vol.18 , pp. 9-21
    • Chen, T.1
  • 13
    • 0034270644 scopus 로고    scopus 로고
    • "Audio-visual speech modeling for continuous speech recognition"
    • Sep
    • S. Dupont and J. Luettin, "Audio-visual speech modeling for continuous speech recognition," IEEE Trans. Multimedia, vol. 2, no. 3, pp. 141-151, Sep. 2000.
    • (2000) IEEE Trans. Multimedia , vol.2 , Issue.3 , pp. 141-151
    • Dupont, S.1    Luettin, J.2
  • 14
    • 26844533276 scopus 로고    scopus 로고
    • "Multimodal speaker identification using an adaptive classifier cascade based on modality reliability"
    • Oct
    • E. Erzin, Y. Yemez, and A. M. Tekalp, "Multimodal speaker identification using an adaptive classifier cascade based on modality reliability," IEEE Trans. Multimedia, vol. 7, no. 5, pp. 840-852, Oct. 2005.
    • (2005) IEEE Trans. Multimedia , vol.7 , Issue.5 , pp. 840-852
    • Erzin, E.1    Yemez, Y.2    Tekalp, A.M.3
  • 15
    • 15744362948 scopus 로고    scopus 로고
    • "Robust multi-modal person identification with tolerance of facial expression"
    • N. A. Fox and R. B. Reilly, "Robust multi-modal person identification with tolerance of facial expression," in IEEE Int. Conf. Systems, Man and Cybernetics, 2004, vol. 1, pp. 580-585.
    • (2004) IEEE Int. Conf. Systems, Man and Cybernetics , vol.1 , pp. 580-585
    • Fox, N.A.1    Reilly, R.B.2
  • 18
    • 0035394653 scopus 로고    scopus 로고
    • "Adaptive fusion of speech and lip information for robust speaker identification"
    • Jul
    • T.Wark and S. Sridharan, "Adaptive fusion of speech and lip information for robust speaker identification," Digital Signal Process., vol. 11, no. 3, pp. 169-186, Jul. 2001.
    • (2001) Digital Signal Process. , vol.11 , Issue.3 , pp. 169-186
    • Wark, T.1    Sridharan, S.2
  • 21
    • 0033899298 scopus 로고    scopus 로고
    • "Bioid: A multimodal biometric identification system"
    • Feb
    • R. W. Frischholz and U. Dieckmann, "Bioid: A multimodal biometric identification system," IEEE Computer, vol. 33, no. 2, pp. 64-68, Feb. 2000.
    • (2000) IEEE Computer , vol.33 , Issue.2 , pp. 64-68
    • Frischholz, R.W.1    Dieckmann, U.2
  • 24
    • 0031233424 scopus 로고    scopus 로고
    • "Speaker recognition: A tutorial"
    • Sep
    • J. P. Campbell, "Speaker recognition: A tutorial," Proc. IEEE, vol. 85, no. 9, pp. 1437-1462, Sep. 1997.
    • (1997) Proc. IEEE , vol.85 , Issue.9 , pp. 1437-1462
    • Campbell, J.P.1
  • 25
    • 0029355999 scopus 로고
    • "Speaker identification and verification using gaussian mixture speaker models"
    • D. A. Reynolds, "Speaker identification and verification using gaussian mixture speaker models," Speech Commun., vol. 17, pp. 91-108, 1995.
    • (1995) Speech Commun. , vol.17 , pp. 91-108
    • Reynolds, D.A.1
  • 26
    • 0029489292 scopus 로고
    • "Robust multiresolution estimation of parametric motion models"
    • Image Represent., Dec
    • J.-M. Odobez and P. Bouthemy, "Robust multiresolution estimation of parametric motion models," J. Vis. Commun. Image Represent., vol. 6, no. 4, pp. 348-365, Dec. 1995.
    • (1995) J. Vis. Commun. , vol.6 , Issue.4 , pp. 348-365
    • Odobez, J.-M.1    Bouthemy, P.2
  • 27
    • 0033738345 scopus 로고    scopus 로고
    • "A quadratic motion-based object-oriented video codec"
    • Y. Yemez, B. Sankur, and E. Anarim, "A quadratic motion-based object-oriented video codec," Signal Process.: Image Commun., vol. 15, pp. 729-766, 2000.
    • (2000) Signal Process.: Image Commun. , vol.15 , pp. 729-766
    • Yemez, Y.1    Sankur, B.2    Anarim, E.3
  • 28
    • 6344240885 scopus 로고    scopus 로고
    • "Video coding using the H.264/MPEG-4 AVC compression standard"
    • A. Puri, X. Chen, and A. Luthra, "Video coding using the H.264/MPEG-4 AVC compression standard," Signal Process.: Imag Commun., vol. 19, pp. 793-849, 2004.
    • (2004) Signal Process.: Imag Commun. , vol.19 , pp. 793-849
    • Puri, A.1    Chen, X.2    Luthra, A.3
  • 29
    • 0036613897 scopus 로고    scopus 로고
    • "Modelling and segmentation of lip area in face images"
    • Jun
    • M. Sadeghi, J. Kittler, and K. Messer, "Modelling and segmentation of lip area in face images," IEE Proc. Vis. Image Signal Process., vol. 149, no. 3, pp. 179-184, Jun. 2002.
    • (2002) IEE Proc. Vis. Image Signal Process. , vol.149 , Issue.3 , pp. 179-184
    • Sadeghi, M.1    Kittler, J.2    Messer, K.3
  • 30
    • 1242263389 scopus 로고    scopus 로고
    • "Lip Iimage segmentation using fuzzy clustering incorporating an elliptic shape function"
    • Jan
    • S.-H. Leung, S.-L. Wang, and W.-H. Lau, "Lip Iimage segmentation using fuzzy clustering incorporating an elliptic shape function," IEEE Trans. Image Process., vol. 13, no. 1, pp. 51-62, Jan. 2004.
    • (2004) IEEE Trans. Image Process. , vol.13 , Issue.1 , pp. 51-62
    • Leung, S.-H.1    Wang, S.-L.2    Lau, W.-H.3
  • 37
    • 0002836012 scopus 로고
    • "An iterative image restoration technique with an application to stereo vision"
    • B. Lucas and T. Kanade, "An iterative image restoration technique with an application to stereo vision," in Proc. DARPA IU Workshop, 1981, pp. 121-130.
    • (1981) Proc. DARPA IU Workshop , pp. 121-130
    • Lucas, B.1    Kanade, T.2
  • 39
    • 0004524499 scopus 로고    scopus 로고
    • "An approach to statistical lip modeling for speaker identification via chromatic feature extraction"
    • T. J. Wark, S. Sridharan, and V. Chandran, "An approach to statistical lip modeling for speaker identification via chromatic feature extraction," in Proc. Int. Conf. Pattern Recognition, 1998, vol. 1, pp. 123-125.
    • (1998) Proc. Int. Conf. Pattern Recognition , vol.1 , pp. 123-125
    • Wark, T.J.1    Sridharan, S.2    Chandran, V.3
  • 42
    • 18844422688 scopus 로고    scopus 로고
    • New York: Springer Verlag, 2005, ch. Joint Audio-Video Processing for Robust Biometric Speaker Identification in Car
    • E. Erzin, Y. Yemez, and A. M. Tekalp, DSP in Mobile and Vehicular Systems. New York: Springer Verlag, 2005, ch. Joint Audio-Video Processing for Robust Biometric Speaker Identification in Car.
    • DSP in Mobile and Vehicular Systems
    • Erzin, E.1    Yemez, Y.2    Tekalp, A.M.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.