메뉴 건너뛰기




Volumn , Issue , 2002, Pages 303-306

Assessing face and speech consistency for monologue detection in video

Author keywords

[No Author keywords available]

Indexed keywords

AUDIO SYSTEMS; COSINE TRANSFORMS; DATABASE SYSTEMS; FACE RECOGNITION; PROBABILITY DENSITY FUNCTION; SPEECH RECOGNITION; SYNCHRONIZATION;

EID: 0037700834     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/641007.641070     Document Type: Conference Paper
Times cited : (42)

References (10)
  • 2
    • 85013597845 scopus 로고
    • Eigenlips' for robust speech recognition
    • C. Bregler and Y. Konig. 'Eigenlips' for robust speech recognition. In Proc. ICASSP, 1994.
    • (1994) Proc. ICASSP
    • Bregler, C.1    Konig, Y.2
  • 4
    • 0037860595 scopus 로고    scopus 로고
    • Look who's talking: Speaker detection using video and audio correlation
    • R. Cutler and L. Davis. Look Who's Talking: Speaker Detection using Video and Audio Correlation. In Proc. ICME, 2000.
    • (2000) Proc. ICME
    • Cutler, R.1    Davis, L.2
  • 5
    • 84898954418 scopus 로고    scopus 로고
    • Learning joint statistical models for audio-visual fusion and segregation
    • J. Fisher III, T. Darrell, W. Freeman, and P. Viola. Learning Joint Statistical Models for Audio-Visual Fusion and Segregation. In Proc. NIPS, 2001.
    • (2001) Proc. NIPS
    • Fisher J. III1    Darrell, T.2    Freeman, W.3    Viola, P.4
  • 6
    • 0009622482 scopus 로고    scopus 로고
    • Using audio-visual synchrony to locate sounds
    • J. Hershey and J. Movellan. Using audio-visual synchrony to locate sounds. In Proc. NIPS, 1999.
    • (1999) Proc. NIPS
    • Hershey, J.1    Movellan, J.2
  • 7
    • 67649123507 scopus 로고    scopus 로고
    • Semantic indexing of multimedia using audio, text and visual cues
    • G. Iyengar, H. Nock, and C. Neti. Semantic Indexing of Multimedia using Audio, Text and Visual Cues. In Proc. ICME, 2002.
    • (2002) Proc. ICME
    • Iyengar, G.1    Nock, H.2    Neti, C.3
  • 8
    • 0034853041 scopus 로고    scopus 로고
    • Hierarchical discriminant features for audio-visual speech recognition
    • G. Potamianos, J. Luettin, and C. Neti. Hierarchical Discriminant Features for Audio-Visual Speech Recognition. In Proc. ICASSP, pages 165-168, 2001.
    • (2001) Proc. ICASSP , pp. 165-168
    • Potamianos, G.1    Luettin, J.2    Neti, C.3
  • 10
    • 84898931254 scopus 로고    scopus 로고
    • Facesync: A linear operator for measuring synchronization of video facial images and audio tracks
    • M. Slaney and M. Covell. Facesync: a linear operator for measuring synchronization of video facial images and audio tracks. In Proc. NIPS, 2001.
    • (2001) Proc. NIPS
    • Slaney, M.1    Covell, M.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.