SCOPUS 정보 검색 플랫폼

Volumn , Issue , 2002, Pages 303-306

Assessing face and speech consistency for monologue detection in video

Author keywords

[No Author keywords available]

Indexed keywords

AUDIO SYSTEMS; COSINE TRANSFORMS; DATABASE SYSTEMS; FACE RECOGNITION; PROBABILITY DENSITY FUNCTION; SPEECH RECOGNITION; SYNCHRONIZATION;

DIGITAL VIDEOS;

VIDEO SIGNAL PROCESSING;

EID: 0037700834 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1145/641007.641070 Document Type: Conference Paper

Times cited : (42)

References (10)

1
- 0038198329
- Text retrieval conference (tree) video track. http://trec.nist.gov.
- Text Retrieval Conference (Tree) Video Track

2
- 85013597845
- Eigenlips' for robust speech recognition
- C. Bregler and Y. Konig. 'Eigenlips' for robust speech recognition. In Proc. ICASSP, 1994.
- (1994) Proc. ICASSP
- Bregler, C.¹ Konig, Y.²

3
- 0002595416
- Speaker, environment and channel change detection and clustering via the bayesian information criterion
- S. Chen and P. Gopalakrishnan. Speaker, environment and channel change detection and clustering via the bayesian information criterion. In Proc. DARPA Broadcast News Transcription & Understanding Workshop. 1998.
- (1998) Proc. DARPA Broadcast News Transcription & Understanding Workshop
- Chen, S.¹ Gopalakrishnan, P.²

4
- 0037860595
- Look who's talking: Speaker detection using video and audio correlation
- R. Cutler and L. Davis. Look Who's Talking: Speaker Detection using Video and Audio Correlation. In Proc. ICME, 2000.
- (2000) Proc. ICME
- Cutler, R.¹ Davis, L.²

5
- 84898954418
- Learning joint statistical models for audio-visual fusion and segregation
- J. Fisher III, T. Darrell, W. Freeman, and P. Viola. Learning Joint Statistical Models for Audio-Visual Fusion and Segregation. In Proc. NIPS, 2001.
- (2001) Proc. NIPS
- Fisher J. III¹ Darrell, T.² Freeman, W.³ Viola, P.⁴

6
- 0009622482
- Using audio-visual synchrony to locate sounds
- J. Hershey and J. Movellan. Using audio-visual synchrony to locate sounds. In Proc. NIPS, 1999.
- (1999) Proc. NIPS
- Hershey, J.¹ Movellan, J.²

7
- 67649123507
- Semantic indexing of multimedia using audio, text and visual cues
- G. Iyengar, H. Nock, and C. Neti. Semantic Indexing of Multimedia using Audio, Text and Visual Cues. In Proc. ICME, 2002.
- (2002) Proc. ICME
- Iyengar, G.¹ Nock, H.² Neti, C.³

8
- 0034853041
- Hierarchical discriminant features for audio-visual speech recognition
- G. Potamianos, J. Luettin, and C. Neti. Hierarchical Discriminant Features for Audio-Visual Speech Recognition. In Proc. ICASSP, pages 165-168, 2001.
- (2001) Proc. ICASSP , pp. 165-168
- Potamianos, G.¹ Luettin, J.² Neti, C.³

10
- 84898931254
- Facesync: A linear operator for measuring synchronization of video facial images and audio tracks
- M. Slaney and M. Covell. Facesync: a linear operator for measuring synchronization of video facial images and audio tracks. In Proc. NIPS, 2001.
- (2001) Proc. NIPS
- Slaney, M.¹ Covell, M.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.