SCOPUS 정보 검색 플랫폼

Volumn 1, Issue , 2003, Pages I329-I332

Audio-visual synchrony for detection of monologues in video archives

Author keywords

[No Author keywords available]

Indexed keywords

BIOMETRICS; INFORMATION USE; VIDEO SIGNAL PROCESSING;

AUDIO AND VIDEO; AUDIO CHANNELS; FACE-BASED BIOMETRICS; FACIAL MOVEMENTS; SPEAKER LOCALIZATION; SPEECH INFORMATION; VIDEO CHANNELS; VIDEO RETRIEVAL;

SPEECH;

EID: 84908470296 PISSN: 19457871 EISSN: 1945788X Source Type: Conference Proceeding
DOI: 10.1109/ICME.2003.1220921 Document Type: Conference Paper

Times cited : (24)

References (11)

1
- 84908473847
- "Text retrieval conference (tree) video track, " http://trec.nist.gov.
- Text Retrieval Conference (Tree) Video Track

2
- 0037860595
- Look who's talking: Speaker detection using video and audio correlation
- Ross Cutler and Larry Davis, "Look Who's Talking: Speaker Detection using Video and Audio Correlation, " in Proc. ICME, 2000.
- (2000) Proc. ICME
- Cutler, R.¹ Davis, L.²

3
- 0037700834
- Assessing face and speech consistency for monologue de- Tectionin video
- Harriet J. Nock, Giridharan Iyengar, and Chalapathy Neti, "Assessing face and speech consistency for monologue de- Tectionin video, " in Proc. ACM Multimedia, 2002.
- (2002) Proc. ACM Multimedia
- Nock, H.J.¹ Iyengar, G.² Neti, C.³

5
- 0009622482
- Using audio-visual synchrony to locate sounds
- John Hershey and Javier Movellan, "Using audio-visual synchrony to locate sounds, " in Proc. NIPS, 1999.
- (1999) Proc. NIPS
- Hershey, J.¹ Movellan, J.²

6
- 84898954418
- Learning joint statistical models for audio-visual fusion and segregation
- JW Fisher III, T Darrell, WT Freeman, and P Viola, "Learning Joint Statistical Models for Audio-Visual Fusion and Segregation, " in Proc. NIPS, 2001.
- (2001) Proc. NIPS
- Fisher, J.W.¹ Darrell, T.² Freeman, W.T.³ Viola, P.⁴

7
- 0036293478
- Informative sub- spaces for audiovisual processing: High-level function from low-level fusion
- John W Fisher III and Trevor Darrell, "Informative sub- spaces for audiovisual processing: High-level function from low-level fusion, " in Proc. ICASSP, 2002.
- (2002) Proc. ICASSP
- Fisher, J.W.¹ Darrell, T.²

8
- 84898931254
- Facesync: Alinearop- erator for measuring synchronization of video facial images and audio tracks
- Malcolm Slaney and Michele Covell, "Facesync: Alinearop- erator for measuring synchronization of video facial images and audio tracks, " in Proc. NIPS, 2001.
- (2001) Proc. NIPS
- Slaney, M.¹ Covell, M.²

10
- 85088715355
- Robust speech recognition in noisy environments: The IBM spine-2 evaluation system
- Brian Kingsbury, George Saon, Lidia Mangu, Mukund Pad- manabhan, and Ruhi Sarikaya, "Robust Speech Recognition in Noisy Environments: The IBM Spine-2 Evaluation System, " in Proc. ICASSP, 2002.
- (2002) Proc. ICASSP
- Kingsbury, B.¹ Saon, G.² Mangu, L.³ Manabhan, M.P.-⁴ Sarikaya, R.⁵

11
- 0002595416
- Speaker, environment and channel change detection and clustering via the bayesian information criterion
- Scott S. Chen and P. S. Gopalakrishnan, "Speaker, environment and channel change detection and clustering via the bayesian information criterion, " Intl. Conf. On Acoust., Sp., andSig. Proc., 1998.
- (1998) Intl. Conf. on Acoust., Sp., AndSig. Proc
- Chen, S.S.¹ Gopalakrishnan, P.S.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.