SCOPUS 정보 검색 플랫폼

Volumn 5, Issue , 2003, Pages 772-775

Audio-visual synchrony for detection of monologues in video archives

Author keywords

[No Author keywords available]

Indexed keywords

ALGORITHMS; CORRELATION METHODS; NEURAL NETWORKS; SPEECH RECOGNITION;

VIDEO ARCHIVES;

COMMUNICATION CHANNELS (INFORMATION THEORY);

EID: 0141631499 PISSN: 15206149 EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (16)

References (11)

1
- 84871617020
- "Text retrieval conference (trec) video track," http://trec.nist.gov.
- Text Retrieval Conference (trec) Video Track

2
- 0037860595
- Look Who's Talking: Speaker Detection using Video and Audio Correlation
- Ross Cutler and Larry Davis, "Look Who's Talking: Speaker Detection using Video and Audio Correlation," in Proc. ICME, 2000.
- (2000) Proc. ICME
- Cutler, R.¹ Davis, L.²

3
- 0037700834
- Assessing face and speech consistency for monologue detectionin video
- Harriet J. Nock, Giridharan lyengar, and Chalapathy Neti, "Assessing face and speech consistency for monologue detectionin video," in Proc. ACM Multimedia, 2002.
- (2002) Proc. ACM Multimedia
- Nock, H.J.¹ Lyengar, G.² Neti, C.³

5
- 0009622482
- Using audio-visual synchrony to locate sounds
- John Hershey and Javier Movellan, "Using audio-visual synchrony to locate sounds," in Proc. NIPS, 1999.
- (1999) Proc. NIPS
- Hershey, J.¹ Movellan, J.²

6
- 84898954418
- Learning Joint Statistical Models for Audio-Visual Fusion and Segregation
- JW Fisher III, T Darrell, WT Freeman, and P Viola, "Learning Joint Statistical Models for Audio-Visual Fusion and Segregation," in Proc. NIPS, 2001.
- (2001) Proc. NIPS
- Fisher J.W. III¹ Darrell, T.² Freeman, W.T.³ Viola, P.⁴

7
- 0036293478
- Informative sub-spaces for audiovisual processing: High-level function from low-level fusion
- John W Fisher III and Trevor Darrell, "Informative sub-spaces for audiovisual processing: High-level function from low-level fusion," in Proc. ICASSP, 2002.
- (2002) Proc. ICASSP
- Fisher J.W. III¹ Darrell, T.²

8
- 84898931254
- Facesync: A linear operator for measuring synchronization of video facial images and audio tracks
- Malcolm Slaney and Michele Covell, "Facesync: a linear operator for measuring synchronization of video facial images and audio tracks," in Proc. NIPS, 2001.
- (2001) Proc. NIPS
- Slaney, M.¹ Covell, M.²

10
- 17344389852
- Robust Speech Recognition in Noisy Environments: The IBM Spine-2 Evaluation System
- Brian Kingsbury, George Saon, Lidia Mangu, Mukund Padmanabhan, and Ruhi Sarikaya, "Robust Speech Recognition in Noisy Environments: The IBM Spine-2 Evaluation System," in Proc. ICASSP, 2002.
- (2002) Proc. ICASSP
- Kingsbury, B.¹ Saon, G.² Mangu, L.³ Padmanabhan, M.⁴ Sarikaya, R.⁵

11
- 0002595416
- Speaker, environment and channel change detection and clustering via the bayesian information criterion
- Scott S. Chen and P. S. Gopalakrishnan, "Speaker, environment and channel change detection and clustering via the bayesian information criterion," Intl. Conf. On Acoust., Sp., and Sig. Proc., 1998.
- (1998) Intl. Conf. On Acoust., Sp., and Sig. Proc.
- Chen, S.S.¹ Gopalakrishnan, P.S.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.