메뉴 건너뛰기




Volumn 1, Issue , 2003, Pages I329-I332

Audio-visual synchrony for detection of monologues in video archives

Author keywords

[No Author keywords available]

Indexed keywords

BIOMETRICS; INFORMATION USE; VIDEO SIGNAL PROCESSING;

EID: 84908470296     PISSN: 19457871     EISSN: 1945788X     Source Type: Conference Proceeding    
DOI: 10.1109/ICME.2003.1220921     Document Type: Conference Paper
Times cited : (24)

References (11)
  • 2
    • 0037860595 scopus 로고    scopus 로고
    • Look who's talking: Speaker detection using video and audio correlation
    • Ross Cutler and Larry Davis, "Look Who's Talking: Speaker Detection using Video and Audio Correlation, " in Proc. ICME, 2000.
    • (2000) Proc. ICME
    • Cutler, R.1    Davis, L.2
  • 3
    • 0037700834 scopus 로고    scopus 로고
    • Assessing face and speech consistency for monologue de- Tectionin video
    • Harriet J. Nock, Giridharan Iyengar, and Chalapathy Neti, "Assessing face and speech consistency for monologue de- Tectionin video, " in Proc. ACM Multimedia, 2002.
    • (2002) Proc. ACM Multimedia
    • Nock, H.J.1    Iyengar, G.2    Neti, C.3
  • 4
    • 0141826698 scopus 로고    scopus 로고
    • Audio-visual speaker recognition for video broadcast news: Some fusion techniques
    • Denmark, September
    • Benoit Maison, Chalapathy Neti, and Andrew Senior, "Audio-visual speaker recognition for video broadcast news: Some fusion techniques, " in IEEE Multimedia Signal Processing (MMSP99), Denmark, September 1999.
    • (1999) IEEE Multimedia Signal Processing (MMSP99)
    • Maison, B.1    Neti, C.2    Andrew, S.3
  • 5
    • 0009622482 scopus 로고    scopus 로고
    • Using audio-visual synchrony to locate sounds
    • John Hershey and Javier Movellan, "Using audio-visual synchrony to locate sounds, " in Proc. NIPS, 1999.
    • (1999) Proc. NIPS
    • Hershey, J.1    Movellan, J.2
  • 6
    • 84898954418 scopus 로고    scopus 로고
    • Learning joint statistical models for audio-visual fusion and segregation
    • JW Fisher III, T Darrell, WT Freeman, and P Viola, "Learning Joint Statistical Models for Audio-Visual Fusion and Segregation, " in Proc. NIPS, 2001.
    • (2001) Proc. NIPS
    • Fisher, J.W.1    Darrell, T.2    Freeman, W.T.3    Viola, P.4
  • 7
    • 0036293478 scopus 로고    scopus 로고
    • Informative sub- spaces for audiovisual processing: High-level function from low-level fusion
    • John W Fisher III and Trevor Darrell, "Informative sub- spaces for audiovisual processing: High-level function from low-level fusion, " in Proc. ICASSP, 2002.
    • (2002) Proc. ICASSP
    • Fisher, J.W.1    Darrell, T.2
  • 8
    • 84898931254 scopus 로고    scopus 로고
    • Facesync: Alinearop- erator for measuring synchronization of video facial images and audio tracks
    • Malcolm Slaney and Michele Covell, "Facesync: Alinearop- erator for measuring synchronization of video facial images and audio tracks, " in Proc. NIPS, 2001.
    • (2001) Proc. NIPS
    • Slaney, M.1    Covell, M.2
  • 10
    • 85088715355 scopus 로고    scopus 로고
    • Robust speech recognition in noisy environments: The IBM spine-2 evaluation system
    • Brian Kingsbury, George Saon, Lidia Mangu, Mukund Pad- manabhan, and Ruhi Sarikaya, "Robust Speech Recognition in Noisy Environments: The IBM Spine-2 Evaluation System, " in Proc. ICASSP, 2002.
    • (2002) Proc. ICASSP
    • Kingsbury, B.1    Saon, G.2    Mangu, L.3    Manabhan, M.P.-4    Sarikaya, R.5
  • 11
    • 0002595416 scopus 로고    scopus 로고
    • Speaker, environment and channel change detection and clustering via the bayesian information criterion
    • Scott S. Chen and P. S. Gopalakrishnan, "Speaker, environment and channel change detection and clustering via the bayesian information criterion, " Intl. Conf. On Acoust., Sp., andSig. Proc., 1998.
    • (1998) Intl. Conf. on Acoust., Sp., AndSig. Proc
    • Chen, S.S.1    Gopalakrishnan, P.S.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.