메뉴 건너뛰기




Volumn , Issue , 2009, Pages 4069-4072

Multi-modal speaker diarization of real-world meetings using compressed-domain video features

Author keywords

Compressed domain features; Multi modal; Speaker extraction

Indexed keywords

ACOUSTIC FEATURES; AUDIO TRACK; COMPRESSED DOMAIN; COMPRESSED DOMAIN FEATURES; DATA SETS; ERROR RATE; IMPROVE-A; MULTI-MODAL; MULTI-MODAL APPROACH; PRIOR KNOWLEDGE; REAL-WORLD; SPEAKER DIARIZATION; SPEAKER EXTRACTION; STANDING-UP; VIDEO FEATURES;

EID: 70349214881     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2009.4960522     Document Type: Conference Paper
Times cited : (59)

References (11)
  • 1
    • 84976926276 scopus 로고
    • Listeners' body movements and speaking turns
    • April
    • Jinni A Harrigan, "Listeners' body movements and speaking turns," Communications Research, vol. 12, no. 2, pp. 233-250, April 1985.
    • (1985) Communications Research , vol.12 , Issue.2 , pp. 233-250
    • Harrigan, J.A.1
  • 2
    • 35248827017 scopus 로고    scopus 로고
    • Speaker Localisation Using Audio-Visual Synchrony: An Empirical Study
    • Harriet J. Nock, Giridharan Iyengar, and Chalapathy Neti, "Speaker Localisation Using Audio-Visual Synchrony: An Empirical Study," Lecture Notes in Computer Science, vol. 2728, pp. 565-570, 2003.
    • (2003) Lecture Notes in Computer Science , vol.2728 , pp. 565-570
    • Nock, H.J.1    Iyengar, G.2    Neti, C.3
  • 3
    • 2642562769 scopus 로고    scopus 로고
    • Speaker association with signal-level audiovisual fusion
    • JW Fisher III and T. Darrell, "Speaker association with signal-level audiovisual fusion," Multimedia, IEEE Transactions on, vol. 6, no. 3, pp. 406-413, 2004.
    • (2004) Multimedia, IEEE Transactions on , vol.6 , Issue.3 , pp. 406-413
    • Fisher III, J.W.1    Darrell, T.2
  • 9
    • 70349219475 scopus 로고    scopus 로고
    • Chuohao Yeo and Kannan Ramchandran, Compressed domain video processing of meetings for activity estimation in dominance classification and slide transition detection, Tech. Rep. UCB/EECS-2008-79, EECS Department, University of California, Berkeley, Jun 2008.
    • Chuohao Yeo and Kannan Ramchandran, "Compressed domain video processing of meetings for activity estimation in dominance classification and slide transition detection," Tech. Rep. UCB/EECS-2008-79, EECS Department, University of California, Berkeley, Jun 2008.
  • 10
    • 0032312273 scopus 로고    scopus 로고
    • Modelling facial colour and identity with gaussian mixtures
    • S J McKenna, S Gong, and Y Raja, "Modelling facial colour and identity with gaussian mixtures," Pattern Recognition, vol. 31, no. 12, pp. 1883-1892, 1998.
    • (1998) Pattern Recognition , vol.31 , Issue.12 , pp. 1883-1892
    • McKenna, S.J.1    Gong, S.2    Raja, Y.3
  • 11
    • 34548310397 scopus 로고    scopus 로고
    • Speaker Diarization For Multiple-Distant-Microphone Meetings Using Several Sources of Information
    • J. Pardo, X. Anguera, and C. Wooters, "Speaker Diarization For Multiple-Distant-Microphone Meetings Using Several Sources of Information," IEEE Transactions on Computers, vol. 56, no. 9, pp. 1189, 2007.
    • (2007) IEEE Transactions on Computers , vol.56 , Issue.9 , pp. 1189
    • Pardo, J.1    Anguera, X.2    Wooters, C.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.