메뉴 건너뛰기




Volumn , Issue , 2010, Pages 4942-4945

Using audio and visual cues for speaker diarisation initialisation

Author keywords

Audio visual speaker diarisation; Clustering initialisation

Indexed keywords

AUDIO AND VISUAL CUES; AUDIO VISUAL SPEAKER DIARIZATION; AUDIO-VISUAL; CLUSTERING INITIALIZATION; CLUSTERINGS; MOTION INTENSITY; SPEAKER DIARIZATION; TIME DELAY OF ARRIVAL; VISUAL FEATURE; VISUAL FOCUS OF ATTENTIONS;

EID: 78049384286     PISSN: 15206149     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICASSP.2010.5495101     Document Type: Conference Paper
Times cited : (9)

References (18)
  • 3
    • 34548352841 scopus 로고    scopus 로고
    • Friends and Enemies: A Novel Initialization for Speaker Diarization
    • X. Anguera, C. Wooters, and J. Hernando, "Friends and Enemies: A Novel Initialization for Speaker Diarization," in Proc. ICSLP, 2006.
    • Proc. ICSLP, 2006
    • Anguera, X.1    Wooters, C.2    Hernando, J.3
  • 4
    • 70349214881 scopus 로고    scopus 로고
    • Multi-Modal Speaker Diarization of Real-World Meetings using Compressed Domain Video Features
    • G. Friedland, H. Hung, and C. Yeo, "Multi-Modal Speaker Diarization of Real-World Meetings using Compressed Domain Video Features," in Proc. ICASSP, 2009.
    • Proc. ICASSP, 2009
    • Friedland, G.1    Hung, H.2    Yeo, C.3
  • 6
    • 72449212152 scopus 로고    scopus 로고
    • A Realtime Multimodal System for Analysing Group Meetings by Combining Face Pose Tracking and Speaker Diarisation
    • K. Otsuka et al., "A Realtime Multimodal System for Analysing Group Meetings by Combining Face Pose Tracking and Speaker Diarisation," in Proc. ICMI, 2008.
    • Proc. ICMI, 2008
    • Otsuka, K.1
  • 7
    • 78049360455 scopus 로고    scopus 로고
    • Weighted Segmental K-Means Initialization for SOM-Based Speaker Clustering
    • O. Ben-Harush, I. Lapidot, and H. Guterman, "Weighted Segmental K-Means Initialization for SOM-Based Speaker Clustering," in Proc. ICSLP, 2008.
    • Proc. ICSLP, 2008
    • Ben-Harush, O.1    Lapidot, I.2    Guterman, H.3
  • 9
    • 78049389415 scopus 로고    scopus 로고
    • Speaker Diarization using Direction of Arrival Estimate and Acoustic Feature Information: The I2R-NTU Submission for the NIST RT 2007
    • E. C. W. Koh et al., "Speaker Diarization using Direction of Arrival Estimate and Acoustic Feature Information: the I2R-NTU Submission for the NIST RT 2007," in Proc. Rich Transcription Spring Meeting Recognition Evaluation, 2007.
    • Proc. Rich Transcription Spring Meeting Recognition Evaluation, 2007
    • Koh, E.C.W.1
  • 10
    • 78049400397 scopus 로고    scopus 로고
    • Clustering Initialization Based on Spatial Information for Speaker Diarization of Meetings
    • J. Luque, C. Segura, and J. Hernando, "Clustering Initialization Based on Spatial Information for Speaker Diarization of Meetings," in Proc. ICSLP, 2008.
    • Proc. ICSLP, 2008
    • Luque, J.1    Segura, C.2    Hernando, J.3
  • 11
    • 72449154157 scopus 로고    scopus 로고
    • Investigating the Use of Visual Focus of Attention for Audio-Visual Speaker Diarisation
    • G. Garau, S. Ba, H. Bourlard, and J.-M. Odobez, "Investigating the Use of Visual Focus of Attention for Audio-Visual Speaker Diarisation," in Proc. ACM Multimedia, 2009.
    • Proc. ACM Multimedia, 2009
    • Garau, G.1    Ba, S.2    Bourlard, H.3    Odobez, J.-M.4
  • 14
    • 33846265193 scopus 로고    scopus 로고
    • The AMI Meeting Corpus: A Pre-Announcement
    • J. Carletta et al., "The AMI Meeting Corpus: A Pre-Announcement, " Proc. MLMI, 2005.
    • Proc. MLMI, 2005
    • Carletta, J.1
  • 15
    • 33846242627 scopus 로고    scopus 로고
    • Speaker diarization for multi-party meetings using acoustic fusion
    • X. Anguera, C. Wooters, and J. Hernando, "Speaker diarization for multi-party meetings using acoustic fusion," in Proc. ASRU, 2005.
    • Proc. ASRU, 2005
    • Anguera, X.1    Wooters, C.2    Hernando, J.3
  • 16
    • 70449563031 scopus 로고    scopus 로고
    • Visual Activity Context for Focus of Attention Estimation in Dynamic Meetings
    • S.O. Ba, H. Hung, and J.-M. Odobez, "Visual Activity Context for Focus of Attention Estimation in Dynamic Meetings," in Proc. of ICME, 2009.
    • Proc. of ICME, 2009
    • Ba, S.O.1    Hung, H.2    Odobez, J.-M.3
  • 17
    • 20444446254 scopus 로고    scopus 로고
    • Action Recognition in Meeting Scenarios using Global Motion Features
    • M. Zobl, F. Wallhoff, and G. Rigoll, "Action Recognition in Meeting Scenarios using Global Motion Features," in Proc. PETS-ICVS, 2003.
    • Proc. PETS-ICVS, 2003
    • Zobl, M.1    Wallhoff, F.2    Rigoll, G.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.