메뉴 건너뛰기




Volumn , Issue , 2008, Pages 257-264

A realtime multimodal system for analyzing group meetings by combining face pose tracking and s peaker diarization

Author keywords

Face tracking; Fisheye lens; Focus of attention; Meeting analysis; Microphone array; Omnidirectional cameras; Realtime system; Speaker diarization

Indexed keywords

CAMERAS; DIRECTION OF ARRIVAL; INTERACTIVE COMPUTER SYSTEMS; LENSES; MICROPHONES; RADIO DIRECTION FINDING SYSTEMS; SENSORS; SPEECH RECOGNITION; THREE DIMENSIONAL; THREE DIMENSIONAL COMPUTER GRAPHICS; TRACKING (POSITION); VIDEO CAMERAS;

EID: 60949097180     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1145/1452392.1452446     Document Type: Conference Paper
Times cited : (69)

References (24)
  • 2
    • 0004260407 scopus 로고
    • 2nd ed. Routledge, London and New York
    • M. Argyle. Bodily Communication - 2nd ed. Routledge, London and New York, 1988.
    • (1988) Bodily Communication
    • Argyle, M.1
  • 3
    • 77249175697 scopus 로고    scopus 로고
    • A study on visual focus of attention recognition from head pose in a meeting room
    • S. O. Ba and J.-M. Odobez. A study on visual focus of attention recognition from head pose in a meeting room. In Proc. MLMI2006, pages 75-87, 2006.
    • (2006) Proc. MLMI2006 , pp. 75-87
    • Ba, S.O.1    Odobez, J.-M.2
  • 4
    • 34547535369 scopus 로고    scopus 로고
    • Real-time monitoring of participants' interaction in a meeting using audio-visual sensors
    • C. Busso, P. G. Georgiou, and S. S. Narayanan. Real-time monitoring of participants' interaction in a meeting using audio-visual sensors. In Proc. ICASSP2007, pages 685-688, 2007.
    • (2007) Proc. ICASSP2007 , pp. 685-688
    • Busso, C.1    Georgiou, P.G.2    Narayanan, S.S.3
  • 5
    • 40249085768 scopus 로고    scopus 로고
    • Robust real time face tracking for the analysis of human behaviour
    • D. Douxchamps and N. Campbell. Robust real time face tracking for the analysis of human behaviour. In Proc. MLMI2007, pages 1-10, 2007.
    • (2007) Proc. MLMI2007 , pp. 1-10
    • Douxchamps, D.1    Campbell, N.2
  • 6
    • 51449100230 scopus 로고    scopus 로고
    • A voice activity detection based on the adaptive integration of multiple speech features and a signal decision scheme
    • M. Fujimoto, K. Ishizuka, and T. Nakatani. A voice activity detection based on the adaptive integration of multiple speech features and a signal decision scheme. In Proc. ICASSP2008, pages 4441-4444, 2008.
    • (2008) Proc. ICASSP2008 , pp. 4441-4444
    • Fujimoto, M.1    Ishizuka, K.2    Nakatani, T.3
  • 9
    • 0014036537 scopus 로고
    • Some functions of gaze-direction in social interaction
    • A. Kendon. Some functions of gaze-direction in social interaction. Acta Psychologica, 26:22-63, 1967.
    • (1967) Acta Psychologica , vol.26 , pp. 22-63
    • Kendon, A.1
  • 10
    • 0016990291 scopus 로고
    • The generalized correlation method for estimation of time delay
    • C. H. Knapp and G. C. Carter. The generalized correlation method for estimation of time delay. IEEE Trans. ASSP, 24(4):320-327, 1976.
    • (1976) IEEE Trans. ASSP , vol.24 , Issue.4 , pp. 320-327
    • Knapp, C.H.1    Carter, G.C.2
  • 11
    • 44449098665 scopus 로고    scopus 로고
    • Vace multimodal meeting corpus
    • L. Chen, et al. Vace multimodal meeting corpus. In Proc. MLMI2006, pages 40-51, 2006.
    • (2006) Proc. MLMI2006 , pp. 40-51
    • Chen, L.1
  • 12
  • 13
    • 51449096171 scopus 로고    scopus 로고
    • Simultaneous and fast 3D tracking of multiple faces in video by GPU-based stream processing
    • O. Mateo Lozano and K. Otsuka. Simultaneous and fast 3D tracking of multiple faces in video by GPU-based stream processing. In Proc. ICASSP2008, pages 713-716, 2008.
    • (2008) Proc. ICASSP2008 , pp. 713-716
    • Mateo Lozano, O.1    Otsuka, K.2
  • 14
    • 63449127126 scopus 로고    scopus 로고
    • Multi human trajectory estimation using stochastic sampling and its application to meeting recognition
    • Y. Matsusaka, H. Asoh, and F. Asano. Multi human trajectory estimation using stochastic sampling and its application to meeting recognition. In Proc. MVA2007, pages 16-18, 2007.
    • (2007) Proc. MVA2007 , pp. 16-18
    • Matsusaka, Y.1    Asoh, H.2    Asano, F.3
  • 15
    • 63449091800 scopus 로고    scopus 로고
    • NIST Speech Group. Spring 2007 (RT-07) rich transcription meeting recognition evaluation plan. Technical Report rt07-meeting-eval-plan-v2, NIST, 2007.
    • NIST Speech Group. Spring 2007 (RT-07) rich transcription meeting recognition evaluation plan. Technical Report rt07-meeting-eval-plan-v2, NIST, 2007.
  • 16
    • 32344452625 scopus 로고    scopus 로고
    • A probabilistic inference of multiparty-conversation structure based on Markov-switching models of gaze patterns, head directions, and utterances
    • K. Otsuka, Y. Takemae, J. Yamato, and H. Murase. A probabilistic inference of multiparty-conversation structure based on Markov-switching models of gaze patterns, head directions, and utterances. In Proc. ICMI'05, pages 191-198, 2005.
    • (2005) Proc. ICMI'05 , pp. 191-198
    • Otsuka, K.1    Takemae, Y.2    Yamato, J.3    Murase, H.4
  • 17
    • 63449139691 scopus 로고    scopus 로고
    • Fast and robust face tracking for analyzing multiparty face-to-face meetings
    • K. Otsuka and J. Yamato. Fast and robust face tracking for analyzing multiparty face-to-face meetings. In Proc. MLMI2008, 2008.
    • (2008) Proc. MLMI2008
    • Otsuka, K.1    Yamato, J.2
  • 18
    • 34247605089 scopus 로고    scopus 로고
    • Conversation scene analysis with dynamic Bayesian network based on visual head tracking
    • K. Otsuka, J. Yamato, and H. Murase. Conversation scene analysis with dynamic Bayesian network based on visual head tracking. In Proc. ICME'06, pages 949-952, 2006.
    • (2006) Proc. ICME'06 , pp. 949-952
    • Otsuka, K.1    Yamato, J.2    Murase, H.3
  • 19
    • 50449105545 scopus 로고    scopus 로고
    • Interpretation of multiparty meetings the AMI and AMIDA projects
    • S. Renals, T. Hain, and H. Bourlard. Interpretation of multiparty meetings the AMI and AMIDA projects. In Proc. HSCMA2008, pages 115-118, 2008.
    • (2008) Proc. HSCMA2008 , pp. 115-118
    • Renals, S.1    Hain, T.2    Bourlard, H.3
  • 20
    • 77249131846 scopus 로고    scopus 로고
    • Real-time monitoring of participants' interaction in a meeting using audio-visual sensors
    • K. Smith, S. Schreiber, I. Potúcek, V. Beran, G. Rigoll, and D. Gatica-Perez. Real-time monitoring of participants' interaction in a meeting using audio-visual sensors. In Proc. MLMI2006, pages 88-101, 2006.
    • (2006) Proc. MLMI2006 , pp. 88-101
    • Smith, K.1    Schreiber, S.2    Potúcek, I.3    Beran, V.4    Rigoll, G.5    Gatica-Perez, D.6
  • 21
    • 0036650146 scopus 로고    scopus 로고
    • Modeling focus of attention for meeting index based on multiple cues
    • R. Stiefelhagen, J. Yang, and A. Waibel. Modeling focus of attention for meeting index based on multiple cues. IEEE Trans. Neural Networks, 13(4), 2002.
    • (2002) IEEE Trans. Neural Networks , vol.13 , Issue.4
    • Stiefelhagen, R.1    Yang, J.2    Waibel, A.3
  • 22
    • 2142812371 scopus 로고    scopus 로고
    • Robust real-time face detection
    • P. Viola and M. Jones. Robust real-time face detection. IJCV, 57(2):137-154, 2004.
    • (2004) IJCV , vol.57 , Issue.2 , pp. 137-154
    • Viola, P.1    Jones, M.2
  • 23
    • 34547212750 scopus 로고    scopus 로고
    • Tracking head pose and focus of attention with multiple far-field cameras
    • M. Voit and R. Stiefelhagen. Tracking head pose and focus of attention with multiple far-field cameras. In Proc. ICMI2006, pages 281-286, 2006.
    • (2006) Proc. ICMI2006 , pp. 281-286
    • Voit, M.1    Stiefelhagen, R.2
  • 24
    • 10044220516 scopus 로고    scopus 로고
    • Face tracking in meeting room scenarios using omnidirectional views
    • F. Wallhoff, M. Zobl, G. Rigoll, and I. Potucek. Face tracking in meeting room scenarios using omnidirectional views. In Proc. ICPR2004, 2004.
    • (2004) Proc. ICPR2004
    • Wallhoff, F.1    Zobl, M.2    Rigoll, G.3    Potucek, I.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.