메뉴 건너뛰기




Volumn 4, Issue 5, 2010, Pages 882-894

Audio-visual fusion and tracking with multilevel iterative decoding: Framework and experimental evaluation

Author keywords

Hierarchical frameworks; human activity analysis; humancomputer interaction; iterative decoding; person tracking

Indexed keywords

AUDIO AND VISUAL CUES; AUDIO AND VISUAL INFORMATION; AUDIO APPLICATIONS; AUDIO-VISUAL; AUDIO-VISUAL FUSION; DATA SETS; EXPERIMENTAL EVALUATION; HIERARCHICAL FRAMEWORKS; HUMAN ACTIVITY ANALYSIS; HUMAN COMMUNICATIONS; HUMAN COMPUTER INTERFACES; INTELLIGENT SPACES; MICROPHONE ARRAYS; MICROPHONE CALIBRATION; NATURAL INTERFACES; NEW APPROACHES; PARTICLE FILTER; PARTICLE FILTERING; PERSON TRACKING; SENSOR CALIBRATION; SENSOR CONFIGURATIONS; VIDEO INFORMATION;

EID: 77956766546     PISSN: 19324553     EISSN: None     Source Type: Journal    
DOI: 10.1109/JSTSP.2010.2057890     Document Type: Article
Times cited : (37)

References (44)
  • 1
    • 11844267204 scopus 로고    scopus 로고
    • Dynamic context capture and distributed video arrays for intelligent spaces
    • Jan
    • M. M. Trivedi, K. S. Huang, and I. Mikic, "Dynamic context capture and distributed video arrays for intelligent spaces," IEEE Trans. Syst., Man, Cybern., A, vol.35, no.1, pp. 145-163, Jan. 2005.
    • (2005) IEEE Trans. Syst., Man, Cybern., A , vol.35 , Issue.1 , pp. 145-163
    • Trivedi, M.M.1    Huang, K.S.2    Mikic, I.3
  • 4
    • 4944221356 scopus 로고    scopus 로고
    • Layered representations for learning and inferring office activity from multiple sensory channels
    • N. Oliver, A. Garg, and E. Horvitz, "Layered representations for learning and inferring office activity from multiple sensory channels," Comput. Vis. Image Understand., vol.96, no.2, pp. 163-180, 2004.
    • (2004) Comput. Vis. Image Understand. , vol.96 , Issue.2 , pp. 163-180
    • Oliver, N.1    Garg, A.2    Horvitz, E.3
  • 6
    • 0034245149 scopus 로고    scopus 로고
    • A Bayesian computer vision system for modeling human interactions
    • Aug
    • N. M. Oliver, B. Rosario, and A. Pentland, "A Bayesian computer vision system for modeling human interactions," IEEE Trans. Pattern Anal. Mach. Intell., vol.22, no.8, pp. 831-843, Aug. 2000.
    • (2000) IEEE Trans. Pattern Anal. Mach. Intell. , vol.22 , Issue.8 , pp. 831-843
    • Oliver, N.M.1    Rosario, B.2    Pentland, A.3
  • 10
    • 40249089621 scopus 로고    scopus 로고
    • Speech enhancement and recognition in meetings with an audio-visual sensor array
    • Nov
    • H. K. Maganti, D. Gatica-Perez, and I. McCowan, "Speech enhancement and recognition in meetings with an audio-visual sensor array," IEEE Trans. Audio, Speech, Lang. Process., vol.15, no.8, pp. 2257-2269, Nov. 2007.
    • (2007) IEEE Trans. Audio, Speech, Lang. Process. , vol.15 , Issue.8 , pp. 2257-2269
    • Maganti, H.K.1    Gatica-Perez, D.2    McCowan, I.3
  • 11
    • 70449556249 scopus 로고    scopus 로고
    • Hierarchical audio-visual cue integration framework for activity analysis in intelligent meeting rooms
    • S. T. Shivappa, M. Trivedi, and B. D. Rao, "Hierarchical audio-visual cue integration framework for activity analysis in intelligent meeting rooms," in Proc. IEEE CVPR Workshop: ViSU'09, 2009, pp. 107-114.
    • (2009) Proc. IEEE CVPR Workshop: ViSU'09 , pp. 107-114
    • Shivappa, S.T.1    Trivedi, M.2    Rao, B.D.3
  • 14
    • 3042551886 scopus 로고    scopus 로고
    • Video arrays for real-time tracking of person, head, and face in an intelligent room
    • K. S. Huang and M. M. Trivedi, "Video arrays for real-time tracking of person, head, and face in an intelligent room," Mach. Vis. Applicat., 2003.
    • (2003) Mach. Vis. Applicat
    • Huang, K.S.1    Trivedi, M.M.2
  • 15
    • 0346707503 scopus 로고    scopus 로고
    • Source localization in reverberant environments: Modeling and statistical analysis
    • Nov
    • T. Gustafsson, B. D. Rao, and M. M. Trivedi, "Source localization in reverberant environments: Modeling and statistical analysis," IEEE Trans. Speech Audio Process., vol.11, no.6, pp. 791-803, Nov. 2003.
    • (2003) IEEE Trans. Speech Audio Process. , vol.11 , Issue.6 , pp. 791-803
    • Gustafsson, T.1    Rao, B.D.2    Trivedi, M.M.3
  • 18
    • 0035458007 scopus 로고    scopus 로고
    • Robust sound localization using multi-source audiovisual information fusion
    • S. G. Z. P. Aarabi, "Robust sound localization using multi-source audiovisual information fusion," Information Fusion, 2001.
    • (2001) Information Fusion
    • Aarabi, S.G.Z.P.1
  • 21
    • 0042349407 scopus 로고    scopus 로고
    • A graphical model for audiovisual object tracking
    • Jul.
    • M. Beal, N. Jojic, and H. Attias, "A graphical model for audiovisual object tracking," IEEE Trans. Pattern Anal. Mach. Intell., vol.25, no.7, pp. 828-836, Jul. 2003.
    • (2003) IEEE Trans. Pattern Anal. Mach. Intell. , vol.25 , Issue.7 , pp. 828-836
    • Beal, M.1    Jojic, N.2    Attias, H.3
  • 23
    • 0009622481 scopus 로고    scopus 로고
    • Learning joint statistical models for audio-visual fusion and segregation
    • J. W. Fisher, T. Darrell, W. T. Freeman, and P. A.Viola, "Learning joint statistical models for audio-visual fusion and segregation," in Proc. NIPS, 2000.
    • (2000) Proc. NIPS
    • Fisher, J.W.1    Darrell, T.2    Freeman, W.T.3    Viola, P.A.4
  • 24
    • 84899028297 scopus 로고    scopus 로고
    • Audio vision: Using audiovisual synchrony to locate sounds
    • J. Hershey and J. Movellan, "Audio vision: Using audiovisual synchrony to locate sounds," in Proc. NIPS, 2000.
    • (2000) Proc. NIPS
    • Hershey, J.1    Movellan, J.2
  • 25
    • 0036874485 scopus 로고    scopus 로고
    • Joint audio-visual tracking using particle filters
    • D. N. Zotkin, R. Duraiswami, and L. S. Davis, "Joint audio-visual tracking using particle filters," EURASIP J. Appl. Signal Process., vol.2002, no.11, pp. 1154-1164, 2002.
    • (2002) EURASIP J. Appl. Signal Process. , vol.2002 , Issue.11 , pp. 1154-1164
    • Zotkin, D.N.1    Duraiswami, R.2    Davis, L.S.3
  • 27
    • 21244492850 scopus 로고    scopus 로고
    • Real-time speaker tracking using particle filter sensor fusion
    • Mar
    • Y. Chen and Y. Rui, "Real-time speaker tracking using particle filter sensor fusion," Proc. IEEE, vol.92, no.3, pp. 485-494, Mar. 2004.
    • (2004) Proc. IEEE , vol.92 , Issue.3 , pp. 485-494
    • Chen, Y.1    Rui, Y.2
  • 32
    • 37849022114 scopus 로고    scopus 로고
    • Audio-visual multi-person tracking and identification for smart environments
    • K. Bernardin and R. Stiefelhagen, "Audio-visual multi-person tracking and identification for smart environments," in Proc. ACM Int. Conf. Multimedia, 2007.
    • (2007) Proc. ACM Int. Conf. Multimedia
    • Bernardin, K.1    Stiefelhagen, R.2
  • 39
    • 0016990291 scopus 로고
    • The generalized correlation method for estimation of time delay
    • Aug
    • C. H. Knapp and G. C. Carter, "The generalized correlation method for estimation of time delay," IEEE Trans. Acoust., Speech, Signal Process., vol.ASSP-24, no.4, pp. 320-327, Aug. 1976.
    • (1976) IEEE Trans. Acoust., Speech, Signal Process. , vol.ASSP-24 , Issue.4 , pp. 320-327
    • Knapp, C.H.1    Carter, G.C.2
  • 40
    • 0030701369 scopus 로고    scopus 로고
    • A robust method for speech signal time-delay estimation in reverberant rooms
    • M. Brandstein and H. Silverman, "A robust method for speech signal time-delay estimation in reverberant rooms," in Proc. ICASSP, 1997, pp. 375-378.
    • (1997) Proc. ICASSP , pp. 375-378
    • Brandstein, M.1    Silverman, H.2
  • 42
    • 0016037512 scopus 로고
    • Optimal decoding of linear codes for minimizing symbol error rate
    • Mar
    • L. Bahl, J. Cocke, F. Jelinek, and J. Raviv, "Optimal decoding of linear codes for minimizing symbol error rate," IEEE Trans. Inf. Theory, vol.IT-20, no.2, pp. 284-287, Mar. 1974.
    • (1974) IEEE Trans. Inf. Theory , vol.IT-20 , Issue.2 , pp. 284-287
    • Bahl, L.1    Cocke, J.2    Jelinek, F.3    Raviv, J.4
  • 43
    • 67650122797 scopus 로고    scopus 로고
    • Random projection trees for vector quantization
    • Jul.
    • S. Dasgupta and Y. Freund, "Random projection trees for vector quantization," IEEE Trans. Inf. Theory, vol.55, no.7, pp. 3229-3242, Jul. 2009.
    • (2009) IEEE Trans. Inf. Theory , vol.55 , Issue.7 , pp. 3229-3242
    • Dasgupta, S.1    Freund, Y.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.