-
1
-
-
11844267204
-
Dynamic context capture and distributed video arrays for intelligent spaces
-
Jan
-
M. M. Trivedi, K. S. Huang, and I. Mikic, "Dynamic context capture and distributed video arrays for intelligent spaces," IEEE Trans. Syst., Man, Cybern., A, vol.35, no.1, pp. 145-163, Jan. 2005.
-
(2005)
IEEE Trans. Syst., Man, Cybern., A
, vol.35
, Issue.1
, pp. 145-163
-
-
Trivedi, M.M.1
Huang, K.S.2
Mikic, I.3
-
2
-
-
4944224241
-
Active camera networks and semantic event databases for intelligent environments
-
M. M. Trivedi, I. Mikic, and S. Bhonsle, "Active camera networks and semantic event databases for intelligent environments," in Proc. IEEE CVPR Workshop Human Modeling, Anal., Synth., 2000.
-
(2000)
Proc. IEEE CVPR Workshop Human Modeling, Anal., Synth
-
-
Trivedi, M.M.1
Mikic, I.2
Bhonsle, S.3
-
3
-
-
60849111400
-
Person tracking with audio-visual cues using the iterative decoding framework
-
S. T. Shivappa, M. M. Trivedi, and B. D. Rao, "Person tracking with audio-visual cues using the iterative decoding framework," in Proc. 5th IEEE Int. Conf. Adv. Video Signal Based Surveill., 2008, pp. 260-267.
-
(2008)
Proc. 5th IEEE Int. Conf. Adv. Video Signal Based Surveill
, pp. 260-267
-
-
Shivappa, S.T.1
Trivedi, M.M.2
Rao, B.D.3
-
4
-
-
4944221356
-
Layered representations for learning and inferring office activity from multiple sensory channels
-
N. Oliver, A. Garg, and E. Horvitz, "Layered representations for learning and inferring office activity from multiple sensory channels," Comput. Vis. Image Understand., vol.96, no.2, pp. 163-180, 2004.
-
(2004)
Comput. Vis. Image Understand.
, vol.96
, Issue.2
, pp. 163-180
-
-
Oliver, N.1
Garg, A.2
Horvitz, E.3
-
5
-
-
84932605936
-
Modeling individual and group actions in meetings: A two-layer HMM framework
-
Jun
-
D. Zhang, D. Gatica-Perez, S. Bengio, I. McCowan, and G. Lathoud, "Modeling individual and group actions in meetings: A two-layer HMM framework," in Proc. IEEE Int. Conf. Comput. Vis. Pattern Recognition, Jun. 2004, p. 117.
-
(2004)
Proc. IEEE Int. Conf. Comput. Vis. Pattern Recognition
, pp. 117
-
-
Zhang, D.1
Gatica-Perez, D.2
Bengio, S.3
McCowan, I.4
Lathoud, G.5
-
6
-
-
0034245149
-
A Bayesian computer vision system for modeling human interactions
-
Aug
-
N. M. Oliver, B. Rosario, and A. Pentland, "A Bayesian computer vision system for modeling human interactions," IEEE Trans. Pattern Anal. Mach. Intell., vol.22, no.8, pp. 831-843, Aug. 2000.
-
(2000)
IEEE Trans. Pattern Anal. Mach. Intell.
, vol.22
, Issue.8
, pp. 831-843
-
-
Oliver, N.M.1
Rosario, B.2
Pentland, A.3
-
9
-
-
70349205633
-
Role of head pose estimation in speech acquisition from distant microphones
-
S. T. Shivappa, B. D. Rao, and M. M. Trivedi, "Role of head pose estimation in speech acquisition from distant microphones," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), 2009, pp. 3557-3560.
-
(2009)
Proc. IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP)
, pp. 3557-3560
-
-
Shivappa, S.T.1
Rao, B.D.2
Trivedi, M.M.3
-
10
-
-
40249089621
-
Speech enhancement and recognition in meetings with an audio-visual sensor array
-
Nov
-
H. K. Maganti, D. Gatica-Perez, and I. McCowan, "Speech enhancement and recognition in meetings with an audio-visual sensor array," IEEE Trans. Audio, Speech, Lang. Process., vol.15, no.8, pp. 2257-2269, Nov. 2007.
-
(2007)
IEEE Trans. Audio, Speech, Lang. Process.
, vol.15
, Issue.8
, pp. 2257-2269
-
-
Maganti, H.K.1
Gatica-Perez, D.2
McCowan, I.3
-
11
-
-
70449556249
-
Hierarchical audio-visual cue integration framework for activity analysis in intelligent meeting rooms
-
S. T. Shivappa, M. Trivedi, and B. D. Rao, "Hierarchical audio-visual cue integration framework for activity analysis in intelligent meeting rooms," in Proc. IEEE CVPR Workshop: ViSU'09, 2009, pp. 107-114.
-
(2009)
Proc. IEEE CVPR Workshop: ViSU'09
, pp. 107-114
-
-
Shivappa, S.T.1
Trivedi, M.2
Rao, B.D.3
-
14
-
-
3042551886
-
Video arrays for real-time tracking of person, head, and face in an intelligent room
-
K. S. Huang and M. M. Trivedi, "Video arrays for real-time tracking of person, head, and face in an intelligent room," Mach. Vis. Applicat., 2003.
-
(2003)
Mach. Vis. Applicat
-
-
Huang, K.S.1
Trivedi, M.M.2
-
15
-
-
0346707503
-
Source localization in reverberant environments: Modeling and statistical analysis
-
Nov
-
T. Gustafsson, B. D. Rao, and M. M. Trivedi, "Source localization in reverberant environments: Modeling and statistical analysis," IEEE Trans. Speech Audio Process., vol.11, no.6, pp. 791-803, Nov. 2003.
-
(2003)
IEEE Trans. Speech Audio Process.
, vol.11
, Issue.6
, pp. 791-803
-
-
Gustafsson, T.1
Rao, B.D.2
Trivedi, M.M.3
-
18
-
-
0035458007
-
Robust sound localization using multi-source audiovisual information fusion
-
S. G. Z. P. Aarabi, "Robust sound localization using multi-source audiovisual information fusion," Information Fusion, 2001.
-
(2001)
Information Fusion
-
-
Aarabi, S.G.Z.P.1
-
19
-
-
0034844366
-
Sequential Monte Carlo fusion of sound and vision for speaker tracking
-
J. Vermaak, M. Gangnet, A. Blake, and P. Perez, "Sequential Monte Carlo fusion of sound and vision for speaker tracking," in Proc. 8th IEEE Int. Conf. Comput. Vis., pp. 741-746.
-
Proc. 8th IEEE Int. Conf. Comput. Vis
, pp. 741-746
-
-
Vermaak, J.1
Gangnet, M.2
Blake, A.3
Perez, P.4
-
20
-
-
84880877816
-
Real-time auditory and visual multiple-object tracking for humanoids
-
K. Nakadai, K. Hidai, H. Mizoguchi, H. G. Okuno, and H. Kitano, "Real-time auditory and visual multiple-object tracking for humanoids," in Proc. IJCAI, 2001.
-
(2001)
Proc. IJCAI
-
-
Nakadai, K.1
Hidai, K.2
Mizoguchi, H.3
Okuno, H.G.4
Kitano, H.5
-
21
-
-
0042349407
-
A graphical model for audiovisual object tracking
-
Jul.
-
M. Beal, N. Jojic, and H. Attias, "A graphical model for audiovisual object tracking," IEEE Trans. Pattern Anal. Mach. Intell., vol.25, no.7, pp. 828-836, Jul. 2003.
-
(2003)
IEEE Trans. Pattern Anal. Mach. Intell.
, vol.25
, Issue.7
, pp. 828-836
-
-
Beal, M.1
Jojic, N.2
Attias, H.3
-
23
-
-
0009622481
-
Learning joint statistical models for audio-visual fusion and segregation
-
J. W. Fisher, T. Darrell, W. T. Freeman, and P. A.Viola, "Learning joint statistical models for audio-visual fusion and segregation," in Proc. NIPS, 2000.
-
(2000)
Proc. NIPS
-
-
Fisher, J.W.1
Darrell, T.2
Freeman, W.T.3
Viola, P.A.4
-
24
-
-
84899028297
-
Audio vision: Using audiovisual synchrony to locate sounds
-
J. Hershey and J. Movellan, "Audio vision: Using audiovisual synchrony to locate sounds," in Proc. NIPS, 2000.
-
(2000)
Proc. NIPS
-
-
Hershey, J.1
Movellan, J.2
-
25
-
-
0036874485
-
Joint audio-visual tracking using particle filters
-
D. N. Zotkin, R. Duraiswami, and L. S. Davis, "Joint audio-visual tracking using particle filters," EURASIP J. Appl. Signal Process., vol.2002, no.11, pp. 1154-1164, 2002.
-
(2002)
EURASIP J. Appl. Signal Process.
, vol.2002
, Issue.11
, pp. 1154-1164
-
-
Zotkin, D.N.1
Duraiswami, R.2
Davis, L.S.3
-
26
-
-
0345565782
-
Audio-Visual Speaker Tracking with Importance Particle Filters
-
D. G. Perez, G. Lathoud, I. McCowan, J. M. Odobez, and D. Moore, "Audio-Visual Speaker Tracking With Importance Particle Filters," in Proc. ICIP, 2003.
-
(2003)
Proc. ICIP
-
-
Perez, D.G.1
Lathoud, G.2
McCowan, I.3
Odobez, J.M.4
Moore, D.5
-
27
-
-
21244492850
-
Real-time speaker tracking using particle filter sensor fusion
-
Mar
-
Y. Chen and Y. Rui, "Real-time speaker tracking using particle filter sensor fusion," Proc. IEEE, vol.92, no.3, pp. 485-494, Mar. 2004.
-
(2004)
Proc. IEEE
, vol.92
, Issue.3
, pp. 485-494
-
-
Chen, Y.1
Rui, Y.2
-
28
-
-
4544347587
-
Multiple person and speaker activity tracking with a particle filter
-
N. Checka, K. W. Wilson, M. R. Siracusa, and T. Darrell, "Multiple person and speaker activity tracking with a particle filter," in Proc. IEEE Int. Conf. Acoust., Speech, Signal Process., 2004, vol.V, pp. 881-884.
-
(2004)
Proc. IEEE Int. Conf. Acoust., Speech, Signal Process
, vol.5
, pp. 881-884
-
-
Checka, N.1
Wilson, K.W.2
Siracusa, M.R.3
Darrell, T.4
-
29
-
-
32344434992
-
A joint particle filter for audio-visual speaker tracking
-
K. Nickel, T. Gehrig, R. Stiefelhagen, and J. McDonough, "A joint particle filter for audio-visual speaker tracking," in Proc. 7th Int. Conf. Multimodal Interfaces, 2005.
-
(2005)
Proc. 7th Int. Conf. Multimodal Interfaces
-
-
Nickel, K.1
Gehrig, T.2
Stiefelhagen, R.3
McDonough, J.4
-
30
-
-
33645672078
-
Kalman filters for audio-video source localization
-
Oct
-
T. Gehrig, K. Nickel, H. K. Ekenel, U. Klee, and J. McDonough, "Kalman filters for audio-video source localization," in Proc. IEEE Workshop Applicat. Signal Process. Audio Acoust., Oct. 2005, pp. 118-121.
-
(2005)
Proc. IEEE Workshop Applicat. Signal Process. Audio Acoust.
, pp. 118-121
-
-
Gehrig, T.1
Nickel, K.2
Ekenel, H.K.3
Klee, U.4
McDonough, J.5
-
31
-
-
64149093817
-
Audiovisual probabilistic tracking of multiple speakers in meetings
-
Feb.
-
D. Gatica-Perez, G. Lathoud, J. Odobez, and I. McCowan, "Audiovisual probabilistic tracking of multiple speakers in meetings," IEEE Trans. Audio, Speech, Lang. Process., vol.15, no.2, pp. 601-616, Feb. 2007.
-
(2007)
IEEE Trans. Audio, Speech, Lang. Process.
, vol.15
, Issue.2
, pp. 601-616
-
-
Gatica-Perez, D.1
Lathoud, G.2
Odobez, J.3
McCowan, I.4
-
32
-
-
37849022114
-
Audio-visual multi-person tracking and identification for smart environments
-
K. Bernardin and R. Stiefelhagen, "Audio-visual multi-person tracking and identification for smart environments," in Proc. ACM Int. Conf. Multimedia, 2007.
-
(2007)
Proc. ACM Int. Conf. Multimedia
-
-
Bernardin, K.1
Stiefelhagen, R.2
-
33
-
-
35348860213
-
Enabling multimodal human-robot interaction for the Karlsruhe humanoid robot
-
Oct
-
R. Stiefelhagen, H. K. Ekenel, C. Fugen, P. Gieselmann, H. Holzapfel, F. Kraft, K. Nickel, M. Voit, and A. Waibel, "Enabling multimodal human-robot interaction for the Karlsruhe humanoid robot," IEEE Trans. Robotics, vol.23, no.5, pp. 840-851, Oct. 2007.
-
(2007)
IEEE Trans. Robotics
, vol.23
, Issue.5
, pp. 840-851
-
-
Stiefelhagen, R.1
Ekenel, H.K.2
Fugen, C.3
Gieselmann, P.4
Holzapfel, H.5
Kraft, F.6
Nickel, K.7
Voit, M.8
Waibel, A.9
-
35
-
-
77956738116
-
Upc audio, video and multimodal person tracking systems in the clear evaluation campaign
-
A. Abad, C. Canton-Ferrer, C. Segura, J. L. Landabaso, D. Macho, J. R. Casas, J. Hernando, M. Pardas, and C. Nadeu, "Upc audio, video and multimodal person tracking systems in the clear evaluation campaign," in Proc. 1st Int. CLEAR Evaluation Workshop-Multimodal Technologies for Perception of Humans, 2007.
-
(2007)
Proc. 1st Int. CLEAR Evaluation Workshop-Multimodal Technologies for Perception of Humans
-
-
Abad, A.1
Canton-Ferrer, C.2
Segura, C.3
Landabaso, J.L.4
MacHo, D.5
Casas, J.R.6
Hernando, J.7
Pardas, M.8
Nadeu, C.9
-
36
-
-
44949210324
-
The ait 3D audio/visual person tracker for clear 2007
-
N. Katsarakis, F. Talantzis, A. Pnevmatikakis, and L. Polymenakos, "The ait 3D audio/visual person tracker for clear 2007," in Proc. 1st Int. CLEAR Evaluation Workshop-Multimodal Technologies for Perception of Humans, 2007.
-
(2007)
Proc. 1st Int. CLEAR Evaluation Workshop-Multimodal Technologies for Perception of Humans
-
-
Katsarakis, N.1
Talantzis, F.2
Pnevmatikakis, A.3
Polymenakos, L.4
-
39
-
-
0016990291
-
The generalized correlation method for estimation of time delay
-
Aug
-
C. H. Knapp and G. C. Carter, "The generalized correlation method for estimation of time delay," IEEE Trans. Acoust., Speech, Signal Process., vol.ASSP-24, no.4, pp. 320-327, Aug. 1976.
-
(1976)
IEEE Trans. Acoust., Speech, Signal Process.
, vol.ASSP-24
, Issue.4
, pp. 320-327
-
-
Knapp, C.H.1
Carter, G.C.2
-
40
-
-
0030701369
-
A robust method for speech signal time-delay estimation in reverberant rooms
-
M. Brandstein and H. Silverman, "A robust method for speech signal time-delay estimation in reverberant rooms," in Proc. ICASSP, 1997, pp. 375-378.
-
(1997)
Proc. ICASSP
, pp. 375-378
-
-
Brandstein, M.1
Silverman, H.2
-
42
-
-
0016037512
-
Optimal decoding of linear codes for minimizing symbol error rate
-
Mar
-
L. Bahl, J. Cocke, F. Jelinek, and J. Raviv, "Optimal decoding of linear codes for minimizing symbol error rate," IEEE Trans. Inf. Theory, vol.IT-20, no.2, pp. 284-287, Mar. 1974.
-
(1974)
IEEE Trans. Inf. Theory
, vol.IT-20
, Issue.2
, pp. 284-287
-
-
Bahl, L.1
Cocke, J.2
Jelinek, F.3
Raviv, J.4
-
43
-
-
67650122797
-
Random projection trees for vector quantization
-
Jul.
-
S. Dasgupta and Y. Freund, "Random projection trees for vector quantization," IEEE Trans. Inf. Theory, vol.55, no.7, pp. 3229-3242, Jul. 2009.
-
(2009)
IEEE Trans. Inf. Theory
, vol.55
, Issue.7
, pp. 3229-3242
-
-
Dasgupta, S.1
Freund, Y.2
-
44
-
-
41349114281
-
The CHIL audiovisual corpus for lecture and meeting analysis inside smart rooms
-
Dec
-
D. Mostefa, N. Moreau, K. Choukri, G. Potamianos, S. M. Chu, J. R. Casas, J. Turmo, L. Cristoferetti, F. Tobia, A. Pnevmatikakis, V. Mylonakis, F. Talantzis, S. Burger, R. Stiefelhagen, K. Bernardin, and C. Rochet, "The CHIL audiovisual corpus for lecture and meeting analysis inside smart rooms," J. Lang. Res. Eval., vol.41, no. (3-4), pp. 389-407, Dec. 2007.
-
(2007)
J. Lang. Res. Eval.
, vol.41
, Issue.3-4
, pp. 389-407
-
-
Mostefa, D.1
Moreau, N.2
Choukri, K.3
Potamianos, G.4
Chu, S.M.5
Casas, J.R.6
Turmo, J.7
Cristoferetti, L.8
Tobia, F.9
Pnevmatikakis, A.10
Mylonakis, V.11
Talantzis, F.12
Burger, S.13
Stiefelhagen, R.14
Bernardin, K.15
Rochet, C.16
|