-
1
-
-
83455196939
-
Finding audio-visual events in informal social gatherings
-
X. Alameda-Pineda, V. Khalidov, R. P. Horaud, and F. Forbes, "Finding audio-visual events in informal social gatherings," in Proc. of the 13th Int. Conf. on Multimodal Interfaces, November 2011, pp. 247-254.
-
Proc. of the 13th Int. Conf. on Multimodal Interfaces, November 2011
, pp. 247-254
-
-
Alameda-Pineda, X.1
Khalidov, V.2
Horaud, R.P.3
Forbes, F.4
-
2
-
-
64149093817
-
Audiovisual probabilistic tracking of multiple speakers in meetings
-
D. Gatica-Perez, G. Lathoud, J.-M. Odobez, and I. McCowan, "Audiovisual probabilistic tracking of multiple speakers in meetings," IEEE Trans. on Audio, Speech, and Language Processing, vol. 15, no. 2, pp. 601-616, 2007.
-
(2007)
IEEE Trans. on Audio, Speech, and Language Processing
, vol.15
, Issue.2
, pp. 601-616
-
-
Gatica-Perez, D.1
Lathoud, G.2
Odobez, J.-M.3
McCowan, I.4
-
3
-
-
0034844366
-
Sequential Monte Carlo fusion of sound and vision for speaker tracking
-
IEEE
-
J. Vermaak, M. Ganget, A. Blake, and P. Pérez, "Sequential Monte Carlo fusion of sound and vision for speaker tracking," in Proc. of the 8th Int. Conf. on Computer Vision. IEEE, 2001, pp. 741-746.
-
(2001)
Proc. of the 8th Int. Conf. on Computer Vision
, pp. 741-746
-
-
Vermaak, J.1
Ganget, M.2
Blake, A.3
Pérez, P.4
-
4
-
-
13344250690
-
Data fusion for visual tracking with particles
-
P. Perez, J. Vermaak, and A. Blake, "Data fusion for visual tracking with particles," Proceedings of IEEE, vol. 92, no. 3, pp. 495-513, 2004.
-
(2004)
Proceedings of IEEE
, vol.92
, Issue.3
, pp. 495-513
-
-
Perez, P.1
Vermaak, J.2
Blake, A.3
-
5
-
-
63449109271
-
Detecion and localization of 3D audio-visual objects using unsupervised clustering
-
V. Khalidov, F. Forbes, M. Hansard, E. Arnaud, and R. Horaud, "Detecion and localization of 3D audio-visual objects using unsupervised clustering," in Proc. of ICMI, 2008.
-
Proc. of ICMI, 2008
-
-
Khalidov, V.1
Forbes, F.2
Hansard, M.3
Arnaud, E.4
Horaud, R.5
-
6
-
-
56549130088
-
Structure inference for Bayesian multisensory scene understanding
-
T. Hospedales and S. Vijayakumar, "Structure inference for Bayesian multisensory scene understanding," IEEE Trans. on Pattern Analysis and Machine Intelligence, vol. 30, no. 12, pp. 2140-2157, 2008.
-
(2008)
IEEE Trans. on Pattern Analysis and Machine Intelligence
, vol.30
, Issue.12
, pp. 2140-2157
-
-
Hospedales, T.1
Vijayakumar, S.2
-
7
-
-
78651517803
-
Conjugate mixture models for clustering multimodal data
-
V. Khalidov, F. Forbes, and R. Horaud, "Conjugate mixture models for clustering multimodal data," Neural Computation, vol. 23, no. 2, pp. 517-557, 2011.
-
(2011)
Neural Computation
, vol.23
, Issue.2
, pp. 517-557
-
-
Khalidov, V.1
Forbes, F.2
Horaud, R.3
-
8
-
-
33749400336
-
A probabilistic model for binaural sound localization
-
V. Willert, J. Eggert, J. Adamy, R. Stahl, and E. Koerner, "A probabilistic model for binaural sound localization," IEEE Trans. on Systems, Man, and Cybernetics - Part B, vol. 36, no. 5, pp. 982-994, 2006.
-
(2006)
IEEE Trans. on Systems, Man, and Cybernetics - Part B
, vol.36
, Issue.5
, pp. 982-994
-
-
Willert, V.1
Eggert, J.2
Adamy, J.3
Stahl, R.4
Koerner, E.5
-
9
-
-
51449115206
-
Direct computation of sound and microphone locations from time-difference-of-arrival data
-
M. Pollefeys and D. Nister, "Direct computation of sound and microphone locations from time-difference-of-arrival data," in IEEE Int. Conf. on Acoustic, Speech, and Signal Processing, 2008, pp. 2445-2448.
-
IEEE Int. Conf. on Acoustic, Speech, and Signal Processing, 2008
, pp. 2445-2448
-
-
Pollefeys, M.1
Nister, D.2
-
11
-
-
4544347587
-
Multiple person and speaker activity tracking with a particle filter
-
N. Checka, K. Wilson, M. Siracusa, and T. Darrell, "Multiple person and speaker activity tracking with a particle filter," in Proc. of IEEE Conf. on Acoustics, Speech, and Signal Processing, 2004, pp. 881-884.
-
Proc. of IEEE Conf. on Acoustics, Speech, and Signal Processing, 2004
, pp. 881-884
-
-
Checka, N.1
Wilson, K.2
Siracusa, M.3
Darrell, T.4
-
13
-
-
34547554086
-
Calibration of audio-video sensors for multi-modal event indexing
-
T. Kuhnapfel, T. Tan, S. Venkatesh, and E. Lehmann, "Calibration of audio-video sensors for multi-modal event indexing," in IEEE Int. Conf. on Acoustics, Speech and Signal Processing, vol. 2, 2007, pp. 741-744.
-
(2007)
IEEE Int. Conf. on Acoustics, Speech and Signal Processing
, vol.2
, pp. 741-744
-
-
Kuhnapfel, T.1
Tan, T.2
Venkatesh, S.3
Lehmann, E.4
-
14
-
-
75649136861
-
Onsets coincidence for cross-modal analysis
-
Z. Barzelay and Y. Schechner, "Onsets coincidence for cross-modal analysis," IEEE Trans. on Multimedia, vol. 12, no. 2, pp. 108-120, 2010.
-
(2010)
IEEE Trans. on Multimedia
, vol.12
, Issue.2
, pp. 108-120
-
-
Barzelay, Z.1
Schechner, Y.2
-
15
-
-
84859997410
-
The cocktail party robot: Sound source separation and localisation with an active binaural head
-
A. Deleforge and R. P. Horaud, "The cocktail party robot: Sound source separation and localisation with an active binaural head," in IEEE/ACM Int. Conf. on Human Robot Interaction, Boston, Mass, March 2012.
-
IEEE/ACM Int. Conf. on Human Robot Interaction, Boston, Mass, March 2012
-
-
Deleforge, A.1
Horaud, R.P.2
-
17
-
-
57849093600
-
Integrating pitch and localisation cues at a speech fragment level
-
H. Christensen, N. Ma, S. Wrigley, and J. Barker, "Integrating pitch and localisation cues at a speech fragment level," in Proc. of Interspeech, 2007, pp. 2769-2772.
-
Proc. of Interspeech, 2007
, pp. 2769-2772
-
-
Christensen, H.1
Ma, N.2
Wrigley, S.3
Barker, J.4
|