-
1
-
-
0036874527
-
-
M. Heckmann, F. Berthommier, and K. Kroschel. Noise adaptive stream weighting in audio-visual speech recognition. EURASIP J. Applied Signal Proc., 11:1260-1273, 2002.
-
M. Heckmann, F. Berthommier, and K. Kroschel. Noise adaptive stream weighting in audio-visual speech recognition. EURASIP J. Applied Signal Proc., 11:1260-1273, 2002.
-
-
-
-
2
-
-
0042349407
-
A graphical model for audiovisual object tracking
-
M. Beal, N. Jojic, and H. Attias. A graphical model for audiovisual object tracking. IEEE Trans. PAMI, 25(7):828-836, 2003.
-
(2003)
IEEE Trans. PAMI
, vol.25
, Issue.7
, pp. 828-836
-
-
Beal, M.1
Jojic, N.2
Attias, H.3
-
3
-
-
34047229439
-
Audio-visual speaker localization using graphical models
-
A. Kushal, M. Rahurkar, L. Fei-Fei, J. Ponce, and T. Huang. Audio-visual speaker localization using graphical models. In Proc. 18th ICPR., pages 291-294, 2006.
-
(2006)
Proc. 18th ICPR
, pp. 291-294
-
-
Kushal, A.1
Rahurkar, M.2
Fei-Fei, L.3
Ponce, J.4
Huang, T.5
-
5
-
-
0034844366
-
Sequential monte carlo fusion of sound and vision for speaker tracking
-
J. Vermaak, M. Ganget, A. Blake, and P. Pérez. Sequential monte carlo fusion of sound and vision for speaker tracking. In Proc. IEEE ICCV, pages 741-746, 2001.
-
(2001)
Proc. IEEE ICCV
, pp. 741-746
-
-
Vermaak, J.1
Ganget, M.2
Blake, A.3
Pérez, P.4
-
6
-
-
13344250690
-
Data fusion for visual tracking with particles
-
P. Perez, J. Vermaak, and A. Blake. Data fusion for visual tracking with particles. Proc. of IEEE, 92(3):495-513, 2004.
-
(2004)
Proc. of IEEE
, vol.92
, Issue.3
, pp. 495-513
-
-
Perez, P.1
Vermaak, J.2
Blake, A.3
-
7
-
-
21244492850
-
Real-time speaker tracking using particle filter sensor fusion
-
Y. Chen and Y. Rui. Real-time speaker tracking using particle filter sensor fusion. Proc. of IEEE, 92(3):485-494, 2004.
-
(2004)
Proc. of IEEE
, vol.92
, Issue.3
, pp. 485-494
-
-
Chen, Y.1
Rui, Y.2
-
8
-
-
32344434992
-
A joint particle filter for audio-visual speaker tracking
-
K. Nickel, T. Gehrig, R. Stiefelhagen, and J. McDonough. A joint particle filter for audio-visual speaker tracking. In Proc. 7th International Conference on Multimodal Interfaces, pages 61-68, 2005.
-
(2005)
Proc. 7th International Conference on Multimodal Interfaces
, pp. 61-68
-
-
Nickel, K.1
Gehrig, T.2
Stiefelhagen, R.3
McDonough, J.4
-
10
-
-
4544347587
-
Multiple person and speaker activity tracking with a particle filter
-
N. Checka, K. Wilson, M. Siracusa, and T. Darrell. Multiple person and speaker activity tracking with a particle filter. In IEEE Conf. Acoust. Sp. Sign. Proc., pages 881-884, 2004.
-
(2004)
IEEE Conf. Acoust. Sp. Sign. Proc
, pp. 881-884
-
-
Checka, N.1
Wilson, K.2
Siracusa, M.3
Darrell, T.4
-
11
-
-
64149093817
-
Audiovisual probabilistic tracking of multiple speakers in meetings
-
D. Gatica-Perez, G. Lathoud, J.-M. Odobez, and I. McCowan. Audiovisual probabilistic tracking of multiple speakers in meetings. IEEE Trans. on ASLP, 15(2):601-616, 2007.
-
(2007)
IEEE Trans. on ASLP
, vol.15
, Issue.2
, pp. 601-616
-
-
Gatica-Perez, D.1
Lathoud, G.2
Odobez, J.-M.3
McCowan, I.4
-
13
-
-
38049107298
-
A generative approach to audio-visual person tracking
-
R. Brunelli, A. Brutti, P. Chippendale, O. Lanz, M. Omologo, P. Svaizer, and F. Tobia. A generative approach to audio-visual person tracking. In Multimodal Technologies for Perception of Humans: Proc. 1st International CLEAR Evaluation Workshop, pages 55-68, 2007.
-
(2007)
Multimodal Technologies for Perception of Humans: Proc. 1st International CLEAR Evaluation Workshop
, pp. 55-68
-
-
Brunelli, R.1
Brutti, A.2
Chippendale, P.3
Lanz, O.4
Omologo, M.5
Svaizer, P.6
Tobia, F.7
-
14
-
-
2642562769
-
Speaker association with signal-level audiovisual fusion
-
J. Fisher and T. Darrell. Speaker association with signal-level audiovisual fusion. IEEE Trans. on Multimedia, 6(3):406-413, 2004.
-
(2004)
IEEE Trans. on Multimedia
, vol.6
, Issue.3
, pp. 406-413
-
-
Fisher, J.1
Darrell, T.2
-
16
-
-
49949092708
-
Patterns of binocular disparity for a fixating observer
-
Springer
-
M. Hansard and R.P. Horaud. Patterns of binocular disparity for a fixating observer. In Advances in Brain, Vision, & AI, 2nd Int. Symp., pages 308-317. Springer, 2007.
-
(2007)
Advances in Brain, Vision, & AI, 2nd Int. Symp
, pp. 308-317
-
-
Hansard, M.1
Horaud, R.P.2
-
17
-
-
0000789852
-
Channel separability in the audio-visual integration of speech: A Bayesian approach
-
D.G. Stork and M.E. Hennecke, editors, Speech Reading by Humans and Machines: Models, Systems and Applications, Springer, Berlin
-
J.R. Movellan and G. Chadderdon. Channel separability in the audio-visual integration of speech: A Bayesian approach. In D.G. Stork and M.E. Hennecke, editors, Speech Reading by Humans and Machines: Models, Systems and Applications, NATO ASI Series, pages 473-487. Springer, Berlin, 1996.
-
(1996)
NATO ASI Series
, pp. 473-487
-
-
Movellan, J.R.1
Chadderdon, G.2
-
18
-
-
0032072433
-
Speech recognition and sensory integration
-
D.W. Massaro and D.G. Stork. Speech recognition and sensory integration. American Scientist, 86(3):236-244, 1998.
-
(1998)
American Scientist
, vol.86
, Issue.3
, pp. 236-244
-
-
Massaro, D.W.1
Stork, D.G.2
-
19
-
-
0037209490
-
EM procedures using mean-field approximations for Markov model-based image segmentation
-
G. Celeux, F. Forbes, and N. Peyrard. EM procedures using mean-field approximations for Markov model-based image segmentation. Pattern Recognition, 36:131-144, 2003.
-
(2003)
Pattern Recognition
, vol.36
, pp. 131-144
-
-
Celeux, G.1
Forbes, F.2
Peyrard, N.3
-
20
-
-
0002629270
-
Maximum likelihood from incomplete data via the EM algorithm (with discussion)
-
A. P. Dempster, N. M. Laird, and D. B. Rubin. Maximum likelihood from incomplete data via the EM algorithm (with discussion). J. Roy. Statist. Soc. Ser. B, 39(1):1-38, 1977.
-
(1977)
J. Roy. Statist. Soc. Ser. B
, vol.39
, Issue.1
, pp. 1-38
-
-
Dempster, A.P.1
Laird, N.M.2
Rubin, D.B.3
-
22
-
-
0000120766
-
Estimating the dimension of a model
-
March
-
G. Schwarz. Estimating the dimension of a model. The Annals of Statistics, 6(2):461-464, March 1978.
-
(1978)
The Annals of Statistics
, vol.6
, Issue.2
, pp. 461-464
-
-
Schwarz, G.1
-
23
-
-
63449110580
-
The CAVA corpus: Synchronized stereoscopic and binaural datasets with head movements
-
E. Arnaud, H. Christensen, Y.C. Lu, J. Barker, V. Khalidov, M. Hansard, B. Holveck, H. Mathieu, R. Narasimha, F. Forbes, and R. Horaud. The CAVA corpus: Synchronized stereoscopic and binaural datasets with head movements. In Proc. of ICMI 2008, 2008.
-
(2008)
Proc. of ICMI 2008
-
-
Arnaud, E.1
Christensen, H.2
Lu, Y.C.3
Barker, J.4
Khalidov, V.5
Hansard, M.6
Holveck, B.7
Mathieu, H.8
Narasimha, R.9
Forbes, F.10
Horaud, R.11
-
26
-
-
57849093600
-
Integrating pitch and localisation cues at a speech fragment level
-
H. Christensen, N. Ma, S.N.Wrigley, and J. Barker. Integrating pitch and localisation cues at a speech fragment level. In Proc. of Interspeech 2007, pages 2769-2772, 2007.
-
(2007)
Proc. of Interspeech 2007
, pp. 2769-2772
-
-
Christensen, H.1
Ma, N.2
Wrigley, S.N.3
Barker, J.4
|