-
1
-
-
33749432376
-
-
J. Hershey, J. Movellan, Audio-vision: Using audio-visual synchrony to locate sounds, in: Proceedings of NIPS, vol. 12, 1999.
-
-
-
-
2
-
-
33749439245
-
-
M. Slaney, M. Covell, FaceSync: a linear operator for measuring synchronization of video facial images and audio tracks, in: Proceedings of NIPS, vol. 13, 2000.
-
-
-
-
3
-
-
33749436181
-
-
H.J. Nock, G. Iyengar, C. Neti, Speaker localisation using audio-visual synchrony: an empirical study, in: Proceedings of the 10th ACM International Conference on Multimedia, 2002.
-
-
-
-
4
-
-
33749425185
-
-
J.W. Fisher III, T. Darrell, W.T. Freeman, P. Viola, Learning joint statistical models for audio-visual fusion and segregation, in: Proceedings of NIPS, vol. 13, 2000.
-
-
-
-
5
-
-
2642562769
-
Speaker association with signal-level audiovisual fusion
-
Fisher III J.W., and Darrell T. Speaker association with signal-level audiovisual fusion. IEEE Trans. Multimedia 6 3 (2004) 406-413
-
(2004)
IEEE Trans. Multimedia
, vol.6
, Issue.3
, pp. 406-413
-
-
Fisher III, J.W.1
Darrell, T.2
-
6
-
-
14844344462
-
From error probability to information theoretic (multi-modal) signal processing
-
Butz T., and Thiran J.-P. From error probability to information theoretic (multi-modal) signal processing. Signal Processing 85 5 (2005) 875-902
-
(2005)
Signal Processing
, vol.85
, Issue.5
, pp. 875-902
-
-
Butz, T.1
Thiran, J.-P.2
-
7
-
-
84863714265
-
-
P. Besson, M. Kunt, T. Butz, J.-P. Thiran, A multimodal approach to extract optimized audio features for speaker detection, in: Proceedings of EUSIPCO, 2005.
-
-
-
-
8
-
-
33749427114
-
-
P. Smaragdis, M. Casey, Audio/visual independent components, in: Proceedings of ICA, 2003, pp. 709-714.
-
-
-
-
9
-
-
0012839687
-
-
Prentice-Hall, Englewood Cliffs, NJ
-
Wang Y., Ostermann J., and Zhang Y.-Q. Digital Video Processing and Communications (2001), Prentice-Hall, Englewood Cliffs, NJ
-
(2001)
Digital Video Processing and Communications
-
-
Wang, Y.1
Ostermann, J.2
Zhang, Y.-Q.3
-
10
-
-
0024035735
-
Time-frequency localization operators: a geometric phase space approach
-
Daubechies I. Time-frequency localization operators: a geometric phase space approach. IEEE Trans. Inform. Theory 34 4 (1988) 605-612
-
(1988)
IEEE Trans. Inform. Theory
, vol.34
, Issue.4
, pp. 605-612
-
-
Daubechies, I.1
-
11
-
-
0027842081
-
Matching pursuits with time-frequency dictionaries
-
Mallat S., and Zhang Z. Matching pursuits with time-frequency dictionaries. IEEE Trans. Signal Process. 41 12 (1993) 3397-3415
-
(1993)
IEEE Trans. Signal Process.
, vol.41
, Issue.12
, pp. 3397-3415
-
-
Mallat, S.1
Zhang, Z.2
-
13
-
-
0020737631
-
The Laplacian pyramid as a compact image code
-
Burt P.J., and Adelson E.H. The Laplacian pyramid as a compact image code. IEEE Trans. Comm. 31 4 (1983) 532-540
-
(1983)
IEEE Trans. Comm.
, vol.31
, Issue.4
, pp. 532-540
-
-
Burt, P.J.1
Adelson, E.H.2
-
14
-
-
1242286060
-
-
L. Peotta, L. Granai, P. Vandergheynst, Very low bit rate image coding using redundant dictionaries, in: Proceedings of the SPIE, Wavelets: Applications in Signal and Image Processing X, vol. 5207, 2003, pp. 228-239.
-
-
-
-
15
-
-
33749437951
-
-
O. Divorra Escoda, Toward sparse and geometry adapted video approximations, Ph.D. Thesis, EPFL, Lausanne. Available: 〈http://lts2www.epfl.ch/〉, June 2005 (online).
-
-
-
-
16
-
-
13344261335
-
-
O. Divorra Escoda, P. Vandergheynst, A Bayesian approach to video expansions on parametric over-complete 2-D dictionaries, in: Proceedings of IEEE MMSP, 2004, pp. 490-493.
-
-
-
-
18
-
-
0029701799
-
-
R. Gribonval, E. Bacry, S. Mallat, P. Depalle, X. Rodet, Analysis of sound signals with high resolution matching pursuit, in: Proceedings of IEEE TFTS, 1996, pp. 125-128.
-
-
-
-
20
-
-
33749239260
-
-
G. Monaci, O. Divorra Escoda, P. Vandergheynst, Analysis of multimodal signals using redundant representations, in: Proceedings of IEEE ICIP, 2005.
-
-
-
-
22
-
-
0036874756
-
Moving-talker, speaker-independent feature study, and baseline results using the CUAVE multimodal speech corpus
-
Patterson E.K., Gurbuz S., Tufekci Z., and Gowdy J.N. Moving-talker, speaker-independent feature study, and baseline results using the CUAVE multimodal speech corpus. J. Appl. Signal Process. 11 (2002) 1189-1201
-
(2002)
J. Appl. Signal Process.
, vol.11
, pp. 1189-1201
-
-
Patterson, E.K.1
Gurbuz, S.2
Tufekci, Z.3
Gowdy, J.N.4
-
23
-
-
33749450587
-
-
R. Gribonval, E. Bacry, J. Abadia, Matching pursuit software and documentation, 〈http://www.cmap.polytechnique.fr/bacry/LastWave/packages/mp/mp.html〉.
-
-
-
-
24
-
-
33749451269
-
-
G. Monaci, Multimodal web page, 〈http://lts2www.epfl.ch/monaci/multimodal.html〉.
-
-
-
-
25
-
-
33749438467
-
-
P. Jost, P. Vandergheynst, P. Frossard, Tree-based pursuit: algorithm and properties, EPFL-ITS Technical Report 2005.13, Lausanne. Available: 〈http://lts2www.epfl.ch/〉, May 2005 (online).
-
-
-
|