-
1
-
-
0011812771
-
Kernel independent component analysis
-
F. Bach and M. Jordan. 2002, "Kernel independent component analysis," J. of Mach. Learning Res. 3, pp. 1-48.
-
(2002)
J. of Mach. Learning Res.
, vol.3
, pp. 1-48
-
-
Bach, F.1
Jordan, M.2
-
2
-
-
0042349407
-
A graphical model for audiovisual object tracking
-
M. J. Beal, N. Jojic, and H. Attias, 2003, "A graphical model for audiovisual object tracking," IEEE Tran. on PAMI, 25, pp. 828-836.
-
(2003)
IEEE Tran. on PAMI
, vol.25
, pp. 828-836
-
-
Beal, M.J.1
Jojic, N.2
Attias, H.3
-
3
-
-
85013597845
-
Eigenlips for robust speech recognition
-
C. Bregler, and Y. Konig, 1994, "Eigenlips for robust speech recognition," In Proc. IEEE ICASSP, vol. 2, pp. 667-672.
-
(1994)
Proc. IEEE ICASSP
, vol.2
, pp. 667-672
-
-
Bregler, C.1
Konig, Y.2
-
4
-
-
0034507915
-
Look who's talking: Speaker detection using video and audio correlation
-
R. Cutler, and L. Davis, 2000, "Look who's talking: speaker detection using video and audio correlation," Proc. IEEE ICME, vol. 3, pp. 1589-1592.
-
(2000)
Proc. IEEE ICME
, vol.3
, pp. 1589-1592
-
-
Cutler, R.1
Davis, L.2
-
5
-
-
24644433110
-
On the regularization of canonical correlation analysis
-
T. De Bie, and B. De Moor, 2003, "On the regularization of canonical correlation analysis," Int. Sympos. ICA and BSS, pp. 785-790.
-
(2003)
Int. Sympos. ICA and BSS
, pp. 785-790
-
-
De Bie, T.1
De Moor, B.2
-
6
-
-
24644433991
-
Audio-visual speech enhancement with AVCDCN (audio-visual codebock dependent cepstral normalization)
-
S. Deligne, G. Potamianos, and C. Neti, 2002, 'Audio-visual speech enhancement with AVCDCN (audio-visual codebock dependent cepstral normalization)," IEEE Work-shop on Sensor Array and Multichannel Signal Processing., pp. 68-71.
-
(2002)
IEEE Work-shop on Sensor Array and Multichannel Signal Processing
, pp. 68-71
-
-
Deligne, S.1
Potamianos, G.2
Neti, C.3
-
8
-
-
0037745171
-
Can recent innovations in harmonic analysis explain key findings in natural image statistics?
-
D. L. Donoho, and A. G. Flesia, 2001, "Can recent innovations in harmonic analysis explain key findings in natural image statistics?," Network: Comput. Neural. Syst., 12, pp. 371-393.
-
(2001)
Network: Comput. Neural. Syst.
, vol.12
, pp. 371-393
-
-
Donoho, D.L.1
Flesia, A.G.2
-
9
-
-
0029935458
-
Enhancement of selective listening by illusory mislocation of speech sounds due to lip-reading
-
J. Driver, 1996, "Enhancement of selective listening by illusory mislocation of speech sounds due to lip-reading," Nature 381, pp. 66-68.
-
(1996)
Nature
, vol.381
, pp. 66-68
-
-
Driver, J.1
-
10
-
-
24644451695
-
A probabilistic study of the average performance of the basis pursuit
-
submitted to the
-
M. Elad, and M. Zibulevsky, 2004, "A probabilistic study of the average performance of the basis pursuit", submitted to the IEEE Trans. on IT.
-
(2004)
IEEE Trans. on IT
-
-
Elad, M.1
Zibulevsky, M.2
-
11
-
-
24644498666
-
A unified framework for bases, frames, subspace bases, and subspace frames
-
G. Farnebäck, 1999, "A unified framework for bases, frames, subspace bases, and subspace frames", Proc. Scand. Conf. Image Analysis pp. 341-349.
-
(1999)
Proc. Scand. Conf. Image Analysis
, pp. 341-349
-
-
Farnebäck, G.1
-
12
-
-
0030879469
-
An anatomical basis for visual calibration of the auditory space map in the barn owl's midbrain
-
D. E. Feldman, and E. I. Knudsen, 1996, "An anatomical basis for visual calibration of the auditory space map in the barn owl's midbrain," The J. Neuroscience 17 pp. 6820-6837.
-
(1996)
The J. Neuroscience
, vol.17
, pp. 6820-6837
-
-
Feldman, D.E.1
Knudsen, E.I.2
-
13
-
-
2642562769
-
Speaker association with signal-level audiovisual fusion
-
J. W. Fisher III, and T. Darrell, 2004, "Speaker association with signal-level audiovisual fusion," IEEE Trans. Multimedia 6, pp. 406-413.
-
(2004)
IEEE Trans. Multimedia
, vol.6
, pp. 406-413
-
-
Fisher III, J.W.1
Darrell, T.2
-
14
-
-
84898954418
-
Learning joint statistical models for audio-visual fusion and Segregation
-
J. W. Fisher III, T. Darrell, W. Freeman, and P. Viola, 2001, "Learning joint statistical models for audio-visual fusion and Segregation," Advanced in Neural Inf. Process. Syst. 13, pp. 772-778.
-
(2001)
Advanced in Neural Inf. Process. Syst.
, vol.13
, pp. 772-778
-
-
Fisher III, J.W.1
Darrell, T.2
Freeman, W.3
Viola, P.4
-
15
-
-
0347968052
-
Sparse representations in unions of bases
-
R. Gribonval, and M. Nielsen, 2003, "Sparse representations in unions of bases," IEEE Trans. IT 49, pp. 3320-3325.
-
(2003)
IEEE Trans. IT
, vol.49
, pp. 3320-3325
-
-
Gribonval, R.1
Nielsen, M.2
-
16
-
-
0037199954
-
Gated visual input to the central auditory system
-
Y. Gutfreund, W. Zheng, and E. I. Knudsen, 2002, "Gated visual input to the central auditory system," Science 297, pp. 1556-1559.
-
(2002)
Science
, vol.297
, pp. 1556-1559
-
-
Gutfreund, Y.1
Zheng, W.2
Knudsen, E.I.3
-
17
-
-
84899028297
-
Audio-vision: Using audio-visual synchrony to locate sound
-
J. Hershey, and J. Movellan, 1999, "Audio-vision: using audio-visual synchrony to locate sound," Advances in Neural Inf. Process. Syst. 12, pp. 813-819.
-
(1999)
Advances in Neural Inf. Process. Syst.
, vol.12
, pp. 813-819
-
-
Hershey, J.1
Movellan, J.2
-
18
-
-
24644517212
-
Pixels that sound
-
Dep. of Electrical Engineering, Technion
-
E. Kidron, Y. Y. Schechner, and M. Elad, 2005, "Pixels that sound," Tech. Rep. CCIT TR-524, Dep. of Electrical Engineering, Technion.
-
(2005)
Tech. Rep.
, vol.CCIT TR-524
-
-
Kidron, E.1
Schechner, Y.Y.2
Elad, M.3
-
19
-
-
34147133605
-
Learning canonical correlations
-
Computer Vision Laboratory, S-581 83 Linköping Univ., Sweden
-
H. Knutsson, M. Borga, and T. Landelius, 1995, "Learning canonical correlations," Tech. Rep. LiTH-ISY-R-1761, Computer Vision Laboratory, S-581 83 Linköping Univ., Sweden.
-
(1995)
Tech. Rep.
, vol.LITH-ISY-R-1761
-
-
Knutsson, H.1
Borga, M.2
Landelius, T.3
-
20
-
-
2342451199
-
Multimedia content processing through cross-modal association
-
D. Li, N. Dimitrova, M. Li, and I. K. Sethi, 2003, "Multimedia content processing through cross-modal association," Proc. ACM Int. Conf. Multimedia, pp. 604-611.
-
(2003)
Proc. ACM Int. Conf. Multimedia
, pp. 604-611
-
-
Li, D.1
Dimitrova, N.2
Li, M.3
Sethi, I.K.4
-
21
-
-
0038648412
-
Appearance models based on kernel canonical correlation analysis
-
T. Melzer, M. Reiter, and H. Bischof, 2003, "Appearance models based on kernel canonical correlation analysis," Patt. Rec. 36, pp. 1961-1971.
-
(2003)
Patt. Rec.
, vol.36
, pp. 1961-1971
-
-
Melzer, T.1
Reiter, M.2
Bischof, H.3
-
22
-
-
7444264756
-
Conducting audio files via computer vision
-
D. Murphy, T. H. Andersen, and K. Jensen, 2004, "Conducting audio files via computer vision," Lecture Notes in Computer Science, 2915, pp. 529-540
-
(2004)
Lecture Notes in Computer Science
, vol.2915
, pp. 529-540
-
-
Murphy, D.1
Andersen, T.H.2
Jensen, K.3
-
23
-
-
0037700834
-
Assessing face and speech consistency for monologue detection in video
-
H. J. Nock, G. Iyengar, and C. Neti, 2002, "Assessing face and speech consistency for monologue detection in video," Proc. ACM Int. Conf. Multimedia, pp. 303-306.
-
(2002)
Proc. ACM Int. Conf. Multimedia
, pp. 303-306
-
-
Nock, H.J.1
Iyengar, G.2
Neti, C.3
-
24
-
-
24644501841
-
A computational model of early auditory-visual integration
-
Proc. Patt. Rec. Sympos.
-
C. Schauer, and H. M. Gross, 2003, "A computational model of early auditory-visual integration," Proc. Patt. Rec. Sympos., Lecture Notes in Computer Science 2781 pp. 362-369.
-
(2003)
Lecture Notes in Computer Science
, vol.2781
, pp. 362-369
-
-
Schauer, C.1
Gross, H.M.2
-
25
-
-
2642557514
-
FaceSync: A linear operator for measuring synchronization of video facial images and audio tracks
-
M. Slaney, and M. Covell, 2000, "FaceSync: a linear operator for measuring synchronization of video facial images and audio tracks," Advanc. in Neural Inf. Process. Syst. 13, pp. 814-820.
-
(2000)
Advanc. in Neural Inf. Process. Syst.
, vol.13
, pp. 814-820
-
-
Slaney, M.1
Covell, M.2
-
26
-
-
5044226917
-
Audio-visual based emotion recognition-a new approach
-
M. Song, J. Bu, C. Chen, and N. Li, 2004, "Audio-visual based emotion recognition-a new approach," Proc. IEEE CVPR, vol. 2, pp. 1020-1025.
-
(2004)
Proc. IEEE CVPR
, vol.2
, pp. 1020-1025
-
-
Song, M.1
Bu, J.2
Chen, C.3
Li, N.4
-
28
-
-
0034844366
-
Sequential Monte Carlo fusion of sound and vision for speaker tracking
-
J. Vermaak, M. Gangnet, A. Blake, and P. Perez, 2001, "Sequential Monte Carlo fusion of sound and vision for speaker tracking," Proc. IEEE ICCV, vol. 1, pp. 741-746.
-
(2001)
Proc. IEEE ICCV
, vol.1
, pp. 741-746
-
-
Vermaak, J.1
Gangnet, M.2
Blake, A.3
Perez, P.4
-
29
-
-
4644322072
-
Learning over Sets using Kernel Principal Angles
-
E. Wolf, A. Shashua, 2003, "Learning over Sets using Kernel Principal Angles," J. of Mach. Learning Res. 4, pp. 913-931.
-
(2003)
J. of Mach. Learning Res.
, vol.4
, pp. 913-931
-
-
Wolf, E.1
Shashua, A.2
|