-
1
-
-
33646697511
-
Representation analysis and synthesis of lip images using dimensionality reduction
-
Aharon M., Kimmel R.: Representation analysis and synthesis of lip images using dimensionality reduction. IJCV 67:297-312, 2006.
-
(2006)
IJCV
, vol.67
, pp. 297-312
-
-
Aharon, M.1
Kimmel, R.2
-
6
-
-
84866701010
-
Blind audio-visual source separation using sparse representations
-
Casanovas A. L., Monaci G., Vandergheynst P.: Blind audio-visual source separation using sparse representations. IEEE ICIP 2007.
-
(2007)
IEEE ICIP
-
-
Casanovas, A.L.1
Monaci, G.2
Vandergheynst, P.3
-
7
-
-
84863054350
-
Multi-view 3D reconstruction for scenes under the refractive plane with known vertical direction
-
Chang Y. J., Chen T: Multi-View 3D Reconstruction for Scenes under the Refractive Plane with Known Vertical Direction. In Proc. ICCV, 2011.
-
(2011)
Proc. ICCV
-
-
Chang, Y.J.1
Chen, T.2
-
8
-
-
4544386970
-
Boosting and structure learning in dynamic bayesian networks for audio-visual speaker detection
-
Choudhury T., Rehg J., Pavlovic V., Pentland A.: Boosting and structure learning in dynamic bayesian networks for audio-visual speaker detection. In Proc. ICPR pp. 789-794, 2002.
-
(2002)
Proc. ICPR
, pp. 789-794
-
-
Choudhury, T.1
Rehg, J.2
Pavlovic, V.3
Pentland, A.4
-
9
-
-
0035500783
-
Speech enhancement for non-stationary noise environments
-
Cohen I., Berdugo B.: Speech enhancement for non-stationary noise environments. Signal Processing 81:2403-2418, 2001.
-
(2001)
Signal Processing
, vol.81
, pp. 2403-2418
-
-
Cohen, I.1
Berdugo, B.2
-
10
-
-
4344675264
-
Region filling and object removal by exemplar-based image inpainting
-
Criminisi A., Perez P., Toyama K.: Region filling and object removal by exemplar-based image inpainting. IEEE Trans. IP 13:1200-1212, 2004.
-
(2004)
IEEE Trans. IP
, vol.13
, pp. 1200-1212
-
-
Criminisi, A.1
Perez, P.2
Toyama, K.3
-
11
-
-
24644433991
-
Audio-visual speech enhancement with AVCDCN (audio-visual codebock dependent cepstral normalization
-
Deligne S., Potamianos G., Neti C.: Audio-visual speech enhancement with AVCDCN (audio-visual codebock dependent cepstral normalization). IEEEWorksh. Sensor Array & Multichannel SP, 68-71, 2002.
-
(2002)
IEEE Worksh. Sensor Array & Multichannel SP
, pp. 68-71
-
-
Deligne, S.1
Potamianos, G.2
Neti, C.3
-
12
-
-
0029935458
-
Enhancement of selective listening by illusory mislocation of speech sounds due to lip-reading
-
Driver J.: Enhancement of selective listening by illusory mislocation of speech sounds due to lip-reading. Nature 381:66-68, 1996.
-
(1996)
Nature
, vol.381
, pp. 66-68
-
-
Driver, J.1
-
13
-
-
0034270644
-
Audio-visual speech modeling for continuous speech recognition
-
Dupont S., Luettin J.:Audio-visual speech modeling for continuous speech recognition. IEEE Trans. Multimedia 2:141-151, 2000.
-
(2000)
IEEE Trans. Multimedia
, vol.2
, pp. 141-151
-
-
Dupont, S.1
Luettin, J.2
-
15
-
-
47749117221
-
Example-based regularization deployed to super-resolution reconstruction of a single image
-
Elad M., Datsenko D.: Example-based regularization deployed to super-resolution reconstruction of a single image. The Computer Journal, 50:1-16, 2007.
-
(2007)
The Computer Journal
, vol.50
, pp. 1-16
-
-
Elad, M.1
Datsenko, D.2
-
17
-
-
84898954418
-
Learning joint statistical models for audio-visual fusion and Segregation
-
Fisher III J.W., Darrell T., FreemanW. T., Viola P.: Learning joint statistical models for audio-visual fusion and Segregation. in Proc. NIPS 13, 772-778, 2001.
-
(2001)
Proc NIPS
, vol.13
, pp. 772-778
-
-
Fisher Iii, J.W.1
Darrell, T.2
Freeman, W.T.3
Viola, P.4
-
19
-
-
0037199954
-
Gated visual input to the central auditory system
-
Gutfreund Y., Zheng W., Knudsen E. I.: Gated visual input to the central auditory system. Science 297:1556-1559, 2002.
-
(2002)
Science
, vol.297
, pp. 1556-1559
-
-
Gutfreund, Y.1
Zheng, W.2
Knudsen, E.I.3
-
21
-
-
34948876301
-
Audio-visual sound separation via hidden markov models
-
Hershey J., Casey M.: Audio-visual sound separation via hidden markov models. in Proc. NIPS pp. 1173-1180, 2001.
-
(2001)
Proc NIPS
, pp. 1173-1180
-
-
Hershey, J.1
Casey, M.2
-
22
-
-
84863643959
-
Digital Video Stabilization and Rolling Shutter Correction using Gyroscopes
-
Karpenko A., Jacobs D. E.,Baek J., Levoy .M,: Digital Video Stabilization and Rolling Shutter Correction using Gyroscopes. Stanford CSTR pp. 2011-03, 2011.
-
(2011)
Stanford CSTR
, pp. 2011-03
-
-
Karpenko, A.1
Jacobs, D.E.2
Baek, J.3
Levoy, M.4
-
23
-
-
33745127045
-
Computer vision for music identification
-
Ke Y., Hoiem D., Sukthankar R.: Computer vision for music identification. Proc. IEEE CVPR pp. 597- 604, 2005
-
(2005)
Proc IEEE CVPR
, pp. 597-604
-
-
Ke, Y.1
Hoiem, D.2
Sukthankar, R.3
-
24
-
-
84866701011
-
Audio-Visual clustering for 3D speaker localization
-
Khalidov V., Forbes F., Hansard M., Arnaud E., Horaud R.: Audio-Visual clustering for 3D speaker localization. Proc. MLMI Workshop, 2008.
-
(2008)
Proc. MLMI Workshop
-
-
Khalidov, V.1
Forbes, F.2
Hansard, M.3
Arnaud, E.4
Horaud, R.5
-
26
-
-
84866679630
-
Visual localization of non-stationary sound sources
-
Liu Y., Sato Y.: Visual localization of non-stationary sound sources. In Proc. Multimedia, 2009.
-
(2009)
Proc. Multimedia
-
-
Liu, Y.1
Sato, Y.2
-
27
-
-
84866679635
-
Speechreading using Shape and Intensity Information
-
Luettin J., Thacker N. A., Beet S. W.: Speechreading using Shape and Intensity Information. In ISCA, 1996.
-
(1996)
ISCA
-
-
Luettin, J.1
Thacker, N.A.2
Beet, S.W.3
-
28
-
-
85135379452
-
An efficient algorithm to estimate the instantaneous SNR of speech signals
-
Martin R.: An efficient algorithm to estimate the instantaneous SNR of speech signals, Proc. EUROSPEECH:1093-1096, 1993.
-
(1993)
Proc. EUROSPEECH
, pp. 1093-1096
-
-
Martin, R.1
-
30
-
-
34948889993
-
Microphone arrays as generalized cameras for integrated audio visual processing
-
O'Donovan A., Duraiswami R., Neumann J.: Microphone arrays as generalized cameras for integrated audio visual processing. Proc. IEEE CVPR pp. :1-8, 2007.
-
(2007)
Proc IEEE CVPR
, pp. 1-8
-
-
O'donovan, A.1
Duraiswami, R.2
Neumann, J.3
-
31
-
-
4544290191
-
Recent advances in the automatic recognition of audio-visual speech
-
Potamianos G., Neti C., Gravier G., Garg A., Senior A.: Recent advances in the automatic recognition of audio-visual speech. Proc. IEEE, 91:1306-1326, 2003.
-
(2003)
Proc IEEE
, vol.91
, pp. 1306-1326
-
-
Potamianos, G.1
Neti, C.2
Gravier, G.3
Garg, A.4
Senior, A.5
-
33
-
-
33745936208
-
Separating transparent layers of repetitive dynamic behaviors
-
Sarel B., Irani M.: Separating transparent layers of repetitive dynamic behaviors. Proc. IEEE ICCV pp. 26-32, 2005.
-
(2005)
Proc IEEE ICCV
, pp. 26-32
-
-
Sarel, B.1
Irani, M.2
-
34
-
-
44949110218
-
Single-channel speech separation using sparse non-negative matrix factorization
-
Schmidt M. N., Olsson R. K.: Single-channel speech separation using sparse non-negative matrix factorization. Conf. Spoken Language Processing, 2006.
-
(2006)
Conf. Spoken Language Processing
-
-
Schmidt, M.N.1
Olsson, R.K.2
-
36
-
-
84858719009
-
A sparse non-parameteric approach for single channel separation of known sounds
-
Smaragdis P., Shashanka R. and Raj B.: A Sparse Non-Parameteric Approach for Single Channel Separation of Known Sounds. In Proc. NIPS pp. 1705-1713, 2009.
-
(2009)
Proc. NIPS
, pp. 1705-1713
-
-
Smaragdis, P.1
Shashanka, R.2
Raj, B.3
-
37
-
-
0032762471
-
A statistical model-based voice activitydetector
-
Sohn J., Kim N.S, Sung W.: A statistical model-based voice activitydetector. IEEE SP Lett 6:1-3, 1999.
-
(1999)
IEEE SP Lett
, vol.6
, pp. 1-3
-
-
Sohn, J.1
Kim, N.S.2
Sung, W.3
-
38
-
-
5044226917
-
Audio-visual based emotion recognition - A new approach
-
Song M., Bu J., Chen C., Li N.: Audio-visual based emotion recognition-a new approach. Proc. IEEE CVPR 2004.
-
(2004)
Proc IEEE CVPR
-
-
Song, M.1
Bu, J.2
Chen, C.3
Li, N.4
-
39
-
-
0033693215
-
Quantile based noise estimation for spectral subtraction and Wiener filtering
-
Stahl V., Fischer A., Bippus R.: Quantile based noise estimation for spectral subtraction and Wiener filtering. Proc. ICASSP pp. 1875-1878, 2000.
-
(2000)
Proc. ICASSP
, pp. 1875-1878
-
-
Stahl, V.1
Fischer, A.2
Bippus, R.3
-
40
-
-
34047223614
-
Audio segmentation and speaker localization in meeting videos
-
Vajaria H., Islam T., Sarkar S., Sankar R., Kasturi R.: Audio segmentation and speaker localization in meeting videos. In ICPR pp. 1150-1153, 2006.
-
(2006)
ICPR
, pp. 1150-1153
-
-
Vajaria, H.1
Islam, T.2
Sarkar, S.3
Sankar, R.4
Kasturi, R.5
|