-
2
-
-
85006343094
-
Joint object-material category segmentation from audio-visual cues
-
A. Arnab, M. Sapienza, S. Golodetz, J. Valentin, O. Miksik, S. Izadi, and P. H. S. Torr. Joint object-material category segmentation from audio-visual cues. In BMVC, 2015.
-
(2015)
BMVC
-
-
Arnab, A.1
Sapienza, M.2
Golodetz, S.3
Valentin, J.4
Miksik, O.5
Izadi, S.6
Torr, P.H.S.7
-
3
-
-
0037854202
-
The acquisition of physical knowledge in infancy: A summary in eight lessons
-
R. Baillargeon. The acquisition of physical knowledge in infancy: A summary in eight lessons. Blackwell handbook of childhood cognitive development, 1:46-83, 2002.
-
(2002)
Blackwell Handbook of Childhood Cognitive Development
, vol.1
, pp. 46-83
-
-
Baillargeon, R.1
-
4
-
-
84962478162
-
Material recognition in the wild with the materials in context database
-
S. Bell, P. Upchurch, N. Snavely, and K. Bala. Material recognition in the wild with the materials in context database. CoRR, abs/1412.0623, 2014.
-
(2014)
CoRR, Abs
, vol.1412
, pp. 0623
-
-
Bell, S.1
Upchurch, P.2
Snavely, N.3
Bala, K.4
-
5
-
-
27644583688
-
A tutorial on onset detection in music signals
-
J. P. Bello, L. Daudet, S. Abdallah, C. Duxbury, M. Davies, and M. B. Sandler. A tutorial on onset detection in music signals. Speech and Audio Processing, IEEE Transactions on, 13(5):1035-1047, 2005.
-
(2005)
Speech and Audio Processing, IEEE Transactions on
, vol.13
, Issue.5
, pp. 1035-1047
-
-
Bello, J.P.1
Daudet, L.2
Abdallah, S.3
Duxbury, C.4
Davies, M.5
Sandler, M.B.6
-
6
-
-
84986273500
-
Were those coconuts or horse hoofs? Visual context effects on identification and perceived veracity of everyday sounds
-
T. Bonebright. Were those coconuts or horse hoofs? visual context effects on identification and perceived veracity of everyday sounds. In International Conference on Auditory Display, 2012.
-
(2012)
International Conference on Auditory Display
-
-
Bonebright, T.1
-
8
-
-
84959239475
-
Visual vibrometry: Estimating material properties from small motion in video
-
A. Davis, K. L. Bouman, M. Rubinstein, F. Durand, and W. T. Freeman. Visual vibrometry: Estimating material properties from small motion in video. In CVPR, 2015.
-
(2015)
CVPR
-
-
Davis, A.1
Bouman, K.L.2
Rubinstein, M.3
Durand, F.4
Freeman, W.T.5
-
9
-
-
84905749657
-
The visual microphone: Passive recovery of sound from video
-
A. Davis, M. Rubinstein, N. Wadhwa, G. J. Mysore, F. Durand, and W. T. Freeman. The visual microphone: passive recovery of sound from video. ACM Transactions on Graphics (TOG), 2014.
-
(2014)
ACM Transactions on Graphics (TOG)
-
-
Davis, A.1
Rubinstein, M.2
Wadhwa, N.3
Mysore, G.J.4
Durand, F.5
Freeman, W.T.6
-
10
-
-
85198028989
-
Imagenet: A large-scale hierarchical image database
-
J. Deng, W. Dong, R. Socher, L.-J. Li, K. Li, and L. Fei-Fei. Imagenet: A large-scale hierarchical image database. In CVPR, 2009.
-
(2009)
CVPR
-
-
Deng, J.1
Dong, W.2
Socher, R.3
Li, L.-J.4
Li, K.5
Fei-Fei, L.6
-
11
-
-
84973916088
-
Unsupervised visual representation learning by context prediction
-
C. Doersch, A. Gupta, and A. A. Efros. Unsupervised visual representation learning by context prediction. ICCV, 2015.
-
(2015)
ICCV
-
-
Doersch, C.1
Gupta, A.2
Efros, A.A.3
-
12
-
-
84959236502
-
Long-term recurrent convolutional networks for visual recognition and description
-
J. Donahue, L. A. Hendricks, S. Guadarrama, M. Rohrbach, S. Venugopalan, K. Saenko, and T. Darrell. Long-term recurrent convolutional networks for visual recognition and description. CVPR, 2015.
-
(2015)
CVPR
-
-
Donahue, J.1
Hendricks, L.A.2
Guadarrama, S.3
Rohrbach, M.4
Venugopalan, S.5
Saenko, K.6
Darrell, T.7
-
13
-
-
0016421071
-
The estimation of the gradient of a density function, with applications in pattern recognition
-
K. Fukunaga and L. D. Hostetler. The estimation of the gradient of a density function, with applications in pattern recognition. Information Theory, IEEE Transactions on, 21(1):32-40, 1975.
-
(1975)
Information Theory, IEEE Transactions on
, vol.21
, Issue.1
, pp. 32-40
-
-
Fukunaga, K.1
Hostetler, L.D.2
-
14
-
-
84959948834
-
What in the world do we hear?: An ecological approach to auditory event perception
-
W.W. Gaver. What in the world do we hear?: An ecological approach to auditory event perception. Ecological psychology, 1993.
-
(1993)
Ecological Psychology
-
-
Gaver, W.W.1
-
15
-
-
84911468319
-
Learning haptic representation for manipulating deformable food objects
-
M. Gemici and A. Saxena. Learning haptic representation for manipulating deformable food objects. In IROS, 2014.
-
(2014)
IROS
-
-
Gemici, M.1
Saxena, A.2
-
16
-
-
0025110885
-
Derivation of auditory filter shapes from notched-noise data
-
B. R. Glasberg and B. C. Moore. Derivation of auditory filter shapes from notched-noise data. Hearing research, 47(1):103-138, 1990.
-
(1990)
Hearing Research
, vol.47
, Issue.1
, pp. 103-138
-
-
Glasberg, B.R.1
Moore, B.C.2
-
17
-
-
85070926206
-
-
arXiv preprint arXiv 1504 02518
-
R. Goroshin, J. Bruna, J. Tompson, D. Eigen, and Y. LeCun. Unsupervised feature learning from temporal data. arXiv preprint arXiv:1504.02518, 2015.
-
(2015)
Unsupervised Feature Learning from Temporal Data
-
-
Goroshin, R.1
Bruna, J.2
Tompson, J.3
Eigen, D.4
LeCun, Y.5
-
19
-
-
84862572299
-
Spatial pattern of bold fmri activation reveals cross-modal information in auditory cortex
-
P.-J. Hsieh, J. T. Colas, and N. Kanwisher. Spatial pattern of bold fmri activation reveals cross-modal information in auditory cortex. Journal of neurophysiology, 2012.
-
(2012)
Journal of Neurophysiology
-
-
Hsieh, P.-J.1
Colas, J.T.2
Kanwisher, N.3
-
20
-
-
0742307391
-
Speech enhancement based on wavelet thresholding the multitaper spectrum
-
Y. Hu and P. C. Loizou. Speech enhancement based on wavelet thresholding the multitaper spectrum. Speech and Audio Processing, IEEE Transactions on, 12(1):59-67, 2004.
-
(2004)
Speech and Audio Processing, IEEE Transactions on
, vol.12
, Issue.1
, pp. 59-67
-
-
Hu, Y.1
Loizou, P.C.2
-
22
-
-
84973897623
-
Learning image representations tied to ego-motion
-
D. Jayaraman and K. Grauman. Learning image representations tied to ego-motion. In ICCV, December 2015.
-
(2015)
ICCV, December
-
-
Jayaraman, D.1
Grauman, K.2
-
23
-
-
84973872595
-
3d convolutional neural networks for human action recognition
-
S. Ji, W. Xu, M. Yang, and K. Yu. 3d convolutional neural networks for human action recognition. IEEE TPAMI, 2013.
-
(2013)
IEEE TPAMI
-
-
Ji, S.1
Xu, W.2
Yang, M.3
Yu, K.4
-
24
-
-
84913580146
-
Caffe: Convolutional architecture for fast feature embedding
-
ACM
-
Y. Jia, E. Shelhamer, J. Donahue, S. Karayev, J. Long, R. Girshick, S. Guadarrama, and T. Darrell. Caffe: Convolutional architecture for fast feature embedding. In Proceedings of the ACM International Conference on Multimedia, pages 675-678. ACM, 2014.
-
(2014)
Proceedings of the ACM International Conference on Multimedia
, pp. 675-678
-
-
Jia, Y.1
Shelhamer, E.2
Donahue, J.3
Karayev, S.4
Long, J.5
Girshick, R.6
Guadarrama, S.7
Darrell, T.8
-
26
-
-
84911364368
-
Large-scale video classification with convolutional neural networks
-
A. Karpathy, G. Toderici, S. Shetty, T. Leung, R. Sukthankar, and L. Fei-Fei. Large-scale video classification with convolutional neural networks. In CVPR, 2014.
-
(2014)
CVPR
-
-
Karpathy, A.1
Toderici, G.2
Shetty, S.3
Leung, T.4
Sukthankar, R.5
Fei-Fei, L.6
-
28
-
-
85032750981
-
Deep learning for acoustic modeling in parametric speech generation: A systematic review of existing techniques and future trends
-
Z.-H. Ling, S.-Y. Kang, H. Zen, A. Senior, M. Schuster, X.-J. Qian, H. M. Meng, and L. Deng. Deep learning for acoustic modeling in parametric speech generation: A systematic review of existing techniques and future trends. IEEE Signal Processing Magazine, 2015.
-
(2015)
IEEE Signal Processing Magazine
-
-
Ling, Z.-H.1
Kang, S.-Y.2
Zen, H.3
Senior, A.4
Schuster, M.5
Qian, X.-J.6
Meng, H.M.7
Deng, L.8
-
29
-
-
68149175636
-
Human sound source identification
-
Springer
-
R. A. Lutfi. Human sound source identification. In Auditory perception of sound sources, pages 13-42. Springer, 2008.
-
(2008)
Auditory Perception of Sound Sources
, pp. 13-42
-
-
Lutfi, R.A.1
-
30
-
-
80052406394
-
Sound texture perception via statistics of the auditory periphery: Evidence from sound synthesis
-
J. H. McDermott and E. P. Simoncelli. Sound texture perception via statistics of the auditory periphery: evidence from sound synthesis. Neuron, 71(5):926-940, 2011.
-
(2011)
Neuron
, vol.71
, Issue.5
, pp. 926-940
-
-
McDermott, J.H.1
Simoncelli, E.P.2
-
31
-
-
71149084945
-
Deep learning from temporal coherence in video
-
H. Mobahi, R. Collobert, and J. Weston. Deep learning from temporal coherence in video. In ICML, 2009.
-
(2009)
ICML
-
-
Mobahi, H.1
Collobert, R.2
Weston, J.3
-
32
-
-
80053437179
-
Multimodal deep learning
-
J. Ngiam, A. Khosla, M. Kim, J. Nam, H. Lee, and A. Y. Ng. Multimodal deep learning. In ICML, 2011.
-
(2011)
ICML
-
-
Ngiam, J.1
Khosla, A.2
Kim, M.3
Nam, J.4
Lee, H.5
Ng, A.Y.6
-
34
-
-
84862867519
-
The origins of inquiry: Inductive inference and exploration in early childhood
-
L. Schulz. The origins of inquiry: Inductive inference and exploration in early childhood. Trends in cognitive sciences, 16(7):382-389, 2012.
-
(2012)
Trends in Cognitive Sciences
, vol.16
, Issue.7
, pp. 382-389
-
-
Schulz, L.1
-
36
-
-
84878726069
-
Recognizing materials using perceptually inspired features
-
L. Sharan, C. Liu, R. Rosenholtz, and E. H. Adelson. Recognizing materials using perceptually inspired features. International journal of computer vision, 103(3):348-371, 2013.
-
(2013)
International Journal of Computer Vision
, vol.103
, Issue.3
, pp. 348-371
-
-
Sharan, L.1
Liu, C.2
Rosenholtz, R.3
Adelson, E.H.4
-
39
-
-
70350383283
-
Interactive learning of the acoustic properties of household objects
-
J. Sinapov, M. Wiemer, and A. Stoytchev. Interactive learning of the acoustic properties of household objects. In ICRA, 2009.
-
(2009)
ICRA
-
-
Sinapov, J.1
Wiemer, M.2
Stoytchev, A.3
-
40
-
-
85153941343
-
Pattern playback in the 90s
-
M. Slaney. Pattern playback in the 90s. In NIPS, pages 827-834, 1994.
-
(1994)
NIPS
, pp. 827-834
-
-
Slaney, M.1
-
41
-
-
15444371960
-
The development of embodied cognition: Six lessons from babies
-
L. Smith and M. Gasser. The development of embodied cognition: Six lessons from babies. Artificial life, 11(1-2):13-29, 2005.
-
(2005)
Artificial Life
, vol.11
, Issue.1-2
, pp. 13-29
-
-
Smith, L.1
Gasser, M.2
-
42
-
-
84904163933
-
Dropout: A simple way to prevent neural networks from overfitting
-
N. Srivastava, G. Hinton, A. Krizhevsky, I. Sutskever, and R. Salakhutdinov. Dropout: A simple way to prevent neural networks from overfitting. The Journal of Machine Learning Research, 15(1):1929-1958, 2014.
-
(2014)
The Journal of Machine Learning Research
, vol.15
, Issue.1
, pp. 1929-1958
-
-
Srivastava, N.1
Hinton, G.2
Krizhevsky, A.3
Sutskever, I.4
Salakhutdinov, R.5
-
45
-
-
84973889989
-
Unsupervised learning of visual representations using videos
-
X. Wang and A. Gupta. Unsupervised learning of visual representations using videos. In ICCV, 2015.
-
(2015)
ICCV
-
-
Wang, X.1
Gupta, A.2
-
46
-
-
84937964578
-
Learning deep features for scene recognition using places database
-
B. Zhou, A. Lapedriza, J. Xiao, A. Torralba, and A. Oliva. Learning deep features for scene recognition using places database. In NIPS, 2014.
-
(2014)
NIPS
-
-
Zhou, B.1
Lapedriza, A.2
Xiao, J.3
Torralba, A.4
Oliva, A.5
|