SCOPUS 정보 검색 플랫폼

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Volumn 9908 LNCS, Issue , 2016, Pages 269-285

“What happens if…” learning to predict the effect of forces in images

(4) Mottaghi, Roozbeh a Rastegari, Mohammad a Gupta, Abhinav a,b Farhadi, Ali a,c

a Allen Institute for Artificial Intelligence (United States)

b CARNEGIE MELLON UNIVERSITY (United States)

c UNIVERSITY OF WASHINGTON (United States)

Author keywords

Forces; Motion estimation; Recurrent neural networks; Scene understanding

Indexed keywords

DEEP NEURAL NETWORKS; FORECASTING; LARGE DATASET; MOTION ESTIMATION;

EXPERIMENTAL EVALUATION; FORCES; LARGE-SCALE DATASET; NEURAL NETWORK MODEL; PHYSICAL MOVEMENTS; SCENE UNDERSTANDING; SEQUENTIAL DEPENDENCIES; SEQUENTIAL MOVEMENTS;

RECURRENT NEURAL NETWORKS;

EID: 84990038863 PISSN: 03029743 EISSN: 16113349 Source Type: Book Series
DOI: 10.1007/978-3-319-46493-0_17 Document Type: Conference Paper

Times cited : (63)

References (44)

1
- 84887299182
- Simulation as an engine of physical scene understanding
- Battaglia, P., Hamrick, J., Tenenbaum, J.B.: Simulation as an engine of physical scene understanding. PNAS 110, 18327-18332 (2013)
- (2013) PNAS , vol.110 , pp. 18327-18332
- Battaglia, P.¹ Hamrick, J.² Tenenbaum, J.B.³

2
- 84944397333
- Computing the physical parameters of rigid-body motion from video
- Heyden, A., Sparr, G., Nielsen, M., Johansen, P. (eds.), Springer, Heidelberg
- Bhat, K.S., Seitz, S.M., Popović, J., Khosla, P.K.: Computing the physical parameters of rigid-body motion from video. In: Heyden, A., Sparr, G., Nielsen, M., Johansen, P. (eds.) ECCV 2002. LNCS, vol. 2350, pp. 551-565. Springer, Heidelberg (2002). doi:10.1007/3-540-47969-4-37
- (2002) ECCV 2002. LNCS , vol.2350 , pp. 551-565
- Bhat, K.S.¹ Seitz, S.M.² Popović, J.³ Khosla, P.K.⁴

3
- 85083815548
- Estimating contact dynamics
- Brubaker, M.A., Sigal, L., Fleet, D.J.: Estimating contact dynamics. In: ICCV (2009)
- (2009) ICCV
- Brubaker, M.A.¹ Sigal, L.² Fleet, D.J.³

4
- 84887394346
- Understanding indoor scenes using 3d geometric phrases
- Choi, W., Chao, Y.W., Pantofaru, C., Savarese, S.: Understanding indoor scenes using 3d geometric phrases. In: CVPR (2013)
- (2013) CVPR
- Choi, W.¹ Chao, Y.W.² Pantofaru, C.³ Savarese, S.⁴

5
- 84937943470
- Depth map prediction from a single image using a multi-scale deep network
- Eigen, D., Puhrsch, C., Fergus, R.: Depth map prediction from a single image using a multi-scale deep network. In: NIPS (2014)
- (2014) NIPS
- Eigen, D.¹ Puhrsch, C.² Fergus, R.³

6
- 77951298115
- The pascal visual object classes (Voc) challenge
- Everingham, M., Gool, L., Williams, C.K., Winn, J., Zisserman, A.: The pascal visual object classes (voc) challenge. IJCV 88, 303-338 (2010)
- (2010) IJCV , vol.88 , pp. 303-338
- Everingham, M.¹ Gool, L.² Williams, C.K.³ Winn, J.⁴ Zisserman, A.⁵

7
- 84911394491
- Predicting object dynamics in scenes
- Fouhey, D.F., Zitnick, C.: Predicting object dynamics in scenes. In: CVPR (2014)
- (2014) CVPR
- Fouhey, D.F.¹ Zitnick, C.²

8
- 85027987438
- Learning predictive visual models of physics for playing billiards
- Fragkiadaki, K., Agrawal, P., Levine, S., Malik, J.: Learning predictive visual models of physics for playing billiards. In: ICLR (2016)
- (2016) ICLR
- Fragkiadaki, K.¹ Agrawal, P.² Levine, S.³ Malik, J.⁴

9
- 78149286912
- Blocks world revisited: Image understanding using qualitative geometry and mechanics
- Daniilidis, K., Maragos, P., Paragios, N. (eds.), Springer, Heidelberg
- Gupta, A., Efros, A.A., Hebert, M.: Blocks world revisited: image understanding using qualitative geometry and mechanics. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010. LNCS, vol. 6314, pp. 482-496. Springer, Heidelberg (2010). doi:10.1007/978-3-642-15561-1-35
- (2010) ECCV 2010. LNCS , vol.6314 , pp. 482-496
- Gupta, A.¹ Efros, A.A.² Hebert, M.³

10
- 85139514506
- Internal physics models guide probabilistic judgments about object dynamics
- Hamrick, J., Battaglia, P., Tenenbaum, J.B.: Internal physics models guide probabilistic judgments about object dynamics. In: Annual Meeting of the Cognitive Science Society (2011)
- (2011) Annual Meeting of the Cognitive Science Society
- Hamrick, J.¹ Battaglia, P.² Tenenbaum, J.B.³

11
- 84986274465
- Deep residual learning for image recognition
- He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: CVPR (2016)
- (2016) CVPR
- He, K.¹ Zhang, X.² Ren, S.³ Sun, J.⁴

12
- 84858763935
- Cascaded classification models: Combining models for holistic scene understanding
- Heitz, G., Gould, S., Saxena, A., Koller, D.: Cascaded classification models: Combining models for holistic scene understanding. In: NIPS (2008)
- (2008) NIPS
- Heitz, G.¹ Gould, S.² Saxena, A.³ Koller, D.⁴

13
- 84887362669
- 3d-based reasoning with blocks, support, and stability
- Jia, Z., Gallagher, A., Saxena, A., Chen, T.: 3d-based reasoning with blocks, support, and stability. In: CVPR (2013)
- (2013) CVPR
- Jia, Z.¹ Gallagher, A.² Saxena, A.³ Chen, T.⁴

14
- 84864475487
- Learning to place new objects in a scene
- Jiang, Y., Lim, M., Zheng, C., Saxena, A.: Learning to place new objects in a scene. IJRR 31, 1021-1043 (2012)
- (2012) IJRR , vol.31 , pp. 1021-1043
- Jiang, Y.¹ Lim, M.² Zheng, C.³ Saxena, A.⁴

15
- 84946734827
- Deep visual-semantic alignments for generating image descriptions
- Karpathy, A., Fei-Fei, L.: Deep visual-semantic alignments for generating image descriptions. In: CVPR (2015)
- (2015) CVPR
- Karpathy, A.¹ Fei-Fei, L.²

16
- 84867863926
- Activity forecasting. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.), Springer, Heidelberg
- Kitani, K.M., Ziebart, B.D., Bagnell, J.A., Hebert, M.: Activity forecasting. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012. LNCS, vol. 7575, pp. 201-214. Springer, Heidelberg (2012). doi:10.1007/978-3-642-33765-9_15
- (2012) ECCV 2012. LNCS , vol.7575 , pp. 201-214
- Kitani, K.M.¹ Ziebart, B.D.² Bagnell, J.A.³ Hebert, M.⁴

17
- 84876231242
- Imagenet classification with deep convolutional neural networks
- Krizhevsky, A., Sutskever, I., Hinton, G.E.: Imagenet classification with deep convolutional neural networks. In: NIPS (2012)
- (2012) NIPS
- Krizhevsky, A.¹ Sutskever, I.² Hinton, G.E.³

18
- 84939821079
- A simple way to initialize recurrent networks of rectified linear units
- Le, Q.V., Jaitly, N., Hinton, G.E.: A simple way to initialize recurrent networks of rectified linear units. In: ArXiv (2015)
- (2015) Arxiv
- Le, Q.V.¹ Jaitly, N.² Hinton, G.E.³

19
- 84990022919
- End-to-end training of deep visuomotor policies
- Levine, S., Finn, C., Darrell, T., Abbeel, P.: End-to-end training of deep visuomotor policies. In: ArXiv (2015)
- (2015) Arxiv
- Levine, S.¹ Finn, C.² Darrell, T.³ Abbeel, P.⁴

20
- 70450219021
- Towards total scene understanding: Classification, annotation and segmentation in an automatic framework
- Li, L.J., Socher, R., Fei-Fei, L.: Towards total scene understanding: Classification, annotation and segmentation in an automatic framework. In: CVPR (2009)
- (2009) CVPR
- Li, L.J.¹ Socher, R.² Fei-Fei, L.³

21
- 84898782715
- Holistic scene understanding for 3d object detection with rgbd cameras
- Lin, D., Fidler, S., Urtasun, R.: Holistic scene understanding for 3d object detection with rgbd cameras. In: ICCV (2013)
- (2013) ICCV
- Lin, D.¹ Fidler, S.² Urtasun, R.³

22
- 84906493406
- Microsoft COCO: Common objects in context
- Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.), Springer, Heidelberg
- Lin, T.-Y., Maire, M., Belongie, S., Hays, J., Perona, P., Ramanan, D., Dollár, P., Zitnick, C.L.: Microsoft COCO: common objects in context. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8693, pp. 740-755. Springer, Heidelberg (2014). doi:10.1007/978-3-319-10602-1-48
- (2014) ECCV 2014. LNCS , vol.8693 , pp. 740-755
- Lin, T.-Y.¹ Maire, M.² Belongie, S.³ Hays, J.⁴ Perona, P.⁵ Ramanan, D.⁶ Dollár, P.⁷ Zitnick, C.L.⁸

23
- 84937955008
- Modeling deep temporal dependencies with recurrent grammar cells
- Michalski, V., Memisevic, R., Konda, K.: Modeling deep temporal dependencies with recurrent grammar cells. In: NIPS (2014)
- (2014) NIPS
- Michalski, V.¹ Memisevic, R.² Konda, K.³

24
- 84986275832
- Newtonian image understanding: Unfolding the dynamics of objects in static images
- Mottaghi, R., Bagherinezhad, H., Rastegari, M., Farhadi, A.: Newtonian image understanding: Unfolding the dynamics of objects in static images. In: CVPR (2016)
- (2016) CVPR
- Mottaghi, R.¹ Bagherinezhad, H.² Rastegari, M.³ Farhadi, A.⁴

25
- 24644462948
- Using the forest to see the trees: A graphical model relating features, objects, and scenes
- Murphy, K., Torralba, A., Freeman, W.T.: Using the forest to see the trees: a graphical model relating features, objects, and scenes. In: NIPS (2003)
- (2003) NIPS
- Murphy, K.¹ Torralba, A.² Freeman, W.T.³

26
- 84965178314
- Action-conditional video prediction using deep networks in atari games
- Oh, J., Guo, X., Lee, H., Lewis, R.L., Singh, S.P.: Action-conditional video prediction using deep networks in atari games. In: NIPS (2015)
- (2015) NIPS
- Oh, J.¹ Guo, X.² Lee, H.³ Lewis, R.L.⁴ Singh, S.P.⁵

27
- 84906491151
- Déjá Vu: Motion prediction in static
- Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.), Springer, Heidelberg
- Pintea, S.L., Gemert, J.C., Smeulders, A.W.M.: Déjá Vu: motion prediction in static. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8691, pp. 172-187. Springer, Heidelberg (2014). doi:10.1007/978-3-319-10578-9_12
- (2014) ECCV 2014. LNCS , vol.8691 , pp. 172-187
- Pintea, S.L.¹ Gemert, J.C.² Smeulders, A.W.M.³

28
- 84965108042
- Video (Language) modeling: A baseline for generative models of natural videos
- Ranzato, M., Szlam, A., Bruna, J., Mathieu, M., Collobert, R., Chopra, S.: Video (language) modeling: a baseline for generative models of natural videos. In: ArXiv (2014)
- (2014) Arxiv
- Ranzato, M.¹ Szlam, A.² Bruna, J.³ Mathieu, M.⁴ Collobert, R.⁵ Chopra, S.⁶

29
- 84959207697
- Computationally bounded retrieval
- Rastegari, M., Keskin, C., Kohli, P., Izadi, S.: Computationally bounded retrieval. In: CVPR (2015)
- (2015) CVPR
- Rastegari, M.¹ Keskin, C.² Kohli, P.³ Izadi, S.⁴

30
- 84856660172
- Physically-based motion models for 3d tracking: A convex formulation
- Salzmann, M., Urtasun, R.: Physically-based motion models for 3d tracking: a convex formulation. In: ICCV (2011)
- (2011) ICCV
- Salzmann, M.¹ Urtasun, R.²

31
- 84867713871
- Indoor segmentation and support inference from RGBD images
- Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.), Springer, Heidelberg
- Silberman, N., Hoiem, D., Kohli, P., Fergus, R.: Indoor segmentation and support inference from RGBD images. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012. LNCS, vol. 7576, pp. 746-760. Springer, Heidelberg (2012). doi:10.1007/978-3-642-33715-4_54
- (2012) ECCV 2012. LNCS , vol.7576 , pp. 746-760
- Silberman, N.¹ Hoiem, D.² Kohli, P.³ Fergus, R.⁴

32
- 84957966034
- Sun rgb-d: A rgb-d scene understanding benchmark suite
- Song, S., Lichtenberg, S.P., Xiao, J.: Sun rgb-d: A rgb-d scene understanding benchmark suite. In: CVPR (2015)
- (2015) CVPR
- Song, S.¹ Lichtenberg, S.P.² Xiao, J.³

33
- 84868323622
- The recurrent temporal restricted boltzmann machine
- Sutskever, I., Hinton, G.E., Taylor, G.W.: The recurrent temporal restricted boltzmann machine. In: NIPS (2008)
- (2008) NIPS
- Sutskever, I.¹ Hinton, G.E.² Taylor, G.W.³

34
- 84946747440
- Show and tell: A neural image caption generator
- Vinyals, O., Toshev, A., Bengio, S., Erhan, D.: Show and tell: A neural image caption generator. In: CVPR (2015)
- (2015) CVPR
- Vinyals, O.¹ Toshev, A.² Bengio, S.³ Erhan, D.⁴

35
- 51949114217
- Physical simulation for probabilistic motion tracking
- Vondrak, M., Sigal, L., Jenkins, O.C.: Physical simulation for probabilistic motion tracking. In: CVPR (2008)
- (2008) CVPR
- Vondrak, M.¹ Sigal, L.² Jenkins, O.C.³

36
- 84911380009
- Patch to the future: Unsupervised visual prediction
- Walker, J., Gupta, A., Hebert, M.: Patch to the future: Unsupervised visual prediction. In: CVPR (2014)
- (2014) CVPR
- Walker, J.¹ Gupta, A.² Hebert, M.³

37
- 84973880490
- Dense optical flow prediction from a static image
- Walker, J., Gupta, A., Hebert, M.: Dense optical flow prediction from a static image. In: ICCV (2015)
- (2015) ICCV
- Walker, J.¹ Gupta, A.² Hebert, M.³

38
- 84959234840
- Designing deep networks for surface normal estimation
- Wang, X., Fouhey, D.F., Gupta, A.: Designing deep networks for surface normal estimation. In: CVPR (2015)
- (2015) CVPR
- Wang, X.¹ Fouhey, D.F.² Gupta, A.³

39
- 84965122247
- Galileo: Perceiving physical object properties by integrating a physics engine with deep learning
- Wu, J., Yildirim, I., Lim, J.J., Freeman, W.T., Tenenbaum, J.B.: Galileo: Perceiving physical object properties by integrating a physics engine with deep learning. In: NIPS (2015)
- (2015) NIPS
- Wu, J.¹ Yildirim, I.² Lim, J.J.³ Freeman, W.T.⁴ Tenenbaum, J.B.⁵

40
- 84866687133
- Describing the scene as a whole: Joint object detection, scene classification and semantic segmentation
- Yao, J., Fidler, S., Urtasun, R.: Describing the scene as a whole: Joint object detection, scene classification and semantic segmentation. In: CVPR (2012)
- (2012) CVPR
- Yao, J.¹ Fidler, S.² Urtasun, R.³

41
- 78149325735
- A data-driven approach for event prediction
- Daniilidis, K., Maragos, P., Paragios, N. (eds.), Springer, Heidelberg
- Yuen, J., Torralba, A.: A data-driven approach for event prediction. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010. LNCS, vol. 6312, pp. 707-720. Springer, Heidelberg (2010). doi:10.1007/978-3-642-15552-9-51
- (2010) ECCV 2010. LNCS , vol.6312 , pp. 707-720
- Yuen, J.¹ Torralba, A.²

42
- 84906352874
- PanoContext: A whole-room 3D context model for panoramic scene understanding
- Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.), Springer, Heidelberg
- Zhang, Y., Song, S., Tan, P., Xiao, J.: PanoContext: a whole-room 3D context model for panoramic scene understanding. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8694, pp. 668-686. Springer, Heidelberg (2014). doi:10.1007/978-3-319-10599-4_43
- (2014) ECCV 2014. LNCS , vol.8694 , pp. 668-686
- Zhang, Y.¹ Song, S.² Tan, P.³ Xiao, J.⁴

43
- 84887337480
- Beyond point clouds: Scene understanding by reasoning geometry and physics
- Zheng, B., Zhao, Y., Yu, J.C., Ikeuchi, K., Zhu, S.C.: Beyond point clouds: Scene understanding by reasoning geometry and physics. In: CVPR (2013)
- (2013) CVPR
- Zheng, B.¹ Zhao, Y.² Yu, J.C.³ Ikeuchi, K.⁴ Zhu, S.C.⁵

44
- 84929180017
- Detecting potential falling objects by inferring human action and natural disturbance
- Zheng, B., Zhao, Y., Yu, J.C., Ikeuchi, K., Zhu, S.C.: Detecting potential falling objects by inferring human action and natural disturbance. In: ICRA (2014)
- (2014) ICRA
- Zheng, B.¹ Zhao, Y.² Yu, J.C.³ Ikeuchi, K.⁴ Zhu, S.C.⁵

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.