SCOPUS 정보 검색 플랫폼

Robotics: Science and Systems

Volumn , Issue , 2014, Pages

State Representation Learning in Robotics: Using Prior Knowledge about Physical Interaction

(2) Jonschkowski, Rico a Brock, Oliver a

a TECHNISCHE UNIVERSITÄT BERLIN (Germany)

Author keywords

[No Author keywords available]

Indexed keywords

AIR NAVIGATION; REINFORCEMENT LEARNING; ROBOTICS;

APPROACHES TO LEARNING; HIGH-DIMENSIONAL; LEARN+; MOVING OBJECTS; NAVIGATION TASKS; PHYSICAL INTERACTIONS; PHYSICAL WORLD; PRIOR-KNOWLEDGE; STATE REPRESENTATION; TASK RELEVANT;

ROBOTS;

EID: 84961612003 PISSN: None EISSN: 2330765X Source Type: Conference Proceeding
DOI: 10.15607/RSS.2014.X.019 Document Type: Conference Paper

Times cited : (50)

References (30)

1
- 84879854889
- Representation learning: A review and new perspectives
- Yoshua Bengio, Aaron C. Courville, and Pascal Vincent. Representation learning: A review and new perspectives. IEEE Transactions on Pattern Analysis and Machine Intelligence, 35(8):1798–1828, 2013.
- (2013) IEEE Transactions on Pattern Analysis and Machine Intelligence , vol.35 , Issue.8 , pp. 1798-1828
- Bengio, Yoshua¹ Courville, Aaron C.² Vincent, Pascal³

2
- 80052249260
- Closing the learning-planning loop with predictive state representations
- Byron Boots, Sajid M. Siddiqi, and Geoffrey J. Gordon. Closing the learning-planning loop with predictive state representations. International Journal of Robotics Research, 30(7):954–966, 2011.
- (2011) International Journal of Robotics Research , vol.30 , Issue.7 , pp. 954-966
- Boots, Byron¹ Siddiqi, Sajid M.² Gordon, Geoffrey J.³

3
- 31844456089
- Action respecting embedding
- Michael Bowling, Ali Ghodsi, and Dana Wilkinson. Action respecting embedding. In 22nd International Conference on Machine Learning (ICML), pages 65–72, 2005.
- (2005) 22nd International Conference on Machine Learning (ICML) , pp. 65-72
- Bowling, Michael¹ Ghodsi, Ali² Wilkinson, Dana³

4
- 80053558787
- Natural language processing (almost) from scratch
- Ronan Collobert, Jason Weston, Léon Bottou, Michael Karlen, Koray Kavukcuoglu, and Pavel Kuksa. Natural language processing (almost) from scratch. Journal of Machine Learning Research, 12:2493–2537, 2011.
- (2011) Journal of Machine Learning Research , vol.12 , pp. 2493-2537
- Collobert, Ronan¹ Weston, Jason² Bottou, Léon³ Karlen, Michael⁴ Kavukcuoglu, Koray⁵ Kuksa, Pavel⁶

5
- 84872530807
- Solving partially observable reinforcement learning problems with recurrent neural networks
- Springer Berlin Heidelberg
- Siegmund Duell, Steffen Udluft, and Volkmar Sterz-ing. Solving partially observable reinforcement learning problems with recurrent neural networks. In Neural Networks: Tricks of the Trade, volume 7700 of Lecture Notes in Computer Science, pages 709–733. Springer Berlin Heidelberg, 2012.
- (2012) Neural Networks: Tricks of the Trade, volume 7700 of Lecture Notes in Computer Science , pp. 709-733
- Duell, Siegmund¹ Udluft, Steffen² Sterz-ing, Volkmar³

6
- 33845594569
- Dimensionality reduction by learning an invariant mapping
- Raia Hadsell, Sumit Chopra, and Yann LeCun. Dimensionality reduction by learning an invariant mapping. In IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR), pages 1735– 1742, 2006.
- (2006) IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR) , pp. 1735-1742
- Hadsell, Raia¹ Chopra, Sumit² LeCun, Yann³

7
- 79956157781
- Using slow feature analysis to extract behavioural manifolds related to humanoid robot postures
- Sebastian Höfer, Manfred Hild, and Matthias Kubisch. Using slow feature analysis to extract behavioural manifolds related to humanoid robot postures. In 10th International Conference on Epigenetic Robotics, pages 43–50, 2010.
- (2010) 10th International Conference on Epigenetic Robotics , pp. 43-50
- Höfer, Sebastian¹ Hild, Manfred² Kubisch, Matthias³

8
- 77954141070
- Feature reinforcement learning: Part I: Unstructured MDPs
- Marcus Hutter. Feature reinforcement learning: Part I: Unstructured MDPs. Journal of Artificial General Intelligence, 1:3–24, 2009.
- (2009) Journal of Artificial General Intelligence , vol.1 , pp. 3-24
- Hutter, Marcus¹

9
- 14344257134
- A spatio-temporal extension to isomap nonlinear dimension reduction
- Odest Chadwicke Jenkins and Maja J. Matarić. A spatio-temporal extension to isomap nonlinear dimension reduction. In 21st International Conference on Machine Learning (ICML), page 56, 2004.
- (2004) 21st International Conference on Machine Learning (ICML) , pp. 56
- Jenkins, Odest Chadwicke¹ Matarić, Maja J.²

10
- 84908168597
- Learning grounded relational symbols from continuous data for abstract reasoning
- Nikolay Jetchev, Tobias Lang, and Marc Toussaint. Learning grounded relational symbols from continuous data for abstract reasoning. In Autonomous Learning Workshop at the IEEE International Conference on Robotics and Automation, 2013.
- (2013) Autonomous Learning Workshop at the IEEE International Conference on Robotics and Automation
- Jetchev, Nikolay¹ Lang, Tobias² Toussaint, Marc³

11
- 85092859373
- Learning task-specific state representations by maximizing slowness and predictability
- Rico Jonschkowski and Oliver Brock. Learning task-specific state representations by maximizing slowness and predictability. In 6th International Workshop on Evolutionary and Reinforcement Learning for Autonomous Robot Systems (ERLARS), 2013.
- (2013) 6th International Workshop on Evolutionary and Reinforcement Learning for Autonomous Robot Systems (ERLARS)
- Jonschkowski, Rico¹ Brock, Oliver²

12
- 84884276459
- Reinforcement learning in robotics: A survey
- Jens Kober, J. Andrew Bagnell, and Jan Peters. Reinforcement learning in robotics: A survey. International Journal of Robotics Research, 32(11):1238–1274, 2013.
- (2013) International Journal of Robotics Research , vol.32 , Issue.11 , pp. 1238-1274
- Kober, Jens¹ Andrew Bagnell, J.² Peters, Jan³

13
- 84876231242
- ImageNet classification with deep convolutional neural networks
- Alex Krizhevsky, Ilya Sutskever, and Geoffrey E. Hinton. ImageNet classification with deep convolutional neural networks. In Advances in Neural Information Processing Systems (NIPS), pages 1106–1114, 2012.
- (2012) Advances in Neural Information Processing Systems (NIPS) , pp. 1106-1114
- Krizhevsky, Alex¹ Sutskever, Ilya² Hinton, Geoffrey E.³

14
- 0041654220
- Multidimensional scaling by optimizing goodness of fit to a nonmetric hypothesis
- Joseph B. Kruskal. Multidimensional scaling by optimizing goodness of fit to a nonmetric hypothesis. Psychometrika, 29(1):1–27, 1964.
- (1964) Psychometrika , vol.29 , Issue.1 , pp. 1-27
- Kruskal, Joseph B.¹

15
- 84865083902
- Autonomous reinforcement learning on raw visual input data in a real world application
- Sascha Lange, Martin Riedmiller, and Arne Voigtländer. Autonomous reinforcement learning on raw visual input data in a real world application. In International Joint Conference on Neural Networks (IJCNN), pages 1–8, 2012.
- (2012) International Joint Conference on Neural Networks (IJCNN) , pp. 1-8
- Lange, Sascha¹ Riedmiller, Martin² Voigtländer, Arne³

16
- 78049417739
- Reinforcement learning on slow features of high-dimensional input streams
- Robert Legenstein, Niko Wilbert, and Laurenz Wiskott. Reinforcement learning on slow features of high-dimensional input streams. PLoS Computational Biology, 6(8):e1000894, 2010.
- (2010) PLoS Computational Biology , vol.6 , Issue.8 , pp. e1000894
- Legenstein, Robert¹ Wilbert, Niko² Wiskott, Laurenz³

17
- 84898982129
- Predictive representations of state
- Michael L. Littman, Richard S. Sutton, and Satinder Singh. Predictive representations of state. In Advances in Neural Information Processing Systems (NIPS), pages 1555–1561, 2002.
- (2002) Advances in Neural Information Processing Systems (NIPS) , pp. 1555-1561
- Littman, Michael L.¹ Sutton, Richard S.² Singh, Satinder³

18
- 84867667632
- Low complexity proto-value function learning from sensory observations with incremental slow feature analysis
- Matthew Luciw and Juergen Schmidhuber. Low complexity proto-value function learning from sensory observations with incremental slow feature analysis. In 22nd International Conference on Artificial Neural Networks and Machine Learning (ICANN), pages 279–287, 2012.
- (2012) 22nd International Conference on Artificial Neural Networks and Machine Learning (ICANN) , pp. 279-287
- Luciw, Matthew¹ Schmidhuber, Juergen²

19
- 35748957806
- Proto-value functions: A laplacian framework for learning representation and control in markov decision processes
- Sridhar Mahadevan and Mauro Maggioni. Proto-value functions: A laplacian framework for learning representation and control in markov decision processes. Journal of Machine Learning Research, 8(10):2169–2231, 2007.
- (2007) Journal of Machine Learning Research , vol.8 , Issue.10 , pp. 2169-2231
- Mahadevan, Sridhar¹ Maggioni, Mauro²

20
- 17444414191
- Basis function adaptation in temporal difference reinforcement learning
- Ishai Menache, Shie Mannor, and Nahum Shimkin. Basis function adaptation in temporal difference reinforcement learning. Annals of Operations Research, 134:215–238, 2005.
- (2005) Annals of Operations Research , vol.134 , pp. 215-238
- Menache, Ishai¹ Mannor, Shie² Shimkin, Nahum³

21
- 79952136450
- Learning visual representations for perception-action systems
- Justus Piater, Sébastien Jodogne, Renaud Detry, Dirk Kraft, Norbert Krüger, Oliver Kroemer, and Jan Peters. Learning visual representations for perception-action systems. International Journal of Robotics Research, 30(3): 294–307, 2011.
- (2011) International Journal of Robotics Research , vol.30 , Issue.3 , pp. 294-307
- Piater, Justus¹ Jodogne, Sébastien² Detry, Renaud³ Kraft, Dirk⁴ Krüger, Norbert⁵ Kroemer, Oliver⁶ Peters, Jan⁷

22
- 33646398129
- Neural fitted Q iteration – first experiences with a data efficient neural reinforcement learning method
- Martin Riedmiller. Neural fitted Q iteration – first experiences with a data efficient neural reinforcement learning method. In 16th European Conference on Machine Learning (ECML), pages 317–328, 2005.
- (2005) 16th European Conference on Machine Learning (ECML) , pp. 317-328
- Riedmiller, Martin¹

23
- 0034704222
- Nonlinear dimensionality reduction by locally linear embedding
- Sam T. Roweis and Lawrence K. Saul. Nonlinear dimensionality reduction by locally linear embedding. Science, 290(5500):2323–2326, 2000.
- (2000) Science , vol.290 , Issue.5500 , pp. 2323-2326
- Roweis, Sam T.¹ Saul, Lawrence K.²

24
- 84865801985
- Conversational speech transcription using context-dependent deep neural networks
- Frank Seide, Gang Li, and Dong Yu. Conversational speech transcription using context-dependent deep neural networks. In Interspeech, pages 437–440, 2011.
- (2011) Interspeech , pp. 437-440
- Seide, Frank¹ Li, Gang² Yu, Dong³

25
- 0023223978
- Toward a universal law of generalization for psychological science
- Roger N. Shepard. Toward a universal law of generalization for psychological science. Science, 237(4820): 1317–1323, 1987.
- (1987) Science , vol.237 , Issue.4820 , pp. 1317-1323
- Shepard, Roger N.¹

26
- 85153965130
- Reinforcement learning with soft state aggregation
- Satinder P. Singh, Tommi Jaakkola, and Michael I. Jordan. Reinforcement learning with soft state aggregation. In Advances in Neural Information Processing Systems (NIPS), pages 361–368, 1995.
- (1995) Advances in Neural Information Processing Systems (NIPS) , pp. 361-368
- Singh, Satinder P.¹ Jaakkola, Tommi² Jordan, Michael I.³

27
- 78751696691
- Predictive projections
- Nathan Sprague. Predictive projections. In 21st International Joint Conference on Artificial Intelligence (IJCAI), pages 1223–1229, 2009.
- (2009) 21st International Joint Conference on Artificial Intelligence (IJCAI) , pp. 1223-1229
- Sprague, Nathan¹

28
- 0004102479
- MIT Press
- Richard S. Sutton and Andrew G. Barto. Reinforcement Learning: An Introduction. MIT Press, 1998.
- (1998) Reinforcement Learning: An Introduction
- Sutton, Richard S.¹ Barto, Andrew G.²

29
- 0034704229
- A global geometric framework for nonlinear dimensionality reduction
- Joshua B. Tenenbaum, Vin De Silva, and John C. Langford. A global geometric framework for nonlinear dimensionality reduction. Science, 290(5500):2319–2323, 2000.
- (2000) Science , vol.290 , Issue.5500 , pp. 2319-2323
- Tenenbaum, Joshua B.¹ De Silva, Vin² Langford, John C.³

30
- 0036546660
- Slow feature analysis: unsupervised learning of invariances
- Laurenz Wiskott and Terrence J. Sejnowski. Slow feature analysis: unsupervised learning of invariances. Neural Computation, 14(4):715–770, 2002.
- (2002) Neural Computation , vol.14 , Issue.4 , pp. 715-770
- Wiskott, Laurenz¹ Sejnowski, Terrence J.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.