SCOPUS 정보 검색 플랫폼

Volumn , Issue , 2012, Pages

Scaling life-long off-policy learning

Author keywords

[No Author keywords available]

Indexed keywords

LIFE LONG LEARNING; LIFE-TIMES; MULTI-STEP PREDICTION; PHYSICAL ROBOTS; REAL TIME; TRAINING DATA;

FORECASTING; LEARNING ALGORITHMS; REINFORCEMENT LEARNING; ROBOTICS; ROBOTS;

PERSONNEL TRAINING;

EID: 84872849054 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/DevLrn.2012.6400860 Document Type: Conference Paper

Times cited : (23)

References (18)

1
- 0002130986
- Robot learning from demonstration
- Atkeson, C. G., Schaal, S. (1997). Robot learning from demonstration. In Proceedings of the 14th International Conference on Machine Learning,12-20.
- (1997) Proceedings of the 14th International Conference on Machine Learning , pp. 12-20
- Atkeson, C.G.¹ Schaal, S.²

2
- 80055034438
- An online spectral learning algorithm for partially observable nonlinear dynamical systems
- Boots, B., Siddiqi, S., Gordon, G. (2011). An online spectral learning algorithm for partially observable nonlinear dynamical systems. In Proceedings of the 25th Conference of the Association for the Advancement of Artificial Intelligence.
- (2011) Proceedings of the 25th Conference of the Association for the Advancement of Artificial Intelligence
- Boots, B.¹ Siddiqi, S.² Gordon, G.³

3
- 84959576016
- Highly Scalable Appearance-Only SLAM - FAB-MAP 2.0
- Cummins, M., Newman, P. (2009). Highly Scalable Appearance-Only SLAM - FAB-MAP 2.0 In The Proceedings of the 4th Conference Robotics: Science and Systems.
- (2009) The Proceedings of the 4th Conference Robotics: Science and Systems
- Cummins, M.¹ Newman, P.²

4
- 78049390740
- Policy search for motor primitives in robotics
- Kober, J., Peters, J. (2011). Policy search for motor primitives in robotics. Machine Learning 84:171-203.
- (2011) Machine Learning , vol.84 , pp. 171-203
- Kober, J.¹ Peters, J.²

5
- 84898982129
- Predictive representations of state
- Littman, M. L., Sutton, R. S., Singh, S. (2002). Predictive representations of state. In Advances in Neural Information Processing Systems 14, 1555-1561.
- (2002) Advances in Neural Information Processing Systems , vol.14 , pp. 1555-1561
- Littman, M.L.¹ Sutton, R.S.² Singh, S.³

6
- 84864655352
- PhD thesis, University of Alberta
- Maei, H. R. (2011). Gradient Temporal-Difference Learning Algorithms. PhD thesis, University of Alberta.
- (2011) Gradient Temporal-Difference Learning Algorithms
- Maei, H.R.¹

8
- 34047267520
- Intrinsic Motivation Systems for Autonomous Mental Development
- Oudeyer, P. Y., Kaplan, F., Hafner, V. (2007). Intrinsic Motivation Systems for Autonomous Mental Development. In IEEE Transactions on Evolutionary Computation 11, 265-286
- (2007) IEEE Transactions on Evolutionary Computation , vol.11 , pp. 265-286
- Oudeyer, P.Y.¹ Kaplan, F.² Hafner, V.³

9
- 84867427247
- Dynamic Switching and Real-time Machine Learning for Improved Human Control of Assistive Biomedical Robots
- Pilarski, P.M., Dawson, M.R., Degris, T.,Carey, J.P., Sutton, R.S. (2012). Dynamic Switching and Real-time Machine Learning for Improved Human Control of Assistive Biomedical Robots. In Proceedings of the 4th IEEE International Conference on Biomedical Robotics and Biomechatronics, 296-302.
- (2012) Proceedings of the 4th IEEE International Conference on Biomedical Robotics and Biomechatronics , pp. 296-302
- Pilarski, P.M.¹ Dawson, M.R.² Degris, T.³ Carey, J.P.⁴ Sutton, R.S.⁵

10
- 2442467081
- A possibility for implementing curiosity and boredom in model-building neural controllers
- Schmidhuber J. (1991). A possibility for implementing curiosity and boredom in model-building neural controllers. In Proceedings of the 1st International Conference on Simulation of Adaptive Behavior, 222-227.
- (1991) In Proceedings of the 1st International Conference on Simulation of Adaptive Behavior , pp. 222-227
- Schmidhuber, J.¹

11
- 84899031920
- Intrinsically motivated reinforcement learning
- Singh S., Barto, A. G., Chentanez, N. (2005). Intrinsically motivated reinforcement learning. In Advances in Neural Information Processing Systems 17, 1281-1288.
- (2005) Advances in Neural Information Processing Systems , vol.17 , pp. 1281-1288
- Singh, S.¹ Barto, A.G.² Chentanez, N.³

12
- 0004102479
- MIT Press
- Sutton, R. S., Barto, A. G. (1998). Reinforcement Learning: An Introduction. MIT Press.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

13
- 0033170372
- Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
- Sutton, R. S., Precup D., Singh, S. (1999). Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning. In Artificial Intelligence 112:181-211.
- (1999) Artificial Intelligence , vol.112 , pp. 181-211
- Sutton, R.S.¹ Precup, D.² Singh, S.³

14
- 33749265408
- Temporal abstraction in temporal-difference networks
- Sutton, R. S., Rafols, E. J., Koop, A. (2006). Temporal abstraction in temporal-difference networks. In Advances in Neural Information Processing Systems 18.
- (2006) Advances in Neural Information Processing Systems , pp. 18
- Sutton, R.S.¹ Rafols, E.J.² Koop, A.³

15
- 71149099079
- Fast gradient-descent methods for temporal-difference learning with linear function approximation
- Sutton, R. S., Maei, H. R., Precup, D., Bhatnagar, S., Silver, D., Szepesvári, Cs., Wiewiora, E. (2009). Fast gradient-descent methods for temporal-difference learning with linear function approximation. In Proceedings of the 26th International Conference on Machine Learning.
- (2009) Proceedings of the 26th International Conference on Machine Learning
- Sutton, R.S.¹ Maei, H.R.² Precup, D.³ Bhatnagar, S.⁴ Silver, D.⁵ Szepesvári, Cs.⁶ Wiewiora, E.⁷

16
- 84899464022
- Horde: A scalable real-time architecture for learning knowledge from unsupervised sensorimotor interaction
- Sutton, R. S., Modayil, J., Delp, M., Degris, T., Pilarski, P. M., White, A., Precup, D. (2011). Horde: A scalable real-time architecture for learning knowledge from unsupervised sensorimotor interaction. In Proceedings of the10th International Conference on Autonomous Agents and Multiagent Systems.
- (2011) Proceedings of The10th International Conference on Autonomous Agents and Multiagent Systems
- Sutton, R.S.¹ Modayil, J.² Delp, M.³ Degris, T.⁴ Pilarski, P.M.⁵ White, A.⁶ Precup, D.⁷

17
- 27744518715
- MIT Press
- Thrun, S., Burgard, W., Fox, D. (2005). Probabilistic Robotics. MIT Press.
- (2005) Probabilistic Robotics
- Thrun, S.¹ Burgard, W.² Fox, D.³

18
- 0029220605
- Lifelong robot learning
- Thrun, S., Mitchell, T. (1995). Lifelong robot learning. Robotics and Autonomous Systems 15: 25-46.
- (1995) Robotics and Autonomous Systems , vol.15 , pp. 25-46
- Thrun, S.¹ Mitchell, T.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.