SCOPUS 정보 검색 플랫폼

Volumn , Issue , 2012, Pages 85-90

RTMBA: A real-time model-based reinforcement learning architecture for robot control

Author keywords

[No Author keywords available]

Indexed keywords

DECISION MAKING; PARALLEL ARCHITECTURES; ROBOT PROGRAMMING; ROBOTICS; ROBOTS;

AUTONOMOUS VEHICLES; MODEL-BASED OPC; ONLINE LEARNING; PLANNING METHOD; PLANNING PROCESS; REAL TIME MODELING; ROBOT CONTROLS; ROBOTIC CONTROLS;

REINFORCEMENT LEARNING;

EID: 84864454010 PISSN: 10504729 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICRA.2012.6225072 Document Type: Conference Paper

Times cited : (66)

References (17)

1
- 0004102479
- Cambridge, MA: MIT Press
- R. Sutton and A. Barto, Reinforcement Learning: An Introduction. Cambridge, MA: MIT Press, 1998.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.¹ Barto, A.²

2
- 33747195910
- Machine learning for fast quadrupedal locomotion
- N. Kohl and P. Stone, "Machine learning for fast quadrupedal locomotion," in AAAI, 2004.
- (2004) AAAI
- Kohl, N.¹ Stone, P.²

3
- 3042583887
- Autonomous helicopter flight via reinforcement learning
- A. Ng, H. J. Kim, M. Jordan, and S. Sastry, "Autonomous helicopter flight via reinforcement learning," in NIPS 16, 2003.
- (2003) NIPS , vol.16
- Ng, A.¹ Kim, H.J.² Jordan, M.³ Sastry, S.⁴

4
- 84880854156
- R-Max - A general polynomial time algorithm for near-optimal reinforcement learning
- R. Brafman and M. Tennenholtz, "R-Max - a general polynomial time algorithm for near-optimal reinforcement learning," in IJCAI, 2001.
- (2001) IJCAI
- Brafman, R.¹ Tennenholtz, M.²

6
- 85132026293
- Integrated architectures for learning, planning, and reacting based on approximating dynamic programming
- R. Sutton, "Integrated architectures for learning, planning, and reacting based on approximating dynamic programming," in ICML, 1990.
- (1990) ICML
- Sutton, R.¹

7
- 56449110907
- Sample-based learning and search with permanent and transient memories
- D. Silver, R. Sutton, and M. Müller, "Sample-based learning and search with permanent and transient memories," in ICML, 2008.
- (2008) ICML
- Silver, D.¹ Sutton, R.² Müller, M.³

8
- 85167397400
- Integrating sample-based planning and model-based reinforcement learning
- T. Walsh, S. Goschin, and M. Littman, "Integrating sample-based planning and model-based reinforcement learning," in AAAI, 2010.
- (2010) AAAI
- Walsh, T.¹ Goschin, S.² Littman, M.³

9
- 34547975806
- Bandit based Monte-Carlo planning
- L. Kocsis and C. Szepesvári, "Bandit based Monte-Carlo planning," in ECML, 2006.
- (2006) ECML
- Kocsis, L.¹ Szepesvári, C.²

11
- 0004049893
- Ph.D. dissertation, University of Cambridge
- C. Watkins, "Learning from delayed rewards," Ph.D. dissertation, University of Cambridge, 1989.
- (1989) Learning from Delayed Rewards
- Watkins, C.¹

13
- 77957352104
- ROS: An open-source robot operating system
- M. Quigley, et al., "ROS: an open-source robot operating system," in ICRA Workshop on Open Source Software, 2009.
- ICRA Workshop on Open Source Software, 2009
- Quigley, M.¹

15
- 0003673017
- Ph.D. dissertation, Pittsburgh, PA, USA
- L.-J. Lin, "Reinforcement learning for robots using neural networks," Ph.D. dissertation, Pittsburgh, PA, USA, 1992.
- (1992) Reinforcement Learning for Robots Using Neural Networks
- Lin, L.-J.¹

16
- 4644323293
- Least-squares policy iteration
- M. Lagoudakis and R. Parr, "Least-squares policy iteration," Journal of Machine Learning Research, vol. 4, pp. 1107-1149, 2003.
- (2003) Journal of Machine Learning Research , vol.4 , pp. 1107-1149
- Lagoudakis, M.¹ Parr, R.²

17
- 84899464022
- Horde: A scalable real-time architecture for learning knowledge from unsupervised sensorimotor interaction
- R. Sutton, et al., "Horde: A scalable real-time architecture for learning knowledge from unsupervised sensorimotor interaction," in AAMAS, 2011.
- (2011) AAMAS
- Sutton, R.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.