메뉴 건너뛰기




Volumn , Issue , 2012, Pages 85-90

RTMBA: A real-time model-based reinforcement learning architecture for robot control

Author keywords

[No Author keywords available]

Indexed keywords

DECISION MAKING; PARALLEL ARCHITECTURES; ROBOT PROGRAMMING; ROBOTICS; ROBOTS;

EID: 84864454010     PISSN: 10504729     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICRA.2012.6225072     Document Type: Conference Paper
Times cited : (66)

References (17)
  • 2
    • 33747195910 scopus 로고    scopus 로고
    • Machine learning for fast quadrupedal locomotion
    • N. Kohl and P. Stone, "Machine learning for fast quadrupedal locomotion," in AAAI, 2004.
    • (2004) AAAI
    • Kohl, N.1    Stone, P.2
  • 3
    • 3042583887 scopus 로고    scopus 로고
    • Autonomous helicopter flight via reinforcement learning
    • A. Ng, H. J. Kim, M. Jordan, and S. Sastry, "Autonomous helicopter flight via reinforcement learning," in NIPS 16, 2003.
    • (2003) NIPS , vol.16
    • Ng, A.1    Kim, H.J.2    Jordan, M.3    Sastry, S.4
  • 4
    • 84880854156 scopus 로고    scopus 로고
    • R-Max - A general polynomial time algorithm for near-optimal reinforcement learning
    • R. Brafman and M. Tennenholtz, "R-Max - a general polynomial time algorithm for near-optimal reinforcement learning," in IJCAI, 2001.
    • (2001) IJCAI
    • Brafman, R.1    Tennenholtz, M.2
  • 5
    • 80053441894 scopus 로고    scopus 로고
    • PILCO: A model-based and dataefficient approach to policy search
    • June
    • M. Deisenroth and C. Rasmussen, "PILCO: A model-based and dataefficient approach to policy search," in ICML, June 2011.
    • (2011) ICML
    • Deisenroth, M.1    Rasmussen, C.2
  • 6
    • 85132026293 scopus 로고
    • Integrated architectures for learning, planning, and reacting based on approximating dynamic programming
    • R. Sutton, "Integrated architectures for learning, planning, and reacting based on approximating dynamic programming," in ICML, 1990.
    • (1990) ICML
    • Sutton, R.1
  • 7
    • 56449110907 scopus 로고    scopus 로고
    • Sample-based learning and search with permanent and transient memories
    • D. Silver, R. Sutton, and M. Müller, "Sample-based learning and search with permanent and transient memories," in ICML, 2008.
    • (2008) ICML
    • Silver, D.1    Sutton, R.2    Müller, M.3
  • 8
    • 85167397400 scopus 로고    scopus 로고
    • Integrating sample-based planning and model-based reinforcement learning
    • T. Walsh, S. Goschin, and M. Littman, "Integrating sample-based planning and model-based reinforcement learning," in AAAI, 2010.
    • (2010) AAAI
    • Walsh, T.1    Goschin, S.2    Littman, M.3
  • 9
    • 34547975806 scopus 로고    scopus 로고
    • Bandit based Monte-Carlo planning
    • L. Kocsis and C. Szepesvári, "Bandit based Monte-Carlo planning," in ECML, 2006.
    • (2006) ECML
    • Kocsis, L.1    Szepesvári, C.2
  • 10
    • 78149247074 scopus 로고    scopus 로고
    • Real time targeted exploration in large domains
    • August
    • T. Hester and P. Stone, "Real time targeted exploration in large domains," in ICDL, August 2010.
    • (2010) ICDL
    • Hester, T.1    Stone, P.2
  • 12
    • 84973495235 scopus 로고    scopus 로고
    • Multiagent interactions in urban driving
    • March
    • P. Beeson, et al., "Multiagent interactions in urban driving," Journal of Physical Agents, vol. 2, no. 1, pp. 15-30, March 2008.
    • (2008) Journal of Physical Agents , vol.2 , Issue.1 , pp. 15-30
    • Beeson, P.1
  • 14
    • 70449370276 scopus 로고    scopus 로고
    • RL-Glue: Language-independent software for reinforcement-learning experiments
    • Sep.
    • B. Tanner and A. White, "RL-Glue : Language-independent software for reinforcement-learning experiments," JMLR, vol. 10, Sep. 2009.
    • (2009) JMLR , vol.10
    • Tanner, B.1    White, A.2
  • 17
    • 84899464022 scopus 로고    scopus 로고
    • Horde: A scalable real-time architecture for learning knowledge from unsupervised sensorimotor interaction
    • R. Sutton, et al., "Horde: A scalable real-time architecture for learning knowledge from unsupervised sensorimotor interaction," in AAMAS, 2011.
    • (2011) AAMAS
    • Sutton, R.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.