메뉴 건너뛰기




Volumn 1, Issue , 2002, Pages 970-977

Robot behavioral selection using Q-learning

Author keywords

[No Author keywords available]

Indexed keywords

COMPUTER SIMULATION; EFFICIENCY; HIERARCHICAL SYSTEMS; LEARNING ALGORITHMS; ROBOT LEARNING; ROBOTICS;

EID: 0036452535     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (24)

References (18)
  • 1
    • 0011962595 scopus 로고    scopus 로고
    • Probabilistic planning for behavior-based robots
    • Amin, A. and Koenig, S., "Probabilistic Planning for Behavior-based Robots", (2001). Proc. FLAIRS-01, Key West, FL., pp.531-535.
    • (2001) Proc. FLAIRS-01, Key West, FL , pp. 531-535
    • Amin, A.1    Koenig, S.2
  • 5
    • 0002278788 scopus 로고    scopus 로고
    • Hierarchical reinforcement learning with the MAXQ value function decomposition
    • Dietterich, Thomas G. 2000. "Hierarchical Reinforcement Learning with the MAXQ Value Function Decomposition", Journal of Artificial Intelligence Research, 13, pp.227-303
    • (2000) Journal of Artificial Intelligence Research , vol.13 , pp. 227-303
    • Dietterich, T.G.1
  • 11
    • 84976813028 scopus 로고
    • Learning to coordinate behaviors
    • San Mateo, CA
    • Maes, P. and Brooks, R. (1990). "Learning to coordinate behaviors". Proc. Eighth National Conf. on AI, pp. 896-802, San Mateo, CA.
    • (1990) Proc. Eighth National Conf. on AI , pp. 896-802
    • Maes, P.1    Brooks, R.2
  • 12
    • 0032051328 scopus 로고    scopus 로고
    • Evaluating the usability of robot programming toolsets
    • MacKenzie, D. and Arkin, R., "Evaluating the Usability of Robot Programming Toolsets", (1998). International Journal of Robotics Research, Vol. 4, No. 7, pp. 381-401.
    • (1998) International Journal of Robotics Research , vol.4 , Issue.7 , pp. 381-401
    • MacKenzie, D.1    Arkin, R.2
  • 13
    • 85158146654 scopus 로고
    • Automatic programming of behavior-based robots using reinforcement learning
    • Mahadevan, S. and Connell, J., (1991). "Automatic Programming of Behavior-Based Robots Using Reinforcement Learning", Proc. AAAI-91, pp. 768-73.
    • (1991) Proc. AAAI-91 , pp. 768-773
    • Mahadevan, S.1    Connell, J.2
  • 14
    • 0011965080 scopus 로고    scopus 로고
    • Robot behavioral selection using Q-learning
    • Technical Report GIT-CC-01-19, College of Computing, Georgia Institute of Technology, Atlanta, GA
    • Martinson, E., Stoychev, A. and Arkin, R. (2001). "Robot Behavioral Selection Using Q-Learning", Technical Report GIT-CC-01-19, College of Computing, Georgia Institute of Technology, Atlanta, GA.
    • (2001)
    • Martinson, E.1    Stoychev, A.2    Arkin, R.3
  • 15
    • 0031336564 scopus 로고    scopus 로고
    • Shaping robot behavior using principles from instrumental conditioning
    • Saksida, L.M., Raymond, S.M., and Touretzky, D.S. (1998). "Shaping robot behavior using principles from instrumental conditioning", Robotics and Autonomous Systems, 22(3/4):231-249.
    • (1998) Robotics and Autonomous Systems , vol.22 , Issue.3-4 , pp. 231-249
    • Saksida, L.M.1    Raymond, S.M.2    Touretzky, D.S.3
  • 18
    • 0004049893 scopus 로고
    • Learning from delayed rewards
    • Ph.D. Thesis, King's College, Cambridge, UK
    • Watkins, C. (1989) "Learning from Delayed Rewards", Ph.D. Thesis, King's College, Cambridge, UK.
    • (1989)
    • Watkins, C.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.