-
1
-
-
0011962595
-
Probabilistic planning for behavior-based robots
-
Amin, A. and Koenig, S., "Probabilistic Planning for Behavior-based Robots", (2001). Proc. FLAIRS-01, Key West, FL., pp.531-535.
-
(2001)
Proc. FLAIRS-01, Key West, FL
, pp. 531-535
-
-
Amin, A.1
Koenig, S.2
-
3
-
-
0028731901
-
Coordination of multiple behaviors acquired by vision-based reinforcement learning
-
Asada, M., Uchibe, E., Noda, S., Tawaratsumida, S., and Hosoda, K. (1994)., "Coordination Of Multiple Behaviors Acquired By Vision-Based Reinforcement Learning", Proc. IEEE International Conference on Intelligent Robots and Systems, pp.917-924.
-
(1994)
Proc. IEEE International Conference on Intelligent Robots and Systems
, pp. 917-924
-
-
Asada, M.1
Uchibe, E.2
Noda, S.3
Tawaratsumida, S.4
Hosoda, K.5
-
4
-
-
0029182767
-
Vision-based reinforcement learning for purposive behavior acquisition
-
Asada, M., Noda, S., Tawaratsumida, S., and Hosoda, K. (1995). "Vision-Based Reinforcement Learning for Purposive Behavior Acquisition", Proc. IEEE International Conference on Robotics and Automation, pp.146-153.
-
(1995)
Proc. IEEE International Conference on Robotics and Automation
, pp. 146-153
-
-
Asada, M.1
Noda, S.2
Tawaratsumida, S.3
Hosoda, K.4
-
5
-
-
0002278788
-
Hierarchical reinforcement learning with the MAXQ value function decomposition
-
Dietterich, Thomas G. 2000. "Hierarchical Reinforcement Learning with the MAXQ Value Function Decomposition", Journal of Artificial Intelligence Research, 13, pp.227-303
-
(2000)
Journal of Artificial Intelligence Research
, vol.13
, pp. 227-303
-
-
Dietterich, T.G.1
-
8
-
-
0029679044
-
Reinforcement learning: A survey
-
Kaelbling, L.P., Littman, M.L., and Moore, A. W. (1996). "Reinforcement Learning: A Survey", Journal of Artificial Intelligence Research, Volume 4.
-
(1996)
Journal of Artificial Intelligence Research
, vol.4
-
-
Kaelbling, L.P.1
Littman, M.L.2
Moore, A.W.3
-
9
-
-
0034867109
-
Learning momentum: Integration and experimentation
-
Lee, J.B. and Arkin, R.C., (2001). "Learning Momentum: Integration and Experimentation", IEEE International Conference on Robotics and Automation, Seoul, Korea.
-
(2001)
IEEE International Conference on Robotics and Automation, Seoul, Korea
-
-
Lee, J.B.1
Arkin, R.C.2
-
10
-
-
0034868612
-
Spatio-temporal case-based reasoning for behavioral selection
-
Likhachev, M. and Arkin, R.C., (2001). "Spatio-Temporal Case-based Reasoning for Behavioral Selection", IEEE International Conference on Robotics and Automation, Seoul, Korea.
-
(2001)
IEEE International Conference on Robotics and Automation, Seoul, Korea
-
-
Likhachev, M.1
Arkin, R.C.2
-
11
-
-
84976813028
-
Learning to coordinate behaviors
-
San Mateo, CA
-
Maes, P. and Brooks, R. (1990). "Learning to coordinate behaviors". Proc. Eighth National Conf. on AI, pp. 896-802, San Mateo, CA.
-
(1990)
Proc. Eighth National Conf. on AI
, pp. 896-802
-
-
Maes, P.1
Brooks, R.2
-
12
-
-
0032051328
-
Evaluating the usability of robot programming toolsets
-
MacKenzie, D. and Arkin, R., "Evaluating the Usability of Robot Programming Toolsets", (1998). International Journal of Robotics Research, Vol. 4, No. 7, pp. 381-401.
-
(1998)
International Journal of Robotics Research
, vol.4
, Issue.7
, pp. 381-401
-
-
MacKenzie, D.1
Arkin, R.2
-
13
-
-
85158146654
-
Automatic programming of behavior-based robots using reinforcement learning
-
Mahadevan, S. and Connell, J., (1991). "Automatic Programming of Behavior-Based Robots Using Reinforcement Learning", Proc. AAAI-91, pp. 768-73.
-
(1991)
Proc. AAAI-91
, pp. 768-773
-
-
Mahadevan, S.1
Connell, J.2
-
14
-
-
0011965080
-
Robot behavioral selection using Q-learning
-
Technical Report GIT-CC-01-19, College of Computing, Georgia Institute of Technology, Atlanta, GA
-
Martinson, E., Stoychev, A. and Arkin, R. (2001). "Robot Behavioral Selection Using Q-Learning", Technical Report GIT-CC-01-19, College of Computing, Georgia Institute of Technology, Atlanta, GA.
-
(2001)
-
-
Martinson, E.1
Stoychev, A.2
Arkin, R.3
-
15
-
-
0031336564
-
Shaping robot behavior using principles from instrumental conditioning
-
Saksida, L.M., Raymond, S.M., and Touretzky, D.S. (1998). "Shaping robot behavior using principles from instrumental conditioning", Robotics and Autonomous Systems, 22(3/4):231-249.
-
(1998)
Robotics and Autonomous Systems
, vol.22
, Issue.3-4
, pp. 231-249
-
-
Saksida, L.M.1
Raymond, S.M.2
Touretzky, D.S.3
-
18
-
-
0004049893
-
Learning from delayed rewards
-
Ph.D. Thesis, King's College, Cambridge, UK
-
Watkins, C. (1989) "Learning from Delayed Rewards", Ph.D. Thesis, King's College, Cambridge, UK.
-
(1989)
-
-
Watkins, C.1
|