-
1
-
-
0011962595
-
Probabilistic planning for behavior-based robots
-
Atrash, A. and Koenig, S., "Probabilistic Planning for Behavior-based Robots", (2001). Proc. FLAIRS-01, Key West, FL., pp.531-535.
-
(2001)
Proc. FLAIRS-01, Key West, FL
, pp. 531-535
-
-
Atrash, A.1
Koenig, S.2
-
2
-
-
0029182767
-
Vision-based reinforcement learning for purposive behavior acquisition
-
Asada, M., Noda, S., Tawaratsumida, S., and Hosoda, K. (1995). "Vision-Based Reinforcement Learning for Purposive Behavior Acquisition", Proc. IEEE International Conference on Robotics and Automation, pp.146-135.
-
(1995)
Proc. IEEE International Conference on Robotics and Automation
, pp. 146-153
-
-
Asada, M.1
Noda, S.2
Tawaratsumida, S.3
Hosoda, K.4
-
3
-
-
0003404197
-
Behavioral diversity in learning robot teams
-
Ph.D. Dissertation, College of Computing, Georgia Tech.
-
Balch, T. (1998). "Behavioral Diversity in Learning Robot Teams", Ph.D. Dissertation, College of Computing, Georgia Tech.
-
(1998)
-
-
Balch, T.1
-
4
-
-
23044520797
-
Big red: The cornell small league robot soccer team
-
D'Andrea, R. et al. (2000) "Big Red: The Cornell Small League Robot Soccer Team", in Lecture Note in Computer Science, v. 1856, p 657-660
-
(2000)
Lecture Note in Computer Science
, vol.1856
, pp. 657-660
-
-
D'Andrea, R.1
-
5
-
-
0002278788
-
Hierarchical reinforcement learning with the MAXQ value function decomposition
-
Dietterich, Thomas G. {2000}. "Hierarchical Reinforcement Learning with the MAXQ Value Function Decomposition", Journal of Artificial Intelligence Research, 13, pp. 227-303
-
(2000)
Journal of Artificial Intelligence Research
, vol.13
, pp. 227-303
-
-
Dietterich, T.G.1
-
6
-
-
0031365714
-
Interference as a tool for designing and evaluating multi-robot controllers
-
Dani Golberg and Maja J Mataric, "Interference as a Tool for Designing and Evaluating Multi-Robot Controllers", Proceedings, AAAI-97, Providence. Rhode Island, Jul 27-31, 1997, 637-642.
-
Proceedings, AAAI-97, Providence. Rhode Island, Jul 27-31, 1997
, pp. 637-642
-
-
Golberg, D.1
Mataric, M.J.2
-
7
-
-
0029679044
-
Reinforcement learning: A survey
-
Kaelbling, Leslie P., Littman, Michael L., and Moore, Andrew W. (1996). "Reinforcement Learning: A Survey", Journal of Artificial Intelligence Research, Volume 4., p. 237-285
-
(1996)
Journal of Artificial Intelligence Research
, vol.4
, pp. 237-285
-
-
Kaelbling, L.P.1
Littman, M.L.2
Moore, A.W.3
-
8
-
-
0034867109
-
Learning momentum: Integration and experimentation
-
Lee, J.B. and Arkin, R.C., (2001). "Learning Momentum: Integration and Experimentation", IEEE International Conference on Robotics and Automation, Seoul, Korea. May 2001. pp. 1975-1980
-
(2001)
IEEE International Conference on Robotics and Automation, Seoul, Korea. May 2001
, pp. 1975-1980
-
-
Lee, J.B.1
Arkin, R.C.2
-
9
-
-
0034868612
-
Spatio-temporal case-based reasoning for behavioral selection
-
Likhachev, M. and Arkin, R.C., (2001). "Spatio-Temporal Case-based Reasoning for Behavioral Selection", IEEE International Conference on Robotics and Automation, Seoul, Korea. pp. 1627-1634
-
(2001)
IEEE International Conference on Robotics and Automation, Seoul, Korea
, pp. 1627-1634
-
-
Likhachev, M.1
Arkin, R.C.2
-
10
-
-
0032051328
-
Evaluating the usability of robot programming toolsets
-
MacKenzie, D. and Arkin, R., "Evaluating the Usability of Robot Programming Toolsets", (1998). International Journal of Robotics Research, Vol. 4, No. 7, pp.381-401
-
(1998)
International Journal of Robotics Research
, vol.4
, Issue.7
, pp. 381-401
-
-
MacKenzie, D.1
Arkin, R.2
-
11
-
-
85158146654
-
Automatic programming of behavior-based robots using reinforcement learning
-
Mahadevan, S. and Connell, J., (1991). "Automatic Programming of Behavior-Based Robots Using Reinforcement Learning", Proc. AAAI-91, pp. 768-73.
-
(1991)
Proc. AAAI-91
, pp. 768-773
-
-
Mahadevan, S.1
Connell, J.2
-
13
-
-
0345183986
-
Robot behavioral selection using Q-learning
-
Martinson, E., Stoychev, A., and Arkin, R. (2002) "Robot Behavioral Selection Using Q-learning", to be appear Proc. of IROS 2002, Lausanne, Switzerland, October 2002.
-
(2002)
Proc. of IROS 2002, Lausanne, Switzerland, October 2002
-
-
Martinson, E.1
Stoychev, A.2
Arkin, R.3
-
14
-
-
84957895797
-
Reward functions for accelerated learning
-
William W. Cohen and Haym Hirsh, eds., Morgan Kaufmann Publishers, San Francisco, CA
-
Mataric′, Maja J. "Reward Functions for Accelerated Learning" in Machine Learning: Proceedings of the Eleventh International Conference, William W. Cohen and Haym Hirsh, eds., Morgan Kaufmann Publishers, San Francisco, CA, 1994, 181-189.
-
(1994)
Machine Learning: Proceedings of the Eleventh International Conference
, pp. 181-189
-
-
Mataric, M.J.1
-
15
-
-
0342398260
-
Task decomposition and dynamic role assignment for real-time strategic teamwork
-
Stone, P. and Veloso, M. (1995). "Task Decomposition and Dynamic Role Assignment for Real-Time Strategic Teamwork", Proceedings of the 5th International Workshop on Intelligent Agents V: Agent Theories, Architectures, and Languages (ATAL-98), Heidelberg, Germany. pages 293-308
-
(1995)
Proceedings of the 5th International Workshop on Intelligent Agents V: Agent Theories, Architectures, and Languages (ATAL-98), Heidelberg, Germany
, pp. 293-308
-
-
Stone, P.1
Veloso, M.2
-
17
-
-
0004049893
-
Learning from delayed rewards
-
Ph.D. Thesis, King's College, Cambridge, UK
-
Watkins, C. (1989). "Learning from Delayed Rewards", Ph.D. Thesis, King's College, Cambridge, UK.
-
(1989)
-
-
Watkins, C.1
|