-
2
-
-
0003601068
-
-
(Technical Report GIT-CC-97-11). College of Computing, Georgia Institute of Technology
-
Balch, T. (1997a). Clay: Integrating motor schemas and reinforcement learning (Technical Report GIT-CC-97-11). College of Computing, Georgia Institute of Technology.
-
(1997)
Clay: Integrating Motor Schemas and Reinforcement Learning
-
-
Balch, T.1
-
5
-
-
0029210635
-
Learning to act using real-time dynamic programming
-
Barto, A., Bradtke, S., & Singh, S. (1995). Learning to act using real-time dynamic programming. Artificial Intelligence, 72, 81-138.
-
(1995)
Artificial Intelligence
, vol.72
, pp. 81-138
-
-
Barto, A.1
Bradtke, S.2
Singh, S.3
-
7
-
-
27144476780
-
Planning is just a way of avoiding figuring out what to do next
-
R. Brooks (Ed.), Cambridge, MA: MIT Press
-
Brooks, R. (1987). Planning is just a way of avoiding figuring out what to do next. In R. Brooks (Ed.), Cambrian intelligence: The early history of the new AI (pp. 103-110). Cambridge, MA: MIT Press.
-
(1987)
Cambrian Intelligence: The Early History of the New AI
, pp. 103-110
-
-
Brooks, R.1
-
8
-
-
0010535077
-
Intelligence without representation
-
J. Haugeland (Ed.), Cambridge, MA: MIT Press
-
Brooks, R. (1991a). Intelligence without representation. In J. Haugeland (Ed.), Mind design II (pp. 395-420). Cambridge, MA: MIT Press.
-
(1991)
Mind Design II
, pp. 395-420
-
-
Brooks, R.1
-
9
-
-
0009351684
-
The role of learning in autonomous robots
-
M. K. Warmuth and L. G. Valiant (Eds.), San Francisco, CA: Morgan Kauffman
-
Brooks, R. (1991b). The role of learning in autonomous robots. In M. K. Warmuth and L. G. Valiant (Eds.), Proceedings of the Fourth Annual Workshop on Computational Learning Theory (COLT '91) (pp. 5-10). San Francisco, CA: Morgan Kauffman.
-
(1991)
Proceedings of the Fourth Annual Workshop on Computational Learning Theory (COLT '91)
, pp. 5-10
-
-
Brooks, R.1
-
10
-
-
35248894899
-
Modularity and specialized learning: Reexamining behavior-based artificial intelligence
-
M. Butz, P. Gérard, & O. Sigaud (Eds.), Berlin: Springer
-
Bryson, J. (2002). Modularity and specialized learning: Reexamining behavior-based artificial intelligence. In M. Butz, P. Gérard, & O. Sigaud (Eds.), Proceedings of the Workshop on Adaptive Behavior in Anticipatory Learning Systems. Berlin: Springer.
-
(2002)
Proceedings of the Workshop on Adaptive Behavior in Anticipatory Learning Systems
-
-
Bryson, J.1
-
11
-
-
0035301619
-
Topological simultaneous localization and mapping (SLAM): Towards exact localization without explicit localization
-
Choset, H., & Nagatani, K. (2001). Topological simultaneous localization and mapping (SLAM): Towards exact localization without explicit localization. IEEE Transactions on Robotics and Automation, 17(2), 125-137.
-
(2001)
IEEE Transactions on Robotics and Automation
, vol.17
, Issue.2
, pp. 125-137
-
-
Choset, H.1
Nagatani, K.2
-
13
-
-
0004782095
-
Learning hierarchical control structures for multiple tasks and changing environments
-
R. Pfeifer, B. Blumberg, J. Meyer, & S. Wilson (Eds.), Cambridge, MA: MIT Press
-
Digney, B. (1998). Learning hierarchical control structures for multiple tasks and changing environments. In R. Pfeifer, B. Blumberg, J. Meyer, & S. Wilson (Eds.), From Animals to Animats 5: Proceedings of the Fifth International Conference on Simulation of Adaptive Behavior (pp. 321-330). Cambridge, MA: MIT Press.
-
(1998)
From Animals to Animats 5: Proceedings of the Fifth International Conference on Simulation of Adaptive Behavior
, pp. 321-330
-
-
Digney, B.1
-
14
-
-
0004136810
-
Using local information in a non-local way for mapping graph-like worlds
-
R. Bajcsy (Ed.) San Francisco, CA: Morgan Kaufmann
-
Dudek, G., Freedman, P., & Hadjres, S. (1993). Using local information in a non-local way for mapping graph-like worlds. In R. Bajcsy (Ed.) Proceedings of the International Joint Conference of Artificial Intelligence (pp. 1639-1647). San Francisco, CA: Morgan Kaufmann.
-
(1993)
Proceedings of the International Joint Conference of Artificial Intelligence
, pp. 1639-1647
-
-
Dudek, G.1
Freedman, P.2
Hadjres, S.3
-
15
-
-
85152517921
-
An approach to anytime learning
-
D. H. Sleeman and P. Edwards (Eds.), San Francisco, CA: Morgan Kaufmann
-
Grefenstette, J., & Ramsey, C. (1992). An approach to anytime learning. In D. H. Sleeman and P. Edwards (Eds.), Proceedings of the Ninth International Conference on Machine Learning (pp. 189-195). San Francisco, CA: Morgan Kaufmann.
-
(1992)
Proceedings of the Ninth International Conference on Machine Learning
, pp. 189-195
-
-
Grefenstette, J.1
Ramsey, C.2
-
16
-
-
0011714199
-
-
D. Phil. thesis, School of Cognitive and Computing Sciences, University of Sussex
-
Harvey, I. (1995). The artificial evolution of adaptive behaviour. D. Phil. thesis, School of Cognitive and Computing Sciences, University of Sussex.
-
(1995)
The Artificial Evolution of Adaptive Behaviour
-
-
Harvey, I.1
-
17
-
-
0007914441
-
Action selection methods using reinforcement learning
-
P. Maes, M. Matarić, J.-A. Meyer, J. Pollack, & S. Wilson (Eds.), Cambridge, MA: MIT Press
-
Humphrys, M. (1996). Action selection methods using reinforcement learning. In P. Maes, M. Matarić, J.-A. Meyer, J. Pollack, & S. Wilson (Eds.), From Animals to Animats 4: The Fourth International Conference on the Simulation of Adaptive Behaviour (SAB-96) (pp. 135-144). Cambridge, MA: MIT Press.
-
(1996)
From Animals to Animats 4: The Fourth International Conference on the Simulation of Adaptive Behaviour (SAB-96)
, pp. 135-144
-
-
Humphrys, M.1
-
19
-
-
26444582589
-
-
Lausanne, Switzerland
-
K-Team SA (1999b). Khepera user manual. Lausanne, Switzerland.
-
(1999)
Khepera User Manual
-
-
-
21
-
-
26444470752
-
-
Master's thesis, School of Informatics, University of Edinburgh
-
Konidaris, G. (2003). Behaviour-based reinforcement learning. Master's thesis, School of Informatics, University of Edinburgh.
-
(2003)
Behaviour-based Reinforcement Learning
-
-
Konidaris, G.1
-
22
-
-
84976813028
-
Learning to coordinate behaviors
-
T. Dietterich and W. Swartout (Eds.), Cambridge, MA
-
Maes, P., & Brooks, R. (1990). Learning to coordinate behaviors. In T. Dietterich and W. Swartout (Eds.), Proceedings of the Eighth National Conference on Artificial Intelligence (pp. 796-802). Cambridge, MA.
-
(1990)
Proceedings of the Eighth National Conference on Artificial Intelligence
, pp. 796-802
-
-
Maes, P.1
Brooks, R.2
-
23
-
-
0026880130
-
Automatic programming of behavior-based robots using reinforcement learning
-
Mahadevan, S., & Connell, J. (1992). Automatic programming of behavior-based robots using reinforcement learning. Artificial Intelligence, 55(2-3), 311-365.
-
(1992)
Artificial Intelligence
, vol.55
, Issue.2-3
, pp. 311-365
-
-
Mahadevan, S.1
Connell, J.2
-
24
-
-
0036789790
-
A self-organising network that grows when required
-
Marsland, S., Shapiro, J., & Nehmzow, U. (2002). A self-organising network that grows when required. Neural Networks, 15(8-9), 1041-1058.
-
(2002)
Neural Networks
, vol.15
, Issue.8-9
, pp. 1041-1058
-
-
Marsland, S.1
Shapiro, J.2
Nehmzow, U.3
-
25
-
-
84957895797
-
Reward functions for accelerated learning
-
W. W. Cohen and H. Hirsh (Eds.), San Francisco, CA: Morgan Kaufmann
-
Matarić, M. (1994). Reward functions for accelerated learning. In W. W. Cohen and H. Hirsh (Eds.), Proceedings of the Eleventh International Conference on Machine Learning (pp. 181-189). San Francisco, CA: Morgan Kaufmann.
-
(1994)
Proceedings of the Eleventh International Conference on Machine Learning
, pp. 181-189
-
-
Matarić, M.1
-
26
-
-
0030647149
-
Reinforcement learning in the multi-robot domain
-
Matarić, M. (1997). Reinforcement learning in the multi-robot domain. Autonomous Robots, 4(1), 73-83.
-
(1997)
Autonomous Robots
, vol.4
, Issue.1
, pp. 73-83
-
-
Matarić, M.1
-
27
-
-
26444496413
-
Learning a distributed map representation based on navigation behaviors
-
R. Brooks (Ed.), Cambridge, Massachusetts: The MIT Press
-
Matarić, M., & Brooks, R. (1990). Learning a distributed map representation based on navigation behaviors. In R. Brooks (Ed.), Cambrian intelligence : The early history of the new AI. Cambridge, Massachusetts: The MIT Press.
-
(1990)
Cambrian Intelligence: The Early History of the New AI
-
-
Matarić, M.1
Brooks, R.2
-
28
-
-
0004255908
-
-
London, UK: McGraw-Hill
-
Mitchell, T. (1997). Machine learning. London, UK: McGraw-Hill. 42
-
(1997)
Machine Learning
, vol.42
-
-
Mitchell, T.1
-
29
-
-
0004156494
-
Evolutionary algorithms for reinforcement learning
-
Moriarty, D., Schultz, A., & Grefenstette, J. (1999). Evolutionary algorithms for reinforcement learning. Journal of Artificial Intelligence Research, 11.
-
(1999)
Journal of Artificial Intelligence Research
, vol.11
-
-
Moriarty, D.1
Schultz, A.2
Grefenstette, J.3
-
30
-
-
84898304094
-
Polarization compass for robot navigation
-
D. Polani, J. Kim, & T. Martinetz (Eds.), Berlin: Akademische Verlagsgesellschaft Aka
-
Schmolke, A., & Mallot, H. (2002). Polarization compass for robot navigation. In D. Polani, J. Kim, & T. Martinetz (Eds.), The Fifth German Workshop on Artificial Life (pp. 163-167). Berlin: Akademische Verlagsgesellschaft Aka.
-
(2002)
The Fifth German Workshop on Artificial Life
, pp. 163-167
-
-
Schmolke, A.1
Mallot, H.2
-
31
-
-
0001898381
-
Practical reinforcement learning in continuous spaces
-
P. Langley (Ed.), San Francisco, CA: Morgan Kaufmann
-
Smart, W., & Kaelbling, L. (2000). Practical reinforcement learning in continuous spaces. In P. Langley (Ed.), Proceedings of the Seventeenth International Conference on Machine Learning (pp. 903-910). San Francisco, CA: Morgan Kaufmann.
-
(2000)
Proceedings of the Seventeenth International Conference on Machine Learning
, pp. 903-910
-
-
Smart, W.1
Kaelbling, L.2
-
32
-
-
0036790898
-
Applications of the self-organising map to reinforcement learning
-
Smith, A. J. (2002). Applications of the self-organising map to reinforcement learning. Neural Networks, 15, 1107-1124.
-
(2002)
Neural Networks
, vol.15
, pp. 1107-1124
-
-
Smith, A.J.1
-
35
-
-
85152618928
-
Planning by incremental dynamic programming
-
L. Birnbaum and G. Collins (Eds.), San Francisco, CA: Morgan Kaufmann
-
Sutton, R. (1991). Planning by incremental dynamic programming. In L. Birnbaum and G. Collins (Eds.), Proceedings of the Ninth Conference on Machine Learning (pp. 353-357). San Francisco, CA: Morgan Kaufmann.
-
(1991)
Proceedings of the Ninth Conference on Machine Learning
, pp. 353-357
-
-
Sutton, R.1
-
37
-
-
26444556750
-
Reinforcement landmark learning
-
R. Pfeifer, B. Blumberg, J. Meyer, & S. Wilson (Eds.), Cambridge, MA: MIT Press
-
Toombs, S., Phillips, W., & Smith, L. (1998). Reinforcement landmark learning. In R. Pfeifer, B. Blumberg, J. Meyer, & S. Wilson (Eds.), From animals to animats 5: Proceedings of the Fifth International Conference on Simulation of Adaptive Behavior (pp. 205-212). Cambridge, MA: MIT Press.
-
(1998)
From Animals to Animats 5: Proceedings of the Fifth International Conference on Simulation of Adaptive Behavior
, pp. 205-212
-
-
Toombs, S.1
Phillips, W.2
Smith, L.3
-
39
-
-
1142280955
-
Concurrent layered learning
-
J. S. Rosenschein, M. Woolbridge, T. Sandholm and M. Yokoo (Eds.), New York, NY: ACM Press
-
Whiteson, S., & Stone, P. (2003). Concurrent layered learning. In J. S. Rosenschein, M. Woolbridge, T. Sandholm and M. Yokoo (Eds.), Proceedings of the Second International Joint Conference on Autonomous Agents and Multi-Agent Systems (pp. 193-200). New York, NY: ACM Press.
-
(2003)
Proceedings of the Second International Joint Conference on Autonomous Agents and Multi-Agent Systems
, pp. 193-200
-
-
Whiteson, S.1
Stone, P.2
|