-
1
-
-
0028731901
-
Coordination of multiple behaviors acquired by a avision-based reinforcement learning
-
Munich, Germany
-
M. Asada, E. Uchibe, S. Noda, S. Tawaratsumida and K. Hosoda, Coordination of multiple behaviors acquired by a avision-based reinforcement learning, Proc. IEEE/RSJ/GI Int. Conf. on Intelligent Robots and Systems, Munich, Germany (1994).
-
(1994)
Proc. IEEE/RSJ/GI Int. Conf. on Intelligent Robots and Systems
-
-
Asada, M.1
Uchibe, E.2
Noda, S.3
Tawaratsumida, S.4
Hosoda, K.5
-
3
-
-
0023206906
-
A hardware retargetable distributed layered architecture for mobile robot control
-
Raleigh, NC
-
R.A. Brooks, A hardware retargetable distributed layered architecture for mobile robot control, IEEE Int. Conf. on Robotics and Automation, Raleigh, NC (1987) 106-110.
-
(1987)
IEEE Int. Conf. on Robotics and Automation
, pp. 106-110
-
-
Brooks, R.A.1
-
5
-
-
84949961376
-
Opponent modeling in multi-agent systems
-
G. Weiss and S. Sen, eds., Lecture Notes in Artificial Intelligence, Springer, Berlin
-
D. Carmel and S. Markovitch, Opponent modeling in multi-agent systems, in: G. Weiss and S. Sen, eds., Adaptation and Learning in Multi-Agent Systems, Lecture Notes in Artificial Intelligence, Vol. 1042 (Springer, Berlin, 1996) 40-52.
-
(1996)
Adaptation and Learning in Multi-agent Systems
, vol.1042
, pp. 40-52
-
-
Carmel, D.1
Markovitch, S.2
-
6
-
-
0002192119
-
Input generalization in delayed reinforcement learning: An algorithm and performance comparisons
-
Sydney, Australia
-
D. Chapman and L.P. Kaelbling, Input generalization in delayed reinforcement learning: An algorithm and performance comparisons, Proc. IJCAI-91, Sydney, Australia (1991).
-
(1991)
Proc. IJCAI-91
-
-
Chapman, D.1
Kaelbling, L.P.2
-
7
-
-
0020234371
-
Dynamics of hierarchy formation: The sequential development of dominance relationships
-
I.D. Chase, Dynamics of hierarchy formation: The sequential development of dominance relationships, Behaviour 80 (1982) 218-240.
-
(1982)
Behaviour
, vol.80
, pp. 218-240
-
-
Chase, I.D.1
-
8
-
-
0001443421
-
Two methods for quantifying the development of dominance hierarchies in large groups with application to Harris' sparrows
-
I.D. Chase and S. Rohwer, Two methods for quantifying the development of dominance hierarchies in large groups with application to Harris' sparrows, Animal Behavior 35 (1987) 1113-1128.
-
(1987)
Animal Behavior
, vol.35
, pp. 1113-1128
-
-
Chase, I.D.1
Rohwer, S.2
-
10
-
-
0006495925
-
Imitation: A review and critique
-
Bateson and Klopfer, eds., Plenum Press, New York
-
J.M. Davis, Imitation: A review and critique, in: Bateson and Klopfer, eds., Perspectives in Ethology, Vol. 1 (Plenum Press, New York, 1973).
-
(1973)
Perspectives in Ethology
, vol.1
-
-
Davis, J.M.1
-
11
-
-
0001595196
-
Self-organization mechanisms in ant societies, II: Learning in foraging and division of labor
-
J.L. Deneubourg, S. Goss, J.M. Pasteels, D. Fresneau and J.P. Lachaud, Self-organization mechanisms in ant societies, II: Learning in foraging and division of labor, From Individual to Collective Behavior in Social Insects 54 (1987) 177-196.
-
(1987)
From Individual to Collective Behavior in Social Insects
, vol.54
, pp. 177-196
-
-
Deneubourg, J.L.1
Goss, S.2
Pasteels, J.M.3
Fresneau, D.4
Lachaud, J.P.5
-
12
-
-
0027706284
-
Overeager reciprocal rationality and mixed strategy equilibria
-
Washington, DC
-
E.H. Durfee, J. Lee and P.J. Gmytrasiewicz, Overeager reciprocal rationality and mixed strategy equilibria, in: Proc. AAAI-93, Washington, DC (1993) 225-230.
-
(1993)
Proc. AAAI-93
, pp. 225-230
-
-
Durfee, E.H.1
Lee, J.2
Gmytrasiewicz, P.J.3
-
14
-
-
84949968554
-
Mutually supervised learning in multiagent systems
-
G. Weiss and S. Sen, eds., Lecture Notes in Artificial Intelligence, Springer, Berlin
-
C.V. Goldman and J.S. Rosenschein, Mutually supervised learning in multiagent systems, in: G. Weiss and S. Sen, eds., Adaptation and Learning in Multi-Agent Systems, Lecture Notes in Artificial Intelligence, Vol. 1042 (Springer, Berlin, 1996) 85-96.
-
(1996)
Adaptation and Learning in Multi-agent Systems
, vol.1042
, pp. 85-96
-
-
Goldman, C.V.1
Rosenschein, J.S.2
-
16
-
-
84949933346
-
A framework for distributed reinforcement learning
-
G. Weiss and S. Sen, eds., Lecture Notes in Artificial Intelligence, Springer, Berlin
-
P. Gu and A.B. Maddox, A framework for distributed reinforcement learning, in: G. Weiss and S. Sen, eds., Adaptation and Learning in Multi-Agent Systems, Lecture Notes in Artificial Intelligence, Vol. 1042 (Springer, Berlin, 1996) 97-112.
-
(1996)
Adaptation and Learning in Multi-agent Systems
, vol.1042
, pp. 97-112
-
-
Gu, P.1
Maddox, A.B.2
-
17
-
-
84949940071
-
Evolving behavioral strategies in predators and prey
-
G. Weiss and S. Sen, eds., Lecture Notes in Artificial Intelligence, Springer, Berlin
-
T. Haynes and S. Sen, Evolving behavioral strategies in predators and prey, in: G. Weiss and S. Sen, eds., Adaptation and Learning in Multi-Agent Systems, Lecture Notes in Artificial Intelligence, Vol. 1042 (Springer, Berlin, 1996) 113-126.
-
(1996)
Adaptation and Learning in Multi-agent Systems
, vol.1042
, pp. 113-126
-
-
Haynes, T.1
Sen, S.2
-
21
-
-
85151437138
-
Programming robots using reinforcement learning and teaching
-
Pittsburgh, PA
-
L.-J. Lin, Programming robots using reinforcement learning and teaching, in: Proc. AAAI-91, Pittsburgh, PA (1991) 781-786.
-
(1991)
Proc. AAAI-91
, pp. 781-786
-
-
Lin, L.-J.1
-
22
-
-
0344050557
-
Self-improving reactive agents: Case studies of reinforcement learning frameworks
-
MIT Press, Cambridge, MA
-
L.-J. Lin, Self-improving reactive agents: Case studies of reinforcement learning frameworks, in: From Animals to Animals: Int. Conf. on Simulation of Adaptive Behavior (MIT Press, Cambridge, MA, 1991).
-
(1991)
From Animals to Animals: Int. Conf. on Simulation of Adaptive Behavior
-
-
Lin, L.-J.1
-
24
-
-
4243545005
-
-
Technical Report, Computer Science Department, CS-90-104, University of Tennessee
-
B.J. MacLennan, Evolution of communication in a population of simple machines, Technical Report, Computer Science Department, CS-90-104, University of Tennessee, 1990.
-
(1990)
Evolution of Communication in a Population of Simple Machines
-
-
MacLennan, B.J.1
-
25
-
-
0027133962
-
Syntethic ecology and the evolution of cooperative communication
-
B.J. MacLennan and G.M. Burghardt, Syntethic ecology and the evolution of cooperative communication, Adaptive Behavior 2 (2) (1994) 161-188.
-
(1994)
Adaptive Behavior
, vol.2
, Issue.2
, pp. 161-188
-
-
MacLennan, B.J.1
Burghardt, G.M.2
-
26
-
-
84976813028
-
Learning to coordinate behaviors
-
Boston, MA
-
P. Maes and R.A. Brooks, Learning to coordinate behaviors, in: Proc. AAAI-91, Boston, MA (1990) 796-802.
-
(1990)
Proc. AAAI-91
, pp. 796-802
-
-
Maes, P.1
Brooks, R.A.2
-
27
-
-
0002386181
-
Automatic programming of behavior-based robots using reinforcement learning
-
Pittsburgh, PA
-
S. Mahadevan and J. Connell, Automatic programming of behavior-based robots using reinforcement learning, Proc. AAAI-91, Pittsburgh, PA (1991) 8-14.
-
(1991)
Proc. AAAI-91
, pp. 8-14
-
-
Mahadevan, S.1
Connell, J.2
-
28
-
-
79955966783
-
Scaling reinforcement learning to robotics by exploiting the subsumption architecture
-
Morgan Kaufmann, Los Altos, CA
-
S. Mahadevan and J. Connell, Scaling reinforcement learning to robotics by exploiting the subsumption architecture, Proc. 8th Int. Workshop on Machine Learning (Morgan Kaufmann, Los Altos, CA, 1991) 328-337.
-
(1991)
Proc. 8th Int. Workshop on Machine Learning
, pp. 328-337
-
-
Mahadevan, S.1
Connell, J.2
-
29
-
-
30244450052
-
Designing emergent behaviors: From local interactions to collective intelligence
-
J.-A. Meyer, H. Roitblat and S. Wilson, eds.
-
M.J. Matarić, Designing emergent behaviors: From local interactions to collective intelligence, in: J.-A. Meyer, H. Roitblat and S. Wilson, eds., From Animals to Animals: Int. Conf. on Simulation of Adaptive Behavior.
-
From Animals to Animals: Int. Conf. on Simulation of Adaptive Behavior
-
-
Matarić, M.J.1
-
30
-
-
0000824463
-
Kin recognition, similarity, and group behavior
-
Boulder, Colorado
-
M.J. Matarić, Kin recognition, similarity, and group behavior, Proc. 15th Annual Conf. on Cognitive Science Society, Boulder, Colorado (1993) 705-710.
-
(1993)
Proc. 15th Annual Conf. on Cognitive Science Society
, pp. 705-710
-
-
Matarić, M.J.1
-
31
-
-
84957895797
-
Reward functions for accelerated learning
-
W.W. Cohen and H. Hirsh, eds., Morgan Kauffman, New Brunswick, NJ
-
M.J. Matarić, Reward functions for accelerated learning, in: W.W. Cohen and H. Hirsh, eds., Proc. 11th Int. Conf. on Machine Learning (ML-94), (Morgan Kauffman, New Brunswick, NJ, 1994) 181-189.
-
(1994)
Proc. 11th Int. Conf. on Machine Learning (ML-94)
, pp. 181-189
-
-
Matarić, M.J.1
-
32
-
-
79957721960
-
Designing and understanding adaptive group behavior
-
M.J. Matarić, Designing and understanding adaptive group behavior, Adaptive Behavior 4 (1) (1995) 50-81.
-
(1995)
Adaptive Behavior
, vol.4
, Issue.1
, pp. 50-81
-
-
Matarić, M.J.1
-
33
-
-
0000434835
-
Cooperative multi-robot box-pushing
-
IEEE Computer Society Press, Los Alamitos, CA
-
M.J. Matarić, M. Nilsson and K.T. Simsarian, Cooperative multi-robot box-pushing, Proc. IROS-95 (IEEE Computer Society Press, Los Alamitos, CA, 1995).
-
(1995)
Proc. IROS-95
-
-
Matarić, M.J.1
Nilsson, M.2
Simsarian, K.T.3
-
34
-
-
0003514044
-
-
Benjamin Cummings, Menlo Park, CA
-
D. McFarland, Animal Behavior (Benjamin Cummings, Menlo Park, CA, 1985).
-
(1985)
Animal Behavior
-
-
McFarland, D.1
-
36
-
-
0004558935
-
Strategic social planning: Looking for willingness in multi-agent domains
-
Boulder, Colorado
-
M. Miceli and A. Cesta, Strategic social planning: Looking for willingness in multi-agent domains, Proc. 15th Annual Conf. on Cognitive Science Society, Boulder, Colorado (1993) 741-746.
-
(1993)
Proc. 15th Annual Conf. on Cognitive Science Society
, pp. 741-746
-
-
Miceli, M.1
Cesta, A.2
-
37
-
-
0010527819
-
Learning reactive sequences from basic reflexes
-
MIT Press, Brighton
-
J.D.R. Millán, Learning reactive sequences from basic reflexes, Proc. Simulation of Adaptive Behavior, SAB-94 (MIT Press, Brighton, 1994) 266-274.
-
(1994)
Proc. Simulation of Adaptive Behavior, Sab-94
, pp. 266-274
-
-
Millán, J.D.R.1
-
38
-
-
84949966497
-
Learn your opponent's strategy (in polynomial time)!
-
G. Weiss and S. Sen, eds., Lecture Notes in Artificial Intelligence, Springer, Berlin
-
Y. Mor, C.V. Goldman and J.S. Rosenschein, Learn your opponent's strategy (in polynomial time)!, in: G. Weiss and S. Sen, eds., Adaptation and Learning in Multi-Agent Systems, Lecture Notes in Artificial Intelligence, Vol. 1042 (Springer, Berlin, 1996) 164-176.
-
(1996)
Adaptation and Learning in Multi-agent Systems
, vol.1042
, pp. 164-176
-
-
Mor, Y.1
Goldman, C.V.2
Rosenschein, J.S.3
-
39
-
-
84949948828
-
Learning to reduce communication cost on task negotiation among multiple autonomous mobile robots
-
G. Weiss and S. Sen, eds., Lecture Notes in Artificial Intelligence, Springer, Berlin
-
T. Ohko, K. Hiraki and Y. Anzai, Learning to reduce communication cost on task negotiation among multiple autonomous mobile robots, in: G. Weiss and S. Sen, eds., Adaptation and Learning in Multi-Agent Systems, Lecture Notes in Artificial Intelligence, Vol. 1042 (Springer, Berlin, 1996) 177-190.
-
(1996)
Adaptation and Learning in Multi-agent Systems
, vol.1042
, pp. 177-190
-
-
Ohko, T.1
Hiraki, K.2
Anzai, Y.3
-
43
-
-
84949966897
-
On multiagent Q-learning in a semi-competitive domain
-
G. Weiss and S. Sen, eds., Lecture Notes in Artificial Intelligence, Springer, Berlin
-
T.W. Sandholm and R.H. Crites, On multiagent Q-learning in a semi-competitive domain, in: G. Weiss and S. Sen, eds., Adaptation and Learning in Multi-Agent Systems, Lecture Notes in Artificial Intelligence, Vol. 1042 (Springer, Berlin, 1996) 191-217.
-
(1996)
Adaptation and Learning in Multi-agent Systems
, vol.1042
, pp. 191-217
-
-
Sandholm, T.W.1
Crites, R.H.2
-
44
-
-
84949993441
-
Multiagent coordination with learning classifier systems
-
G. Weiss and S. Sen, eds., Lecture Notes in Artificial Intelligence, Springer, Berlin
-
S. Sen and M. Sekaran, Multiagent coordination with learning classifier systems, in: G. Weiss and S. Sen, eds., Adaptation and Learning in Multi-Agent Systems, Lecture Notes in Artificial Intelligence, Vol. 1042 (Springer, Berlin, 1996) 218-233.
-
(1996)
Adaptation and Learning in Multi-agent Systems
, vol.1042
, pp. 218-233
-
-
Sen, S.1
Sekaran, M.2
-
46
-
-
33847202724
-
Learning to predict by method of temporal differences
-
R. Sutton, Learning to predict by method of temporal differences, Machine Learning 3 (1) (1988) 9-44.
-
(1988)
Machine Learning
, vol.3
, Issue.1
, pp. 9-44
-
-
Sutton, R.1
-
47
-
-
85152198941
-
Multi-agent reinforcement learning: Independent vs. Cooperative agents
-
Amherst, MA
-
M. Tan, Multi-agent reinforcement learning: Independent vs. cooperative agents, Proc. 10th Int. Conf. on Machine Learning, Amherst, MA (1993) 330-337.
-
(1993)
Proc. 10th Int. Conf. on Machine Learning
, pp. 330-337
-
-
Tan, M.1
-
48
-
-
0007651192
-
Integrating inductive neural network learning and explanation-based learning
-
Chambery, France
-
S.B. Thrun and T.M. Mitchell, Integrating inductive neural network learning and explanation-based learning, Proc. IJCAI-93, Chambery, France (1993).
-
(1993)
Proc. IJCAI-93
-
-
Thrun, S.B.1
Mitchell, T.M.2
|