-
1
-
-
0036874366
-
The complexity of decentralized control of Markov decision processes
-
Daniel S. Bernstein, Robert Givan, Neil Immerman, and Shlomo Zilberstein. The complexity of decentralized control of markov decision processes. Mathematics of Operations Research, 27(4):819-840, 2002.
-
(2002)
Mathematics of Operations Research
, vol.27
, Issue.4
, pp. 819-840
-
-
Bernstein, D.S.1
Givan, R.2
Immerman, N.3
Zilberstein, S.4
-
2
-
-
0000719863
-
Packet routing in dynamically changing networks: A reinforcement learning approach
-
Justin A. Boyan and Michael L. Littman. Packet routing in dynamically changing networks: A reinforcement learning approach. In NIPS'94, Volume 6, pages 671-678, 1994.
-
(1994)
NIPS'94
, vol.6
, pp. 671-678
-
-
Boyan, J.A.1
Littman, M.L.2
-
3
-
-
84899033169
-
Using collective intelligence to route internet traffic
-
David H. Wolpert, Kagan Turner, and Jeremy Frank. Using collective intelligence to route internet traffic. In NIPS'99, pages 952-958, 1999.
-
(1999)
NIPS'99
, pp. 952-958
-
-
Wolpert, D.H.1
Turner, K.2
Frank, J.3
-
4
-
-
44949217895
-
Information-directed routing in sensor networks using real-time reinforcement learning
-
Ying Zhang, Juan Liu, and Feng Zhao. Information-directed routing in sensor networks using real-time reinforcement learning. Combinatorial Optimization in Communication Networks, 18:259-288, 2006.
-
(2006)
Combinatorial Optimization in Communication Networks
, vol.18
, pp. 259-288
-
-
Zhang, Y.1
Liu, J.2
Zhao, F.3
-
5
-
-
78751684474
-
A multi-agent learning approach to online distributed resource allocation
-
Chongjie Zhang, Victor Lesser, and Prashant Shenoy. A Multi-Agent Learning Approach to Online Distributed Resource Allocation. In IJCAI'09, 2009.
-
(2009)
IJCAI'09
-
-
Zhang, C.1
Lesser, V.2
Shenoy, P.3
-
6
-
-
84899413300
-
A reinforcement learning based distributed search algorithm for hierarchical content sharing systems
-
Haizheng Zhang and Victor Lesser. A reinforcement learning based distributed search algorithm for hierarchical content sharing systems. In AAMAS'07, 2007.
-
(2007)
AAMAS'07
-
-
Zhang, H.1
Lesser, V.2
-
7
-
-
85156187730
-
Improving elevator performance using reinforcement learning
-
Robert Crites and Andrew Barto. Improving elevator performance using reinforcement learning. In NIPS'96, pages 1017-1023, 1996.
-
(1996)
NIPS'96
, pp. 1017-1023
-
-
Crites, R.1
Barto, A.2
-
8
-
-
84899884456
-
Integrating organizational control into multi-agent learning
-
Chongjie Zhang, Sherief Abdallah, and Victor Lesser. Integrating organizational control into multi-agent learning. In AAMAS'09, 2009.
-
(2009)
AAMAS'09
-
-
Zhang, C.1
Abdallah, S.2
Lesser, V.3
-
9
-
-
84878960852
-
Multiagent reinforcement learning and self-organization in a network of agents
-
Sherief Abdallah and Victor Lesser. Multiagent reinforcement learning and self-organization in a network of agents. In AAMAS'07, 2007.
-
(2007)
AAMAS'07
-
-
Abdallah, S.1
Lesser, V.2
-
11
-
-
39149136585
-
Losing quantitative models to search for appropriate organizational designs
-
Bryan Horling and Victor Lesser. Losing quantitative models to search for appropriate organizational designs. Autonomous Agents and Multi-Agent Systems, 16(2):95-149, 2008.
-
(2008)
Autonomous Agents and Multi-Agent Systems
, vol.16
, Issue.2
, pp. 95-149
-
-
Horling, B.1
Lesser, V.2
-
12
-
-
1142304657
-
Self-organization through bottom-up coalition formation
-
Mark Sims, Claudia Goldman, and Victor Lesser. Self-Organization through Bottom-up Coalition Formation. In AAMAS'03, pages 867-874, 2003.
-
(2003)
AAMAS'03
, pp. 867-874
-
-
Sims, M.1
Goldman, C.2
Lesser, V.3
-
13
-
-
14344256227
-
-
PhD thesis, Stanford University, Stanford, CA, USA
-
Carlos Ernesto Guestrin. Planning under uncertainty in complex structured environments. PhD thesis, Stanford University, Stanford, CA, USA, 2003.
-
(2003)
Planning Under Uncertainty in Complex Structured Environments
-
-
Guestrin, C.E.1
-
14
-
-
36348956362
-
Average-reward decentralized Markov decision processes
-
Marek Petrik and Shlomo Zilberstein. Average-reward decentralized markov decision processes. In IJCAI, pages 1997-2002, 2007.
-
(2007)
IJCAI
, pp. 1997-2002
-
-
Petrik, M.1
Zilberstein, S.2
-
15
-
-
4544301377
-
Decentralized Markov decision processes with event-driven interactions
-
Raphen Becker, Victor Lesser, and Shlomo Zilberstein. Decentralized Markov Decision Processes with Event-Driven Interactions. In AAMAS'04, Volume 1, pages 302-309, 2004.
-
(2004)
AAMAS'04
, vol.1
, pp. 302-309
-
-
Becker, R.1
Lesser, V.2
Zilberstein, S.3
-
16
-
-
29344437834
-
Networked distributed pomdps: A synthesis of distributed constraint optimization and pomdps
-
Ranjit Nair, Pradeep Varakantham, Milind Tambe, and Makoto Yokoo. Networked distributed pomdps: a synthesis of distributed constraint optimization and pomdps. In AAAI'05, pages 133-139, 2005.
-
(2005)
AAAI'05
, pp. 133-139
-
-
Nair, R.1
Varakantham, P.2
Tambe, M.3
Yokoo, M.4
-
17
-
-
44149093158
-
An application of automated negotiation to distributed task allocation
-
Michael Krainin, Bo An, and Victor Lesser. An Application of Automated Negotiation to Distributed Task Allocation. In IAT'07, pages 138-145, 2007.
-
(2007)
IAT'07
, pp. 138-145
-
-
Krainin, M.1
An, B.2
Lesser, V.3
-
18
-
-
44149121550
-
Learning the task allocation game
-
Sherief Abdallah and Victor Lesser. Learning the task allocation game. In AAMAS'06, 2006.
-
(2006)
AAMAS'06
-
-
Abdallah, S.1
Lesser, V.2
|