-
1
-
-
0034441439
-
Cluster reserves: A mechanism for resource management in cluster-based network servers
-
Mohit Aron, Peter Druschel, and Willy Zwaenepoel. Cluster reserves: a mechanism for resource management in cluster-based network servers. In Measurement and Modeling of Computer Systems, pages 90-101, 2000.
-
(2000)
Measurement and Modeling of Computer Systems
, pp. 90-101
-
-
Aron, M.1
Druschel, P.2
Zwaenepoel, W.3
-
3
-
-
84899027977
-
Convergence and noregret in multiagent learning
-
Michael Bowling. Convergence and noregret in multiagent learning. In NIPS'05, pages 209-216, 2005.
-
(2005)
NIPS'05
, pp. 209-216
-
-
Bowling, M.1
-
4
-
-
0000719863
-
Packet routing in dynamically changing networks: A reinforcement learning approach
-
Justin A. Boyan and Michael L. Littman. Packet routing in dynamically changing networks: A reinforcement learning approach. In NIPS'94, volume 6, pages 671-678, 1994.
-
(1994)
NIPS'94
, vol.6
, pp. 671-678
-
-
Boyan, J.A.1
Littman, M.L.2
-
5
-
-
0031630561
-
The dynamics of reinforcement learning in cooperative multiagent systems
-
AAAI Press
-
Caroline Claus and Craig Boutilier. The dynamics of reinforcement learning in cooperative multiagent systems. In AAAI'98, pages 746-752. AAAI Press, 1998.
-
(1998)
AAAI'98
, pp. 746-752
-
-
Claus, C.1
Boutilier, C.2
-
6
-
-
18544386026
-
Confidence based dual reinforcement qrouting: An adaptive online network routing algorithm
-
Shailesh Kumar and Risto Miikkulainen. Confidence based dual reinforcement qrouting: An adaptive online network routing algorithm. In IJCAI '99, pages 758-763, 1999.
-
(1999)
IJCAI '99
, pp. 758-763
-
-
Kumar, S.1
Miikkulainen, R.2
-
7
-
-
0024716426
-
Distributed scheduling of tasks with deadlines and resource requirements
-
K. Ramamritham, J. A. Stankovic, and W. Zhao. Distributed scheduling of tasks with deadlines and resource requirements. IEEE Trans. Comput., 38(8):1110-1123, 1989.
-
(1989)
IEEE Trans. Comput.
, vol.38
, Issue.8
, pp. 1110-1123
-
-
Ramamritham, K.1
Stankovic, J.A.2
Zhao, W.3
-
9
-
-
0033901602
-
Convergence results for single-step on-policy reinforcement-learning algorithms
-
Satinder P. Singh, Tommi Jaakkola, Michael L. Littman, and Csaba Szepesvari. Convergence results for single-step on-policy reinforcement-learning algorithms. Machine Learning, 38(3):287-308, 2000.
-
(2000)
Machine Learning
, vol.38
, Issue.3
, pp. 287-308
-
-
Singh, S.P.1
Jaakkola, T.2
Littman, M.L.3
Szepesvari, C.4
-
10
-
-
29344462255
-
Online resource allocation using decompositional reinforcement learning
-
Manuela M. Veloso and Subbarao Kambhampati, editors, AAAI Press / The MIT Press
-
Gerald Tesauro. Online resource allocation using decompositional reinforcement learning. In Manuela M. Veloso and Subbarao Kambhampati, editors, AAAI, pages 886-891. AAAI Press / The MIT Press, 2005.
-
(2005)
AAAI
, pp. 886-891
-
-
Tesauro, G.1
-
12
-
-
1942484421
-
Online convex programming and generalized infinitesimal gradient ascent
-
Martin Zinkevich. Online convex programming and generalized infinitesimal gradient ascent. In ICML'03, pages 928-936, 2003.
-
(2003)
ICML'03
, pp. 928-936
-
-
Zinkevich, M.1
|