-
3
-
-
0000719863
-
Packet routing in dynamically changing networks: A reinforcement learning approach
-
J. Boyan and M. Littman, "Packet routing in dynamically changing networks: A reinforcement learning approach," Adv. Neural Inf. Process. Syst., vol. 6, pp. 671-678, 1994.
-
(1994)
Adv. Neural Inf. Process. Syst.
, vol.6
, pp. 671-678
-
-
Boyan, J.1
Littman, M.2
-
4
-
-
0011847654
-
Distributed reinforcement learning, load-based routing: A case study
-
Stockholm, Sweden
-
A. Nowé and K. Verbeeck, "Distributed reinforcement learning, load-based routing: A case study," in Notes of the Neural, Symbolic, and Reinforcement Methods for Sequence Learning Workshop at IJCAI, Stockholm, Sweden, 1999, pp. 85-91.
-
(1999)
Notes of the Neural, Symbolic, and Reinforcement Methods for Sequence Learning Workshop at IJCAI
, pp. 85-91
-
-
Nowé, A.1
Verbeeck, K.2
-
6
-
-
0022738693
-
Decentralized learning in finite Markov chains
-
June
-
R.M. Wheeler and K.S. Narendra, "Decentralized learning in finite Markov chains," IEEE Trans. Automat. Contr., vol. AC-31, pp. 519-526, June 1986.
-
(1986)
IEEE Trans. Automat. Contr.
, vol.AC-31
, pp. 519-526
-
-
Wheeler, R.M.1
Narendra, K.S.2
-
7
-
-
0030082551
-
The ant system: Optimization by a colony of cooperating agents
-
Feb.
-
M. Dorigo, V. Maniezzo, and A. Colorni, "The ant system: Optimization by a colony of cooperating agents," IEEE Trans. Syst., Man, Cybern. B, vol. 26, pp. 29-41, Feb. 1996.
-
(1996)
IEEE Trans. Syst., Man, Cybern. B
, vol.26
, pp. 29-41
-
-
Dorigo, M.1
Maniezzo, V.2
Colorni, A.3
-
8
-
-
0033084695
-
Ant algorithms for discrete optimization
-
M. Dorigo, G. Di Caro, and L.M. Gambardella, "Ant algorithms for discrete optimization," Artif. Life, vol. 5, no. 2, pp. 137-172, 1999.
-
(1999)
Artif. Life
, vol.5
, Issue.2
, pp. 137-172
-
-
Dorigo, M.1
Di Caro, G.2
Gambardella, L.M.3
-
9
-
-
0000873984
-
AntNet: Stigmergetic control for communications networks
-
G. Di Caro and M. Dorigo, "AntNet: Stigmergetic control for communications networks," J. Artif. Intell. Res., vol. 9, pp. 317-365, 1998.
-
(1998)
J. Artif. Intell. Res.
, vol.9
, pp. 317-365
-
-
Di Caro, G.1
Dorigo, M.2
-
10
-
-
0029292154
-
Adaptive coordination in distributed systems with delayed communication
-
Apr.
-
E.A. Billard and J.C. Pasquale, "Adaptive coordination in distributed systems with delayed communication," IEEE Trans. Syst., Man, Cybern., vol. 25, pp. 546-554, Apr. 1995.
-
(1995)
IEEE Trans. Syst., Man, Cybern.
, vol.25
, pp. 546-554
-
-
Billard, E.A.1
Pasquale, J.C.2
-
11
-
-
0027602378
-
Coadaptive behavior in a simple distributed job scheduling system
-
May/June
-
A. Glockner and J.C. Pasquale, "Coadaptive behavior in a simple distributed job scheduling system," IEEE Trans. Syst., Man, Cybern., vol. 23, pp. 902-907, May/June 1993.
-
(1993)
IEEE Trans. Syst., Man, Cybern.
, vol.23
, pp. 902-907
-
-
Glockner, A.1
Pasquale, J.C.2
-
12
-
-
84880690163
-
Sequential optimality and coordination in multi-agent systems
-
Stockholm, Sweden
-
C. Boutilier, "Sequential optimality and coordination in multi-agent systems," in Proc. IJCAI, Stockholm, Sweden, 1999, pp. 478-485.
-
(1999)
Proc. IJCAI
, pp. 478-485
-
-
Boutilier, C.1
-
13
-
-
0000929496
-
Multi-agent reinforcement learning: Theoretical framework and an algorithm
-
J. Hu and M.P. Wellman, "Multi-agent reinforcement learning: Theoretical framework and an algorithm," in Proc. 15th Int. Conf. Machine Learning, 1998, pp. 242-250.
-
(1998)
Proc. 15th Int. Conf. Machine Learning
, pp. 242-250
-
-
Hu, J.1
Wellman, M.P.2
-
14
-
-
0011807751
-
ACO strategies for dynamic TSP
-
M. Guntsch, J. Branke, M. Middendorf, and H. Schmek, "ACO strategies for dynamic TSP," in Proc. 2nd Int. Workshop Ant Algorithms, 2000, pp. 59-62.
-
(2000)
Proc. 2nd Int. Workshop Ant Algorithms
, pp. 59-62
-
-
Guntsch, M.1
Branke, J.2
Middendorf, M.3
Schmek, H.4
-
15
-
-
0031122887
-
Ant colony system: A cooperative learning approach to the travelling salesman problem
-
Jan.
-
M. Dorigo and L.M. Gambardella, "Ant colony system: A cooperative learning approach to the travelling salesman problem," IEEE Trans. Evol. Comput., vol. 1, pp. 53-66, Jan. 1997.
-
(1997)
IEEE Trans. Evol. Comput.
, vol.1
, pp. 53-66
-
-
Dorigo, M.1
Gambardella, L.M.2
-
17
-
-
0028497630
-
Asynchronous stochastic approximation and q-learning
-
J.N. Tsitsiklis, "Asynchronous stochastic approximation and q-learning," Mach. Learn., vol. 16, pp. 185-202, 1994.
-
(1994)
Mach. Learn.
, vol.16
, pp. 185-202
-
-
Tsitsiklis, J.N.1
-
19
-
-
85149834820
-
Markov games as a framework for multiagent reinforcement learning
-
M.L. Litmann, "Markov games as a framework for multiagent reinforcement learning," in Proc. 11th Int. Conf. Machine Learning, 1994, pp. 157-163.
-
(1994)
Proc. 11th Int. Conf. Machine Learning
, pp. 157-163
-
-
Litmann, M.L.1
-
20
-
-
34249833101
-
Q-learning
-
C. Watkins and P. Dayan, "Q-learning," Mach. Learn., vol. 8, no. 3, pp. 279-292, 1992.
-
(1992)
Mach. Learn.
, vol.8
, Issue.3
, pp. 279-292
-
-
Watkins, C.1
Dayan, P.2
-
22
-
-
0032687842
-
Learning in multilevel games with incomplete information-Part I
-
June
-
E.A. Billard and S. Lakshmivarahan, "Learning in multilevel games with incomplete information-Part I," IEEE Trans. Syst., Man, Cybern. B, vol. 29, pp. 329-339, June 1999.
-
(1999)
IEEE Trans. Syst., Man, Cybern. B
, vol.29
, pp. 329-339
-
-
Billard, E.A.1
Lakshmivarahan, S.2
|