-
2
-
-
33749888666
-
-
1995.
-
S. Benson, "Reacting, Planning and Learning in an Autonomous Agent," Ph.D. thesis, Comput. Sci. Dept., Stanford Univ., Stanford, CA, 1995.
-
"Reacting, Planning and Learning in an Autonomous Agent," Ph.D. Thesis, Comput. Sci. Dept., Stanford Univ., Stanford, CA
-
-
Benson, S.1
-
6
-
-
33749935265
-
-
1995.
-
G. A. Rummery, "Problem Solving with Reinforcement Learning," Ph.D. dissertation, Eng. Dept., Cambridge Univ., Cambridge, U.K., 1995.
-
"Problem Solving with Reinforcement Learning," Ph.D. Dissertation, Eng. Dept., Cambridge Univ., Cambridge, U.K.
-
-
Rummery, G.A.1
-
7
-
-
33749920435
-
-
1996.
-
T. W. Sandholm and R. H. Crites, "On multiagent Q-learning in a semi-competitive domain," in Adaption and Learning in Multi-Agent Systems, G. Weiss and S. Sen, Eds. Berlin, Germany: Springer-Verlag, 1996.
-
"On Multiagent Q-learning in a Semi-competitive Domain," in Adaption and Learning in Multi-Agent Systems, G. Weiss and S. Sen, Eds. Berlin, Germany: Springer-Verlag
-
-
Sandholm, T.W.1
Crites, R.H.2
-
8
-
-
34249833101
-
-
1992.
-
C. J. C. H. Watkins and P. Dayan, "Q-learning," Mach. Learn., vol. 8, pp. 279-292, 1992.
-
"Q-learning," Mach. Learn., Vol. 8, Pp. 279-292
-
-
Watkins, C.J.1
Dayan, P.2
-
9
-
-
0029679044
-
-
1996.
-
L. P. Kaelbling, M. L. Littman, and A. W. Moore, "Reinforcement learning: A survey," J. Artif. Intell. Res., vol. 4, pp. 237-285, 1996.
-
M. L. Littman, and A. W. Moore, "Reinforcement Learning: a Survey," J. Artif. Intell. Res., Vol. 4, Pp. 237-285
-
-
Kaelbling, L.P.1
-
11
-
-
33749914374
-
-
1997.
-
J. W. Sheppard, "Multi-Agent Reinforcement Learning in Markov Games," Ph.D. dissertation, John Hopkins Univ., Baltimore, MD, 1997.
-
"Multi-Agent Reinforcement Learning in Markov Games," Ph.D. Dissertation, John Hopkins Univ., Baltimore, MD
-
-
Sheppard, J.W.1
-
12
-
-
33749940305
-
-
1994.
-
S. P. Singh, "Learning to Solve Markov Decision Processes," Ph.D. dissertation, Dept. Comput. Sci., Univ. Mass., Boston, 1994.
-
"Learning to Solve Markov Decision Processes," Ph.D. Dissertation, Dept. Comput. Sci., Univ. Mass., Boston
-
-
Singh, S.P.1
-
15
-
-
33749874773
-
-
pp. 175-204.
-
D. J. C. MacKay, "Introduction to Monte Carlo methods," in Learning in Graphical Models, M. I. Jordan, Ed. Cambridge, MA: MIT Press, 1999, pp. 175-204.
-
"Introduction to Monte Carlo Methods," in Learning in Graphical Models, M. I. Jordan, Ed. Cambridge, MA: MIT Press, 1999
-
-
MacKay, D.J.C.1
-
16
-
-
0000123778
-
-
1992.
-
L. J. Lin, "Self-improving reactive agents based on reinforcement learning, planning and teaching," Mach. Learn., vol. 8, pp. 293-321, 1992.
-
"Self-improving Reactive Agents Based on Reinforcement Learning, Planning and Teaching," Mach. Learn., Vol. 8, Pp. 293-321
-
-
Lin J, L.1
-
17
-
-
33749951855
-
-
1997.
-
P. Cichosz, "Reinforcement learning by truncating temporal differences," Ph.D. dissertation, Dept. Electron. Inform. Technol., Warsaw Univ. Technol., Warsaw, Poland, 1997.
-
"Reinforcement Learning by Truncating Temporal Differences," Ph.D. Dissertation, Dept. Electron. Inform. Technol., Warsaw Univ. Technol., Warsaw, Poland
-
-
Cichosz, P.1
-
18
-
-
33749953200
-
-
1992.
-
J. A. Boyan, "Modular neural networks for learning context-dependent game strategies," M.Sc. thesis, Dept. Eng., Cambridge Univ., Cambridge, U.K., 1992.
-
"Modular Neural Networks for Learning Context-dependent Game Strategies," M.Sc. Thesis, Dept. Eng., Cambridge Univ., Cambridge, U.K.
-
-
Boyan, J.A.1
-
20
-
-
33749946976
-
-
pp. 1017-1023.
-
R. H. Crites and A. G. Barto, "Improving elevator performance using reinforcement learning," in Advances in Neural Information Processing Systems. Cambridge, MA: MIT Press, 1996, vol. 8, pp. 1017-1023.
-
"Improving Elevator Performance Using Reinforcement Learning," in Advances in Neural Information Processing Systems. Cambridge, MA: MIT Press, 1996, Vol. 8
-
-
Crites, R.H.1
Barto, A.G.2
-
21
-
-
33749981849
-
-
1995.
-
L. Gambardella and M. Dorigo, "Ant-Q: A reinforcement learning approach to the traveling salesman problem," IEEE Trans. Syst., Man, Cybern. B, vol. 26, no. 1, pp. 29-41, 1995.
-
"Ant-Q: a Reinforcement Learning Approach to the Traveling Salesman Problem," IEEE Trans. Syst., Man, Cybern. B, Vol. 26, No. 1, Pp. 29-41
-
-
Gambardella, L.1
Dorigo, M.2
-
24
-
-
0012228023
-
-
pp. 189-196.
-
V. Miagkikh and W. Punch, "Global search in combinatorial optimization using reinforcement learning algorithms," in Proc. 1999 Congr. Evolutionary Computation, vol. 1, 1999, pp. 189-196.
-
"Global Search in Combinatorial Optimization Using Reinforcement Learning Algorithms," in Proc. 1999 Congr. Evolutionary Computation, Vol. 1, 1999
-
-
Miagkikh, V.1
Punch, W.2
-
30
-
-
33749918390
-
-
1995.
-
A. Schaerf, Y. Shoham, and M. Tennenholtz, "Adaptive load balancing: A study in multi-agent learning," J. Artif. Intell. Res., vol. 2, pp. 475-500, 1995.
-
Y. Shoham, and M. Tennenholtz, "Adaptive Load Balancing: a Study in Multi-agent Learning," J. Artif. Intell. Res., Vol. 2, Pp. 475-500
-
-
Schaerf, A.1
-
32
-
-
33749959707
-
-
1997.
-
S. Sen and T. Haynes, "Co-adaptation in a team," Int. J. Comput. Intell. Organ., vol. 1, no. 4, 1997.
-
"Co-adaptation in a Team," Int. J. Comput. Intell. Organ., Vol. 1, No. 4
-
-
Sen, S.1
Haynes, T.2
-
36
-
-
33749953771
-
-
vol. 1042.
-
C. V. Goldman and J. S. Rosenschein, "Mutually supervised learning in multiagent systems," in Proceedings of Adaptation and Learning in Multi-Agent Systems IJCAI95 Workshop, Lecture Notes in Artificial Intelligence, G. Weiss and S. Sen, Eds. Berlin, Germany: Springer Verlag, 1995, vol. 1042.
-
"Mutually Supervised Learning in Multiagent Systems," in Proceedings of Adaptation and Learning in Multi-Agent Systems IJCAI95 Workshop, Lecture Notes in Artificial Intelligence, G. Weiss and S. Sen, Eds. Berlin, Germany: Springer Verlag, 1995
-
-
Goldman, C.V.1
Rosenschein, J.S.2
|