SCOPUS 정보 검색 플랫폼

IEEE Transactions on Systems, Man and Cybernetics Part C: Applications and Reviews

Volumn 30, Issue 4, 2000, Pages 485-497

Multiagent reinforcement learning using function approximation

(3) Abul, Osman a Polat, Faruk a Alhajj, Reda a

a MIDDLE EAST TECHNICAL UNIVERSITY (Turkey)

Author keywords

[No Author keywords available]

Indexed keywords

APPROXIMATION THEORY; ARTIFICIAL INTELLIGENCE; LEARNING ALGORITHMS; MULTI AGENT SYSTEMS;

MULTIAGENT REINFORCEMENT LEARNING;

LEARNING SYSTEMS;

EID: 0034313638 PISSN: 10946977 EISSN: None Source Type: Journal
DOI: 10.1109/5326.897075 Document Type: Article

Times cited : (90)

References (38)

1
- 33749932965
- 1994. LNAI 130.
- F. Polat and A. Guvenir, "A conflict resolution based decentralized multi-agent problem solving model," in Artificial Social Systems. Berlin, Germany: Springer-Verlag, 1994. LNAI 130.
- "A Conflict Resolution Based Decentralized Multi-agent Problem Solving Model," in Artificial Social Systems. Berlin, Germany: Springer-Verlag
- Polat, F.¹ Guvenir, A.²

2
- 33749888666
- 1995.
- S. Benson, "Reacting, Planning and Learning in an Autonomous Agent," Ph.D. thesis, Comput. Sci. Dept., Stanford Univ., Stanford, CA, 1995.
- "Reacting, Planning and Learning in an Autonomous Agent," Ph.D. Thesis, Comput. Sci. Dept., Stanford Univ., Stanford, CA
- Benson, S.¹

3
- 33749937079
- 1995.
- L. Baird, "Residual algorithms: Reinforcement learning with function approximation," in Proc. Int. Conf. Machine Learning, 1995.
- "Residual Algorithms: Reinforcement Learning with Function Approximation," in Proc. Int. Conf. Machine Learning
- Baird, L.¹

4
- 0000723997
- 1996.
- R. S. Sutton, "Generalization in reinforcement learning: Successful examples using sparse coarse coding," in Advances in Neural Information Processing Systems, 1996.
- "Generalization in Reinforcement Learning: Successful Examples Using Sparse Coarse Coding," in Advances in Neural Information Processing Systems
- Sutton, R.S.¹

5
- 0004007508
- 1998.
- R. S. Sutton and A. G. Barto, Reinforcement Learning: An Introduction. Cambridge, MA: MIT Press, 1998.
- Reinforcement Learning: an Introduction. Cambridge, MA: MIT Press
- Sutton, R.S.¹ Barto, A.G.²

6
- 33749935265
- 1995.
- G. A. Rummery, "Problem Solving with Reinforcement Learning," Ph.D. dissertation, Eng. Dept., Cambridge Univ., Cambridge, U.K., 1995.
- "Problem Solving with Reinforcement Learning," Ph.D. Dissertation, Eng. Dept., Cambridge Univ., Cambridge, U.K.
- Rummery, G.A.¹

7
- 33749920435
- 1996.
- T. W. Sandholm and R. H. Crites, "On multiagent Q-learning in a semi-competitive domain," in Adaption and Learning in Multi-Agent Systems, G. Weiss and S. Sen, Eds. Berlin, Germany: Springer-Verlag, 1996.
- "On Multiagent Q-learning in a Semi-competitive Domain," in Adaption and Learning in Multi-Agent Systems, G. Weiss and S. Sen, Eds. Berlin, Germany: Springer-Verlag
- Sandholm, T.W.¹ Crites, R.H.²

8
- 34249833101
- 1992.
- C. J. C. H. Watkins and P. Dayan, "Q-learning," Mach. Learn., vol. 8, pp. 279-292, 1992.
- "Q-learning," Mach. Learn., Vol. 8, Pp. 279-292
- Watkins, C.J.¹ Dayan, P.²

9
- 0029679044
- 1996.
- L. P. Kaelbling, M. L. Littman, and A. W. Moore, "Reinforcement learning: A survey," J. Artif. Intell. Res., vol. 4, pp. 237-285, 1996.
- M. L. Littman, and A. W. Moore, "Reinforcement Learning: a Survey," J. Artif. Intell. Res., Vol. 4, Pp. 237-285
- Kaelbling, L.P.¹

10
- 33749883843
- 1996.
- P. Langley, Elements of Machine Learning. San Mateo, CA: Morgan Kaufmann, 1996.
- Elements of Machine Learning. San Mateo, CA: Morgan Kaufmann
- Langley, P.¹

11
- 33749914374
- 1997.
- J. W. Sheppard, "Multi-Agent Reinforcement Learning in Markov Games," Ph.D. dissertation, John Hopkins Univ., Baltimore, MD, 1997.
- "Multi-Agent Reinforcement Learning in Markov Games," Ph.D. Dissertation, John Hopkins Univ., Baltimore, MD
- Sheppard, J.W.¹

12
- 33749940305
- 1994.
- S. P. Singh, "Learning to Solve Markov Decision Processes," Ph.D. dissertation, Dept. Comput. Sci., Univ. Mass., Boston, 1994.
- "Learning to Solve Markov Decision Processes," Ph.D. Dissertation, Dept. Comput. Sci., Univ. Mass., Boston
- Singh, S.P.¹

13
- 0032073263
- 1998.
- L. P. Kaelbling et al., "Planning and acting in partially observable stochastic domains,"Artif. Intell., vol. 101, 1998.
- "Planning and Acting in Partially Observable Stochastic Domains,"Artif. Intell., Vol. 101
- Kaelbling, L.P.¹

14
- 33749888665
- 1999.
- N. Meuleau et al., "Learning finite-state controllers for partially observable environments," in Proc. Int. Conf. Uncertainty Artificial Intelligence, 1999.
- "Learning Finite-state Controllers for Partially Observable Environments," in Proc. Int. Conf. Uncertainty Artificial Intelligence
- Meuleau, N.¹

15
- 33749874773
- pp. 175-204.
- D. J. C. MacKay, "Introduction to Monte Carlo methods," in Learning in Graphical Models, M. I. Jordan, Ed. Cambridge, MA: MIT Press, 1999, pp. 175-204.
- "Introduction to Monte Carlo Methods," in Learning in Graphical Models, M. I. Jordan, Ed. Cambridge, MA: MIT Press, 1999
- MacKay, D.J.C.¹

16
- 0000123778
- 1992.
- L. J. Lin, "Self-improving reactive agents based on reinforcement learning, planning and teaching," Mach. Learn., vol. 8, pp. 293-321, 1992.
- "Self-improving Reactive Agents Based on Reinforcement Learning, Planning and Teaching," Mach. Learn., Vol. 8, Pp. 293-321
- Lin J, L.¹

17
- 33749951855
- 1997.
- P. Cichosz, "Reinforcement learning by truncating temporal differences," Ph.D. dissertation, Dept. Electron. Inform. Technol., Warsaw Univ. Technol., Warsaw, Poland, 1997.
- "Reinforcement Learning by Truncating Temporal Differences," Ph.D. Dissertation, Dept. Electron. Inform. Technol., Warsaw Univ. Technol., Warsaw, Poland
- Cichosz, P.¹

18
- 33749953200
- 1992.
- J. A. Boyan, "Modular neural networks for learning context-dependent game strategies," M.Sc. thesis, Dept. Eng., Cambridge Univ., Cambridge, U.K., 1992.
- "Modular Neural Networks for Learning Context-dependent Game Strategies," M.Sc. Thesis, Dept. Eng., Cambridge Univ., Cambridge, U.K.
- Boyan, J.A.¹

19
- 0003433293
- 1993.
- M. J. Zurada, Introduction to Artificial Neural Systems. New York: West, 1993.
- Introduction to Artificial Neural Systems. New York: West
- Zurada, M.J.¹

20
- 33749946976
- pp. 1017-1023.
- R. H. Crites and A. G. Barto, "Improving elevator performance using reinforcement learning," in Advances in Neural Information Processing Systems. Cambridge, MA: MIT Press, 1996, vol. 8, pp. 1017-1023.
- "Improving Elevator Performance Using Reinforcement Learning," in Advances in Neural Information Processing Systems. Cambridge, MA: MIT Press, 1996, Vol. 8
- Crites, R.H.¹ Barto, A.G.²

21
- 33749981849
- 1995.
- L. Gambardella and M. Dorigo, "Ant-Q: A reinforcement learning approach to the traveling salesman problem," IEEE Trans. Syst., Man, Cybern. B, vol. 26, no. 1, pp. 29-41, 1995.
- "Ant-Q: a Reinforcement Learning Approach to the Traveling Salesman Problem," IEEE Trans. Syst., Man, Cybern. B, Vol. 26, No. 1, Pp. 29-41
- Gambardella, L.¹ Dorigo, M.²

22
- 33749905629
- 1998.
- J. Hu and M. P. Wellman, "Multiagent reinforcement learning and stochastic games," Games Econ. Behav., 1998.
- "Multiagent Reinforcement Learning and Stochastic Games," Games Econ. Behav.
- Hu, J.¹ Wellman, M.P.²

23
- 33749890637
- 1994.
- M. L. Littman, "Markov games as a framework for multi-agent reinforcement learning," in Proc. Int. Conf. Machine Learning, 1994.
- "Markov Games as a Framework for Multi-agent Reinforcement Learning," in Proc. Int. Conf. Machine Learning
- Littman, M.L.¹

24
- 0012228023
- pp. 189-196.
- V. Miagkikh and W. Punch, "Global search in combinatorial optimization using reinforcement learning algorithms," in Proc. 1999 Congr. Evolutionary Computation, vol. 1, 1999, pp. 189-196.
- "Global Search in Combinatorial Optimization Using Reinforcement Learning Algorithms," in Proc. 1999 Congr. Evolutionary Computation, Vol. 1, 1999
- Miagkikh, V.¹ Punch, W.²

25
- 85194539042
- pp. 147-166.
- T. W. Sandholm and R. H. Crites, "Multiagent reinforcement learning in the iterated prisoner's dilemma," Biosystems, vol. 37, pp. 147-166.
- "Multiagent Reinforcement Learning in the Iterated Prisoner's Dilemma," Biosystems, Vol. 37
- Sandholm, T.W.¹ Crites, R.H.²

26
- 84898972974
- pp. 974-980.
- S. P. Singh and D. Bertsekas, "Reinforcement learning for dynamic channel allocation in cellular telephone systems," in Proc. Advanced Neural Information Processing Systems, 1996, pp. 974-980.
- "Reinforcement Learning for Dynamic Channel Allocation in Cellular Telephone Systems," in Proc. Advanced Neural Information Processing Systems, 1996
- Singh, S.P.¹ Bertsekas, D.²

27
- 33749938224
- 1995.
- M. Asada et al., "Agents that learn from other competitive agents," in Proc. Machine Learning Workshop Agents That Learn from Other Agents, 1995.
- "Agents that Learn from Other Competitive Agents," in Proc. Machine Learning Workshop Agents that Learn from Other Agents
- Asada, M.¹

28
- 84965539827
- 1993.
- F. Polat, "A negotiation platform for cooperating multi-agent systems," Int. J. Concurrent Eng., no. 3, 1993.
- "A Negotiation Platform for Cooperating Multi-agent Systems," Int. J. Concurrent Eng., No. 3
- Polat, F.¹

29
- 33749939303
- 1997.
- D. C. Parkes and L. H. Ungar, "Learning and adaption in multiagent systems," in Proc. AAAI Multiagent Learning Workshop, 1997.
- "Learning and Adaption in Multiagent Systems," in Proc. AAAI Multiagent Learning Workshop
- Parkes, D.C.¹ Ungar, L.H.²

30
- 33749918390
- 1995.
- A. Schaerf, Y. Shoham, and M. Tennenholtz, "Adaptive load balancing: A study in multi-agent learning," J. Artif. Intell. Res., vol. 2, pp. 475-500, 1995.
- Y. Shoham, and M. Tennenholtz, "Adaptive Load Balancing: a Study in Multi-agent Learning," J. Artif. Intell. Res., Vol. 2, Pp. 475-500
- Schaerf, A.¹

31
- 33749948588
- pp. 84-89.
- S. Sen and M. Sekaran, "Multiagent coordination with learning classifier systems," in Proc. IJCAI Workshop Adaptation Learning Multi-Agent Systems, 1995, pp. 84-89.
- "Multiagent Coordination with Learning Classifier Systems," in Proc. IJCAI Workshop Adaptation Learning Multi-Agent Systems, 1995
- Sen, S.¹ Sekaran, M.²

32
- 33749959707
- 1997.
- S. Sen and T. Haynes, "Co-adaptation in a team," Int. J. Comput. Intell. Organ., vol. 1, no. 4, 1997.
- "Co-adaptation in a Team," Int. J. Comput. Intell. Organ., Vol. 1, No. 4
- Sen, S.¹ Haynes, T.²

33
- 85152198941
- pp. 330-337.
- M. Tan, "Multi-agent reinforcement learning: Independent vs. cooperative agents," in Proc. Int. Conf. Machine Learning, 1993, pp. 330-337.
- "Multi-agent Reinforcement Learning: Independent Vs. Cooperative Agents," in Proc. Int. Conf. Machine Learning, 1993
- Tan, M.¹

34
- 33749934091
- 1994.
- G. Weiss, "Learning to coordinate actions in multi-agent systems," in Proc. Nat. Conf. AI, 1994.
- "Learning to Coordinate Actions in Multi-agent Systems," in Proc. Nat. Conf. AI
- Weiss, G.¹

35
- 32144443210
- 1995.
- J. Carbonell et al., "Integrating planning and learning: The PRODIGY architecture," J. Theoret. Exper. Artif. Intell., vol. 7, no. 1, 1995.
- "Integrating Planning and Learning: the PRODIGY Architecture," J. Theoret. Exper. Artif. Intell., Vol. 7, No. 1
- Carbonell, J.¹

36
- 33749953771
- vol. 1042.
- C. V. Goldman and J. S. Rosenschein, "Mutually supervised learning in multiagent systems," in Proceedings of Adaptation and Learning in Multi-Agent Systems IJCAI95 Workshop, Lecture Notes in Artificial Intelligence, G. Weiss and S. Sen, Eds. Berlin, Germany: Springer Verlag, 1995, vol. 1042.
- "Mutually Supervised Learning in Multiagent Systems," in Proceedings of Adaptation and Learning in Multi-Agent Systems IJCAI95 Workshop, Lecture Notes in Artificial Intelligence, G. Weiss and S. Sen, Eds. Berlin, Germany: Springer Verlag, 1995
- Goldman, C.V.¹ Rosenschein, J.S.²

37
- 33749981848
- pp. 252-258.
- N. Ono and K. Fukumoto, "Multi-agent reinforcement learning: A modular approach," in Proc. Int. Conf. Multi-Agent Systems, 1996, pp. 252-258.
- "Multi-agent Reinforcement Learning: a Modular Approach," in Proc. Int. Conf. Multi-Agent Systems, 1996
- Ono, N.¹ Fukumoto, K.²

38
- 33749964779
- 1997.
- M. Veloso and W. Uther, Adversarial Reinforcement Learning, 1997.
- Adversarial Reinforcement Learning
- Veloso, M.¹ Uther, W.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.