SCOPUS 정보 검색 플랫폼

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Volumn 2457, Issue , 2002, Pages 264-272

Minimax fuzzy Q-learning in cooperative multi-agent systems

(2) Kilic, Alper a Arslan, Ahmet a

a FIRAT UNIVERSITY (Turkey)

Author keywords

[No Author keywords available]

Indexed keywords

INFORMATION SYSTEMS; INFORMATION USE; MACHINE LEARNING; OPTIMIZATION; REINFORCEMENT LEARNING;

FUNCTION MAPPING; FUZZY STATE; FUZZY-Q-LEARNING; LEARNING METHODS; MINIMAX-Q LEARNING; MULTI AGENT COOPERATION; MULTI-AGENT REINFORCEMENT LEARNING; OPTIMAL POLICIES;

MULTI AGENT SYSTEMS;

EID: 80053654030 PISSN: 03029743 EISSN: 16113349 Source Type: Book Series
DOI: 10.1007/3-540-36077-8_27 Document Type: Conference Paper

Times cited : (3)

References (28)

1
- 84951728431
- PhD Thesis, University of Cambridge, England
- Waduns, C. J. C. H., "Learnzng pom delayed rewards", PhD Thesis, University of Cambridge, England, 1989.
- (1989) Learnzng Pom Delayed Rewards
- Waduns, C.¹

2
- 84951788107
- Machlne Learnhg, 3, 9 4 4
- Sutton, R. S., "Learnzng to predzct by the methods of temporal dzferences", Machlne Learnhg, 3, 9 4 4
- Learnzng to Predzct by the Methods of Temporal Dzferences
- Sutton, R.S.¹

3
- 84951735656
- MS Thesis, Middle East Technical University
- Kuter, U., "S-Learnzng: A Multz-Agent Reznforcement Learnzng Method', MS Thesis, Middle East Technical University, 2000.
- (2000) S-Learnzng: A Multz-Agent Reznforcement Learnzng Method
- Kuter, U.¹

4
- 84951751027
- PhD thesis, John Hopkins University
- Sheppard J. W., "Multz agentreznforcement learnzng & Markov Games", PhD thesis, John Hopkins University, 1997.
- (1997) Multz Agentreznforcement Learnzng & Markov Games
- Sheppard, J.W.¹

5
- 84951791360
- Draft, Submitted to AHRL Workshop
- Andre, D., "Learnzng Hzerarchzcal Behaviors", Draft, Submitted to AHRL Workshop, 1998.
- (1998) Learnzng Hzerarchzcal Behaviors
- Andre, D.¹

6
- 1842453586
- Fzndzng Sub-optzmal Policies Faster zn Multz-Agent Systems: FQ-Learnzng
- California, USA, March 25-27
- Kilic, A, Kaya, M., Arslan, A., "Fzndzng Sub-optzmal Policies Faster zn Multz-Agent Systems: FQ-Learnzng", the 7th International Conference on Intelligent Autonomous System, California, USA, March 25-27, 2002
- (2002) 7Th International Conference on Intelligent Autonomous System
- Kilic, A.¹ Kaya, M.² Arslan, A.³

7
- 0002335248
- Multz-agent reznforcement learning: A modular approach
- Ono, N., Fukomoto, K., "Multz-agent reznforcement learning: A modular approach". In Proceedings of the Second International Conference on Multi-Agent System (ICMAS96), pp: 252-258, 1996.
- (1996) Proceedings of the Second International Conference on Multi-Agent System (ICMAS96) , pp. 252-258
- Ono, N.¹ Fukomoto, K.²

8
- 85152198941
- Multz-agent reznforcement learnzng: Zndependent vs, cooperatzve agents
- Tan, M., "Multz-agent reznforcement learnzng: zndependent vs, cooperatzve agents" In Proceedings of the Tenth International Conference on Macline Learnhg (ICML-93), pp: 330-337, 1993.
- (1993) Proceedings of the Tenth International Conference on Macline Learnhg (ICML-93) , pp. 330-337
- Tan, M.¹

9
- 85149834820
- Markov games as a framework for multi agent reinforcement learning
- San Francisco, CA
- Littman M. L., “Markov games as a framework for multi agent reinforcement learning”, Proceedings of the Eleventh International Conference on Machine Learning, pp. 157-163. San Francisco, CA 1994.
- (1994) Proceedings of the Eleventh International Conference on Machine Learning , pp. 157-163
- Littman, M.L.¹

10
- 0030050933
- Multi agent reinforcement learning in the Iterated Prisoners Dilemma”
- Sandholm, T.W.; Crites, R. H., “Multi agent reinforcement learning in the Iterated Prisoner’s Dilemma”, Biosystems, 37:147–166, 1995.
- (1995) Biosystems , vol.37 , pp. 147-166
- Sandholm, T.W.¹ Crites, R.H.²

11
- 0000929496
- Multi-agent reinforcement learning: Theoretical framework and an algorithm
- Hu, J.; Wellman, M. P., “Multi-agent reinforcement learning: theoretical framework and an algorithm” In Proceedings of the Fifteenth International Conference on Machine Learning (ICML-98), pp: 242–250, 1998.
- (1998) Proceedings of the Fifteenth International Conference on Machine Learning (ICML-98) , pp. 242-250
- Hu, J.¹ Wellman, M.P.²

12
- 0028555752
- Learning to coordinate without sharing information
- Sen, S.; Sekeran, M.; Hale, J., “Learning to coordinate without sharing information”. In proceedings of the Twelfth National Conference on Artificial Intelligence (AAAI-94), pp: 426–431, 1994.
- (1994) Proceedings of the Twelfth National Conference on Artificial Intelligence (AAAI-94) , pp. 426-431
- Sen, S.¹ Sekeran, M.² Hale, J.³

13
- 0000123778
- Self-improving reactive agents based on reinforcement learning, planning and teaching
- Lin, L. J. ,“Self-improving reactive agents based on reinforcement learning, planning and teaching“, Machine Learning, Vol: 8, pp: 293-321, 1992.
- (1992) Machine Learning , vol.8 , pp. 293-332
- Lin, L.J.¹

14
- 1842610379
- Neural reinforcement learning for behavior synthesis
- Lille, July
- Touzet, P., “Neural reinforcement learning for behavior synthesis”, In Proceedings of CESA’96 IMACS Multi-conference, Lille, July 1996.
- Proceedings of CESA’96 IMACS Multi-Conference , pp. 1996
- Touzet, P.¹

15
- 0033280134
- Cooperation and coordination between fuzzy reinforcement learning agents in continuous state partially observable markov decision processes
- Berenji, H.; Vengerov, D., “Cooperation and coordination between fuzzy reinforcement learning agents in continuous state partially observable markov decision processes”, In Proceedings of the 8th IEEE International Conference on Fuzzy Systems (FUZZ-IEEE'99) 1999.
- (1999) Proceedings of the 8Th IEEE International Conference on Fuzzy Systems (FUZZ-IEEE'99)
- Berenji, H.¹ Vengerov, D.²

16
- 0033685787
- Advantage of cooperation between reinforcement learning agents in difficult stochastic problems
- Berenji, H.; Vengerov, D., “Advantage of cooperation between reinforcement learning agents in difficult stochastic problems”, In Proceedings of the 9th IEEE International Conference on Fuzzy Systems (FUZZ-IEEE'00) 2000.
- (2000) Proceedings of the 9Th IEEE International Conference on Fuzzy Systems (FUZZ-IEEE'00)
- Berenji, H.¹ Vengerov, D.²

17
- 1842610384
- Fuzzy-Reinforcement Learning in Cooperative Multi-Agent Systems
- Turkey, November 5–7
- Kaya, M.; Kilic, A., “Fuzzy-Reinforcement Learning in Cooperative Multi-Agent Systems”, International Symposium on Computer and Information Sciences (ISCIS 2001), Turkey, November 5–7, 2001
- (2001) International Symposium on Computer and Information Sciences (ISCIS 2001)
- Kaya, M.¹ Kilic, A.²

18
- 34249833101
- Technical Note: Q-Learning
- Watkins, C. J. C. H.; Dayan P., “Technical Note: Q-Learning” Machine Learning, 8:279-292, 1992.
- (1992) Machine Learning , vol.8 , pp. 279-292
- Watkins, C.¹ Dayan, P.²

19
- 0001547175
- Value-function reinforcement learning in Markov games
- Littman, M. L., “Value-function reinforcement learning in Markov games”, Journal of Cognitive Systems Research, vol: 2, pp: 55–66, 2001.
- (2001) Journal of Cognitive Systems Research , vol.2 , pp. 55-66
- Littman, M.L.¹

20
- 84951730143
- Fuzzy Q-learning
- Milano, Italy, September, 18–19
- Glorennec, P. Y.; Jouffe, L., “Fuzzy Q-learning”, Second European Workshop on Reinforcement Learning, Milano, Italy, September, 18–19, 1995.
- (1995) Second European Workshop on Reinforcement Learning
- Glorennec, P.Y.¹ Jouffe, L.²

21
- 0026923465
- Learning and tuning fuzzy logic controllers through reinforcement
- Sept
- Berenji, H.; Khedkar, P., “Learning and tuning fuzzy logic controllers through reinforcement”, IEEE Trans. on Neural Networks, 3(5), Sept. 1992.
- (1992) IEEE Trans. On Neural Networks , vol.3 , Issue.5
- Berenji, H.¹ Khedkar, P.²

22
- 84951808510
- Reinforcement learning for autonomous robots
- Aachen, Germany, Sept
- Glorennec, P. Y.; Jouffe, L., “Reinforcement learning for autonomous robots”, Proc. of EUFIT, Aachen, Germany, Sept., 1996.
- (1996) Proc. Of EUFIT
- Glorennec, P.Y.¹ Jouffe, L.²

23
- 0029287724
- Fuzzy logic controllers are universal approximators
- April
- Castro, J. L., “Fuzzy logic controllers are universal approximators”, IEEE Transaction on SMC, vol: 25/4, April, 1995.
- (1995) IEEE Transaction on SMC , vol.25 , Issue.4
- Castro, J.L.¹

24
- 11744283659
- Fuzzy Q-learning and evolutionary Strategy for adaptive fuzzy control
- Aachen, Germany, Sept
- Glorennec, P. Y., “Fuzzy Q-learning and evolutionary Strategy for adaptive fuzzy control”, Proc. of EUFIT, ELITE Foundation, pp: 35-40, Aachen, Germany, Sept., 1994.
- (1994) Proc. Of EUFIT, ELITE Foundation , pp. 35-40
- Glorennec, P.Y.¹

25
- 0028731609
- Fuzzy Q-learning: A new approach for fuzzy dynamic programming
- IEEE Computer Press, Piscataway, NJ
- Berenji, H. R., “Fuzzy Q-learning: a new approach for fuzzy dynamic programming”, Proc. Third IEEE Int. Conf. on Fuzzy Systems. IEEE Computer Press, Piscataway, NJ, pp: 486–491, 1994.
- (1994) Proc. Third IEEE Int. Conf. On Fuzzy Systems , pp. 486-549
- Berenji, H.R.¹

26
- 0003330984
- Delayed reinforcement, Fuzzy Q-learning and Fuzzy Logic Controllers
- Physica Verlag (Springer Verlag), Heidelberg, Germany
- Bonarini, A., “Delayed reinforcement, Fuzzy Q-learning and Fuzzy Logic Controllers”, Genetic Algorithms and Soft Computing, Physica Verlag (Springer Verlag), Heidelberg, Germany, pp: 447–466, 1996b.
- (1996) Genetic Algorithms and Soft Computing , pp. 447-466
- Bonarini, A.¹

27
- 0001435241
- Multi-agent Reinforcement Learning: An Approach Based On The Other Agent’s Internal Model
- 215–221, Los Alamitos, IEEE Computer Society
- Nagayuki, Y.; Ishii, S.; Kenji, D., “Multi-agent Reinforcement Learning: An Approach Based On The Other Agent’s Internal Model”, Fourth International Conference on Multiagent Systems (ICMAS), 215–221, Los Alamitos, IEEE Computer Society, 2000.
- (2000) Fourth International Conference on Multiagent Systems (ICMAS)
- Nagayuki, Y.¹ Ishii, S.² Kenji, D.³

28
- 0033697232
- A Fuzzy Reinforcement Function for the Intelligent Agent to process Vague Goals
- Seo, H. S.; Youn, S. J.; Oh, K. W.,” A Fuzzy Reinforcement Function for the Intelligent Agent to process Vague Goals”, The 19th International Meeting of the North American Fuzzy Information Processing, NAFIPS, 2000.
- (2000) The 19Th International Meeting of the North American Fuzzy Information Processing, NAFIPS
- Seo, H.S.¹ Youn, S.J.² Oh, K.W.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.