-
1
-
-
84951728431
-
-
PhD Thesis, University of Cambridge, England
-
Waduns, C. J. C. H., "Learnzng pom delayed rewards", PhD Thesis, University of Cambridge, England, 1989.
-
(1989)
Learnzng Pom Delayed Rewards
-
-
Waduns, C.1
-
6
-
-
1842453586
-
Fzndzng Sub-optzmal Policies Faster zn Multz-Agent Systems: FQ-Learnzng
-
California, USA, March 25-27
-
Kilic, A, Kaya, M., Arslan, A., "Fzndzng Sub-optzmal Policies Faster zn Multz-Agent Systems: FQ-Learnzng", the 7th International Conference on Intelligent Autonomous System, California, USA, March 25-27, 2002
-
(2002)
7Th International Conference on Intelligent Autonomous System
-
-
Kilic, A.1
Kaya, M.2
Arslan, A.3
-
10
-
-
0030050933
-
Multi agent reinforcement learning in the Iterated Prisoners Dilemma”
-
Sandholm, T.W.; Crites, R. H., “Multi agent reinforcement learning in the Iterated Prisoner’s Dilemma”, Biosystems, 37:147–166, 1995.
-
(1995)
Biosystems
, vol.37
, pp. 147-166
-
-
Sandholm, T.W.1
Crites, R.H.2
-
12
-
-
0028555752
-
Learning to coordinate without sharing information
-
Sen, S.; Sekeran, M.; Hale, J., “Learning to coordinate without sharing information”. In proceedings of the Twelfth National Conference on Artificial Intelligence (AAAI-94), pp: 426–431, 1994.
-
(1994)
Proceedings of the Twelfth National Conference on Artificial Intelligence (AAAI-94)
, pp. 426-431
-
-
Sen, S.1
Sekeran, M.2
Hale, J.3
-
13
-
-
0000123778
-
Self-improving reactive agents based on reinforcement learning, planning and teaching
-
Lin, L. J. ,“Self-improving reactive agents based on reinforcement learning, planning and teaching“, Machine Learning, Vol: 8, pp: 293-321, 1992.
-
(1992)
Machine Learning
, vol.8
, pp. 293-332
-
-
Lin, L.J.1
-
14
-
-
1842610379
-
Neural reinforcement learning for behavior synthesis
-
Lille, July
-
Touzet, P., “Neural reinforcement learning for behavior synthesis”, In Proceedings of CESA’96 IMACS Multi-conference, Lille, July 1996.
-
Proceedings of CESA’96 IMACS Multi-Conference
, pp. 1996
-
-
Touzet, P.1
-
17
-
-
1842610384
-
Fuzzy-Reinforcement Learning in Cooperative Multi-Agent Systems
-
Turkey, November 5–7
-
Kaya, M.; Kilic, A., “Fuzzy-Reinforcement Learning in Cooperative Multi-Agent Systems”, International Symposium on Computer and Information Sciences (ISCIS 2001), Turkey, November 5–7, 2001
-
(2001)
International Symposium on Computer and Information Sciences (ISCIS 2001)
-
-
Kaya, M.1
Kilic, A.2
-
18
-
-
34249833101
-
Technical Note: Q-Learning
-
Watkins, C. J. C. H.; Dayan P., “Technical Note: Q-Learning” Machine Learning, 8:279-292, 1992.
-
(1992)
Machine Learning
, vol.8
, pp. 279-292
-
-
Watkins, C.1
Dayan, P.2
-
19
-
-
0001547175
-
Value-function reinforcement learning in Markov games
-
Littman, M. L., “Value-function reinforcement learning in Markov games”, Journal of Cognitive Systems Research, vol: 2, pp: 55–66, 2001.
-
(2001)
Journal of Cognitive Systems Research
, vol.2
, pp. 55-66
-
-
Littman, M.L.1
-
20
-
-
84951730143
-
Fuzzy Q-learning
-
Milano, Italy, September, 18–19
-
Glorennec, P. Y.; Jouffe, L., “Fuzzy Q-learning”, Second European Workshop on Reinforcement Learning, Milano, Italy, September, 18–19, 1995.
-
(1995)
Second European Workshop on Reinforcement Learning
-
-
Glorennec, P.Y.1
Jouffe, L.2
-
21
-
-
0026923465
-
Learning and tuning fuzzy logic controllers through reinforcement
-
Sept
-
Berenji, H.; Khedkar, P., “Learning and tuning fuzzy logic controllers through reinforcement”, IEEE Trans. on Neural Networks, 3(5), Sept. 1992.
-
(1992)
IEEE Trans. On Neural Networks
, vol.3
, Issue.5
-
-
Berenji, H.1
Khedkar, P.2
-
22
-
-
84951808510
-
Reinforcement learning for autonomous robots
-
Aachen, Germany, Sept
-
Glorennec, P. Y.; Jouffe, L., “Reinforcement learning for autonomous robots”, Proc. of EUFIT, Aachen, Germany, Sept., 1996.
-
(1996)
Proc. Of EUFIT
-
-
Glorennec, P.Y.1
Jouffe, L.2
-
23
-
-
0029287724
-
Fuzzy logic controllers are universal approximators
-
April
-
Castro, J. L., “Fuzzy logic controllers are universal approximators”, IEEE Transaction on SMC, vol: 25/4, April, 1995.
-
(1995)
IEEE Transaction on SMC
, vol.25
, Issue.4
-
-
Castro, J.L.1
-
24
-
-
11744283659
-
Fuzzy Q-learning and evolutionary Strategy for adaptive fuzzy control
-
Aachen, Germany, Sept
-
Glorennec, P. Y., “Fuzzy Q-learning and evolutionary Strategy for adaptive fuzzy control”, Proc. of EUFIT, ELITE Foundation, pp: 35-40, Aachen, Germany, Sept., 1994.
-
(1994)
Proc. Of EUFIT, ELITE Foundation
, pp. 35-40
-
-
Glorennec, P.Y.1
-
25
-
-
0028731609
-
Fuzzy Q-learning: A new approach for fuzzy dynamic programming
-
IEEE Computer Press, Piscataway, NJ
-
Berenji, H. R., “Fuzzy Q-learning: a new approach for fuzzy dynamic programming”, Proc. Third IEEE Int. Conf. on Fuzzy Systems. IEEE Computer Press, Piscataway, NJ, pp: 486–491, 1994.
-
(1994)
Proc. Third IEEE Int. Conf. On Fuzzy Systems
, pp. 486-549
-
-
Berenji, H.R.1
-
26
-
-
0003330984
-
Delayed reinforcement, Fuzzy Q-learning and Fuzzy Logic Controllers
-
Physica Verlag (Springer Verlag), Heidelberg, Germany
-
Bonarini, A., “Delayed reinforcement, Fuzzy Q-learning and Fuzzy Logic Controllers”, Genetic Algorithms and Soft Computing, Physica Verlag (Springer Verlag), Heidelberg, Germany, pp: 447–466, 1996b.
-
(1996)
Genetic Algorithms and Soft Computing
, pp. 447-466
-
-
Bonarini, A.1
-
27
-
-
0001435241
-
Multi-agent Reinforcement Learning: An Approach Based On The Other Agent’s Internal Model
-
215–221, Los Alamitos, IEEE Computer Society
-
Nagayuki, Y.; Ishii, S.; Kenji, D., “Multi-agent Reinforcement Learning: An Approach Based On The Other Agent’s Internal Model”, Fourth International Conference on Multiagent Systems (ICMAS), 215–221, Los Alamitos, IEEE Computer Society, 2000.
-
(2000)
Fourth International Conference on Multiagent Systems (ICMAS)
-
-
Nagayuki, Y.1
Ishii, S.2
Kenji, D.3
-
28
-
-
0033697232
-
A Fuzzy Reinforcement Function for the Intelligent Agent to process Vague Goals
-
Seo, H. S.; Youn, S. J.; Oh, K. W.,” A Fuzzy Reinforcement Function for the Intelligent Agent to process Vague Goals”, The 19th International Meeting of the North American Fuzzy Information Processing, NAFIPS, 2000.
-
(2000)
The 19Th International Meeting of the North American Fuzzy Information Processing, NAFIPS
-
-
Seo, H.S.1
Youn, S.J.2
Oh, K.W.3
|