SCOPUS 정보 검색 플랫폼

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Volumn 847 LNAI, Issue , 1994, Pages 1-9

Fuzzy reinforcement learning and dynamic programming

(1) Berenji, Hamid R a

a NASA AMES RESEARCH CENTER (United States)

Author keywords

[No Author keywords available]

Indexed keywords

COMPUTER CIRCUITS; DECISION MAKING; FUZZY LOGIC; MACHINE LEARNING; REINFORCEMENT LEARNING;

ALTERNATIVE SOLUTIONS; DECISION PROCESS; FUZZY DYNAMIC PROGRAMMING; FUZZY REINFORCEMENT LEARNING; FUZZY-CONSTRAINT; FUZZY-Q-LEARNING; MULTISTAGE DECISION MAKING PROBLEMS; Q-LEARNING METHOD;

DYNAMIC PROGRAMMING;

EID: 0004987125 PISSN: 03029743 EISSN: 16113349 Source Type: Book Series
DOI: 10.1007/3-540-58409-9_1 Document Type: Conference Paper

Times cited : (5)

References (14)

1
- 85028465863
- Learning to act using real-time dynamic programming
- A. G. Barto, S. Bradtke, and S. Singh. Learning to act using real-time dynamic programming. Submitted to AI Journal special issue on Computational Theories of Interaction and Agency, 1993.
- (1993) Submitted to AI Journal Special Issue on Computational Theories of Interaction and Agency
- Barto, A.G.¹ Bradt, S.² Kesingh, S.³

2
- 0020970738
- Neuronlike adaptive elements that can solve difficult learning control problems
- A. G. Barto, R. S. Sutton, and C. W. Anderson. Neuronlike adaptive elements that can solve difficult learning control problems. IEEE Transactions on Systems, Man, and Cybernetics, 13:834-846, 1983.
- (1983) IEEE Transactions on Systems, Man, and Cybernetics , vol.13 , pp. 834-846
- Barto, A.G.¹ Sutton, R.S.² Anderson, C.W.³

3
- 0003787146
- Princeton University Press, Princeton, NJ
- R. Bellman. Dynamic Programming. Princeton University Press, Princeton, NJ, 1957.
- (1957) Dynamic Programming
- Bellman, R.¹

4
- 0346636333
- Decision-making in a fuzzy environment
- R.E. Bellman and L.A. Zadeh. Decision-making in a fuzzy environment. Management Science, 17(4):B-141:B-164, 1970.
- (1970) Management Science , vol.17 , Issue.4
- Bellman, R.E.¹ Zadeh, L.A.²

5
- 0026923465
- IEEE Transactions on Neural Networks
- H.R. Berenji and P. Khedkar. Learning and tuning fuzzy logic controllers through reinforcements. IEEE Transactions on Neural Networks, 3(5), 1992.
- (1992) Learning and Tuning Fuzzy Logic Controllers through Reinforcements , vol.3 , Issue.5
- Berenji, H.R.¹ Khedkar, P.²

6
- 85028459433
- Space shuttle attitude control by fuzzy logic and reinforcement learning
- San Francisco, CA, March
- H.R. Berenji, Y. Jani R.N Lea, P. Khedkar, A. Malkani, and J. Hoblit. Space shuttle attitude control by fuzzy logic and reinforcement learning. In Second IEEE International conference on Fuzzy Systems, San Francisco, CA, March 1993.
- Second IEEE International Conference on Fuzzy Systems , pp. 1993
- Berenji, H.R.¹ Lea, Y.J.² Khedkar, P.³ Malkani, A.⁴ Hoblit, J.⁵

7
- 85151437138
- Programming robots using reinforcement learning and teaching
- L.J. Lin. Programming robots using reinforcement learning and teaching. In Proceedings of the Ninth National Conference on Artificial Intelligence, 1991.
- (1991) Proceedings of the Ninth National Conference on Artificial Intelligence
- Lin, L.J.¹

8
- 85027124419
- Prioritized sweeping: Reinforcement learning with less data and less real time
- page to appear
- A. Moore and C. Atkeson. Prioritized sweeping: Reinforcement learning with less data and less real time. Machine Learning, page to appear.
- Machine Learning
- Moore, A.¹ Atkeson, C.²

9
- 33847202724
- Learning to predict by the methods of temporal differences
- R.S. Sutton. Learning to predict by the methods of temporal differences. Machine Learning, 3:9-44, 1988.
- (1988) Machine Learning , vol.3 , pp. 9-44
- Sutton, R.S.¹

10
- 85132026293
- Integrated architectures for learning, planning, and reacting based on approximating dynamic programming
- R.S. Sutton. Integrated architectures for learning, planning, and reacting based on approximating dynamic programming. In Proceedings of the Seventh International Conference on Machine Learning, 1990.
- (1990) Proceedings of the Seventh International Conference on Machine Learning
- Sutton, R.S.¹

11
- 0001046225
- Practical issues in temporal difference learning
- G. Tesauro. Practical issues in temporal difference learning. Machine Learning, (8):257-277, 1992.
- (1992) Machine Learning , Issue.8 , pp. 257-277
- Tesauro, G.¹

12
- 0000985504
- Td-gammon, a self-teaching backgammon program, achieves master-level play
- G. Tesauro. Td-gammon, a self-teaching backgammon program, achieves master-level play. Neural Computation, 6(2):215-219, 1994.
- (1994) Neural Computation , vol.6 , Issue.2 , pp. 215-219
- Tesauro, G.¹

13
- 34249833101
- C. Watkins and P. Dayan. Q-learning. Machine Learning, (8):279-292, 1992.
- (1992) Q-Learning. Machine Learning , Issue.8 , pp. 279-292
- Watkins, C.¹ Dayan, P.²

14
- 0004049895
- PhD thesis, Cambridge University, Psychology Department
- C.J.C.H. Watkins. Learning with Delayed Rewards. PhD thesis, Cambridge University, Psychology Department, 1989.
- (1989) Learning with Delayed Rewards
- Watkins, C.J.C.H.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.