SCOPUS 정보 검색 플랫폼

Volumn 17, Issue 4, 1996, Pages 89-97

The National Science Foundation Workshop on reinforcement learning

Author keywords

[No Author keywords available]

Indexed keywords

EID: 17144419347 PISSN: 07384602 EISSN: None Source Type: Journal
DOI: None Document Type: Article

Times cited : (7)

References (9)

3
- 0029210635
- Learning to act using real-time dynamic programming
- Barto, A.; Bradkte, S.; and Singh, S. 1995. Learning to Act Using Real-Time Dynamic Programming. Artificial Intelligence 72:81-138.
- (1995) Artificial Intelligence , vol.72 , pp. 81-138
- Barto, A.¹ Bradkte, S.² Singh, S.³

4
- 0020970738
- Neuronlike adaptive elements that can solve difficult learning control problems
- Barto, A.; Sutton, R.; and Anderson, C. 1983. Neuronlike Adaptive Elements That Can Solve Difficult Learning Control Problems. IEEE Transactions on Systems, Man, and Cybernetics 13(5): 834-846.
- (1983) IEEE Transactions on Systems, Man, and Cybernetics , vol.13 , Issue.5 , pp. 834-846
- Barto, A.¹ Sutton, R.² Anderson, C.³

5
- 0003565783
- New York: Athena Scientific
- Bertsekas, D. 1995. Dynamic Programming and Optimal Control. New York: Athena Scientific.
- (1995) Dynamic Programming and Optimal Control
- Bertsekas, D.¹

6
- 0003487482
- New York: Athena Scientific
- Bertsekas, D., and Tsitsiklis, J. 1996. Neuro-Dynamic Programming. New York: Athena Scientific.
- (1996) Neuro-Dynamic Programming
- Bertsekas, D.¹ Tsitsiklis, J.²

7
- 0029752592
- Average reward reinforcement learning: Foundations, algorithms, and empirical results
- Mahadevan, S. 1996b. Average Reward Reinforcement Learning: Foundations, Algorithms, and Empirical Results. Machine Learning 22:159-196.
- (1996) Machine Learning , vol.22 , pp. 159-196
- Mahadevan, S.¹

9
- 0026880130
- Automatic programming of behavior-based robots using reinforcement learning
- Mahadevan, S., and Connell, J. 1992. Automatic Programming of Behavior-Based Robots Using Reinforcement Learning. Artificial Intelligence 55:311-365.
- (1992) Artificial Intelligence , vol.55 , pp. 311-365
- Mahadevan, S.¹ Connell, J.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.