메뉴 건너뛰기




Volumn 17, Issue 4, 1996, Pages 89-97

The National Science Foundation Workshop on reinforcement learning

Author keywords

[No Author keywords available]

Indexed keywords


EID: 17144419347     PISSN: 07384602     EISSN: None     Source Type: Journal    
DOI: None     Document Type: Article
Times cited : (7)

References (9)
  • 1
    • 0003477315 scopus 로고
    • Reinforcement learning with high-dimensional continuous actions
    • Wright Laboratory, Wright-Patterson Air Force Base
    • Baird, L., and Klopf, H. 1993. Reinforcement Learning with High-Dimensional Continuous Actions, Technical Report WL-TR-93-1147, Wright Laboratory, Wright-Patterson Air Force Base.
    • (1993) Technical Report , vol.WL-TR-93-1147
    • Baird, L.1    Klopf, H.2
  • 3
    • 0029210635 scopus 로고
    • Learning to act using real-time dynamic programming
    • Barto, A.; Bradkte, S.; and Singh, S. 1995. Learning to Act Using Real-Time Dynamic Programming. Artificial Intelligence 72:81-138.
    • (1995) Artificial Intelligence , vol.72 , pp. 81-138
    • Barto, A.1    Bradkte, S.2    Singh, S.3
  • 4
  • 7
    • 0029752592 scopus 로고    scopus 로고
    • Average reward reinforcement learning: Foundations, algorithms, and empirical results
    • Mahadevan, S. 1996b. Average Reward Reinforcement Learning: Foundations, Algorithms, and Empirical Results. Machine Learning 22:159-196.
    • (1996) Machine Learning , vol.22 , pp. 159-196
    • Mahadevan, S.1
  • 8
    • 17144430819 scopus 로고    scopus 로고
    • Sensitive-discount optimality: Unifying average-reward and discounted reinforcement learning
    • San Francisco, Calif.: Morgan Kaufmann
    • Mahadevan, S. 1996c. Sensitive-Discount Optimality: Unifying Average-Reward and Discounted Reinforcement Learning. In Proceedings of the Thirteenth International Conference on Machine Learning, 328-336. San Francisco, Calif.: Morgan Kaufmann.
    • (1996) Proceedings of the Thirteenth International Conference on Machine Learning , pp. 328-336
    • Mahadevan, S.1
  • 9
    • 0026880130 scopus 로고
    • Automatic programming of behavior-based robots using reinforcement learning
    • Mahadevan, S., and Connell, J. 1992. Automatic Programming of Behavior-Based Robots Using Reinforcement Learning. Artificial Intelligence 55:311-365.
    • (1992) Artificial Intelligence , vol.55 , pp. 311-365
    • Mahadevan, S.1    Connell, J.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.