메뉴 건너뛰기




Volumn 4, Issue , 1996, Pages 237-285

Reinforcement learning: A survey

Author keywords

[No Author keywords available]

Indexed keywords

COMPUTER SCIENCE; DECISION THEORY; HUMAN COMPUTER INTERACTION; MARKOV PROCESSES; MATHEMATICAL MODELS;

EID: 0029679044     PISSN: 10769757     EISSN: None     Source Type: Journal    
DOI: 10.1613/jair.301     Document Type: Article
Times cited : (6201)

References (7)
  • 1
    • 0000244881 scopus 로고
    • Generalization and scaling in reinforcement learning
    • Touretzky, D. S. (Ed.), San Mateo, CA. Morgan Kaufmann
    • Ackley, D. H., & Littman, M. L. (1990). Generalization and scaling in reinforcement learning. In Touretzky, D. S. (Ed.), Advances in Neural Information Processing Systems 2, pp. 550-557 San Mateo, CA. Morgan Kaufmann.
    • (1990) Advances in Neural Information Processing Systems , vol.2 , pp. 550-557
    • Ackley, D.H.1    Littman, M.L.2
  • 2
    • 0016556021 scopus 로고
    • A new approach to manipulator control: Cerebellar model articulation controller (cmac)
    • Albus, J. S. (1975). A new approach to manipulator control: Cerebellar model articulation controller (cmac). Journal of Dynamic Systems, Measurement and Control, 97, 220-227.
    • (1975) Journal of Dynamic Systems, Measurement and Control , vol.97 , pp. 220-227
    • Albus, J.S.1
  • 3
    • 0003942195 scopus 로고
    • BYTE Books, Subsidiary of McGraw-Hill, Peterborough, New Hampshire
    • Albus, J. S. (1981). Brains, Behavior, and Robotics. BYTE Books, Subsidiary of McGraw-Hill, Peterborough, New Hampshire.
    • (1981) Brains, Behavior, and Robotics
    • Albus, J.S.1
  • 6
    • 85151728371 scopus 로고
    • Residual algorithms: Reinforcement learning with function approximation
    • Prieditis, A., & Russell, S. (Eds.), San Francisco, CA. Morgan Kaufmann
    • Baird, L. (1995). Residual algorithms: Reinforcement learning with function approximation. In Prieditis, A., & Russell, S. (Eds.), Proceedings of the Twelfth International Conference on Machine Learning, pp. 30-37 San Francisco, CA. Morgan Kaufmann.
    • (1995) Proceedings of the Twelfth International Conference on Machine Learning , pp. 30-37
    • Baird, L.1
  • 7
    • 0003477315 scopus 로고
    • Reinforcement learning with high-dimensional, continuous actions
    • Wright-Patterson Air Force Base Ohio: Wright Laboratory
    • Baird, L. C., & Klopf, A. H. (1993). Reinforcement learning with high-dimensional, continuous actions. Tech. rep. WL-TR-93-1147, Wright-Patterson Air Force Base Ohio: Wright Laboratory.
    • (1993) Tech. Rep. WL-TR-93-1147
    • Baird, L.C.1    Klopf, A.H.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.