메뉴 건너뛰기




Volumn , Issue , 1992, Pages 316-321

Using Transitional Proximity for Faster Reinforcement Learning

Author keywords

[No Author keywords available]

Indexed keywords

'CURRENT; CURRENT PATHS; FAST LEARNING; INTERNAL STATE; KOHONEN NETWORK; LEARN+; LEARNING NETWORK; OPTIMAL POLICIES; Q-LEARNING; REINFORCEMENT LEARNINGS;

EID: 84957622922     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1016/B978-1-55860-247-2.50045-0     Document Type: Conference Paper
Times cited : (13)

References (12)
  • 1
    • 0003787146 scopus 로고
    • [Bellman, 1957] Princeton University Press, Princeton, NJ
    • [Bellman, 1957] R. E. Bellman. Dynamic Programming. Princeton University Press, Princeton, NJ, 1957.
    • (1957) Dynamic Programming
    • Bellman, R. E.1
  • 2
    • 0002192119 scopus 로고
    • Learning from delayed reinforcement in a complex domain
    • [Chapman and Kaelbling, 1991]
    • [Chapman and Kaelbling, 1991] David Chapman and Leslie Pack Kaelbling. Learning from delayed reinforcement in a complex domain. In Proceedings of IJCAI, 1991.
    • (1991) Proceedings of IJCAI
    • Chapman, David1    Kaelbling, Leslie Pack2
  • 3
    • 0003882939 scopus 로고
    • [Chapman, 1990] PhD thesis, MIT Artificial Intelligence Laboratory
    • [Chapman, 1990] David Chapman. Vision, Instruction, and action. PhD thesis, MIT Artificial Intelligence Laboratory, 1990.
    • (1990) Vision, Instruction, and action
    • Chapman, David1
  • 6
    • 85151437138 scopus 로고
    • Programming robots using reinforcement learning and teaching
    • 1991]
    • [Lin, 1991] Long-Ji Lin. Programming robots using reinforcement learning and teaching. AAAI, 1991.
    • (1991) AAAI
    • Lin, Long-Ji1
  • 10
    • 84916567474 scopus 로고
    • Cost sensitive reinforcement learning for adaptive classification and control
    • [Tan, 1991]
    • [Tan, 1991] Ming Tan. Cost sensitive reinforcement learning for adaptive classification and control. In AAAI, 1991.
    • (1991) AAAI
    • Tan, Ming1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.