메뉴 건너뛰기




Volumn 49, Issue 2-3, 2002, Pages 247-265

Continuous-action Q-learning

Author keywords

Continuous domains; Incremental topology preserving maps; Real time operation; Reinforcement learning

Indexed keywords

AUTONOMOUS AGENTS; ESTIMATION; OPTIMIZATION; REAL TIME SYSTEMS; TOPOLOGY;

EID: 0036832960     PISSN: 08856125     EISSN: None     Source Type: Journal    
DOI: 10.1023/A:1017988514716     Document Type: Article
Times cited : (116)

References (23)
  • 6
    • 0000123778 scopus 로고
    • Self-improving reactive agents based on reinforcement learning, planning and teaching
    • (1992) Machine Learning , vol.8 , pp. 293-321
    • Lin, L.-J.1
  • 7
    • 0029752592 scopus 로고    scopus 로고
    • Average reward reinforcement learning: Foundations, algorithms, and empirical results
    • (1996) Machine Learning , vol.22 , pp. 159-195
    • Mahadevan, S.1
  • 10
    • 0008852323 scopus 로고
    • A reinforcement connectionist learning approach to robot path finding
    • Ph.D. Thesis, Software Dept., Universitat Politècnica de Catalunya, Barcelona, Spain
    • (1992)
    • Millán, J.D.R.1
  • 23
    • 0004049895 scopus 로고
    • Learning with delayed rewards
    • Ph.D. Thesis, Cambridge University, England, UK
    • (1989)
    • Watkins, C.J.C.H.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.