메뉴 건너뛰기




Volumn 49, Issue 1, 2002, Pages 5-37

The lagging anchor algorithm: Reinforcement learning in two-player zero-sum games with imperfect information

Author keywords

Imperfect information; Neural net; Reinforcement learning; Two player zero sum game

Indexed keywords

COMPUTATIONAL COMPLEXITY; CONVERGENCE OF NUMERICAL METHODS; GAME THEORY; LEARNING ALGORITHMS; NEURAL NETWORKS;

EID: 0036778915     PISSN: 08856125     EISSN: None     Source Type: Journal    
DOI: 10.1023/A:1014063505958     Document Type: Article
Times cited : (12)

References (33)
  • 3
  • 7
    • 25344458514 scopus 로고    scopus 로고
    • Three games designed for the study of human and automated decision making
    • Definitions and properties of the games campaign, Operation lucid and operation opaque. FFI/Rapport-98/02799, Norwegian Defence Research Establishment (FFI), Kjeller, Norway
    • (1998)
    • Dahl, F.A.1    Halck, O.M.2
  • 32
    • 0004049893 scopus 로고
    • Learning from delayed rewards
    • PhD thesis, Psychology Department, Cambridge University, Cambridge, UK
    • (1989)
    • Watkins, C.J.C.H.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.