메뉴 건너뛰기




Volumn 3, Issue , 2008, Pages 1357-1360

Social reward shaping in the Prisoner's dilemma

Author keywords

Game theory; Iterated prisoner's dilemma; Leader follower strategies; Reinforcement learning; Subgame perfect equilibrium

Indexed keywords

AUTONOMOUS AGENTS; INTELLIGENT AGENTS; MULTI AGENT SYSTEMS; REINFORCEMENT LEARNING;

EID: 84899963942     PISSN: 15488403     EISSN: 15582914     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (55)

References (9)
  • 3
    • 9544234477 scopus 로고    scopus 로고
    • A polynomial-time Nash equilibrium algorithm for repeated games
    • M. L. Littman and P. Stone. A polynomial-time Nash equilibrium algorithm for repeated games. Decision Support Systems, 39(1):55-66, 2005.
    • (2005) Decision Support Systems , vol.39 , Issue.1 , pp. 55-66
    • Littman, M.L.1    Stone, P.2
  • 7
    • 34147161536 scopus 로고    scopus 로고
    • If multi-agent learning is the answer, what is the question?
    • Special issue on the foundations of research in multi-agent learning
    • Y. Shoham, R. Powers, and T. Grenager. If multi-agent learning is the answer, what is the question? Artificial Intelligence, 2007. Special issue on the foundations of research in multi-agent learning.
    • (2007) Artificial Intelligence
    • Shoham, Y.1    Powers, R.2    Grenager, T.3
  • 9
    • 27344453198 scopus 로고    scopus 로고
    • Potential-based shaping and Q-value initialization are equivalent
    • E. Wiewiora. Potential-based shaping and Q-value initialization are equivalent. J. Artif. Intell. Res. (JAIR), 19:205-208, 2003.
    • (2003) J. Artif. Intell. Res. (JAIR) , vol.19 , pp. 205-208
    • Wiewiora, E.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.