SCOPUS 정보 검색 플랫폼

Volumn 3, Issue , 2008, Pages 1357-1360

Social reward shaping in the Prisoner's dilemma

Author keywords

Game theory; Iterated prisoner's dilemma; Leader follower strategies; Reinforcement learning; Subgame perfect equilibrium

Indexed keywords

AUTONOMOUS AGENTS; INTELLIGENT AGENTS; MULTI AGENT SYSTEMS; REINFORCEMENT LEARNING;

ITERATED PRISONER'S DILEMMA; LEADER/FOLLOWER STRATEGIES; NEAR-OPTIMAL; PRISONER'S DILEMMA; REINFORCEMENT-LEARNING AGENTS;

GAME THEORY;

EID: 84899963942 PISSN: 15488403 EISSN: 15582914 Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (55)

References (9)

1
- 84898960502
- Playing is believing: The role of beliefs in multi-agent learning
- Y.-H. Chang and L. P. Kaelbling. Playing is believing: The role of beliefs in multi-agent learning. In Advances in Neural Information Processing Systems 14, 2002.
- (2002) Advances in Neural Information Processing Systems , vol.14
- Chang, Y.-H.¹ Kaelbling, L.P.²

2
- 80053136974
- Implicit negotiation in repeated games
- M. L. Littman and P. Stone. Implicit negotiation in repeated games. In Eighth International Workshop on Agent Theories, Architectures, and Languages (ATAL-2001), pages 393-404, 2001.
- (2001) Eighth International Workshop on Agent Theories, Architectures, and Languages (ATAL-2001) , pp. 393-404
- Littman, M.L.¹ Stone, P.²

3
- 9544234477
- A polynomial-time Nash equilibrium algorithm for repeated games
- M. L. Littman and P. Stone. A polynomial-time Nash equilibrium algorithm for repeated games. Decision Support Systems, 39(1):55-66, 2005.
- (2005) Decision Support Systems , vol.39 , Issue.1 , pp. 55-66
- Littman, M.L.¹ Stone, P.²

4
- 0141596576
- Policy invariance under reward transformations: Theory and application to reward shaping
- A. Y. Ng, D. Harada, and S. Russell. Policy invariance under reward transformations: Theory and application to reward shaping. In Proceedings of the Sixteenth International Conference on Machine Learning, pages 278-287, 1999.
- (1999) Proceedings of the Sixteenth International Conference on Machine Learning , pp. 278-287
- Ng, A.Y.¹ Harada, D.² Russell, S.³

5
- 0003427725
- The MIT Press
- M. J. Osborne and A. Rubinstein. A Course in Game Theory. The MIT Press, 1994.
- (1994) A Course in Game Theory
- Osborne, M.J.¹ Rubinstein, A.²

6
- 0003998452
- John Wiley & Sons, Inc., New York, NY
- M. L. Puterman. Markov Decision Processes-Discrete Stochastic Dynamic Programming. John Wiley & Sons, Inc., New York, NY, 1994.
- (1994) Markov Decision Processes-Discrete Stochastic Dynamic Programming
- Puterman, M.L.¹

8
- 0004102479
- The MIT Press
- R. S. Sutton and A. G. Barto. Reinforcement Learning: An Introduction. The MIT Press, 1998.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

9
- 27344453198
- Potential-based shaping and Q-value initialization are equivalent
- E. Wiewiora. Potential-based shaping and Q-value initialization are equivalent. J. Artif. Intell. Res. (JAIR), 19:205-208, 2003.
- (2003) J. Artif. Intell. Res. (JAIR) , vol.19 , pp. 205-208
- Wiewiora, E.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.