SCOPUS 정보 검색 플랫폼

Volumn 2, Issue 1, 2001, Pages 55-66

Value-function reinforcement learning in Markov games

Author keywords

Game theory; Markov games; Nash equilibria; Q learning; Reinforcement learning; Temporal difference learning; Value functions

Indexed keywords

EID: 0001547175 PISSN: 13890417 EISSN: None Source Type: Journal
DOI: 10.1016/S1389-0417(01)00015-8 Document Type: Article

Times cited : (390)

References (5)

2
- 0031630561
- The dynamics of reinforcement learning in cooperative multiagent systems
- Claus, C., & Boutilier, C. (1998). The dynamics of reinforcement learning in cooperative multiagent systems. In: Proceedings of the Fifteenth National Conference on Artificial Intelligence.
- (1998) Proceedings of the Fifteenth National Conference on Artificial Intelligence
- Claus, C.¹ Boutilier, C.²

3
- 0003989209
- Springer-Verlag
- Filar, J., & Vrieze, K. (1997). Competitive Markov decision processes, Springer-Verlag. Szepesvári, C., & Littman, M. L. (1999). A unified analysis of value-function-based reinforcement-learning algorithms. Neural Comput. 11(8), 2017-2059.
- (1997) Competitive Markov Decision Processes
- Filar, J.¹ Vrieze, K.²

4
- 0033570798
- A unified analysis of value-function-based reinforcement-learning algorithms
- Filar, J., & Vrieze, K. (1997). Competitive Markov decision processes, Springer-Verlag. Szepesvári, C., & Littman, M. L. (1999). A unified analysis of value-function-based reinforcement-learning algorithms. Neural Comput. 11(8), 2017-2059.
- (1999) Neural Comput. , vol.11 , Issue.8 , pp. 2017-2059
- Szepesvári, C.¹ Littman, M.L.²

5
- 34249833101
- Q-learning
- Watkins, C. J. C. H., & Dayan, P. (1992). Q-learning. Machine Learn. 8(3), 279-292.
- (1992) Machine Learn. , vol.8 , Issue.3 , pp. 279-292
- Watkins, C.J.C.H.¹ Dayan, P.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.