메뉴 건너뛰기




Volumn 2, Issue 1, 2001, Pages 55-66

Value-function reinforcement learning in Markov games

Author keywords

Game theory; Markov games; Nash equilibria; Q learning; Reinforcement learning; Temporal difference learning; Value functions

Indexed keywords


EID: 0001547175     PISSN: 13890417     EISSN: None     Source Type: Journal    
DOI: 10.1016/S1389-0417(01)00015-8     Document Type: Article
Times cited : (390)

References (5)
  • 1
    • 0003602259 scopus 로고
    • Learning and sequential decision making
    • Gabriel, M., & Moore, J. (Eds.) Learning and computational neuroscience: foundations of adaptive networks, MIT Press, Cambridge, MA, Department of Computer and Information Science, University of Massachusetts, Amherst, MA, 1989.
    • Barto, A. G., Sutton, R. S., & Watkins, C. J. C. H. (1991). Learning and sequential decision making. In: Gabriel, M., & Moore, J. (Eds.), Learning and computational neuroscience: foundations of adaptive networks, MIT Press, Cambridge, MA, Tech. Rep. 89-95, Department of Computer and Information Science, University of Massachusetts, Amherst, MA, 1989.
    • (1991) Tech. Rep. , vol.89-95
    • Barto, A.G.1    Sutton, R.S.2    Watkins, C.J.C.H.3
  • 3
    • 0003989209 scopus 로고    scopus 로고
    • Springer-Verlag
    • Filar, J., & Vrieze, K. (1997). Competitive Markov decision processes, Springer-Verlag. Szepesvári, C., & Littman, M. L. (1999). A unified analysis of value-function-based reinforcement-learning algorithms. Neural Comput. 11(8), 2017-2059.
    • (1997) Competitive Markov Decision Processes
    • Filar, J.1    Vrieze, K.2
  • 4
    • 0033570798 scopus 로고    scopus 로고
    • A unified analysis of value-function-based reinforcement-learning algorithms
    • Filar, J., & Vrieze, K. (1997). Competitive Markov decision processes, Springer-Verlag. Szepesvári, C., & Littman, M. L. (1999). A unified analysis of value-function-based reinforcement-learning algorithms. Neural Comput. 11(8), 2017-2059.
    • (1999) Neural Comput. , vol.11 , Issue.8 , pp. 2017-2059
    • Szepesvári, C.1    Littman, M.L.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.