SCOPUS 정보 검색 플랫폼

Volumn 1810, Issue , 2000, Pages 117-128

Minimax td-learning with neural nets in a Markov game

Author keywords

[No Author keywords available]

Indexed keywords

LEARNING SYSTEMS; LINEAR PROGRAMMING; NEURAL NETWORKS; REINFORCEMENT LEARNING;

EVALUATION CRITERIA; FICTITIOUS PLAY; GAME PLAYING; MARKOV GAMES; MINIMAX-Q LEARNING; SOLVING MATRICES; TEMPORAL DIFFERENCE LEARNING; ZERO-SUM GAME;

GAME THEORY;

EID: 84974693459 PISSN: 03029743 EISSN: 16113349 Source Type: Book Series
DOI: 10.1007/3-540-45164-1_13 Document Type: Conference Paper

Times cited : (9)

References (13)

1
- 33847202724
- Learning to predict by the methods of temporal differences
- Sutton, R.S.: Learning to predict by the methods of temporal differences, Machine Learning 3 (1988) 9-44.
- (1988) Machine Learning , vol.3 , pp. 9-44
- Sutton, R.S.¹

2
- 0001046225
- Practical issues in temporal difference learning
- Tesauro, G.J.: Practical issues in temporal difference learning, Machine Learning 8 (1992) 257-277.
- (1992) Machine Learning , vol.8 , pp. 257-277
- Tesauro, G.J.¹

3
- 0031192989
- Representations and solutions for game-theoretic problems
- Koller, D., Pfeffer, A.: Representations and solutions for game-theoretic problems. Artificial Intelligence 94(1) (1997) 167-215.
- (1997) Artificial Intelligence , vol.94 , Issue.1 , pp. 167-215
- Koller, D.¹ Pfeffer, A.²

7
- 0004145762
- Wiley, New York
- Luce, R.D., Raiffa, H.: Games and Decisions. Wiley, New York (1957).
- (1957) Games and Decisions
- Luce, R.D.¹ Raiffa, H.²

8
- 0003449348
- Second Edition. Harcourt Brace Jovanovich, Orlando
- Strang, G.: Linear Algebra and Its Applications. Second Edition. Harcourt Brace Jovanovich, Orlando (1980).
- (1980) Linear Algebra and Its Applications
- Strang, G.¹

11
- 0029210635
- Learning to act using real-time dynamic programming
- Barto, A.G., Bradtke, S.J., Singh, S.P.: Learning to act using real-time dynamic programming. Artificial Intelligence 72 (1995) 81-138.
- (1995) Artificial Intelligence , vol.72 , pp. 81-138
- Barto, A.G.¹ Bradtke, S.J.² Singh, S.P.³

12
- 0003474751
- Cambridge University Press, Cambridge, UK
- Press, W.H., Flannery, B.P., Teukolsky, S.A., Vetterling, W.T.: Numerical Recipes in C. The Art of Scientific Computing. Cambridge University Press, Cambridge, UK (1988).
- (1988) Numerical Recipes in C. the Art of Scientific Computing
- Press, W.H.¹ Flannery, B.P.² Teukolsky, S.A.³ Vetterling, W.T.⁴

13
- 0033570798
- A unified analysis of value-function-based reinforcementlearning algorithms
- Szepesvari, C., Littman, M.L.: A unified analysis of value-function-based reinforcementlearning algorithms. Neural Computation 11 (1999) 2017-2060.
- (1999) Neural Computation , vol.11 , pp. 2017-2060
- Szepesvari, C.¹ Littman, M.L.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.