SCOPUS 정보 검색 플랫폼 - 논문 보기

메뉴 건너뛰기

Machine Learning

Volumn 49, Issue 1, 2002, Pages 5-37

The lagging anchor algorithm: Reinforcement learning in two-player zero-sum games with imperfect information

(1) Dahl, Fredrik A a

a NORWEGIAN DEFENCE RESEARCH ESTABLISHMENT FFI (Norway)

Author keywords

Imperfect information; Neural net; Reinforcement learning; Two player zero sum game

Indexed keywords

COMPUTATIONAL COMPLEXITY; CONVERGENCE OF NUMERICAL METHODS; GAME THEORY; LEARNING ALGORITHMS; NEURAL NETWORKS;

IMPERFECT INFORMATION; REINFORCEMENT LEARNING; TWO PLAYER ZERO SUM GAME;

LEARNING SYSTEMS;

EID: 0036778915 PISSN: 08856125 EISSN: None Source Type: Journal
DOI: 10.1023/A:1014063505958 Document Type: Article

Times cited : (12)

References (33)

1
- 0008851724
- Experimental studies of neural net training and human learning in a military air campaign game
- University of Central Florida, Orlando, Florida, Institute for Simulation and Training
- (1998) Proceedings of the Seventh Conference on Computer Generated Forces and Behavioral Representation , pp. 263-274
- Bakken, B.T.¹ Dahl, F.A.²

2
- 0008839797
- The tactical air game: A multimove game with mixed strategy solution
- In J. D. Grote (Ed.)
- (1975) The Theory and Application of Differential Games , pp. 169-177
- Berkovitz, L.D.¹

3
- 0004268210
- Interscience tracts in pure and applied mathematics, No. 6. New York: Interscience Publishers
- (1958) Convex Surfaces
- Busemann, H.¹

4
- 38248998837
- Adaptation in games-2 solutions to the Crawford puzzle
- (1993) Journal of Economic Behavior and Organizations , vol.22 , pp. 25-50
- Conlisk, J.¹

5
- 38248999440
- Adaptive tactics in games-Further solutions to the Crawford puzzle
- (1993) Journal of Economic Behavior and Organizations , vol.22 , pp. 51-68
- Conlisk, J.¹

6
- 0000379898
- Learning the optimal strategy in a zero-sum game
- (1974) Econometrica , vol.42 , pp. 885-891
- Crawford, V.P.¹

7
- 25344458514
- Three games designed for the study of human and automated decision making
- Definitions and properties of the games campaign, Operation lucid and operation opaque. FFI/Rapport-98/02799, Norwegian Defence Research Establishment (FFI), Kjeller, Norway
- (1998)
- Dahl, F.A.¹ Halck, O.M.²

8
- 84974693459
- Minimax TD-learning with neural nets in a Markov game
- In R. Lopez de Mantaras & E. Plaza (Eds.); Lecture Notes in Computer Science; Berlin: Springer-Verlag
- (2000) ECML 2000. Proceedings of the 11th European Conference on Machine Learning , vol.1810
- Dahl, F.A.¹ Halck, O.M.²

9
- 0008830480
- FFI/Rapport-2000/04400, Norwegian Defence Research Establishment (FFI), Kjeller, Norway
- (2000) Machine Learning in the Game of Campaign
- Dahl, F.A.¹ Halck, O.M.² Braathen, S.³

10
- 0038829878
- Predicting how people play games: Reinforcement learning in experimental games with unique, mixed strategy equilibria
- (1998) The American Economic Review , vol.88 , pp. 848-881
- Erev, I.¹ Roth, A.E.²

11
- 0004247096
- Cambridge: MIT Press
- (1998) The Theory of Learning in Games
- Fudenberg, D.¹ Levine, D.K.²

12
- 0008815685
- On classification of games and evaluation of players-with some sweeping generalizations about the literature
- In: J. Fürnkranz, & M. Kubat (Eds.); Ljubljana, Slovenia: Jozef Stefan Institute
- (1999) Proceedings of the ICML-99 Workshop on Machine Learning in Game Playing
- Halck, O.M.¹ Dahl, F.A.²

13
- 79957749002
- Reinforcement learning applied to a differential game
- Cambridge, MA: MIT Press
- (1995) Adaptive Behavior , vol.4
- Harmon, M.E.¹ Baird, L.C.² Klopf, A.H.³

14
- 0003591521
- Cambridge, MA: MIT Press
- (1995) Fundamentals of Artificial Neural Networks
- Hassoun, M.H.¹

15
- 0030170957
- Efficient solutions of extensive two-person games
- (1996) Games and Economic Behavior , vol.14 , pp. 247-259
- Koller, D.¹ Megiddo, N.² Von Stengel, B.³

16
- 85149834820
- Markov games as a framework for multi-agent reinforcement learning
- New Brunswick: Morgan Kaufmann
- (1994) Proceedings of the 11th International Conference on Machine Learning , pp. 157-163
- Littman, M.L.¹

17
- 0004145762
- New York: Wiley
- (1957) Games and Decisions
- Luce, R.D.¹ Raiffa, H.²

18
- 0003881809
- New York: Wiley
- (1979) Introduction to Dynamic Systems. Theory, Models, & Applications
- Luenberger, D.G.¹

19
- 0003488911
- Reading, MA: Addison-Wesley
- (1984) Linear and Nonlinear Programming
- Luenberger, D.G.¹

20
- 0002380992
- Game-playing and game-learning automata
- In L. Fox (Ed.); New York, Pergamon
- (1966) Advances in Programming and Non-Numerical Computation , pp. 183-200
- Michie, D.¹

21
- 0003424571
- Berlin: Springer-Verlag
- (1995) Linear Optimization and Extensions
- Padberg, M.¹

22
- 0003897694
- Reading, MA: Addison Wesley
- (1994) Computational Complexity
- Papadimitriou, C.H.¹

23
- 0001201756
- Some studies in machine learning using the game of checkers
- (1959) IBM J Res. Develop. , vol.3 , pp. 210-229
- Samuel, A.L.¹

24
- 0008831290
- Learning to play strong poker
- In: J. Furnkranz, & M. Kubat (Eds.); Ljubljana, Slovenia: Jozef Stefan Institute
- (1999) Proceedings of the ICML-99 Workshop on Machine Learning in Game Playing
- Schaeffer, J.¹ Billings, D.² Peña, L.³ Szafron, D.⁴

25
- 0001826509
- Anticipatory learning in two-person games
- In R. Selten (Ed.); Berlin: Springer-Verlag
- (1991) Game Equilibrium Models (Vol. I: Evolution and Game Dynamics)
- Selten, R.¹

26
- 0003449348
- London: Harcourt Brace
- (1980) Linear Algebra and Its Applications
- Strang, G.¹

27
- 33847202724
- Learning to predict by the methods of temporal differences
- (1988) Machine Learning , vol.3 , pp. 9-44
- Sutton, R.S.¹

28
- 0033570798
- A unified analysis of value-function-based reinforcement-learning algorithms
- (1999) Neural Computation , vol.11 , pp. 2017-2060
- Szepesvari, C.¹ Littman, M.L.²

29
- 0001046225
- Practical issues in temporal difference learning
- (1992) Machine Learning , vol.8 , pp. 257-277
- Tesauro, G.J.¹

30
- 0024702037
- A parallel network that learns to play backgammon
- (1989) Artificial Intelligence , vol.39 , pp. 357-390
- Tesauro, G.J.¹ Sejnowski, T.J.²

31
- 0003792179
- New York: Wiley
- (1953) Theory of Games and Economic Behavior, 3rd Ed.
- Von Neumann, J.¹ Morgenstern, O.²

32
- 0004049893
- Learning from delayed rewards
- PhD thesis, Psychology Department, Cambridge University, Cambridge, UK
- (1989)
- Watkins, C.J.C.H.¹

33
- 0004202581
- Cambridge: MIT Press
- (1995) Evolutionary Game Theory
- Weibull, J.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.