SCOPUS 정보 검색 플랫폼

Volumn 2167, Issue , 2001, Pages 85-96

A reinforcement learning algorithm applied to simplified two-player Texas Hold’em poker

Author keywords

[No Author keywords available]

Indexed keywords

ALGORITHMS; ARTIFICIAL INTELLIGENCE; LEARNING SYSTEMS; PROBABILITY DISTRIBUTIONS; REINFORCEMENT LEARNING;

GRADIENT SEARCH; IMPERFECT INFORMATION; NON-TRIVIAL; PARAMETER SPACES; Q-LEARNING; VALUE-BASED;

LEARNING ALGORITHMS;

EID: 33745139542 PISSN: 03029743 EISSN: 16113349 Source Type: Book Series
DOI: None Document Type: Conference Paper

Times cited : (17)

References (15)

1
- 84948141808
- The lagging anchor algorithm. Reinforcement learning in two-player zero-sum games with imperfect information
- Dahl, F. A.: The lagging anchor algorithm. Reinforcement learning in two-player zero-sum games with imperfect information. Machine Learning (to appear).
- Machine Learning
- Dahl, F.A.¹

2
- 0004260006
- 3rd ed. Academic Press, San Diego
- Owen, G.: Game Theory. 3rd ed. Academic Press, San Diego (1995).
- (1995) Game Theory
- Owen, G.¹

3
- 33847202724
- Learning to predict by the methods of temporal differences
- Sutton, R. S.: Learning to predict by the methods of temporal differences. Machine Learning 3 (1988) 9–44.
- (1988) Machine Learning , vol.3 , pp. 9-44
- Sutton, R.S.¹

4
- 0004049893
- PhD thesis, University of Cambridge, UK
- Watkins, C. J. C. H.: Learning from Delayed Rewards. PhD thesis, University of Cambridge, UK (1989).
- (1989) Learning from Delayed Rewards
- Watkins, C.¹

5
- 0033570798
- A unified analysis of value-function-based reinforcementlearning algorithms
- Szepesvari, C., Littman, M. L.: A unified analysis of value-function-based reinforcementlearning algorithms. Neural Computation 11 (1999) 2017–2060.
- (1999) Neural Computation , vol.11 , pp. 2017-2060
- Szepesvari, C.¹ Littman, M.L.²

6
- 0001046225
- Practical issues in temporal difference learning
- Tesauro, G. J.: Practical issues in temporal difference learning. Machine Learning 8 (1992) 257–277.
- (1992) Machine Learning , vol.8 , pp. 257-277
- Tesauro, G.J.¹

9
- 0030170957
- Efficient computation of equilibria for extensive two-person games
- Koller, D., Megiddo, N., von Stengel, B.: Efficient computation of equilibria for extensive two-person games. Games and Economic Behavior 14 (1996) 247–259.
- (1996) Games and Economic Behavior , vol.14 , pp. 247-259
- Koller, D.¹ Megiddo, N.² Von Stengel, B.³

10
- 0004145762
- Wiley, New York
- Luce, R. D., Raiffa, H.: Games and Decisions. Wiley, New York (1957).
- (1957) Games and Decisions
- Luce, R.D.¹ Raiffa, H.²

11
- 0031192989
- Representations and solutions for game-theoretic problems
- Koller, D., Pfeffer, A.: Representations and solutions for game-theoretic problems. Artificial Intelligence 94 (1997) 167–215.
- (1997) Artificial Intelligence , vol.94 , pp. 167-215
- Koller, D.¹ Pfeffer, A.²

13
- 0003591521
- MIT Press, Cambridge, Massachusetts
- Hassoun, M. H.: Fundamentals of Artificial Neural Networks. MIT Press, Cambridge, Massachusetts (1995).
- (1995) Fundamentals of Artificial Neural Networks
- Hassoun, M.H.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.