메뉴 건너뛰기




Volumn 2167, Issue , 2001, Pages 85-96

A reinforcement learning algorithm applied to simplified two-player Texas Hold’em poker

Author keywords

[No Author keywords available]

Indexed keywords

ALGORITHMS; ARTIFICIAL INTELLIGENCE; LEARNING SYSTEMS; PROBABILITY DISTRIBUTIONS; REINFORCEMENT LEARNING;

EID: 33745139542     PISSN: 03029743     EISSN: 16113349     Source Type: Book Series    
DOI: None     Document Type: Conference Paper
Times cited : (17)

References (15)
  • 1
    • 84948141808 scopus 로고    scopus 로고
    • The lagging anchor algorithm. Reinforcement learning in two-player zero-sum games with imperfect information
    • Dahl, F. A.: The lagging anchor algorithm. Reinforcement learning in two-player zero-sum games with imperfect information. Machine Learning (to appear).
    • Machine Learning
    • Dahl, F.A.1
  • 2
    • 0004260006 scopus 로고
    • 3rd ed. Academic Press, San Diego
    • Owen, G.: Game Theory. 3rd ed. Academic Press, San Diego (1995).
    • (1995) Game Theory
    • Owen, G.1
  • 3
    • 33847202724 scopus 로고
    • Learning to predict by the methods of temporal differences
    • Sutton, R. S.: Learning to predict by the methods of temporal differences. Machine Learning 3 (1988) 9–44.
    • (1988) Machine Learning , vol.3 , pp. 9-44
    • Sutton, R.S.1
  • 5
    • 0033570798 scopus 로고    scopus 로고
    • A unified analysis of value-function-based reinforcementlearning algorithms
    • Szepesvari, C., Littman, M. L.: A unified analysis of value-function-based reinforcementlearning algorithms. Neural Computation 11 (1999) 2017–2060.
    • (1999) Neural Computation , vol.11 , pp. 2017-2060
    • Szepesvari, C.1    Littman, M.L.2
  • 6
    • 0001046225 scopus 로고
    • Practical issues in temporal difference learning
    • Tesauro, G. J.: Practical issues in temporal difference learning. Machine Learning 8 (1992) 257–277.
    • (1992) Machine Learning , vol.8 , pp. 257-277
    • Tesauro, G.J.1
  • 7
    • 85149834820 scopus 로고
    • Markov games as a framework for multi-agent reinforcement learning
    • Morgan Kaufmann, New Brunswick
    • Littman, M. L.: Markov games as a framework for multi-agent reinforcement learning. In: Proceedings of the 11th International Conference on Machine Learning, Morgan Kaufmann, New Brunswick (1994) 157–163.
    • (1994) Proceedings of the 11th International Conference on Machine Learning , pp. 157-163
    • Littman, M.L.1
  • 8
    • 84974693459 scopus 로고    scopus 로고
    • Minimax TD-learning with neural nets in a Markov game
    • : Lopez de Mantaras, R., Plaza, E. (eds.): ECML 2000, Springer-Verlag, Berlin–Heidelberg–New York
    • Dahl F. A., Halck O. M.: Minimax TD-learning with neural nets in a Markov game. In: Lopez de Mantaras, R., Plaza, E. (eds.): ECML 2000. Proceedings of the 11th European Conference on Machine Learning. Lecture Notes in Computer Science Vol. 1810, Springer-Verlag, Berlin–Heidelberg–New York (2000) 117–128.
    • (2000) Proceedings of the 11th European Conference on Machine Learning. Lecture Notes in Computer Science , vol.1810 , pp. 117-128
    • Dahl, F.A.1    Halck, O.M.2
  • 9
    • 0030170957 scopus 로고    scopus 로고
    • Efficient computation of equilibria for extensive two-person games
    • Koller, D., Megiddo, N., von Stengel, B.: Efficient computation of equilibria for extensive two-person games. Games and Economic Behavior 14 (1996) 247–259.
    • (1996) Games and Economic Behavior , vol.14 , pp. 247-259
    • Koller, D.1    Megiddo, N.2    Von Stengel, B.3
  • 11
    • 0031192989 scopus 로고    scopus 로고
    • Representations and solutions for game-theoretic problems
    • Koller, D., Pfeffer, A.: Representations and solutions for game-theoretic problems. Artificial Intelligence 94 (1997) 167–215.
    • (1997) Artificial Intelligence , vol.94 , pp. 167-215
    • Koller, D.1    Pfeffer, A.2
  • 14
    • 0001826509 scopus 로고
    • Anticipatory learning in two-person games
    • : Selten, R. (ed.), Evolution and game dynamics, Springer-Verlag, Berlin
    • Selten R. (1991). Anticipatory learning in two-person games, in: Selten, R. (ed.): Game equilibrium models, vol. I: Evolution and game dynamics, Springer-Verlag, Berlin.
    • (1991) Game Equilibrium Models , vol.1
    • Selten, R.1
  • 15
    • 0008815685 scopus 로고    scopus 로고
    • On classification of games and evaluation of players – with some sweeping generalizations about the literature
    • : Fürnkranz, J., Kubat, M. (eds.), Jozef Stefan Institute, Ljubljana
    • Halck, O. M., Dahl, F. A.: On classification of games and evaluation of players – with some sweeping generalizations about the literature. In: Fürnkranz, J., Kubat, M. (eds.): Proceedings of the ICML-99 Workshop on Machine Learning in Game Playing, Jozef Stefan Institute, Ljubljana (1999).
    • (1999) Proceedings of the ICML-99 Workshop on Machine Learning in Game Playing
    • Halck, O.M.1    Dahl, F.A.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.