메뉴 건너뛰기




Volumn 1810, Issue , 2000, Pages 117-128

Minimax td-learning with neural nets in a Markov game

Author keywords

[No Author keywords available]

Indexed keywords

LEARNING SYSTEMS; LINEAR PROGRAMMING; NEURAL NETWORKS; REINFORCEMENT LEARNING;

EID: 84974693459     PISSN: 03029743     EISSN: 16113349     Source Type: Book Series    
DOI: 10.1007/3-540-45164-1_13     Document Type: Conference Paper
Times cited : (9)

References (13)
  • 1
    • 33847202724 scopus 로고
    • Learning to predict by the methods of temporal differences
    • Sutton, R.S.: Learning to predict by the methods of temporal differences, Machine Learning 3 (1988) 9-44.
    • (1988) Machine Learning , vol.3 , pp. 9-44
    • Sutton, R.S.1
  • 2
    • 0001046225 scopus 로고
    • Practical issues in temporal difference learning
    • Tesauro, G.J.: Practical issues in temporal difference learning, Machine Learning 8 (1992) 257-277.
    • (1992) Machine Learning , vol.8 , pp. 257-277
    • Tesauro, G.J.1
  • 3
    • 0031192989 scopus 로고    scopus 로고
    • Representations and solutions for game-theoretic problems
    • Koller, D., Pfeffer, A.: Representations and solutions for game-theoretic problems. Artificial Intelligence 94(1) (1997) 167-215.
    • (1997) Artificial Intelligence , vol.94 , Issue.1 , pp. 167-215
    • Koller, D.1    Pfeffer, A.2
  • 4
    • 0008815685 scopus 로고    scopus 로고
    • On classification of games and evaluation of players - With some sweeping generalizations about the literature
    • Fürnkranz, J., Kubat, M. (eds.), Jozef Stefan Institute, Ljubljana
    • Halck, O.M., Dahl, F.A.: On classification of games and evaluation of players - with some sweeping generalizations about the literature. In: Fürnkranz, J., Kubat, M. (eds.): Proceedings of the ICML-99 Workshop on Machine Learning in Game Playing, Jozef Stefan Institute, Ljubljana (1999).
    • (1999) Proceedings of the ICML-99 Workshop on Machine Learning in Game Playing
    • Halck, O.M.1    Dahl, F.A.2
  • 5
    • 84876767994 scopus 로고    scopus 로고
    • Poker as a testbed for machine intelligence research
    • Mercer, R., Neufeld, E. (eds.), Springer- Verlag, Berlin-Heidelberg-New York
    • Billings, D., Papp, D., Schaeffer, J., Szafron, D.: Poker as a testbed for machine intelligence research. In: Mercer, R., Neufeld, E. (eds.): Advances in Artificial Intelligence, Springer- Verlag, Berlin-Heidelberg-New York (1998) 228-238.
    • (1998) Advances in Artificial Intelligence , pp. 228-238
    • Billings, D.1    Papp, D.2    Schaeffer, J.3    Szafron, D.4
  • 6
    • 85149834820 scopus 로고
    • Markov games as a framework for multi-agent reinforcement learning
    • Morgan Kaufmann, New Brunswick
    • Littman, M.L.: Markov games as a framework for multi-agent reinforcement learning. In: Proceedings of the 11th International Conference on Machine Learning, Morgan Kaufmann, New Brunswick (1994) 157-163.
    • (1994) Proceedings of the 11Th International Conference on Machine Learning , pp. 157-163
    • Littman, M.L.1
  • 9
    • 84945311957 scopus 로고    scopus 로고
    • Three games designed for the study of human and automated decision making
    • FFI/RAPPORT-98/02799, Norwegian Defence Research Establishment (FFI), Kjeller, Norway
    • Dahl, F.A., Halck, O.M.: Three games designed for the study of human and automated decision making. Definition and properties of the games Campaign, Operation Lucid and Operation Opaque. FFI/RAPPORT-98/02799, Norwegian Defence Research Establishment (FFI), Kjeller, Norway (1998).
    • (1998) Definition and Properties of the Games Campaign, Operation Lucid and Operation Opaque
    • Dahl, F.A.1    Halck, O.M.2
  • 10
    • 0008839797 scopus 로고
    • The Tactical Air Game: A multimove game with mixed strategy solution
    • Grote, J.D. (ed.), Dordrecht, The Netherlands
    • Berkovitz, L.D.: The Tactical Air Game: A multimove game with mixed strategy solution. In: Grote, J.D. (ed.): The Theory and Application of Differential Games, Reidel Publishing Company, Dordrecht, The Netherlands (1975) 169-177.
    • (1975) The Theory and Application of Differential Games, Reidel Publishing Company , pp. 169-177
    • Berkovitz, L.D.1
  • 13
    • 0033570798 scopus 로고    scopus 로고
    • A unified analysis of value-function-based reinforcementlearning algorithms
    • Szepesvari, C., Littman, M.L.: A unified analysis of value-function-based reinforcementlearning algorithms. Neural Computation 11 (1999) 2017-2060.
    • (1999) Neural Computation , vol.11 , pp. 2017-2060
    • Szepesvari, C.1    Littman, M.L.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.