메뉴 건너뛰기




Volumn , Issue , 1998, Pages 64-71

How to explore your opponent's strategy (almost) optimally

Author keywords

[No Author keywords available]

Indexed keywords

EXPECTED UTILITY; EXPLORATION METHODS; EXPLORATION STRATEGIES; ITERATED PRISONER'S DILEMMA; LEARNING METHODS; MODEL-BASED OPC; MODELING-BASED LEARNING; OPPONENT MODELING;

EID: 0011717041     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICMAS.1998.699033     Document Type: Conference Paper
Times cited : (13)

References (14)
  • 2
    • 0011471586 scopus 로고
    • The complexity of computing a best response automaton in repeated games with mixed strategies
    • E. Ben-Porath. The complexity of computing a best response automaton in repeated games with mixed strategies. Games and Economic Behavior, 2:1-12, 1990.
    • (1990) Games and Economic Behavior , vol.2 , pp. 1-12
    • Ben-Porath, E.1
  • 6
    • 0003328374 scopus 로고
    • Neural network exploration using optimal experimental design
    • J. D. Cowan, G. Tesauro, and J. Alspector, editors Morgan Caufmann
    • D. A. Cohn. Neural network exploration using optimal experimental design. In J. D. Cowan, G. Tesauro, and J. Alspector, editors, Advances in Neural Information Processing Systems 6, pages 679-686. Morgan Caufmann, 1994.
    • (1994) Advances in Neural Information Processing Systems , vol.6 , pp. 679-686
    • Cohn, D.A.1
  • 9
    • 38249029225 scopus 로고
    • The complexity of computing best response Automata in repeated games
    • I. Gilboa. The complexity of computing best response Automata in repeated games. Journal of economic theory, 45:342-352,1988.
    • (1988) Journal of Economic Theory , vol.45 , pp. 342-352
    • Gilboa, I.1
  • 10
    • 38249006045 scopus 로고
    • Bounded versus unbounded rationality: The tyranny of the weak
    • I. Gilboa and D. Samet. Bounded versus unbounded rationality: The tyranny of the weak. Games and Economic Behavior, 1:213-221,1989.
    • (1989) Games and Economic Behavior , vol.1 , pp. 213-221
    • Gilboa, I.1    Samet, D.2
  • 12
    • 46149134052 scopus 로고
    • Finite automata play the repeated Prisoner's Dilemma
    • A Rubinstein. Finite automata play the repeated Prisoner's Dilemma. Journal of Economic Theory, 39:83-96,1986.
    • (1986) Journal of Economic Theory , vol.39 , pp. 83-96
    • Rubinstein, A.1
  • 13
    • 0030050933 scopus 로고
    • Multiagent reinforcement learning and the iterated Prisoner's Dilemma
    • T. W. Sandholm and R. H. Crites. Multiagent reinforcement learning and the iterated Prisoner's Dilemma. Biosystems Journal, 37:147-166,1995.
    • (1995) Biosystems Journal , vol.37 , pp. 147-166
    • Sandholm, T.W.1    Crites, R.H.2
  • 14
    • 0002210775 scopus 로고
    • The role of exploration in learning control
    • D. A. White and D. Sopfge, editors Multiscience Press Inc
    • S. B. Thrun. The role of exploration in learning control. In D. A. White and D. Sopfge, editors, Handbookfor Intelligent Control. Multiscience Press Inc., 1992.
    • (1992) Handbookfor Intelligent Control
    • Thrun, S.B.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.