메뉴 건너뛰기




Volumn 28, Issue 2, 2003, Pages 327-345

The empirical bayes envelope and regret minimization in competitive Markov decision processes

Author keywords

Approachability; Bayes envelope; Controlled Markov processes; Regret minimization; Stochastic games

Indexed keywords

DECISION MAKING; GAME THEORY; MATRIX ALGEBRA; STOCHASTIC PROGRAMMING;

EID: 0038386340     PISSN: 0364765X     EISSN: None     Source Type: Journal    
DOI: 10.1287/moor.28.2.327.14483     Document Type: Article
Times cited : (29)

References (31)
  • 3
    • 84972545864 scopus 로고
    • An analog of the minimax theorem for vector payoffs
    • Blackwell, D. 1956a. An analog of the minimax theorem for vector payoffs. Pacific J. Math. 6(1) 1-8.
    • (1956) Pacific J. Math. , vol.6 , Issue.1 , pp. 1-8
    • Blackwell, D.1
  • 4
    • 0013371249 scopus 로고
    • Controlled random walks
    • North-Holland, Amsterdam, The Netherlands
    • _. 1956b. Controlled random walks. Proc. Internat. Congress of Mathematicians, 1954, Vol. III. North-Holland, Amsterdam, The Netherlands, 336-338.
    • (1956) Proc. Internat. Congress of Mathematicians, 1954 , vol.3 , pp. 336-338
  • 7
    • 0032122807 scopus 로고    scopus 로고
    • Simplifying optimal strategies in stochastic games
    • Flesch, J., F. Thuijsman, O. J. Vrieze. 1998. Simplifying optimal strategies in stochastic games. SIAM J. Control Optim. 36(4) 1331-1347.
    • (1998) SIAM J. Control Optim. , vol.36 , Issue.4 , pp. 1331-1347
    • Flesch, J.1    Thuijsman, F.2    Vrieze, O.J.3
  • 8
    • 0002267135 scopus 로고    scopus 로고
    • Adaptive game playing using multiplicative weights
    • Freund, Y., R. Schapire. 1999. Adaptive game playing using multiplicative weights. Games and Econom. Behavior 29 79-103.
    • (1999) Games and Econom. Behavior , vol.29 , pp. 79-103
    • Freund, Y.1    Schapire, R.2
  • 9
    • 0000668347 scopus 로고
    • Universal consistency and cautious fictitious play
    • Fudenberg, D., D. Levine. 1995. Universal consistency and cautious fictitious play. J. Econom. Dynamics and Control 19 1065-1990.
    • (1995) J. Econom. Dynamics and Control , vol.19 , pp. 1065-1990
    • Fudenberg, D.1    Levine, D.2
  • 11
    • 0001976283 scopus 로고
    • Approximation to Bayes risk in repeated play
    • M. Dresher, A. W. Tucker, P. Wolde, eds. Princeton University Press, Princeton, NJ
    • Hannan, J. 1957. Approximation to Bayes risk in repeated play. M. Dresher, A. W. Tucker, P. Wolde, eds. Contribution to the Theory of Games, III. Princeton University Press, Princeton, NJ, 97-139.
    • (1957) Contribution to the Theory of Games , vol.3 , pp. 97-139
    • Hannan, J.1
  • 12
    • 0000908510 scopus 로고    scopus 로고
    • A simple adaptive procedure leading to correlated equilibrium
    • Hart, S., A. Mas-Colell. 2000. A simple adaptive procedure leading to correlated equilibrium. Econometrica 68 1127-1150.
    • (2000) Econometrica , vol.68 , pp. 1127-1150
    • Hart, S.1    Mas-Colell, A.2
  • 13
    • 0013327463 scopus 로고    scopus 로고
    • A general class of adaptive strategies
    • _, _. 2001. A general class of adaptive strategies. J. Econom. Theory 98 26-54.
    • (2001) J. Econom. Theory , vol.98 , pp. 26-54
  • 16
    • 0032182921 scopus 로고    scopus 로고
    • Reliable communication under channel uncertainty
    • Lapidoth, A., P. Narayan. 1998. Reliable communication under channel uncertainty. IEEE Trans. on Inform. Theory 44 2148-2177.
    • (1998) IEEE Trans. on Inform. Theory , vol.44 , pp. 2148-2177
    • Lapidoth, A.1    Narayan, P.2
  • 18
    • 0038634234 scopus 로고    scopus 로고
    • The empirical Bayes envelope approach to regret minimization in stochastic games
    • Faculty of Electrical Engineering, Technion, Haifa, Israel
    • Mannor, S., N. Shimkin. 2000a. The empirical Bayes envelope approach to regret minimization in stochastic games. Technical report No. EE-1262, Faculty of Electrical Engineering, Technion, Haifa, Israel.
    • (2000) Technical Report No. EE-1262 , vol.EE-1262
    • Mannor, S.1    Shimkin, N.2
  • 19
    • 85039666935 scopus 로고    scopus 로고
    • Generalized approachability results for stochastic games witha single communicating state
    • Faculty of Electrical Engineering, Technion, Haifa, Israel
    • _, _. 2000b. Generalized approachability results for stochastic games with a single communicating state. Technical report No. EE-1263, Faculty of Electrical Engineering, Technion, Haifa, Israel.
    • (2000) Technical Report No. EE-1263
  • 21
    • 0004278770 scopus 로고
    • CORE Reprint Nos. Discussion Papers 9420, 9421, and 9422. Center for Operations Research and Economics, Universite Catholique de Louvain, Louvain, Belgium
    • _, S. Sorin, S. Zamir. 1994. Repeated games. CORE Reprint Nos. Discussion Papers 9420, 9421, and 9422. Center for Operations Research and Economics, Universite Catholique de Louvain, Louvain, Belgium.
    • (1994) Repeated Games
    • Sorin, S.1    Zamir, S.2
  • 23
    • 0008192018 scopus 로고    scopus 로고
    • Unpublished Ph.D., Laboratory for Information and Decision Systems, Massachusetts Institute of Technology, Cambridge, MA
    • Pated, S. 1997. Stochastic shortest path games. Unpublished Ph.D., Laboratory for Information and Decision Systems, Massachusetts Institute of Technology, Cambridge, MA.
    • (1997) Stochastic Shortest Path Games
    • Pated, S.1
  • 25
    • 0004267646 scopus 로고
    • Princeton University Press, Princeton, NJ
    • Rockafellar, R. 1970. Convex Analysis. Princeton University Press, Princeton, NJ.
    • (1970) Convex Analysis
    • Rockafellar, R.1
  • 26
    • 0013327190 scopus 로고    scopus 로고
    • Minimizing regret: The general case
    • Rustichini, A. 1999. Minimizing regret: The general case. Games and Econom. Behavior 29 224-243.
    • (1999) Games and Econom. Behavior , vol.29 , pp. 224-243
    • Rustichini, A.1
  • 27
    • 0027201360 scopus 로고
    • Guaranteed performance regions in Markovian systems with competing decision makers
    • Shimkin, N., A. Shwartz. 1993. Guaranteed performance regions in Markovian systems with competing decision makers. IEEE Trans. on Automatic Control 38(1) 84-95.
    • (1993) IEEE Trans. on Automatic Control , vol.38 , Issue.1 , pp. 84-95
    • Shimkin, N.1    Shwartz, A.2
  • 28
    • 85039665016 scopus 로고    scopus 로고
    • An approachability condition for general sets
    • Ecole Polytechnique, Paris, France
    • Spinat, X. 1999. An approachability condition for general sets. Technical Report No. 496, Ecole Polytechnique, Paris, France.
    • (1999) Technical Report No. 496 , vol.496
    • Spinat, X.1
  • 31
    • 0032047115 scopus 로고    scopus 로고
    • A game of prediction with expert advice
    • Vovk, V. 1998. A game of prediction with expert advice. J. Comput. Systems Sci. 56(2) 153-173.
    • (1998) J. Comput. Systems Sci. , vol.56 , Issue.2 , pp. 153-173
    • Vovk, V.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.