SCOPUS 정보 검색 플랫폼

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Volumn 2111, Issue , 2001, Pages 128-142

Adaptive strategies and regret minimization in arbitrarily varying Markov environments

(2) Mannor, Shie a Shimkin, Nahum a

a TECHNION ISRAEL INSTITUTE OF TECHNOLOGY (Israel)

Author keywords

[No Author keywords available]

Indexed keywords

GAME THEORY; STOCHASTIC MODELS; STOCHASTIC SYSTEMS;

ADAPTIVE STRATEGY; ATTAINABLE SOLUTIONS; CONTROL STRATEGIES; GUARANTEED PERFORMANCE; MARKOVIAN DYNAMICS; REGRET MINIMIZATION; SINGLE CONTROLLERS; STATE TRANSITIONS;

COMPUTATION THEORY;

EID: 84943237201 PISSN: 03029743 EISSN: 16113349 Source Type: Book Series
DOI: 10.1007/3-540-44581-1_9 Document Type: Conference Paper

Times cited : (1)

References (22)

1
- 84943335811
- Special issue on learning in games, November
- Special issue on learning in games. Games and Economic Behavior, 29(1), November 1999.
- (1999) Games and Economic Behavior , vol.29 , Issue.1

2
- 0029513526
- Gambling in a rigged casino: The adversarial multi armed bandit problem
- IEEE Computer Society Press
- P. Auer, N. Cesa-Bianchi, Y. Freund, and R. E. Schapire. Gambling in a rigged casino: The adversarial multi armed bandit problem. In Proc. 36th Annual Symposium on Foundations of Computer Science, pages 322–331. IEEE Computer Society Press, 1995.
- (1995) Proc. 36Th Annual Symposium on Foundations of Computer Science , pp. 322-331
- Auer, P.¹ Cesa-Bianchi, N.² Freund, Y.³ Schapire, R.E.⁴

3
- 0003487482
- Athena Scientific
- D.P. Bertsekas and J.N. Tsitsiklis. Neuro-Dynamic Programming. Athena Scientific, 1995.
- (1995) Neuro-Dynamic Programming
- Bertsekas, D.P.¹ Tsitsiklis, J.N.²

4
- 84972545864
- An analog of the minimax theorem for vector payoffs
- D. Blackwell. An analog of the minimax theorem for vector payoffs. Pacific J. Math., 6(1):1–8, 1956.
- (1956) Pacific J. Math. , vol.6 , Issue.1 , pp. 1-8
- Blackwell, D.¹

5
- 0013371249
- Controlled random walks
- North-Holland
- D. Blackwell. Controlled random walks. In Proc. International Congress of Mathematicians, 1954, volume 3, pages 336–338. North-Holland, 1956.
- (1956) Proc. International Congress of Mathematicians, 1954 , vol.3 , pp. 336-338
- Blackwell, D.¹

6
- 0003989209
- Springer Verlag
- J. Filar and K. Vrieze. Competitive Markov Decision Processes. Springer Verlag, 1996.
- (1996) Competitive Markov Decision Processes
- Filar, J.¹ Vrieze, K.²

7
- 0002267135
- Adaptive game playing using multiplicative weights
- November
- Y. Freund and R. Schapire. Adaptive game playing using multiplicative weights. Games and Economic Behavior, 29:79–103, November 1999.
- (1999) Games and Economic Behavior , vol.29 , pp. 79-103
- Freund, Y.¹ Schapire, R.²

8
- 0000668347
- Universal consistency and cautious fictitious play
- D. Fudenberg and D. Levine. Universal consistency and cautious fictitious play. Journal of Economic Dynamic and Control, 19:1065–1990, 1995.
- (1995) Journal of Economic Dynamic and Control , vol.19 , pp. 1065-1990
- Fudenberg, D.¹ Levine, D.²

9
- 0001976283
- Approximation to bayes risk in repeated play
- M. Dresher, A. W. Tucker, and P. Wolde, editors, Princeton University Press
- J. Hannan. Approximation to bayes risk in repeated play. In M. Dresher, A. W. Tucker, and P. Wolde, editors, Contribution to The Theory of Games, III, pages 97–139. Princeton University Press, 1957.
- (1957) Contribution to the Theory of Games, III , pp. 97-139
- Hannan, J.¹

10
- 0003665818
- DP 166, The Hebrew University of Jerusalem, Center for Rationality
- S. Hart and A. Mas-Colell. A simple adaptive procedure leading to correlated equilibrium. DP 166, The Hebrew University of Jerusalem, Center for Rationality, 1998.
- (1998) A Simple Adaptive Procedure Leading to Correlated Equilibrium
- Hart, S.¹ Mas-Colell, A.²

11
- 0038295434
- Preprint, May
- E. Lehrer. Approachability in infinite dimensional spaces and an application: A universal algorithm for generating extended normal numbers. Preprint, May 1998.
- (1998) Approachability in Infinite Dimensional Spaces and an Application: A Universal Algorithm for Generating Extended Normal Numbers
- Lehrer, E.¹

12
- 85149834820
- Markov games as a framework for multi-agent reinforcement learning
- Morgan Kaufman, editor
- M.L. Littman. Markov games as a framework for multi-agent reinforcement learning. In Morgan Kaufman, editor, Eleventh International Conference on Machine Learning, pages 157–163, 1994.
- (1994) Eleventh International Conference on Machine Learning , pp. 157-163
- Littman, M.L.¹

13
- 0038634234
- Technical report EE- 1262, Faculty of Electrical Engineering, Technion, Israel, October
- S. Mannor and N. Shimkin. The empirical bayes envelope approach to regret minimization in stochastic games. Technical report EE- 1262, Faculty of Electrical Engineering, Technion, Israel, October 2000. available from: http://tiger.technion.ac.il/~shie/Public/drmOct23techreport.ps.gz.
- (2000) The Empirical Bayes Envelope Approach to Regret Minimization in Stochastic Games
- Mannor, S.¹ Shimkin, N.²

14
- 9444223591
- Technical report EE- 1242, Faculty of Electrical Engineering, Technion, Israel, March
- S. Mannor and N. Shimkin. Regret minimization in signal space for repeated matrix games with partial observations. Technical report EE- 1242, Faculty of Electrical Engineering, Technion, Israel, March 2000. available from: http://tiger.technion.ac.il/~shie/Public/beMar16.ps.gz.
- (2000) Regret Minimization in Signal Space for Repeated Matrix Games with Partial Observations
- Mannor, S.¹ Shimkin, N.²

15
- 0002282886
- Markov games - a survey
- T. Parthasarathy and M. Stern. Markov games - a survey. Differential Games and Control Theory, 1977.
- (1977) Differential Games and Control Theory
- Parthasarathy, T.¹ Stern, M.²

16
- 0008192018
- PhD thesis, LIDS MIT, January
- S.D. Patek. Stochastic Shortest Path Games. PhD thesis, LIDS MIT, January 1997.
- (1997) Stochastic Shortest Path Games
- Patek, S.D.¹

17
- 0003998452
- Wiley-Interscience
- M. Puterman. Markov Decision Processes. Wiley-Interscience, 1994.
- (1994) Markov Decision Processes
- Puterman, M.¹

18
- 0003582821
- Blackwell
- E. Rasmunsen. Games and Information: An Introduction to Game Theory. Blackwell, 1994.
- (1994) Games and Information: An Introduction to Game Theory
- Rasmunsen, E.¹

19
- 0013327190
- Minimizing regret: The general case
- November
- A. Rustichini. Minimizing regret: the general case. Games and Economic Behavior, 29:224–243, November 1999.
- (1999) Games and Economic Behavior , vol.29 , pp. 224-243
- Rustichini, A.¹

20
- 0027201360
- Guaranteed performance regions in markovian systems with competing decision makers
- January
- N. Shimkin and A. Shwartz. Guaranteed performance regions in markovian systems with competing decision makers. IEEE Trans. on Automatic Control, 38(1):84–95, January 1993.
- (1993) IEEE Trans. On Automatic Control , vol.38 , Issue.1 , pp. 84-95
- Shimkin, N.¹ Shwartz, A.²

21
- 84943268461
- Technical Report 496, Ecole Polytechnique, Paris
- X. Spiant. An approachability condition for general sets. Technical Report 496, Ecole Polytechnique, Paris, 1999.
- (1999) An Approachability Condition for General Sets
- Spiant, X.¹

22
- 0032047115
- A game of prediction with experts advice
- April
- V. Vovk. A game of prediction with experts advice. Journal of Computer and Systems Sciences, 56(2):153–173, April 1998.
- (1998) Journal of Computer and Systems Sciences , vol.56 , Issue.2 , pp. 153-173
- Vovk, V.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.