SCOPUS 정보 검색 플랫폼 - 논문 보기

메뉴 건너뛰기

Journal of Machine Learning Research

Volumn 12, Issue , 2011, Pages 1896-1921

Internal regret with partial monitoring: Calibration-based optimal algorithms

(1) Perchet, Vianney a

a Paris Descartes University (France)

Author keywords

Calibration; On line learning; Partial monitoring; Regret; Repeated games; Vorono and Laguerre diagrams

Indexed keywords

DECISION MAKING;

LAGUERRE; ONLINE LEARNING; OPTIMAL ALGORITHM; RANDOM ALGORITHMS; RANDOM FEEDBACKS; REGRET; REPEATED GAMES; SEQUENTIAL DECISIONS;

CALIBRATION;

EID: 79960129843 PISSN: 15324435 EISSN: 15337928 Source Type: Journal
DOI: None Document Type: Article

Times cited : (23)

References (32)

1
- 0037709910
- The nonstochastic multiarmed bandit problem
- electronic
- P. Auer, N. Cesa-Bianchi, Y. Freund, and R. E. Schapire. The nonstochastic multiarmed bandit problem. SIAM J. Comput., 32:48-77 (electronic), 2002/03.
- (2002) SIAM J. Comput. , vol.32 , pp. 48-77
- Auer, P.¹ Cesa-Bianchi, N.² Freund, Y.³ Schapire, R.E.⁴

2
- 0037520770
- d+1
- F. Aurenhammer. A criterion for the affine equivalence of cell complexes in Rd and convex polyhedra in Rd+1. Discrete Comput. Geom., 2:49-64, 1987.
- (1987) Discrete Comput. Geom. , vol.2 , pp. 49-64
- Aurenhammer, F.¹

3
- 84972574511
- Weighted sums of certain dependent random variables
- K. Azuma. Weighted sums of certain dependent random variables. Tôhoku Math. J. (2), 19:357-367, 1967.
- (1967) Tôhoku Math. J. , vol.2 , Issue.19 , pp. 357-367
- Azuma, K.¹

4
- 0000672715
- Fiber polytopes
- L. J. Billera and B. Sturmfels. Fiber polytopes. The Annals of Mathematics, 135(3):pp. 527-549, 1992.
- (1992) The Annals of Mathematics , vol.135 , Issue.3 , pp. 527-549
- Billera, L.J.¹ Sturmfels, B.²

5
- 84972545864
- An analog of the minimax theorem for vector payoffs
- D. Blackwell. An analog of the minimax theorem for vector payoffs. Pacific J. Math., 6:1-8, 1956a.
- (1956) Pacific J. Math. , vol.6 , pp. 1-8
- Blackwell, D.¹

6
- 0013371249
- Controlled random walks
- D. Blackwell. Controlled random walks. In Proceedings of the International Congress of Mathematicians, 1954, Amsterdam, vol. III, pages 336-338, 1956b.
- (1956) Proceedings of the International Congress of Mathematicians 1954, Amsterdam , vol.3 , pp. 336-338
- Blackwell, D.¹

7
- 34547254640
- From external to internal regret
- A. Blum and Y. Mansour. From external to internal regret. J. Mach. Learn. Res., 8:1307-1324 (electronic), 2007. (Pubitemid 47143711)
- (2007) Journal of Machine Learning Research , vol.8 , pp. 1307-1324
- Blum, A.¹ Mansour, Y.²

8
- 0039956166
- Partition of space
- R. C. Buck. Partition of space. Amer. Math. Monthly, 50:541-544, 1943.
- (1943) Amer. Math. Monthly , vol.50 , pp. 541-544
- Buck, R.C.¹

9
- 84926078662
- Cambridge University Press, Cambridge
- N. Cesa-Bianchi and G. Lugosi. Prediction, Learning, and Games. Cambridge University Press, Cambridge, 2006.
- (2006) Prediction, Learning, and Games
- Cesa-Bianchi, N.¹ Lugosi, G.²

10
- 20544462399
- Minimizing regret with label efficient prediction
- DOI 10.1109/TIT.2005.847729
- N. Cesa-Bianchi, G. Lugosi, and G. Stoltz. Minimizing regret with label efficient prediction. IEEE Trans. Inform. Theory, 51:2152-2162, 2005. (Pubitemid 40843632)
- (2005) IEEE Transactions on Information Theory , vol.51 , Issue.6 , pp. 2152-2162
- Cesa-Bianchi, N.¹ Lugosi, G.² Stoltz, G.³

11
- 0030544315
- Laws of large numbers for Hilbert space-valued mixingales with applications
- X. Chen and H. White. Laws of large numbers for Hilbert space-valued mixingales with applications. Econometric Theory, 12:284-304, 1996.
- (1996) Econometric Theory , vol.12 , pp. 284-304
- Chen, X.¹ White, H.²

12
- 84950454029
- The well-calibrated Bayesian
- A. P. Dawid. The well-calibrated Bayesian. J. Amer. Statist. Assoc., 77:605-613, 1982.
- (1982) J. Amer. Statist. Assoc. , vol.77 , pp. 605-613
- Dawid, A.P.¹

13
- 0031256578
- Calibrated learning and correlated equilibrium
- DOI 10.1006/game.1997.0595, PII S0899825697905959
- D. P. Foster and R. V. Vohra. Calibrated learning and correlated equilibrium. Games Econom. Behav., 21:40-55, 1997. (Pubitemid 127175523)
- (1997) Games and Economic Behavior , vol.21 , Issue.1-2 , pp. 40-55
- Foster, D.P.¹ Vohra, R.V.²

14
- 0037539108
- Asymptotic calibration
- D. P. Foster and R. V. Vohra. Asymptotic calibration. Biometrika, 85:379-390, 1998.
- (1998) Biometrika , vol.85 , pp. 379-390
- Foster, D.P.¹ Vohra, R.V.²

15
- 0002384441
- On tail probabilities for martingales
- D. A. Freedman. On tail probabilities for martingales. Ann. Probability, 3:100-118, 1975.
- (1975) Ann. Probability , vol.3 , pp. 100-118
- Freedman, D.A.¹

16
- 0008547696
- Conditional universal consistency
- D. Fudenberg and D. K. Levine. Conditional universal consistency. Games Econom. Behav., 29: 104-130, 1999.
- (1999) Games Econom. Behav. , vol.29 , pp. 104-130
- Fudenberg, D.¹ Levine, D.K.²

17
- 0001976283
- Approximation to Bayes risk in repeated play
- Princeton University Press, Princeton, N. J.
- J. Hannan. Approximation to Bayes risk in repeated play. In Contributions to the Theory of Games, volume 3 of Annals ofMathematics Studies, pages 97-139. Princeton University Press, Princeton, N. J., 1957.
- (1957) Contributions to the Theory of Games Volume 3 of Annals OfMathematics Studies , pp. 97-139
- Hannan, J.¹

18
- 0000908510
- A simple adaptive procedure leading to correlated equilibrium
- S. Hart and A. Mas-Colell. A simple adaptive procedure leading to correlated equilibrium. Econometrica, 68:1127-1150, 2000.
- (2000) Econometrica , vol.68 , pp. 1127-1150
- Hart, S.¹ Mas-Colell, A.²

19
- 84947403595
- Probability inequalities for sums of bounded random variables
- W. Hoeffding. Probability inequalities for sums of bounded random variables. J. Amer. Statist. Assoc., 58:13-30, 1963.
- (1963) J. Amer. Statist. Assoc. , vol.58 , pp. 13-30
- Hoeffding, W.¹

20
- 77951952841
- Near-optimal regret bounds for reinforcement learning
- T. Jaksch, R. Ortner, and P. Auer. Near-optimal regret bounds for reinforcement learning. J. Mach. Learn. Res., 11:1563-1600, 2010.
- (2010) J. Mach. Learn. Res. , vol.11 , pp. 1563-1600
- Jaksch, T.¹ Ortner, R.² Auer, P.³

21
- 77952055124
- E. Lehrer and E. Solan. Learning to play partially-specified equilibrium. manuscript, 2007.
- (2007) Learning to Play Partially-specified Equilibrium Manuscript
- Lehrer, E.¹ Solan, E.²

22
- 0000176346
- Equilibrium points of bimatrix games
- C. E. Lemke and J. T. Howson, Jr. Equilibrium points of bimatrix games. J. Soc. Indust. Appl. Math., 12:413-423, 1964.
- (1964) J. Soc. Indust. Appl. Math. , vol.12 , pp. 413-423
- Lemke, C.E.¹ Howson Jr., J.T.²

23
- 61349116274
- Strategies for prediction under imperfect monitoring
- G. Lugosi, S. Mannor, and G. Stoltz. Strategies for prediction under imperfect monitoring. Math. Oper. Res., 33:513-528, 2008.
- (2008) Math. Oper. Res. , vol.33 , pp. 513-528
- Lugosi, G.¹ Mannor, S.² Stoltz, G.³

24
- 77952069205
- Calibration and internal no-regret with random signals
- V. Perchet. Calibration and internal no-regret with random signals. Proceedings of the 20th International Conference on Algorithmic Learning Theory, pages 68-82, 2009.
- (2009) Proceedings of the 20th International Conference on Algorithmic Learning Theory , pp. 68-82
- Perchet, V.¹

25
- 0030523539
- Projections of polytopes and the generalized baues conjecture
- J. Rambau and G. M. Ziegler. Projections of polytopes and the generalized Baues conjecture. Discrete Comput. Geom., 16:215-237, 1996. (Pubitemid 126317943)
- (1996) Discrete and Computational Geometry , vol.16 , Issue.3 , pp. 215-237
- Rambau, J.¹ Ziegler, G.M.²

26
- 0004031920
- Princeton University Press, Princeton, N.J.
- R. T. Rockafellar. Convex Analysis. Princeton Mathematical Series, No. 28. Princeton University Press, Princeton, N.J., 1970.
- (1970) Convex Analysis. Princeton Mathematical Series , Issue.28
- Rockafellar, R.T.¹

27
- 0013327190
- Minimizing regret: The general case
- A. Rustichini. Minimizing regret: the general case. Games Econom. Behav., 29:224-243, 1999.
- (1999) Games Econom. Behav. , vol.29 , pp. 224-243
- Rustichini, A.¹

28
- 0003570325
- Springer Series in Statistics. Springer-Verlag, New York, second edition
- E. Seneta. Nonnegative Matrices and Markov Chains. Springer Series in Statistics. Springer-Verlag, New York, second edition, 1981.
- (1981) Nonnegative Matrices and Markov Chains
- Seneta, E.¹

29
- 0040104631
- Supergames
- Econom. Theory Econometrics Math. Econom., Academic Press, San Diego, CA
- S. Sorin. Supergames. In Game theory and applications (Columbus, OH, 1987), Econom. Theory Econometrics Math. Econom., pages 46-63. Academic Press, San Diego, CA, 1990.
- (1990) Game Theory and Applications (Columbus, OH 1987) , pp. 46-63
- Sorin, S.¹

30
- 77952029796
- Unpublished Lecture Notes
- S. Sorin. Lectures on Dynamics in Games. Unpublished Lecture Notes, 2008.
- (2008) Lectures on Dynamics in Games
- Sorin, S.¹

31
- 0000836223
- Exponential inequalities for sums of random vectors
- V. Yurinskii. Exponential inequalities for sums of random vectors. Journal of Multivariate Analysis, 6:473-499, 1976.
- (1976) Journal of Multivariate Analysis , vol.6 , pp. 473-499
- Yurinskii, V.¹

32
- 0003589701
- Springer-Verlag, New York
- G. Ziegler. Lectures on Polytopes, volume 152 of Graduate Texts in Mathematics. Springer-Verlag, New York, 1995.
- (1995) Lectures on Polytopes Volume 152 of Graduate Texts in Mathematics
- Ziegler, G.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.