SCOPUS 정보 검색 플랫폼

Journal of Machine Learning Research

Volumn 8, Issue , 2007, Pages 1307-1324

From external to internal regret

(2) Blum, Avrim a Mansour, Yishay b

a Carnegie Mellon University (United States)

b TEL AVIV UNIVERSITY (Israel)

Author keywords

External regret; Internal regret; Multi arm bandit; Online learning; Reductions; Sleeping experts

Indexed keywords

EXTERNAL REGRET; INTERNAL REGRET; MULTI-ARM BANDIT; ONLINE LEARNING; SLEEPING EXPERTS;

MATHEMATICAL MODELS; ONLINE SYSTEMS; PROBLEM SOLVING;

ALGORITHMS;

EID: 34547254640 PISSN: 15324435 EISSN: 15337928 Source Type: Journal
DOI: None Document Type: Article

Times cited : (274)

References (29)

1
- 0037709910
- The nonstochastic multiarmed bandit problem
- P. Auer, N. Cesa-Bianchi, Y. Freund, and R.E. Schapire. The nonstochastic multiarmed bandit problem. SIAM Journal on Computing, 32(1):48-77, 2002a.
- (2002) SIAM Journal on Computing , vol.32 , Issue.1 , pp. 48-77
- Auer, P.¹ Cesa-Bianchi, N.² Freund, Y.³ Schapire, R.E.⁴

2
- 0036477185
- Adaptive and self-confident on-line learning algorithms
- P. Auer, N. Cesa-Bianchi, and C. Gentile. Adaptive and self-confident on-line learning algorithms. Journal of Computing and System Sciences, 64(1):48-75, 2002b.
- (2002) Journal of Computing and System Sciences , vol.64 , Issue.1 , pp. 48-75
- Auer, P.¹ Cesa-Bianchi, N.² Gentile, C.³

3
- 0002430114
- Subjectivity and correlation in randomized strategies
- R. J. Aumann. Subjectivity and correlation in randomized strategies. Journal of Mathematical Economics, 1:67-96, 1974.
- (1974) Journal of Mathematical Economics , vol.1 , pp. 67-96
- Aumann, R.J.¹

4
- 84972545864
- An analog of the mimimax theorem for vector payoffs
- D. Blackwell. An analog of the mimimax theorem for vector payoffs. Pacific Journal of Mathematics, 6:1-8, 1956.
- (1956) Pacific Journal of Mathematics , vol.6 , pp. 1-8
- Blackwell, D.¹

5
- 0030819669
- Empirical support for Winnow and Weighted-Majority based algorithms: Results on a calendar scheduling domain
- A. Blum. Empirical support for Winnow and Weighted-Majority based algorithms: Results on a calendar scheduling domain. Machine Learning, 26:5-23, 1997.
- (1997) Machine Learning , vol.26 , pp. 5-23
- Blum, A.¹

6
- 0031140246
- How to use expert advice
- N. Cesa-Bianchi, Y. Freund, D.P. Helmbold, D. Haussler, R.E. Schapire, and M.K. Warmuth. How to use expert advice. Journal of the Association for Computing Machinery (JACM), 44(3):427-485, 1997.
- (1997) Journal of the Association for Computing Machinery (JACM) , vol.44 , Issue.3 , pp. 427-485
- Cesa-Bianchi, N.¹ Freund, Y.² Helmbold, D.P.³ Haussler, D.⁴ Schapire, R.E.⁵ Warmuth, M.K.⁶

7
- 0037614825
- Potential-based algorithms in on-line prediction and game theory
- N. Cesa-Bianchi and G. Lugosi. Potential-based algorithms in on-line prediction and game theory. Machine Learning, 51(3):239-261, 2003.
- (2003) Machine Learning , vol.51 , Issue.3 , pp. 239-261
- Cesa-Bianchi, N.¹ Lugosi, G.²

8
- 33748442333
- Regret minimization under partial monitoring
- N. Cesa-Bianchi, G. Lugosi, and G. Stoltz. Regret minimization under partial monitoring. Mathematics of Operations Research, 31:562-580, 2006.
- (2006) Mathematics of Operations Research , vol.31 , pp. 562-580
- Cesa-Bianchi, N.¹ Lugosi, G.² Stoltz, G.³

9
- 26944464957
- Improved second-order bounds for prediction with expert advice
- N. Cesa-Bianchi, Y. Mansour, and G. Stoltz. Improved second-order bounds for prediction with expert advice. In Proceedings of the Eighteenth Annual Conference on Computational Learning Theory, pages 217-232, 2005.
- (2005) Proceedings of the Eighteenth Annual Conference on Computational Learning Theory , pp. 217-232
- Cesa-Bianchi, N.¹ Mansour, Y.² Stoltz, G.³

10
- 26944455042
- Learning to query the web
- W. Cohen and Y. Singer. Learning to query the web. In AAAI Workshop on Internet-Based Information Systems, 1996.
- (1996) AAAI Workshop on Internet-Based Information Systems
- Cohen, W.¹ Singer, Y.²

11
- 0001345686
- Context-sensitive learning methods for text categorization
- W. Cohen and Y. Singer. Context-sensitive learning methods for text categorization. ACM Transactions on Information Systems, 17(2):141-173, 1999.
- (1999) ACM Transactions on Information Systems , vol.17 , Issue.2 , pp. 141-173
- Cohen, W.¹ Singer, Y.²

12
- 0003421261
- Wiley
- W. Feller. An Introduction to Probability Theory and its Applications. - Vol. 1. Wiley, 1968.
- (1968) An Introduction to Probability Theory and its Applications , vol.1
- Feller, W.¹

13
- 0003421261
- Wiley
- W. Feller. An Introduction to Probability Theory and its Applications. - Vol. 2. Wiley, 1971.
- (1971) An Introduction to Probability Theory and its Applications , vol.2
- Feller, W.¹

14
- 0002095886
- A randomization rule for selecting forecasts
- July-August
- D. Foster and R. Vohra. A randomization rule for selecting forecasts. Operations Research, 41(4): 704-709, July-August 1993.
- (1993) Operations Research , vol.41 , Issue.4 , pp. 704-709
- Foster, D.¹ Vohra, R.²

15
- 0031256578
- Calibrated learning and correlated equilibrium
- D. Foster and R. Vohra. Calibrated learning and correlated equilibrium. Games and Economic Behavior, 21:40-55, 1997.
- (1997) Games and Economic Behavior , vol.21 , pp. 40-55
- Foster, D.¹ Vohra, R.²

16
- 0037539108
- Asymptotic calibration
- D. Foster and R. Vohra. Asymptotic calibration. Biometrika, 85:379-390, 1998.
- (1998) Biometrika , vol.85 , pp. 379-390
- Foster, D.¹ Vohra, R.²

17
- 0002476325
- Regret in the on-line decision problem
- D. Foster and R. Vohra. Regret in the on-line decision problem. Games and Economic Behavior, 29:7-36, 1999.
- (1999) Games and Economic Behavior , vol.29 , pp. 7-36
- Foster, D.¹ Vohra, R.²

18
- 0031211090
- A decision-theoretic generalization of on-line learning and an application to boosting
- Y. Freund and R.E. Schapire. A decision-theoretic generalization of on-line learning and an application to boosting. Journal of Computer and System Sciences, 55(1):119-139, 1997.
- (1997) Journal of Computer and System Sciences , vol.55 , Issue.1 , pp. 119-139
- Freund, Y.¹ Schapire, R.E.²

19
- 0002267135
- Adaptive game playing using multiplicative weights
- Y. Freund and R.E. Schapire. Adaptive game playing using multiplicative weights. Games and Economic Behavior, 29:79-103, 1999.
- (1999) Games and Economic Behavior , vol.29 , pp. 79-103
- Freund, Y.¹ Schapire, R.E.²

20
- 0030643068
- Using and combining predictors that specialize
- Y. Freund, R.E. Schapire, Y. Singer, and M.K. Warmuth. Using and combining predictors that specialize. In Proceedings of the 29th Annual Symposium on Theory of Computing, pages 334-343, 1997.
- (1997) Proceedings of the 29th Annual Symposium on Theory of Computing , pp. 334-343
- Freund, Y.¹ Schapire, R.E.² Singer, Y.³ Warmuth, M.K.⁴

21
- 0001976283
- Approximation to Bayes risk in repeated plays
- M. Dresher, A. Tucker, and P. Wolfe, editors, Princeton University Press
- J. Hannan. Approximation to Bayes risk in repeated plays. In M. Dresher, A. Tucker, and P. Wolfe, editors, Contributions to the Theory of Games, volume 3, pages 97-139. Princeton University Press, 1957.
- (1957) Contributions to the Theory of Games , vol.3 , pp. 97-139
- Hannan, J.¹

22
- 0000908510
- A simple adaptive procedure leading to correlated equilibrium
- S. Hart and A. Mas-Colell. A simple adaptive procedure leading to correlated equilibrium. Econometrica, 68:1127-1150, 2000.
- (2000) Econometrica , vol.68 , pp. 1127-1150
- Hart, S.¹ Mas-Colell, A.²

23
- 0242684983
- A reinforcement procedure leading to correlated equilibrium
- Wilhelm Neuefeind Gerard Debreu and Walter Trockel, editors, Springer
- S. Hart and A. Mas-Colell. A reinforcement procedure leading to correlated equilibrium. In Wilhelm Neuefeind Gerard Debreu and Walter Trockel, editors, Economic Essays, pages 181-200. Springer, 2001.
- (2001) Economic Essays , pp. 181-200
- Hart, S.¹ Mas-Colell, A.²

24
- 0038404996
- A wide range no-regret theorem
- E. Lehrer. A wide range no-regret theorem. Games and Economic Behavior, 42:101-115, 2003.
- (2003) Games and Economic Behavior , vol.42 , pp. 101-115
- Lehrer, E.¹

25
- 34250091945
- Learning quickly when irrelevant attributes abound: A new linear-threshold algorithm
- N. Littlestone. Learning quickly when irrelevant attributes abound: A new linear-threshold algorithm. Machine Learning, 2:285-318, 1988.
- (1988) Machine Learning , vol.2 , pp. 285-318
- Littlestone, N.¹

26
- 35148838877
- The weighted majority algorithm
- N. Littlestone and M.K. Warmuth. The weighted majority algorithm. Information and Computation, 108:212-261, 1994.
- (1994) Information and Computation , vol.108 , pp. 212-261
- Littlestone, N.¹ Warmuth, M.K.²

27
- 34547275330
- PhD thesis, Dept. of Mathematics, University Paris XI, ORSAY
- G. Stoltz. Incomplete Information and Internal Regret in Prediction of Individual Sequences. PhD thesis, Dept. of Mathematics, University Paris XI, ORSAY, 2005.
- (2005) Incomplete Information and Internal Regret in Prediction of Individual Sequences
- Stoltz, G.¹

28
- 21244487467
- Internal regret in on-line portfolio selection
- G. Stoltz and G. Lugosi. Internal regret in on-line portfolio selection. Machine Learning, 59(1-2): 125-159, 2005.
- (2005) Machine Learning , vol.59 , Issue.1-2 , pp. 125-159
- Stoltz, G.¹ Lugosi, G.²

29
- 33947600544
- Learning correlated equilibria in games with compact sets of strategies
- G. Stoltz and G. Lugosi. Learning correlated equilibria in games with compact sets of strategies. Games and Economic Behavior, 59:187-209, 2007.
- (2007) Games and Economic Behavior , vol.59 , pp. 187-209
- Stoltz, G.¹ Lugosi, G.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.