메뉴 건너뛰기




Volumn 8, Issue , 2007, Pages 1307-1324

From external to internal regret

Author keywords

External regret; Internal regret; Multi arm bandit; Online learning; Reductions; Sleeping experts

Indexed keywords

EXTERNAL REGRET; INTERNAL REGRET; MULTI-ARM BANDIT; ONLINE LEARNING; SLEEPING EXPERTS;

EID: 34547254640     PISSN: 15324435     EISSN: 15337928     Source Type: Journal    
DOI: None     Document Type: Article
Times cited : (274)

References (29)
  • 3
    • 0002430114 scopus 로고
    • Subjectivity and correlation in randomized strategies
    • R. J. Aumann. Subjectivity and correlation in randomized strategies. Journal of Mathematical Economics, 1:67-96, 1974.
    • (1974) Journal of Mathematical Economics , vol.1 , pp. 67-96
    • Aumann, R.J.1
  • 4
    • 84972545864 scopus 로고
    • An analog of the mimimax theorem for vector payoffs
    • D. Blackwell. An analog of the mimimax theorem for vector payoffs. Pacific Journal of Mathematics, 6:1-8, 1956.
    • (1956) Pacific Journal of Mathematics , vol.6 , pp. 1-8
    • Blackwell, D.1
  • 5
    • 0030819669 scopus 로고    scopus 로고
    • Empirical support for Winnow and Weighted-Majority based algorithms: Results on a calendar scheduling domain
    • A. Blum. Empirical support for Winnow and Weighted-Majority based algorithms: Results on a calendar scheduling domain. Machine Learning, 26:5-23, 1997.
    • (1997) Machine Learning , vol.26 , pp. 5-23
    • Blum, A.1
  • 7
    • 0037614825 scopus 로고    scopus 로고
    • Potential-based algorithms in on-line prediction and game theory
    • N. Cesa-Bianchi and G. Lugosi. Potential-based algorithms in on-line prediction and game theory. Machine Learning, 51(3):239-261, 2003.
    • (2003) Machine Learning , vol.51 , Issue.3 , pp. 239-261
    • Cesa-Bianchi, N.1    Lugosi, G.2
  • 11
    • 0001345686 scopus 로고    scopus 로고
    • Context-sensitive learning methods for text categorization
    • W. Cohen and Y. Singer. Context-sensitive learning methods for text categorization. ACM Transactions on Information Systems, 17(2):141-173, 1999.
    • (1999) ACM Transactions on Information Systems , vol.17 , Issue.2 , pp. 141-173
    • Cohen, W.1    Singer, Y.2
  • 14
    • 0002095886 scopus 로고
    • A randomization rule for selecting forecasts
    • July-August
    • D. Foster and R. Vohra. A randomization rule for selecting forecasts. Operations Research, 41(4): 704-709, July-August 1993.
    • (1993) Operations Research , vol.41 , Issue.4 , pp. 704-709
    • Foster, D.1    Vohra, R.2
  • 15
    • 0031256578 scopus 로고    scopus 로고
    • Calibrated learning and correlated equilibrium
    • D. Foster and R. Vohra. Calibrated learning and correlated equilibrium. Games and Economic Behavior, 21:40-55, 1997.
    • (1997) Games and Economic Behavior , vol.21 , pp. 40-55
    • Foster, D.1    Vohra, R.2
  • 16
    • 0037539108 scopus 로고    scopus 로고
    • Asymptotic calibration
    • D. Foster and R. Vohra. Asymptotic calibration. Biometrika, 85:379-390, 1998.
    • (1998) Biometrika , vol.85 , pp. 379-390
    • Foster, D.1    Vohra, R.2
  • 17
  • 18
    • 0031211090 scopus 로고    scopus 로고
    • A decision-theoretic generalization of on-line learning and an application to boosting
    • Y. Freund and R.E. Schapire. A decision-theoretic generalization of on-line learning and an application to boosting. Journal of Computer and System Sciences, 55(1):119-139, 1997.
    • (1997) Journal of Computer and System Sciences , vol.55 , Issue.1 , pp. 119-139
    • Freund, Y.1    Schapire, R.E.2
  • 19
    • 0002267135 scopus 로고    scopus 로고
    • Adaptive game playing using multiplicative weights
    • Y. Freund and R.E. Schapire. Adaptive game playing using multiplicative weights. Games and Economic Behavior, 29:79-103, 1999.
    • (1999) Games and Economic Behavior , vol.29 , pp. 79-103
    • Freund, Y.1    Schapire, R.E.2
  • 21
    • 0001976283 scopus 로고
    • Approximation to Bayes risk in repeated plays
    • M. Dresher, A. Tucker, and P. Wolfe, editors, Princeton University Press
    • J. Hannan. Approximation to Bayes risk in repeated plays. In M. Dresher, A. Tucker, and P. Wolfe, editors, Contributions to the Theory of Games, volume 3, pages 97-139. Princeton University Press, 1957.
    • (1957) Contributions to the Theory of Games , vol.3 , pp. 97-139
    • Hannan, J.1
  • 22
    • 0000908510 scopus 로고    scopus 로고
    • A simple adaptive procedure leading to correlated equilibrium
    • S. Hart and A. Mas-Colell. A simple adaptive procedure leading to correlated equilibrium. Econometrica, 68:1127-1150, 2000.
    • (2000) Econometrica , vol.68 , pp. 1127-1150
    • Hart, S.1    Mas-Colell, A.2
  • 23
    • 0242684983 scopus 로고    scopus 로고
    • A reinforcement procedure leading to correlated equilibrium
    • Wilhelm Neuefeind Gerard Debreu and Walter Trockel, editors, Springer
    • S. Hart and A. Mas-Colell. A reinforcement procedure leading to correlated equilibrium. In Wilhelm Neuefeind Gerard Debreu and Walter Trockel, editors, Economic Essays, pages 181-200. Springer, 2001.
    • (2001) Economic Essays , pp. 181-200
    • Hart, S.1    Mas-Colell, A.2
  • 24
    • 0038404996 scopus 로고    scopus 로고
    • A wide range no-regret theorem
    • E. Lehrer. A wide range no-regret theorem. Games and Economic Behavior, 42:101-115, 2003.
    • (2003) Games and Economic Behavior , vol.42 , pp. 101-115
    • Lehrer, E.1
  • 25
    • 34250091945 scopus 로고
    • Learning quickly when irrelevant attributes abound: A new linear-threshold algorithm
    • N. Littlestone. Learning quickly when irrelevant attributes abound: A new linear-threshold algorithm. Machine Learning, 2:285-318, 1988.
    • (1988) Machine Learning , vol.2 , pp. 285-318
    • Littlestone, N.1
  • 28
    • 21244487467 scopus 로고    scopus 로고
    • Internal regret in on-line portfolio selection
    • G. Stoltz and G. Lugosi. Internal regret in on-line portfolio selection. Machine Learning, 59(1-2): 125-159, 2005.
    • (2005) Machine Learning , vol.59 , Issue.1-2 , pp. 125-159
    • Stoltz, G.1    Lugosi, G.2
  • 29
    • 33947600544 scopus 로고    scopus 로고
    • Learning correlated equilibria in games with compact sets of strategies
    • G. Stoltz and G. Lugosi. Learning correlated equilibria in games with compact sets of strategies. Games and Economic Behavior, 59:187-209, 2007.
    • (2007) Games and Economic Behavior , vol.59 , pp. 187-209
    • Stoltz, G.1    Lugosi, G.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.