메뉴 건너뛰기




Volumn 12, Issue , 2011, Pages 1896-1921

Internal regret with partial monitoring: Calibration-based optimal algorithms

Author keywords

Calibration; On line learning; Partial monitoring; Regret; Repeated games; Vorono and Laguerre diagrams

Indexed keywords

DECISION MAKING;

EID: 79960129843     PISSN: 15324435     EISSN: 15337928     Source Type: Journal    
DOI: None     Document Type: Article
Times cited : (23)

References (32)
  • 2
    • 0037520770 scopus 로고
    • d+1
    • F. Aurenhammer. A criterion for the affine equivalence of cell complexes in Rd and convex polyhedra in Rd+1. Discrete Comput. Geom., 2:49-64, 1987.
    • (1987) Discrete Comput. Geom. , vol.2 , pp. 49-64
    • Aurenhammer, F.1
  • 3
    • 84972574511 scopus 로고
    • Weighted sums of certain dependent random variables
    • K. Azuma. Weighted sums of certain dependent random variables. Tôhoku Math. J. (2), 19:357-367, 1967.
    • (1967) Tôhoku Math. J. , vol.2 , Issue.19 , pp. 357-367
    • Azuma, K.1
  • 5
    • 84972545864 scopus 로고
    • An analog of the minimax theorem for vector payoffs
    • D. Blackwell. An analog of the minimax theorem for vector payoffs. Pacific J. Math., 6:1-8, 1956a.
    • (1956) Pacific J. Math. , vol.6 , pp. 1-8
    • Blackwell, D.1
  • 8
    • 0039956166 scopus 로고
    • Partition of space
    • R. C. Buck. Partition of space. Amer. Math. Monthly, 50:541-544, 1943.
    • (1943) Amer. Math. Monthly , vol.50 , pp. 541-544
    • Buck, R.C.1
  • 11
    • 0030544315 scopus 로고    scopus 로고
    • Laws of large numbers for Hilbert space-valued mixingales with applications
    • X. Chen and H. White. Laws of large numbers for Hilbert space-valued mixingales with applications. Econometric Theory, 12:284-304, 1996.
    • (1996) Econometric Theory , vol.12 , pp. 284-304
    • Chen, X.1    White, H.2
  • 12
    • 84950454029 scopus 로고
    • The well-calibrated Bayesian
    • A. P. Dawid. The well-calibrated Bayesian. J. Amer. Statist. Assoc., 77:605-613, 1982.
    • (1982) J. Amer. Statist. Assoc. , vol.77 , pp. 605-613
    • Dawid, A.P.1
  • 13
    • 0031256578 scopus 로고    scopus 로고
    • Calibrated learning and correlated equilibrium
    • DOI 10.1006/game.1997.0595, PII S0899825697905959
    • D. P. Foster and R. V. Vohra. Calibrated learning and correlated equilibrium. Games Econom. Behav., 21:40-55, 1997. (Pubitemid 127175523)
    • (1997) Games and Economic Behavior , vol.21 , Issue.1-2 , pp. 40-55
    • Foster, D.P.1    Vohra, R.V.2
  • 14
    • 0037539108 scopus 로고    scopus 로고
    • Asymptotic calibration
    • D. P. Foster and R. V. Vohra. Asymptotic calibration. Biometrika, 85:379-390, 1998.
    • (1998) Biometrika , vol.85 , pp. 379-390
    • Foster, D.P.1    Vohra, R.V.2
  • 15
    • 0002384441 scopus 로고
    • On tail probabilities for martingales
    • D. A. Freedman. On tail probabilities for martingales. Ann. Probability, 3:100-118, 1975.
    • (1975) Ann. Probability , vol.3 , pp. 100-118
    • Freedman, D.A.1
  • 16
  • 18
    • 0000908510 scopus 로고    scopus 로고
    • A simple adaptive procedure leading to correlated equilibrium
    • S. Hart and A. Mas-Colell. A simple adaptive procedure leading to correlated equilibrium. Econometrica, 68:1127-1150, 2000.
    • (2000) Econometrica , vol.68 , pp. 1127-1150
    • Hart, S.1    Mas-Colell, A.2
  • 19
    • 84947403595 scopus 로고
    • Probability inequalities for sums of bounded random variables
    • W. Hoeffding. Probability inequalities for sums of bounded random variables. J. Amer. Statist. Assoc., 58:13-30, 1963.
    • (1963) J. Amer. Statist. Assoc. , vol.58 , pp. 13-30
    • Hoeffding, W.1
  • 20
    • 77951952841 scopus 로고    scopus 로고
    • Near-optimal regret bounds for reinforcement learning
    • T. Jaksch, R. Ortner, and P. Auer. Near-optimal regret bounds for reinforcement learning. J. Mach. Learn. Res., 11:1563-1600, 2010.
    • (2010) J. Mach. Learn. Res. , vol.11 , pp. 1563-1600
    • Jaksch, T.1    Ortner, R.2    Auer, P.3
  • 23
    • 61349116274 scopus 로고    scopus 로고
    • Strategies for prediction under imperfect monitoring
    • G. Lugosi, S. Mannor, and G. Stoltz. Strategies for prediction under imperfect monitoring. Math. Oper. Res., 33:513-528, 2008.
    • (2008) Math. Oper. Res. , vol.33 , pp. 513-528
    • Lugosi, G.1    Mannor, S.2    Stoltz, G.3
  • 25
    • 0030523539 scopus 로고    scopus 로고
    • Projections of polytopes and the generalized baues conjecture
    • J. Rambau and G. M. Ziegler. Projections of polytopes and the generalized Baues conjecture. Discrete Comput. Geom., 16:215-237, 1996. (Pubitemid 126317943)
    • (1996) Discrete and Computational Geometry , vol.16 , Issue.3 , pp. 215-237
    • Rambau, J.1    Ziegler, G.M.2
  • 27
    • 0013327190 scopus 로고    scopus 로고
    • Minimizing regret: The general case
    • A. Rustichini. Minimizing regret: the general case. Games Econom. Behav., 29:224-243, 1999.
    • (1999) Games Econom. Behav. , vol.29 , pp. 224-243
    • Rustichini, A.1
  • 28
    • 0003570325 scopus 로고
    • Springer Series in Statistics. Springer-Verlag, New York, second edition
    • E. Seneta. Nonnegative Matrices and Markov Chains. Springer Series in Statistics. Springer-Verlag, New York, second edition, 1981.
    • (1981) Nonnegative Matrices and Markov Chains
    • Seneta, E.1
  • 29
    • 0040104631 scopus 로고
    • Supergames
    • Econom. Theory Econometrics Math. Econom., Academic Press, San Diego, CA
    • S. Sorin. Supergames. In Game theory and applications (Columbus, OH, 1987), Econom. Theory Econometrics Math. Econom., pages 46-63. Academic Press, San Diego, CA, 1990.
    • (1990) Game Theory and Applications (Columbus, OH 1987) , pp. 46-63
    • Sorin, S.1
  • 31
    • 0000836223 scopus 로고
    • Exponential inequalities for sums of random vectors
    • V. Yurinskii. Exponential inequalities for sums of random vectors. Journal of Multivariate Analysis, 6:473-499, 1976.
    • (1976) Journal of Multivariate Analysis , vol.6 , pp. 473-499
    • Yurinskii, V.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.