메뉴 건너뛰기




Volumn 31, Issue 3, 2006, Pages 562-580

Regret minimization under partial monitoring

Author keywords

Hannan consistency; Imperfect monitoring; Internal regret; Repeated games

Indexed keywords

CONDITION MONITORING; CONVERGENCE OF NUMERICAL METHODS; FEEDBACK; GAME THEORY; PROBABILITY;

EID: 33748442333     PISSN: 0364765X     EISSN: 15265471     Source Type: Journal    
DOI: 10.1287/moor.1060.0206     Document Type: Article
Times cited : (113)

References (44)
  • 1
    • 0041966002 scopus 로고    scopus 로고
    • Using confidence bounds for exploitation-exploration trade-offs
    • Auer, P. 2002. Using confidence bounds for exploitation-exploration trade-offs. J. Machine Learn. Res. 3 397-422.
    • (2002) J. Machine Learn. Res. , vol.3 , pp. 397-422
    • Auer, P.1
  • 3
    • 0036477185 scopus 로고    scopus 로고
    • Adaptive and self-confident on-line learning algorithms
    • Auer, P., N. Cesa-Bianchi, C. Gentile. 2002. Adaptive and self-confident on-line learning algorithms. J. Comput. System Sci. 64 48-75.
    • (2002) J. Comput. System Sci. , vol.64 , pp. 48-75
    • Auer, P.1    Cesa-Bianchi, N.2    Gentile, C.3
  • 5
    • 84972574511 scopus 로고
    • Weighted sums of certain dependent random variables
    • Azuma, K. 1967. Weighted sums of certain dependent random variables. Tohoku Math. J. 68 357-367.
    • (1967) Tohoku Math. J. , vol.68 , pp. 357-367
    • Azuma, K.1
  • 6
    • 0038623721 scopus 로고
    • On pseudo-games
    • Baños, A. 1968. On pseudo-games. Ann. Math. Statist. 39 1932-1945.
    • (1968) Ann. Math. Statist. , vol.39 , pp. 1932-1945
    • Baños, A.1
  • 7
    • 84972545864 scopus 로고
    • An analog of the minimax theorem for vector payoffs
    • Blackwell, D. 1956. An analog of the minimax theorem for vector payoffs. Pacific J. Math. 6 1-8.
    • (1956) Pacific J. Math. , vol.6 , pp. 1-8
    • Blackwell, D.1
  • 11
    • 0033234631 scopus 로고    scopus 로고
    • On prediction of individual sequences
    • Cesa-Bianchi, N., G. Lugosi. 1999. On prediction of individual sequences. Ann. Statist. 27 1865-1895.
    • (1999) Ann. Statist. , vol.27 , pp. 1865-1895
    • Cesa-Bianchi, N.1    Lugosi, G.2
  • 12
    • 0037614825 scopus 로고    scopus 로고
    • Potential-based algorithms in on-line prediction and game theory
    • Cesa-Bianchi, N., G. Lugosi. 2003. Potential-based algorithms in on-line prediction and game theory. Machine Learn. 51 239-261.
    • (2003) Machine Learn. , vol.51 , pp. 239-261
    • Cesa-Bianchi, N.1    Lugosi, G.2
  • 18
    • 0031256578 scopus 로고    scopus 로고
    • Calibrated learning and correlated equilibrium
    • Foster, D., R. Vohra. 1997. Calibrated learning and correlated equilibrium. Games Econom. Behav. 21 40-55.
    • (1997) Games Econom. Behav. , vol.21 , pp. 40-55
    • Foster, D.1    Vohra, R.2
  • 19
    • 0037539108 scopus 로고    scopus 로고
    • Asymptotic calibration
    • Foster, D., R. Vohra. 1998. Asymptotic calibration. Biometrika 85 379-390.
    • (1998) Biometrika , vol.85 , pp. 379-390
    • Foster, D.1    Vohra, R.2
  • 20
    • 0002476325 scopus 로고    scopus 로고
    • Regret in the on-line decision problem
    • Foster, D., R. Vohra. 1999. Regret in the on-line decision problem. Games Econom. Behav. 29 7-36.
    • (1999) Games Econom. Behav. , vol.29 , pp. 7-36
    • Foster, D.1    Vohra, R.2
  • 21
    • 0002384441 scopus 로고
    • On tail probabilities for martingales
    • Freedman, D. A. 1975. On tail probabilities for martingales. Ann. Probab. 3 100-118.
    • (1975) Ann. Probab. , vol.3 , pp. 100-118
    • Freedman, D.A.1
  • 22
    • 0000668347 scopus 로고
    • Universal consistency and cautious fictitious play
    • Fudenberg, D., D. K. Levine. 1995. Universal consistency and cautious fictitious play. J. Econom. Dynam. Control 19 1065-1089.
    • (1995) J. Econom. Dynam. Control , vol.19 , pp. 1065-1089
    • Fudenberg, D.1    Levine, D.K.2
  • 24
    • 0001976283 scopus 로고
    • Approximation to Bayes risk in repeated play
    • M. Dresher, A. W. Tucker, P. Wolfe, eds. Princeton University Press, Princeton, NJ
    • Hannan, J. 1957. Approximation to Bayes risk in repeated play. M. Dresher, A. W. Tucker, P. Wolfe, eds. Contributions to the Theory of Games, Vol. 3. Princeton University Press, Princeton, NJ, 97-139.
    • (1957) Contributions to the Theory of Games , vol.3 , pp. 97-139
    • Hannan, J.1
  • 25
    • 0000908510 scopus 로고    scopus 로고
    • A simple adaptive procedure leading to correlated equilibrium
    • Hart, S., A. Mas-Colell. 2000. A simple adaptive procedure leading to correlated equilibrium. Econometrica 68 1127-1150.
    • (2000) Econometrica , vol.68 , pp. 1127-1150
    • Hart, S.1    Mas-Colell, A.2
  • 26
    • 0013327463 scopus 로고    scopus 로고
    • A general class of adaptive strategies
    • Hart, S., A. Mas-Colell. 2001. A general class of adaptive strategies. J. Econom. Theory 98 26-54.
    • (2001) J. Econom. Theory , vol.98 , pp. 26-54
    • Hart, S.1    Mas-Colell, A.2
  • 27
    • 0242684983 scopus 로고    scopus 로고
    • A reinforcement procedure leading to correlated equilibrium
    • G. Debreu, W. Neuefeind, W. Trockel, eds. Springer, New York
    • Hart, S., A. Mas-Colell. 2002. A reinforcement procedure leading to correlated equilibrium. G. Debreu, W. Neuefeind, W. Trockel, eds. Economic Essays: A Festschrift for Werner Hildenbrand. Springer, New York, 181-200.
    • (2002) Economic Essays: A Festschrift for Werner Hildenbrand , pp. 181-200
    • Hart, S.1    Mas-Colell, A.2
  • 30
    • 84947403595 scopus 로고
    • Probability inequalities for sums of bounded random variables
    • Hoeffding, W. 1963. Probability inequalities for sums of bounded random variables. J. Amer. Statist. Assoc. 58 13-30.
    • (1963) J. Amer. Statist. Assoc. , vol.58 , pp. 13-30
    • Hoeffding, W.1
  • 31
    • 0345412655 scopus 로고    scopus 로고
    • The value of knowing a demand curve: Bounds on regret for on-line posted-price auctions
    • IEEE Press, Piscataway, NJ
    • Kleinberg, R., T. Leighton. 2003. The value of knowing a demand curve: Bounds on regret for on-line posted-price auctions. Proc. 44th Annual IEEE Sympos. Foundations Comput. Sci. IEEE Press, Piscataway, NJ, 594-605.
    • (2003) Proc. 44th Annual IEEE Sympos. Foundations Comput. Sci. , pp. 594-605
    • Kleinberg, R.1    Leighton, T.2
  • 32
  • 33
    • 9444255069 scopus 로고    scopus 로고
    • On-line learning with imperfect monitoring
    • Springer, New York
    • Mannor, S., N. Shimkin. 2003. On-line learning with imperfect monitoring. Proc. 16th Annual Conf. Learn. Theory, Springer, New York, 552-567.
    • (2003) Proc. 16th Annual Conf. Learn. Theory , pp. 552-567
    • Mannor, S.1    Shimkin, N.2
  • 35
    • 0038675791 scopus 로고
    • On repeated games with incomplete information played by non-Bayesian players
    • Megiddo, N. 1980. On repeated games with incomplete information played by non-Bayesian players. Internat. J. Game Theory 9 157-167.
    • (1980) Internat. J. Game Theory , vol.9 , pp. 157-167
    • Megiddo, N.1
  • 37
    • 0003351019 scopus 로고
    • Repeated games
    • 9421, CORE, Louvain-la-Neuve, Belgium
    • Mertens, J.-F., S. Sorin, S. Zamir. 1994. Repeated games. Discussion Paper 9420, 9421, 9422, CORE, Louvain-la-Neuve, Belgium.
    • (1994) Discussion Paper , vol.9420-9422
    • Mertens, J.-F.1    Sorin, S.2    Zamir, S.3
  • 39
    • 0013327190 scopus 로고    scopus 로고
    • Minimizing regret: The general case
    • Rustichini, A. 1999. Minimizing regret: The general case. Games Econom. Behav. 29 224-243.
    • (1999) Games Econom. Behav. , vol.29 , pp. 224-243
    • Rustichini, A.1
  • 40
    • 21244487467 scopus 로고    scopus 로고
    • Internal regret in on-line portfolio selection
    • Stoltz, G., G. Lugosi. 2005. Internal regret in on-line portfolio selection. Machine Learn. 59 125-159.
    • (2005) Machine Learn. , vol.59 , pp. 125-159
    • Stoltz, G.1    Lugosi, G.2
  • 42
    • 0035413537 scopus 로고    scopus 로고
    • Competitive on-line statistics
    • Vovk, V. G. 2001. Competitive on-line statistics. Internat. Statist. Rev. 69 213-248.
    • (2001) Internat. Statist. Rev. , vol.69 , pp. 213-248
    • Vovk, V.G.1
  • 43
    • 0035443342 scopus 로고    scopus 로고
    • Universal prediction of binary individual sequences in the presence of noise
    • Weissman, T., N. Merhav. 2001. Universal prediction of binary individual sequences in the presence of noise. IEEE Trans. Inform. Theory 47 2151-2173.
    • (2001) IEEE Trans. Inform. Theory , vol.47 , pp. 2151-2173
    • Weissman, T.1    Merhav, N.2
  • 44
    • 0035397523 scopus 로고    scopus 로고
    • Twofold universal prediction schemes for achieving the finite state predictability of a noisy individual binary sequence
    • Weissman, T., N. Merhav, A. Somekh-Baruch. 2001. Twofold universal prediction schemes for achieving the finite state predictability of a noisy individual binary sequence. IEEE Trans. Inform. Theory 47 1849-1866.
    • (2001) IEEE Trans. Inform. Theory , vol.47 , pp. 1849-1866
    • Weissman, T.1    Merhav, N.2    Somekh-Baruch, A.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.