SCOPUS 정보 검색 플랫폼

Mathematics of Operations Research

Volumn 31, Issue 3, 2006, Pages 562-580

Regret minimization under partial monitoring

(3) Cesa Bianchi, Nicolò a Lugosi, Gábor b Stoltz, Gilles c

a UNIVERSITY OF MILAN (Italy)

b UNIVERSITAT POMPEU FABRA (Spain)

c ECOLE NORMALE SUPÉRIEURE (France)

Author keywords

Hannan consistency; Imperfect monitoring; Internal regret; Repeated games

Indexed keywords

CONDITION MONITORING; CONVERGENCE OF NUMERICAL METHODS; FEEDBACK; GAME THEORY; PROBABILITY;

HANNAN CONSISTENCY; IMPERFECT MONITORING; INTERNAL REGRET; REPEATED GAMES;

PROFESSIONAL ASPECTS;

EID: 33748442333 PISSN: 0364765X EISSN: 15265471 Source Type: Journal
DOI: 10.1287/moor.1060.0206 Document Type: Article

Times cited : (113)

References (44)

1
- 0041966002
- Using confidence bounds for exploitation-exploration trade-offs
- Auer, P. 2002. Using confidence bounds for exploitation-exploration trade-offs. J. Machine Learn. Res. 3 397-422.
- (2002) J. Machine Learn. Res. , vol.3 , pp. 397-422
- Auer, P.¹

2
- 33748450056
- A preliminary version appeared
- A preliminary version appeared in Proc. 41st Annual Sympos. Foundations Comput. Sci.
- Proc. 41st Annual Sympos. Foundations Comput. Sci.

3
- 0036477185
- Adaptive and self-confident on-line learning algorithms
- Auer, P., N. Cesa-Bianchi, C. Gentile. 2002. Adaptive and self-confident on-line learning algorithms. J. Comput. System Sci. 64 48-75.
- (2002) J. Comput. System Sci. , vol.64 , pp. 48-75
- Auer, P.¹ Cesa-Bianchi, N.² Gentile, C.³

4
- 0037709910
- The nonstochastic multiarmed bandit problem
- Auer, P., N. Cesa-Bianchi, Y. Freund, R. E. Schapire. 2002. The nonstochastic multiarmed bandit problem. SIAM J. Comput. 32 48-77.
- (2002) SIAM J. Comput. , vol.32 , pp. 48-77
- Auer, P.¹ Cesa-Bianchi, N.² Freund, Y.³ Schapire, R.E.⁴

5
- 84972574511
- Weighted sums of certain dependent random variables
- Azuma, K. 1967. Weighted sums of certain dependent random variables. Tohoku Math. J. 68 357-367.
- (1967) Tohoku Math. J. , vol.68 , pp. 357-367
- Azuma, K.¹

6
- 0038623721
- On pseudo-games
- Baños, A. 1968. On pseudo-games. Ann. Math. Statist. 39 1932-1945.
- (1968) Ann. Math. Statist. , vol.39 , pp. 1932-1945
- Baños, A.¹

7
- 84972545864
- An analog of the minimax theorem for vector payoffs
- Blackwell, D. 1956. An analog of the minimax theorem for vector payoffs. Pacific J. Math. 6 1-8.
- (1956) Pacific J. Math. , vol.6 , pp. 1-8
- Blackwell, D.¹

8
- 20744457866
- Near-optimal online auctions
- Blum, A., J. Hartline. 2005. Near-optimal online auctions. Proc. 16th ACM-SIAM Sympos. Discrete Algorithms, 1156-1163.
- (2005) Proc. 16th ACM-SIAM Sympos. Discrete Algorithms , pp. 1156-1163
- Blum, A.¹ Hartline, J.²

9
- 26944476270
- From external to internal regret
- Blum, A., Y. Mansour. 2005. From external to internal regret. Proc. 18th Annual Conf. Comput. Learn. Theory, Springer, 621-636.
- (2005) Proc. 18th Annual Conf. Comput. Learn. Theory , vol.SPRINGER , pp. 621-636
- Blum, A.¹ Mansour, Y.²

10
- 4444253732
- Online learning in online auctions
- Blum, A., V. Kumar, A. Rudra, F. Wu. 2004. Online learning in online auctions. Theoret. Comput. Sci. 324 137-146.
- (2004) Theoret. Comput. Sci. , vol.324 , pp. 137-146
- Blum, A.¹ Kumar, V.² Rudra, A.³ Wu, F.⁴

11
- 0033234631
- On prediction of individual sequences
- Cesa-Bianchi, N., G. Lugosi. 1999. On prediction of individual sequences. Ann. Statist. 27 1865-1895.
- (1999) Ann. Statist. , vol.27 , pp. 1865-1895
- Cesa-Bianchi, N.¹ Lugosi, G.²

12
- 0037614825
- Potential-based algorithms in on-line prediction and game theory
- Cesa-Bianchi, N., G. Lugosi. 2003. Potential-based algorithms in on-line prediction and game theory. Machine Learn. 51 239-261.
- (2003) Machine Learn. , vol.51 , pp. 239-261
- Cesa-Bianchi, N.¹ Lugosi, G.²

13
- 84926078662
- Cambridge University Press, Cambridge, UK
- Cesa-Bianchi, N., G. Lugosi. 2006. Prediction, Learning, and Games. Cambridge University Press, Cambridge, UK.
- (2006) Prediction, Learning, and Games
- Cesa-Bianchi, N.¹ Lugosi, G.²

14
- 20544462399
- Minimizing regret with label efficient prediction
- Cesa-Bianchi, N., G. Lugosi, G. Stoltz. 2005. Minimizing regret with label efficient prediction. IEEE Trans. Inform. Theory 51 2152-2162.
- (2005) IEEE Trans. Inform. Theory , vol.51 , pp. 2152-2162
- Cesa-Bianchi, N.¹ Lugosi, G.² Stoltz, G.³

15
- 0031140246
- How to use expert advice
- Cesa-Bianchi, N., Y. Freund, D. P. Helmbold, D. Haussler, R. Schapire, M. K. Warmuth. 1997. How to use expert advice. J. ACM 44 427-485.
- (1997) J. ACM , vol.44 , pp. 427-485
- Cesa-Bianchi, N.¹ Freund, Y.² Helmbold, D.P.³ Haussler, D.⁴ Schapire, R.⁵ Warmuth, M.K.⁶

16
- 84889281816
- John Wiley, New York
- Cover, T. M., J. A. Thomas. 1991. Elements of Information Theory. John Wiley, New York.
- (1991) Elements of Information Theory
- Cover, T.M.¹ Thomas, J.A.²

17
- 84882281845
- Universal prediction of individual sequences
- Feder, M., N. Merhav, M. Gutman. 1992. Universal prediction of individual sequences. IEEE Trans. Inform. Theory 38 1258-1270.
- (1992) IEEE Trans. Inform. Theory , vol.38 , pp. 1258-1270
- Feder, M.¹ Merhav, N.² Gutman, M.³

18
- 0031256578
- Calibrated learning and correlated equilibrium
- Foster, D., R. Vohra. 1997. Calibrated learning and correlated equilibrium. Games Econom. Behav. 21 40-55.
- (1997) Games Econom. Behav. , vol.21 , pp. 40-55
- Foster, D.¹ Vohra, R.²

19
- 0037539108
- Asymptotic calibration
- Foster, D., R. Vohra. 1998. Asymptotic calibration. Biometrika 85 379-390.
- (1998) Biometrika , vol.85 , pp. 379-390
- Foster, D.¹ Vohra, R.²

20
- 0002476325
- Regret in the on-line decision problem
- Foster, D., R. Vohra. 1999. Regret in the on-line decision problem. Games Econom. Behav. 29 7-36.
- (1999) Games Econom. Behav. , vol.29 , pp. 7-36
- Foster, D.¹ Vohra, R.²

21
- 0002384441
- On tail probabilities for martingales
- Freedman, D. A. 1975. On tail probabilities for martingales. Ann. Probab. 3 100-118.
- (1975) Ann. Probab. , vol.3 , pp. 100-118
- Freedman, D.A.¹

22
- 0000668347
- Universal consistency and cautious fictitious play
- Fudenberg, D., D. K. Levine. 1995. Universal consistency and cautious fictitious play. J. Econom. Dynam. Control 19 1065-1089.
- (1995) J. Econom. Dynam. Control , vol.19 , pp. 1065-1089
- Fudenberg, D.¹ Levine, D.K.²

23
- 0004247096
- MIT Press, Boston, MA
- Fudenberg, D., D. K. Levine. 1998. The Theory of Learning in Games. MIT Press, Boston, MA.
- (1998) The Theory of Learning in Games
- Fudenberg, D.¹ Levine, D.K.²

24
- 0001976283
- Approximation to Bayes risk in repeated play
- M. Dresher, A. W. Tucker, P. Wolfe, eds. Princeton University Press, Princeton, NJ
- Hannan, J. 1957. Approximation to Bayes risk in repeated play. M. Dresher, A. W. Tucker, P. Wolfe, eds. Contributions to the Theory of Games, Vol. 3. Princeton University Press, Princeton, NJ, 97-139.
- (1957) Contributions to the Theory of Games , vol.3 , pp. 97-139
- Hannan, J.¹

25
- 0000908510
- A simple adaptive procedure leading to correlated equilibrium
- Hart, S., A. Mas-Colell. 2000. A simple adaptive procedure leading to correlated equilibrium. Econometrica 68 1127-1150.
- (2000) Econometrica , vol.68 , pp. 1127-1150
- Hart, S.¹ Mas-Colell, A.²

26
- 0013327463
- A general class of adaptive strategies
- Hart, S., A. Mas-Colell. 2001. A general class of adaptive strategies. J. Econom. Theory 98 26-54.
- (2001) J. Econom. Theory , vol.98 , pp. 26-54
- Hart, S.¹ Mas-Colell, A.²

27
- 0242684983
- A reinforcement procedure leading to correlated equilibrium
- G. Debreu, W. Neuefeind, W. Trockel, eds. Springer, New York
- Hart, S., A. Mas-Colell. 2002. A reinforcement procedure leading to correlated equilibrium. G. Debreu, W. Neuefeind, W. Trockel, eds. Economic Essays: A Festschrift for Werner Hildenbrand. Springer, New York, 181-200.
- (2002) Economic Essays: A Festschrift for Werner Hildenbrand , pp. 181-200
- Hart, S.¹ Mas-Colell, A.²

28
- 0030707345
- Some label efficient learning results
- ACM Press, New York
- Helmbold, D. P., S. Panizza. 1997. Some label efficient learning results. Proc. 10th Annual Conf. Comput. Learn. Theory, ACM Press, New York, 218-230.
- (1997) Proc. 10th Annual Conf. Comput. Learn. Theory , pp. 218-230
- Helmbold, D.P.¹ Panizza, S.²

29
- 0034666805
- Apple tasting
- Helmbold, D. P., N. Littlestone, P. M. Long. 2000. Apple tasting. Inform. Comput. 161 85-139.
- (2000) Inform. Comput. , vol.161 , pp. 85-139
- Helmbold, D.P.¹ Littlestone, N.² Long, P.M.³

30
- 84947403595
- Probability inequalities for sums of bounded random variables
- Hoeffding, W. 1963. Probability inequalities for sums of bounded random variables. J. Amer. Statist. Assoc. 58 13-30.
- (1963) J. Amer. Statist. Assoc. , vol.58 , pp. 13-30
- Hoeffding, W.¹

31
- 0345412655
- The value of knowing a demand curve: Bounds on regret for on-line posted-price auctions
- IEEE Press, Piscataway, NJ
- Kleinberg, R., T. Leighton. 2003. The value of knowing a demand curve: Bounds on regret for on-line posted-price auctions. Proc. 44th Annual IEEE Sympos. Foundations Comput. Sci. IEEE Press, Piscataway, NJ, 594-605.
- (2003) Proc. 44th Annual IEEE Sympos. Foundations Comput. Sci. , pp. 594-605
- Kleinberg, R.¹ Leighton, T.²

32
- 35148838877
- The weighted majority algorithm
- Littlestone, N., M. K. Warmuth. 1994. The weighted majority algorithm. Inform. Comput. 108 212-261.
- (1994) Inform. Comput. , vol.108 , pp. 212-261
- Littlestone, N.¹ Warmuth, M.K.²

33
- 9444255069
- On-line learning with imperfect monitoring
- Springer, New York
- Mannor, S., N. Shimkin. 2003. On-line learning with imperfect monitoring. Proc. 16th Annual Conf. Learn. Theory, Springer, New York, 552-567.
- (2003) Proc. 16th Annual Conf. Learn. Theory , pp. 552-567
- Mannor, S.¹ Shimkin, N.²

34
- 9444248338
- Concentration inequalities and model selection
- Springer, New-York
- Massart, P. 2003. Concentration inequalities and model selection. Lectures on Probability Theory and Statistics (Saint-Flour, 2003), Lecture Notes in Mathematics. Springer, New-York.
- (2003) Lectures on Probability Theory and Statistics (Saint-Flour, 2003), Lecture Notes in Mathematics
- Massart, P.¹

35
- 0038675791
- On repeated games with incomplete information played by non-Bayesian players
- Megiddo, N. 1980. On repeated games with incomplete information played by non-Bayesian players. Internat. J. Game Theory 9 157-167.
- (1980) Internat. J. Game Theory , vol.9 , pp. 157-167
- Megiddo, N.¹

36
- 0032184197
- Universal prediction
- Merhav, N., M. Feder. 1998. Universal prediction. IEEE Trans. Inform. Theory 44 2124-2147.
- (1998) IEEE Trans. Inform. Theory , vol.44 , pp. 2124-2147
- Merhav, N.¹ Feder, M.²

37
- 0003351019
- Repeated games
- 9421, CORE, Louvain-la-Neuve, Belgium
- Mertens, J.-F., S. Sorin, S. Zamir. 1994. Repeated games. Discussion Paper 9420, 9421, 9422, CORE, Louvain-la-Neuve, Belgium.
- (1994) Discussion Paper , vol.9420-9422
- Mertens, J.-F.¹ Sorin, S.² Zamir, S.³

38
- 84898041886
- Discrete prediction games with arbitrary feedback and loss
- Piccolboni, A., C. Schindelhauer. 2001. Discrete prediction games with arbitrary feedback and loss. Proc. 14th Annual Conf. Comput. Learn. Theory, 208-223.
- (2001) Proc. 14th Annual Conf. Comput. Learn. Theory , pp. 208-223
- Piccolboni, A.¹ Schindelhauer, C.²

39
- 0013327190
- Minimizing regret: The general case
- Rustichini, A. 1999. Minimizing regret: The general case. Games Econom. Behav. 29 224-243.
- (1999) Games Econom. Behav. , vol.29 , pp. 224-243
- Rustichini, A.¹

40
- 21244487467
- Internal regret in on-line portfolio selection
- Stoltz, G., G. Lugosi. 2005. Internal regret in on-line portfolio selection. Machine Learn. 59 125-159.
- (2005) Machine Learn. , vol.59 , pp. 125-159
- Stoltz, G.¹ Lugosi, G.²

41
- 85048665932
- Aggregating strategies
- Vovk, V. G. 1990. Aggregating strategies. Proc. 3rd Annual Workshop Comput. Learn. Theory, 372-383.
- (1990) Proc. 3rd Annual Workshop Comput. Learn. Theory , pp. 372-383
- Vovk, V.G.¹

42
- 0035413537
- Competitive on-line statistics
- Vovk, V. G. 2001. Competitive on-line statistics. Internat. Statist. Rev. 69 213-248.
- (2001) Internat. Statist. Rev. , vol.69 , pp. 213-248
- Vovk, V.G.¹

43
- 0035443342
- Universal prediction of binary individual sequences in the presence of noise
- Weissman, T., N. Merhav. 2001. Universal prediction of binary individual sequences in the presence of noise. IEEE Trans. Inform. Theory 47 2151-2173.
- (2001) IEEE Trans. Inform. Theory , vol.47 , pp. 2151-2173
- Weissman, T.¹ Merhav, N.²

44
- 0035397523
- Twofold universal prediction schemes for achieving the finite state predictability of a noisy individual binary sequence
- Weissman, T., N. Merhav, A. Somekh-Baruch. 2001. Twofold universal prediction schemes for achieving the finite state predictability of a noisy individual binary sequence. IEEE Trans. Inform. Theory 47 1849-1866.
- (2001) IEEE Trans. Inform. Theory , vol.47 , pp. 1849-1866
- Weissman, T.¹ Merhav, N.² Somekh-Baruch, A.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.