SCOPUS 정보 검색 플랫폼

Volumn 4005 LNAI, Issue , 2006, Pages 529-543

Online learning with constraints

Author keywords

[No Author keywords available]

Indexed keywords

CONSTRAINT THEORY; DECISION MAKING; FORECASTING; ONLINE SYSTEMS; OPTIMIZATION;

CONVEX HULL; ONLINE LEARNING; REWARD-IN-HINDSIGHT FUNCTION;

LEARNING ALGORITHMS;

EID: 33746094276 PISSN: 03029743 EISSN: 16113349 Source Type: Book Series
DOI: 10.1007/11776420_39 Document Type: Conference Paper

Times cited : (12)

References (11)

1
- 0001976283
- Approximation to Bayes Risk in Repeated Play. Princeton University Press
- J. Hannan. Approximation to Bayes Risk in Repeated Play, volume III of Contribution to The Theory of Games, pages 97-139. Princeton University Press, 1957.
- (1957) Contribution to the Theory of Games , vol.3 , pp. 97-139
- Hannan, J.¹

2
- 79960013704
- A geometric approach to multi-criterion reinforcement learning
- S. Mannor and N. Shimkin. A geometric approach to multi-criterion reinforcement learning. Journal of Machine Learning Research, 5:325-360, 2004.
- (2004) Journal of Machine Learning Research , vol.5 , pp. 325-360
- Mannor, S.¹ Shimkin, N.²

3
- 0003989208
- Chapman and Hall
- E. Altman. Constrained Markov Decision Processes. Chapman and Hall, 1999.
- (1999) Constrained Markov Decision Processes
- Altman, E.¹

5
- 84972545864
- An analog of the minimax theorem for vector payoffs
- D. Blackwell. An analog of the minimax theorem for vector payoffs. Pacific J. Math., 6(1):1-8, 1956.
- (1956) Pacific J. Math. , vol.6 , Issue.1 , pp. 1-8
- Blackwell, D.¹

6
- 0036474456
- A necessary and sufficient condition for approachability
- X. Spinat. A necessary and sufficient condition for approachability. Mathematics of Operations Research, 27(1):31-44, 2002.
- (2002) Mathematics of Operations Research , vol.27 , Issue.1 , pp. 31-44
- Spinat, X.¹

8
- 0038386340
- The empirical Bayes envelope and regret minimization in competitive Markov decision processes
- S. Mannor and N. Shimkin. The empirical Bayes envelope and regret minimization in competitive Markov decision processes. Mathematics of Operations Research, 28(2):327-345, 2003.
- (2003) Mathematics of Operations Research , vol.28 , Issue.2 , pp. 327-345
- Mannor, S.¹ Shimkin, N.²

9
- 0004278770
- CORE Reprint Dps 9420, 9421 and 9422, Center for Operation Research and Econometrics, Universite Catholique De Louvain, Belgium
- J. F. Mertens, S. Sorin, and S. Zamir. Repeated games. CORE Reprint Dps 9420, 9421 and 9422, Center for Operation Research and Econometrics, Universite Catholique De Louvain, Belgium, 1994.
- (1994) Repeated Games
- Mertens, J.F.¹ Sorin, S.² Zamir, S.³

10
- 0031256578
- Calibrated learning and correlated equilibrium
- D. P. Foster and R. V. Vohra. Calibrated learning and correlated equilibrium. Games and Economic Behavior, 21:40-55, 1997.
- (1997) Games and Economic Behavior , vol.21 , pp. 40-55
- Foster, D.P.¹ Vohra, R.V.²

11
- 84926078662
- Cambridge University Press, New York
- N. Cesa-Bianchi and G. Lugosi. Prediction, Learning, and Games. Cambridge University Press, New York, 2006.
- (2006) Prediction, Learning, and Games
- Cesa-Bianchi, N.¹ Lugosi, G.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.