메뉴 건너뛰기




Volumn 4005 LNAI, Issue , 2006, Pages 529-543

Online learning with constraints

Author keywords

[No Author keywords available]

Indexed keywords

CONSTRAINT THEORY; DECISION MAKING; FORECASTING; ONLINE SYSTEMS; OPTIMIZATION;

EID: 33746094276     PISSN: 03029743     EISSN: 16113349     Source Type: Book Series    
DOI: 10.1007/11776420_39     Document Type: Conference Paper
Times cited : (12)

References (11)
  • 1
    • 0001976283 scopus 로고
    • Approximation to Bayes Risk in Repeated Play. Princeton University Press
    • J. Hannan. Approximation to Bayes Risk in Repeated Play, volume III of Contribution to The Theory of Games, pages 97-139. Princeton University Press, 1957.
    • (1957) Contribution to the Theory of Games , vol.3 , pp. 97-139
    • Hannan, J.1
  • 2
    • 79960013704 scopus 로고    scopus 로고
    • A geometric approach to multi-criterion reinforcement learning
    • S. Mannor and N. Shimkin. A geometric approach to multi-criterion reinforcement learning. Journal of Machine Learning Research, 5:325-360, 2004.
    • (2004) Journal of Machine Learning Research , vol.5 , pp. 325-360
    • Mannor, S.1    Shimkin, N.2
  • 4
    • 0011853562 scopus 로고
    • Stochastic games with average cost constraints
    • T. Basar and A. Haurie, editors. Birkhauser
    • N. Shimkin. Stochastic games with average cost constraints. In T. Basar and A. Haurie, editors, Advances in Dynamic Games and Applications, pages 219-230. Birkhauser, 1994.
    • (1994) Advances in Dynamic Games and Applications , pp. 219-230
    • Shimkin, N.1
  • 5
    • 84972545864 scopus 로고
    • An analog of the minimax theorem for vector payoffs
    • D. Blackwell. An analog of the minimax theorem for vector payoffs. Pacific J. Math., 6(1):1-8, 1956.
    • (1956) Pacific J. Math. , vol.6 , Issue.1 , pp. 1-8
    • Blackwell, D.1
  • 6
    • 0036474456 scopus 로고    scopus 로고
    • A necessary and sufficient condition for approachability
    • X. Spinat. A necessary and sufficient condition for approachability. Mathematics of Operations Research, 27(1):31-44, 2002.
    • (2002) Mathematics of Operations Research , vol.27 , Issue.1 , pp. 31-44
    • Spinat, X.1
  • 7
    • 0013371249 scopus 로고
    • Controlled random walks
    • North Holland, Amsterdam
    • D. Blackwell. Controlled random walks. In Proc. Int. Congress of Mathematicians 1954, volume 3, pages 336-338. North Holland, Amsterdam, 1956.
    • (1956) Proc. Int. Congress of Mathematicians 1954 , vol.3 , pp. 336-338
    • Blackwell, D.1
  • 8
    • 0038386340 scopus 로고    scopus 로고
    • The empirical Bayes envelope and regret minimization in competitive Markov decision processes
    • S. Mannor and N. Shimkin. The empirical Bayes envelope and regret minimization in competitive Markov decision processes. Mathematics of Operations Research, 28(2):327-345, 2003.
    • (2003) Mathematics of Operations Research , vol.28 , Issue.2 , pp. 327-345
    • Mannor, S.1    Shimkin, N.2
  • 9
    • 0004278770 scopus 로고
    • CORE Reprint Dps 9420, 9421 and 9422, Center for Operation Research and Econometrics, Universite Catholique De Louvain, Belgium
    • J. F. Mertens, S. Sorin, and S. Zamir. Repeated games. CORE Reprint Dps 9420, 9421 and 9422, Center for Operation Research and Econometrics, Universite Catholique De Louvain, Belgium, 1994.
    • (1994) Repeated Games
    • Mertens, J.F.1    Sorin, S.2    Zamir, S.3
  • 10
    • 0031256578 scopus 로고    scopus 로고
    • Calibrated learning and correlated equilibrium
    • D. P. Foster and R. V. Vohra. Calibrated learning and correlated equilibrium. Games and Economic Behavior, 21:40-55, 1997.
    • (1997) Games and Economic Behavior , vol.21 , pp. 40-55
    • Foster, D.P.1    Vohra, R.V.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.