메뉴 건너뛰기




Volumn 2111, Issue , 2001, Pages 128-142

Adaptive strategies and regret minimization in arbitrarily varying Markov environments

Author keywords

[No Author keywords available]

Indexed keywords

GAME THEORY; STOCHASTIC MODELS; STOCHASTIC SYSTEMS;

EID: 84943237201     PISSN: 03029743     EISSN: 16113349     Source Type: Book Series    
DOI: 10.1007/3-540-44581-1_9     Document Type: Conference Paper
Times cited : (1)

References (22)
  • 1
    • 84943335811 scopus 로고    scopus 로고
    • Special issue on learning in games, November
    • Special issue on learning in games. Games and Economic Behavior, 29(1), November 1999.
    • (1999) Games and Economic Behavior , vol.29 , Issue.1
  • 4
    • 84972545864 scopus 로고
    • An analog of the minimax theorem for vector payoffs
    • D. Blackwell. An analog of the minimax theorem for vector payoffs. Pacific J. Math., 6(1):1–8, 1956.
    • (1956) Pacific J. Math. , vol.6 , Issue.1 , pp. 1-8
    • Blackwell, D.1
  • 7
    • 0002267135 scopus 로고    scopus 로고
    • Adaptive game playing using multiplicative weights
    • November
    • Y. Freund and R. Schapire. Adaptive game playing using multiplicative weights. Games and Economic Behavior, 29:79–103, November 1999.
    • (1999) Games and Economic Behavior , vol.29 , pp. 79-103
    • Freund, Y.1    Schapire, R.2
  • 9
    • 0001976283 scopus 로고
    • Approximation to bayes risk in repeated play
    • M. Dresher, A. W. Tucker, and P. Wolde, editors, Princeton University Press
    • J. Hannan. Approximation to bayes risk in repeated play. In M. Dresher, A. W. Tucker, and P. Wolde, editors, Contribution to The Theory of Games, III, pages 97–139. Princeton University Press, 1957.
    • (1957) Contribution to the Theory of Games, III , pp. 97-139
    • Hannan, J.1
  • 12
    • 85149834820 scopus 로고
    • Markov games as a framework for multi-agent reinforcement learning
    • Morgan Kaufman, editor
    • M.L. Littman. Markov games as a framework for multi-agent reinforcement learning. In Morgan Kaufman, editor, Eleventh International Conference on Machine Learning, pages 157–163, 1994.
    • (1994) Eleventh International Conference on Machine Learning , pp. 157-163
    • Littman, M.L.1
  • 19
    • 0013327190 scopus 로고    scopus 로고
    • Minimizing regret: The general case
    • November
    • A. Rustichini. Minimizing regret: the general case. Games and Economic Behavior, 29:224–243, November 1999.
    • (1999) Games and Economic Behavior , vol.29 , pp. 224-243
    • Rustichini, A.1
  • 20
    • 0027201360 scopus 로고
    • Guaranteed performance regions in markovian systems with competing decision makers
    • January
    • N. Shimkin and A. Shwartz. Guaranteed performance regions in markovian systems with competing decision makers. IEEE Trans. on Automatic Control, 38(1):84–95, January 1993.
    • (1993) IEEE Trans. On Automatic Control , vol.38 , Issue.1 , pp. 84-95
    • Shimkin, N.1    Shwartz, A.2
  • 22
    • 0032047115 scopus 로고    scopus 로고
    • A game of prediction with experts advice
    • April
    • V. Vovk. A game of prediction with experts advice. Journal of Computer and Systems Sciences, 56(2):153–173, April 1998.
    • (1998) Journal of Computer and Systems Sciences , vol.56 , Issue.2 , pp. 153-173
    • Vovk, V.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.