메뉴 건너뛰기




Volumn 2375, Issue , 2002, Pages 364-379

Self-optimizing and Pareto-optimal policies in general environments based on Bayes-mixtures

Author keywords

[No Author keywords available]

Indexed keywords

BAYESIAN NETWORKS; COMPUTATION THEORY; MARKOV PROCESSES; MIXTURES; PARETO PRINCIPLE; PROBABILITY DISTRIBUTIONS; REINFORCEMENT LEARNING;

EID: 84937417436     PISSN: 03029743     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1007/3-540-45435-7_25     Document Type: Conference Paper
Times cited : (37)

References (14)
  • 1
    • 85012688561 scopus 로고
    • [Bel57] Princeton University Press, New Jersey
    • [Bel57] R. Bellman. Dynamic Programming. Princeton University Press, New Jersey, 1957.
    • (1957) Dynamic Programming
    • Bellman, R.1
  • 3
    • 0034247018 scopus 로고    scopus 로고
    • A near-optimal polynomial time algorithm for learning in certain classes of stochastic games
    • [BT00]
    • [BT00] R. I. Brafman and M. Tennenholtz. A near-optimal polynomial time algorithm for learning in certain classes of stochastic games. Artificial Intelligence, 121(1-2):31-47, 2000.
    • (2000) Artificial Intelligence , vol.121 , Issue.1-2 , pp. 31-47
    • Brafman, R.I.1    Tennenholtz, M.2
  • 4
    • 0003954462 scopus 로고
    • [Doo53] John Wiley & Sons, New York
    • [Doo53] J. L. Doob. Stochastic Processes. John Wiley & Sons, New York, 1953.
    • (1953) Stochastic Processes
    • Doob, J.L.1
  • 8
    • 0012257655 scopus 로고    scopus 로고
    • Near-optimal reinforcement learning in polynomial time
    • [KS98] Morgan Kaufmann, San Francisco, CA
    • [KS98] M. Kearns and S. Singh. Near-optimal reinforcement learning in polynomial time. In Proc. 15th International Conf. on Machine Learning, pages 260-268. Morgan Kaufmann, San Francisco, CA, 1998.
    • (1998) Proc. 15th International Conf. on Machine Learning , pp. 260-268
    • Kearns, M.1    Singh, S.2
  • 14
    • 0017996595 scopus 로고
    • Complexity-based induction systems: Comparisons and convergence theorems
    • [Sol78]
    • [Sol78] R. J. Solomonoff. Complexity-based induction systems: comparisons and convergence theorems. IEEE Trans. Inform. Theory, IT-24:422-432, 1978.
    • (1978) IEEE Trans. Inform. Theory , vol.IT-24 , pp. 422-432
    • Solomonoff, R.J.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.