메뉴 건너뛰기




Volumn , Issue , 2007, Pages 103-110

Model-based reinforcement learning in factored-state MDPs

Author keywords

[No Author keywords available]

Indexed keywords

DATA STRUCTURES; LEARNING ALGORITHMS; OPTIMAL CONTROL SYSTEMS; POLYNOMIAL APPROXIMATION; PROBLEM SOLVING;

EID: 34548763246     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ADPRL.2007.368176     Document Type: Conference Paper
Times cited : (25)

References (21)
  • 1
    • 0036832954 scopus 로고    scopus 로고
    • Near-optimal reinforcement learning in polynomial time
    • M. J. Kearns and S. P. Singh, "Near-optimal reinforcement learning in polynomial time," Machine Learning, vol. 49, no. 2-3, pp. 209-232, 2002.
    • (2002) Machine Learning , vol.49 , Issue.2-3 , pp. 209-232
    • Kearns, M.J.1    Singh, S.P.2
  • 2
    • 0041965975 scopus 로고    scopus 로고
    • R-MAX - a general polynomial time algorithm for near-optimal reinforcement learning
    • R. I. Brafman and M. Tennenholtz, "R-MAX - a general polynomial time algorithm for near-optimal reinforcement learning," Journal of Machine Learning Research, vol. 3, pp. 213-231, 2002.
    • (2002) Journal of Machine Learning Research , vol.3 , pp. 213-231
    • Brafman, R.I.1    Tennenholtz, M.2
  • 3
    • 23244466805 scopus 로고    scopus 로고
    • Ph.D. dissertation, Gatsby Computational Neuroscience Unit, University College London
    • S. M. Kakade, "On the sample complexity of reinforcement learning," Ph.D. dissertation, Gatsby Computational Neuroscience Unit, University College London, 2003.
    • (2003) On the sample complexity of reinforcement learning
    • Kakade, S.M.1
  • 10
    • 0346942368 scopus 로고    scopus 로고
    • Decision-theoretic planning: Structural assumptions and computational leverage
    • C. Boutilier, T. Dean, and S. Hanks, "Decision-theoretic planning: Structural assumptions and computational leverage," Journal of Artificial Intelligence Research, vol. 11, pp. 1-94, 1999.
    • (1999) Journal of Artificial Intelligence Research , vol.11 , pp. 1-94
    • Boutilier, C.1    Dean, T.2    Hanks, S.3
  • 15
    • 0021518106 scopus 로고
    • A theory of the learnable
    • November
    • L. G. Valiant, "A theory of the learnable," Communications of the ACM, vol. 27, no. 11, pp. 1134-1142, November 1984.
    • (1984) Communications of the ACM , vol.27 , Issue.11 , pp. 1134-1142
    • Valiant, L.G.1
  • 18
    • 0031369472 scopus 로고    scopus 로고
    • Probabilistic propositional planning: Representations and complexity
    • AAAI Press/The MIT Press, Online, Available
    • M. L. Littman, "Probabilistic propositional planning: Representations and complexity," in Proceedings of the Fourteenth National Conference on Artificial Intelligence. AAAI Press/The MIT Press, 1997, pp. 748-754. [Online], Available: http://www.cs.rutgers.edu/ mlittman/papers/aaai97-planning.ps
    • (1997) Proceedings of the Fourteenth National Conference on Artificial Intelligence , pp. 748-754
    • Littman, M.L.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.