메뉴 건너뛰기




Volumn 2, Issue 2, 2008, Pages 684-694

Algebraic results and bottom-up algorithm for policies generalization in reinforcement learning using concept lattices

Author keywords

Galois lattices; Reinforcement learning

Indexed keywords


EID: 41249094659     PISSN: 1751570X     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.nahs.2006.12.001     Document Type: Article
Times cited : (2)

References (16)
  • 2
    • 0033170372 scopus 로고    scopus 로고
    • Between mdps and semi-mdps: A framework for temporal abstraction in reinforcement learning
    • Sutton R.S., Precup D., and Singh S. Between mdps and semi-mdps: A framework for temporal abstraction in reinforcement learning. Artificial Intelligence 112 1-2 (1999) 181-211
    • (1999) Artificial Intelligence , vol.112 , Issue.1-2 , pp. 181-211
    • Sutton, R.S.1    Precup, D.2    Singh, S.3
  • 3
    • 14344251007 scopus 로고    scopus 로고
    • M.R. James, S. Singh, Learning and discovery of predictive state representations in dynamical systems with reset, in: ICML 2004, Dpt of Computer Science and Engineering, University of Michigan, Ann Arbor, 2004, pp. 417-424
    • M.R. James, S. Singh, Learning and discovery of predictive state representations in dynamical systems with reset, in: ICML 2004, Dpt of Computer Science and Engineering, University of Michigan, Ann Arbor, 2004, pp. 417-424
  • 4
    • 1942516880 scopus 로고    scopus 로고
    • R. Munos, Error bounds for approximate policy iteration, in: International Conference on Machine Learning ICML 2003, Centre de Mathématiques Appliquées, Ecole Polytechnique, Palaiseau, France, 2003, pp. 560-567
    • R. Munos, Error bounds for approximate policy iteration, in: International Conference on Machine Learning ICML 2003, Centre de Mathématiques Appliquées, Ecole Polytechnique, Palaiseau, France, 2003, pp. 560-567
  • 5
    • 41249085418 scopus 로고    scopus 로고
    • A. McCallum, Reinforcement learning with selective perception and hidden state, Ph.D. Thesis, 1996
    • A. McCallum, Reinforcement learning with selective perception and hidden state, Ph.D. Thesis, 1996
  • 6
    • 84880771557 scopus 로고    scopus 로고
    • B. Ravindran, A.G. Barto, SMDP homomorphisms: An algebraic approach to abstraction in semi Markov decision processes, in: IJCAI 2003, AAAI Press edition, Dpt of Computer Science, University of Massachussetts, Amherst, 2003, pp. 1011-1016
    • B. Ravindran, A.G. Barto, SMDP homomorphisms: An algebraic approach to abstraction in semi Markov decision processes, in: IJCAI 2003, AAAI Press edition, Dpt of Computer Science, University of Massachussetts, Amherst, 2003, pp. 1011-1016
  • 7
    • 0038517214 scopus 로고    scopus 로고
    • Equivalence notions and model minimization in Markov decision processes
    • Givan R., Dean T., and Greig M. Equivalence notions and model minimization in Markov decision processes. Artificial Intelligence 147 1-2 (2003) 163-223
    • (2003) Artificial Intelligence , vol.147 , Issue.1-2 , pp. 163-223
    • Givan, R.1    Dean, T.2    Greig, M.3
  • 8
    • 0002278788 scopus 로고    scopus 로고
    • State abstraction in maxq hierachical reinforcement learning
    • Dietterich T.G. State abstraction in maxq hierachical reinforcement learning. Artificial Intelligence Research 13 (2000) 227-303
    • (2000) Artificial Intelligence Research , Issue.13 , pp. 227-303
    • Dietterich, T.G.1
  • 10
    • 0344666744 scopus 로고    scopus 로고
    • M. Ricordeau, Q-concept-learning: Generalization with concept lattice representation in reinforcement learning, in: I.C. Society (Ed.), International Conference on Tools with Artificial Intelligence, ICTAI 03, Lirmm, Montpellier, 2003, pp. 316-323
    • M. Ricordeau, Q-concept-learning: Generalization with concept lattice representation in reinforcement learning, in: I.C. Society (Ed.), International Conference on Tools with Artificial Intelligence, ICTAI 03, Lirmm, Montpellier, 2003, pp. 316-323
  • 14
    • 84861810840 scopus 로고    scopus 로고
    • M. Liquière, J. Sallantin, Structural machine learning with Galois lattice and graphs, in: M.K. Ed (Ed.), ICML 1998, Lirmm, Montpellier, Morgan Kaufmann Ed, 1998, pp. 305-313
    • M. Liquière, J. Sallantin, Structural machine learning with Galois lattice and graphs, in: M.K. Ed (Ed.), ICML 1998, Lirmm, Montpellier, Morgan Kaufmann Ed, 1998, pp. 305-313
  • 15
    • 41249095897 scopus 로고    scopus 로고
    • R. Munos, Finite-element methods with local triangulation refinement for continuous reinforcement learning problems, 1997
    • R. Munos, Finite-element methods with local triangulation refinement for continuous reinforcement learning problems, 1997
  • 16
    • 41249095783 scopus 로고    scopus 로고
    • A. McCallum, Efficiently inducing features of conditional random fields, in: Conference on Uncertainty in Articifical Intelligence, UAI, 2003, Dpt of Computer Science, University of Massachussetts, Amherst, 2003
    • A. McCallum, Efficiently inducing features of conditional random fields, in: Conference on Uncertainty in Articifical Intelligence, UAI, 2003, Dpt of Computer Science, University of Massachussetts, Amherst, 2003


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.