메뉴 건너뛰기




Volumn , Issue , 2007, Pages 2054-2059

Effective control knowledge transfer through learning skill and representation hierarchies

Author keywords

[No Author keywords available]

Indexed keywords

ARTIFICIAL LEARNING; HIERARCHICAL REINFORCEMENT LEARNING; KNOWLEDGE TRANSFER; LEARNING ARCHITECTURES; LEARNING CAPABILITIES; LEARNING SKILLS; LEARNING TASKS; STATE SPACE REPRESENTATION;

EID: 84880906451     PISSN: 10450823     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (37)

References (11)
  • 3
    • 0346942368 scopus 로고    scopus 로고
    • Decision-Theoretic Planning: Structural Assumptions and Computational Leverage
    • C. Boutilier, T. Dean, and S. Hanks. Decision-Theoretic Planning: Structural Assumptions and Computational Leverage. Journal of Artificial Intelligence Research, 11:1-94, 1999. (Pubitemid 129628760)
    • (1999) Journal of Artificial Intelligence Research , vol.11 , pp. 1-94
    • Boutilier, C.1    Dean, T.2    Hanks, S.3
  • 4
    • 0002278788 scopus 로고    scopus 로고
    • Hierarchical reinforcement learning with the maxq value function decomposition
    • T. G. Dietterich. Hierarchical reinforcement learning with the maxq value function decomposition. Artificial Intelligence Research, 13:227-303, 2000.
    • (2000) Artificial Intelligence Research , vol.13 , pp. 227-303
    • Dietterich, T.G.1
  • 5
    • 0000746330 scopus 로고    scopus 로고
    • Model Reduction Techniques for Computing Approximately Optimal Solutions for Markov Decision Processes
    • San Francisco, CA, Morgan Kaufmann Publishers
    • R. Givan, T. Dean, and S. Leach. Model Reduction Techniques for Computing Approximately Optimal Solutions for Markov Decision Processes. In Proceedings of the 13th Annual Conference on Uncertainty in Artificial Intelligence (UAI-97), pages 124-131, San Francisco, CA, 1997. Morgan Kaufmann Publishers.
    • (1997) Proceedings of the 13th Annual Conference on Uncertainty in Artificial Intelligence (UAI-97) , pp. 124-131
    • Givan, R.1    Dean, T.2    Leach, S.3
  • 6
    • 29344435556 scopus 로고    scopus 로고
    • Subgoal Discovery for Hierarchical Reinforcement Learning Using Learned Policies
    • AAAI
    • S. Goel and M. Huber. Subgoal Discovery for Hierarchical Reinforcement Learning Using Learned Policies. In Proceedings of the 16th International FLAIRS Conference, pages 346-350. AAAI, 2003.
    • (2003) Proceedings of the 16th International FLAIRS Conference , pp. 346-350
    • Goel, S.1    Huber, M.2
  • 7
    • 0038178323 scopus 로고    scopus 로고
    • Solving Factored MDPs using Non-Homogeneous Partitions
    • K. Kim and T. Dean. Solving Factored MDPs using Non-Homogeneous Partitions. Artificial Intelligence, 147:225-251, 2003.
    • (2003) Artificial Intelligence , vol.147 , pp. 225-251
    • Kim, K.1    Dean, T.2
  • 10
    • 0033170372 scopus 로고    scopus 로고
    • Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
    • DOI 10.1016/S0004-3702(99)00052-1
    • R.S. Sutton, D. Precup, and S. Singh. Between MDPs and Semi-MDPs: Learning, Planning, and Representing Knowledge at Multiple Temporal Scales. Artificial Intelligence, 112:181-211, 1999. (Pubitemid 32079890)
    • (1999) Artificial Intelligence , vol.112 , Issue.1 , pp. 181-211
    • Sutton, R.S.1    Precup, D.2    Singh, S.3
  • 11
    • 27544473171 scopus 로고    scopus 로고
    • Behavior transfer for value-function-based reinforcement learning
    • Frank Dignum, Virginia Dignum, Sven Koenig, Sarit Kraus, Munindar P. Singh, and Michael Wooldridge, editors, New York, NY, July ACM Press
    • Matthew E. Taylor and Peter Stone. Behavior transfer for value-function-based reinforcement learning. In Frank Dignum, Virginia Dignum, Sven Koenig, Sarit Kraus, Munindar P. Singh, and Michael Wooldridge, editors, The Fourth International Joint Conference on Autonomous Agents and Multiagent Systems, pages 53-59, New York, NY, July 2005. ACM Press.
    • (2005) The Fourth International Joint Conference on Autonomous Agents and Multiagent Systems , pp. 53-59
    • Taylor, M.E.1    Stone, P.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.