메뉴 건너뛰기




Volumn WS-07-02, Issue , 2007, Pages 52-56

Hierarchical strategy learning with hybrid representations

Author keywords

[No Author keywords available]

Indexed keywords

HIERARCHICAL REPRESENTATION; NOVICE LEARNERS; PLANNING ALGORITHMS; STRATEGIC LEVELS; STRATEGY LEARNING; SUPERIOR PERFORMANCE; TECHNICAL REPORTS; VALUE FUNCTIONS;

EID: 51849088509     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (6)

References (19)
  • 1
    • 14344251217 scopus 로고    scopus 로고
    • Apprenticeship learning via inverse reinforcement learning
    • Abbeel, P., and Ng, A. Y. 2004. Apprenticeship learning via inverse reinforcement learning. In Proc. ICML.
    • (2004) Proc. ICML
    • Abbeel, P.1    Ng, A.Y.2
  • 5
    • 0002278788 scopus 로고    scopus 로고
    • Hierarchical reinforcement learning with the MAXQ value function decomposition
    • Dietterich, T. G. 2000. Hierarchical reinforcement learning with the MAXQ value function decomposition. Journal of Artificial Intelligence Research 13:227-303.
    • (2000) Journal of Artificial Intelligence Research , vol.13 , pp. 227-303
    • Dietterich, T.G.1
  • 8
    • 17444432872 scopus 로고    scopus 로고
    • Camel: Learning method preconditions for htn planning
    • Ilghami, O.; Nau, D. S.; and oz Avila, H. M. 2002. Camel: Learning method preconditions for htn planning. In AIPS02.
    • (2002) AIPS , vol.2
    • Ilghami, O.1    Nau, D.S.2    oz Avila, H.M.3
  • 11
    • 0027574520 scopus 로고
    • Taxonomie syntax for first-order inference
    • McAllester, D., and Givan, R. 1993. Taxonomie syntax for first-order inference. Journal of the ACM 40:246-283.
    • (1993) Journal of the ACM , vol.40 , pp. 246-283
    • McAllester, D.1    Givan, R.2
  • 12
    • 14344250066 scopus 로고    scopus 로고
    • Learning to fly by combining reinforcement learning with behavioural cloning
    • Morales, E., and Sammut, C. 2004. Learning to fly by combining reinforcement learning with behavioural cloning. In ICML.
    • (2004) ICML
    • Morales, E.1    Sammut, C.2
  • 14
    • 0001070375 scopus 로고    scopus 로고
    • Reinforcement learning with hierarchies of machines
    • Jordan, M. I, Kearns, M. J, and Solla, S. A, eds, The MIT Press
    • Parr, R., and Russell, S. 1997. Reinforcement learning with hierarchies of machines. In Jordan, M. I.; Kearns, M. J.; and Solla, S. A., eds., Advances in Neural Information Processing Systems, volume 10. The MIT Press.
    • (1997) Advances in Neural Information Processing Systems , vol.10
    • Parr, R.1    Russell, S.2
  • 15
    • 29144443664 scopus 로고    scopus 로고
    • Minority report in fraud detection: Classification of skewed data
    • 50-59
    • Phua, C.; Alahakoon, D.; and Lee, V. 2004. Minority report in fraud detection: classification of skewed data. SIGKDD Explor. Newsl. 6(1):50-59.
    • (2004) SIGKDD Explor. Newsl , vol.6 , Issue.1
    • Phua, C.1    Alahakoon, D.2    Lee, V.3
  • 16
    • 84899003140 scopus 로고    scopus 로고
    • Multi-time models for temporally abstract planning
    • Jordan, M. I, Kearns, M. J, and Solla, S. A, eds, The MIT Press
    • Precup, D., and Sutton, R. S. 1998. Multi-time models for temporally abstract planning. In Jordan, M. I.; Kearns, M. J.; and Solla, S. A., eds., Advances in Neural Information Processing Systems, volume 10. The MIT Press.
    • (1998) Advances in Neural Information Processing Systems , vol.10
    • Precup, D.1    Sutton, R.S.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.