메뉴 건너뛰기




Volumn , Issue , 2003, Pages 1003-1010

Generalizing plans to new environments in relational MDPs

Author keywords

[No Author keywords available]

Indexed keywords

APPROXIMATE VALUE FUNCTION; DIFFERENT CLASS; LOSS OF PERFORMANCE; MARKOV DECISION PROCESSES; MULTI-AGENT SETTING; PLANNING TIME; POLYNOMIAL NUMBER; PROBLEM-BASED;

EID: 84880803349     PISSN: 10450823     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (151)

References (17)
  • 1
    • 84880891360 scopus 로고    scopus 로고
    • Symbolic dynamic programming for first-order MDPs
    • C. Boutilier, R. Reitcr, and B. Price. Symbolic dynamic programming for first-order MDPs. In IJCA1-0I, 2001.
    • (2001) IJCA1-0I
    • Boutilier, C.1    Reitcr, R.2    Price, B.3
  • 2
    • 13444255536 scopus 로고    scopus 로고
    • On constraint sampling for the linear programming approach to approximate dynamic programming
    • Submitted to
    • D.P. de Farias and B. Van Roy. On constraint sampling for the linear programming approach to approximate dynamic programming. Submitted to Math, of Operations Research, 2001.
    • (2001) Math, of Operations Research
    • De Farias, D.P.1    Van Roy, B.2
  • 3
    • 85055854784 scopus 로고
    • Probabilistic temporal reasoning
    • T. Dean and K. Kanazawa. Probabilistic temporal reasoning. In AAA1-88, 1988.
    • (1988) AAA1-88
    • Dean, T.1    Kanazawa, K.2
  • 4
    • 0002278788 scopus 로고    scopus 로고
    • Hierarchical reinforcement learning with the MAXQ value function decomposition
    • T. G. Dietterich. Hierarchical reinforcement learning with the MAXQ value function decomposition. Journal of Artificial Intelligence Research, 13:227-303, 2000.
    • (2000) Journal of Artificial Intelligence Research , vol.13 , pp. 227-303
    • Dietterich, T.G.1
  • 5
    • 0015440625 scopus 로고
    • Learning and executing generalized robot plans
    • R E. Fikes, P. E. Hart, and N. J. Nilsson. Learning and executing generalized robot plans. Artf. Intel, 3(4):251-288, 1972.
    • (1972) Artf. Intel , vol.3 , Issue.4 , pp. 251-288
    • Fikes, R.E.1    Hart, P.E.2    Nilsson, N.J.3
  • 6
  • 7
    • 1142281638 scopus 로고    scopus 로고
    • Context specific multiagent coordination and planning with factored MDPs
    • C. E. Guestrin, S. Venkataraman, and D. Roller. Context specific multiagent coordination and planning with factored MDPs. In AAA/-02, 2002.
    • (2002) AAA/-02
    • Guestrin, C.E.1    Venkataraman, S.2    Roller, D.3
  • 8
  • 9
    • 0033189384 scopus 로고    scopus 로고
    • Learning action strategies for planning domains
    • DOI 10.1016/S0004-3702(99)00060-0
    • R. Khardon. Learning action strategics for planning domains. Artificial Intelligence, 113:125-148, 1999. (Pubitemid 30542740)
    • (1999) Artificial Intelligence , vol.113 , Issue.1 , pp. 125-148
    • Khardon, R.1
  • 11
    • 0038362668 scopus 로고    scopus 로고
    • Learning generalized policies in planning using concept languages
    • M. Martin and H. Gcffner. Learning generalized policies in planning using concept languages. In KR, 2000.
    • (2000) KR
    • Martin, M.1    Gcffner, H.2
  • 12
    • 0346738900 scopus 로고    scopus 로고
    • Flexible decomposition algorithms for weakly coupled markov decision problems
    • R. Parr. Flexible decomposition algorithms for weakly coupled markov decision problems. In UAI-98, 1998.
    • (1998) UAI-98
    • Parr, R.1
  • 13
    • 1542342765 scopus 로고    scopus 로고
    • Direct value-approximation for factored MDPs
    • D. Schuurmans and R. Patrascu. Direct value-approximation for factored MDPs. In NIPS-14, 2001.
    • (2001) NIPS-14
    • Schuurmans, D.1    Patrascu, R.2
  • 14
    • 0000273218 scopus 로고
    • Generalized polynomial approximations in Markovian decision processes
    • P. Schweitzer and A. Seidmann. Generalized polynomial approximations in Markovian decision processes. / of Mathematical Analysis and Applications, 110:568-582, 1985.
    • (1985) Of Mathematical Analysis and Applications , vol.110 , pp. 568-582
    • Schweitzer, P.1    Seidmann, A.2
  • 16
    • 0042496037 scopus 로고    scopus 로고
    • Discovering structure in multiple learning tasks: The TC algorithm
    • S. Thrun and J. O'Sullivan. Discovering structure in multiple learning tasks: The TC algorithm. In 1CML-96, 1996.
    • (1996) 1CML-96
    • Thrun, S.1    O'Sullivan, J.2
  • 17
    • 13444310066 scopus 로고    scopus 로고
    • Inductive policy selection for first-order MDPs
    • S. W. Yoon, A. Fern, and B. Givan. Inductive policy selection for first-order MDPs. In UAI-02, 2002.
    • (2002) UAI-02
    • Yoon, S.W.1    Fern, A.2    Givan, B.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.