메뉴 건너뛰기




Volumn 9285, Issue , 2015, Pages 327-342

Planning in discrete and continuous markov decision processes by probabilistic programming

Author keywords

[No Author keywords available]

Indexed keywords

ARTIFICIAL INTELLIGENCE; COMPUTER PROGRAMMING LANGUAGES; IMPORTANCE SAMPLING; LEARNING ALGORITHMS; LEARNING SYSTEMS; MARKOV PROCESSES; REINFORCEMENT LEARNING;

EID: 84959387419     PISSN: 03029743     EISSN: 16113349     Source Type: Book Series    
DOI: 10.1007/978-3-319-23525-7_20     Document Type: Conference Paper
Times cited : (13)

References (32)
  • 2
    • 40349089023 scopus 로고    scopus 로고
    • Probabilistic inductive logic programming
    • De Raedt, L., Frasconi, P., Kersting, K.,Muggleton, S.H. (eds.), LNCS (LNAI), Springer, Heidelberg
    • De Raedt, L., Kersting, K.: Probabilistic inductive logic programming. In: De Raedt, L., Frasconi, P., Kersting, K.,Muggleton, S.H. (eds.) Probabilistic Inductive Logic Programming. LNCS (LNAI), vol. 4911, pp. 1-27. Springer, Heidelberg (2008)
    • (2008) Probabilistic Inductive Logic Programming , vol.4911 , pp. 1-27
    • De Raedt, L.1    Kersting, K.2
  • 3
    • 1942421161 scopus 로고    scopus 로고
    • Relational instance based regression for relational reinforcement learning
    • Driessens, K., Ramon, J.: Relational instance based regression for relational reinforcement learning. In: Proc. ICML (2003)
    • (2003) Proc. ICML
    • Driessens, K.1    Ramon, J.2
  • 4
    • 29344460055 scopus 로고    scopus 로고
    • Dynamic programming for structured continuous Markov decision problems
    • Feng, Z., Dearden, R., Meuleau, N., Washington, R.: Dynamic programming for structured continuous Markov decision problems. In: Proc. UAI (2004)
    • (2004) Proc. UAI
    • Feng, Z.1    Dearden, R.2    Meuleau, N.3    Washington, R.4
  • 9
    • 84866455160 scopus 로고    scopus 로고
    • PROST: Probabilistic planning based on UCT
    • Keller, T., Eyerich, P.: PROST: probabilistic planning based on UCT. In: Proc. ICAPS (2012)
    • (2012) Proc. ICAPS
    • Keller, T.1    Eyerich, P.2
  • 10
    • 58549084036 scopus 로고    scopus 로고
    • On the efficient execution of problog programs
    • Garcia de la Banda, M., Pontelli, E. (eds.), LNCS, Springer, Heidelberg
    • Kimmig, A., Santos Costa, V., Rocha, R., Demoen, B., De Raedt, L.: On the efficient execution of problog programs. In: Garcia de la Banda, M., Pontelli, E. (eds.) ICLP 2008. LNCS, vol. 5366, pp. 175-189. Springer, Heidelberg (2008)
    • (2008) ICLP 2008 , vol.5366 , pp. 175-189
    • Kimmig, A.1    Santos Costa, V.2    Rocha, R.3    Demoen, B.4    De Raedt, L.5
  • 11
    • 33750293964 scopus 로고    scopus 로고
    • Bandit based monte-carlo planning
    • Fürnkranz, J., Scheffer, T., Spiliopoulou, M. (eds.), LNCS (LNAI), Springer, Heidelberg
    • Kocsis, L., Szepesvári, C.: Bandit based monte-carlo planning. In: Fürnkranz, J., Scheffer, T., Spiliopoulou, M. (eds.) ECML 2006. LNCS (LNAI), vol. 4212, pp. 282-293. Springer, Heidelberg (2006)
    • (2006) ECML 2006 , vol.4212 , pp. 282-293
    • Kocsis, L.1    Szepesvári, C.2
  • 13
    • 80054835987 scopus 로고    scopus 로고
    • Sample-Based planning for continuous action markov decision processes
    • Mansley, C.R., Weinstein, A., Littman, M.L.: Sample-Based planning for continuous action markov decision processes. In: Proc. ICAPS (2011)
    • (2011) Proc. ICAPS
    • Mansley, C.R.1    Weinstein, A.2    Littman, M.L.3
  • 17
    • 84949870173 scopus 로고    scopus 로고
    • A particle filter for hybrid relational domains
    • Nitti, D., De Laet, T., De Raedt, L.: A particle filter for hybrid relational domains. In: Proc. IROS (2013)
    • (2013) Proc. IROS
    • Nitti, D.1    De Laet, T.2    De Raedt, L.3
  • 20
    • 18544382314 scopus 로고    scopus 로고
    • Learning from scarce experience
    • Peshkin, L., Shelton, C.R.: Learning from scarce experience. In: Proc. ICML, pp. 498-505 (2002)
    • (2002) Proc. ICML , pp. 498-505
    • Peshkin, L.1    Shelton, C.R.2
  • 21
    • 0242393653 scopus 로고    scopus 로고
    • Eligibility traces for off-policy policy evaluation
    • Precup, D., Sutton, R.S., Singh, S.P.: Eligibility traces for off-policy policy evaluation. In: Proc. ICML (2000)
    • (2000) Proc. ICML
    • Precup, D.1    Sutton, R.S.2    Singh, S.P.3
  • 23
    • 80053161811 scopus 로고    scopus 로고
    • Symbolic dynamic programming for discrete and continuous state MDPs
    • Sanner, S., Delgado, K.V., de Barros, L.N.: Symbolic dynamic programming for discrete and continuous state MDPs. In: Proc. UAI (2011)
    • (2011) Proc. UAI
    • Sanner, S.1    Delgado, K.V.2    de Barros, L.N.3
  • 24
    • 18544374225 scopus 로고    scopus 로고
    • Policy improvement for POMDPs using normalized importance sampling
    • Shelton, C.R.: Policy improvement for POMDPs using normalized importance sampling. In: Proc. UAI, pp. 496-503 (2001)
    • (2001) Proc. UAI , pp. 496-503
    • Shelton, C.R.1
  • 26
    • 0001898381 scopus 로고    scopus 로고
    • Practical reinforcement learning in continuous spaces
    • Smart, W.D., Kaelbling, L.P.: Practical reinforcement learning in continuous spaces. In: Proc. ICML (2000)
    • (2000) Proc. ICML
    • Smart, W.D.1    Kaelbling, L.P.2
  • 30
    • 84919905597 scopus 로고    scopus 로고
    • Model-Based relational RL when object existence is partially observable
    • Vien, N.A., Toussaint, M.: Model-Based relational RL when object existence is partially observable. In: Proc. ICML (2014)
    • (2014) Proc. ICML
    • Vien, N.A.1    Toussaint, M.2
  • 31
    • 85167397400 scopus 로고    scopus 로고
    • Integrating sample-based planning and model-based reinforcement learning
    • Walsh, T.J., Goschin, S., Littman, M.L.: Integrating sample-based planning and model-based reinforcement learning. In: Proc. AAAI (2010)
    • (2010) Proc. AAAI
    • Walsh, T.J.1    Goschin, S.2    Littman, M.L.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.