SCOPUS 정보 검색 플랫폼

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Volumn 9285, Issue , 2015, Pages 327-342

Planning in discrete and continuous markov decision processes by probabilistic programming

(3) Nitti, Davide a Belle, Vaishak a De Raedt, Luc a

a UNIVERSITY OF LEUVEN (Belgium)

Author keywords

[No Author keywords available]

Indexed keywords

ARTIFICIAL INTELLIGENCE; COMPUTER PROGRAMMING LANGUAGES; IMPORTANCE SAMPLING; LEARNING ALGORITHMS; LEARNING SYSTEMS; MARKOV PROCESSES; REINFORCEMENT LEARNING;

EMPIRICAL EVALUATIONS; MARKOV DECISION PROCESSES; NATURAL APPROACHES; PLANNING ALGORITHMS; PROBABILISTIC PROGRAMMING; PROBABILISTIC PROGRAMMING LANGUAGE; PROBABILISTIC PROGRAMS; REAL-WORLD PLANNING PROBLEM;

PROBABILITY DISTRIBUTIONS;

EID: 84959387419 PISSN: 03029743 EISSN: 16113349 Source Type: Book Series
DOI: 10.1007/978-3-319-23525-7_20 Document Type: Conference Paper

Times cited : (13)

References (32)

1
- 84959363979
- Université Paris Sud-Paris XI, Thesis
- Couetoux, A.: Monte Carlo Tree Search for Continuous and Stochastic Sequential Decision Making Problems. Université Paris Sud-Paris XI, Thesis (2013)
- (2013) Monte Carlo Tree Search for Continuous and Stochastic Sequential Decision Making Problems
- Couetoux, A.¹

2
- 40349089023
- Probabilistic inductive logic programming
- De Raedt, L., Frasconi, P., Kersting, K.,Muggleton, S.H. (eds.), LNCS (LNAI), Springer, Heidelberg
- De Raedt, L., Kersting, K.: Probabilistic inductive logic programming. In: De Raedt, L., Frasconi, P., Kersting, K.,Muggleton, S.H. (eds.) Probabilistic Inductive Logic Programming. LNCS (LNAI), vol. 4911, pp. 1-27. Springer, Heidelberg (2008)
- (2008) Probabilistic Inductive Logic Programming , vol.4911 , pp. 1-27
- De Raedt, L.¹ Kersting, K.²

3
- 1942421161
- Relational instance based regression for relational reinforcement learning
- Driessens, K., Ramon, J.: Relational instance based regression for relational reinforcement learning. In: Proc. ICML (2003)
- (2003) Proc. ICML
- Driessens, K.¹ Ramon, J.²

4
- 29344460055
- Dynamic programming for structured continuous Markov decision problems
- Feng, Z., Dearden, R., Meuleau, N., Washington, R.: Dynamic programming for structured continuous Markov decision problems. In: Proc. UAI (2004)
- (2004) Proc. UAI
- Feng, Z.¹ Dearden, R.² Meuleau, N.³ Washington, R.⁴

5
- 1942419282
- Representations for learning control policies
- Forbes, J., André, D.: Representations for learning control policies. In: Proc. of the ICML Workshop on Development of Representations (2002)
- (2002) Proc. of the ICML Workshop on Development of Representations
- Forbes, J.¹ André, D.²

6
- 70049098573
- Church: A language for generative models
- Goodman, N., Mansinghka, V.K., Roy, D.M., Bonawitz, K., Tenenbaum, J.B.: Church: A language for generative models. In: Proc. UAI, pp. 220-229 (2008)
- (2008) Proc. UAI , pp. 220-229
- Goodman, N.¹ Mansinghka, V.K.² Roy, D.M.³ Bonawitz, K.⁴ Tenenbaum, J.B.⁵

7
- 80054898934
- Theory and Practice of Logic Programming
- Gutmann, B., Thon, I., Kimmig, A., Bruynooghe, M., De Raedt, L.: The magic of logical inference in probabilistic programming. Theory and Practice of Logic Programming (2011)
- (2011) The magic of logical inference in probabilistic programming
- Gutmann, B.¹ Thon, I.² Kimmig, A.³ Bruynooghe, M.⁴ De Raedt, L.⁵

8
- 0036832951
- Machine Learning
- Kearns, M., Mansour, Y., Ng, A.Y.: A Sparse Sampling Algorithm for Near-Optimal Planning in Large Markov Decision Processes. Machine Learning (2002)
- (2002) A Sparse Sampling Algorithm for Near-Optimal Planning in Large Markov Decision Processes
- Kearns, M.¹ Mansour, Y.² Ng, A.Y.³

9
- 84866455160
- PROST: Probabilistic planning based on UCT
- Keller, T., Eyerich, P.: PROST: probabilistic planning based on UCT. In: Proc. ICAPS (2012)
- (2012) Proc. ICAPS
- Keller, T.¹ Eyerich, P.²

10
- 58549084036
- On the efficient execution of problog programs
- Garcia de la Banda, M., Pontelli, E. (eds.), LNCS, Springer, Heidelberg
- Kimmig, A., Santos Costa, V., Rocha, R., Demoen, B., De Raedt, L.: On the efficient execution of problog programs. In: Garcia de la Banda, M., Pontelli, E. (eds.) ICLP 2008. LNCS, vol. 5366, pp. 175-189. Springer, Heidelberg (2008)
- (2008) ICLP 2008 , vol.5366 , pp. 175-189
- Kimmig, A.¹ Santos Costa, V.² Rocha, R.³ Demoen, B.⁴ De Raedt, L.⁵

11
- 33750293964
- Bandit based monte-carlo planning
- Fürnkranz, J., Scheffer, T., Spiliopoulou, M. (eds.), LNCS (LNAI), Springer, Heidelberg
- Kocsis, L., Szepesvári, C.: Bandit based monte-carlo planning. In: Fürnkranz, J., Scheffer, T., Spiliopoulou, M. (eds.) ECML 2006. LNCS (LNAI), vol. 4212, pp. 282-293. Springer, Heidelberg (2006)
- (2006) ECML 2006 , vol.4212 , pp. 282-293
- Kocsis, L.¹ Szepesvári, C.²

12
- 78651517373
- Planning with Noisy Probabilistic Relational Rules
- Lang, T., Toussaint, M.: Planning with Noisy Probabilistic Relational Rules. Journal of Artificial Intelligence Research 39, 1-49 (2010)
- (2010) Journal of Artificial Intelligence Research , vol.39 , pp. 1-49
- Lang, T.¹ Toussaint, M.²

13
- 80054835987
- Sample-Based planning for continuous action markov decision processes
- Mansley, C.R., Weinstein, A., Littman, M.L.: Sample-Based planning for continuous action markov decision processes. In: Proc. ICAPS (2011)
- (2011) Proc. ICAPS
- Mansley, C.R.¹ Weinstein, A.² Littman, M.L.³

14
- 65349138293
- A heuristic search approach to planning with continuous resources in stochastic domains
- Meuleau, N., Benazera, E., Brafman, R.I., Hansen, E.A., Mausam, M.: A heuristic search approach to planning with continuous resources in stochastic domains. Journal of Artificial Intelligence Research 34(1), 27 (2009)
- (2009) Journal of Artificial Intelligence Research , vol.34 , Issue.1 , pp. 27
- Meuleau, N.¹ Benazera, E.² Brafman, R.I.³ Hansen, E.A.⁴ Mausam, M.⁵

15
- 84880739933
- BLOG: Probabilistic models with unknown objects
- Milch, B., Marthi, B., Russell, S., Sontag, D., Ong, D., Kolobov, A.: BLOG: probabilistic models with unknown objects. In: Proc. IJCAI (2005)
- (2005) Proc. IJCAI
- Milch, B.¹ Marthi, B.² Russell, S.³ Sontag, D.⁴ Ong, D.⁵ Kolobov, A.⁶

16
- 84892915061
- Foundations and Trends in Machine Learning, Now Publishers
- Munos, R.: From Bandits to Monte-Carlo Tree Search: The Optimistic Principle Applied to Optimization and Planning. Foundations and Trends in Machine Learning, Now Publishers (2014)
- (2014) From Bandits to Monte-Carlo Tree Search: The Optimistic Principle Applied to Optimization and Planning
- Munos, R.¹

17
- 84949870173
- A particle filter for hybrid relational domains
- Nitti, D., De Laet, T., De Raedt, L.: A particle filter for hybrid relational domains. In: Proc. IROS (2013)
- (2013) Proc. IROS
- Nitti, D.¹ De Laet, T.² De Raedt, L.³

18
- 84929208405
- Relational object tracking and learning
- Nitti, D., De Laet, T., De Raedt, L.: Relational object tracking and learning. In: Proc. ICRA (2014)
- (2014) Proc. ICRA
- Nitti, D.¹ De Laet, T.² De Raedt, L.³

19
- 84904418787
- methods and examples
- Owen, A.B.: Monte Carlo theory, methods and examples (2013)
- (2013) Monte Carlo theory
- Owen, A.B.¹

20
- 18544382314
- Learning from scarce experience
- Peshkin, L., Shelton, C.R.: Learning from scarce experience. In: Proc. ICML, pp. 498-505 (2002)
- (2002) Proc. ICML , pp. 498-505
- Peshkin, L.¹ Shelton, C.R.²

21
- 0242393653
- Eligibility traces for off-policy policy evaluation
- Precup, D., Sutton, R.S., Singh, S.P.: Eligibility traces for off-policy policy evaluation. In: Proc. ICML (2000)
- (2000) Proc. ICML
- Precup, D.¹ Sutton, R.S.² Singh, S.P.³

22
- 84861364904
- (unpublished paper)
- Sanner, S.: Relational Dynamic Influence Diagram Language (RDDL): Language Description (unpublished paper)
- Relational Dynamic Influence Diagram Language (RDDL): Language Description
- Sanner, S.¹

23
- 80053161811
- Symbolic dynamic programming for discrete and continuous state MDPs
- Sanner, S., Delgado, K.V., de Barros, L.N.: Symbolic dynamic programming for discrete and continuous state MDPs. In: Proc. UAI (2011)
- (2011) Proc. UAI
- Sanner, S.¹ Delgado, K.V.² de Barros, L.N.³

24
- 18544374225
- Policy improvement for POMDPs using normalized importance sampling
- Shelton, C.R.: Policy improvement for POMDPs using normalized importance sampling. In: Proc. UAI, pp. 496-503 (2001)
- (2001) Proc. UAI , pp. 496-503
- Shelton, C.R.¹

25
- 0005942760
- Ph.D. thesis, MIT
- Shelton, C.R.: Importance Sampling for Reinforcement Learning with Multiple Objectives. Ph.D. thesis, MIT (2001)
- (2001) Importance Sampling for Reinforcement Learning with Multiple Objectives
- Shelton, C.R.¹

26
- 0001898381
- Practical reinforcement learning in continuous spaces
- Smart, W.D., Kaelbling, L.P.: Practical reinforcement learning in continuous spaces. In: Proc. ICML (2000)
- (2000) Proc. ICML
- Smart, W.D.¹ Kaelbling, L.P.²

27
- 84923305328
- First-order open-universe POMDPs
- Srivastava, S., Russell, S., Ruan, P., Cheng, X.: First-order open-universe POMDPs. In: Proc. UAI (2014)
- (2014) Proc. UAI
- Srivastava, S.¹ Russell, S.² Ruan, P.³ Cheng, X.⁴

28
- 0004102479
- MIT Press
- Sutton, R.S., Barto, A.G.: Reinforcement Learning: An Introduction. MIT Press (1998)
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

29
- 85130714337
- DTProbLog: A decision-theoretic probabilistic prolog
- Van den Broeck, G., Thon, I., van Otterlo, M., De Raedt, L.: DTProbLog: a decision-theoretic probabilistic prolog. In: Proc. AAAI (2010)
- (2010) Proc. AAAI
- Van den Broeck, G.¹ Thon, I.² van Otterlo, M.³ De Raedt, L.⁴

30
- 84919905597
- Model-Based relational RL when object existence is partially observable
- Vien, N.A., Toussaint, M.: Model-Based relational RL when object existence is partially observable. In: Proc. ICML (2014)
- (2014) Proc. ICML
- Vien, N.A.¹ Toussaint, M.²

31
- 85167397400
- Integrating sample-based planning and model-based reinforcement learning
- Walsh, T.J., Goschin, S., Littman, M.L.: Integrating sample-based planning and model-based reinforcement learning. In: Proc. AAAI (2010)
- (2010) Proc. AAAI
- Walsh, T.J.¹ Goschin, S.² Littman, M.L.³

32
- 84873574800
- Reinforcement learning: State-of-the-art
- Springer
- Wiering, M., van Otterlo, M.: Reinforcement learning: state-of-the-art. In: Adaptation, Learning, and Optimization. Springer (2012)
- (2012) Adaptation, Learning, and Optimization.
- Wiering, M.¹ van Otterlo, M.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.