SCOPUS 정보 검색 플랫폼

Lecture Notes in Artificial Intelligence (Subseries of Lecture Notes in Computer Science)

Volumn 3194, Issue , 2004, Pages 180-197

Logical Markov decision programs and the convergence of logical TD(λ)

(2) Kersting, Kristian a De Raedt, Luc a

a UNIVERSITY OF FREIBURG (Germany)

Author keywords

[No Author keywords available]

Indexed keywords

DECISION THEORY; LOGIC PROGRAMMING; MARKOV PROCESSES; PROBABILISTIC LOGICS; REGRESSION ANALYSIS; TREES (MATHEMATICS); REINFORCEMENT LEARNING;

LOGICAL MARKOV DECISION PROGRAMS (LOMDP); MARKOV DECISION PROCESS (MDP); RELATIONAL LEARNING; RELATIONAL REINFORCEMENT LEARNING (RRL); LOGIC PROGRAMS; LOGICAL MARKOV DECISION PROGRAMS; MARKOV DECISION PROCESSES; RELATIONAL REINFORCEMENT LEARNING; REPRESENTATION FORMALISMS;

LEARNING SYSTEMS; INDUCTIVE LOGIC PROGRAMMING (ILP);

EID: 22944490192 PISSN: 03029743 EISSN: None Source Type: Conference Proceeding
DOI: 10.1007/978-3-540-30109-7_16 Document Type: Conference Paper

Times cited : (11)

References (35)

1
- 84898960325
- Programmable reinforcement learning agents
- MIT Press
- D. Andre and S. Russell. Programmable reinforcement learning agents. In Advances in Neural Information Processing Systems 13, pages 1019-1025. MIT Press, 2001.
- (2001) Advances in Neural Information Processing Systems , vol.13 , pp. 1019-1025
- Andre, D.¹ Russell, S.²

2
- 0032652172
- Towards a model of intelligence as an economy of agents
- E. B. Baum. Towards a Model of Intelligence as an Economy of Agents. Machine Learning, 35(2):155-185, 1999.
- (1999) Machine Learning , vol.35 , Issue.2 , pp. 155-185
- Baum, E.B.¹

3
- 0346942368
- Decision-theoretic planning: Structural assumptions and computational leverage
- C. Boutilier, T. Deam, and S. Hanks. Decision-Theoretic Planning: Structural Assumptions and Computational Leverage. JAIR, 11:1-94, 1999.
- (1999) JAIR , vol.11 , pp. 1-94
- Boutilier, C.¹ Deam, T.² Hanks, S.³

4
- 84880891360
- Symbolic dynamic programming for first-order MDPs
- Seattle, USA
- C. Boutilier, R. Reiter, and B. Price. Symbolic Dynamic Programming for First-order MDPs. In Seventeenth International Joint Conference on Artificial Intelligence (IJCAI-01), pages 690-700, Seattle, USA, 2001.
- (2001) Seventeenth International Joint Conference on Artificial Intelligence (IJCAI-01) , pp. 690-700
- Boutilier, C.¹ Reiter, R.² Price, B.³

5
- 0001133021
- Generalization in reinforcement learning: Safely approximating the value function
- J. A. Boyan and A. W. Moore. Generalization in reinforcement learning: safely approximating the value function. In Advances in Neural Information Processing Systems, volume 7, 1995.
- (1995) Advances in Neural Information Processing Systems , vol.7
- Boyan, J.A.¹ Moore, A.W.²

6
- 18544371909
- Probabilistic logic learning
- L. De Raedt and K. Kersting. Probabilistic Logic Learning. ACM-SIGKDD Explorations: Special issue on Multi-Relational Data Mining, 5(1):31-48, 2003.
- (2003) ACM-SIGKDD Explorations: Special Issue on Multi-relational Data Mining , vol.5 , Issue.1 , pp. 31-48
- De Raedt, L.¹ Kersting, K.²

7
- 0030697013
- Abstraction and approximate decision theoretic planning
- R. Dearden and C. Boutilier. Abstraction and approximate decision theoretic planning. Artificial Intelligence, 89(1):219-283, 1997.
- (1997) Artificial Intelligence , vol.89 , Issue.1 , pp. 219-283
- Dearden, R.¹ Boutilier, C.²

8
- 0002278788
- Hierarchical reinforcement learning with the MAXQ value function decomposition
- Thomas G. Dietterich. Hierarchical reinforcement learning with the MAXQ value function decomposition. Journal of Artificial Intelligence Research, 13:227-303, 2000.
- (2000) Journal of Artificial Intelligence Research , vol.13 , pp. 227-303
- Dietterich, T.G.¹

9
- 1942421161
- Relational instance based regression for relational reinforcement learning
- Washington DC, USA
- K. Driessens and J. Ramon. Relational Instance Based Regression for Relational Reinforcement Learning. In Proceedings of the Twelfth International Conference on Machine Learning, pages 123-130, Washington DC, USA, 2003.
- (2003) Proceedings of the Twelfth International Conference on Machine Learning , pp. 123-130
- Driessens, K.¹ Ramon, J.²

10
- 0035312760
- Relational reinforcement learning
- S. Džeroski, L. De Raedt, and K. Driessens. Relational reinforcement learning. Machine Learning, 43(1/2):7-52, 2001.
- (2001) Machine Learning , vol.43 , Issue.1-2 , pp. 7-52
- Džeroski, S.¹ De Raedt, L.² Driessens, K.³

11
- 0011177327
- Springer-Verlag
- S. Džeroski and N. Lavrač. Relational Data Mining. Springer-Verlag, 2001.
- (2001) Relational Data Mining
- Džeroski, S.¹ Lavrač, N.²

12
- 58349113822
- Approximate policy iteration with a policy language bias
- A. Fern, S. Yoon, and R. Givan. Approximate policy iteration with a policy language bias. In Proceedings of the Neural Information Processing Conference (NIPS), 2003.
- (2003) Proceedings of the Neural Information Processing Conference (NIPS)
- Fern, A.¹ Yoon, S.² Givan, R.³

13
- 33749683139
- The thing that we tried didn't work very well: Deictic representation in reinforcement learning
- S. Finney, N. H. Gardiol, L. P. Kaelbling, and T. Oates. The thing that we tried didn't work very well: Deictic representation in reinforcement learning. In Proceedings of the Eighteenth International Conference on Uncertainty in Artificial Intelligence (UAI-02), 2002.
- (2002) Proceedings of the Eighteenth International Conference on Uncertainty in Artificial Intelligence (UAI-02)
- Finney, S.¹ Gardiol, N.H.² Kaelbling, L.P.³ Oates, T.⁴

14
- 0007414022
- John Wiley and Sons Ltd.
- P. Flach. Simply logical: intelligent reasoning by example. John Wiley and Sons Ltd., 1994.
- (1994) Simply Logical: Intelligent Reasoning by Example
- Flach, P.¹

15
- 84880688943
- Learning probabilistic relational models
- Stockholm, Sweden. Morgan Kaufmann
- N. Friedman, L. Getoor, D. Koller, and A. Pfeffer. Learning probabilistic relational models. In Proceedings of the Sixteenth International Joint Conferences on Artificial Intelligence (IJCAI-99), pages 1300-1309, Stockholm, Sweden, 1999. Morgan Kaufmann.
- (1999) Proceedings of the Sixteenth International Joint Conferences on Artificial Intelligence (IJCAI-99) , pp. 1300-1309
- Friedman, N.¹ Getoor, L.² Koller, D.³ Pfeffer, A.⁴

16
- 0038517214
- Equivalence notions and model minimization in Markov decision processes
- R. Givan, T. Dean, and M. Greig. Equivalence notions and model minimization in Markov decision processes. Artificial Intelligence, 147:163-224, 2003.
- (2003) Artificial Intelligence , vol.147 , pp. 163-224
- Givan, R.¹ Dean, T.² Greig, M.³

17
- 85156203891
- Stable fitted reinforcement learning
- MIT Press
- G. J. Gordon. Stable fitted reinforcement learning. In Advances in Neural Information Processing, pages 1052-1058. MIT Press, 1996.
- (1996) Advances in Neural Information Processing , pp. 1052-1058
- Gordon, G.J.¹

18
- 84880803349
- Generalizing plans to new environments in relational MDPs
- Acapulco, Mexico
- C. Guestrin, D. Koller, C. Gearhart, and N. Kanodia. Generalizing Plans to New Environments in Relational MDPs. In Proceedings of International Joint Conference on Artificial Intelligence (IJCAI-03), Acapulco, Mexico, 2003.
- (2003) Proceedings of International Joint Conference on Artificial Intelligence (IJCAI-03)
- Guestrin, C.¹ Koller, D.² Gearhart, C.³ Kanodia, N.⁴

19
- 21144439055
- Learning in worlds with objects
- L. P. Kaelbling, T. Oates, N. H. Gardiol, and S. Finney. Learning in worlds with objects. In Working Notes of the AAAI Stanford Spring Symposium on Learning Grounded Representations, 2001.
- (2001) Working Notes of the AAAI Stanford Spring Symposium on Learning Grounded Representations
- Kaelbling, L.P.¹ Oates, T.² Gardiol, N.H.³ Finney, S.⁴

20
- 4444242181
- Logical markov decision programs
- K. Kersting and L. De Raedt. Logical markov decision programs. In Working Notes of the IJCAI-2003 Workshop on Learning Statistical Models from Relational Data (SRL-03), pages pp. 63-70, 2003.
- (2003) Working Notes of the IJCAI-2003 Workshop on Learning Statistical Models from Relational Data (SRL-03) , pp. 63-70
- Kersting, K.¹ De Raedt, L.²

21
- 14344249892
- Bellman goes relational
- Banff, Alberta, Canada, July 4-8. (to appear)
- K. Kersting, M. Van Otterlo, and L. De Raedt. Bellman goes Relational. In Proceedings of the Twenty-First International Conference on Machine Learning (ICML-04), Banff, Alberta, Canada, July 4-8 2004. (to appear).
- (2004) Proceedings of the Twenty-first International Conference on Machine Learning (ICML-04)
- Kersting, K.¹ Van Otterlo, M.² De Raedt, L.³

22
- 0038178323
- Solving factored mdps using non-homogeneous partitions
- K.-E. Kim and T. Dean. Solving factored mdps using non-homogeneous partitions. Artificial Intelligence, 147:225-251, 2003.
- (2003) Artificial Intelligence , vol.147 , pp. 225-251
- Kim, K.-E.¹ Dean, T.²

23
- 0003932121
- PhD thesis, Department of Computer Science, University of Rochester
- A. K. McCallum. Reinforcement Learning with Selective Perception and Hidden States. PhD thesis, Department of Computer Science, University of Rochester, 1995.
- (1995) Reinforcement Learning with Selective Perception and Hidden States
- McCallum, A.K.¹

24
- 0028429573
- Inductive logic programming: Theory and methods
- S. Muggleton and L. De Raedt. Inductive logic programming: Theory and methods. Journal of Logic Programming, 19(20):629-679, 1994.
- (1994) Journal of Logic Programming , vol.19 , Issue.20 , pp. 629-679
- Muggleton, S.¹ De Raedt, L.²

25
- 0033315871
- Influence and variance of a Markov chain: Application to adaptive discretization in optimal control
- R. Munos and A. Moore. Influence and Variance of a Markov Chain: Application to Adaptive Discretization in Optimal Control. In Proceedings of the IEEE Conference on Decision and Control, 1999.
- (1999) Proceedings of the IEEE Conference on Decision and Control
- Munos, R.¹ Moore, A.²

26
- 0031187203
- The independent choice logic for modelling multiple agents under uncertainty
- D. Poole. The independent choice logic for modelling multiple agents under uncertainty. Artificial Intelligence, 94(1-2):7-56, 1997.
- (1997) Artificial Intelligence , vol.94 , Issue.1-2 , pp. 7-56
- Poole, D.¹

27
- 85102627959
- John Wiley &: Sons
- M. L. Puterman. Markov Decision Processes: Discrete Stochastic Dynamic Programming. John Wiley &: Sons, 1994.
- (1994) Markov Decision Processes: Discrete Stochastic Dynamic Programming
- Puterman, M.L.¹

28
- 85153965130
- Reinforcement learning with soft state aggregation
- MIT Press
- S. P. Singh, T. Jaakkola, and M. I. Jordan. Reinforcement learning with soft state aggregation. In Advances in Neural Information Processing 7, pages 361-268. MIT Press, 1994.
- (1994) Advances in Neural Information Processing , vol.7 , pp. 361-1268
- Singh, S.P.¹ Jaakkola, T.² Jordan, M.I.³

29
- 0342420590
- Blocks world revisited
- J. Slaney and S. Thiébaux. Blocks World revisited. Artificial Intelligence, 125:119-153, 2001.
- (2001) Artificial Intelligence , vol.125 , pp. 119-153
- Slaney, J.¹ Thiébaux, S.²

30
- 0004102479
- The MIT Press
- R. S. Sutton and A. G. Barto. Reinforcement Learning: An Introduction. The MIT Press, 1998.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

31
- 0033170372
- Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
- R. S. Sutton, D. Precup, and S. Singh. Between MDPs and semi-MDPs: a framework for temporal abstraction in reinforcement learning. Artificial Intelligence, 112:181-211, 1999.
- (1999) Artificial Intelligence , vol.112 , pp. 181-211
- Sutton, R.S.¹ Precup, D.² Singh, S.³

32
- 0031143730
- An analysis of temporal-difference learning with function approximation
- J. N. Tsitsiklis and B. Van Roy. An analysis of temporal-difference learning with function approximation. IEEE Transactions of Automatic Control, 42:674-690, 1997.
- (1997) IEEE Transactions of Automatic Control , vol.42 , pp. 674-690
- Tsitsiklis, J.N.¹ Van Roy, B.²

33
- 40949136351
- Reinforcement learning for relational MDPs
- M. Van Otterlo. Reinforcement Learning for Relational MDPs. In Proceedings of the Annual Machine Learning Conference of Belgium and the Netherlands, 2004.
- (2004) Proceedings of the Annual Machine Learning Conference of Belgium and the Netherlands
- Van Otterlo, M.¹

34
- 0002557085
- Learning to perceive and act by trial and error
- S. D. Whitehead and D. H. Ballard. Learning to perceive and act by trial and error. Machine Learning, 7(1):45-83, 1991.
- (1991) Machine Learning , vol.7 , Issue.1 , pp. 45-83
- Whitehead, S.D.¹ Ballard, D.H.²

35
- 13444310066
- Inductive policy selection for first-order MDPs
- S. Yoon, A. Fern, and R. Givan. Inductive policy selection for first-order MDPs. In Proceedings of the International Conference on Uncertainty in Artificial Intelligence (UAI), 2002.
- (2002) Proceedings of the International Conference on Uncertainty in Artificial Intelligence (UAI)
- Yoon, S.¹ Fern, A.² Givan, R.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.