SCOPUS 정보 검색 플랫폼

Journal of Computers

Volumn 3, Issue 9, 2008, Pages 29-38

Agent learning in relational domains based on logical MDPs with negation

(3) Song, Zhiwei a Chen, Xiaoping a Cong, Shuang a

a UNIVERSITY OF SCIENCE AND TECHNOLOGY OF CHINA (China)

Author keywords

Logical MDPs with negation; Relational reineforcement learning; State refinement; ( ) Learning

Indexed keywords

GROUND STATE; MARKOV PROCESSES; REINFORCEMENT LEARNING;

FORMAL DEFINITION; GENERATING METHODS; LOGICAL MDPS WITH NEGATION; MARKOV DECISION PROCESSES; RELATIONAL FORMS; RELATIONAL REINEFORCEMENT LEARNING; RELATIONAL REINFORCEMENT LEARNING; STATE REFINEMENT;

LEARNING ALGORITHMS;

EID: 72949090743 PISSN: 1796203X EISSN: None Source Type: Journal
DOI: 10.4304/jcp.3.9.29-38 Document Type: Article

Times cited : (3)

References (36)

1
- 0029679044
- Reinforcement learning: A survey
- L. Kaelbling, M. Littman, and A. Moore, "Reinforcement learning: A survey," Journal of Artificial Intelligence Research, vol. 4, pp. 237-285, 1996.
- (1996) Journal of Artificial Intelligence Research , vol.4 , pp. 237-285
- Kaelbling, L.¹ Littman, M.² Moore, A.³

2
- 0004102479
- The MIT Press
- R. Sutton and A. Barto, Reinforcement Learning: An Introduction. The MIT Press, 1998.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.¹ Barto, A.²

3
- 27644480261
- Integrating relevance feedback techniques for image retrieval using reinforcement learning
- P.-Y. Yin, B. Bhanu, K.-C. Chang, and A. Dong, "Integrating relevance feedback techniques for image retrieval using reinforcement learning," IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 27, no. 10, pp. 1536-1551, 2005.
- (2005) IEEE Transactions on Pattern Analysis and Machine Intelligence , vol.27 , Issue.10 , pp. 1536-1551
- Yin, P.-Y.¹ Bhanu, B.² Chang, K.-C.³ Dong, A.⁴

4
- 33645712749
- LEAD: A methodology for learning efficient approaches to medical diagnosis
- S. Fakih and T. Das, "LEAD: a methodology for learning efficient approaches to medical diagnosis," IEEE Transactions on Information Technology in Biomedicine, vol. 10, no. 2, pp. 220-228, 2006.
- (2006) IEEE Transactions on Information Technology in Biomedicine , vol.10 , Issue.2 , pp. 220-228
- Fakih, S.¹ Das, T.²

5
- 33646492472
- An efficient dynamic algorithm for maintaining all-pairs shortest paths in stochastic networks
- S. Misra and B. Oommen, "An efficient dynamic algorithm for maintaining all-pairs shortest paths in stochastic networks," IEEE Transactions on Computers, vol. 55, no. 6, pp. 686-702, 2006.
- (2006) IEEE Transactions on Computers , vol.55 , Issue.6 , pp. 686-702
- Misra, S.¹ Oommen, B.²

6
- 0035312760
- Relational reinforcement learning
- S. Džeroski, L. De Raedt, and K. Driessens, "Relational reinforcement learning," Machine Learning, vol. 43, pp. 7-52, 2001.
- (2001) Machine Learning , vol.43 , pp. 7-52
- Džeroski, S.¹ De Raedt, L.² Driessens, K.³

7
- 37249061374
- A survey of reinforcement learning in relational domains,
- 1381-3625, July 2005
- M. Van Otterlo, "A survey of reinforcement learning in relational domains," CTIT Technical Report Series ISSN 1381-3625, July 2005.
- CTIT Technical Report Series ISSN
- Van Otterlo, M.¹

8
- 26944455336
- Relational reinforcement learning: An overview
- P. Tadepalli, R. Givan, and K. Driessens, "Relational reinforcement learning: An overview," in ICML'04 Workshop on Relational Reinforcement Learning, 2004.
- (2004) ICML'04 Workshop on Relational Reinforcement Learning
- Tadepalli, P.¹ Givan, R.² Driessens, K.³

9
- 13444310066
- Inductive policy selection for first-order MDPs
- S. Yoon, A. Fern, and R. Givan, "Inductive policy selection for first-order MDPs," in UAI'02, 2002.
- (2002) UAI'02
- Yoon, S.¹ Fern, A.² Givan, R.³

10
- 40949103747
- Symbolic learning for adaptive agents
- J. Cole, K. Lloyd, and K. Ng, "Symbolic learning for adaptive agents," in The Annual Partner Conference, Smart Internet Technology Cooperative Research Centre, 2003.
- (2003) The Annual Partner Conference, Smart Internet Technology Cooperative Research Centre
- Cole, J.¹ Lloyd, K.² Ng, K.³

11
- 84880803349
- Generalizing plans to new environments in relational MDPs
- C. Guestrin, D. Koller, C. Gearhart, and N. Kanodia, "Generalizing plans to new environments in relational MDPs," in IJCAI'03, 2003.
- (2003) IJCAI'03
- Guestrin, C.¹ Koller, D.² Gearhart, C.³ Kanodia, N.⁴

12
- 4444242181
- Logical markov decision programs
- K. Kersting and L. De Raedt, "Logical markov decision programs," in IJCAI'03 Workshop on Learning Statistical Models of Relational Data, 2003.
- (2003) IJCAI'03 Workshop on Learning Statistical Models of Relational Data
- Kersting, K.¹ De Raedt, L.²

13
- 4444326434
- Scaling up reinforcement learning with a relational representation
- Sydney
- E. Morales, "Scaling up reinforcement learning with a relational representation," in Proceedings of the Workshop on Adaptability in Multi-agent Systems at AORC'03, Sydney, 2003.
- (2003) Proceedings of the Workshop on Adaptability in Multi-agent Systems at AORC'03
- Morales, E.¹

14
- 14344249892
- Bellman goes relational
- K. Kersting, M. Van Otterlo, and L. De Raedt, "Bellman goes relational," in ICML'04, 2004.
- (2004) ICML'04
- Kersting, K.¹ Van Otterlo, M.² De Raedt, L.³

15
- 40949136351
- Reinforcement learning for relational MDPs
- M. Van Otterlo, "Reinforcement learning for relational MDPs," in Machine Learning Conference of Belgium and the Netherlands, 2004.
- (2004) Machine Learning Conference of Belgium and the Netherlands
- Van Otterlo, M.¹

16
- 31844440221
- Combining model-based and instance-based learning for first order regression
- K. Driessens and S. Džeroski, "Combining model-based and instance-based learning for first order regression," in Proceedings of the 22nd International Conference on Machine Learning, 2005, pp. 193-200.
- (2005) Proceedings of the 22nd International Conference on Machine Learning , pp. 193-200
- Driessens, K.¹ Džeroski, S.²

17
- 33745605805
- Convergence of reinforcement learning using a decision tree learner
- Bonn Germany
- J. Ramon, "Convergence of reinforcement learning using a decision tree learner," in Proceedings of the ICML'05 Workshop on Rich Representations for Reinforcement Learning, Bonn Germany, 2005.
- (2005) Proceedings of the ICML'05 Workshop on Rich Representations for Reinforcement Learning
- Ramon, J.¹

18
- 79551514274
- Relational sequence alignements
- T. Gärtner, G. C. Garriga, and T. Meinl, Eds, September
- A. Karwath and K. Kersting, "Relational sequence alignements," in Proceedings of The 4th International Workshop on Mining and Learning with Graphs (MLG '06), T. Gärtner, G. C. Garriga, and T. Meinl, Eds., September 2006.
- (2006) Proceedings of The 4th International Workshop on Mining and Learning with Graphs (MLG '06)
- Karwath, A.¹ Kersting, K.²

19
- 84880869367
- IJCAI
- C. Wang, S. Joshi, and R. Khardon, "First order decision diagrams for relational mdps," in IJCAI 2007, 2007.
- (2007) First order decision diagrams for relational mdps
- Wang, C.¹ Joshi, S.² Khardon, R.³

20
- 0004109057
- Springer-Verlag
- S.-H. Neinhuys-Cheng and R. de Wolf, Foundations of Inductive Logic Programming, vol. 1228 of Lecture Notes in Artifical Intelligence. Springer-Verlag, 1997.
- (1997) Foundations of Inductive Logic Programming, vol. 1228 of Lecture Notes in Artifical Intelligence
- Neinhuys-Cheng, S.-H.¹ de Wolf, R.²

21
- 0000826543
- Negation as failure
- K. Clark, "Negation as failure." in Logic and Data Bases, 1977, pp. 293-322.
- (1977) Logic and Data Bases , pp. 293-322
- Clark, K.¹

22
- 33750335125
- Challenges for relational reinforcement learning
- M. Van Otterlo and K. Kersting, "Challenges for relational reinforcement learning," in ICML'04 Workshop on Relational Reinforcement Learning, 2004.
- (2004) ICML'04 Workshop on Relational Reinforcement Learning
- Van Otterlo, M.¹ Kersting, K.²

23
- 0029333536
- An algorithm for probabilistic planning
- N. Kushmerick, S. Hanks, and D. Weld, "An algorithm for probabilistic planning," Artificial Intelligence, vol. 76, pp. 239-286, 1995.
- (1995) Artificial Intelligence , vol.76 , pp. 239-286
- Kushmerick, N.¹ Hanks, S.² Weld, D.³

24
- 22944490192
- Logical markov decision programs and the convergence of logical TD(λ)
- K. Kersting and L. De Raedt, "Logical markov decision programs and the convergence of logical TD(λ)," in Fourteenth International Conference on Inductive Logic Programming, 2004, pp. 180-197.
- (2004) Fourteenth International Conference on Inductive Logic Programming , pp. 180-197
- Kersting, K.¹ De Raedt, L.²

25
- 0342420590
- Blocks world revisited
- J. Slaney and S. Thiébaux, "Blocks world revisited," Artificial Intelligence, vol. 125, pp. 119-153, 2001.
- (2001) Artificial Intelligence , vol.125 , pp. 119-153
- Slaney, J.¹ Thiébaux, S.²

26
- 0033165498
- Reasoning about noisy sensors and effectors in the situation calculus
- F. Bacchus, J. Y. Halpern, and H. J. Levesque, "Reasoning about noisy sensors and effectors in the situation calculus," Artificial Intelligence, vol. 111, no. 1-2, pp. 171-208, 1999.
- (1999) Artificial Intelligence , vol.111 , Issue.1-2 , pp. 171-208
- Bacchus, F.¹ Halpern, J.Y.² Levesque, H.J.³

27
- 78651544767
- P. Mateus, A. Pacheco, and J. Pinto, Observations and the probabilistic situation calculus, in Proceedings of the 8th KR, 2002, pp. 327-340.
- P. Mateus, A. Pacheco, and J. Pinto, "Observations and the probabilistic situation calculus," in Proceedings of the 8th KR, 2002, pp. 327-340.

28
- 0346942368
- Decision-theoretic planning: Structural assumptions and computational leverage
- C. Boutilier, T. Dean, and S. Hanks, "Decision-theoretic planning: Structural assumptions and computational leverage," Journal of Artificial Intelligence Research (JAIR), vol. 11, pp. 1-94, 1999.
- (1999) Journal of Artificial Intelligence Research (JAIR) , vol.11 , pp. 1-94
- Boutilier, C.¹ Dean, T.² Hanks, S.³

29
- 0031187203
- The independent choice logic for modelling multiple agents under uncertainty
- D. Poole, "The independent choice logic for modelling multiple agents under uncertainty," Artificial Intelligence, vol. 94, no. 1-2, pp. 7-56, 1997.
- (1997) Artificial Intelligence , vol.94 , Issue.1-2 , pp. 7-56
- Poole, D.¹

30
- 84880891360
- Symbolic dynamic programming for first-order MDPs
- C. Boutilier, R. Reiter, and B. Price, "Symbolic dynamic programming for first-order MDPs," in Proceedings of the 17th IJCAI, 2001, pp. 690-700.
- (2001) Proceedings of the 17th IJCAI , pp. 690-700
- Boutilier, C.¹ Reiter, R.² Price, B.³

31
- 84880895882
- Decisino-theoretic, high-level agent programming in the situatino calculus
- C. Boutilier, R. Reiter, M. Soutchanski, and S. Thrun, "Decisino-theoretic, high-level agent programming in the situatino calculus," in Proceedings of the 17th AAAI/12th IAAI, 2000, pp. 355-362.
- (2000) Proceedings of the 17th AAAI/12th IAAI , pp. 355-362
- Boutilier, C.¹ Reiter, R.² Soutchanski, M.³ Thrun, S.⁴

32
- 0027694534
- Representing action and change by logic programs
- M. Gelfoun and V. Lifschitz, "Representing action and change by logic programs," Journal of Logical Programming, vol. 17, no. 2-4, pp. 301-321, 1993.
- (1993) Journal of Logical Programming , vol.17 , Issue.2-4 , pp. 301-321
- Gelfoun, M.¹ Lifschitz, V.²

33
- 0036923234
- Reasoning about actions in a probabilistic setting
- C. Baral, N. Tran, and L.-C. Tuan, "Reasoning about actions in a probabilistic setting," in Proceedings of the 18th AAAI/14th IAAI, 2002, pp. 507-512.
- (2002) Proceedings of the 18th AAAI/14th IAAI , pp. 507-512
- Baral, C.¹ Tran, N.² Tuan, L.-C.³

34
- 0002192119
- Input generalization in delayed reinforcement learning: An algorithm and performance comparisions
- D. Chapman and L. Kaelbling, "Input generalization in delayed reinforcement learning: An algorithm and performance comparisions," in Proccedings of the 12th International Joint Conference on Artificial Intelligence, 1991, pp. 726-731.
- (1991) Proccedings of the 12th International Joint Conference on Artificial Intelligence , pp. 726-731
- Chapman, D.¹ Kaelbling, L.²

35
- 0003932121
- PhD thesis, Computer Science Department, University of Rochester
- A. McCallum, Reinforcement Learning with Selective Perception and Hidden State. PhD thesis, Computer Science Department, University of Rochester, 1995.
- (1995) Reinforcement Learning with Selective Perception and Hidden State
- McCallum, A.¹

36
- 0029514510
- The parti-game algorithm for variable resolution reinforcement learning in multidimensional state-spaces
- A. Moore, "The parti-game algorithm for variable resolution reinforcement learning in multidimensional state-spaces," Machine Learning, vol. 21, no. 3, pp. 199-233, 1995.
- (1995) Machine Learning , vol.21 , Issue.3 , pp. 199-233
- Moore, A.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.