SCOPUS 정보 검색 플랫폼

Machine Learning

Volumn 57, Issue 3, 2004, Pages 271-304

Integrating guidance into relational reinforcement learning

(2) Driessens, Kurt a Džeroski, Sašo b

a UNIVERSITY OF LEUVEN (Belgium)

b JO EF STEFAN INSTITUTE (Slovenia)

Author keywords

Guided exploration; Reinforcement learning; Relational learning

Indexed keywords

DECISION TREES; GUIDED EXPLORATIONS; RELATIONAL REINFORCEMENT LEARNING (RRL); STRUCTURAL DOMAINS;

ALGORITHMS; DATA STORAGE EQUIPMENT; DECISION THEORY; DISTRIBUTED DATABASE SYSTEMS; LEARNING SYSTEMS; NEURAL NETWORKS; PARAMETER ESTIMATION; TREES (MATHEMATICS);

RELATIONAL DATABASE SYSTEMS;

EID: 4444312102 PISSN: 08856125 EISSN: None Source Type: Journal
DOI: 10.1023/B:MACH.0000039779.47329.3a Document Type: Conference Paper

Times cited : (79)

References (45)

1
- 0025725905
- Instance-based learning algorithms
- Aha, D. W., Kibler, D., & Albert, M. K. (1991). Instance-based learning algorithms. Machine Learning, 6:1, 37-66.
- (1991) Machine Learning , vol.6 , Issue.1 , pp. 37-66
- Aha, D.W.¹ Kibler, D.² Albert, M.K.³

2
- 1942450674
- A framework for behavioral cloning
- S. Muggleton, K. Furukawa, & D. Michie (Eds.). Oxford University Press
- Bain, M., & Sammut, C. (1995). A framework for behavioral cloning. In S. Muggleton, K. Furukawa, & D. Michie (Eds.), Machine Intelligence, vol. 15. Oxford University Press.
- (1995) Machine Intelligence , vol.15
- Bain, M.¹ Sammut, C.²

3
- 0003487482
- Athena Scientific
- Bertsekas, & Tsitsiklis (1996). Neuro-dynamic programming. Athena Scientific.
- (1996) Neuro-dynamic Programming
- Bertsekas¹ Tsitsiklis²

4
- 0032069371
- Top-down induction of first order logical decision trees
- Blockeel, H., & De Raedt, L. (1998). Top-down induction of first order logical decision trees. Artificial Intelligence, 101:1/2, 285-297.
- (1998) Artificial Intelligence , vol.101 , Issue.1-2 , pp. 285-297
- Blockeel, H.¹ De Raedt, L.²

5
- 0003802343
- Belmont: Wadsworth
- Breiman, L., Friedman, J., Olshen, R., & Stone, C. (1984). Classification and regression trees. Belmont: Wadsworth.
- (1984) Classification and Regression Trees
- Breiman, L.¹ Friedman, J.² Olshen, R.³ Stone, C.⁴

6
- 0347592018
- Man-machine co-operation on a learning task
- Chambers, R. A., & Michie, D. (1969). Man-machine co-operation on a learning task. Computer Graphics: Techniques and Applications (pp. 179-186).
- (1969) Computer Graphics: Techniques and Applications , pp. 179-186
- Chambers, R.A.¹ Michie, D.²

7
- 0002192119
- Input generalization in delayed reinforcement learning: An algorithm and performance comparisions
- Chapman, D., & Kaelbling, L. P. (1991). Input generalization in delayed reinforcement learning: An algorithm and performance comparisions. In Proceedings of the 12th International Joint Conference on Artificial Intelligence (pp. 726-731).
- (1991) Proceedings of the 12th International Joint Conference on Artificial Intelligence , pp. 726-731
- Chapman, D.¹ Kaelbling, L.P.²

8
- 0031198976
- Logical settings for concept learning
- De Raedt, L. (1997). Logical settings for concept learning. Artificial Intelligence, 95, 187-201.
- (1997) Artificial Intelligence , vol.95 , pp. 187-201
- De Raedt, L.¹

9
- 0028529173
- First order j k-clausal theories are PAC-learaable
- De Raedt, L., & Džeroski, S. (1994). First order j k-clausal theories are PAC-learaable. Artificial Intelligence, 70, 375-392.
- (1994) Artificial Intelligence , vol.70 , pp. 375-392
- De Raedt, L.¹ Džeroski, S.²

10
- 0003979861
- Incorporating prior knowledge and previously learned information into reinforcement learning agents
- Institute for Complex Engineered Systems, Carnegie Mellon University
- Dixon, K., Malak, R., & Khosla, P. (2000). Incorporating prior knowledge and previously learned information into reinforcement learning agents. Technical report, Institute for Complex Engineered Systems, Carnegie Mellon University.
- (2000) Technical Report
- Dixon, K.¹ Malak, R.² Khosla, P.³

11
- 4444267543
- Learing digger using hierarchical reinforcement learning for concurrent goals
- Onderwijsinstituut CKI, University of Utrecht
- Driessens, K., & Blockeel, H. (2001). Learing digger using hierarchical reinforcement learning for concurrent goals. In Proceedings of the 5th European Workshop on Reinforcement Learning (pp. 11-12). Onderwijsinstituut CKI, University of Utrecht.
- (2001) Proceedings of the 5th European Workshop on Reinforcement Learning , pp. 11-12
- Driessens, K.¹ Blockeel, H.²

12
- 1942421161
- Relational instance based regression for relational reinforcement learning
- Submitted to
- Driessens, K., & Ramon, J. (2003). Relational instance based regression for relational reinforcement learning. In Submitted to ICML 2003.
- (2003) ICML 2003
- Driessens, K.¹ Ramon, J.²

13
- 84948172455
- Speeding up relational reinforcement learning through the use of an incremental first order decision tree learner
- Springer-Verlag
- Driessens, K., Ramon, J., & Blockeel, H. (2001). Speeding up relational reinforcement learning through the use of an incremental first order decision tree learner. In Proceedings of the 13th European Conference on Machine Learning (pp. 97-108). Springer-Verlag.
- (2001) Proceedings of the 13th European Conference on Machine Learning , pp. 97-108
- Driessens, K.¹ Ramon, J.² Blockeel, H.³

14
- 0242445843
- Relational reinforcement learning
- J. Shavlik (Ed.). Morgan Kaufmann
- Džeroski, S., De Raedt, L., & Blockeel, H. (1998). Relational reinforcement learning. In J. Shavlik (Ed.), Proceedings of the 15th International Conference on Machine Learning (ICML'98) (pp. 136-143). Morgan Kaufmann.
- (1998) Proceedings of the 15th International Conference on Machine Learning (ICML'98) , pp. 136-143
- Džeroski, S.¹ De Raedt, L.² Blockeel, H.³

15
- 0035312760
- Relational reinforcement learning
- Džeroski, S., De Raedt, L., & Driessens, K. (2001). Relational reinforcement learning. Machine Learning, 43, 7-52.
- (2001) Machine Learning , vol.43 , pp. 7-52
- Džeroski, S.¹ De Raedt, L.² Driessens, K.³

16
- 0002479240
- Relational instance-based learning
- L. Saitta (Ed.). Morgan Kaufmann
- Emde, W., & Wettschereck, D. (1996). Relational instance-based learning. In L. Saitta (Ed.), Proceedings of the Thirteenth International Conference on Machine Learning (pp. 122-130). Morgan Kaufmann.
- (1996) Proceedings of the Thirteenth International Conference on Machine Learning , pp. 122-130
- Emde, W.¹ Wettschereck, D.²

17
- 58349113822
- Approximate policy iteration with a policy language bias
- T. S., L. Saul, & B. Bernhard Schikopf (Eds.). The MIT Press
- Fern, A., Yoon, S., & Givan, R. (2003). Approximate policy iteration with a policy language bias. In T. S., L. Saul, & B. Bernhard Schikopf (Eds.), Proceedings of the Seventeenth Annual Conference on Neural Information Processing Systems. The MIT Press.
- (2003) Proceedings of the Seventeenth Annual Conference on Neural Information Processing Systems
- Fern, A.¹ Yoon, S.² Givan, R.³

18
- 84881576361
- Strips: A new approach to the application for theorem proving to problem solving
- Edinburgh, Scotland
- Fikes, R. E., & Nilsson, N. J. (1971). Strips: A new approach to the application for theorem proving to problem solving. In Advance Papers of the Second International Joint Conference on Artificial Intelligence (pp. 608-620). Edinburgh, Scotland.
- (1971) Advance Papers of the Second International Joint Conference on Artificial Intelligence , pp. 608-620
- Fikes, R.E.¹ Nilsson, N.J.²

19
- 0029679044
- Reinforcement learning: A survey
- Kaelbling, L., Littman, M., & Moore, A. (1996). Reinforcement learning: A survey. Journal of Artificial Intelligence Research, 4, 237-285.
- (1996) Journal of Artificial Intelligence Research , vol.4 , pp. 237-285
- Kaelbling, L.¹ Littman, M.² Moore, A.³

20
- 4444242181
- Logical markov decision programs
- Kersting, K., & De Raedt, L. (2003). Logical markov decision programs. In Proceedings of the IJCAI'03 Workshop on Learning Statistical Models of Relational Data (pp. 63-70).
- (2003) Proceedings of the IJCAI'03 Workshop on Learning Statistical Models of Relational Data , pp. 63-70
- Kersting, K.¹ De Raedt, L.²

21
- 4444283088
- Distance based approaches to relational learning and clustering
- S. Džeroski and N. Lavrač (Eds.). Springer-Verlag
- Kirsten, M., Wrobel, S., & Horvath, T. (2001). Distance based approaches to relational learning and clustering. In S. Džeroski and N. Lavrač (Eds.), Relational data mining (pp. 213-232). Springer-Verlag.
- (2001) Relational Data Mining , pp. 213-232
- Kirsten, M.¹ Wrobel, S.² Horvath, T.³

22
- 0030349752
- Structural regression trees
- Cambridge/Menlo Park. AAAI Press/MIT Press
- Kramer, S. (1996). Structural regression trees. In Proceedings of the Thirteenth National Conference on Artificial Intelligence (pp. 812-819). Cambridge/Menlo Park. AAAI Press/MIT Press.
- (1996) Proceedings of the Thirteenth National Conference on Artificial Intelligence , pp. 812-819
- Kramer, S.¹

23
- 35048819671
- Least-squares methods in reinforcement learning for control
- Springer
- Lagoudakis, M., Parr, R., & Littman, M. (2002). Least-squares methods in reinforcement learning for control. In Proceedings of the 2nd Hellenic Conference on Artificial Intelligence (SETN-02) (pp. 249-260), Springer.
- (2002) Proceedings of the 2nd Hellenic Conference on Artificial Intelligence (SETN-02) , pp. 249-260
- Lagoudakis, M.¹ Parr, R.² Littman, M.³

24
- 0004080766
- Ellis Horwood
- Lavrač, N., & Džeroski, S. (1994). Inductive logic programming: Techniques and applications. Ellis Horwood.
- (1994) Inductive Logic Programming: Techniques and Applications
- Lavrač, N.¹ Džeroski, S.²

25
- 0000123778
- Self-improving reactive agents based on reinforcement learning, planning and teaching
- Lin, L.-J. (1992). Self-improving reactive agents based on reinforcement learning, planning and teaching. Machine Learning, 8, 293-321.
- (1992) Machine Learning , vol.8 , pp. 293-321
- Lin, L.-J.¹

26
- 4444326434
- Scaling up reinforcement learning with a relational representation
- Morales, E. (2003). Scaling up reinforcement learning with a relational representation. In Proc. of the Workshop on Adaptability in Multi-agent Systems (pp. 15-26).
- (2003) Proc. of the Workshop on Adaptability in Multi-agent Systems , pp. 15-26
- Morales, E.¹

27
- 0028429573
- Inductive logic programming: Theory and methods
- Muggleton, S., & De Raedt, L. (1994). Inductive logic programming: Theory and methods. Journal of Logic Programming, 19/20, 629-679.
- (1994) Journal of Logic Programming , vol.19-20 , pp. 629-679
- Muggleton, S.¹ De Raedt, L.²

28
- 0003500248
- Morgan Kaufmann series in machine learning Morgan Kaufmann
- Quinlan, J. R. (1993). C4.5: Programs for Machine Learning, Morgan Kaufmann series in machine learning. Morgan Kaufmann.
- (1993) C4.5: Programs for Machine Learning
- Quinlan, J.R.¹

29
- 4444243659
- Ph.D. thesis, Department of Computer Science, K.U. Leuven
- Ramon, J. (2002). Clustering and instance based learning in first order logic. Ph.D. thesis, Department of Computer Science, K.U. Leuven.
- (2002) Clustering and Instance Based Learning in First Order Logic
- Ramon, J.¹

30
- 0035402326
- A polynomial time computable metric between point sets
- Ramon, J., & Bruynooghe, M. (2001). A polynomial time computable metric between point sets. Acta Informatica, 37, 765-780.
- (2001) Acta Informatica , vol.37 , pp. 765-780
- Ramon, J.¹ Bruynooghe, M.²

31
- 0003444646
- ed. w/PDF Research Group, MA: MIT Press Cambridge
- Rumelhart, D. E., & McClelland, J. L. (1986). Parallel distributed processing: Foundations (ed. w/PDF Research Group), vol. 1, MA: MIT Press Cambridge.
- (1986) Parallel Distributed Processing: Foundations , vol.1
- Rumelhart, D.E.¹ McClelland, J.L.²

32
- 4444242180
- Why experimentation can be better than "perfect guidance"
- Morgan Kaufmann
- Scheffer, T., Greiner, R., & Darken, C. (1997). Why experimentation can be better than "Perfect Guidance". In Proceedings of the 14th International Conference on Machine Learning (pp. 331-339). Morgan Kaufmann.
- (1997) Proceedings of the 14th International Conference on Machine Learning , pp. 331-339
- Scheffer, T.¹ Greiner, R.² Darken, C.³

33
- 0034819986
- Using background knowledge to speed reinforcement learning in physical agents
- Association for Computing Machinery
- Shapiro, D., Langley, P., & Shachter, R. (2001). Using background knowledge to speed reinforcement learning in physical agents. In Proceedings of the 5th International Conference on Autonomous Agents. Association for Computing Machinery.
- (2001) Proceedings of the 5th International Conference on Autonomous Agents
- Shapiro, D.¹ Langley, P.² Shachter, R.³

34
- 0001898381
- Practical reinforcement learning in continuous spaces
- Morgan Kaufmann
- Smart, W. D., & Kaelbling, L. P. (2000). Practical reinforcement learning in continuous spaces. In Proceedings of the I7th International Conference on Machine Learning (pp. 903-910). Morgan Kaufmann.
- (2000) Proceedings of the I7th International Conference on Machine Learning , pp. 903-910
- Smart, W.D.¹ Kaelbling, L.P.²

35
- 85156221438
- Generalization in reinforcement learning: Successful examples using sparse coarse coding
- Cambridge, MA: The MIT Press
- Sutton, R. (1996). Generalization in reinforcement learning: Successful examples using sparse coarse coding. In Proceeding of the 8th Conference on Advances in Neural Information Processing Systems (pp. 1038-1044). Cambridge, MA: The MIT Press.
- (1996) Proceeding of the 8th Conference on Advances in Neural Information Processing Systems , pp. 1038-1044
- Sutton, R.¹

36
- 0004102479
- Cambridge, MA: The MIT Press
- Sutton, R., & Barto, A. (1998). Reinforcement learning: An introduction. Cambridge, MA: The MIT Press.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.¹ Barto, A.²

37
- 4444285247
- Learning models of control skills: Phenomena, results and problems
- IFAC
- Urbancic, T., Bratko, I., & Sammut, C. (1996). Learning models of control skills: Phenomena, results and problems. In Proceedings of the 13th Triennial World Congress of the International Federation of Automatic Control (pp. 391-396). IFAC.
- (1996) Proceedings of the 13th Triennial World Congress of the International Federation of Automatic Control , pp. 391-396
- Urbancic, T.¹ Bratko, I.² Sammut, C.³

38
- 0031246271
- Decision tree induction based on efficient tree restructuring
- Utgoff, P., Berkman, N., & Clouse, J. (1997). Decision tree induction based on efficient tree restructuring. Machine Learning, 29:1, 5-44.
- (1997) Machine Learning , vol.29 , Issue.1 , pp. 5-44
- Utgoff, P.¹ Berkman, N.² Clouse, J.³

39
- 4444306027
- How to upgrade propositional learners to first order logic: A case study
- S. Džeroski, & N. Lavrač (Eds.). Springer-Verlag
- Van Laer, W., & De Raedt, L. (2001 ). How to upgrade propositional learners to first order logic: A case study. In S. Džeroski, & N. Lavrač (Eds.), Relational Data Mining (pp. 235-261). Springer-Verlag.
- (2001) Relational Data Mining , pp. 235-261
- Van Laer, W.¹ De Raedt, L.²

40
- 40949136351
- Reinforcement learning for relational MDPs
- van Otterlo, M. (2004). Reinforcement learning for relational MDPs. In Proceedings of the Machine Learning Conference of Belgium and the Netherlands 2004.
- (2004) Proceedings of the Machine Learning Conference of Belgium and the Netherlands 2004
- Van Otterlo, M.¹

41
- 0015960104
- The string to string correction problem
- Wagner, R., & Fischer, M. (1974). The string to string correction problem. Journal of the ACM, 21(1), 168-173.
- (1974) Journal of the ACM , vol.21 , Issue.1 , pp. 168-173
- Wagner, R.¹ Fischer, M.²

42
- 84969334117
- Learning by observation and practice: An incremental approach for planning operator acquisition
- Wang, X. (1995). Learning by observation and practice: An incremental approach for planning operator acquisition. In Proceedings of the 12th International Conference on Machine Learning (pp. 549-557).
- (1995) Proceedings of the 12th International Conference on Machine Learning , pp. 549-557
- Wang, X.¹

43
- 0004049893
- Ph.D. thesis, King's College, Cambridge
- Watkins, C. (1989). Learning from delayed rewards. Ph.D. thesis, King's College, Cambridge.
- (1989) Learning from Delayed Rewards
- Watkins, C.¹

44
- 0345161977
- Ph.D. thesis, University of Amsterdam
- Wiering, M. (1999). Explorations in efficient reinforcement learning. Ph.D. thesis, University of Amsterdam.
- (1999) Explorations in Efficient Reinforcement Learning
- Wiering, M.¹

45
- 13444310066
- Inductive policy selection for first order MDPs
- Yoon, S., Fern, A., & Givan, R. (2002). Inductive policy selection for first order MDPs. In Proceedings of UAI'02.
- (2002) Proceedings of UAI'02
- Yoon, S.¹ Fern, A.² Givan, R.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.