메뉴 건너뛰기




Volumn , Issue , 2007, Pages 738-743

Utile distinctions for relational reinforcement learning

Author keywords

[No Author keywords available]

Indexed keywords

CHANGING ENVIRONMENT; FUNCTION APPROXIMATION; REINFORCEMENT LEARNING AGENT; RELATIONAL LEARNING; RELATIONAL REINFORCEMENT LEARNING; RELATIONAL REPRESENTATIONS; STOCHASTIC SAMPLING; TEMPORAL SAMPLING;

EID: 79952415359     PISSN: 10450823     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (10)

References (19)
  • 1
    • 0002192119 scopus 로고    scopus 로고
    • Input generalization in delayed reinforcement learning: An algorithm and performance comparisons
    • D. Chapman and L. P. Kaelbling. Input generalization in delayed reinforcement learning: An algorithm and performance comparisons. In Proceedings of IJCAI-91, 1991.
    • Proceedings of IJCAI-91, 1991
    • Chapman, D.1    Kaelbling, L.P.2
  • 2
    • 84948172455 scopus 로고    scopus 로고
    • Speeding up relational reinfocement learning through the use of an incremental first order decision tree learner
    • K. Driessens, J. Ramon, and H. Blockeel. Speeding up relational reinfocement learning through the use of an incremental first order decision tree learner. In Proceedings of ECML - European Conference on Machine Learning, pages 97-108, 2001.
    • (2001) Proceedings of ECML - European Conference on Machine Learning , pp. 97-108
    • Driessens, K.1    Ramon, J.2    Blockeel, H.3
  • 4
  • 5
    • 0035312760 scopus 로고    scopus 로고
    • Relational reinforcement learning
    • April
    • S. Dzeroski, L. De Raedt, and K. Driessens. Relational reinforcement learning. Machine Learning, 43(1/2):5-52, April 2001.
    • (2001) Machine Learning , vol.43 , Issue.1-2 , pp. 5-52
    • Dzeroski, S.1    De Raedt, L.2    Driessens, K.3
  • 8
    • 0032073263 scopus 로고    scopus 로고
    • Planning and acting in partially observable stochastic domains
    • L.P. Kaebling, M. L. Littman, and A. R. Cassandra. Planning and acting in partially observable stochastic domains. Artificial Intelligence, 101(1-2):99-134, 1998.
    • (1998) Artificial Intelligence , vol.101 , Issue.1-2 , pp. 99-134
    • Kaebling, L.P.1    Littman, M.L.2    Cassandra, A.R.3
  • 11
    • 33750741792 scopus 로고    scopus 로고
    • Exploiting relational structure to understand publication patterns in high-energy physics
    • December
    • A. McGovern, L. Friedland, M. Hay, B. Gallagher, A. Fast, J. Neville, and D. Jensen. Exploiting relational structure to understand publication patterns in high-energy physics. SIGKDD Explorations, 5(2):165-172, December 2003.
    • (2003) SIGKDD Explorations , vol.5 , Issue.2 , pp. 165-172
    • McGovern, A.1    Friedland, L.2    Hay, M.3    Gallagher, B.4    Fast, A.5    Neville, J.6    Jensen, D.7
  • 12
    • 23044531386 scopus 로고    scopus 로고
    • Learning a tsume-go heuristic with TILDE
    • Proceedings of CG2000, the Second International Conference on Computers and Games, Springer-Verlag
    • J. Ramon, T. Francis, and H. Blockeel. Learning a tsume-go heuristic with TILDE. In Proceedings of CG2000, the Second International Conference on Computers and Games, volume 2063 of Lecture Notes in Computer Science, pages 151-169. Springer-Verlag, 2001.
    • (2001) Lecture Notes in Computer Science , vol.2063 , pp. 151-169
    • Ramon, J.1    Francis, T.2    Blockeel, H.3
  • 14
    • 22644449927 scopus 로고    scopus 로고
    • A study of two probabilistic methods for searching large spaces with ILP
    • A Srinivasan. A study of two probabilistic methods for searching large spaces with ILP. Data Mining and Knowledge Discovery, 3(1):95-123, 1999.
    • (1999) Data Mining and Knowledge Discovery , vol.3 , Issue.1 , pp. 95-123
    • Srinivasan, A.1
  • 17
    • 0031246271 scopus 로고    scopus 로고
    • Decision Tree Induction Based on Efficient Tree Restructuring
    • P. E. Utgoff, N.C. Berkman, and J. A. Clouse. Decision tree induction based on efficient tree restructuring. Machine Learning, 29:5-44, 1997. (Pubitemid 127507172)
    • (1997) Machine Learning , vol.29 , Issue.1 , pp. 5-44
    • Utgoff, P.E.1    Berkman, N.C.2    Clouse, J.A.3
  • 18
    • 37249061374 scopus 로고    scopus 로고
    • A survey of reinforcement learning in relational domains
    • Technical Report TR-CTIT-05-31, ISSN 1381-3625
    • M. van Otterlo. A survey of reinforcement learning in relational domains. Technical Report TR-CTIT-05-31, CTIT Technical Report Series, ISSN 1381-3625, 2005.
    • (2005) CTIT Technical Report Series
    • Van Otterlo, M.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.