메뉴 건너뛰기




Volumn 3, Issue 9, 2008, Pages 29-38

Agent learning in relational domains based on logical MDPs with negation

Author keywords

Logical MDPs with negation; Relational reineforcement learning; State refinement; ( ) Learning

Indexed keywords

GROUND STATE; MARKOV PROCESSES; REINFORCEMENT LEARNING;

EID: 72949090743     PISSN: 1796203X     EISSN: None     Source Type: Journal    
DOI: 10.4304/jcp.3.9.29-38     Document Type: Article
Times cited : (3)

References (36)
  • 4
    • 33645712749 scopus 로고    scopus 로고
    • LEAD: A methodology for learning efficient approaches to medical diagnosis
    • S. Fakih and T. Das, "LEAD: a methodology for learning efficient approaches to medical diagnosis," IEEE Transactions on Information Technology in Biomedicine, vol. 10, no. 2, pp. 220-228, 2006.
    • (2006) IEEE Transactions on Information Technology in Biomedicine , vol.10 , Issue.2 , pp. 220-228
    • Fakih, S.1    Das, T.2
  • 5
    • 33646492472 scopus 로고    scopus 로고
    • An efficient dynamic algorithm for maintaining all-pairs shortest paths in stochastic networks
    • S. Misra and B. Oommen, "An efficient dynamic algorithm for maintaining all-pairs shortest paths in stochastic networks," IEEE Transactions on Computers, vol. 55, no. 6, pp. 686-702, 2006.
    • (2006) IEEE Transactions on Computers , vol.55 , Issue.6 , pp. 686-702
    • Misra, S.1    Oommen, B.2
  • 7
    • 37249061374 scopus 로고    scopus 로고
    • A survey of reinforcement learning in relational domains,
    • 1381-3625, July 2005
    • M. Van Otterlo, "A survey of reinforcement learning in relational domains," CTIT Technical Report Series ISSN 1381-3625, July 2005.
    • CTIT Technical Report Series ISSN
    • Van Otterlo, M.1
  • 9
    • 13444310066 scopus 로고    scopus 로고
    • Inductive policy selection for first-order MDPs
    • S. Yoon, A. Fern, and R. Givan, "Inductive policy selection for first-order MDPs," in UAI'02, 2002.
    • (2002) UAI'02
    • Yoon, S.1    Fern, A.2    Givan, R.3
  • 11
  • 21
    • 0000826543 scopus 로고
    • Negation as failure
    • K. Clark, "Negation as failure." in Logic and Data Bases, 1977, pp. 293-322.
    • (1977) Logic and Data Bases , pp. 293-322
    • Clark, K.1
  • 23
  • 26
    • 0033165498 scopus 로고    scopus 로고
    • Reasoning about noisy sensors and effectors in the situation calculus
    • F. Bacchus, J. Y. Halpern, and H. J. Levesque, "Reasoning about noisy sensors and effectors in the situation calculus," Artificial Intelligence, vol. 111, no. 1-2, pp. 171-208, 1999.
    • (1999) Artificial Intelligence , vol.111 , Issue.1-2 , pp. 171-208
    • Bacchus, F.1    Halpern, J.Y.2    Levesque, H.J.3
  • 27
    • 78651544767 scopus 로고    scopus 로고
    • P. Mateus, A. Pacheco, and J. Pinto, Observations and the probabilistic situation calculus, in Proceedings of the 8th KR, 2002, pp. 327-340.
    • P. Mateus, A. Pacheco, and J. Pinto, "Observations and the probabilistic situation calculus," in Proceedings of the 8th KR, 2002, pp. 327-340.
  • 29
    • 0031187203 scopus 로고    scopus 로고
    • The independent choice logic for modelling multiple agents under uncertainty
    • D. Poole, "The independent choice logic for modelling multiple agents under uncertainty," Artificial Intelligence, vol. 94, no. 1-2, pp. 7-56, 1997.
    • (1997) Artificial Intelligence , vol.94 , Issue.1-2 , pp. 7-56
    • Poole, D.1
  • 32
    • 0027694534 scopus 로고
    • Representing action and change by logic programs
    • M. Gelfoun and V. Lifschitz, "Representing action and change by logic programs," Journal of Logical Programming, vol. 17, no. 2-4, pp. 301-321, 1993.
    • (1993) Journal of Logical Programming , vol.17 , Issue.2-4 , pp. 301-321
    • Gelfoun, M.1    Lifschitz, V.2
  • 36
    • 0029514510 scopus 로고
    • The parti-game algorithm for variable resolution reinforcement learning in multidimensional state-spaces
    • A. Moore, "The parti-game algorithm for variable resolution reinforcement learning in multidimensional state-spaces," Machine Learning, vol. 21, no. 3, pp. 199-233, 1995.
    • (1995) Machine Learning , vol.21 , Issue.3 , pp. 199-233
    • Moore, A.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.