메뉴 건너뛰기




Volumn , Issue , 2005, Pages 461-466

Toward a topological theory of relational reinforcement learning for navigation tasks

Author keywords

[No Author keywords available]

Indexed keywords

CLOSED-FORM ENVELOPE; MARKOV DECISION PROCESS (MDP); RELATIONAL REINFORCEMENT LEARNING;

EID: 32844454618     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (8)

References (17)
  • 2
    • 0034248853 scopus 로고    scopus 로고
    • Stochastic dynamic programming with factored representations
    • Boutilier, C.; Dearden, R.; and Goldszmidt, M. 2000. Stochastic dynamic programming with factored representations. Artificial Intelligence 121(1-2):49-107.
    • (2000) Artificial Intelligence , vol.121 , Issue.1-2 , pp. 49-107
    • Boutilier, C.1    Dearden, R.2    Goldszmidt, M.3
  • 5
    • 84942867726 scopus 로고    scopus 로고
    • An overview of MAXQ hierarchical reinforcement learning
    • Choueiry, B. Y., and Walsh, T., eds., Lecture Notes in Artificial Intelligence. New York: Springer Verlag
    • Dietterich, T. G. 2000. An overview of MAXQ hierarchical reinforcement learning. In Choueiry, B. Y., and Walsh, T., eds., Proceedings of the Symposium on Abstraction, Reformulation and Approximation (SARA 2000), Lecture Notes in Artificial Intelligence. New York: Springer Verlag.
    • (2000) Proceedings of the Symposium on Abstraction, Reformulation and Approximation (SARA 2000)
    • Dietterich, T.G.1
  • 7
    • 0041779094 scopus 로고    scopus 로고
    • Learning Probabilistic Relational Models
    • Dzeroski, S. and Lavrac, N., eds.. Springer-Verlag
    • Getoor, L.; Friedman, N.; Koller, D.; and Pfeffer, A. 2001. Learning Probabilistic Relational Models. In Dzeroski, S. and Lavrac, N., eds., Relational Data Mining. Springer-Verlag.
    • (2001) Relational Data Mining
    • Getoor, L.1    Friedman, N.2    Koller, D.3    Pfeffer, A.4
  • 12
    • 0003392384 scopus 로고    scopus 로고
    • Ph.D. Dissertation, Department of Computer Science, University of Massachusetts, Amherst, MA
    • Precup, D. 2000. Temporal Abstraction in Reinforcement Learning. Ph.D. Dissertation, Department of Computer Science, University of Massachusetts, Amherst, MA.
    • (2000) Temporal Abstraction in Reinforcement Learning
    • Precup, D.1
  • 15
    • 1942484796 scopus 로고    scopus 로고
    • Relativized options: Choosing the right transformation
    • Fawcett, T., and Mishra, N., eds. Washington, DC: AAAI Press
    • Ravindran, B., and Barto, A. G. 2003. Relativized options: Choosing the right transformation. In Fawcett, T., and Mishra, N., eds., Proceedings of the Twentieth International Conference on Machine Learning, 608-615. Washington, DC: AAAI Press.
    • (2003) Proceedings of the Twentieth International Conference on Machine Learning , pp. 608-615
    • Ravindran, B.1    Barto, A.G.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.