메뉴 건너뛰기




Volumn 2006, Issue , 2006, Pages 49-56

Relational temporal difference learning

Author keywords

[No Author keywords available]

Indexed keywords

DECISION THEORY; FUNCTION EVALUATION; GAME THEORY; HIERARCHICAL SYSTEMS; MARKOV PROCESSES; MULTI AGENT SYSTEMS;

EID: 33749265162     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (4)

References (18)
  • 5
    • 0002278788 scopus 로고    scopus 로고
    • Hierarchical reinforcement learning with the MAXQ value function decomposition
    • Dietterich, T. G. (2000). Hierarchical reinforcement learning with the MAXQ value function decomposition. Journal of Artificial Intelligence Research, 13, 227-303.
    • (2000) Journal of Artificial Intelligence Research , vol.13 , pp. 227-303
    • Dietterich, T.G.1
  • 7
    • 84948172455 scopus 로고    scopus 로고
    • Speeding up relational reinforcement learning through the use of an incremental first order decision tree learning
    • Freiburg, Germany
    • Driessens, K., Ramon, J., & Blockeel, H. (2001). Speeding up relational reinforcement learning through the use of an incremental first order decision tree learning. Proceedings of the Twelfth European Conference on Machine Learning (pp. 97-108). Freiburg, Germany.
    • (2001) Proceedings of the Twelfth European Conference on Machine Learning , pp. 97-108
    • Driessens, K.1    Ramon, J.2    Blockeel, H.3
  • 14
    • 85149834820 scopus 로고
    • Markov games as a framework for multi-agent reinforcement learning
    • New Brunswick, NJ: Morgan Kaufmann
    • Littman, M. L. (1994). Markov games as a framework for multi-agent reinforcement learning. Proceedings of the Eleventh International Conference on Machine Learning (pp. 157-163). New Brunswick, NJ: Morgan Kaufmann.
    • (1994) Proceedings of the Eleventh International Conference on Machine Learning , pp. 157-163
    • Littman, M.L.1
  • 16
    • 0033570798 scopus 로고    scopus 로고
    • A unified analysis of value-function-based reinforcement learning algorithms
    • Szepesvari, C., & Littman, M. (1999). A unified analysis of value-function-based reinforcement learning algorithms. Neural Computation, 11, 2017-2060.
    • (1999) Neural Computation , vol.11 , pp. 2017-2060
    • Szepesvari, C.1    Littman, M.2
  • 18
    • 0000985504 scopus 로고
    • TD-Gammon, a self-teaching backgammon program
    • Tesauro, G. (1994). TD-Gammon, a self-teaching backgammon program. Neural Computation, 6, 215-219.
    • (1994) Neural Computation , vol.6 , pp. 215-219
    • Tesauro, G.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.