메뉴 건너뛰기




Volumn , Issue , 2010, Pages 541-546

Free-energy-based reinforcement learning in a partially observable environment

Author keywords

[No Author keywords available]

Indexed keywords

ACTIVATION PATTERNS; EQUILIBRIUM FREE ENERGY; FUTURE OBSERVATIONS; HIGH-DIMENSIONAL; MARKOV DECISION PROCESSES; PARTIALLY OBSERVABLE ENVIRONMENTS; RESTRICTED BOLTZMANN MACHINE; VALUE FUNCTIONS;

EID: 84887013392     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (14)

References (7)
  • 1
    • 84880741298 scopus 로고    scopus 로고
    • Solving POMDPs with continuous or large discrete observation spaces
    • J. Hoey and P. Poupart. Solving POMDPs with continuous or large discrete observation spaces. In IJCAI, volume 19, page 1332, 2005.
    • (2005) IJCAI , vol.19 , pp. 1332
    • Hoey, J.1    Poupart, P.2
  • 2
    • 67349102783 scopus 로고    scopus 로고
    • Hierarchical POMDP controller optimization by likelihood maximization
    • M. Toussaint, L. Charlin, and P. Poupart. Hierarchical POMDP controller optimization by likelihood maximization. UAI, 2008.
    • (2008) UAI
    • Toussaint, M.1    Charlin, L.2    Poupart, P.3
  • 3
    • 32844474095 scopus 로고    scopus 로고
    • Reinforcement learning with factored states and actions
    • B. Sallans and G. E. Hinton. Reinforcement learning with factored states and actions. Journal of Machine Learning Research, 5:1063-1088, 2004.
    • (2004) Journal of Machine Learning Research , vol.5 , pp. 1063-1088
    • Sallans, B.1    Hinton, G.E.2
  • 4
    • 0029250080 scopus 로고
    • Reinforcement learning of non-Markov decision processes
    • S. D. Whitehead and L. J. Lin. Reinforcement learning of non-Markov decision processes. Artificial Intelligence, 73:271-306, 1995.
    • (1995) Artificial Intelligence , vol.73 , pp. 271-306
    • Whitehead, S.D.1    Lin, L.J.2
  • 6
    • 0025503558 scopus 로고
    • Backpropagation through time: What it does and how to do it
    • P. J. Werbos. Backpropagation through time: what it does and how to do it. Proceedings of the IEEE, 78(10):1550-1560, 1990.
    • (1990) Proceedings of the IEEE , vol.78 , Issue.10 , pp. 1550-1560
    • Werbos, P.J.1
  • 7
    • 84899015857 scopus 로고    scopus 로고
    • Reinforcement learning with long short-term memory
    • B. Bakker. Reinforcement learning with long short-term memory. NIPS, 2:1475-1482, 2002.
    • (2002) NIPS , vol.2 , pp. 1475-1482
    • Bakker, B.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.