SCOPUS 정보 검색 플랫폼

Proceedings of the 18th European Symposium on Artificial Neural Networks - Computational Intelligence and Machine Learning, ESANN 2010

Volumn , Issue , 2010, Pages 541-546

Free-energy-based reinforcement learning in a partially observable environment

(3) Otsuka, Makoto a,b Yoshimoto, Junichiro a,b Doya, Kenji a,b

a OKINAWA INSTITUTE OF SCIENCE AND TECHNOLOGY GRADUATE UNIVERSITY (Japan)

b NARA INSTITUTE OF SCIENCE AND TECHNOLOGY (Japan)

Author keywords

[No Author keywords available]

Indexed keywords

ACTIVATION PATTERNS; EQUILIBRIUM FREE ENERGY; FUTURE OBSERVATIONS; HIGH-DIMENSIONAL; MARKOV DECISION PROCESSES; PARTIALLY OBSERVABLE ENVIRONMENTS; RESTRICTED BOLTZMANN MACHINE; VALUE FUNCTIONS;

ARTIFICIAL INTELLIGENCE; LEARNING ALGORITHMS; MARKOV PROCESSES; RECURRENT NEURAL NETWORKS; REINFORCEMENT LEARNING;

FREE ENERGY;

EID: 84887013392 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (14)

References (7)

1
- 84880741298
- Solving POMDPs with continuous or large discrete observation spaces
- J. Hoey and P. Poupart. Solving POMDPs with continuous or large discrete observation spaces. In IJCAI, volume 19, page 1332, 2005.
- (2005) IJCAI , vol.19 , pp. 1332
- Hoey, J.¹ Poupart, P.²

2
- 67349102783
- Hierarchical POMDP controller optimization by likelihood maximization
- M. Toussaint, L. Charlin, and P. Poupart. Hierarchical POMDP controller optimization by likelihood maximization. UAI, 2008.
- (2008) UAI
- Toussaint, M.¹ Charlin, L.² Poupart, P.³

3
- 32844474095
- Reinforcement learning with factored states and actions
- B. Sallans and G. E. Hinton. Reinforcement learning with factored states and actions. Journal of Machine Learning Research, 5:1063-1088, 2004.
- (2004) Journal of Machine Learning Research , vol.5 , pp. 1063-1088
- Sallans, B.¹ Hinton, G.E.²

4
- 0029250080
- Reinforcement learning of non-Markov decision processes
- S. D. Whitehead and L. J. Lin. Reinforcement learning of non-Markov decision processes. Artificial Intelligence, 73:271-306, 1995.
- (1995) Artificial Intelligence , vol.73 , pp. 271-306
- Whitehead, S.D.¹ Lin, L.J.²

5
- 0004007508
- MIT Press
- R. S. Sutton and A. G. Barto. Reinforcement Learning. MIT Press, 1998.
- (1998) Reinforcement Learning
- Sutton, R.S.¹ Barto, A.G.²

6
- 0025503558
- Backpropagation through time: What it does and how to do it
- P. J. Werbos. Backpropagation through time: what it does and how to do it. Proceedings of the IEEE, 78(10):1550-1560, 1990.
- (1990) Proceedings of the IEEE , vol.78 , Issue.10 , pp. 1550-1560
- Werbos, P.J.¹

7
- 84899015857
- Reinforcement learning with long short-term memory
- B. Bakker. Reinforcement learning with long short-term memory. NIPS, 2:1475-1482, 2002.
- (2002) NIPS , vol.2 , pp. 1475-1482
- Bakker, B.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.