SCOPUS 정보 검색 플랫폼

Volumn 4131 LNCS - I, Issue , 2006, Pages 830-839

Reinforcement learning with echo state networks

Author keywords

[No Author keywords available]

Indexed keywords

APPROXIMATION THEORY; CONVERGENCE OF NUMERICAL METHODS; DECISION MAKING; MARKOV PROCESSES; NEURAL NETWORKS;

ECHO STATE NETWORKS; K-ORDER MARKOV DECISION PROCESSES; REINFORCEMENT LEARNING;

LEARNING SYSTEMS;

EID: 33749845123 PISSN: 03029743 EISSN: 16113349 Source Type: Book Series
DOI: 10.1007/11840817_86 Document Type: Conference Paper

Times cited : (55)

References (17)

1
- 0004102479
- MIT Press, Cambridge, MA
- Sutton, R., Barto, A.G.: Reinforcement Learning: An Introduction. MIT Press, Cambridge, MA (1998)
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.¹ Barto, A.G.²

2
- 57649089060
- Gordon, G.J.: Chattering in SARSA (lambda) - a CMU Learning Lab Internal Report (1996)
- (1996) Chattering in SARSA (Lambda) - A CMU Learning Lab Internal Report
- Gordon, G.J.¹

4
- 1842421269
- Harnessing nonlinearity: Predicting chaotic systems and saving energy in wireless telecommunication
- Jaeger, H., Haas, H.: Harnessing nonlinearity: predicting chaotic systems and saving energy in wireless telecommunication. Science (2004) 78-80
- (2004) Science , pp. 78-80
- Jaeger, H.¹ Haas, H.²

5
- 0024702037
- A parallel network that learns to play backgammon
- Tesauro, G., Sejnowski, T.J.: A parallel network that learns to play backgammon. Artificial Intelligence 39 (1989) 357-390
- (1989) Artificial Intelligence , vol.39 , pp. 357-390
- Tesauro, G.¹ Sejnowski, T.J.²

6
- 33749859740
- Athena Scientific, Belmont, MA
- Bertsekas, D.P., Tsitsiklis, J.N.: Neuro-Dynarnic Programming. Athena Scientific, Belmont, MA (1996)
- (1996) Neuro-dynarnic Programming
- Bertsekas, D.P.¹ Tsitsiklis, J.N.²

8
- 26444565569
- Finding structure in time
- Elman, J.L.: Finding structure in time, Cognitive Science 14 (1990) 179-211
- (1990) Cognitive Science , vol.14 , pp. 179-211
- Elman, J.L.¹

10
- 84899015857
- Reinforcement learning with long short-term memory
- Bakker, B.: Reinforcement learning with long short-term memory. Advances in Neural Information Processing Systems 14 (2002) 1475-1482
- (2002) Advances in Neural Information Processing Systems , vol.14 , pp. 1475-1482
- Bakker, B.¹

11
- 18444387230
- PhD thesis, Universiteit Leiden
- Bakker, P.B.: The State of Mind - Reinforcement Learning with Recurrent Neural Networks. PhD thesis, Universiteit Leiden (2004)
- (2004) The State of Mind - Reinforcement Learning with Recurrent Neural Networks
- Bakker, P.B.¹

13
- 85151728371
- Residual algorithms: Reinforcement learning with function approximation
- Baird, L.C.: Residual algorithms: Reinforcement learning with function approximation. In: International Conference on Machine Learning. (1995) 30-37
- (1995) International Conference on Machine Learning , pp. 30-37
- Baird, L.C.¹

14
- 0004049893
- PhD thesis, Cambridge University, Cambridge, UK
- Watkins, C.J.C.H.: Learning from Delayed Rewards. PhD thesis, Cambridge University, Cambridge, UK (1989)
- (1989) Learning from Delayed Rewards
- Watkins, C.J.C.H.¹

16
- 0028564629
- Acting optimally in partially observable stochastic domains
- L. P. Kaelbling, A.R.C., Littman, M.L.: Acting optimally in partially observable stochastic domains. In: Proc. of the 12th Nat'l Conf. on Artif. Intell. (1994)
- (1994) Proc. of the 12th Nat'l Conf. on Artif. Intell.
- Kaelbling, L.P.¹ C., A.R.² Littman, M.L.³

17
- 0003584577
- Prentice-Hall, Englewood Cliffs, New Jersey
- Russell, S.J., Norvíg, P.: Artificial Intelligence: a Modern Approach. Prentice-Hall, Englewood Cliffs, New Jersey (1994)
- (1994) Artificial Intelligence: A Modern Approach
- Russell, S.J.¹ Norvíg, P.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.