SCOPUS 정보 검색 플랫폼

Neurocomputing

Volumn 70, Issue 1-3, 2006, Pages 21-34

Influence zones: A strategy to enhance reinforcement learning

(2) Braga, Arthur Plínio de S a Araújo, Aluízio F R b

a FEDERAL UNIVERSITY OF CEARÁ (Brazil)

b FEDERAL UNIVERSITY OF PERNAMBUCO (Brazil)

Author keywords

Instantaneous topological map; Learning acceleration; Reinforcement learning; Self organizing map

Indexed keywords

ESTIMATION; LEARNING ALGORITHMS; LEARNING SYSTEMS; SELF ORGANIZING MAPS; TOPOLOGY;

INSTANTANEOUS TOPOLOGICAL MAP; LEARNING ACCELERATION; REINFORCEMENT LEARNING; VALUE FUNCTION ESTIMATION;

DECISION MAKING;

ALGORITHM; ANALYTICAL ERROR; ARTICLE; CONTROLLED STUDY; DECISION MAKING; INTERMETHOD COMPARISON; LEARNING; PERFORMANCE; PRIORITY JOURNAL; REINFORCEMENT;

EID: 33750404375 PISSN: 09252312 EISSN: None Source Type: Journal
DOI: 10.1016/j.neucom.2006.07.010 Document Type: Article

Times cited : (14)

References (32)

1
- 0003787146
- Princeton University Press, Princeton, NJ
- Bellman R. Dynamic Programming (1957), Princeton University Press, Princeton, NJ
- (1957) Dynamic Programming
- Bellman, R.¹

2
- 0003487482
- Athena Scientific, Belmont, MA
- Bertsekas D.P., and Tsitsiklis J.N. Neuro-Dynamic Programming (1996), Athena Scientific, Belmont, MA
- (1996) Neuro-Dynamic Programming
- Bertsekas, D.P.¹ Tsitsiklis, J.N.²

3
- 0346342550
- A topological reinforcement learning agent for navigation
- Braga A.P.S., and Araújo A.F.R. A topological reinforcement learning agent for navigation. Neural Comput. Appl. 12 3-4 (2003) 220-236
- (2003) Neural Comput. Appl. , vol.12 , Issue.3-4 , pp. 220-236
- Braga, A.P.S.¹ Araújo, A.F.R.²

4
- 85135470835
- A growing neural gas network learns topologies
- Fritzke B. A growing neural gas network learns topologies. Adv. Neural Inform. Process. Syst. 7 (1995) 625-632
- (1995) Adv. Neural Inform. Process. Syst. , vol.7 , pp. 625-632
- Fritzke, B.¹

5
- 0012532267
- Unsupervised ontogenetic networks
- Fiesler E., and Beale R. (Eds), Institute of Physics Publishing and Oxford University Press
- Fritzke B. Unsupervised ontogenetic networks. In: Fiesler E., and Beale R. (Eds). Handbook of Neural Computation (1996), Institute of Physics Publishing and Oxford University Press
- (1996) Handbook of Neural Computation
- Fritzke, B.¹

6
- 0003777193
- Wiley, New York
- George P.L. Automatic Mesh Generation-Application to Finite Element Methods (1991), Wiley, New York
- (1991) Automatic Mesh Generation-Application to Finite Element Methods
- George, P.L.¹

7
- 33750393529
- A. Großmann, Continual learning for mobile robots, Ph.D. thesis, School of Computer Science, The University of Birmingham, Birmingham, UK, 2001.

8
- 85131710558
- J. Jockusch, H. Ritter, An instantaneous topological mapping model for correlated stimuli. Proceedings of the IJCNN'99, 1999, pp. 445.

9
- 0029679044
- Reinforcement learning: a survey
- Kaelbling L.P., Littman M.L., and Moore A.W. Reinforcement learning: a survey. J. Artif. Intell. Res. 4 (1996) 237-285
- (1996) J. Artif. Intell. Res. , vol.4 , pp. 237-285
- Kaelbling, L.P.¹ Littman, M.L.² Moore, A.W.³

10
- 0003882190
- Wiley, New York
- Kalos M.H., and Whhitlock P.A. Monte Carlo Methods (1986), Wiley, New York
- (1986) Monte Carlo Methods
- Kalos, M.H.¹ Whhitlock, P.A.²

11
- 3142715411
- Three-dimensional map building for mobile robot navigation environments using a self-organizing neural network
- Kim M.Y., and Cho H. Three-dimensional map building for mobile robot navigation environments using a self-organizing neural network. J. Robot. Syst. 21 6 (2004) 323-343
- (2004) J. Robot. Syst. , vol.21 , Issue.6 , pp. 323-343
- Kim, M.Y.¹ Cho, H.²

12
- 0029751419
- The effect of representation and knowledge on goal-directed exploration with reinforcement learning algorithms
- Koenig S., and Simmons R.G. The effect of representation and knowledge on goal-directed exploration with reinforcement learning algorithms. Mach. Learn. 22 (1996) 227-250
- (1996) Mach. Learn. , vol.22 , pp. 227-250
- Koenig, S.¹ Simmons, R.G.²

13
- 0003527079
- Springer, Heidelberg
- Kohonen T. Self-Organization and Associative Memory (1984), Springer, Heidelberg
- (1984) Self-Organization and Associative Memory
- Kohonen, T.¹

14
- 0028204732
- Topology representing networks
- Martinetz T., and Schulten K. Topology representing networks. Neural Networks 7 3 (1994) 507-522
- (1994) Neural Networks , vol.7 , Issue.3 , pp. 507-522
- Martinetz, T.¹ Schulten, K.²

15
- 0004080833
- Wiley, New York
- Mason R.L., Gunst R.F., and Hess J.L. Statistical Design and Analysis of Experiments (1989), Wiley, New York
- (1989) Statistical Design and Analysis of Experiments
- Mason, R.L.¹ Gunst, R.F.² Hess, J.L.³

16
- 33750393758
- R.A. McCallum, Using transitional proximity for faster reinforcement learning, Proceedings of the Ninth International Conference on Machine Learning, 1992, pp. 316-321.

17
- 0030171602
- Rapid, safe, and incremental learning of navigation strategies
- Milán J., and del R. Rapid, safe, and incremental learning of navigation strategies. IEEE Trans. Syst., Man, Cybernet. 26 (1996) 408-420
- (1996) IEEE Trans. Syst., Man, Cybernet. , vol.26 , pp. 408-420
- Milán, J.¹ del, R.²

18
- 0036832960
- Continuous-action Q-learning
- Millán J., del R., Posenato D., and Dedieu E. Continuous-action Q-learning. Mach. Learn. 49 (2002) 247-265
- (2002) Mach. Learn. , vol.49 , pp. 247-265
- Millán, J.¹ del, R.² Posenato, D.³ Dedieu, E.⁴

19
- 0003486756
- Wiley, New York
- Montgomery D.C. Design and Analysis of Experiments (1984), Wiley, New York
- (1984) Design and Analysis of Experiments
- Montgomery, D.C.¹

20
- 0027684215
- Prioritized sweeping: reinforcement learning with less data and less time
- Moore A.W., and Atkeson C.G. Prioritized sweeping: reinforcement learning with less data and less time. Mach. Learn. 13 (1993) 103-130
- (1993) Mach. Learn. , vol.13 , pp. 103-130
- Moore, A.W.¹ Atkeson, C.G.²

21
- 0000955979
- Incremental multi-step Q-learning
- Peng J., and Williams R.J. Incremental multi-step Q-learning. Mach. Learn. 22 (1996) 283-290
- (1996) Mach. Learn. , vol.22 , pp. 283-290
- Peng, J.¹ Williams, R.J.²

22
- 33750392533
- C.H.C. Ribeiro, Aspects of the behaviour of a learning agent in control tasks, Ph.D. thesis, Imperial College of Science, Technology and Medicine, University of London, 1998.

23
- 0036570250
- Reinforcement learning agents
- Ribeiro C.H.C. Reinforcement learning agents. Artif. Intell. Rev. 17 (2002) 223-250
- (2002) Artif. Intell. Rev. , vol.17 , pp. 223-250
- Ribeiro, C.H.C.¹

24
- 0003647182
- Addison-Wesley Publishing Company, Reading, MA
- Ritter H., Martinetz T., and Schulten K. Neural Computation and Self-Organizing Maps-An Introduction (1992), Addison-Wesley Publishing Company, Reading, MA
- (1992) Neural Computation and Self-Organizing Maps-An Introduction
- Ritter, H.¹ Martinetz, T.² Schulten, K.³

25
- 33750397173
- G.A. Rummery, Problem solving with reinforcement learning, Ph.D. thesis, Cambridge University, 1995.

26
- 0004282314
- The Benjamin/Cummings Publishing Company, Inc.
- Schefler W.C. Statistics-Concepts and Applications (1988), The Benjamin/Cummings Publishing Company, Inc.
- (1988) Statistics-Concepts and Applications
- Schefler, W.C.¹

27
- 0036790898
- Applications of the self-organising map to reinforcement learning
- Smith A.J. Applications of the self-organising map to reinforcement learning. Neural Networks 15 8-9 (2002) 1107-1124
- (2002) Neural Networks , vol.15 , Issue.8-9 , pp. 1107-1124
- Smith, A.J.¹

28
- 0012929784
- Dyna, an integrated architecture for learning, planning, and reacting
- ACM Press
- Sutton R.S. Dyna, an integrated architecture for learning, planning, and reacting. SIGART Bull. 2 (1991) 160-163 ACM Press
- (1991) SIGART Bull. , vol.2 , pp. 160-163
- Sutton, R.S.¹

29
- 0003420416
- MIT Press/Bradford Books, Cambridge, MA
- Sutton R.S., and Barto A. Introduction to Reinforcement Learning (1998), MIT Press/Bradford Books, Cambridge, MA
- (1998) Introduction to Reinforcement Learning
- Sutton, R.S.¹ Barto, A.²

30
- 0031341345
- Neural reinforcement learning for behaviour synthesis
- Touzet C. Neural reinforcement learning for behaviour synthesis. Robot. Autonom. Syst. 22 3-4 (1997) 251-281
- (1997) Robot. Autonom. Syst. , vol.22 , Issue.3-4 , pp. 251-281
- Touzet, C.¹

31
- 33750392301
- C.J.C.H. Watkins, Learning from delayed rewards, Ph.D. thesis, King's College, Cambridge, 1989.

32
- 0032182997
- Fast online Q(λ)
- Wiering M., and Schimidhuber J. Fast online Q(λ). Mach. Learn. 33 (1998) 105-115
- (1998) Mach. Learn. , vol.33 , pp. 105-115
- Wiering, M.¹ Schimidhuber, J.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.