SCOPUS 정보 검색 플랫폼

Volumn 9, Issue 2, 1999, Pages 119-127

Training reinforcement neurocontrollers using the polytope algorithm

Author keywords

[No Author keywords available]

Indexed keywords

INTELLIGENT CONTROL; LEARNING ALGORITHMS; LEARNING SYSTEMS; OPTIMIZATION;

POLYTYPE ALGORITHMS; REINFORCEMENT NEUROCONTROLLERS;

NEURAL NETWORKS;

EID: 0032676817 PISSN: 13704621 EISSN: None Source Type: Journal
DOI: 10.1023/A:1018669223478 Document Type: Article

Times cited : (3)

References (16)

1
- 0003997198
- Technical Report TR87-509.3, GTE Labs, Waltham, MA
- C.W. Anderson, Strategy learning with multilayer connectionist representations, Technical Report TR87-509.3, GTE Labs, Waltham, MA.
- Strategy Learning with Multilayer Connectionist Representations
- Anderson, C.W.¹

2
- 0024646143
- Learning to control an inverted pendulum using neural networks
- C.W. Anderson, "Learning to control an inverted pendulum using neural networks", IEEE Control Systems Magazine, Vol. 2, pp. 31-37, 1989.
- (1989) IEEE Control Systems Magazine , vol.2 , pp. 31-37
- Anderson, C.W.¹

3
- 0020970738
- Neuronlike elements that can solve difficult control problems
- A.G. Barto, R.S. Sutton and C.W. Anderson, C.W., "Neuronlike elements that can solve difficult control problems", IEEE Trans, on Systems, Man and Cybernetics, Vol. 13, pp. 835-846, 1983.
- (1983) IEEE Trans, on Systems, Man and Cybernetics , vol.13 , pp. 835-846
- Barto, A.G.¹ Sutton, R.S.² Anderson, C.W.³

4
- 0026923465
- Learning and tuning fuzzy logic controllers using reinforcements
- H.R. Berenji and P. Khedkar, "Learning and tuning fuzzy logic controllers using reinforcements", IEEE Trans, on Neural Networks, Vol. 3, pp. 724-740, 1992.
- (1992) IEEE Trans, on Neural Networks , vol.3 , pp. 724-740
- Berenji, H.R.¹ Khedkar, P.²

5
- 0024304385
- C.S. Chassapis, D.G. Papageorgiou, and I.E. Lagaris, MCL - "Optimization oriented programming language, computer physics communications", Vol. 52, pp. 223-239, 1989.
- (1989) MCL - "Optimization Oriented Programming Language, Computer Physics Communications" , vol.52 , pp. 223-239
- Chassapis, C.S.¹ Papageorgiou, D.G.² Lagaris, I.E.³

6
- 45949117576
- Merlin - "A portable system for multidimensional minimization"
- G.A. Evangelakis, J.P. Rizos, I.E. Lagaris and I.N. Demetropoulos, Merlin - "A portable system for multidimensional minimization", Computer Physics Communications, Vol. 46, pp. 402-412, 1987.
- (1987) Computer Physics Communications , vol.46 , pp. 402-412
- Evangelakis, G.A.¹ Rizos, J.P.² Lagaris, I.E.³ Demetropoulos, I.N.⁴

7
- 0024304980
- MERLIN-2.0 - Enhanced and programmable version
- D.G. Papageorgiou, C.S. Chassapis and I.E. Lagaris, "MERLIN-2.0 - Enhanced and programmable version", Computer Physics Communications, Vol. 52, pp. 241-247, 1989.
- (1989) Computer Physics Communications , vol.52 , pp. 241-247
- Papageorgiou, D.G.¹ Chassapis, C.S.² Lagaris, I.E.³

8
- 0004240547
- Academic Press
- P. Gil, W Murray, and M. Wright, Practical Optimization, Academic Press, 1989.
- (1989) Practical Optimization
- Gil, P.¹ Murray, W.² Wright, M.³

9
- 0029679044
- Reinforcement learning: A survey
- L. Kaelbing, M. Littman and A. Moore, "Reinforcement learning: A survey", Journal of Artificial Intelligence Research, Vol. 4, pp. 237-285, 1996.
- (1996) Journal of Artificial Intelligence Research , vol.4 , pp. 237-285
- Kaelbing, L.¹ Littman, M.² Moore, A.³

11
- 0030147547
- Reinforcement learning for an ART-based fuzzy adaptive learning control network
- C-J. Lin and C-T. Lin, "Reinforcement learning for an ART-based fuzzy adaptive learning control network", IEEE Trans. on Neural Networks, Vol. 7, pp. 709-731, 1996.
- (1996) IEEE Trans. on Neural Networks , vol.7 , pp. 709-731
- Lin, C.-J.¹ Lin, C.-T.²

12
- 0000238336
- A simplex method for function minimization
- J.A. Nelder and R. Mead, "A simplex method for function minimization", Computer Journal, Vol. 7, pp. 308-313, 1965.
- (1965) Computer Journal , vol.7 , pp. 308-313
- Nelder, J.A.¹ Mead, R.²

13
- 0004182779
- McGraw-Hill
- S. Nash and A. Sofer, Linear and Nonlinear Programming, McGraw-Hill, 1996.
- (1996) Linear and Nonlinear Programming
- Nash, S.¹ Sofer, A.²

14
- 84946640734
- Sequential application of simplex designs in optimization and evolutionary operation
- W. Spendley, G. Hext and F. Himsworth, "Sequential application of simplex designs in optimization and evolutionary operation", Technometrics, Vol. 4, pp. 441-461, 1962.
- (1962) Technometrics , vol.4 , pp. 441-461
- Spendley, W.¹ Hext, G.² Himsworth, F.³

15
- 34249833101
- Learning from delayed rewards
- C. Watkins and P. Dayan, "Learning from delayed rewards", Machine Learning, Vol. 8, pp. 279-292, 1992.
- (1992) Machine Learning , vol.8 , pp. 279-292
- Watkins, C.¹ Dayan, P.²

16
- 0027701513
- Genetic reinforcement learning for neurocontrol problems
- D. Whitley, S. Dominic, R. Das and C.W. Anderson, "Genetic reinforcement learning for neurocontrol problems", Machine Learning, Vol. 13, pp. 259-284, 1993.
- (1993) Machine Learning , vol.13 , pp. 259-284
- Whitley, D.¹ Dominic, S.² Das, R.³ Anderson, C.W.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.