SCOPUS 정보 검색 플랫폼

Volumn 12, Issue 2, 2003, Pages 81-88

Neural Q-learning

Author keywords

Feed forward network; Learning from real systems; Nonlinear systems; Optimal control Reinforcement learning

Indexed keywords

GENERAL FUNCTION APPROXIMATORS; REINFORCEMENT LEARNING (RL);

LEARNING SYSTEMS; MOBILE ROBOTS; NONLINEAR SYSTEMS; OPTIMAL CONTROL SYSTEMS; OPTIMIZATION;

FEEDFORWARD NEURAL NETWORKS;

EID: 0345393286 PISSN: 09410643 EISSN: None Source Type: Journal
DOI: 10.1007/s00521-003-0369-9 Document Type: Review

Times cited : (28)

References (21)

1
- 0025399567
- Identification and control for dynamic systems using neural networks
- Narendra K, Parthasarathy K (1990) Identification and control for dynamic systems using neural networks. IEEE Transaction on Neural networks 1(1): 447-457
- (1990) IEEE Transaction on Neural Networks , vol.1 , Issue.1 , pp. 447-457
- Narendra, K.¹ Parthasarathy, K.²

2
- 0033731028
- Nonlinear adaptive control using networks of piecewise linear approximators
- Choi J, Farrell J (2000) Nonlinear adaptive control using networks of piecewise linear approximators. IEEE Transactions on Neural Networks 11: 390-401
- (2000) IEEE Transactions on Neural Networks , vol.11 , pp. 390-401
- Choi, J.¹ Farrell, J.²

3
- 0000922214
- Stable neural controller design for unknown nonlinear systems using backstepping
- Zhang Y, Peng P, Jiang Z (2000) Stable neural controller design for unknown nonlinear systems using backstepping. IEEE Transactions on Neural Networks 11: 1347-60
- (2000) IEEE Transactions on Neural Networks , vol.11 , pp. 1347-1360
- Zhang, Y.¹ Peng, P.² Jiang, Z.³

5
- 0031236002
- Adaptive critic design
- Prokhorov D, Wunch II D (1997) Adaptive critic design. IEEE transactions on Neural Networks 8(5): 997-1007
- (1997) IEEE Transactions on Neural Networks , vol.8 , Issue.5 , pp. 997-1007
- Prokhorov, D.¹ Wunch II, D.²

8
- 0031259122
- Synthesis of reinforcement learning, neural networks, and pi control applied to a simulated heating coil
- Anderson C, Hittle D, Katz A, Kretchmar R (1997) Synthesis of reinforcement learning, neural networks, and pi control applied to a simulated heating coil. J Artific Intell Eng 11(4): 421-429
- (1997) J Artific Intell Eng , vol.11 , Issue.4 , pp. 421-429
- Anderson, C.¹ Hittle, D.² Katz, A.³ Kretchmar, R.⁴

10
- 84855627460
- Neurocontrol by reinforcement learning
- Schram G, Kröse B, Babuska R, Krijgsman A (1996) Neurocontrol by reinforcement learning. J Auto Control 37: 59-64
- (1996) J Auto Control , vol.37 , pp. 59-64
- Schram, G.¹ Kröse, B.² Babuska, R.³ Krijgsman, A.⁴

13
- 33847202724
- Kluwer
- Riedmiller M (1999) Concepts and facilities of a neural reinforcement learning control architecture for technical process control. Neural Computation and Application Journal 8: 323-338, Springer Verlag London [14] Macline Learing, Kluwer, 3(1): 9-44
- Macline Learing , vol.3 , Issue.1 , pp. 9-44

14
- 0033750123
- Neurocontroller alternatives for 'fuzzy' ball-and-beam systems with nonuniform nonlinear friction
- Eaton P, Prokhorov D, Wunch II D (2000) Neurocontroller alternatives for 'fuzzy' ball-and-beam systems with nonuniform nonlinear friction. IEEE transactions on Neural Networks
- (2000) IEEE Transactions on Neural Networks
- Eaton, P.¹ Prokhorov, D.² Wunch II, D.³

16
- 0004049893
- PhD thesis, University of Cambridge
- Watkins C (1989) Learning from Delayed Rewards. PhD thesis, University of Cambridge
- (1989) Learning from Delayed Rewards
- Watkins, C.¹

20
- 0003754075
- PhD thesis, Linköping University
- Landelius T (1997) Reinforcement learning and Distributed Local Model Synthesis. PhD thesis, Linköping University
- (1997) Reinforcement Learning and Distributed Local Model Synthesis
- Landelius, T.¹

21
- 0042415648
- PhD thesis, Computer Science Institute, University of Amsterdam, The Netherlands
- ten Hagen S (2001) Continuous State Space Q-Learning for Control of Nonlinear Systems. PhD thesis, Computer Science Institute, University of Amsterdam, The Netherlands
- (2001) Continuous State Space Q-Learning for Control of Nonlinear Systems
- Ten Hagen, S.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.