SCOPUS 정보 검색 플랫폼

2009 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning, ADPRL 2009 - Proceedings

Volumn , Issue , 2009, Pages 169-176

Learning continuous-action control policies

(2) Pazis, Jason a G Lagoudakis, Michail a

a TECHNICAL UNIVERSITY OF CRETE (Greece)

Author keywords

[No Author keywords available]

Indexed keywords

ACTION POLICIES; ACTION SPACES; BINARY DECISION; CONTINUOUS STATE; CONTINUOUS STATE SPACE; CONTROL POLICY; DISCRETIZATION; EFFICIENT METHOD; INVERTED PENDULUM; LEAST SQUARE; POLICY ITERATION; Q-LEARNING; REAL-WORLD PROBLEM; STOCHASTIC PROCESS;

DYNAMIC PROGRAMMING; EDUCATION; REINFORCEMENT; REINFORCEMENT LEARNING; SYSTEMS ENGINEERING;

LEARNING ALGORITHMS;

EID: 67650505310 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ADPRL.2009.4927541 Document Type: Conference Paper

Times cited : (9)

References (17)

1
- 0004102479
- Cambridge, Massachusetts: The MIT Press
- R. Sutton and A. Barto, Reinforcement Learning: An Introduction. Cambridge, Massachusetts: The MIT Press, 1998.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.¹ Barto, A.²

2
- 0004049893
- Ph.D. dissertation, Cambridge University, Cambridge, United Kingdom
- C. J. C. H. Watkins, "Learning from delayed rewards," Ph.D. dissertation, Cambridge University, Cambridge, United Kingdom, 1989.
- (1989) Learning from delayed rewards
- Watkins, C.J.C.H.¹

3
- 21844465127
- Tree-based batch mode reinforcement learning
- D. Ernst, P. Geurts, and L. Wehenkel, "Tree-based batch mode reinforcement learning," Journal of Machine Learning Research, Vol. 6, pp. 503-556, 2005. (Pubitemid 40958851)
- (2005) Journal of Machine Learning Research , vol.6
- Ernst, D.¹ Geurts, P.² Wehenkel, L.³

4
- 0036832956
- Kernel-based reinforcement learning
- DOI 10.1023/A:1017928328829
- D. Ormoneit and Ś. Sen, "Kernel-based reinforcement learning," Machine Learning, Vol. 49, no. 2-3, pp. 161-178, 2002. (Pubitemid 34325684)
- (2002) Machine Learning , vol.49 , Issue.2-3 , pp. 161-178
- Ormoneit, D.¹ Sen, A.²

5
- 4644323293
- Least-squares policy iteration
- M. G. Lagoudakis and R. Parr, "Least-squares policy iteration," Journal of Machine Learning Research, Vol. 4, pp. 1107-1149, 2003.
- (2003) Journal of Machine Learning Research , vol.4 , pp. 1107-1149
- Lagoudakis, M.G.¹ Parr, R.²

6
- 0004210389
- Prentice Hall
- J. G. Proakis and M. Salehi, Communication Systems Engineering. Prentice Hall, 2001.
- (2001) Communication Systems Engineering
- Proakis, J.G.¹ Salehi, M.²

7
- 84957629024
- Q-learning in continuous state and action spaces
- Springer-Verlag
- C. Gaskett, D. Wettergreen, and E. Zelinsky, "Q-learning in continuous state and action spaces," in Proceedings of the 12th Australian Joint Conference on Artificial Intelligence. Springer-Verlag, 1999, pp. 417-428.
- (1999) Proceedings of the 12th Australian Joint Conference on Artificial Intelligence , pp. 417-428
- Gaskett, C.¹ Wettergreen, D.² Zelinsky, E.³

8
- 0031645424
- A neural field approach to topological reinforcement learning in continuous action spaces
- H. M. Gross, V. Stephan, and M. Krabbes, "A neural field approach to topological reinforcement learning in continuous action spaces," in Proceedings of the International Joint Conference on Neural Networks, 1998, pp. 1992-1997.
- (1998) Proceedings of the International Joint Conference on Neural Networks , pp. 1992-1997
- Gross, H.M.¹ Stephan, V.² Krabbes, M.³

9
- 70049109079
- Reinforcement learning in continuous state and action space
- T. Strösslin and W. Gerstner, "Reinforcement learning in continuous state and action space," in International Conference on Artificial Neural Networks, 2003.
- (2003) International Conference on Artificial Neural Networks
- Strösslin, T.¹ Gerstner, W.²

10
- 0031341345
- Neural reinforcement learning for behaviour synthesis
- PII S0921889097000420
- C. Touzet, "Neural reinforcement learning for behaviour synthesis," Robotics and Autonomous Systems, Vol. 22, pp. 251-281, 1997. (Pubitemid 127398213)
- (1997) Robotics and Autonomous Systems , vol.22 , Issue.3-4 , pp. 251-281
- Touzet, C.F.¹

11
- 85161968592
- Reinforcement learning in continuous action spaces through sequential monte carlo methods
- Cambridge, MA: MIT Press
- A. Lazaric, M. Restelli, and A. Bonarini, "Reinforcement learning in continuous action spaces through sequential monte carlo methods," in Advances in Neural Information Processing Systems 20. Cambridge, MA: MIT Press, 2008, pp. 833-840.
- (2008) Advances in Neural Information Processing Systems , vol.20 , pp. 833-840
- Lazaric, A.¹ Restelli, M.² Bonarini, A.³

12
- 0347625319
- A learning algorithm for the control of continuous action set-point regulator systems
- A. O. Esogbue and W. E. Hearnes, "A learning algorithm for the control of continuous action set-point regulator systems," Journal of Computational Analysis and Applications, Vol. 1, no. 2, pp. 121-234, 1999.
- (1999) Journal of Computational Analysis and Applications , vol.1 , Issue.2 , pp. 121-234
- Esogbue, A.O.¹ Hearnes, W.E.²

13
- 32844474095
- Reinforcement learning with factored states and actions
- B. Sallans and G. E. Hinton, "Reinforcement learning with factored states and actions," Journal of Machine Learning Research, Vol. 5, pp. 1063-1088, 2004.
- (2004) Journal of Machine Learning Research , vol.5 , pp. 1063-1088
- Sallans, B.¹ Hinton, G.E.²

14
- 0031231885
- Experiments with reinforcement learning in problems with continuous state and action spaces
- J. C. Santamaría, R. S. Sutton, and A. Ram, "Experiments with reinforcement learning in problems with continuous state and action spaces," Adaptive Behavior, Vol. 6, pp. 163-218, 1998.
- (1998) Adaptive Behavior , vol.6 , pp. 163-218
- Santamaria, J.C.¹ Sutton, R.S.² Ram, A.³

15
- 67650370700
- Application of a self-learning controller with continuous control signals based on the DOE-approach
- M. Riedmiller, "Application of a self-learning controller with continuous control signals based on the DOE-approach," in Proceedings of the European Symposium on Neural Networks, 1997.
- (1997) Proceedings of the European Symposium on Neural Networks
- Riedmiller, M.¹

16
- 0030082891
- An approach to fuzzy control of nonlinear systems: Stability and design issues
- PII S106367069600639X
- H. O. Wang, K. Tanaka, and M. F. Griffin, "An approach to fuzzy control of nonlinear systems: Stability and design issues," IEEE Transactions on Fuzzy Systems, Vol. 4, no. 1, pp. 14-23, 1996. (Pubitemid 126782417)
- (1996) IEEE Transactions on Fuzzy Systems , vol.4 , Issue.1 , pp. 14-23
- Wang, H.O.¹ Tanaka, K.² Griffin, M.F.³

17
- 1642401055
- Learning to drive a bicycle using reinforcement learning and shaping
- J. Randløv and P. Alstrøm, "Learning to drive a bicycle using reinforcement learning and shaping," in Proceedings of The Fifteenth International Conference on Machine Learning, 1998, pp. 463-471.
- (1998) Proceedings of The Fifteenth International Conference on Machine Learning , pp. 463-471
- Randløv, J.¹ Alstrøm, P.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.