메뉴 건너뛰기




Volumn , Issue , 2009, Pages 169-176

Learning continuous-action control policies

Author keywords

[No Author keywords available]

Indexed keywords

ACTION POLICIES; ACTION SPACES; BINARY DECISION; CONTINUOUS STATE; CONTINUOUS STATE SPACE; CONTROL POLICY; DISCRETIZATION; EFFICIENT METHOD; INVERTED PENDULUM; LEAST SQUARE; POLICY ITERATION; Q-LEARNING; REAL-WORLD PROBLEM; STOCHASTIC PROCESS;

EID: 67650505310     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ADPRL.2009.4927541     Document Type: Conference Paper
Times cited : (9)

References (17)
  • 2
    • 0004049893 scopus 로고
    • Ph.D. dissertation, Cambridge University, Cambridge, United Kingdom
    • C. J. C. H. Watkins, "Learning from delayed rewards," Ph.D. dissertation, Cambridge University, Cambridge, United Kingdom, 1989.
    • (1989) Learning from delayed rewards
    • Watkins, C.J.C.H.1
  • 4
    • 0036832956 scopus 로고    scopus 로고
    • Kernel-based reinforcement learning
    • DOI 10.1023/A:1017928328829
    • D. Ormoneit and Ś. Sen, "Kernel-based reinforcement learning," Machine Learning, Vol. 49, no. 2-3, pp. 161-178, 2002. (Pubitemid 34325684)
    • (2002) Machine Learning , vol.49 , Issue.2-3 , pp. 161-178
    • Ormoneit, D.1    Sen, A.2
  • 10
    • 0031341345 scopus 로고    scopus 로고
    • Neural reinforcement learning for behaviour synthesis
    • PII S0921889097000420
    • C. Touzet, "Neural reinforcement learning for behaviour synthesis," Robotics and Autonomous Systems, Vol. 22, pp. 251-281, 1997. (Pubitemid 127398213)
    • (1997) Robotics and Autonomous Systems , vol.22 , Issue.3-4 , pp. 251-281
    • Touzet, C.F.1
  • 11
    • 85161968592 scopus 로고    scopus 로고
    • Reinforcement learning in continuous action spaces through sequential monte carlo methods
    • Cambridge, MA: MIT Press
    • A. Lazaric, M. Restelli, and A. Bonarini, "Reinforcement learning in continuous action spaces through sequential monte carlo methods," in Advances in Neural Information Processing Systems 20. Cambridge, MA: MIT Press, 2008, pp. 833-840.
    • (2008) Advances in Neural Information Processing Systems , vol.20 , pp. 833-840
    • Lazaric, A.1    Restelli, M.2    Bonarini, A.3
  • 12
    • 0347625319 scopus 로고    scopus 로고
    • A learning algorithm for the control of continuous action set-point regulator systems
    • A. O. Esogbue and W. E. Hearnes, "A learning algorithm for the control of continuous action set-point regulator systems," Journal of Computational Analysis and Applications, Vol. 1, no. 2, pp. 121-234, 1999.
    • (1999) Journal of Computational Analysis and Applications , vol.1 , Issue.2 , pp. 121-234
    • Esogbue, A.O.1    Hearnes, W.E.2
  • 13
    • 32844474095 scopus 로고    scopus 로고
    • Reinforcement learning with factored states and actions
    • B. Sallans and G. E. Hinton, "Reinforcement learning with factored states and actions," Journal of Machine Learning Research, Vol. 5, pp. 1063-1088, 2004.
    • (2004) Journal of Machine Learning Research , vol.5 , pp. 1063-1088
    • Sallans, B.1    Hinton, G.E.2
  • 14
    • 0031231885 scopus 로고    scopus 로고
    • Experiments with reinforcement learning in problems with continuous state and action spaces
    • J. C. Santamaría, R. S. Sutton, and A. Ram, "Experiments with reinforcement learning in problems with continuous state and action spaces," Adaptive Behavior, Vol. 6, pp. 163-218, 1998.
    • (1998) Adaptive Behavior , vol.6 , pp. 163-218
    • Santamaria, J.C.1    Sutton, R.S.2    Ram, A.3
  • 15
    • 67650370700 scopus 로고    scopus 로고
    • Application of a self-learning controller with continuous control signals based on the DOE-approach
    • M. Riedmiller, "Application of a self-learning controller with continuous control signals based on the DOE-approach," in Proceedings of the European Symposium on Neural Networks, 1997.
    • (1997) Proceedings of the European Symposium on Neural Networks
    • Riedmiller, M.1
  • 16
    • 0030082891 scopus 로고    scopus 로고
    • An approach to fuzzy control of nonlinear systems: Stability and design issues
    • PII S106367069600639X
    • H. O. Wang, K. Tanaka, and M. F. Griffin, "An approach to fuzzy control of nonlinear systems: Stability and design issues," IEEE Transactions on Fuzzy Systems, Vol. 4, no. 1, pp. 14-23, 1996. (Pubitemid 126782417)
    • (1996) IEEE Transactions on Fuzzy Systems , vol.4 , Issue.1 , pp. 14-23
    • Wang, H.O.1    Tanaka, K.2    Griffin, M.F.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.