메뉴 건너뛰기




Volumn 1, Issue 3, 2014, Pages 323-336

Continuous action reinforcement learning for control-affine systems with unknown dynamics

Author keywords

approximate value iteration; continuous action spaces; control affine nonlinear systems; fitted value iteration; policy approximation; Reinforcement learning

Indexed keywords

ANTENNAS; BALANCING; DECISION MAKING; DIFFERENTIAL EQUATIONS; ITERATIVE METHODS; NONLINEAR EQUATIONS; NONLINEAR SYSTEMS; REINFORCEMENT LEARNING;

EID: 84969983915     PISSN: 23299266     EISSN: 23299274     Source Type: Journal    
DOI: 10.1109/JAS.2014.7004690     Document Type: Article
Times cited : (28)

References (33)
  • 1
    • 68849115332 scopus 로고    scopus 로고
    • Analysis and control of nonlinear systems: A flatness-based approach
    • New York: Springer
    • Levine J. Analysis and control of nonlinear systems: a flatness-based approach. Mathematical Engineering. New York: Springer, 2009.
    • (2009) Mathematical Engineering
    • Levine, J.1
  • 5
    • 81355166317 scopus 로고    scopus 로고
    • Approximate value iteration in the reinforcement learning context. Application to electrical power system control
    • Ernst D, Glavic M, Geurts P, Wehenkel L. Approximate value iteration in the reinforcement learning context. application to electrical power system control. International Journal of Emerging Electric Power Systems, 2005, 3(1): 10661-106637
    • (2005) International Journal of Emerging Electric Power Systems , vol.3 , Issue.1 , pp. 10661-106637
    • Ernst, D.1    Glavic, M.2    Geurts, P.3    Wehenkel, L.4
  • 7
    • 34548331001 scopus 로고    scopus 로고
    • Cambridge, U.K.: Cambridge University Press
    • La Valle S M. Planning Algorithms. Cambridge, U.K.: Cambridge University Press, 2006.
    • (2006) Planning Algorithms
    • La Valle, S.M.1
  • 9
    • 80455160265 scopus 로고    scopus 로고
    • Decentralized optimal control of a class of interconnected nonlinear discrete-time systems by using online Hamilton-Jacobi-Bellman formulation
    • Mehraeen S, Jagannathan S. Decentralized optimal control of a class of interconnected nonlinear discrete-time systems by using online Hamilton-Jacobi-Bellman formulation. IEEE Transactions on Neural Networks, 2011, 22(11): 1757-1769
    • (2011) IEEE Transactions on Neural Networks , vol.22 , Issue.11 , pp. 1757-1769
    • Mehraeen, S.1    Jagannathan, S.2
  • 10
    • 84875270081 scopus 로고    scopus 로고
    • Online optimal control of affine nonlinear discrete-time systems with unknown internal dynamics by using timebased policy update
    • Dierks T, Jagannathan S. Online optimal control of affine nonlinear discrete-time systems with unknown internal dynamics by using timebased policy update. IEEE Transactions on Neural Networks and Learning Systems, 2012, 23(7): 1118-1129
    • (2012) IEEE Transactions on Neural Networks and Learning Systems , vol.23 , Issue.7 , pp. 1118-1129
    • Dierks, T.1    Jagannathan, S.2
  • 12
    • 79959473178 scopus 로고    scopus 로고
    • Decentralized nearly optimal control of a class of interconnected nonlinear discrete-time systems by using online Hamilton-Bellman-Jacobi formulation
    • Barcelona: IEEE
    • Mehraeen S, Jagannathan S. Decentralized nearly optimal control of a class of interconnected nonlinear discrete-time systems by using online Hamilton-Bellman-Jacobi formulation. In: Proceeding of the 2010 International Joint Conference on Neural Networks (IJCNN). Barcelona: IEEE, 2010. 1-8
    • (2010) Proceeding of the 2010 International Joint Conference on Neural Networks (IJCNN) , pp. 1-8
    • Mehraeen, S.1    Jagannathan, S.2
  • 13
  • 14
    • 84881373865 scopus 로고    scopus 로고
    • A policy iteration approach to online optimal control of continuous-time constrained-input systems
    • Modares H, Sistani M B N, Lewis F L. A policy iteration approach to online optimal control of continuous-time constrained-input systems. ISA Transactions, 2013, 52(5): 611-621
    • (2013) ISA Transactions , vol.52 , Issue.5 , pp. 611-621
    • Modares, H.1    Sistani, M.B.N.2    Lewis, F.L.3
  • 15
    • 39549085591 scopus 로고    scopus 로고
    • Generalized Hamilton-Jacobi-Bellman formulation-based neural network control of affine nonlinear discrete time systems
    • Chen Z, Jagannathan S. Generalized Hamilton-Jacobi-Bellman formulation-based neural network control of affine nonlinear discrete time systems. IEEE Transactions on Neural Networks, 2008, 19(1): 90-106
    • (2008) IEEE Transactions on Neural Networks , vol.19 , Issue.1 , pp. 90-106
    • Chen, Z.1    Jagannathan, S.2
  • 16
    • 84865467087 scopus 로고    scopus 로고
    • Computational adaptive optimal control for continuous-time linear systems with completely unknown dynamics
    • Jiang Y, Jiang Z P. Computational adaptive optimal control for continuous-time linear systems with completely unknown dynamics. Automatica, 2012, 48(10): 2699-2704
    • (2012) Automatica , vol.48 , Issue.10 , pp. 2699-2704
    • Jiang, Y.1    Jiang, Z.P.2
  • 18
    • 33846781133 scopus 로고    scopus 로고
    • A neural network solution for fixed-final time optimal control of nonlinear systems
    • Cheng T, Lewis F L, Abu-Khalaf M. A neural network solution for fixed-final time optimal control of nonlinear systems. Automatica, 2007, 43(3): 482-490
    • (2007) Automatica , vol.43 , Issue.3 , pp. 482-490
    • Cheng, T.1    Lewis, F.L.2    Abu-Khalaf, M.3
  • 20
    • 85042095332 scopus 로고    scopus 로고
    • Reinforcement learning in continuous state and action spaces
    • Berlin Heidelberg: Springer
    • Hasselt H. Reinforcement learning in continuous state and action spaces. Adaptation, Learning, and Optimization. Berlin Heidelberg: Springer, 2012. 207-251
    • (2012) Adaptation, Learning, and Optimization , pp. 207-251
    • Hasselt, H.1
  • 22
    • 51349128679 scopus 로고    scopus 로고
    • Reinforcement learning in multi-dim ensional state-action space using random rectangular coarse coding and Gibbs sampling
    • San Diego, CA: IEEE
    • Kimura H. Reinforcement learning in multi-dim ensional state-action space using random rectangular coarse coding and Gibbs sampling. In: Proceeding of the 2007 IEEE International Conference on Intelligent Robots and Systems (IROS). San Diego, CA: IEEE, 2007. 88-95
    • (2007) Proceeding of the 2007 IEEE International Conference on Intelligent Robots and Systems (IROS) , pp. 88-95
    • Kimura, H.1
  • 31
    • 0003787146 scopus 로고
    • Mineola, NY: Dover Publications, Incorporated
    • Bellman R E. Dynamic Programming. Mineola, NY: Dover Publications, Incorporated, 1957.
    • (1957) Dynamic Programming
    • Bellman, R.E.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.