메뉴 건너뛰기




Volumn 11, Issue 4, 2014, Pages 1176-1190

A novel iterative -adaptive dynamic programming for discrete-time nonlinear systems

Author keywords

Adaptive critic designs; Adaptive dynamic programming; Approximate dynamic programming; Neural networks; Neuro dynamic programming; Nonlinear systems; Optima

Indexed keywords

NONLINEAR SYSTEMS;

EID: 84908658175     PISSN: 15455955     EISSN: None     Source Type: Journal    
DOI: 10.1109/TASE.2013.2280974     Document Type: Article
Times cited : (122)

References (41)
  • 1
    • 84876138680 scopus 로고    scopus 로고
    • Swarm intelligence approaches to optimal power flow problem with distributed generator failures in power networks
    • Apr
    • Q. Kang, M. Zhou, J. An, and Q. Wu, "Swarm intelligence approaches to optimal power flow problem with distributed generator failures in power networks," IEEE Trans. Autom. Sci. Eng., vol. 10, no. 2, pp. 343-353, Apr. 2013.
    • (2013) IEEE Trans. Autom. Sci. Eng , vol.10 , Issue.2 , pp. 343-353
    • Kang, Q.1    Zhou, M.2    An, J.3    Wu, Q.4
  • 3
    • 84892570718 scopus 로고    scopus 로고
    • Noncyclic scheduling for timed discrete- event systems with application to single-armed cluster tools using pareto-optimal optimization
    • Jul
    • U. Wikborg and T.-E. Lee, "Noncyclic scheduling for timed discrete- event systems with application to single-armed cluster tools using pareto-optimal optimization," IEEE Trans. Autom. Sci. Eng., vol. 10, no. 3, pp. 699-710, Jul. 2013.
    • (2013) IEEE Trans. Autom. Sci. Eng , vol.10 , Issue.3 , pp. 699-710
    • Wikborg, U.1    Lee, T.-E.2
  • 4
    • 84892617907 scopus 로고    scopus 로고
    • Smart management of multiple energy systems in automotive painting shop
    • Jul
    • Z. Xu, Q. S. Jia, X. Guan, and J. Shen, "Smart management of multiple energy systems in automotive painting shop," IEEE Trans. Autom. Sci. Eng., vol. 10, no. 3, pp. 603-614, Jul. 2013.
    • (2013) IEEE Trans. Autom. Sci. Eng , vol.10 , Issue.3 , pp. 603-614
    • Xu, Z.1    Jia, Q.S.2    Guan, X.3    Shen, J.4
  • 5
    • 85012688561 scopus 로고
    • Princeton, NJ, USA: Princeton Univ. Press
    • R. E. Bellman, Dynamic Programming. Princeton, NJ, USA: Princeton Univ. Press, 1957.
    • (1957) Dynamic Programming
    • Bellman, R.E.1
  • 6
    • 84855353346 scopus 로고    scopus 로고
    • A polynomial dynamic programming algorithm for crude oil transportation planning
    • Jan
    • C. Chu, F. Chu, M. Zhou, H. Chen, and Q. Shen, "A polynomial dynamic programming algorithm for crude oil transportation planning," IEEE Trans. Autom. Sci. Eng., vol. 9, no. 1, pp. 42-55, Jan. 2012.
    • (2012) IEEE Trans. Autom. Sci. Eng , vol.9 , Issue.1 , pp. 42-55
    • Chu, C.1    Chu, F.2    Zhou, M.3    Chen, H.4    Shen, Q.5
  • 7
    • 80053632509 scopus 로고    scopus 로고
    • Optimization of train regulation and energy usage of metro lines using an adaptive-optimal-control algorithm
    • Oct
    • W. S. Lin and J.W. Sheu, "Optimization of train regulation and energy usage of metro lines using an adaptive-optimal-control algorithm," IEEE Trans. Autom. Sci. Eng., vol. 8, no. 4, pp. 121-131, Oct. 2011.
    • (2011) IEEE Trans. Autom. Sci. Eng , vol.8 , Issue.4 , pp. 121-131
    • Lin, W.S.1    Sheu, J.W.2
  • 8
    • 0002557583 scopus 로고
    • Advanced forecasting methods for global crisis warning and models of intelligence
    • P. J. Werbos, "Advanced forecasting methods for global crisis warning and models of intelligence," General Syst. Yearbook, vol. 22, pp. 25-38, 1977.
    • (1977) General Syst. Yearbook , vol.22 , pp. 25-38
    • Werbos, P.J.1
  • 9
    • 0002011091 scopus 로고
    • A menu of designs for reinforcement learning over time
    • W. T. Miller, R. S. Sutton, and P. J. Werbos, Eds. Cambridge, MA, USA: MIT Press
    • P. J. Werbos, "A menu of designs for reinforcement learning over time," in Neural Networks for Control, W. T. Miller, R. S. Sutton, and P. J. Werbos, Eds. Cambridge, MA, USA: MIT Press, 1991, pp. 67-95.
    • (1991) Neural Networks for Control , pp. 67-95
    • Werbos, P.J.1
  • 10
    • 77950867306 scopus 로고    scopus 로고
    • Developing a stochastic dynamic programming framework for optical tweezer-based automated particle transport operations
    • Apr
    • A. G. Banerjee, A. Pomerance, W. Losert, and S. K. Gupta, "Developing a stochastic dynamic programming framework for optical tweezer-based automated particle transport operations," IEEE Trans. Autom. Sci. Eng., vol. 7, no. 2, pp. 218-227, Apr. 2010.
    • (2010) IEEE Trans. Autom. Sci. Eng , vol.7 , Issue.2 , pp. 218-227
    • Banerjee, A.G.1    Pomerance, A.2    Losert, W.3    Gupta, S.K.4
  • 11
    • 84880065287 scopus 로고    scopus 로고
    • Finite-horizon control-constrained nonlinear optimal control using single network adaptive critics
    • Jan
    • A. Heydari and S. N. Balakrishnan, "Finite-horizon control-constrained nonlinear optimal control using single network adaptive critics," IEEE Trans. Neural Netw. Learning Syst., vol. 24, no. 1, pp. 145-157, Jan. 2013.
    • (2013) IEEE Trans. Neural Netw. Learning Syst , vol.24 , Issue.1 , pp. 145-157
    • Heydari, A.1    Balakrishnan, S.N.2
  • 12
    • 26844483839 scopus 로고    scopus 로고
    • Aself-learning call admission control scheme for cdma cellular networks
    • Sep
    • D. Liu, Y. Zhang, andH. Zhang, "Aself-learning call admission control scheme for CDMA cellular networks," IEEE Trans. Neural Netw., vol. 16, no. 5, pp. 855-864, Sep. 2011.
    • (2011) IEEE Trans. Neural Netw , vol.16 , Issue.5 , pp. 855-864
    • Liu, D.1    Zhang, Y.2    Zhang, H.3
  • 13
    • 84867404137 scopus 로고    scopus 로고
    • Modeling and optimization of building emergency evacuation considering blocking effects on crowd movement
    • Sep
    • P. B. Luh, C. T. Wilkie, S. C. Chang, K. L. Marsh, and N. Olderman, "Modeling and optimization of building emergency evacuation considering blocking effects on crowd movement," IEEE Trans. Autom. Sci. Eng., vol. 9, no. 4, pp. 687-700, Sep. 2012.
    • (2012) IEEE Trans. Autom. Sci. Eng , vol.9 , Issue.4 , pp. 687-700
    • Luh, P.B.1    Wilkie, C.T.2    Chang, S.C.3    Marsh, K.L.4    Olderman, N.5
  • 14
    • 84864489666 scopus 로고    scopus 로고
    • Optimal control of unknown nonaffine nonlinear discrete-time systems based on adaptive dynamic programming
    • Aug
    • D. Wang, D. Liu, Q. Wei, D. Zhao, and N. Jin, "Optimal control of unknown nonaffine nonlinear discrete-time systems based on adaptive dynamic programming," Autom., vol. 48, no. 8, pp. 1825-1832, Aug. 2012.
    • (2012) Autom , vol.48 , Issue.8 , pp. 1825-1832
    • Wang, D.1    Liu, D.2    Wei, Q.3    Zhao, D.4    Jin, N.5
  • 15
    • 0002031779 scopus 로고
    • Approximate dynamic programming for real-time control and neural modeling
    • D. A. White andD. A. Sofge, Eds. New York, NY, USA: Van Nostrand Reinhold, ch. 13
    • P. J. Werbos, "Approximate dynamic programming for real-time control and neural modeling," in Handbook of Intelligent Control: Neural, Fuzzy, and Adaptive Approaches,D. A. White andD. A. Sofge, Eds. New York, NY, USA: Van Nostrand Reinhold, 1992, ch. 13.
    • (1992) Handbook of Intelligent Control: Neural, Fuzzy, and Adaptive Approaches
    • Werbos, P.J.1
  • 16
    • 0031236002 scopus 로고    scopus 로고
    • Adaptive critic designs
    • Sep
    • D. V. Prokhorov and D. C. Wunsch, "Adaptive critic designs," IEEE Trans. Neural Netw., vol. 8, no. 5, pp. 997-1007, Sep. 1997.
    • (1997) IEEE Trans. Neural Netw , vol.8 , Issue.5 , pp. 997-1007
    • Prokhorov, D.V.1    Wunsch, D.C.2
  • 17
    • 84892442931 scopus 로고    scopus 로고
    • A multiagent -learningbased optimal allocation approach for urban water resource management system
    • to be published
    • J. Ni, M. Liu, L. Ren, and S. X. Yang, "A Multiagent -learningbased optimal allocation approach for urban water resource management system," IEEE Trans. Autom. Sci. Eng., 2013, to be published.
    • (2013) IEEE Trans. Autom. Sci. Eng
    • Ni, J.1    Liu, M.2    Ren, L.3    Yang, S.X.4
  • 18
    • 84857646134 scopus 로고    scopus 로고
    • Wide-area measurement based dynamic stochastic optimal power flow control for smart grids with high variability and uncertainty
    • Mar
    • J. Liang, G. K. Venayagamoorthy, and R. G. Harley, "Wide-area measurement based dynamic stochastic optimal power flow control for smart grids with high variability and uncertainty," IEEE Trans. Smart Grid, vol. 3, no. 1, pp. 59-69, Mar. 2012.
    • (2012) IEEE Trans. Smart Grid , vol.3 , Issue.1 , pp. 59-69
    • Liang, J.1    Venayagamoorthy, G.K.2    Harley, R.G.3
  • 20
    • 84859774473 scopus 로고    scopus 로고
    • Real-time adaptive control of a flexible manipulator using reinforcement learning
    • Apr
    • S. K. Pradhan and B. Subudhi, "Real-time adaptive control of a flexible manipulator using reinforcement learning," IEEE Trans. Autom. Sci. Eng., vol. 9, no. 2, pp. 237-249, Apr. 2012.
    • (2012) IEEE Trans. Autom. Sci. Eng , vol.9 , Issue.2 , pp. 237-249
    • Pradhan, S.K.1    Subudhi, B.2
  • 21
    • 66449130966 scopus 로고    scopus 로고
    • Adaptive dynamic programming: An introduction
    • Mar
    • F. Wang, H. Zhang, and D. Liu, "Adaptive dynamic programming: An introduction," IEEE Comput. Intell.Mag., vol. 4, no. 2, pp. 39-47, Mar. 2009.
    • (2009) IEEE Comput. Intell.Mag , vol.4 , Issue.2 , pp. 39-47
    • Wang, F.1    Zhang, H.2    Liu, D.3
  • 22
    • 61849184281 scopus 로고    scopus 로고
    • Model-free multiobjective approximate dynamic programming for discrete-time nonlinear systems with general performance index functions
    • Mar
    • Q. Wei, H. Zhang, and J. Dai, "Model-free multiobjective approximate dynamic programming for discrete-time nonlinear systems with general performance index functions," Neurocomput., vol. 72, no. 7-9, pp. 1839-1848, Mar. 2009.
    • (2009) Neurocomput , vol.72 , Issue.7-9 , pp. 1839-1848
    • Wei, Q.1    Zhang, H.2    Dai, J.3
  • 23
    • 84862811062 scopus 로고    scopus 로고
    • An iterative -optimal control scheme for a class of discrete-time nonlinear systems with unfixed initial state
    • Q. Wei and D. Liu, "An iterative -optimal control scheme for a class of discrete-time nonlinear systems with unfixed initial state," Neural Netw., vol. 32, pp. 236-244, 2012.
    • (2012) Neural Netw , vol.32 , pp. 236-244
    • Wei, Q.1    Liu, D.2
  • 24
    • 70349253929 scopus 로고    scopus 로고
    • The rbf neural network-based nearoptimal control for a class of discrete-time affine nonlinear systems with control constraint
    • Sep
    • H. Zhang, Y. Luo, and D. Liu, "The RBF neural network-based nearoptimal control for a class of discrete-time affine nonlinear systems with control constraint," IEEE Trans. Neural Netw., vol. 20, no. 9, pp. 1490-1503, Sep. 2009.
    • (2009) IEEE Trans. Neural Netw , vol.20 , Issue.9 , pp. 1490-1503
    • Zhang, H.1    Luo, Y.2    Liu, D.3
  • 25
    • 78650805234 scopus 로고    scopus 로고
    • An iterative adaptive dynamic programming method for solving a class of nonlinear zero-sum differential games
    • Jan
    • H. Zhang, Q. Wei, and D. Liu, "An iterative adaptive dynamic programming method for solving a class of nonlinear zero-sum differential games," Autom., vol. 47, no. 1, pp. 207-214, Jan. 2011.
    • (2011) Autom , vol.47 , Issue.1 , pp. 207-214
    • Zhang, H.1    Wei, Q.2    Liu, D.3
  • 26
    • 49049119493 scopus 로고    scopus 로고
    • A novel infinite-time optimal tracking control scheme for a class of discrete-time nonlinear systems via the greedy hdp iteration algorithm
    • Jul
    • H. Zhang, Q. Wei, and Y. Luo, "A novel infinite-time optimal tracking control scheme for a class of discrete-time nonlinear systems via the greedy HDP iteration algorithm," IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 38, no. 4, pp. 937-942, Jul. 2008.
    • (2008) IEEE Trans. Syst., Man, Cybern. B, Cybern , vol.38 , Issue.4 , pp. 937-942
    • Zhang, H.1    Wei, Q.2    Luo, Y.3
  • 27
    • 70349116541 scopus 로고    scopus 로고
    • Reinforcement learning and adaptive dynamic programming for feedback control
    • May
    • F. L. Lewis and D. Vrabie, "Reinforcement learning and adaptive dynamic programming for feedback control," IEEE Circuits Syst. Mag., vol. 9, no. 3, pp. 32-50, May 2009.
    • (2009) IEEE Circuits Syst. Mag , vol.9 , Issue.3 , pp. 32-50
    • Lewis, F.L.1    Vrabie, D.2
  • 28
    • 14844340822 scopus 로고    scopus 로고
    • Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network hjb approach
    • May
    • M. Abu-Khalaf and F. L. Lewis, "Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach," Autom., vol. 41, no. 5, pp. 779-791, May 2005.
    • (2005) Autom , vol.41 , Issue.5 , pp. 779-791
    • Abu-Khalaf, M.1    Lewis, F.L.2
  • 29
    • 78651311269 scopus 로고    scopus 로고
    • Adaptive dynamic programming for finite-horizon optimal control of discrete-time nonlinear systems with -error bound
    • Jan
    • F. Wang, N. Jin, D. Liu, and Q. Wei, "Adaptive dynamic programming for finite-horizon optimal control of discrete-time nonlinear systems with -error bound," IEEE Trans. Neural Netw., vol. 22, no. 1, pp. 24-36, Jan. 2011.
    • (2011) IEEE Trans. Neural Netw , vol.22 , Issue.1 , pp. 24-36
    • Wang, F.1    Jin, N.2    Liu, D.3    Wei, Q.4
  • 31
    • 49049089962 scopus 로고    scopus 로고
    • Discrete-time nonlinear hjb solution using approximate dynamic programming: Convergence proof
    • Aug
    • A. Al-Tamimi, F. L. Lewis, and M. Abu-Khalaf, "Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof," IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 38, no. 4, pp. 943-949, Aug. 2008.
    • (2008) IEEE Trans. Syst., Man, Cybern. B, Cybern , vol.38 , Issue.4 , pp. 943-949
    • Al-Tamimi, A.1    Lewis, F.L.2    Abu-Khalaf, M.3
  • 32
    • 84863467146 scopus 로고    scopus 로고
    • Neural-network-based optimal control for a class of unknown discrete-time nonlinear systems using globalized dual heuristic programming
    • Jul
    • D. Liu, D. Wang, D. Zhao, Q. Wei, and N. Jin, "Neural-network-based optimal control for a class of unknown discrete-time nonlinear systems using globalized dual heuristic programming," IEEE Trans. Autom. Sci. Eng., vol. 9, no. 3, pp. 628-634, Jul. 2012.
    • (2012) IEEE Trans. Autom. Sci. Eng , vol.9 , Issue.3 , pp. 628-634
    • Liu, D.1    Wang, D.2    Zhao, D.3    Wei, Q.4    Jin, N.5
  • 35
    • 48949116222 scopus 로고    scopus 로고
    • Neurodynamic programming and zero-sum games for constrained control systems
    • Apr
    • M. Abu-Khalaf, F. L. Lewis, and J. Huang, "Neurodynamic programming and zero-sum games for constrained control systems," IEEE Trans. Neural Netw., vol. 19, no. 7, pp. 1243-1252, Apr. 2008.
    • (2008) IEEE Trans. Neural Netw , vol.19 , Issue.7 , pp. 1243-1252
    • Abu-Khalaf, M.1    Lewis, F.L.2    Huang, J.3
  • 36
    • 0035273403 scopus 로고    scopus 로고
    • On-line learning control by association and reinforcement
    • Mar
    • J. Si and Y. T. Wang, "On-line learning control by association and reinforcement," IEEE Trans. Neural Netw., vol. 12, no. 2, pp. 264-276, Mar. 2001.
    • (2001) IEEE Trans. Neural Netw , vol.12 , Issue.2 , pp. 264-276
    • Si, J.1    Wang, Y.T.2
  • 37
    • 84859001250 scopus 로고    scopus 로고
    • Reinforcement learning controller design for affine nonlinear discrete-time systems using online approximators
    • Apr
    • Q. Yang and S. Jagannathan, "Reinforcement learning controller design for affine nonlinear discrete-time systems using online approximators," IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 42, no. 2, pp. 377-390, Apr. 2012.
    • (2012) IEEE Trans. Syst., Man, Cybern. B, Cybern , vol.42 , Issue.2 , pp. 377-390
    • Yang, Q.1    Jagannathan, S.2
  • 40
    • 34249012415 scopus 로고    scopus 로고
    • Local feedback passivation of nonlinear discrete- time systems through the speed-gradient algorithm
    • Jul
    • E. M. Navarro-Lopez, "Local feedback passivation of nonlinear discrete- time systems through the speed-gradient algorithm," Autom., vol. 43, no. 7, pp. 1302-1306, Jul. 2007.
    • (2007) Autom , vol.43 , Issue.7 , pp. 1302-1306
    • Navarro-Lopez, E.M.1
  • 41
    • 0026254227 scopus 로고
    • Non-linear discrete variable structure systems in quasi-sliding mode
    • H. Sira-Ramirez, "Non-linear discrete variable structure systems in quasi-sliding mode," Int. J. Control, vol. 54, no. 5, pp. 1171-1187, 1991.
    • (1991) Int. J. Control , vol.54 , Issue.5 , pp. 1171-1187
    • Sira-Ramirez, H.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.