메뉴 건너뛰기




Volumn 53, Issue 19, 2014, Pages 8106-8119

Data-based suboptimal neuro-control design with reinforcement learning for dissipative spatially distributed processes

Author keywords

[No Author keywords available]

Indexed keywords

COST FUNCTIONS; DATA REDUCTION; EIGENVALUES AND EIGENFUNCTIONS; ITERATIVE METHODS; NONLINEAR EQUATIONS; ORDINARY DIFFERENTIAL EQUATIONS; PARTIAL DIFFERENTIAL EQUATIONS; PERTURBATION TECHNIQUES; REINFORCEMENT LEARNING; SPATIAL DISTRIBUTION;

EID: 84988290534     PISSN: 08885885     EISSN: 15205045     Source Type: Journal    
DOI: 10.1021/ie4031743     Document Type: Article
Times cited : (32)

References (59)
  • 2
    • 84864324494 scopus 로고    scopus 로고
    • Online policy iteration algorithm for optimal control of linear hyperbolic PDE systems
    • Luo, B.; Wu, H.-N. Online policy iteration algorithm for optimal control of linear hyperbolic PDE systems J. Process Control 2012, 22, 1161-1170
    • (2012) J. Process Control , vol.22 , pp. 1161-1170
    • Luo, B.1    Wu, H.-N.2
  • 3
    • 34547133970 scopus 로고    scopus 로고
    • Robust/optimal temperature profile control of a high-speed aerospace vehicle using neural networks
    • Yadav, V.; Padhi, R.; Balakrishnan, S. N. Robust/optimal temperature profile control of a high-speed aerospace vehicle using neural networks IEEE Trans. Neural Networks 2007, 18, 1115-1128
    • (2007) IEEE Trans. Neural Networks , vol.18 , pp. 1115-1128
    • Yadav, V.1    Padhi, R.2    Balakrishnan, S.N.3
  • 4
    • 45949083297 scopus 로고    scopus 로고
    • Optimal control of diffusion-convection-reaction processes using reduced-order models
    • Li, M.; Christofides, P. D. Optimal control of diffusion-convection- reaction processes using reduced-order models Comput. Chem. Eng. 2008, 32, 2123-2135
    • (2008) Comput. Chem. Eng. , vol.32 , pp. 2123-2135
    • Li, M.1    Christofides, P.D.2
  • 5
    • 84869489097 scopus 로고    scopus 로고
    • Approximate optimal control design for nonlinear one-dimensional parabolic PDE systems using empirical eigenfunctions and neural network
    • Luo, B.; Wu, H.-N. Approximate optimal control design for nonlinear one-dimensional parabolic PDE systems using empirical eigenfunctions and neural network IEEE Trans. Syst. Man Cybern. Part B 2012, 42, 1538-1549
    • (2012) IEEE Trans. Syst. Man Cybern. Part B , vol.42 , pp. 1538-1549
    • Luo, B.1    Wu, H.-N.2
  • 6
    • 0036899040 scopus 로고    scopus 로고
    • Dynamic optimization of dissipative PDE systems using nonlinear order reduction
    • Armaou, A.; Christofides, P. D. Dynamic optimization of dissipative PDE systems using nonlinear order reduction Chem. Eng. Sci. 2002, 57, 5083-5114
    • (2002) Chem. Eng. Sci. , vol.57 , pp. 5083-5114
    • Armaou, A.1    Christofides, P.D.2
  • 7
    • 33846101066 scopus 로고    scopus 로고
    • Predictive output feedback control of parabolic partial differential equations (PDEs)
    • Dubljevic, S.; Christofides, P. D. Predictive output feedback control of parabolic partial differential equations (PDEs) Ind. Eng. Chem. Res. 2006, 45, 8421-8429
    • (2006) Ind. Eng. Chem. Res. , vol.45 , pp. 8421-8429
    • Dubljevic, S.1    Christofides, P.D.2
  • 8
    • 34547112281 scopus 로고    scopus 로고
    • Optimal LQ-feedback regulation of a nonisothermal plug flow reactor model by spectral factorization
    • Aksikas, I.; Winkin, J. J.; Dochain, D. Optimal LQ-feedback regulation of a nonisothermal plug flow reactor model by spectral factorization IEEE Trans. Automat. Control 2007, 52, 1179-1193
    • (2007) IEEE Trans. Automat. Control , vol.52 , pp. 1179-1193
    • Aksikas, I.1    Winkin, J.J.2    Dochain, D.3
  • 9
    • 58349106441 scopus 로고    scopus 로고
    • Optimal control of switched distributed parameter systems with spatially scheduled actuators
    • Iftime, O. V.; M. A. Demetriou, M. A. Optimal control of switched distributed parameter systems with spatially scheduled actuators Automatica 2009, 45, 312-323
    • (2009) Automatica , vol.45 , pp. 312-323
    • Iftime, O.V.1    Demetriou, M.A.M.A.2
  • 10
    • 84863856475 scopus 로고    scopus 로고
    • Heuristic dynamic programming algorithm for optimal control design of linear continuous-time hyperbolic PDE systems
    • Wu, H.-N.; Luo, B. Heuristic dynamic programming algorithm for optimal control design of linear continuous-time hyperbolic PDE systems Ind. Eng. Chem. Res. 2012, 51, 9310-9319
    • (2012) Ind. Eng. Chem. Res. , vol.51 , pp. 9310-9319
    • Wu, H.-N.1    Luo, B.2
  • 11
    • 0001233894 scopus 로고    scopus 로고
    • Control of nonlinear distributed process systems: Recent developments and challenges
    • Christofides, P. D. Control of nonlinear distributed process systems: Recent developments and challenges AIChE J. 2001, 47, 514-518
    • (2001) AIChE J. , vol.47 , pp. 514-518
    • Christofides, P.D.1
  • 14
    • 34247503604 scopus 로고    scopus 로고
    • An input/output approach to the optimal transition control of a class of distributed chemical reactors
    • Li, M. H.; Christofides, P. D. An input/output approach to the optimal transition control of a class of distributed chemical reactors Chem. Eng. Sci. 2007, 62, 2979-2988
    • (2007) Chem. Eng. Sci. , vol.62 , pp. 2979-2988
    • Li, M.H.1    Christofides, P.D.2
  • 17
    • 79958779459 scopus 로고    scopus 로고
    • Reinforcement learning in feedback control
    • Hafner, R.; Riedmiller, M. Reinforcement learning in feedback control Mach. Learn. 2011, 84, 137-169
    • (2011) Mach. Learn. , vol.84 , pp. 137-169
    • Hafner, R.1    Riedmiller, M.2
  • 18
    • 84865442280 scopus 로고    scopus 로고
    • Autonomous adaptive and active tuning up of the dissolved oxygen setpoint in a wastewater treatment plant using reinforcement learning
    • Hernandez-del-Olmo, F.; Gaudioso, E.; Nevado, A. Autonomous adaptive and active tuning up of the dissolved oxygen setpoint in a wastewater treatment plant using reinforcement learning IEEE Trans. Syst. Man Cybern. Part C 2012, 42, 768-774
    • (2012) IEEE Trans. Syst. Man Cybern. Part C , vol.42 , pp. 768-774
    • Hernandez-Del-Olmo, F.1    Gaudioso, E.2    Nevado, A.3
  • 21
    • 0033629916 scopus 로고    scopus 로고
    • Reinforcement learning in continuous time and space
    • Doya, K. Reinforcement learning in continuous time and space Neural Comput. 2000, 12, 219-245
    • (2000) Neural Comput. , vol.12 , pp. 219-245
    • Doya, K.1
  • 22
    • 84871756682 scopus 로고    scopus 로고
    • A survey of actor-critic reinforcement learning: Standard and natural policy gradients
    • Grondman, I.; Busoniu, L.; Lopes, G. A. D.; Babuska, R. A survey of actor-critic reinforcement learning: standard and natural policy gradients IEEE Trans. Syst. Man Cybern. Part C 2012, 42, 1291-1307
    • (2012) IEEE Trans. Syst. Man Cybern. Part C , vol.42 , pp. 1291-1307
    • Grondman, I.1    Busoniu, L.2    Lopes, G.A.D.3    Babuska, R.4
  • 24
    • 0031143730 scopus 로고    scopus 로고
    • An analysis of temporal-difference learning with function approximation
    • Tsitsiklis, J. N.; Van Roy, B. An analysis of temporal-difference learning with function approximation IEEE Trans. Automat. Control 1997, 42, 674-690
    • (1997) IEEE Trans. Automat. Control , vol.42 , pp. 674-690
    • Tsitsiklis, J.N.1    Van Roy, B.2
  • 25
    • 70349116541 scopus 로고    scopus 로고
    • Reinforcement learning and adaptive dynamic programming for feedback control
    • Lewis, F. L.; Vrabie, D. Reinforcement learning and adaptive dynamic programming for feedback control IEEE Circ. Syst. Mag. 2009, 9, 32-50
    • (2009) IEEE Circ. Syst. Mag. , vol.9 , pp. 32-50
    • Lewis, F.L.1    Vrabie, D.2
  • 26
    • 34047138362 scopus 로고    scopus 로고
    • Reinforcement learning neural-network-based controller for nonlinear discrete-time systems with input constraints
    • He, P.; Jagannathan, S. Reinforcement learning neural-network-based controller for nonlinear discrete-time systems with input constraints IEEE Trans. Syst. Man Cybern. Part B 2007, 37, 425-436
    • (2007) IEEE Trans. Syst. Man Cybern. Part B , vol.37 , pp. 425-436
    • He, P.1    Jagannathan, S.2
  • 27
    • 49049089962 scopus 로고    scopus 로고
    • Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof
    • Al-Tamimi, A.; Lewis, F. L.; Abu-Khalaf, M. Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof IEEE Trans. Syst. Man Cybern. Part B 2008, 38, 943-949
    • (2008) IEEE Trans. Syst. Man Cybern. Part B , vol.38 , pp. 943-949
    • Al-Tamimi, A.1    Lewis, F.L.2    Abu-Khalaf, M.3
  • 28
    • 49049106959 scopus 로고    scopus 로고
    • Direct heuristic dynamic programming for damping oscillations in a large power system
    • Lu, C.; Si, J.; Xie, X. R. Direct heuristic dynamic programming for damping oscillations in a large power system IEEE Trans. Syst. Man Cybern. Part B 2008, 38, 1008-1013
    • (2008) IEEE Trans. Syst. Man Cybern. Part B , vol.38 , pp. 1008-1013
    • Lu, C.1    Si, J.2    Xie, X.R.3
  • 29
    • 83855165164 scopus 로고    scopus 로고
    • Optimal tracking control for a class of nonlinear discrete-time systems with time delays based on heuristic dynamic programming
    • Zhang, H.; Song, R.; Wei, Q.; Zhang, T. Optimal tracking control for a class of nonlinear discrete-time systems with time delays based on heuristic dynamic programming IEEE Trans. Neural Netw. 2011, 22, 1851-1862
    • (2011) IEEE Trans. Neural Netw. , vol.22 , pp. 1851-1862
    • Zhang, H.1    Song, R.2    Wei, Q.3    Zhang, T.4
  • 30
    • 65149086182 scopus 로고    scopus 로고
    • Approximate dynamic programming based optimal control applied to an integrated plant with a reactor and a distillation column with recycle
    • Tosukhowong, T.; Lee, J. H. Approximate dynamic programming based optimal control applied to an integrated plant with a reactor and a distillation column with recycle AIChE J. 2009, 55, 919-930
    • (2009) AIChE J. , vol.55 , pp. 919-930
    • Tosukhowong, T.1    Lee, J.H.2
  • 31
    • 78651311269 scopus 로고    scopus 로고
    • Adaptive dynamic programming for finite-horizon optimal control of discrete-time nonlinear systems with ε-error bound
    • Wang, F.; Jin, N.; Liu, D.; Wei, Q. Adaptive dynamic programming for finite-horizon optimal control of discrete-time nonlinear systems with ε-error bound IEEE Trans. Neural Netw. 2011, 22, 24-36
    • (2011) IEEE Trans. Neural Netw. , vol.22 , pp. 24-36
    • Wang, F.1    Jin, N.2    Liu, D.3    Wei, Q.4
  • 32
    • 79551685808 scopus 로고    scopus 로고
    • Reinforcement learning for partially observable dynamic processes: Adaptive dynamic programming using measured output data
    • Lewis, F. L.; Vamvoudakis, K. G. Reinforcement learning for partially observable dynamic processes: adaptive dynamic programming using measured output data IEEE Trans. Syst. Man Cybern. Part B 2011, 41, 14-25
    • (2011) IEEE Trans. Syst. Man Cybern. Part B , vol.41 , pp. 14-25
    • Lewis, F.L.1    Vamvoudakis, K.G.2
  • 33
    • 84859001250 scopus 로고    scopus 로고
    • Reinforcement learning controller design for affine nonlinear discrete-time systems using online approximators
    • Yang, Q.; Jagannathan, S. Reinforcement learning controller design for affine nonlinear discrete-time systems using online approximators IEEE Trans. Syst. Man Cybern. Part B 2012, 42, 377-390
    • (2012) IEEE Trans. Syst. Man Cybern. Part B , vol.42 , pp. 377-390
    • Yang, Q.1    Jagannathan, S.2
  • 34
    • 0035273403 scopus 로고    scopus 로고
    • Online learning control by association and reinforcement
    • Si, J.; Wang, Y. Online learning control by association and reinforcement IEEE Trans. Neural Netw. 2001, 12, 264-276
    • (2001) IEEE Trans. Neural Netw. , vol.12 , pp. 264-276
    • Si, J.1    Wang, Y.2
  • 36
    • 58349110975 scopus 로고    scopus 로고
    • Adaptive optimal control for continuous-time linear systems based on policy iteration
    • Vrabie, D.; Pastravanu, O.; Abu-Khalaf, M.; Lewis, F. L. Adaptive optimal control for continuous-time linear systems based on policy iteration Automatica 2009, 45, 477-484
    • (2009) Automatica , vol.45 , pp. 477-484
    • Vrabie, D.1    Pastravanu, O.2    Abu-Khalaf, M.3    Lewis, F.L.4
  • 37
    • 67349145396 scopus 로고    scopus 로고
    • Neural network approach to continuous-time direct adaptive optimal control for partially unknown nonlinear systems
    • Vrabie, D.; Lewis, F. L. Neural network approach to continuous-time direct adaptive optimal control for partially unknown nonlinear systems Neural Netw. 2009, 22, 237-246
    • (2009) Neural Netw. , vol.22 , pp. 237-246
    • Vrabie, D.1    Lewis, F.L.2
  • 38
    • 77950630017 scopus 로고    scopus 로고
    • Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem
    • Vamvoudakis, K. G.; Lewis, F. L. Online actor-critic algorithm to solve the continuous-time infinite horizon optimal control problem Automatica 2010, 46, 878-888
    • (2010) Automatica , vol.46 , pp. 878-888
    • Vamvoudakis, K.G.1    Lewis, F.L.2
  • 39
    • 33644523102 scopus 로고    scopus 로고
    • Advances in distributed parameter approach to the dynamics and control of activated sludge processes for wastewater treatment
    • Lee, T. T.; Wang, F. Y.; Newell, R. B. Advances in distributed parameter approach to the dynamics and control of activated sludge processes for wastewater treatment Water Res. 2006, 40, 853-869
    • (2006) Water Res. , vol.40 , pp. 853-869
    • Lee, T.T.1    Wang, F.Y.2    Newell, R.B.3
  • 40
    • 0033231203 scopus 로고    scopus 로고
    • Output feedback control of parabolic PDE systems with nonlinear spatial differential operators
    • Baker, J.; Christofides, P. D. Output feedback control of parabolic PDE systems with nonlinear spatial differential operators Ind. Eng. Chem. Res. 1999, 38, 4372-4380
    • (1999) Ind. Eng. Chem. Res. , vol.38 , pp. 4372-4380
    • Baker, J.1    Christofides, P.D.2
  • 41
    • 0035135921 scopus 로고    scopus 로고
    • Crystal temperature control in the Czochralski crystal growth process
    • Armaou, A.; Christofides, P. D. Crystal temperature control in the Czochralski crystal growth process AIChE J. 2001, 47, 79-106
    • (2001) AIChE J. , vol.47 , pp. 79-106
    • Armaou, A.1    Christofides, P.D.2
  • 43
    • 84861114041 scopus 로고    scopus 로고
    • Probabilistic PCA-based spatiotemporal multimodeling for nonlinear distributed parameter processes
    • Qi, C.-K.; Li, H.-X.; Li, S.; Zhao, X.; Gao, F. Probabilistic PCA-based spatiotemporal multimodeling for nonlinear distributed parameter processes Ind. Eng. Chem. Res. 2012, 51, 6811-6822
    • (2012) Ind. Eng. Chem. Res. , vol.51 , pp. 6811-6822
    • Qi, C.-K.1    Li, H.-X.2    Li, S.3    Zhao, X.4    Gao, F.5
  • 44
    • 84874531352 scopus 로고    scopus 로고
    • Output feedback control of dissipative PDE systems with partial sensor information based on adaptive model reduction
    • Pitchaiah, S.; Armaou, A. Output feedback control of dissipative PDE systems with partial sensor information based on adaptive model reduction AIChE J. 2013, 59, 747-760
    • (2013) AIChE J. , vol.59 , pp. 747-760
    • Pitchaiah, S.1    Armaou, A.2
  • 45
    • 78049407663 scopus 로고    scopus 로고
    • Output feedback control of distributed parameter systems using adaptive proper orthogonal decomposition
    • Pitchaiah, S.; Armaou, A. Output feedback control of distributed parameter systems using adaptive proper orthogonal decomposition Ind. Eng. Chem. Res. 2010, 49, 10496-10509
    • (2010) Ind. Eng. Chem. Res. , vol.49 , pp. 10496-10509
    • Pitchaiah, S.1    Armaou, A.2
  • 46
    • 0030247167 scopus 로고    scopus 로고
    • REVIEW: MEMS and its applications for flow control
    • Ho, C.; Tai, Y. REVIEW: MEMS and its applications for flow control ASME J. Fluid Eng. 1996, 118, 437-447
    • (1996) ASME J. Fluid Eng. , vol.118 , pp. 437-447
    • Ho, C.1    Tai, Y.2
  • 49
    • 0031366767 scopus 로고    scopus 로고
    • Finite-dimensional control of parabolic PDE systems using approximate inertial manifolds
    • Christofides, P. D.; Daoutidis, P. Finite-dimensional control of parabolic PDE systems using approximate inertial manifolds J. Math. Anal. Appl. 1997, 216, 398-420
    • (1997) J. Math. Anal. Appl. , vol.216 , pp. 398-420
    • Christofides, P.D.1    Daoutidis, P.2
  • 50
    • 0032555622 scopus 로고    scopus 로고
    • Robust control of parabolic PDE systems
    • Christofides, P. D. Robust control of parabolic PDE systems Chem. Eng. Sci. 1998, 53, 2949-2965
    • (1998) Chem. Eng. Sci. , vol.53 , pp. 2949-2965
    • Christofides, P.D.1
  • 51
    • 0018441647 scopus 로고
    • An approximation theory of optimal control for trainable manipulators
    • Saridis, G. N.; Lee, C. G. An approximation theory of optimal control for trainable manipulators IEEE Trans. Syst. Man Cybern. Part B 1979, 9, 152-159
    • (1979) IEEE Trans. Syst. Man Cybern. Part B , vol.9 , pp. 152-159
    • Saridis, G.N.1    Lee, C.G.2
  • 56
    • 0021386070 scopus 로고
    • Persistence of excitation conditions and the convergence of adaptive schemes
    • Bitmead, R. Persistence of excitation conditions and the convergence of adaptive schemes IEEE Trans. Inf. Theory 1984, 30, 183-191
    • (1984) IEEE Trans. Inf. Theory , vol.30 , pp. 183-191
    • Bitmead, R.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.