메뉴 건너뛰기




Volumn 35, Issue 2, 2005, Pages 381-385

Second-order training of adaptive critics for online process control

Author keywords

Action dependent adaptive critic; Intelligent control; Multilayer perceptrons; Neural networks; Nonlinear process control; Process optimization; Reinforcement learning

Indexed keywords

APPROXIMATION THEORY; CHEMICAL REACTORS; COMPUTER SIMULATION; IDENTIFICATION (CONTROL SYSTEMS); INTELLIGENT CONTROL; LEARNING ALGORITHMS; MATHEMATICAL MODELS; MULTILAYER NEURAL NETWORKS; NONLINEAR CONTROL SYSTEMS; OPTIMAL CONTROL SYSTEMS; PROCESS CONTROL;

EID: 17444428905     PISSN: 10834419     EISSN: None     Source Type: Journal    
DOI: 10.1109/TSMCB.2004.843276     Document Type: Article
Times cited : (22)

References (28)
  • 1
    • 0002011091 scopus 로고
    • "A menu of designs for reinforcement learning over time"
    • W. T. Miller, R. S. Sutton, and P. J. Werbos, Eds. Cambridge, MA: MIT Press
    • P. J. Werbos, "A menu of designs for reinforcement learning over time," in Neural Networks for Control, W. T. Miller, R. S. Sutton, and P. J. Werbos, Eds. Cambridge, MA: MIT Press, 1990, pp. 67-95.
    • (1990) Neural Networks for Control , pp. 67-95
    • Werbos, P.J.1
  • 2
    • 0002031779 scopus 로고
    • "Approximate dynamic programming for real-time control and neural modeling"
    • D. A. White and D. A. Sofge, Eds. New York: Van-Nostrand Reinhold
    • P. J. Werbos, "Approximate dynamic programming for real-time control and neural modeling," in Handbook of Intelligent Control: Neural, Fuzzy and Adaptive Approaches, D. A. White and D. A. Sofge, Eds. New York: Van-Nostrand Reinhold, 1992, pp. 493-525.
    • (1992) Handbook of Intelligent Control: Neural, Fuzzy and Adaptive Approaches , pp. 493-525
    • Werbos, P.J.1
  • 3
    • 0027656692 scopus 로고
    • "Reinforcement learning control of unknown dynamic systems"
    • Q. H. Wu and A. C. Pugh, "Reinforcement learning control of unknown dynamic systems," Proc. Inst. Elect. Eng. D, vol. 140, pp. 313-322, 1993.
    • (1993) Proc. Inst. Elect. Eng. D , vol.140 , pp. 313-322
    • Wu, Q.H.1    Pugh, A.C.2
  • 4
    • 0029346234 scopus 로고
    • "Reinforcement learning control using interconnected learning automata"
    • Q. H. Wu, "Reinforcement learning control using interconnected learning automata," Int. J. Control, vol. 62, no. 1, pp. 1-16, 1995.
    • (1995) Int. J. Control , vol.62 , Issue.1 , pp. 1-16
    • Wu, Q.H.1
  • 6
    • 0001085193 scopus 로고    scopus 로고
    • "A strategy for controlling nonlinear systems using a learning automaton"
    • X. Zeng, J. Zhou, and C. Vasseur, "A strategy for controlling nonlinear systems using a learning automaton," Automatica, vol. 36, pp. 1517-1524, 2000.
    • (2000) Automatica , vol.36 , pp. 1517-1524
    • Zeng, X.1    Zhou, J.2    Vasseur, C.3
  • 7
    • 0033233953 scopus 로고    scopus 로고
    • "Concepts and facilities of a neural reinforcement learning control architecture for technical process control"
    • London, U.K.: Springer-Verlag
    • M. Riedmiller, "Concepts and facilities of a neural reinforcement learning control architecture for technical process control," in Neural Computing and Applications. London, U.K.: Springer-Verlag, 1999, vol. 8, pp. 323-338.
    • (1999) Neural Computing and Applications , vol.8 , pp. 323-338
    • Riedmiller, M.1
  • 8
    • 1442265466 scopus 로고    scopus 로고
    • "Power system stability control: Reinforcement learning framework"
    • Feb
    • D. Ernst, M. Glavic, and L. Wehenkel, "Power system stability control: Reinforcement learning framework," IEEE Trans. Power Syst., vol. 19, no. 1, pp. 427-436, Feb. 2004.
    • (2004) IEEE Trans. Power Syst. , vol.19 , Issue.1 , pp. 427-436
    • Ernst, D.1    Glavic, M.2    Wehenkel, L.3
  • 9
    • 0141459837 scopus 로고    scopus 로고
    • "Adaptive critic designs and their implementations on different neural network architectures"
    • J. W. Park, R. G. Harley, and G. K. Venayagamoorthy, "Adaptive critic designs and their implementations on different neural network architectures," in Proc. Int. Joint Conf. Neural Networks, vol. 3, 2003, pp. 1879-1884.
    • (2003) Proc. Int. Joint Conf. Neural Networks , vol.3 , pp. 1879-1884
    • Park, J.W.1    Harley, R.G.2    Venayagamoorthy, G.K.3
  • 10
    • 0036446485 scopus 로고    scopus 로고
    • "Adaptive critic based optimal neurocontrol for synchronous generator in power system using MLP/RBF neural networks"
    • J. W. Park, "Adaptive critic based optimal neurocontrol for synchronous generator in power system using MLP/RBF neural networks," in Conf. Record - IAS Annu. Meeting, vol. 2, 2002, pp. 1447-1454.
    • (2002) Conf. Record - IAS Annu. Meeting , vol.2 , pp. 1447-1454
    • Park, J.W.1
  • 12
    • 0034878578 scopus 로고    scopus 로고
    • "Excitation and turbine neurocontrol with derivative adaptive critics of multiple generators on the power grid"
    • G. K. Venayagamoorthy, "Excitation and turbine neurocontrol with derivative adaptive critics of multiple generators on the power grid," in Proc. Int. Joint Conf. Neural Networks, vol. 2, 2001, pp. 984-989.
    • (2001) Proc. Int. Joint Conf. Neural Networks , vol.2 , pp. 984-989
    • Venayagamoorthy, G.K.1
  • 13
    • 0036565019 scopus 로고    scopus 로고
    • "Comparison of heuristic dynamic programming and dual heuristic programming adaptive critics for neurocontrol of a turbogenerator"
    • May
    • G. K. Venayagamoorthy, "Comparison of heuristic dynamic programming and dual heuristic programming adaptive critics for neurocontrol of a turbogenerator," IEEE Trans. Neural Netw., vol. 13, no. 3, pp. 764-773, May 2002.
    • (2002) IEEE Trans. Neural Netw. , vol.13 , Issue.3 , pp. 764-773
    • Venayagamoorthy, G.K.1
  • 14
    • 0035506833 scopus 로고    scopus 로고
    • "Dynamic re-optimization of a fed-batch fermentor using adaptive critic designs"
    • Nov
    • M. S. Iyer and D. C. Wunsch II, "Dynamic re-optimization of a fed-batch fermentor using adaptive critic designs," IEEE Trans. Neural Netw. vol. 12, no. 6, pp. 1433-1444, Nov. 2001.
    • (2001) IEEE Trans. Neural Netw. , vol.12 , Issue.6 , pp. 1433-1444
    • Iyer, M.S.1    Wunsch II, D.C.2
  • 15
    • 0037707291 scopus 로고    scopus 로고
    • "Proper orthogonal decomposition based optimal neurocontrol synthesis of a chemical reactor process using approximate dynamic programming"
    • P. Radhakant and S. N. Balakrishnan, "Proper orthogonal decomposition based optimal neurocontrol synthesis of a chemical reactor process using approximate dynamic programming," Neural Netw., vol. 16, pp. 719-728, 2003.
    • (2003) Neural Netw. , vol.16 , pp. 719-728
    • Radhakant, P.1    Balakrishnan, S.N.2
  • 16
    • 0025569234 scopus 로고
    • "Neural network based process optimization and control"
    • D. A. Sofge and D. A. White, "Neural network based process optimization and control," in Proc. IEEE Conf. Decision and Control, vol. 6, 1990, pp. 3270-3276.
    • (1990) Proc. IEEE Conf. Decision and Control , vol.6 , pp. 3270-3276
    • Sofge, D.A.1    White, D.A.2
  • 17
    • 0026849113 scopus 로고
    • "Process control via artificial neural networks and reinforcement learning"
    • J. C. Hoskins and D. M. Himmelblau, "Process control via artificial neural networks and reinforcement learning," Comput. Chem. Eng., vol. 16, no. 4, pp. 241-251, 1992.
    • (1992) Comput. Chem. Eng. , vol.16 , Issue.4 , pp. 241-251
    • Hoskins, J.C.1    Himmelblau, D.M.2
  • 18
    • 0003630733 scopus 로고
    • "Learning and Problem Solving With Mulitlayer Connectionist Systems"
    • Ph.D. dissertation, Dept. Comput. Inform. Sci., Univ. Massachusetts, Amherst
    • C. W. Anderson, "Learning and Problem Solving With Mulitlayer Connectionist Systems," Ph.D. dissertation, Dept. Comput. Inform. Sci., Univ. Massachusetts, Amherst, 1986.
    • (1986)
    • Anderson, C.W.1
  • 19
    • 0003997198 scopus 로고
    • "Strategy Learning With Multilayer Connectionist Representations"
    • GTE Laboratories, Waltham, MA, Tech. Rep. TR87-507.3
    • C. W. Anderson, "Strategy Learning With Multilayer Connectionist Representations," GTE Laboratories, Waltham, MA, Tech. Rep. TR87-507.3, 1987.
    • (1987)
    • Anderson, C.W.1
  • 20
    • 85012688561 scopus 로고
    • Princeton, NJ: Princeton Univ. Press
    • R. E. Bellman, Dynamic Programming. Princeton, NJ: Princeton Univ. Press, 1957.
    • (1957) Dynamic Programming
    • Bellman, R.E.1
  • 21
    • 0035273403 scopus 로고    scopus 로고
    • "Online learning control by association and reinforcement"
    • Mar
    • J. Si and Y. T. Wang, "Online learning control by association and reinforcement," IEEE Trans. Neural Netw., vol. 12, no. 2, pp. 264-276, Mar. 2001.
    • (2001) IEEE Trans. Neural Netw. , vol.12 , Issue.2 , pp. 264-276
    • Si, J.1    Wang, Y.T.2
  • 22
    • 33847202724 scopus 로고
    • "Learning to predict by the method of temporal differences"
    • R. S. Sutton, "Learning to predict by the method of temporal differences," in Mach. Learn., 1988, vol. 3, pp. 9-44.
    • (1988) Mach. Learn. , vol.3 , pp. 9-44
    • Sutton, R.S.1
  • 23
    • 0032255815 scopus 로고    scopus 로고
    • "Neural network identification: A survey of gradient based methods"
    • London, U.K., Nov. Dig. 98/521
    • S. McLoone, "Neural network identification: A survey of gradient based methods," in IEE Colloq. Optimization in Control: Methods and Applications, London, U.K., Nov. 1998, Dig. 98/521.
    • (1998) IEE Colloq. Optimization in Control: Methods and Applications
    • McLoone, S.1
  • 24
    • 0033716744 scopus 로고    scopus 로고
    • "Efficient training of neural nets for nonlinear adaptive filtering using a recursive Levenberg- Marquardt algorithm"
    • Jul
    • L. S. H. Ngia and J. Sjöberg, "Efficient training of neural nets for nonlinear adaptive filtering using a recursive Levenberg- Marquardt algorithm," IEEE Trans. Signal Process., vol. 48, no. 7, pp. 1915-1926, Jul. 2000.
    • (2000) IEEE Trans. Signal Process. , vol.48 , Issue.7 , pp. 1915-1926
    • Ngia, L.S.H.1    Sjöberg, J.2
  • 26
    • 0025546834 scopus 로고
    • "An adaptive nonlinear predictive controller"
    • J. D. Morningred, "An adaptive nonlinear predictive controller," in Proc. American Control Conf. (ACC), vol. 2, 1990, pp. 1614-1619.
    • (1990) Proc. American Control Conf. (ACC) , vol.2 , pp. 1614-1619
    • Morningred, J.D.1
  • 27
    • 0026172111 scopus 로고
    • "Dynamic matrix based control of fossil power plants"
    • Jun
    • J. A. Rovlak and R. Corlis, "Dynamic matrix based control of fossil power plants," IEEE Trans. Energy Convers., vol. 6, no. 2, pp. 320-326, Jun. 1991.
    • (1991) IEEE Trans. Energy Convers. , vol.6 , Issue.2 , pp. 320-326
    • Rovlak, J.A.1    Corlis, R.2
  • 28
    • 8844273598 scopus 로고
    • "Electrical power and chemical process applications"
    • ser. IEE Control Engineering Series 53, G. W. Irwin, K. Warwick, and K. J. Hunt, Eds. London, U.K.: IEE
    • G. W. Irwin, P. O'Reilly, G. Ligthbody, M. Brown, and E. Swidenbank, "Electrical power and chemical process applications," in Neural Network Applications in Control. ser. IEE Control Engineering Series 53, G. W. Irwin, K. Warwick, and K. J. Hunt, Eds. London, U.K.: IEE, 1995.
    • (1995) Neural Network Applications in Control
    • Irwin, G.W.1    O'Reilly, P.2    Ligthbody, G.3    Brown, M.4    Swidenbank, E.5


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.