메뉴 건너뛰기




Volumn 1, Issue , 2004, Pages 316-321

Reinforcement learning for process identification, control and optimisation

Author keywords

Action dependent adaptive critics; Neural network; Nonlinear PI control; Reinforcement learning

Indexed keywords

APPROXIMATION THEORY; BACKPROPAGATION; COMPUTER SIMULATION; INTEGRATED CONTROL; LEARNING SYSTEMS; MATHEMATICAL MODELS; NEURAL NETWORKS; OPTIMIZATION; PROPORTIONAL CONTROL SYSTEMS;

EID: 8844245617     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (7)

References (16)
  • 1
    • 0002011091 scopus 로고
    • A menu of designs for reinforcement learning over time
    • Miller, W. T., Sutton, R. S. and Werbos P. J. eds, MIT Press, Cambridge, MA
    • Werbos, P. J., "A Menu of Designs for Reinforcement Learning Over Time", in Miller, W. T., Sutton, R. S. and Werbos P. J. eds, Neural Networks for Control, MIT Press, Cambridge, MA, pp. 67 - 95, 1990.
    • (1990) Neural Networks for Control , pp. 67-95
    • Werbos, P.J.1
  • 2
    • 0002031779 scopus 로고
    • Approximate dynamic programming control for real-time control and neural modelling
    • (Chapter 13), Edited by D. A. White and D. A. Sofge, New York, NY: Van Nostrand Reinhold
    • P. J. Werbos, "Approximate dynamic programming control for real-time control and neural modelling", Handbook of Intelligent Control: Neural, Fuzzy, and Adaptive Approaches (Chapter 13), Edited by D. A. White and D. A. Sofge, New York, NY: Van Nostrand Reinhold, 1992.
    • (1992) Handbook of Intelligent Control: Neural, Fuzzy, and Adaptive Approaches
    • Werbos, P.J.1
  • 3
    • 0003787146 scopus 로고
    • Princeton, NJ, Princeton University Press
    • R. E. Bellman, "Dynamic Programming", Princeton, NJ, Princeton University Press, 1917.
    • (1917) Dynamic Programming
    • Bellman, R.E.1
  • 4
    • 0035273403 scopus 로고    scopus 로고
    • Online learning control by association and reinforcement
    • March
    • J. Si and Y. T. Wang, "Online Learning Control by Association and Reinforcement", IEEE Transactions on Neural Networks, vol. 12, no. 2, pp. 264-276, March 2001.
    • (2001) IEEE Transactions on Neural Networks , vol.12 , Issue.2 , pp. 264-276
    • Si, J.1    Wang, Y.T.2
  • 6
    • 0033716744 scopus 로고    scopus 로고
    • Efficient training of neural nets for nonlinear adaptive filtering using a recursive levenberg-marquardt algorithm
    • July
    • Ngia L.S.H. and Sjöberg J., "Efficient Training of Neural Nets for Nonlinear Adaptive Filtering using a Recursive Levenberg-Marquardt Algorithm", IEEE Transactions on Signal Processing, vol. 48, no. 7, pp. 1915-1926, July 2000.
    • (2000) IEEE Transactions on Signal Processing , vol.48 , Issue.7 , pp. 1915-1926
    • Ngia, L.S.H.1    Sjöberg, J.2
  • 7
    • 0029592634 scopus 로고
    • Adaptive critic designs: A case study for neurocontrol
    • Prokhorov, D., Santiago, R., and D. Wunsch, "Adaptive Critic Designs: A Case Study For Neurocontrol", Neural Networks, vol. 8, no. 9, pp. 1367-1372, 1995.
    • (1995) Neural Networks , vol.8 , Issue.9 , pp. 1367-1372
    • Prokhorov, D.1    Santiago, R.2    Wunsch, D.3
  • 9
    • 0025229247 scopus 로고
    • Consistency of HDP applied to a simple reinforcement learning problem
    • Werbos, P. J., "Consistency of HDP Applied to a Simple Reinforcement Learning Problem", Neural Networks, vol. 3, no. 2, pp. 179-189, 1990.
    • (1990) Neural Networks , vol.3 , Issue.2 , pp. 179-189
    • Werbos, P.J.1
  • 11
    • 33847202724 scopus 로고
    • Learning to predict by the method of temporal differences
    • Sutton R. S., "Learning to Predict by the Method of Temporal Differences", Machine Learning, Vol. 3, pp. 9-44, 1988.
    • (1988) Machine Learning , vol.3 , pp. 9-44
    • Sutton, R.S.1
  • 12
    • 0004052906 scopus 로고
    • Implementation details of the TD(1) procedure for the case of vector predictions and backpropagation
    • Aug.
    • Sutton R.S., "Implementation Details of the TD(1) Procedure for the Case of Vector Predictions and Backpropagation", GTE Laboratories Technical Note TN87-509.1, Aug., 1989.
    • (1989) GTE Laboratories Technical Note , vol.TN87-509.1
    • Sutton, R.S.1
  • 15
    • 0026172111 scopus 로고
    • Dynamic matrix based control of fossil power plants
    • Rovlak, J. A. and Corlis, R., "Dynamic matrix based control of fossil power plants", IEEE Transactions on Energy Conversion, vol. 6, no. 2, pp. 320-326, 1991.
    • (1991) IEEE Transactions on Energy Conversion , vol.6 , Issue.2 , pp. 320-326
    • Rovlak, J.A.1    Corlis, R.2
  • 16
    • 8844273598 scopus 로고
    • Electrical power and chemical process applications
    • Neural network applications in control, Irwin, G. W., Warwick, K. and Hunt, K. J., eds., The Institution of Electrical Engineers, London, UK
    • Irwin, G. W., O'Reilly, P., Ligthbody, G., Brown, M., and Swidenbank, E., "Electrical power and chemical process applications", In Neural network applications in control, Irwin, G. W., Warwick, K. and Hunt, K. J., eds., IEE Control Engineering Series 53, The Institution of Electrical Engineers, London, UK, 1995.
    • (1995) IEE Control Engineering Series , vol.53
    • Irwin, G.W.1    O'Reilly, P.2    Ligthbody, G.3    Brown, M.4    Swidenbank, E.5


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.