메뉴 건너뛰기




Volumn 9, Issue 2, 1999, Pages 119-127

Training reinforcement neurocontrollers using the polytope algorithm

Author keywords

[No Author keywords available]

Indexed keywords

INTELLIGENT CONTROL; LEARNING ALGORITHMS; LEARNING SYSTEMS; OPTIMIZATION;

EID: 0032676817     PISSN: 13704621     EISSN: None     Source Type: Journal    
DOI: 10.1023/A:1018669223478     Document Type: Article
Times cited : (3)

References (16)
  • 2
    • 0024646143 scopus 로고
    • Learning to control an inverted pendulum using neural networks
    • C.W. Anderson, "Learning to control an inverted pendulum using neural networks", IEEE Control Systems Magazine, Vol. 2, pp. 31-37, 1989.
    • (1989) IEEE Control Systems Magazine , vol.2 , pp. 31-37
    • Anderson, C.W.1
  • 4
    • 0026923465 scopus 로고
    • Learning and tuning fuzzy logic controllers using reinforcements
    • H.R. Berenji and P. Khedkar, "Learning and tuning fuzzy logic controllers using reinforcements", IEEE Trans, on Neural Networks, Vol. 3, pp. 724-740, 1992.
    • (1992) IEEE Trans, on Neural Networks , vol.3 , pp. 724-740
    • Berenji, H.R.1    Khedkar, P.2
  • 10
    • 0004650460 scopus 로고
    • Efficient reinforcement learning strategies for the pole balancing problem
    • M. Marinaro and P. Morasso (eds), Springer-Verlag
    • D. Kontoravdis, A. Likas and A. Stafylopatis, "Efficient reinforcement learning strategies for the pole balancing problem", in M. Marinaro and P. Morasso (eds), Proc. ICANN'94, pp. 659-662, Springer-Verlag, 1994.
    • (1994) Proc. ICANN'94 , pp. 659-662
    • Kontoravdis, D.1    Likas, A.2    Stafylopatis, A.3
  • 11
    • 0030147547 scopus 로고    scopus 로고
    • Reinforcement learning for an ART-based fuzzy adaptive learning control network
    • C-J. Lin and C-T. Lin, "Reinforcement learning for an ART-based fuzzy adaptive learning control network", IEEE Trans. on Neural Networks, Vol. 7, pp. 709-731, 1996.
    • (1996) IEEE Trans. on Neural Networks , vol.7 , pp. 709-731
    • Lin, C.-J.1    Lin, C.-T.2
  • 12
    • 0000238336 scopus 로고
    • A simplex method for function minimization
    • J.A. Nelder and R. Mead, "A simplex method for function minimization", Computer Journal, Vol. 7, pp. 308-313, 1965.
    • (1965) Computer Journal , vol.7 , pp. 308-313
    • Nelder, J.A.1    Mead, R.2
  • 14
    • 84946640734 scopus 로고
    • Sequential application of simplex designs in optimization and evolutionary operation
    • W. Spendley, G. Hext and F. Himsworth, "Sequential application of simplex designs in optimization and evolutionary operation", Technometrics, Vol. 4, pp. 441-461, 1962.
    • (1962) Technometrics , vol.4 , pp. 441-461
    • Spendley, W.1    Hext, G.2    Himsworth, F.3
  • 15
    • 34249833101 scopus 로고
    • Learning from delayed rewards
    • C. Watkins and P. Dayan, "Learning from delayed rewards", Machine Learning, Vol. 8, pp. 279-292, 1992.
    • (1992) Machine Learning , vol.8 , pp. 279-292
    • Watkins, C.1    Dayan, P.2
  • 16
    • 0027701513 scopus 로고
    • Genetic reinforcement learning for neurocontrol problems
    • D. Whitley, S. Dominic, R. Das and C.W. Anderson, "Genetic reinforcement learning for neurocontrol problems", Machine Learning, Vol. 13, pp. 259-284, 1993.
    • (1993) Machine Learning , vol.13 , pp. 259-284
    • Whitley, D.1    Dominic, S.2    Das, R.3    Anderson, C.W.4


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.