메뉴 건너뛰기




Volumn 2, Issue , 1997, Pages 755-760

Partitioning input space for reinforcement learning for control

Author keywords

[No Author keywords available]

Indexed keywords

INPUT SPACE; SYSTEM DESIGNERS; SYSTEM LEARNING; TEMPORAL DOMAIN;

EID: 0030691689     PISSN: 10987576     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICNN.1997.616117     Document Type: Conference Paper
Times cited : (8)

References (18)
  • 1
    • 0024646143 scopus 로고
    • Learning to control an inverted pendulum using neural networks
    • C. W. Anderson. Learning to control an inverted pendulum using neural networks. IEEE Control Systems Magazine, 9(3):31-37,1989.
    • (1989) IEEE Control Systems Magazine , vol.9 , Issue.3 , pp. 31-37
    • Anderson, C.W.1
  • 2
  • 3
    • 0003787146 scopus 로고
    • Princeton University Press, Princeton, NJ
    • R. E. Bellman. Dynamic Programming. Princeton University Press, Princeton, NJ, 1957.
    • (1957) Dynamic Programming
    • Bellman, R.E.1
  • 9
    • 0026837390 scopus 로고
    • Adaptive fuzzy systems for backing up a truck-and-trailer
    • S.-G. Kong and B. Kosko. Adaptive fuzzy systems for backing up a truck-and-trailer. IEEE Trans, on Neural Networks, 3(2):211-223,1992.
    • (1992) IEEE Trans, on Neural Networks , vol.3 , Issue.2 , pp. 211-223
    • Kong, S.-G.1    Kosko, B.2
  • 10
    • 0024882715 scopus 로고
    • 3d-neural-net for learning visuomoter-coordination of a robot arm
    • Erlbaum
    • T. M. Martinetz, H. J. Ritter, and K. J. Schulten. 3d-neural-net for learning visuomoter-coordination of a robot arm. In Int'l Joint Conf. on Neural Networks, volume II, pages 351-356. Erlbaum, 1989.
    • (1989) Int'l Joint Conf. on Neural Networks , vol.2 , pp. 351-356
    • Martinetz, T.M.1    Ritter, H.J.2    Schulten, K.J.3
  • 11
    • 0000827179 scopus 로고
    • Boxes: An experiment in adaptive control
    • In E. Dale and D. Michie, editors Oliver and Boyd, Edinburgh
    • D. Michie and R. Chambers. Boxes: an experiment in adaptive control. In E. Dale and D. Michie, editors, Machine Intelligence. Oliver and Boyd, Edinburgh, 1968.
    • (1968) Machine Intelligence
    • Michie, D.1    Chambers, R.2
  • 12
    • 0029514510 scopus 로고
    • The parti-game algorithm for variable resolution reinforcement learning in multidimensional state-spaces
    • A. W. Moore and C. G. Atkeson. The parti-game algorithm for variable resolution reinforcement learning in multidimensional state-spaces. Machine Learning, 21:199-233,1995.
    • (1995) Machine Learning , vol.21 , pp. 199-233
    • Moore, A.W.1    Atkeson, C.G.2
  • 15
    • 0020199330 scopus 로고
    • A self-learning automaton with variable resulu-tion for high precision assembly by industrial robots
    • October
    • J. Simons, H. V. Brussel, J. D. Schutter, and J. Ver-haert. A self-learning automaton with variable resulu-tion for high precision assembly by industrial robots. IEEE Trans, on Automatic Control, 27(5): 1109-1113, October 1982.
    • (1982) IEEE Trans, on Automatic Control , vol.27 , Issue.5 , pp. 1109-1113
    • Simons, J.1    Brussel, H.V.2    Schutter, J.D.3    Ver-Haert, J.4
  • 16
    • 0029753630 scopus 로고    scopus 로고
    • Reinforcement learning with replacing eligibility traces
    • S. P. Singh and R. S. Sutton. Reinforcement learning with replacing eligibility traces. Machine Learning, 22:123-158,1996.
    • (1996) Machine Learning , vol.22 , pp. 123-158
    • Singh, S.P.1    Sutton, R.S.2
  • 17
    • 33847202724 scopus 로고
    • Learning to predict by the methods of temporal differences
    • R. S. Sutton. Learning to predict by the methods of temporal differences. Machine Learning, 3:944, 1988.
    • (1988) Machine Learning , vol.3 , pp. 944
    • Sutton, R.S.1
  • 18
    • 0345182481 scopus 로고
    • Fuzzy BOXES as an alternative to neural networks for difficult control problems
    • N. Woodcock, N. J. Hallam, and P. D.Picton. Fuzzy BOXES as an alternative to neural networks for difficult control problems. Artificial Inteligence in Engineering, pages 903-919,1991.
    • (1991) Artificial Inteligence in Engineering , pp. 903-919
    • Woodcock, N.1    Hallam, N.J.2    Picton, P.D.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.