메뉴 건너뛰기




Volumn 1, Issue , 2002, Pages 1062-1067

Approximating the value function for continuous space reinforcement learning in robot control

Author keywords

[No Author keywords available]

Indexed keywords

APPROXIMATION THEORY; MATHEMATICAL MODELS; MOBILE ROBOTS; MOTION CONTROL; NEURAL NETWORKS; STATE SPACE METHODS;

EID: 0036453749     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (4)

References (20)
  • 1
    • 0030149709 scopus 로고    scopus 로고
    • Purposive behavior acquisition for a real robot by vision-based reinforcement learning
    • M. Asada, S. Tawaratsumida, and K. Hosoda: Purposive Behavior Acquisition for a Real Robot by Vision-Based Reinforcement Learning. Machine Learning Journal, 23:279-303, 1996.
    • (1996) Machine Learning Journal , vol.23 , pp. 279-303
    • Asada, M.1    Tawaratsumida, S.2    Hosoda, K.3
  • 4
    • 0001133021 scopus 로고
    • Generalization in reinforcement learning: Safely approximating the value function
    • In Tesauro, G., D. S. Touretzky, and T. K. Leen (eds.); MIT Press
    • J. Boyan and A. Moore: Generalization in Reinforcement Learning: Safely approximating the value function. In Tesauro, G., D. S. Touretzky, and T. K. Leen (eds.), Advances in Neural Information Processing Systems 7 (NIPS). MIT Press, 1995.
    • (1995) Advances in Neural Information Processing Systems 7 (NIPS)
    • Boyan, J.1    Moore, A.2
  • 5
    • 0011934925 scopus 로고    scopus 로고
    • M-ROSE: A multi robot simulation environment for learning cooperative behavior
    • In H. Asama, T. Arai, T. Fukuda, and T. Hasegawa (eds.):; Springer Verlag
    • S. Buck, M. Beetz, and T. Schmitt: M-ROSE: A Multi Robot Simulation Environment for Learning Cooperative Behavior. In H. Asama, T. Arai, T. Fukuda, and T. Hasegawa (eds.): Distributed Autonomous Robotic Systems 5, Springer Verlag, 2002.
    • (2002) Distributed Autonomous Robotic Systems , vol.5
    • Buck, S.1    Beetz, M.2    Schmitt, T.3
  • 6
    • 0033629916 scopus 로고    scopus 로고
    • Reinforcement learning in continuous time and space
    • K. Doya: Reinforcement Learning In Continuous Time and Space. Neural Computation, 12, 219-245, 2000.
    • (2000) Neural Computation , vol.12 , pp. 219-245
    • Doya, K.1
  • 9
    • 0003932121 scopus 로고
    • Reinforcement learning with selective perception and hidden state
    • PhD thesis
    • A. McCallum: Reinforcement Learning with Selective Perception and Hidden State. PhD thesis, 1995.
    • (1995)
    • McCallum, A.1
  • 10
    • 0029514510 scopus 로고
    • The parti-game algorithm for variable resolution reinforcement learning in multidimensional state-spaces
    • A. Moore and C. Atkeson: The Parti-game Algorithm for Variable Resolution Reinforcement Learning in Multidimensional State-spaces. Machine Learning Journal, 21(3):199-233, 1995.
    • (1995) Machine Learning Journal , vol.21 , Issue.3 , pp. 199-233
    • Moore, A.1    Atkeson, C.2
  • 11
    • 0012003008 scopus 로고    scopus 로고
    • Barycentric interpolators for continuous space & time reinforcement learning
    • May
    • R. Munos and A. Moore: Barycentric Interpolators for Continuous Space & Time Reinforcement Learning, Advances in Neural Information Processing Systems 11, May 1999.
    • (1999) Advances in Neural Information Processing Systems , vol.11
    • Munos, R.1    Moore, A.2
  • 16
    • 0004102479 scopus 로고    scopus 로고
    • Reinforcement learning: An introduction
    • MIT Press, Cambridge, MA
    • R.S. Sutton and A.G. Barto: Reinforcement Learning: An Introduction. MIT Press, Cambridge, MA, 1998.
    • (1998)
    • Sutton, R.S.1    Barto, A.G.2
  • 18
    • 0003270924 scopus 로고
    • Issues in using function approximation for reinforcement learning
    • In M. Mozer, P. Smolensky, D. Touretzky, J. Elman, and A. Weigend, editors; Hillsdale, NJ
    • S. Thrun and A. Schwartz: Issues in Using Function Approximation for Reinforcement Learning. In M. Mozer, P. Smolensky, D. Touretzky, J. Elman, and A. Weigend, editors, Proceedings of the Connectionist Models Summer School, pp. 255-263, Hillsdale, NJ, 1993.
    • (1993) Proceedings of the Connectionist Models Summer School , pp. 255-263
    • Thrun, S.1    Schwartz, A.2
  • 20
    • 0002011091 scopus 로고
    • A menu of designs for reinforcement learning over time
    • W.T. Miller, R.S. Sutton, and P.J. Werbos, editors; MIT Press, MA, USA
    • P. Werbos: A menu of designs for reinforcement learning over time. In Neural Networks for Control, W.T. Miller, R.S. Sutton, and P.J. Werbos, editors, pp. 67-95, MIT Press, MA, USA, 1990.
    • (1990) Neural Networks for Control , pp. 67-95
    • Werbos, P.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.