SCOPUS 정보 검색 플랫폼

IEEE International Conference on Intelligent Robots and Systems

Volumn 1, Issue , 2002, Pages 1062-1067

Approximating the value function for continuous space reinforcement learning in robot control

(3) Buck, Sebastian a Beetz, Michael a Schmitt, Thorsten a

a TECHNICAL UNIVERSITY OF MUNICH (Germany)

Author keywords

[No Author keywords available]

Indexed keywords

APPROXIMATION THEORY; MATHEMATICAL MODELS; MOBILE ROBOTS; MOTION CONTROL; NEURAL NETWORKS; STATE SPACE METHODS;

CONTINUOUS SPACE VALUE FUNCTION; REINFORCEMENT LEARNING; ROBOT CONTROL;

ROBOT LEARNING;

EID: 0036453749 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (4)

References (20)

1
- 0030149709
- Purposive behavior acquisition for a real robot by vision-based reinforcement learning
- M. Asada, S. Tawaratsumida, and K. Hosoda: Purposive Behavior Acquisition for a Real Robot by Vision-Based Reinforcement Learning. Machine Learning Journal, 23:279-303, 1996.
- (1996) Machine Learning Journal , vol.23 , pp. 279-303
- Asada, M.¹ Tawaratsumida, S.² Hosoda, K.³

2
- 0026829832
- Numerical potential field techniques for robot path planning
- March/April
- J. Barraquand, B. Langlois, and J. Latombe: Numerical potential field techniques for robot path planning. IEEE Transactions on Systems, Man, Cybernetics, 22(2):224-241, March/April 1992.
- (1992) IEEE Transactions on Systems, Man, Cybernetics , vol.22 , Issue.2 , pp. 224-241
- Barraquand, J.¹ Langlois, B.² Latombe, J.³

3
- 0003487482
- Athena Scientific, Belmont MA
- D. Bertsekas, and J. Tsitsiklis: Neuro-Dynamic Programming. Athena Scientific, Belmont MA, 1996.
- (1996) Neuro-Dynamic Programming
- Bertsekas, D.¹ Tsitsiklis, J.²

4
- 0001133021
- Generalization in reinforcement learning: Safely approximating the value function
- In Tesauro, G., D. S. Touretzky, and T. K. Leen (eds.); MIT Press
- J. Boyan and A. Moore: Generalization in Reinforcement Learning: Safely approximating the value function. In Tesauro, G., D. S. Touretzky, and T. K. Leen (eds.), Advances in Neural Information Processing Systems 7 (NIPS). MIT Press, 1995.
- (1995) Advances in Neural Information Processing Systems 7 (NIPS)
- Boyan, J.¹ Moore, A.²

5
- 0011934925
- M-ROSE: A multi robot simulation environment for learning cooperative behavior
- In H. Asama, T. Arai, T. Fukuda, and T. Hasegawa (eds.):; Springer Verlag
- S. Buck, M. Beetz, and T. Schmitt: M-ROSE: A Multi Robot Simulation Environment for Learning Cooperative Behavior. In H. Asama, T. Arai, T. Fukuda, and T. Hasegawa (eds.): Distributed Autonomous Robotic Systems 5, Springer Verlag, 2002.
- (2002) Distributed Autonomous Robotic Systems , vol.5
- Buck, S.¹ Beetz, M.² Schmitt, T.³

6
- 0033629916
- Reinforcement learning in continuous time and space
- K. Doya: Reinforcement Learning In Continuous Time and Space. Neural Computation, 12, 219-245, 2000.
- (2000) Neural Computation , vol.12 , pp. 219-245
- Doya, K.¹

7
- 0141517979
- Real-time reinforcement learning in continuous domains
- J. Forbes and D. Andre: Real-time reinforcement learning in continuous domains. AAAI Spring Symposium on Real-Time Autonomous Systems. 2000.
- AAAI Spring Symposium on Real-Time Autonomous Systems. 2000
- Forbes, J.¹ Andre, D.²

8
- 2342560826
- Module based reinforcement learning for a real robot
- Z. Kalmar, C. Szepesvari, and A. Lorincz: Module Based Reinforcement Learning for a Real Robot. Proceedings of the 6th European Workshop on Learning Robots, Lecture Notes in AI. 1998.
- Proceedings of the 6th European Workshop on Learning Robots, Lecture Notes in AI. 1998
- Kalmar, Z.¹ Szepesvari, C.² Lorincz, A.³

9
- 0003932121
- Reinforcement learning with selective perception and hidden state
- PhD thesis
- A. McCallum: Reinforcement Learning with Selective Perception and Hidden State. PhD thesis, 1995.
- (1995)
- McCallum, A.¹

10
- 0029514510
- The parti-game algorithm for variable resolution reinforcement learning in multidimensional state-spaces
- A. Moore and C. Atkeson: The Parti-game Algorithm for Variable Resolution Reinforcement Learning in Multidimensional State-spaces. Machine Learning Journal, 21(3):199-233, 1995.
- (1995) Machine Learning Journal , vol.21 , Issue.3 , pp. 199-233
- Moore, A.¹ Atkeson, C.²

11
- 0012003008
- Barycentric interpolators for continuous space & time reinforcement learning
- May
- R. Munos and A. Moore: Barycentric Interpolators for Continuous Space & Time Reinforcement Learning, Advances in Neural Information Processing Systems 11, May 1999.
- (1999) Advances in Neural Information Processing Systems , vol.11
- Munos, R.¹ Moore, A.²

12
- 0011995148
- Karlsruhe brainstormers - a reinforcement learning approach to robotic soccer II
- M. Riedmiller and A. Merke: Karlsruhe Brainstormers - a reinforcement learning approach to robotic soccer II. In 5th International Workshop on RoboCup, Lecture Notes in Artificial Intelligence, 2001, Springer Verlag.
- 5th International Workshop on RoboCup, Lecture Notes in Artificial Intelligence, 2001, Springer Verlag
- Riedmiller, M.¹ Merke, A.²

13
- 84943274699
- A direct adaptive method for faster backpropagation learning: The rprop algorithm
- M. Riedmiller and H. Braun: A direct adaptive method for faster backpropagation learning: the Rprop algorithm, Proceedings of the ICNN, San Francisco, 1993.
- Proceedings of the ICNN, San Francisco, 1993
- Riedmiller, M.¹ Braun, H.²

14
- 0001898381
- Practical reinforcement learning in continuous spaces
- W.D. Smart and L.P. Kaelbling: Practical Reinforcement Learning in Continuous Spaces. In Proceedings of the Seventeenth International Conference on Machine Learning, pp. 903-910, 2000.
- (2000) Proceedings of the Seventeenth International Conference on Machine Learning , pp. 903-910
- Smart, W.D.¹ Kaelbling, L.P.²

15
- 0013528313
- Scaling reinforcement learning toward roboCup soccer
- P. Stone and R. Sutton: Scaling Reinforcement Learning toward RoboCup Soccer. Eighteenth International Conference on Machine Learning, 2001.
- Eighteenth International Conference on Machine Learning, 2001
- Stone, P.¹ Sutton, R.²

16
- 0004102479
- Reinforcement learning: An introduction
- MIT Press, Cambridge, MA
- R.S. Sutton and A.G. Barto: Reinforcement Learning: An Introduction. MIT Press, Cambridge, MA, 1998.
- (1998)
- Sutton, R.S.¹ Barto, A.G.²

17
- 0035558924
- Continuous valued Q-learning method able to incrementally refine state space
- M. Takeda, T. Nakamura, and T. Ogasawara: Continuous Valued Q-learning Method Able to Incrementally Refine State Space. Proceedings of the IEEE International Conference on Intelligent Robots and Systems, 2001.
- Proceedings of the IEEE International Conference on Intelligent Robots and Systems, 2001
- Takeda, M.¹ Nakamura, T.² Ogasawara, T.³

18
- 0003270924
- Issues in using function approximation for reinforcement learning
- In M. Mozer, P. Smolensky, D. Touretzky, J. Elman, and A. Weigend, editors; Hillsdale, NJ
- S. Thrun and A. Schwartz: Issues in Using Function Approximation for Reinforcement Learning. In M. Mozer, P. Smolensky, D. Touretzky, J. Elman, and A. Weigend, editors, Proceedings of the Connectionist Models Summer School, pp. 255-263, Hillsdale, NJ, 1993.
- (1993) Proceedings of the Connectionist Models Summer School , pp. 255-263
- Thrun, S.¹ Schwartz, A.²

19
- 34249833101
- Q-learning
- C.J. Watkins and P. Dayan: Q-learning. Machine Learning Journal, 8:279-292, 1992.
- (1992) Machine Learning Journal , vol.8 , pp. 279-292
- Watkins, C.J.¹ Dayan, P.²

20
- 0002011091
- A menu of designs for reinforcement learning over time
- W.T. Miller, R.S. Sutton, and P.J. Werbos, editors; MIT Press, MA, USA
- P. Werbos: A menu of designs for reinforcement learning over time. In Neural Networks for Control, W.T. Miller, R.S. Sutton, and P.J. Werbos, editors, pp. 67-95, MIT Press, MA, USA, 1990.
- (1990) Neural Networks for Control , pp. 67-95
- Werbos, P.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.