SCOPUS 정보 검색 플랫폼

IEEE International Conference on Intelligent Robots and Systems

Volumn 1, Issue , 2003, Pages 406-411

Using Policy Gradient Reinforcement Learning on Autonomous Robot Controllers

(3) Grudic, Gregory Z a Kumar, Vijay b Ungar, Lyle b

a UCB 450 (United States)

b UNIVERSITY OF PENNSYLVANIA (United States)

Author keywords

[No Author keywords available]

Indexed keywords

ALGEBRA; ALGORITHMS; APPROXIMATION THEORY; CONTROL EQUIPMENT; CONTROL SYSTEM ANALYSIS; ESTIMATION; FEEDBACK; LEARNING SYSTEMS; ROBOT PROGRAMMING; ROBUSTNESS (CONTROL SYSTEMS);

CONTINOUS CONTROLLERS; REINFORCEMENT FEEDBACK;

ROBOTICS;

EID: 0347410594 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (20)

References (21)

1
- 0003633537
- ch. Cooperative Multiagent Robot Systems. MIT Press
- R. Arkin and T. Balch, Artificial Intelligence and Mobile Robots, ch. Cooperative Multiagent Robot Systems. MIT Press, 1998.
- (1998) Artificial Intelligence and Mobile Robots
- Arkin, R.¹ Balch, T.²

2
- 0029537980
- Issues and approaches in the design of collective autonomous a gents
- Dec
- M. Mataric, "Issues and approaches in the design of collective autonomous a gents," Robotics and Autonoumous Systems, vol. 16, pp. 321-331, Dec 1995.
- (1995) Robotics and Autonoumous Systems , vol.16 , pp. 321-331
- Mataric, M.¹

3
- 84957702766
- Multiobjective hybrid control synthisis
- Proceedings of hybrid and realtime systems, Grenoble: Springer-Verlag, March
- J.Lygeros, C.J.Tomlin, and S.Sastry, "Multiobjective hybrid control synthisis," in Proceedings of hybrid and realtime systems, vol. 1201 of Lecture Notes in Computer Science, Grenoble: Springer-Verlag, March 1997.
- (1997) Lecture Notes in Computer Science , vol.1201
- Lygeros, J.¹ Tomlin, C.J.² Sastry, S.³

4
- 0033311181
- Basic problems in stability and design of switched systems
- Oct.
- D. Liberzon and A. S. Morse, "Basic problems in stability and design of switched systems," IEEE Control Systems, vol. 19, pp. 59-70, Oct. 1999.
- (1999) IEEE Control Systems , vol.19 , pp. 59-70
- Liberzon, D.¹ Morse, A.S.²

5
- 0003672832
- PhD thesis, MIT, Cambridge, MA
- M. Branicky, Studies in Hybrid Systems: Modeling, Analysis and Control. PhD thesis, MIT, Cambridge, MA, 1995.
- (1995) Studies in Hybrid Systems: Modeling, Analysis and Control
- Branicky, M.¹

6
- 0004102479
- Cambridge, MA: MIT Press
- R. S. Sutton and A. G. Barto, Reinforcement Learning: An Introduction. Cambridge, MA: MIT Press, 1998.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

7
- 0029679044
- Reinforcement learning: A survey
- L. P. Kaelbling, M. L. Littman, and A. W. Moore, "Reinforcement learning: A survey," Journal of Artificial Intelligence Research, vol. 4, pp. 237-285, 1996.
- (1996) Journal of Artificial Intelligence Research , vol.4 , pp. 237-285
- Kaelbling, L.P.¹ Littman, M.L.² Moore, A.W.³

8
- 0033348437
- Representation of behavioral history for learning in nonstationary conditions
- F. Michaud and M. J. Mataric, "Representation of behavioral history for learning in nonstationary conditions," Robotics and Autonomous Systems, vol. 29, no. 2, pp. 187-200, 1999.
- (1999) Robotics and Autonomous Systems , vol.29 , Issue.2 , pp. 187-200
- Michaud, F.¹ Mataric, M.J.²

9
- 0001898381
- Practical reinforcement learning in continuous spaces
- Morgan Kaufmann, June 29 - July 2
- W. D. Smart and L. P. Kaelbling, "Practical reinforcement learning in continuous spaces," in Proceedings of the Seventeenth International Conference on Machine Learning, vol. 17, pp. 903-910, Morgan Kaufmann, June 29 - July 2 2000.
- (2000) Proceedings of the Seventeenth International Conference on Machine Learning , vol.17 , pp. 903-910
- Smart, W.D.¹ Kaelbling, L.P.²

10
- 0348132949
- Enhancing transfer in reinforcement learning by building stochastic models of robot actions
- Morgan Kaufmann
- S. Mahadevan, "Enhancing transfer in reinforcement learning by building stochastic models of robot actions," in Proceedings of the Ninth International Conference on Machine Learning, vol. 9, pp. 290-299, Morgan Kaufmann, 1992.
- (1992) Proceedings of the Ninth International Conference on Machine Learning , vol.9 , pp. 290-299
- Mahadevan, S.¹

11
- 0000123778
- Self-improving reactive agents based on reinforcement learning, planning and teaching
- L. J. Lin, "Self-improving reactive agents based on reinforcement learning, planning and teaching," Machine Learning, vol. 8, pp. 293-321, 1992.
- (1992) Machine Learning , vol.8 , pp. 293-321
- Lin, L.J.¹

12
- 0030149709
- Purposive behaviour aquisition for a real robot by vision-based reinforcement learning
- M. Asada, S. Noda, S. Tawaratsumida, and K. Hosoda, "Purposive behaviour aquisition for a real robot by vision-based reinforcement learning," Machine Learning, vol. 23, pp. 279-303, 1996.
- (1996) Machine Learning , vol.23 , pp. 279-303
- Asada, M.¹ Noda, S.² Tawaratsumida, S.³ Hosoda, K.⁴

13
- 0036058423
- Effective reinforcement learning for mobile robots
- IEEE Intl. Conf. on Robot, and Automat., 2002
- W. D. Smart and L. P. Kaelbling, "Effective reinforcement learning for mobile robots," in IEEE Int. Conf. on Robotics and Automation, ICRA 02, 2002. IEEE Intl. Conf. on Robot, and Automat., 2002.
- (2002) IEEE Int. Conf. on Robotics and Automation, ICRA 02
- Smart, W.D.¹ Kaelbling, L.P.²

14
- 0036452535
- Robot behavioral selection using q-learning
- A. S. E. Martinson and R. C. Arkin, "Robot behavioral selection using q-learning," in In the Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2002.
- (2002) In the Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)
- Martinson, A.S.E.¹ Arkin, R.C.²

15
- 0036453118
- Learning optimal switching policies for path tracking tasks on a mobile robot
- A. H. F. Y. Wang, B. Thibodeau and R. Grupen, "Learning optimal switching policies for path tracking tasks on a mobile robot," in In the Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2002.
- (2002) In the Proceedings of the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)
- Wang, A.H.F.Y.¹ Thibodeau, B.² Grupen, R.³

16
- 0013528312
- Continuous-time hierarchical reinforcement learning
- Morgan Kaufmann
- S. Mahadevan, "Continuous-time hierarchical reinforcement learning," in Proceedings of the Eighteenth International Conference on Machine Learning, vol. 18, pp. 186-193, Morgan Kaufmann, 2002.
- (2002) Proceedings of the Eighteenth International Conference on Machine Learning , vol.18 , pp. 186-193
- Mahadevan, S.¹

17
- 0001794302
- Localizing search in reinforcement learning
- Menlo park, CA: AAAI Press / Cambridge, MA: MIT Press, July 30 - August 3
- G. Z. Grudic and L. H. Ungar, "Localizing search in reinforcement learning," in Proceedings of the Seventeenth National Conference on Artificial Intelligence, vol. 17, pp. 590-595, Menlo park, CA: AAAI Press / Cambridge, MA: MIT Press, July 30 - August 3 2000.
- (2000) Proceedings of the Seventeenth National Conference on Artificial Intelligence , vol.17 , pp. 590-595
- Grudic, G.Z.¹ Ungar, L.H.²

18
- 84898958374
- Gradient descent for general reinforcement learning
- M. I. Jordan, M. J. Kearns, and S. A. Solla, eds., Cambridge, MA, MIT Press
- L. Baird and A. W. Moore, "Gradient descent for general reinforcement learning," in Advances in Neural Information Processing Systems (M. I. Jordan, M. J. Kearns, and S. A. Solla, eds.), vol. 11, (Cambridge, MA), MIT Press, 1999.
- (1999) Advances in Neural Information Processing Systems , vol.11
- Baird, L.¹ Moore, A.W.²

19
- 0000337576
- Simple statistical gradient-following algorithms for connectionist reinforcement learning
- R. J. Williams, "Simple statistical gradient-following algorithms for connectionist reinforcement learning," Machine Learning, vol. 8, no. 3, pp. 229-256, 1992.
- (1992) Machine Learning , vol.8 , Issue.3 , pp. 229-256
- Williams, R.J.¹

20
- 0348132950
- Submitted
- G. Z. Grudic, V. Kumar, and L. H. Ungar, "Refining autonomous robot controllers using reinforcemnt learning," Submitted, 2003.
- (2003) Refining Autonomous Robot Controllers Using Reinforcemnt Learning
- Grudic, G.Z.¹ Kumar, V.² Ungar, L.H.³

21
- 0003774798
- Cambridge; New York: Cambridge University Press
- D. Lee, The map-building and exploration strategies of a simple sonar-equipped robot: an experimental, quantitative evaluation. Cambridge; New York: Cambridge University Press, 1996.
- (1996) The Map-building and Exploration Strategies of a Simple Sonar-equipped Robot: An Experimental, Quantitative Evaluation
- Lee, D.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.