SCOPUS 정보 검색 플랫폼

2008 IEEE/RSJ International Conference on Intelligent Robots and Systems, IROS

Volumn , Issue , 2008, Pages 3635-3640

Policy gradient based reinforcement learning for real autonomous underwater cable tracking

(2) El Fakdi, Andres a Carreras, Marc a

a UNIVERSITY OF GIRONA (Spain)

Author keywords

[No Author keywords available]

Indexed keywords

ACTION SELECTION; AUTONOMOUS ROBOT; CONVERGENCE TIME; DIRECT POLICY SEARCH; FIELD APPLICATION; LEARNING PHASE; LEARNING PROCESS; POLICY GRADIENT; REAL ROBOT; SIMULATED ENVIRONMENT; SPEED-UPS; UNDERWATER CABLES; UNDERWATER ROBOTS;

AUTONOMOUS UNDERWATER VEHICLES; CABLES; CONVERGENCE OF NUMERICAL METHODS; EDUCATION; INTELLIGENT ROBOTS; REINFORCEMENT; REINFORCEMENT LEARNING;

ROBOTICS;

EID: 69549136968 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/IROS.2008.4650873 Document Type: Conference Paper

Times cited : (27)

References (23)

1
- 0004102479
- MIT Press
- R. Sutton and A. Barto, Reinforcement Learning, an introduction. MIT Press, 1998.
- (1998) Reinforcement Learning, An Introduction
- Sutton, R.¹ Barto, A.²

2
- 84898939480
- Policy gradient methods for reinforcement learning with function approximation
- R. Sutton, D. McAllester, S. Singh, and Y. Mansour, "Policy gradient methods for reinforcement learning with function approximation," Advances in Neural Information Processing Systems, vol.12, pp. 1057-1063, 2000.
- (2000) Advances in Neural Information Processing Systems , vol.12 , pp. 1057-1063
- Sutton, R.¹ McAllester, D.² Singh, S.³ Mansour, Y.⁴

3
- 32844458161
- University of Colorado State," Computer Science Technical Report
- C. Anderson, "Approximating a policy can be easier than approximating a value function," University of Colorado State," Computer Science Technical Report, 2000.
- (2000) Approximating a Policy Can Be Easier Than Approximating a Value Function
- Anderson, C.¹

4
- 14344253499
- Ph.D. dissertation, Australian National University, April
- D. A. Aberdeen, "Policy-gradient algorithms for partially observable markov decision processes," Ph.D. dissertation, Australian National University, April 2003.
- (2003) Policy-gradient Algorithms for Partially Observable Markov Decision Processes
- Aberdeen, D.A.¹

5
- 0034859944
- Autonomous helicopter control using reinforcement learning policy search methods
- J. Bagnell and J. Schneider, "Autonomous helicopter control using reinforcement learning policy search methods," in Proceedings of the IEEE International Conference on Robotics and Automation, Korea, 2001.
- Proceedings of the IEEE International Conference on Robotics and Automation, Korea, 2001
- Bagnell, J.¹ Schneider, J.²

6
- 84880911162
- Robot weightlifting by direct policy search
- M. Rosenstein and A. Barto, "Robot weightlifting by direct policy search," in Proceedings of the International Joint Conference on Artificial Intelligence, 2001.
- (2001) Proceedings of the International Joint Conference on Artificial Intelligence
- Rosenstein, M.¹ Barto, A.²

7
- 3042534761
- Policy gradient reinforcement learning for fast quadrupedal locomotion
- N. Kohl and P. Stone, "Policy gradient reinforcement learning for fast quadrupedal locomotion," in IEEE International Conference on Robotics and Automation (ICRA), 2004.
- (2004) IEEE International Conference on Robotics and Automation (ICRA)
- Kohl, N.¹ Stone, P.²

8
- 33847238318
- Center for Communications Systems Research, University of Cambridge, Tech. Rep., March
- P. Marbach and J. N. Tsitsiklis, "Gradient-based optimization of Markov reward processes: Practical variants," Center for Communications Systems Research, University of Cambridge, Tech. Rep., March 2000.
- (2000) Gradient-based Optimization of Markov Reward Processes: Practical Variants
- Marbach, P.¹ Tsitsiklis, J.N.²

9
- 4043069840
- On actor-critic algorithms
- V. Konda and J. Tsitsiklis, "On actor-critic algorithms," SIAM Journal on Control and Optimization, vol.42, number 4, pp. 1143-1166, 2003.
- (2003) SIAM Journal on Control and Optimization , vol.42 , Issue.4 , pp. 1143-1166
- Konda, V.¹ Tsitsiklis, J.²

10
- 33746878798
- Massachusetts Institute of Technology, AI Memo Tech. Rep., April
- N. Meuleau, L. Peshkin, and K. Kim, "Exploration in gradient based reinforcement learning," Massachusetts Institute of Technology, AI Memo 2001-2003, Tech. Rep., April 2001.
- (2001) Exploration in Gradient Based Reinforcement Learning , pp. 2001-2003
- Meuleau, N.¹ Peshkin, L.² Kim, K.³

11
- 14044262287
- Stochastic policy gradient reinforcement learning on a simple 3D biped
- R. Tedrake, T. W. Zhang, and H. S. Seung, "Stochastic policy gradient reinforcement learning on a simple 3D biped," in IEEE/RSJ International Conference on Intelligent Robots and Systems IROS'04, Sendai, Japan, September 28 - October 2 2004.
- IEEE/RSJ International Conference on Intelligent Robots and Systems IROS'04, Sendai, Japan, September 28 - October 2 2004
- Tedrake, R.¹ Zhang, T.W.² Seung, H.S.³

12
- 33846174631
- Learning sensory feedback to CPG with policy gradient for biped locomotion
- T. Matsubara, J. Morimoto, J. Nakanishi, M. Sato, and K. Doya, "Learning sensory feedback to CPG with policy gradient for biped locomotion," in Proceedings of the International Conference on Robotics and Automation ICRA, Barcelona, Spain, April 2005.
- Proceedings of the International Conference on Robotics and Automation ICRA, Barcelona, Spain, April 2005
- Matsubara, T.¹ Morimoto, J.² Nakanishi, J.³ Sato, M.⁴ Doya, K.⁵

13
- 0000123778
- Self-improving reactive agents based on reinforcement learning, planning and teaching
- L. Lin, "Self-improving reactive agents based on reinforcement learning, planning and teaching." Machine Learning, vol.8(3/4), pp. 293-321, 1992.
- (1992) Machine Learning , vol.8 , Issue.3-4 , pp. 293-321
- Lin, L.¹

14
- 0004090962
- Ph.D. dissertation, Department of Computer Science at Brown University, Rhode Island, May
- W. Smart, "Making reinforcement learning work on real robots," Ph.D. dissertation, Department of Computer Science at Brown University, Rhode Island, May 2002.
- (2002) Making Reinforcement Learning Work on Real Robots
- Smart, W.¹

15
- 0031074521
- Locally weighted learning
- C. Atkenson, A. Moore, and S. Schaal, "Locally weighted learning," Artificial Intelligence Review, vol.11, pp. 11-73, 1997.
- (1997) Artificial Intelligence Review , vol.11 , pp. 11-73
- Atkenson, C.¹ Moore, A.² Schaal, S.³

16
- 33846984231
- Learning obstacle avoidance parameters from operator behavior
- December
- B. Hammer, S. Singh, and S. Scherer, "Learning obstacle avoidance parameters from operator behavior," Journal of Field Robotics, Special Issue on Machine Learning Based Robotics in Unstructured Environments, vol.23 (11/12), December 2006.
- (2006) Journal of Field Robotics, Special Issue on Machine Learning Based Robotics in Unstructured Environments , vol.23 , Issue.11-12
- Hammer, B.¹ Singh, S.² Scherer, S.³

17
- 0004142943
- Australian National University, Tech. Rep.
- J. Baxter and P. Bartlett, "Direct gradient-based reinforcement learning: I. gradient estimation algorithms," Australian National University, Tech. Rep., 1999.
- (1999) Direct Gradient-based Reinforcement Learning: I. Gradient Estimation Algorithms
- Baxter, J.¹ Bartlett, P.²

18
- 34250644253
- Towards direct policy search reinforcement learning for robot control
- A. El-Fakdi, M. Carreras, and P. Ridao, "Towards direct policy search reinforcement learning for robot control," in IEEE/RSJ International Conference on Intelligent Robots and Systems, 2006.
- (2006) IEEE/RSJ International Conference on Intelligent Robots and Systems
- El-Fakdi, A.¹ Carreras, M.² Ridao, P.³

19
- 0003413187
- Prentice Hall
- S. Haykin, Neural Networks, a comprehensive foundation, 2nd ed. Prentice Hall, 1999.
- (1999) Neural Networks, a Comprehensive Foundation, 2nd Ed
- Haykin, S.¹

20
- 36348971779
- Ictineu auv wins the first sauc-e competition
- D. Ribas, N. Palomeras, P. Ridao, M. Carreras, and E. Hernandez, "Ictineu auv wins the first sauc-e competition," in IEEE International Conference on Robotics and Automation, 2007.
- (2007) IEEE International Conference on Robotics and Automation
- Ribas, D.¹ Palomeras, N.² Ridao, P.³ Carreras, M.⁴ Hernandez, E.⁵

21
- 3342922286
- On the identification of non-linear models of unmanned underwater vehicles
- DOI 10.1016/j.conengprac.2004.01.004, PII S0967066104000152
- P. Ridao, A. Tiano, A. El-Fakdi, M. Carreras, and A. Zirilli, "On the identification of non-linear models of unmanned underwater vehicles," Control Engineering Practice, vol.12, pp. 1483-1499, 2004. (Pubitemid 38994782)
- (2004) Control Engineering Practice , vol.12 , Issue.12 SPEC. ISS , pp. 1483-1499
- Ridao, P.¹ Tiano, A.² El-Fakdi, A.³ Carreras, M.⁴ Zirilli, A.⁵

22
- 8844227781
- A vision system for an underwater cable tracker
- DOI 10.1007/s001380100065
- A. Ortiz, M. Simo, and G. Oliver, "A vision system for an underwater cable tracker," International Journal of Machine Vision and Applications, vol.13 (3), pp. 129-140, 2002. (Pubitemid 41200797)
- (2002) Machine Vision and Applications , vol.13 , Issue.3 , pp. 129-140
- Ortiz, A.¹ Simo, M.² Oliver, G.³

23
- 35248838766
- Underwater cable tracking by visual feedback
- J. Antich and A. Ortiz, "Underwater cable tracking by visual feedback," in First Iberian Conference on Pattern recognition and Image Analysis (IbPRIA, LNCS 2652), Port d'Andratx, Spain, 2003.
- First Iberian Conference on Pattern Recognition and Image Analysis (IbPRIA, LNCS 2652), Port D'Andratx, Spain, 2003
- Antich, J.¹ Ortiz, A.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.