SCOPUS 정보 검색 플랫폼

Proceedings - IEEE International Conference on Robotics and Automation

Volumn , Issue , 2010, Pages 2267-2272

Two steps natural actor critic learning for underwater cable tracking

(3) El Fakdi, Andres a Carreras, Marc a Galceran, Enric a

a UNIVERSITY OF GIRONA (Spain)

Author keywords

[No Author keywords available]

Indexed keywords

ACTION SELECTION; ACTOR CRITIC; ACTOR-CRITIC LEARNING; AUTONOMOUS ROBOT; CONVERGENCE PROCESS; FAST CONVERGENCE; FIELD APPLICATION; FUNCTION APPROXIMATION; HYDRODYNAMIC MODEL; LEARNING PROCEDURES; LEARNING PROCESS; PARTIAL OBSERVABILITY; POLICY GRADIENT; REAL ENVIRONMENTS; SIMULATED RESULTS; UNDERWATER CABLES; UNDERWATER VEHICLES; VALUE FUNCTIONS;

CABLES; COMPUTER SIMULATION; OBSERVABILITY; ROBOTICS; ROBOTS; SUBMERSIBLES; WATER CRAFT;

CONVERGENCE OF NUMERICAL METHODS;

EID: 77955825214 PISSN: 10504729 EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ROBOT.2010.5509751 Document Type: Conference Paper

Times cited : (5)

References (19)

1
- 0004102479
- MIT Press
- R. Sutton and A. Barto, Reinforcement Learning, an introduction. MIT Press, 1998.
- (1998) Reinforcement Learning, An Introduction
- Sutton, R.¹ Barto, A.²

2
- 34250635407
- Policy gradient methods for robotics
- J. Peters and S. Schaal, "Policy gradient methods for robotics," in IEEE/RSJ International Conference on Intelligent Robots and Systems IROS'06, Beijing, China, October 9-15 2006.
- IEEE/RSJ International Conference on Intelligent Robots and Systems IROS'06, Beijing, China, October 9-15 2006
- Peters, J.¹ Schaal, S.²

3
- 4043069840
- On actor-critic algorithms
- V. Konda and J. Tsitsiklis, "On actor-critic algorithms," SIAM Journal on Control and Optimization, vol. 42, number 4, pp. 1143-1166, 2003.
- (2003) SIAM Journal on Control and Optimization , vol.42 , Issue.4 , pp. 1143-1166
- Konda, V.¹ Tsitsiklis, J.²

4
- 84898939480
- Policy gradient methods for reinforcement learning with function approximation
- R. Sutton, D. McAllester, S. Singh, and Y. Mansour, "Policy gradient methods for reinforcement learning with function approximation," Advances in Neural Information Processing Systems, vol. 12, pp. 1057-1063, 2000.
- (2000) Advances in Neural Information Processing Systems , vol.12 , pp. 1057-1063
- Sutton, R.¹ McAllester, D.² Singh, S.³ Mansour, Y.⁴

5
- 14044262287
- Stochastic policy gradient reinforcement learning on a simple 3D biped
- R. Tedrake, T. W. Zhang, and H. S. Seung, "Stochastic policy gradient reinforcement learning on a simple 3D biped," in IEEE/RSJ International Conference on Intelligent Robots and Systems IROS'04, Sendai, Japan, September 28 - October 2 2004.
- IEEE/RSJ International Conference on Intelligent Robots and Systems IROS'04, Sendai, Japan, September 28 - October 2 2004
- Tedrake, R.¹ Zhang, T.W.² Seung, H.S.³

6
- 33846174631
- Learning sensory feedback to CPG with policy gradient for biped locomotion
- T. Matsubara, J. Morimoto, J. Nakanishi, M. Sato, and K. Doya, "Learning sensory feedback to CPG with policy gradient for biped locomotion," in Proceedings of the International Conference on Robotics and Automation ICRA, Barcelona, Spain, April 2005.
- Proceedings of the International Conference on Robotics and Automation ICRA, Barcelona, Spain, April 2005
- Matsubara, T.¹ Morimoto, J.² Nakanishi, J.³ Sato, M.⁴ Doya, K.⁵

7
- 70049104346
- Ph.D. dissertation, Department of Computer Science, University of Southern California.
- J. Peters, "Machine learning of motor skills for robotics," Ph.D. dissertation, Department of Computer Science, University of Southern California., 2007.
- (2007) Machine Learning of Motor Skills for Robotics
- Peters, J.¹

8
- 0000396062
- Natural gradient works efficiently in learning
- S. Amari, "Natural gradient works efficiently in learning," Neural Computation, vol. 10, pp. 251-276, 1998.
- (1998) Neural Computation , vol.10 , pp. 251-276
- Amari, S.¹

9
- 84864064043
- Natural actor-critic for road traffic optimisation
- S. Richter, D. Aberdeen, and J. Yu, "Natural actor-critic for road traffic optimisation," in Neural Information Processing Systems, NIPS'06, 2006, pp. 1169-1176.
- Neural Information Processing Systems, NIPS'06, 2006 , pp. 1169-1176
- Richter, S.¹ Aberdeen, D.² Yu, J.³

10
- 34250613580
- Stable learning of quasi-passive dynamic walking by an unstable biped robot based on off-policy natural actor-critic
- T. Ueno, Y. Nakamura, T. Shibata, K. Hosoda, and S. Ishii, "Stable learning of quasi-passive dynamic walking by an unstable biped robot based on off-policy natural actor-critic," in IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2006.
- IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2006
- Ueno, T.¹ Nakamura, Y.² Shibata, T.³ Hosoda, K.⁴ Ishii, S.⁵

11
- 0004090962
- Ph.D. dissertation, Department of Computer Science at Brown University, Rhode Island, May
- W. Smart, "Making reinforcement learning work on real robots," Ph.D. dissertation, Department of Computer Science at Brown University, Rhode Island, May 2002.
- (2002) Making Reinforcement Learning Work on Real Robots
- Smart, W.¹

12
- 33846984231
- Learning obstacle avoidance parameters from operator behavior
- December
- B. Hammer, S. Singh, and S. Scherer, "Learning obstacle avoidance parameters from operator behavior," Journal of Field Robotics, Special Issue on Machine Learning Based Robotics in Unstructured Environments, vol. 23 (11/12), December 2006.
- (2006) Journal of Field Robotics, Special Issue on Machine Learning Based Robotics in Unstructured Environments , vol.23 , Issue.11-12
- Hammer, B.¹ Singh, S.² Scherer, S.³

13
- 33646413135
- Natural actor-critic
- J. Peters, S. Vijayakumar, and S. Schaal, "Natural actor-critic," in ECML, 2005, pp. 280-291.
- (2005) ECML , pp. 280-291
- Peters, J.¹ Vijayakumar, S.² Schaal, S.³

14
- 0000123778
- Self-improving reactive agents based on reinforcement learning, planning and teaching
- L. Lin, "Self-improving reactive agents based on reinforcement learning, planning and teaching." Machine Learning, vol. 8(3/4), pp. 293-321, 1992.
- (1992) Machine Learning , vol.8 , Issue.3-4 , pp. 293-321
- Lin, L.¹

15
- 69549136968
- Policy gradient based reinforcement learning for real autonomous underwater cable tracking
- A. El-Fakdi and M. Carreras, "Policy gradient based reinforcement learning for real autonomous underwater cable tracking," in IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2008.
- IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2008
- El-Fakdi, A.¹ Carreras, M.²

16
- 0004170597
- John Wiley and Sons
- T. I. Fossen, Guidance and Control of Ocean Vehicles. John Wiley and Sons, 1995.
- (1995) Guidance and Control of Ocean Vehicles
- Fossen, T.I.¹

17
- 36348971779
- Ictineu auv wins the first sauc-e competition
- D. Ribas, N. Palomeras, P. Ridao, M. Carreras, and E. Hernandez, "Ictineu auv wins the first sauc-e competition," in IEEE International Conference on Robotics and Automation, 2007.
- IEEE International Conference on Robotics and Automation, 2007
- Ribas, D.¹ Palomeras, N.² Ridao, P.³ Carreras, M.⁴ Hernandez, E.⁵

18
- 3342922286
- On the identification of non-linear models of unmanned underwater vehicles
- P. Ridao, A. Tiano, A. El-Fakdi, M. Carreras, and A. Zirilli, "On the identification of non-linear models of unmanned underwater vehicles," Control Engineering Practice, vol. 12, pp. 1483-1499, 2004.
- (2004) Control Engineering Practice , vol.12 , pp. 1483-1499
- Ridao, P.¹ Tiano, A.² El-Fakdi, A.³ Carreras, M.⁴ Zirilli, A.⁵

19
- 35248838766
- Underwater cable tracking by visual feedback
- J. Antich and A. Ortiz, "Underwater cable tracking by visual feedback," in First Iberian Conference on Pattern recognition and Image Analysis (IbPRIA, LNCS 2652), Port d'Andratx, Spain, 2003.
- First Iberian Conference on Pattern Recognition and Image Analysis (IbPRIA, LNCS 2652), Port D'Andratx, Spain, 2003
- Antich, J.¹ Ortiz, A.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.