메뉴 건너뛰기




Volumn , Issue , 2010, Pages 2267-2272

Two steps natural actor critic learning for underwater cable tracking

Author keywords

[No Author keywords available]

Indexed keywords

ACTION SELECTION; ACTOR CRITIC; ACTOR-CRITIC LEARNING; AUTONOMOUS ROBOT; CONVERGENCE PROCESS; FAST CONVERGENCE; FIELD APPLICATION; FUNCTION APPROXIMATION; HYDRODYNAMIC MODEL; LEARNING PROCEDURES; LEARNING PROCESS; PARTIAL OBSERVABILITY; POLICY GRADIENT; REAL ENVIRONMENTS; SIMULATED RESULTS; UNDERWATER CABLES; UNDERWATER VEHICLES; VALUE FUNCTIONS;

EID: 77955825214     PISSN: 10504729     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ROBOT.2010.5509751     Document Type: Conference Paper
Times cited : (5)

References (19)
  • 7
    • 70049104346 scopus 로고    scopus 로고
    • Ph.D. dissertation, Department of Computer Science, University of Southern California.
    • J. Peters, "Machine learning of motor skills for robotics," Ph.D. dissertation, Department of Computer Science, University of Southern California., 2007.
    • (2007) Machine Learning of Motor Skills for Robotics
    • Peters, J.1
  • 8
    • 0000396062 scopus 로고    scopus 로고
    • Natural gradient works efficiently in learning
    • S. Amari, "Natural gradient works efficiently in learning," Neural Computation, vol. 10, pp. 251-276, 1998.
    • (1998) Neural Computation , vol.10 , pp. 251-276
    • Amari, S.1
  • 11
    • 0004090962 scopus 로고    scopus 로고
    • Ph.D. dissertation, Department of Computer Science at Brown University, Rhode Island, May
    • W. Smart, "Making reinforcement learning work on real robots," Ph.D. dissertation, Department of Computer Science at Brown University, Rhode Island, May 2002.
    • (2002) Making Reinforcement Learning Work on Real Robots
    • Smart, W.1
  • 14
    • 0000123778 scopus 로고
    • Self-improving reactive agents based on reinforcement learning, planning and teaching
    • L. Lin, "Self-improving reactive agents based on reinforcement learning, planning and teaching." Machine Learning, vol. 8(3/4), pp. 293-321, 1992.
    • (1992) Machine Learning , vol.8 , Issue.3-4 , pp. 293-321
    • Lin, L.1
  • 18


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.