메뉴 건너뛰기




Volumn , Issue , 2007, Pages 254-261

Evaluation of policy gradient methods and variants on the cart-pole benchmark

Author keywords

[No Author keywords available]

Indexed keywords

ALGORITHMS; BENCHMARKING; COMPUTATIONAL METHODS; FINITE DIFFERENCE METHOD; MATHEMATICAL MODELS; OBJECT ORIENTED PROGRAMMING; OPTIMIZATION; PUBLIC POLICY;

EID: 34548763245     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ADPRL.2007.368196     Document Type: Conference Paper
Times cited : (57)

References (13)
  • 1
    • 0000396062 scopus 로고    scopus 로고
    • Natural gradient works efficiently in learning
    • S. Amari. Natural gradient works efficiently in learning. Neural Computation, 10, 1998.
    • (1998) Neural Computation , vol.10
    • Amari, S.1
  • 2
    • 34250653774 scopus 로고    scopus 로고
    • Learning cpg sensory feedback with policy gradient for biped locomotion for a full-body humanoid
    • G. Endo, J. Morimoto, T. Matsubara, J. Nakanishi, and G. Cheng. Learning cpg sensory feedback with policy gradient for biped locomotion for a full-body humanoid. In AAAI 2005, 2005.
    • (2005) AAAI 2005
    • Endo, G.1    Morimoto, J.2    Matsubara, T.3    Nakanishi, J.4    Cheng, G.5
  • 3
    • 0012260296 scopus 로고    scopus 로고
    • Feature article: Optimization for simulation: Theory vs. practice
    • M. C. Fu. Feature article: Optimization for simulation: Theory vs. practice. INFORMS Journal on Computing, 14(3): 192-215, 2002.
    • (2002) INFORMS Journal on Computing , vol.14 , Issue.3 , pp. 192-215
    • Fu, M.C.1
  • 4
    • 0028381374 scopus 로고    scopus 로고
    • V. Gullapalli, J. Franklin, and H. Benbrahim. Aquiring robot skills via reinforcement learning. IEEE Control Systems, -(39), 1994.
    • V. Gullapalli, J. Franklin, and H. Benbrahim. Aquiring robot skills via reinforcement learning. IEEE Control Systems, -(39), 1994.
  • 10
    • 0028466750 scopus 로고    scopus 로고
    • M. Riedmiller. Advanced supervised learning in multi-layer perceptrons -- from backpropagation to adaptive learning algorithms. Int. Journal of Computer Standards and Interfaces, 16:265-278, 1994. Special Issue on Neural Networks.
    • M. Riedmiller. Advanced supervised learning in multi-layer perceptrons -- from backpropagation to adaptive learning algorithms. Int. Journal of Computer Standards and Interfaces, 16:265-278, 1994. Special Issue on Neural Networks.
  • 11
    • 34548720281 scopus 로고    scopus 로고
    • M. Riedmiller, R. Hafner, S. Lange, and S. Timmer. Clsquare, a closed loop simulation system
    • M. Riedmiller, R. Hafner, S. Lange, and S. Timmer. Clsquare - a closed loop simulation system.
  • 13
    • 34548757155 scopus 로고    scopus 로고
    • R. Tedrake, T. W. Zhang, and H. S. Seung. Learning to walk in 20 minutes. In Proceedings of the Fourteenth Yale Workshop on Adaptive and Learning Systems, Russ Tedrake, Teresa Weirui Zhang, and H. Sebastian Seung. (2005) Learning to Walk in 20 Minutes. In Proceedings of the Fourteenth Yale Workshop on Adaptive and Learning Systems, Yale University, New Haven, CT, 2005, 2005.
    • R. Tedrake, T. W. Zhang, and H. S. Seung. Learning to walk in 20 minutes. In Proceedings of the Fourteenth Yale Workshop on Adaptive and Learning Systems, Russ Tedrake, Teresa Weirui Zhang, and H. Sebastian Seung. (2005) Learning to Walk in 20 Minutes. In Proceedings of the Fourteenth Yale Workshop on Adaptive and Learning Systems, Yale University, New Haven, CT, 2005, 2005.


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.