SCOPUS 정보 검색 플랫폼 - 논문 보기

메뉴 건너뛰기

Studies in Computational Intelligence

Volumn 264, Issue , 2010, Pages 293-309

Motor learning at intermediate reynolds number: Experiments with policy gradient on the flapping flight of a rigid wing

(4) Roberts, John W a Moret, Lionel b Zhang, Jun b Tedrake, Russ a

a MASSACHUSETTS INSTITUTE OF TECHNOLOGY (United States)

b NEW YORK UNIVERSITY (United States)

Author keywords

[No Author keywords available]

Indexed keywords

EID: 74049113741 PISSN: 1860949X EISSN: None Source Type: Book Series
DOI: 10.1007/978-3-642-05181-4_13 Document Type: Conference Paper

Times cited : (14)

References (19)

1
- 23844516492
- Coherent locomotion as an attracting state for a free flapping body
- Alben, S., Shelley, M.: Coherent locomotion as an attracting state for a free flapping body. Proceedings of the National Academy of Science 102, 11163-11166 (2005)
- (2005) Proceedings of the National Academy of Science , vol.102 , pp. 11163-11166
- Alben, S.¹ Shelley, M.²

2
- 0000396062
- Natural gradient works efficiently in learning
- Amari, S.: Natural gradient works efficiently in learning. Neural Computation 10, 251-276 (1998) (Pubitemid 128463152)
- (1998) Neural Computation , vol.10 , Issue.2 , pp. 251-276
- Amari, S.-I.¹

3
- 0013535965
- Infinite-horizon policy-gradient estimation
- Baxter, J., Bartlett, P.: Infinite-horizon policy-gradient estimation. Journal of Artificial Intelligence Research 15, 319-350 (2001)
- (2001) Journal of Artificial Intelligence Research , vol.15 , pp. 319-350
- Baxter, J.¹ Bartlett, P.²

4
- 74049108803
- Implementation of a highly parameterized digital PIV system on reconfigurable hardware
- Lexington, MA
- Bennis, A., Leeser, M., Tadmor, G., Tedrake, R.: Implementation of a highly parameterized digital PIV system on reconfigurable hardware. In: Proceedings of the Twelfth Annual Workshop on High Performance Embedded Computing (HPEC), Lexington, MA (2008)
- (2008) Proceedings of the Twelfth Annual Workshop on High Performance Embedded Computing (HPEC)
- Bennis, A.¹ Leeser, M.² Tadmor, G.³ Tedrake, R.⁴

5
- 13844306287
- Efficient bipedal robots based on passivedynamic walkers
- Collins, S.H., Ruina, A., Tedrake, R., Wisse, M.: Efficient bipedal robots based on passivedynamic walkers. Science 307, 1082-1085 (2005)
- (2005) Science , vol.307 , pp. 1082-1085
- Collins, S.H.¹ Ruina, A.² Tedrake, R.³ Wisse, M.⁴

6
- 84897694817
- Variance reduction techniques for gradient estimates in reinforcement learning
- Greensmith, E., Bartlett, P.L., Baxter, J.: Variance reduction techniques for gradient estimates in reinforcement learning. Journal of Machine Learning Research 5, 1471-1530 (2004)
- (2004) Journal of Machine Learning Research , vol.5 , pp. 1471-1530
- Greensmith, E.¹ Bartlett, P.L.² Baxter, J.³

7
- 74049141404
- Methods for learning control policies from variable-constraint demonstrations
- Sigaud, O., Peters, J. (eds.). SCI. Springer, Heidelberg
- Howard, M., Klanke, S., Gienger, M., Goerick, C., Vijayakumar, S.: Methods for learning control policies from variable-constraint demonstrations. In: Sigaud, O., Peters, J. (eds.) From Motor Learning to Interaction Learning in Robots. SCI, vol.264, pp. 253-291. Springer, Heidelberg (2010)
- (2010) From Motor Learning to Interaction Learning in Robots , vol.264 , pp. 253-291
- Howard, M.¹ Klanke, S.² Gienger, M.³ Goerick, C.⁴ Vijayakumar, S.⁵

8
- 0026712578
- Weight perturbation: An optimal architecture and learning technique for analog VLSI feedforward and recurrent multilayer networks
- Jabri, M., Flower, B.:Weight perturbation: An optimal architecture and learning technique for analog VLSI feedforward and recurrent multilayer networks. IEEE Trans. Neural Netw. 3, 154-157 (1992)
- (1992) IEEE Trans. Neural Netw. , vol.3 , pp. 154-157
- Jabri, M.¹ Flower, B.²

9
- 0032073263
- Planning and acting in partially observable stochastic domains
- Kaelbling, L.P., Littman, M.L., Cassandra, A.R.: Planning and acting in partially observable stochastic domains. Artificial Intelligence, 101 (1998)
- (1998) Artificial Intelligence , vol.101
- Kaelbling, L.P.¹ Littman, M.L.² Cassandra, A.R.³

10
- 74049090442
- Imitation and reinforcement learning for motor primitives with perceptual coupling
- Sigaud, O., Peters, J. (eds.). SCI. Springer, Heidelberg
- Kober, J., Mohler, B., Peters, J.: Imitation and reinforcement learning for motor primitives with perceptual coupling. In: Sigaud, O., Peters, J. (eds.) From Motor Learning to Interaction Learning in Robots. SCI, vol.264, pp. 209-225. Springer, Heidelberg (2010)
- (2010) From Motor Learning to Interaction Learning in Robots , vol.264 , pp. 209-225
- Kober, J.¹ Mohler, B.² Peters, J.³

11
- 74049151713
- Off-policy policy search
- Meuleau, N., Peshkin, L., Kaelbling, L.P., Kim, K.-E.: Off-policy policy search. In: NIPS (2000)
- (2000) NIPS
- Meuleau, N.¹ Peshkin, L.² Kaelbling, L.P.³ Kim, K.-E.⁴

12
- 74049092902
- Policy gradient methods for robot control
- University of Southern California
- Peters, J., Vijayakumar, S., Schaal, S.: Policy gradient methods for robot control (Technical Report CS-03-787). University of Southern California (2003)
- (2003) Technical Report CS-03-787
- Peters, J.¹ Vijayakumar, S.² Schaal, S.³

13
- 74049159258
- Signal-to-noise ratio analysis of policy gradient algorithms
- Roberts, J.W., Tedrake, R.: Signal-to-noise ratio analysis of policy gradient algorithms. In: Advances of Neural Information Processing Systems (NIPS), vol.21, p. 8 (2009)
- (2009) Advances of Neural Information Processing Systems (NIPS) , vol.21 , pp. 8
- Roberts, J.W.¹ Tedrake, R.²

14
- 74049126171
- Shelley, M.: Personal Communication (2007)
- (2007) Personal Communication
- Shelley, M.¹

15
- 14044262287
- Stochastic policy gradient reinforcement learning on a simple 3D biped
- Sendai, Japan
- Tedrake, R., Zhang, T.W., Seung, H.S.: Stochastic policy gradient reinforcement learning on a simple 3D biped. In: Proceedings of the IEEE International Conference on Intelligent Robots and Systems (IROS), Sendai, Japan, pp. 2849-2854 (2004)
- (2004) Proceedings of the IEEE International Conference on Intelligent Robots and Systems (IROS) , pp. 2849-2854
- Tedrake, R.¹ Zhang, T.W.² Seung, H.S.³

16
- 33645659820
- On unidirectional flight of a free flapping wing
- Vandenberghe, N., Childress, S., Zhang, J.: On unidirectional flight of a free flapping wing. Physics of Fluids, 18 (2006)
- (2006) Physics of Fluids , vol.18
- Vandenberghe, N.¹ Childress, S.² Zhang, J.³

17
- 2442651249
- Symmetry breaking leads to forward flapping flight
- Vandenberghe, N., Zhang, J., Childress, S.: Symmetry breaking leads to forward flapping flight. Journal of Fluid Mechanics 506, 147-155 (2004)
- (2004) Journal of Fluid Mechanics , vol.506 , pp. 147-155
- Vandenberghe, N.¹ Zhang, J.² Childress, S.³

18
- 34047226109
- Importance sampling actor-critic algorithms
- Williams, J.L., Fisher III, J.W., Willsky, A.S.: Importance sampling actor-critic algorithms. In: Proceedings of the 2006 American Control Conference (2006)
- (2006) Proceedings of the 2006 American Control Conference
- Williams, J.L.¹ Fisher III, J.W.² Willsky, A.S.³

19
- 0000337576
- Simple statistical gradient-following algorithms for connectionist reinforcement learning
- Williams, R.: Simple statistical gradient-following algorithms for connectionist reinforcement learning. Machine Learning 8, 229-256 (1992)
- (1992) Machine Learning , vol.8 , pp. 229-256
- Williams, R.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.