-
3
-
-
0013535965
-
Infinite-horizon policy-gradient estimation
-
Baxter, J., and Bartlett, P. L. 2001. Infinite-horizon policy-gradient estimation. Journal of AI Research 15:319-350.
-
(2001)
Journal of AI Research
, vol.15
, pp. 319-350
-
-
Baxter, J.1
Bartlett, P.L.2
-
5
-
-
84867449438
-
Omnidirectional motion for quadruped robots
-
Birk, A. et al. eds. Springer
-
Hengst, B.; Ibbotson, D.; Pham, S. B.; and Sammut, C. 2001. Omnidirectional motion for quadruped robots. In Birk, A. et al. eds., RoboCup International Symposium, Lecture Notes in Computer Science, LNAI 2377, 368. Springer.
-
(2001)
RoboCup International Symposium, Lecture Notes in Computer Science, LNAI
, vol.2377
, pp. 368
-
-
Hengst, B.1
Ibbotson, D.2
Pham, S.B.3
Sammut, C.4
-
6
-
-
0042634673
-
Autonomous evolution of gaits with the sony quadruped robot
-
Banzhaf, W. et al. eds. Orlando, Florida, USA: Morgan Kaufmann
-
Hornby, G, S.; Fujita, M.; Takamura, S.; Yamamoto, T.; and Hanagata, O. 1999. Autonomous evolution of gaits with the Sony quadruped robot. In Banzhaf, W. et al. eds., Proceedings of the Genetic and Evolutionary Computation Conference, volume 2, 1297-1304. Orlando, Florida, USA: Morgan Kaufmann.
-
(1999)
Proceedings of the Genetic and Evolutionary Computation Conference
, vol.2
, pp. 1297-1304
-
-
Hornby, G.S.1
Fujita, M.2
Takamura, S.3
Yamamoto, T.4
Hanagata, O.5
-
9
-
-
0039836198
-
Q2: Memory-based active learning for optimizing noisy continuous functions
-
Shavlik, J., ed. 340 Pine Street, 6th Fl., San Francisco, CA 94104: Morgan Kaufmann
-
Moore, A.; Schneider, J.; Boyan, J.; and Lee, M. S. 1998. Q2: Memory-based active learning for optimizing noisy continuous functions. In Shavlik, J., ed., Proceedings of the Fifteenth International Conference of Machine Learning, 386-394. 340 Pine Street, 6th Fl., San Francisco, CA 94104: Morgan Kaufmann.
-
(1998)
Proceedings of the Fifteenth International Conference of Machine Learning
, pp. 386-394
-
-
Moore, A.1
Schneider, J.2
Boyan, J.3
Lee, M.S.4
-
10
-
-
84898980684
-
Autonomous helicopter flight via reinforcement learning
-
MIT Press. To Appear
-
Ng, A. et al. 2004. Autonomous helicopter flight via reinforcement learning. In Advances in Neural Information Processing Systems 17. MIT Press. To Appear.
-
(2004)
Advances in Neural Information Processing Systems
, vol.17
-
-
Ng, A.1
-
13
-
-
3042530303
-
Germanteam robocup 2003
-
Sony. 2004. Aibo robot
-
Rofer, T. et al. 2003. Germanteam robocup 2003. Tech report. Sony. 2004. Aibo robot. www.sony.net/Products/aibo.
-
(2003)
Tech Report
-
-
Rofer, T.1
-
14
-
-
3042623052
-
UT Austin Villa 2003: A new RoboCup four-legged team
-
The University of Texas at Austin, Department of Computer Sciences, AI Laboratory
-
Stone, P. et al. 2003. UT Austin Villa 2003: A new RoboCup four-legged team. Technical Report UT-AI-TR-03-304, The University of Texas at Austin, Department of Computer Sciences, AI Laboratory. At http://www.cs.utexas.edu/ home/department/pubsforms.shtml.
-
(2003)
Technical Report
, vol.UT-AI-TR-03-304
-
-
Stone, P.1
-
15
-
-
84862476377
-
A model-based approach to robot joint control
-
Stronger, D., and Stone, P. 2003. A model-based approach to robot joint control. Under Review. Available from http: //www.cs.utexas.edu/~pstone/papers. html.
-
(2003)
Under Review
-
-
Stronger, D.1
Stone, P.2
-
16
-
-
84898939480
-
Policy gradient methods for reinforcement learning with function approximation
-
The MIT Press
-
Sutton, R.; McAllester, D.; Singh, S.; and Mansour, Y. 2000. Policy gradient methods for reinforcement learning with function approximation. In Advances in Neural Information Processing Systems, volume 12, 1057-1063. The MIT Press.
-
(2000)
Advances in Neural Information Processing Systems
, vol.12
, pp. 1057-1063
-
-
Sutton, R.1
McAllester, D.2
Singh, S.3
Mansour, Y.4
|