-
1
-
-
0034859944
-
Autonomous helicopter control using reinforcement learning policy search methods
-
Piscataway NJ: Institute of Electrical and Electronics Engineers, Inc.
-
Bagnell, J., and Schneider, J. 2001. Autonomous Helicopter Control Using Reinforcement Learning Policy Search Methods. In Proceedings of the International Conference on Robotics and Automation 2001, 1615-1620. Piscataway NJ: Institute of Electrical and Electronics Engineers, Inc.
-
(2001)
Proceedings of the International Conference on Robotics and Automation 2001
, pp. 1615-1620
-
-
Bagnell, J.1
Schneider, J.2
-
2
-
-
0004870746
-
A problem in the sequential design of experiments
-
Bellman, R. E. 1956. A Problem in the Sequential Design of Experiments. Sankhya 16(3,4): 221-229.
-
(1956)
Sankhya
, vol.16
, Issue.3-4
, pp. 221-229
-
-
Bellman, R.E.1
-
4
-
-
85153940465
-
Generalization in reinforcement learning: Safely approximating the value function
-
Cambridge, MA: The MIT Press
-
Boyan, J. A., and Moore, A. W. 1995. Generalization in Reinforcement Learning: Safely Approximating the Value Function. In Advances in Neural Information Processing Systems 7, 369-376. Cambridge, MA: The MIT Press.
-
(1995)
Advances in Neural Information Processing Systems 7
, pp. 369-376
-
-
Boyan, J.A.1
Moore, A.W.2
-
5
-
-
0028605089
-
Swinging up the acrobot: An example of intelligent control
-
Piscataway, NJ: Institute of Electrical and Electronics Engineers, Inc.
-
Dejong, G., and Spong, M. W. 1994. Swinging Up the Acrobot: An Example of Intelligent Control. In Proceedings of the American Control Conference, 2158-2162. Piscataway, NJ: Institute of Electrical and Electronics Engineers, Inc.
-
(1994)
Proceedings of the American Control Conference
, pp. 2158-2162
-
-
Dejong, G.1
Spong, M.W.2
-
6
-
-
35248818685
-
Tetris is hard, even to approximate
-
Lecture Notes in Computer Science, Springer
-
Demaine, D. E.; Hohenberger, S.; and Liben-Nowell, D. 2003. Tetris Is Hard, Even to Approximate. In Proceedings of the Ninth International Computing and Combinatorics Conference, 351-363. Lecture Notes in Computer Science, Volume 2697. Berlin: Springer.
-
(2003)
Proceedings of the Ninth International Computing and Combinatorics Conference
, vol.2697
, pp. 351-363
-
-
Demaine, D.E.1
Hohenberger, S.2
Liben-Nowell, D.3
-
11
-
-
33744488034
-
Inverted autonomous helicopter flight via reinforcement learning
-
Berlin: Springer
-
Ng, A. Y.; Coates, A.; Diel, M.; Ganapathi, V.; Schulte, J.; Tse, B.; Berger, E.; and Liang, E. 2004. Inverted Autonomous Helicopter Flight Via Reinforcement Learning. In Proceedings of the International Symposium on Experimental Robotics, 363-372. Berlin: Springer.
-
(2004)
Proceedings of the International Symposium on Experimental Robotics
, pp. 363-372
-
-
Ng, A.Y.1
Coates, A.2
Diel, M.3
Ganapathi, V.4
Schulte, J.5
Tse, B.6
Berger, E.7
Liang, E.8
-
12
-
-
0032021222
-
Soccer server: A tool for research on multiagent systems
-
Noda, I.; Matsubara, H.; Hiraki, K.; and Frank, I. 1998. Soccer Server: A Tool for Research on Multiagent Systems. Applied Artificial Intelligence 12(1): 233-250. (Pubitemid 127619180)
-
(1998)
Applied Artificial Intelligence
, vol.12
, Issue.2-3
, pp. 233-250
-
-
Noda, I.1
Matsubara, H.2
Hiraki, K.3
Frank, I.4
-
13
-
-
79951937255
-
A novel benchmark methodology and data repository for real-life reinforcement learning
-
New York: Association for Computing Machinery
-
Nouri, A.; Littman, M. L.; Li, L.; Parr, R.; Painter-Wakefield, C.; and Taylor, G. 2009. A Novel Benchmark Methodology and Data Repository for Real-Life Reinforcement Learning. In Proceedings of the 26th International Conference on Machine Learning. New York: Association for Computing Machinery.
-
(2009)
Proceedings of the 26th International Conference on Machine Learning
-
-
Nouri, A.1
Littman, M.L.2
Li, L.3
Parr, R.4
Painter-Wakefield, C.5
Taylor, G.6
-
14
-
-
37249034293
-
Keepaway soccer: From machine learning testbed to benchmark
-
Berlin: Springer Verlag
-
Stone, P.; Kuhlmann, G.; Taylor, M. E.; and Liu, Y. 2005. Keepaway Soccer: From Machine Learning Testbed to Benchmark. In Robocup-2005: Robot Soccer World Cup IX, Volume 4020, 93-105. Berlin: Springer Verlag.
-
(2005)
Robocup-2005: Robot Soccer World Cup IX
, vol.4020
, pp. 93-105
-
-
Stone, P.1
Kuhlmann, G.2
Taylor, M.E.3
Liu, Y.4
-
15
-
-
27544506565
-
Reinforcement learning in robocup-soccer keepaway
-
Stone, P.; Sutton, R. S.; and Kuhlmann, G. 2005. Reinforcement Learning in Robocup-Soccer Keepaway. Adaptive Behavior 13(3): 165-188.
-
(2005)
Adaptive Behavior
, vol.13
, Issue.3
, pp. 165-188
-
-
Stone, P.1
Sutton, R.S.2
Kuhlmann, G.3
-
16
-
-
85156221438
-
Generalization in reinforcement learning: Successful examples using sparse coarse coding
-
Cambridge, MA: The MIT Press
-
Sutton, R. S. 1996. Generalization in Reinforcement Learning: Successful Examples Using Sparse Coarse Coding. In Proceedings of Advances in Neural Information Processing Systems 8, 1038-1044. Cambridge, MA: The MIT Press.
-
(1996)
Proceedings of Advances in Neural Information Processing Systems
, vol.8
, pp. 1038-1044
-
-
Sutton, R.S.1
-
17
-
-
33847202724
-
Learning to predict by the methods of temporal differences
-
Sutton, R. S. 1988. Learning to Predict by the Methods of Temporal Differences. Machine Learning 3(1): 9-44.
-
(1988)
Machine Learning
, vol.3
, Issue.1
, pp. 9-44
-
-
Sutton, R.S.1
-
19
-
-
33845344721
-
Learning tetris using the noisy cross-entropy method
-
DOI 10.1162/neco.2006.18.12.2936
-
Szita, I., and Lörincz, A. 2006. Learning Tetris Using the Noisy Cross-Entropy Method. Neural Computation 18(12): 2936-2941. (Pubitemid 44879147)
-
(2006)
Neural Computation
, vol.18
, Issue.12
, pp. 2936-2941
-
-
Szita, I.1
Lorincz, A.2
-
20
-
-
70449370276
-
RL-Glue: Language-independent software for reinforcement-learning experiments
-
September
-
Tanner, B., and White, A. 2009. RL-Glue: Language-Independent Software for Reinforcement-Learning Experiments. Journal of Machine Learning Research 10 (September): 2133-2136.
-
(2009)
Journal of Machine Learning Research
, vol.10
, pp. 2133-2136
-
-
Tanner, B.1
White, A.2
-
21
-
-
79951880135
-
-
Master's Thesis, Department of Computing, University of Alberta, Edmonton, Alberta, Canada
-
White, A. 2006. A Standard Benchmarking System for Reinforcement Learning. Master's Thesis, Department of Computing, University of Alberta, Edmonton, Alberta, Canada.
-
(2006)
A Standard Benchmarking System for Reinforcement Learning
-
-
White, A.1
-
22
-
-
84869461477
-
Generalized domains for empirical evaluations in reinforcement learning
-
Paper presented Montreal, Quebec, Canada, 25 March
-
Whiteson, S.; Tanner, B.; Taylor, M. E.; and Stone, P. 2009. Generalized Domains for Empirical Evaluations in Reinforcement Learning. Paper presented at the 4th Workshop on Evaluation Methods for Machine Learning, Montreal, Quebec, Canada, 25 March.
-
(2009)
4th Workshop on Evaluation Methods for Machine Learning
-
-
Whiteson, S.1
Tanner, B.2
Taylor, M.E.3
Stone, P.4
-
23
-
-
23044435398
-
Dynamic model of the octopus arm. I. biomechanics of the octopus reaching movement
-
DOI 10.1152/jn.00684.2004
-
Yekutieli, Y.; Sagiv-Zohar, R.; Aharonov, R.; Engel, Y.; Hochner, B.; and Flash, T. 2005. A Dynamic Model of the Octopus Arm. I. Biomechanics of the Octopus Reaching Movement. Journal of Neurophysiology 94(2): 1443-1458. (Pubitemid 41061378)
-
(2005)
Journal of Neurophysiology
, vol.94
, Issue.2
, pp. 1443-1458
-
-
Yekutieli, Y.1
Sagiv-Zohar, R.2
Aharonov, R.3
Engel, Y.4
Hochner, B.5
Flash, T.6
|