-
1
-
-
0028401306
-
Case-based reasoning; foundational issues, methodological variations, and system approaches
-
Aamodt, A., & Plaza, E. (1994). Case-Based Reasoning; Foundational Issues, Methodological Variations, and System Approaches. AI Communications, 7 (1), 39-59.
-
(1994)
AI Communications
, vol.7
, Issue.1
, pp. 39-59
-
-
Aamodt, A.1
Plaza, E.2
-
2
-
-
84883027643
-
Autonomous Autorotation of an RC Helicopter
-
Abbeel, P., Coates, A., Hunter, T., & Ng, A. Y. (2008). Autonomous Autorotation of an RC Helicopter. In ISER, pp. 385-394.
-
(2008)
ISER
, pp. 385-394
-
-
Abbeel, P.1
Coates, A.2
Hunter, T.3
Ng, A.Y.4
-
3
-
-
77955809093
-
Autonomous helicopter aerobatics through apprenticeship learning. I
-
Abbeel, P., Coates, A., & Ng, A. Y. (2010). Autonomous helicopter aerobatics through apprenticeship learning. I. J. Robotic Res., 29 (13), 1608-1639.
-
(2010)
J. Robotic Res.
, vol.29
, Issue.13
, pp. 1608-1639
-
-
Abbeel, P.1
Coates, A.2
Ng, A.Y.3
-
4
-
-
50249164874
-
Robocup 2007: Robot soccer world cup xi
-
Springer-Verlag, Berlin, Heidelberg
-
Abbott, R. G. (2008). Robocup 2007: Robot soccer world cup xi.. chap. Behavioral Cloning for Simulator Validation, pp. 329-336. Springer-Verlag, Berlin, Heidelberg.
-
(2008)
Chap. Behavioral Cloning for Simulator Validation
, pp. 329-336
-
-
Abbott, R.G.1
-
5
-
-
0000217085
-
Tolerating noisy, irrelevant and novel attributes in instance-based learning algorithms
-
Aha, D. W. (1992). Tolerating Noisy, Irrelevant and Novel Attributes in Instance-Based Learning Algorithms. International Journal Man-Machine Studies, 36 (2), 267-287.
-
(1992)
International Journal Man-Machine Studies
, vol.36
, Issue.2
, pp. 267-287
-
-
Aha, D.W.1
-
6
-
-
0025725905
-
Instance-based learning algorithms
-
Aha, D. W., & Kibler, D. (1991). Instance-based learning algorithms. In Machine Learning, pp. 37-66.
-
(1991)
Machine Learning
, pp. 37-66
-
-
Aha, D.W.1
Kibler, D.2
-
7
-
-
1942515258
-
Behavioral cloning of student pilots with modular neural networks
-
Morgan Kaufmann
-
Anderson, C. W., Draper, B. A., & Peterson, D. A. (2000). Behavioral cloning of student pilots with modular neural networks. In Proceedings of the Seventeenth International Conference on Machine Learning, pp. 25-32. Morgan Kaufmann.
-
(2000)
Proceedings of the Seventeenth International Conference on Machine Learning
, pp. 25-32
-
-
Anderson, C.W.1
Draper, B.A.2
Peterson, D.A.3
-
8
-
-
63149159130
-
A survey of robot learning from demonstration
-
Argall, B., Chernova, S., Veloso, M., & Browning, B. (2009). A Survey of Robot Learning from Demonstration. Robotics and Autonomous Systems, 57 (5), 469-483.
-
(2009)
Robotics and Autonomous Systems
, vol.57
, Issue.5
, pp. 469-483
-
-
Argall, B.1
Chernova, S.2
Veloso, M.3
Browning, B.4
-
9
-
-
84901708832
-
Case-based reasoning: Survey and future directions
-
Puppe, F. (Ed.), Vol. 1570 of Lecture Notes in Computer Science, Springer
-
Bartsch-Sprl, B., Lenz, M., & Hbner, A. (1999). Case-based reasoning: Survey and future directions.. In Puppe, F. (Ed.), XPS, Vol. 1570 of Lecture Notes in Computer Science, pp. 67-89. Springer.
-
(1999)
XPS
, pp. 67-89
-
-
Bartsch-Sprl, B.1
Lenz, M.2
Hbner, A.3
-
10
-
-
70350352555
-
Improving reinforcement learning by using case-based heuristics
-
Springer, Lecture Notes in Artificial Intelligence, Springer
-
Bianchi, R., Ros, R., & de Mántaras, R. L. (2009). Improving reinforcement learning by using case-based heuristics.. Vol. 5650, pp. 75-89. Lecture Notes in Artificial Intelligence, Springer, Lecture Notes in Artificial Intelligence, Springer.
-
(2009)
Lecture Notes in Artificial Intelligence
, vol.5650
, pp. 75-89
-
-
Bianchi, R.1
Ros, R.2
De Mántaras, R.L.3
-
11
-
-
72249118874
-
SIMBA: A simulator for business education and research
-
Borrajo, F., Bueno, Y., de Pablo, I., Santos, B. n., Fernandez, F., Garcia, J., & Sagredo, I. (2010). SIMBA: A Simulator for Business Education and Research. Decission Support Systems, 48 (3), 498-506.
-
(2010)
Decission Support Systems
, vol.48
, Issue.3
, pp. 498-506
-
-
Borrajo, F.1
Bueno, Y.2
De Pablo, I.3
Santos, B.N.4
Fernandez, F.5
García, J.6
Sagredo, I.7
-
12
-
-
84875147306
-
Proceedings of the workshop on value function approximation, machine learning conference 1995
-
Boyan, J., Moore, A., & Sutton, R. (1995). Proceedings of the workshop on value function approximation, machine learning conference 1995... Technical Report CMU-CS-95- 206.
-
(1995)
Technical Report CMU-CS-95- 206
-
-
Boyan, J.1
Moore, A.2
Sutton, R.3
-
14
-
-
67650691600
-
Multi-thresholded approach to demonstration selection for interactive robot learning
-
New York, NY, USA. ACM
-
Chernova, S., & Veloso, M. (2008). Multi-thresholded approach to demonstration selection for interactive robot learning. In Proceedings of the 3rd ACM/IEEE international conference on Human robot interaction, HRI '08, pp. 225-232, New York, NY, USA. ACM.
-
(2008)
Proceedings of the 3rd ACM/IEEE International Conference on Human Robot Interaction, HRI '08
, pp. 225-232
-
-
Chernova, S.1
Veloso, M.2
-
15
-
-
0007512578
-
Truncating temporal differences: On the efficient implementation of td(lambda) for reinforcement learning
-
Cichosz, P. (1995). Truncating temporal differences: On the efficient implementation of td(lambda) for reinforcement learning. Journal of Artificial Intelligence Research (JAIR), 2, 287-318.
-
(1995)
Journal of Artificial Intelligence Research (JAIR)
, vol.2
, pp. 287-318
-
-
Cichosz, P.1
-
17
-
-
0033077715
-
Risk-sensitive and minimax control of discrete- time, finite-state markov decision processes
-
Coraluppi, S. P., & Marcus, S. I. (1999). Risk-Sensitive and Minimax Control of Discrete- Time, Finite-State Markov Decision Processes. AUTOMATICA, 35, 301-309.
-
(1999)
Automatica
, vol.35
, pp. 301-309
-
-
Coraluppi, S.P.1
Marcus, S.I.2
-
20
-
-
4444312102
-
Integrating guidance into relational reinforcement learning
-
Driessens, K., & Dẑeroski, S. (2004). Integrating guidance into relational reinforcement learning. Machine Learning, 57 (3), 271-304.
-
(2004)
Machine Learning
, vol.57
, Issue.3
, pp. 271-304
-
-
Driessens, K.1
Dẑeroski, S.2
-
21
-
-
39549117816
-
Local feature weighting in nearest prototype classification
-
Fernandez, F., & Isasi, P. (2008). Local feature weighting in nearest prototype classification. Neural Networks, IEEE Transactions on, 19 (1), 40-53.
-
(2008)
Neural Networks, IEEE Transactions on
, vol.19
, Issue.1
, pp. 40-53
-
-
Fernandez, F.1
Isasi, P.2
-
22
-
-
38949129339
-
Two steps reinforcement learning
-
Fernandez, F., & Borrajo, D. (2008). Two steps reinforcement learning. International Journal of Intelligent Systems, 23 (2), 213-245.
-
(2008)
International Journal of Intelligent Systems
, vol.23
, Issue.2
, pp. 213-245
-
-
Fernandez, F.1
Borrajo, D.2
-
24
-
-
52449097334
-
A case-based reasoning approach to imitating robocup players
-
Floyd, M. W., Esfandiari, B., & Lam, K. (2008). A Case-Based Reasoning Approach to Imitating Robocup Players. In Proceedings of the 21st International Florida Artificial Intelligence Research Society Conference, pp. 251-256.
-
(2008)
Proceedings of the 21st International Florida Artificial Intelligence Research Society Conference
, pp. 251-256
-
-
Floyd, M.W.1
Esfandiari, B.2
Lam, K.3
-
29
-
-
79956136559
-
Safe exploration for reinforcement learning
-
Hans, A., Schneegass, D., Schäfer, A. M., & Udluft, S. (2008). Safe Exploration for Reinforcement Learning. In European Symposium on Artificial Neural Network, pp. 143-148.
-
(2008)
European Symposium on Artificial Neural Network
, pp. 143-148
-
-
Hans, A.1
Schneegass, D.2
Schäfer, A.M.3
Udluft, S.4
-
31
-
-
55749100315
-
Seeding the initial population of a multi-objective evolutionary algorithm using gradient-based information
-
IEEE
-
Hernández-Díaz, A. G., Coello, C. A. C., Perez, F., Caballero, R., Luque, J. M., & Santana- Quintero, L. V. (2008). Seeding the initial population of a multi-objective evolutionary algorithm using gradient-based information. In IEEE Congress on Evolutionary Computation, pp. 1617-1624. IEEE.
-
(2008)
IEEE Congress on Evolutionary Computation
, pp. 1617-1624
-
-
Hernández-Díaz, A.G.1
Coello, C.A.C.2
Perez, F.3
Caballero, R.4
Luque, J.M.5
Santana- Quintero, L.V.6
-
32
-
-
84875140551
-
-
Tech. rep. arXiv e-Prints 1105.1749, arXiv
-
Hester, T., Quinlan, M., & Stone, P. (2011). A real-time model-based reinforcement learning architecture for robot control. Tech. rep. arXiv e-Prints 1105.1749, arXiv.
-
(2011)
A Real-time Model-based Reinforcement Learning Architecture for Robot Control
-
-
Hester, T.1
Quinlan, M.2
Stone, P.3
-
33
-
-
84867438662
-
Essex wizards 2001 team description
-
Birk, A. Coradeschi, S. & Tadokoro, S. (Eds.), Vol. 2377 of Lecture Notes in Computer Science, Springer
-
Hu, H., Kostiadis, K., Hunter, M., & Kalyviotis, N. (2001). Essex wizards 2001 team description. In Birk, A., Coradeschi, S., & Tadokoro, S. (Eds.), RoboCup, Vol. 2377 of Lecture Notes in Computer Science, pp. 511-514. Springer.
-
(2001)
RoboCup
, pp. 511-514
-
-
Hu, H.1
Kostiadis, K.2
Hunter, M.3
Kalyviotis, N.4
-
35
-
-
0029679044
-
Reinforcement learning: A survey
-
Kaelbling, L., Littman, M., & Moore, A. (1996). Reinforcement learning: A survey. Journal of Artificial Intelligence Research (JAIR), 4, 237-285.
-
(1996)
Journal of Artificial Intelligence Research (JAIR)
, vol.4
, pp. 237-285
-
-
Kaelbling, L.1
Littman, M.2
Moore, A.3
-
36
-
-
84866396617
-
Reinforcement learning for games: Failures and successes
-
New York, NY, USA. ACM
-
Konen, W., & Bartz-Beielstein, T. (2009). Reinforcement learning for games: failures and successes. In Proceedings of the 11th Annual Conference Companion on Genetic and Evolutionary Computation Conference: Late Breaking Papers, GECCO '09, pp. 2641- 2648, New York, NY, USA. ACM.
-
(2009)
Proceedings of the 11th Annual Conference Companion on Genetic and Evolutionary Computation Conference: Late Breaking Papers, GECCO '09
, pp. 2641-2648
-
-
Konen, W.1
Bartz-Beielstein, T.2
-
38
-
-
80955137547
-
Neuroevolutionary reinforcement learning for generalized control of simulated helicopters
-
Koppejan, R., & Whiteson, S. (2011). Neuroevolutionary reinforcement learning for generalized control of simulated helicopters. Evolutionary Intelligence, 4, 219-241.
-
(2011)
Evolutionary Intelligence
, vol.4
, pp. 219-241
-
-
Koppejan, R.1
Whiteson, S.2
-
41
-
-
9444276079
-
Reinforcement learning for average reward zero-sum games
-
Shawe- Taylor, J. & Singer, Y. (Eds.), Vol. 3120 of Lecture Notes in Computer Science, Springer
-
Mannor, S. (2004). Reinforcement learning for average reward zero-sum games. In Shawe- Taylor, J., & Singer, Y. (Eds.), COLT, Vol. 3120 of Lecture Notes in Computer Science, pp. 49-63. Springer.
-
(2004)
COLT
, pp. 49-63
-
-
Mannor, S.1
-
44
-
-
0036832952
-
Risk-Sensitive reinforcement learning
-
Mihatsch, O., & Neuneier, R. (2002). Risk-Sensitive reinforcement learning. Machine Learning, 49 (2-3), 267-290.
-
(2002)
Machine Learning
, vol.49
, Issue.2-3
, pp. 267-290
-
-
Mihatsch, O.1
Neuneier, R.2
-
46
-
-
0016082525
-
Learning automata - A survey
-
Narendra, K. S., & Thathachar, M. A. L. (1974). Learning automata - a survey. Ieee Transactions On Systems Man And Cybernetics, SMC-4(4), 323-334.
-
(1974)
Ieee Transactions on Systems Man and Cybernetics, SMC-4
, Issue.4
, pp. 323-334
-
-
Narendra, K.S.1
Thathachar, M.A.L.2
-
47
-
-
0003891507
-
-
Prentice-Hall, Inc. Upper Saddle River, NJ, USA
-
Narendra, K. S., & Thathachar, M. A. L. (1989). Learning automata: an introduction. Prentice-Hall, Inc., Upper Saddle River, NJ, USA.
-
(1989)
Learning Automata: An Introduction
-
-
Narendra, K.S.1
Thathachar, M.A.L.2
-
48
-
-
3042583887
-
Autonomous helicopter flight via reinforcement learning
-
Thrun, S. Saul, L. K. & Scholkopf, B. (Eds.), MIT Press
-
Ng, A. Y., Kim, H. J., Jordan, M. I., & Sastry, S. (2003). Autonomous Helicopter Flight via Reinforcement Learning. In Thrun, S., Saul, L. K., & Scholkopf, B. (Eds.), NIPS. MIT Press.
-
(2003)
NIPS
-
-
Ng, A.Y.1
Kim, H.J.2
Jordan, M.I.3
Sastry, S.4
-
49
-
-
84875131154
-
Robot learning
-
Sammut, C. & Webb, G. I. (Eds.), Springer
-
Peters, J., Tedrake, R., Roy, N., & Morimoto, J. (2010). Robot learning. In Sammut, C., & Webb, G. I. (Eds.), Encyclopedia of Machine Learning, pp. 865-869. Springer.
-
(2010)
Encyclopedia of Machine Learning
, pp. 865-869
-
-
Peters, J.1
Tedrake, R.2
Roy, N.3
Morimoto, J.4
-
50
-
-
0242667271
-
Genetic programming with user-driven selection: Experiments on the evolution of algorithms for image enhancement
-
Morgan Kaufmann
-
Poli, R., & Cagnoni, S. (1997). Genetic programming with user-driven selection: Experiments on the evolution of algorithms for image enhancement. In Genetic Programming 1997: Proceedings of the Second Annual Conference, pp. 269-277. Morgan Kaufmann.
-
(1997)
Genetic Programming 1997: Proceedings of the Second Annual Conference
, pp. 269-277
-
-
Poli, R.1
Cagnoni, S.2
-
51
-
-
62949112174
-
A collaborative reinforcement learning approach to urban tra-c control optimization
-
Salkham, A., Cunningham, R., Garg, A., & Cahill, V. (2008). A collaborative reinforcement learning approach to urban tra-c control optimization. In Web Intelligence and Intelligent Agent Technology, 2008. WI-IAT '08. IEEE/WIC/ACM International Conference on, Vol. 2, pp. 560-566.
-
(2008)
Web Intelligence and Intelligent Agent Technology, 2008. WI-IAT '08. IEEE/WIC/ACM International Conference on
, vol.2
, pp. 560-566
-
-
Salkham, A.1
Cunningham, R.2
Garg, A.3
Cahill, V.4
-
52
-
-
0031231885
-
Experiments with reinforcement learning in problems with continuous state and action spaces
-
Santamaría, J. C., Sutton, R. S., & Ram, A. (1998). Experiments with reinforcement learning in problems with continuous state and action spaces. Adaptive Behavior, 6, 163-218.
-
(1998)
Adaptive Behavior
, vol.6
, pp. 163-218
-
-
Santamaría, J.C.1
Sutton, R.S.2
Ram, A.3
-
53
-
-
80054035256
-
Transfer learning in real-time strategy games using hybrid cbr/rl
-
Sharma, M., Holmes, M., Santamaria, J., Irani, A., Isbell, C., & Ram, A. (2007). Transfer learning in real-time strategy games using hybrid cbr/rl. In In Proceedings of the Twentieth International Joint Conference on Artificial Intelligence.
-
(2007)
Proceedings of the Twentieth International Joint Conference on Artificial Intelligenced
-
-
Sharma, M.1
Holmes, M.2
Santamaria, J.3
Irani, A.4
Isbell, C.5
Ram, A.6
-
55
-
-
0001898381
-
Practical reinforcement learning in continuous spaces
-
Morgan Kaufmann
-
Smart, W. D., & Kaelbling, L. P. (2000). Practical reinforcement learning in continuous spaces. In Artificial Intelligence, pp. 903-910. Morgan Kaufmann.
-
(2000)
Artificial Intelligence
, pp. 903-910
-
-
Smart, W.D.1
Kaelbling, L.P.2
-
56
-
-
0036058423
-
Effective reinforcement learning for mobile robots
-
IEEE
-
Smart, W. D., & Kaelbling, L. P. (2002). Effective reinforcement learning for mobile robots. In ICRA, pp. 3404-3410. IEEE.
-
(2002)
ICRA
, pp. 3404-3410
-
-
Smart, W.D.1
Kaelbling, L.P.2
-
58
-
-
77955839705
-
Parameterized maneuver learning for autonomous helicopter ight
-
Tang, J., Singh, A., Goehausen, N., & Abbeel, P. (2010). Parameterized maneuver learning for autonomous helicopter ight. In International Conference on Robotics and Automation (ICRA).
-
(2010)
International Conference on Robotics and Automation (ICRA)
-
-
Tang, J.1
Singh, A.2
Goehausen, N.3
Abbeel, P.4
-
62
-
-
0033362601
-
Evolving artificial neural networks
-
Yao, X. (1999). Evolving artificial neural networks. PIEEE: Proceedings of the IEEE, 87, 1423-1447.
-
(1999)
PIEEE: Proceedings of the IEEE
, vol.87
, pp. 1423-1447
-
-
Yao, X.1
|