-
1
-
-
0016556021
-
A new approach to manipulator control: The cerebellar model articulation controller (CMAC)
-
Albus, J.S. (1975). A new approach to manipulator control: The cerebellar model articulation controller (CMAC). Dynamic Systems, Measurement and Control, 97, 220-227.
-
(1975)
Dynamic Systems, Measurement and Control
, vol.97
, pp. 220-227
-
-
Albus, J.S.1
-
2
-
-
2542445555
-
A vision-based reinforcement learning for coordination of soccer playing behaviors
-
Asada, M., Uchibe, E., Noda, S., Tawaratsumida, S., & Hosoda, K. (1994). A vision-based reinforcement learning for coordination of soccer playing behaviors. Proceedings of AAAI-94 Workshop on AI and A-life and Entertainment (pp. 16-21).
-
(1994)
Proceedings of AAAI-94 Workshop on AI and A-life and Entertainment
, pp. 16-21
-
-
Asada, M.1
Uchibe, E.2
Noda, S.3
Tawaratsumida, S.4
Hosoda, K.5
-
4
-
-
85011779332
-
Removing the genetics from the standard genetic algorithm
-
A. Prieditis, & S. Russell (Eds.), San Francisco, CA: Morgan Kaufmann Publishers
-
Baluja, S., & Caruana, R. (1995). Removing the genetics from the standard genetic algorithm. In A. Prieditis, & S. Russell (Eds.), Machine Learning: Proceedings of the Twelfth International Conference (pp. 38-46). San Francisco, CA: Morgan Kaufmann Publishers.
-
(1995)
Machine Learning: Proceedings of the Twelfth International Conference
, pp. 38-46
-
-
Baluja, S.1
Caruana, R.2
-
6
-
-
0002258659
-
A representation for the adaptive generation of simple sequential programs
-
J. Grefenstette (Ed.), Hillsdale, NJ: Lawrence Erlbaum Associates
-
Cramer, N.L. (1985). A representation for the adaptive generation of simple sequential programs. In J. Grefenstette (Ed.), Proceedings of an International Conference on Genetic Algorithms and their Applications (pp. 183-187). Hillsdale, NJ: Lawrence Erlbaum Associates.
-
(1985)
Proceedings of An International Conference on Genetic Algorithms and Their Applications
, pp. 183-187
-
-
Cramer, N.L.1
-
7
-
-
85156187730
-
Improving elevator performance using reinforcement learning
-
D. Touretzky, M. Mozer, & M. Hasselmo (Eds.), Cambridge, MA: MIT Press
-
Crites, R., & Barto, A. (1996). Improving elevator performance using reinforcement learning. In D. Touretzky, M. Mozer, & M. Hasselmo (Eds.), Advances in Neural Information Processing Systems (Vol. 8, pp. 1017-1023). Cambridge, MA: MIT Press.
-
(1996)
Advances in Neural Information Processing Systems
, vol.8
, pp. 1017-1023
-
-
Crites, R.1
Barto, A.2
-
8
-
-
0344299454
-
-
Fortgeschrittenenpraktikum, Institut für Informatik, Lehrstuhl Prof. Radig, Technische Universität München
-
Dickmanns, D., Schmidhuber, J., & Winklhofer, A. (1987). Der genetische Algorithmus: Eine Implementierung in Prolog. Fortgeschrittenenpraktikum, Institut für Informatik, Lehrstuhl Prof. Radig, Technische Universität München.
-
(1987)
Der Genetische Algorithmus: Eine Implementierung in Prolog
-
-
Dickmanns, D.1
Schmidhuber, J.2
Winklhofer, A.3
-
11
-
-
0000202647
-
Universal sequential search problems
-
Levin, L.A. (1973). Universal sequential search problems. Problems of Information Transmission, 9(3), 265-266.
-
(1973)
Problems of Information Transmission
, vol.9
, Issue.3
, pp. 265-266
-
-
Levin, L.A.1
-
12
-
-
0021404339
-
Randomness conservation inequalities: Information and independence in mathematical theories
-
Levin, L.A. (1984). Randomness conservation inequalities: Information and independence in mathematical theories. Information and Control, 61, 15-37.
-
(1984)
Information and Control
, vol.61
, pp. 15-37
-
-
Levin, L.A.1
-
15
-
-
85149834820
-
Markov games as a framework for multi-agent reinforcement learning
-
A. Prieditis, & S. Russell (Eds.), San Francisco, CA: Morgan Kaufmann Publishers
-
Littman, M.L. (1994). Markov games as a framework for multi-agent reinforcement learning. In A. Prieditis, & S. Russell (Eds.), Machine Learning: Proceedings of the Eleventh International Conference (pp. 157-163). San Francisco, CA: Morgan Kaufmann Publishers.
-
(1994)
Machine Learning: Proceedings of the Eleventh International Conference
, pp. 157-163
-
-
Littman, M.L.1
-
16
-
-
0003322602
-
Co-evolving soccer softbot team coordination with genetic programming
-
Luke, S., Hohn, C., Farris, J., Jackson, G., & Hendler, J. (1997). Co-evolving soccer softbot team coordination with genetic programming. Proceedings of the First International Workshop on RoboCup, at the International Joint Conference on Artificial Intelligence (IJCAI-97).
-
(1997)
Proceedings of the First International Workshop on RoboCup, at the International Joint Conference on Artificial Intelligence (IJCAI-97)
-
-
Luke, S.1
Hohn, C.2
Farris, J.3
Jackson, G.4
Hendler, J.5
-
17
-
-
0002650559
-
Learning of cooperative actions in multi-agent systems: A case study of pass play in soccer
-
S. Sen (Ed.), Menlo Park, CA: AAAI Press
-
Matsubara, H., Noda, I., & Hiraki, K. (1996). Learning of cooperative actions in multi-agent systems: A case study of pass play in soccer. In S. Sen (Ed.), Working Notes for the AAAI-96 Spring Symposium on Adaptation, Coevolution and Learning in Multi-agent Systems (pp. 63-67). Menlo Park, CA: AAAI Press.
-
(1996)
Working Notes for the AAAI-96 Spring Symposium on Adaptation, Coevolution and Learning in Multi-agent Systems
, pp. 63-67
-
-
Matsubara, H.1
Noda, I.2
Hiraki, K.3
-
19
-
-
0001765492
-
Simplifying neural networks by soft weight sharing
-
Nowlan, S.J., & Hinton, G.E. (1992). Simplifying neural networks by soft weight sharing. Neural Computation, 4, 173-193.
-
(1992)
Neural Computation
, vol.4
, pp. 173-193
-
-
Nowlan, S.J.1
Hinton, G.E.2
-
20
-
-
0000955979
-
Incremental multi-step Q-learning
-
Peng, J., & Williams, R. (1996). Incremental multi-step Q-learning. Machine Learning, 22, 283-290.
-
(1996)
Machine Learning
, vol.22
, pp. 283-290
-
-
Peng, J.1
Williams, R.2
-
22
-
-
0000108169
-
Probabilistic incremental program evolution
-
Sałustowicz, R.P., & Schmidhuber, J. (1997). Probabilistic incremental program evolution. Evolutionary Computation, 5(2), 123-141.
-
(1997)
Evolutionary Computation
, vol.5
, Issue.2
, pp. 123-141
-
-
Sałustowicz, R.P.1
Schmidhuber, J.2
-
24
-
-
0345593940
-
On learning soccer strategies
-
W Gerstner, A. Germond, M. Hasler, & J.-D. Nicoud (Eds.), Proceedings of the Seventh International Conference on Artificial Neural Networks (ICANN'97), Berlin Heidelberg: Springer-Verlag
-
Sałustowicz, R.P., Wiering, M.A., & Schmidhuber, J. (1997b). On learning soccer strategies. In W Gerstner, A. Germond, M. Hasler, & J.-D. Nicoud (Eds.), Proceedings of the Seventh International Conference on Artificial Neural Networks (ICANN'97), volume 1327 of Lecture Notes in Computer Science (pp. 769-774). Berlin Heidelberg: Springer-Verlag.
-
(1997)
Lecture Notes in Computer Science
, vol.1327
, pp. 769-774
-
-
Sałustowicz, R.P.1
Wiering, M.A.2
Schmidhuber, J.3
-
25
-
-
0031194381
-
Discovering neural nets with low Kolmogorov complexity and high generalization capability
-
Schmidhuber, J. (1997a). Discovering neural nets with low Kolmogorov complexity and high generalization capability. Neural Networks, 10(5), 857-873.
-
(1997)
Neural Networks
, vol.10
, Issue.5
, pp. 857-873
-
-
Schmidhuber, J.1
-
26
-
-
0007918330
-
A general method for incremental self-improvement and multi-agent learning in unrestricted environments
-
X. Yao (Ed.), Singapore: Scientific Publ. Co., in press
-
Schmidhuber, J. (1997b). A general method for incremental self-improvement and multi-agent learning in unrestricted environments. In X. Yao (Ed.), Evolutionary Computation: Theory and Applications. Singapore: Scientific Publ. Co., in press.
-
(1997)
Evolutionary Computation: Theory and Applications
-
-
Schmidhuber, J.1
-
27
-
-
0000156236
-
Reinforcement learning with self-modifying policies
-
S. Thrun & L. Pratt (Eds.), Boston, MA: Kluwer
-
Schmidhuber, J., Zhao, J., & Schraudolph, N. (1997a). Reinforcement learning with self-modifying policies. In S. Thrun & L. Pratt (Eds.), Learning to Learn (pp. 293-309). Boston, MA: Kluwer.
-
(1997)
Learning to Learn
, pp. 293-309
-
-
Schmidhuber, J.1
Zhao, J.2
Schraudolph, N.3
-
28
-
-
0031186687
-
Shifting inductive bias with success-story algorithm, adaptive Levin search, and incremental self-improvement
-
Schmidhuber, J., Zhao, J., & Wiering, M. (1997b). Shifting inductive bias with success-story algorithm, adaptive Levin search, and incremental self-improvement. Machine Learning, 28, 105-130.
-
(1997)
Machine Learning
, vol.28
, pp. 105-130
-
-
Schmidhuber, J.1
Zhao, J.2
Wiering, M.3
-
29
-
-
0022825723
-
An application of algorithmic probability to problems in artificial intelligence
-
L.N. Kanal & J.F. Lemmer (Eds.), Elsevier Science Publishers
-
Solomonoff, R. (1986). An application of algorithmic probability to problems in artificial intelligence. In L.N. Kanal & J.F. Lemmer (Eds.), Uncertainty in Artificial Intelligence (pp. 473-491). Elsevier Science Publishers.
-
(1986)
Uncertainty in Artificial Intelligence
, pp. 473-491
-
-
Solomonoff, R.1
-
30
-
-
85156255116
-
Beating a defender in robotic soccer: Memory-based learning of a continuous function
-
G. Tesauro, D.S. Touretzky, & T.K. Leen (Eds.), Cambridge, MA: MIT Press
-
Stone, P., & Veloso, M. (1996a). Beating a defender in robotic soccer: Memory-based learning of a continuous function. In G. Tesauro, D.S. Touretzky, & T.K. Leen (Eds.), Advances in Neural Information Processing Systems (Vol. 8, pp. 896-902). Cambridge, MA: MIT Press.
-
(1996)
Advances in Neural Information Processing Systems
, vol.8
, pp. 896-902
-
-
Stone, P.1
Veloso, M.2
-
31
-
-
0032020927
-
A layered approach to learning client behaviors in the robocup soccer server
-
1998, to appear
-
Stone, P., & Veloso, M. (1996b). A layered approach to learning client behaviors in the robocup soccer server. Applied Artificial Intelligence (AAI), 1998, to appear.
-
(1996)
Applied Artificial Intelligence (AAI)
-
-
Stone, P.1
Veloso, M.2
-
32
-
-
33847202724
-
Learning to predict by the methods of temporal differences
-
Sutton, R.S. (1988). Learning to predict by the methods of temporal differences. Machine Learning, 3, 9-44.
-
(1988)
Machine Learning
, vol.3
, pp. 9-44
-
-
Sutton, R.S.1
-
33
-
-
85156221438
-
Generalization in reinforcement learning: Successful examples using sparse coarse coding
-
D.S. Touretzky, M.C. Mozer, & M.E. Hasselmo (Eds.), Cambridge, MA: MIT Press
-
Sutton, R.S. (1996). Generalization in reinforcement learning: Successful examples using sparse coarse coding. In D.S. Touretzky, M.C. Mozer, & M.E. Hasselmo (Eds.), Advances in Neural Information Processing Systems (Vol. 8, pp. 1038-1045). Cambridge, MA: MIT Press.
-
(1996)
Advances in Neural Information Processing Systems
, vol.8
, pp. 1038-1045
-
-
Sutton, R.S.1
-
35
-
-
0000985504
-
TD-gammon, a self-teaching backgammon program, achieves master-level play
-
Tesauro, G. (1994). TD-gammon, a self-teaching backgammon program, achieves master-level play. Neural Computation, 6(2), 215-219.
-
(1994)
Neural Computation
, vol.6
, Issue.2
, pp. 215-219
-
-
Tesauro, G.1
-
36
-
-
84890657801
-
Learning real team solutions
-
G. Weiss (Ed.), DAI Meets Machine Learning, Berlin: Springer-Verlag
-
Versino, C., & Gambardella, L.M. (1997). Learning real team solutions. In G. Weiss (Ed.), DAI Meets Machine Learning, volume 1221 of Lecture Notes in Artificial Intelligence (pp. 40-61). Berlin: Springer-Verlag.
-
(1997)
Lecture Notes in Artificial Intelligence
, vol.1221
, pp. 40-61
-
-
Versino, C.1
Gambardella, L.M.2
-
38
-
-
84949977009
-
Adaptation and learning in multi-agent systems: Some remarks and a bibliography
-
G. Weiss & S. Sen (Eds.), Adaptation and Learning in Multi-Agent Systems, Berlin Heidelberg: Springer-Verlag
-
Weiss, G. (1996). Adaptation and learning in multi-agent systems: Some remarks and a bibliography. In G. Weiss & S. Sen (Eds.), Adaptation and Learning in Multi-Agent Systems, volume 1042 of Lecture Notes in Artificial Intelligence (pp. 1-21). Berlin Heidelberg: Springer-Verlag.
-
(1996)
Lecture Notes in Artificial Intelligence
, vol.1042
, pp. 1-21
-
-
Weiss, G.1
-
39
-
-
0002278965
-
Adaptive switching circuits
-
New York: IRE. Reprinted in Anderson and Rosenfeld (1988)
-
Widrow, B., & Hoff, M.E. (1960). Adaptive switching circuits. 1960 IRE WESCON Convention Record (Vol. 4, pp. 96-104). New York: IRE. Reprinted in Anderson and Rosenfeld (1988).
-
(1960)
1960 IRE WESCON Convention Record
, vol.4
, pp. 96-104
-
-
Widrow, B.1
Hoff, M.E.2
-
40
-
-
0010888394
-
Solving POMDPs with Levin search and EIRA
-
L. Saitta (Ed.), San Francisco, CA: Morgan Kaufmann Publishers
-
Wiering, M.A., & Schmidhuber, J. (1996). Solving POMDPs with Levin search and EIRA. In L. Saitta (Ed.), Machine Learning: Proceedings of the Thirteenth International Conference (pp. 534-542). San Francisco, CA: Morgan Kaufmann Publishers.
-
(1996)
Machine Learning: Proceedings of the Thirteenth International Conference
, pp. 534-542
-
-
Wiering, M.A.1
Schmidhuber, J.2
-
41
-
-
2542471725
-
-
(Technical Report IDSIA-21-97), IDSIA, Lugano, Switzerland
-
Wiering, M.A., & Schmidhuber, J. (1997). Fast online Q(λ) (Technical Report IDSIA-21-97), IDSIA, Lugano, Switzerland.
-
(1997)
Fast Online Q(λ)
-
-
Wiering, M.A.1
Schmidhuber, J.2
|