-
1
-
-
85153940465
-
Generalization in reinforcement learning: Safely approximating the value function
-
J. Tesauro, and D. Touretzky, and T. Leen, (eds.) MIT Press, Cambridge, MA
-
J. Boyan and A. Moore, (1995). Generalization in reinforcement learning: safely approximating the value function. in: J. Tesauro, and D. Touretzky, and T. Leen, (eds.) Neural Information Processing Systems, 369-376, MIT Press, Cambridge, MA.
-
(1995)
Neural Information Processing Systems
, pp. 369-376
-
-
Boyan, J.1
Moore, A.2
-
2
-
-
0004291566
-
-
Wadsworth, Belmont, CA
-
L. Breiman, L. Friedman, and P. Stone, (1984). Classification and Regression. Wadsworth, Belmont, CA.
-
(1984)
Classification and Regression
-
-
Breiman, L.1
Friedman, L.2
Stone, P.3
-
3
-
-
0001940458
-
Adaptive mixtures of local experts
-
R. Jacobs, M. Jordan, S. Nowlan, and G. Hinton, (1991). Adaptive mixtures of local experts. Neural Computation. 3, 79-87.
-
(1991)
Neural Computation.
, vol.3
, pp. 79-87
-
-
Jacobs, R.1
Jordan, M.2
Nowlan, S.3
Hinton, G.4
-
4
-
-
0000262562
-
Hierarchical mixtures of experts and the em algorithm
-
M. Jordan and R. Jacobs, (1994). Hierarchical mixtures of experts and the EM algorithm. Neural Computation. 6, 181-214.
-
(1994)
Neural Computation.
, vol.6
, pp. 181-214
-
-
Jordan, M.1
Jacobs, R.2
-
6
-
-
0000123778
-
Self-improving reactive agents based on reinforcement learning
-
L. Lin, (1992). Self-improving reactive agents based on reinforcement learning, planning, and teaching. Machine Learning. Vol.8, pp.293-321.
-
(1992)
Planning, and Teaching. Machine Learning
, vol.8
, pp. 293-321
-
-
Lin, L.1
-
9
-
-
21144465016
-
On variable binding in connectionist networks
-
1992
-
R. Sun, (1992). On variable binding in connectionist networks. Connection Science, Vol.4, No.2, pp.93-124. 1992.
-
(1992)
Connection Science
, vol.4
, Issue.2
, pp. 93-124
-
-
Sun, R.1
-
10
-
-
0031258480
-
Learning, action, and consciousness: A hybrid approach towards modeling consciousness
-
R. Sun, (1997). Learning, action, and consciousness: a hybrid approach towards modeling consciousness. Neural Networks, 10 (7), pp.1317-1331
-
(1997)
Neural Networks
, vol.10
, Issue.7
, pp. 1317-1331
-
-
Sun, R.1
-
11
-
-
0030675538
-
A hybrid model for learning sequential navigation. Proc. of
-
IEEE Press, Piscateway, NJ
-
R. Sun and T. Peterson, (1997). A hybrid model for learning sequential navigation. Proc. of IEEE International Symposium on Computational Intelligence in Robotics and Automation (CIRA'97). Monterey, CA. pp.234-239. IEEE Press, Piscateway, NJ.
-
(1997)
IEEE International Symposium on Computational Intelligence in Robotics and Automation (CIRA'97). Monterey, CA.
, pp. 234-239
-
-
Sun, R.1
Peterson, T.2
-
12
-
-
0032207548
-
Autonomous learning of sequential tasks: Experiments and analyses
-
R. Sun and T. Peterson, (1998). Autonomous learning of sequential tasks: experiments and analyses. IEEE Transactions on Neural Networks, Vol.9, No.6, pp.12171234.
-
(1998)
IEEE Transactions on Neural Networks
, vol.9
, Issue.6
, pp. 12171234
-
-
Sun, R.1
Peterson, T.2
-
13
-
-
0032772352
-
Multi-agent reinforcement learning: Weighting and partitioning
-
R. Sun and T. Peterson, (1999). Multi-agent reinforcement learning: weighting and partitioning. Neural Networks, Vol.12 No.4-5. pp.127-153.
-
(1999)
Neural Networks
, vol.12
, Issue.4-5
, pp. 127-153
-
-
Sun, R.1
Peterson, T.2
-
14
-
-
0033165135
-
A hybrid architecture for situated learning of reactive sequential decision making
-
in press
-
R. Sun, T. Peterson, and E. Merrill, (1999). A hybrid architecture for situated learning of reactive sequential decision making. Applied Intelligence, in press.
-
(1999)
Applied Intelligence
-
-
Sun, R.1
Peterson, T.2
Merrill, E.3
-
15
-
-
84900180782
-
Extracting plans from reinforcement learners
-
eds. L. Xu, L. Chan, I. King, and A. Fu. Springer Verlag, Heidelberg
-
R. Sun and C. Sessions, (1998a). Extracting plans from reinforcement learners. Proceedings of the 1998 International Symposium on Intelligent Data Engineering and Learning (IDEAL'98). pp.243-248. eds. L. Xu, L. Chan, I. King, and A. Fu. Springer-Verlag, Heidelberg.
-
(1998)
Proceedings of the 1998 International Symposium on Intelligent Data Engineering and Learning (IDEAL'98).
, pp. 243-248
-
-
Sun, R.1
Sessions, C.2
-
18
-
-
85132026293
-
Integrated architectures for learning, planning, and reacting based on approximating dynamic programming
-
Morgan Kaufmann, San Meteo, CA
-
R. Sutton, (1990). Integrated architectures for learning, planning, and reacting based on approximating dynamic programming. Proc. of Seventh International Conference on Machine Learning. Morgan Kaufmann, San Meteo, CA.
-
(1990)
Proc. of Seventh International Conference on Machine Learning
-
-
Sutton, R.1
-
19
-
-
0001046225
-
Practical issues in temporal difference learning
-
T. Tesauro, (1992). Practical issues in temporal difference learning. Machine Learning. Vol.8, 257-277.
-
(1992)
Machine Learning
, vol.8
, pp. 257-277
-
-
Tesauro, T.1
-
20
-
-
0027678679
-
Extracting refined rules from Knowledge-Based Neural Networks
-
G. Towell and J. Shavlik, (1993). Extracting refined rules from Knowledge-Based Neural Networks, Machine Learning. 13 (1), 71-101.
-
(1993)
Machine Learning.
, vol.13
, Issue.1
, pp. 71-101
-
-
Towell, G.1
Shavlik, J.2
-
21
-
-
0004049895
-
-
Ph.D Thesis, Cambridge University, Cambridge, UK
-
C. Watkins, (1989). Learning with Delayed Rewards. Ph.D Thesis, Cambridge University, Cambridge, UK.
-
(1989)
Learning with Delayed Rewards
-
-
Watkins, C.1
-
22
-
-
85158158334
-
A complexity analysis of cooperative mechanisms in reinforcement learning
-
Morgan Kaufmann, San Francisco, CA
-
S. Whitehead, (1993). A complexity analysis of cooperative mechanisms in reinforcement learning. Proc. of the National Conference on Artificial Intelligence (AAAI'93), 607-613. Morgan Kaufmann, San Francisco, CA.
-
(1993)
Proc. of the National Conference on Artificial Intelligence (AAAI'93
, pp. 607-613
-
-
Whitehead, S.1
|