-
1
-
-
0004049893
-
-
Ph.D dissertation, Cambridge University, Cambridge, England
-
Watkins, C. J. C. H., Learning from Delayed Rewards. Ph.D dissertation, Cambridge University, Cambridge, England, 1989.
-
(1989)
Learning from Delayed Rewards
-
-
Watkins, C.J.C.H.1
-
2
-
-
0004102479
-
-
MIT Press, Cambridge, MA
-
Sutton, R. S. and Barto, A. G., Reinforcement Learning: An Introduction. MIT Press, Cambridge, MA, 1998.
-
(1998)
Reinforcement Learning: An Introduction
-
-
Sutton, R.S.1
Barto, A.G.2
-
3
-
-
0025529853
-
Advances in reinforcement learning and their implications for intelligent control
-
Whitehead, S. D., Sutton, R. S., and Ballard, D. H., Advances in reinforcement learning and their implications for intelligent control. Proceedings of IEEE International Symposium on Intelligent Control, 1990, pp. 1289-1297.
-
(1990)
Proceedings of IEEE International Symposium on Intelligent Control
, pp. 1289-1297
-
-
Whitehead, S.D.1
Sutton, R.S.2
Ballard, D.H.3
-
4
-
-
0029276036
-
Temporal difference learning and TD-Gammon
-
Tesauro, G. J., Temporal difference learning and TD-Gammon. Commun. ACM 38, 58-68 (1995).
-
(1995)
Commun. ACM
, vol.38
, pp. 58-68
-
-
Tesauro, G.J.1
-
5
-
-
0033347508
-
A dynamic channel assignment policy through Q-learning
-
Nie, J. and Haykin, S., A dynamic channel assignment policy through Q-learning. IEEE Trans. Neural Netw. 10, 1443-1455 (1999).
-
(1999)
IEEE Trans. Neural Netw.
, vol.10
, pp. 1443-1455
-
-
Nie, J.1
Haykin, S.2
-
6
-
-
0029277469
-
A sensor-based navigation for a mobile robot using fuzzy logic and reinforcement learning
-
Beom, H. R. and Cho, H. S., A sensor-based navigation for a mobile robot using fuzzy logic and reinforcement learning. IEEE Trans. Syst. Man Cybern. 25, 464-477 (1995).
-
(1995)
IEEE Trans. Syst. Man Cybern.
, vol.25
, pp. 464-477
-
-
Beom, H.R.1
Cho, H.S.2
-
7
-
-
0032289291
-
Dynamical categories and control policy selection
-
Coelho, J. A., Araujo, E. G., Huber, M., and Grupen, R. A., Dynamical categories and control policy selection. Proceedings of IEEE International Symposium on Intelligent Control, 1998, pp. 459-464.
-
(1998)
Proceedings of IEEE International Symposium on Intelligent Control
, pp. 459-464
-
-
Coelho, J.A.1
Araujo, E.G.2
Huber, M.3
Grupen, R.A.4
-
8
-
-
0016873783
-
The apparent conflict between estimation and control - A survey of the two-armed problem
-
Wirten, I. H., The apparent conflict between estimation and control - A survey of the two-armed problem. J. Franklin Inst. 301, 161-189 (1976).
-
(1976)
J. Franklin Inst.
, vol.301
, pp. 161-189
-
-
Wirten, I.H.1
-
10
-
-
0003487482
-
-
Athena Scientific, Belmont, MA
-
Bertsekas, D. P. and Tsitsiklis, J. N., Neural Dynamic Programming. Athena Scientific, Belmont, MA, 1996.
-
(1996)
Neural Dynamic Programming
-
-
Bertsekas, D.P.1
Tsitsiklis, J.N.2
-
12
-
-
2142764562
-
-
Sutton, R. S., editor, A Special Issue of Machine Learning on Reinforcement Learning, Volume 8. Machine Learning, 1992, Also published as Reinforcement Learning, Kluwer Academic Press, Boston, MA, 1992.
-
(1992)
A Special Issue of Machine Learning on Reinforcement Learning, Volume 8. Machine Learning
, vol.8
-
-
Sutton, R.S.1
-
13
-
-
0004007508
-
-
Kluwer Academic Press, Boston, MA
-
Sutton, R. S., editor, A Special Issue of Machine Learning on Reinforcement Learning, Volume 8. Machine Learning, 1992, Also published as Reinforcement Learning, Kluwer Academic Press, Boston, MA, 1992.
-
(1992)
Reinforcement Learning
-
-
-
16
-
-
0034449143
-
Fuzzy landmark-based localization for a legged robot
-
Buschka, P., Saffiotti, A., and Wasik, Z., Fuzzy landmark-based localization for a legged robot. Proceedings of Intelligent Robots and Systems Conference, 2000, pp. 1205-1210.
-
(2000)
Proceedings of Intelligent Robots and Systems Conference
, pp. 1205-1210
-
-
Buschka, P.1
Saffiotti, A.2
Wasik, Z.3
-
17
-
-
0033279889
-
Reactive navigation in dynamic environment using a multisensor predictor
-
Song, K. T. and Chang, C. C., Reactive navigation in dynamic environment using a multisensor predictor. IEEE Trans. Syst. Man Cybern. 29, 870-880 (1999).
-
(1999)
IEEE Trans. Syst. Man Cybern.
, vol.29
, pp. 870-880
-
-
Song, K.T.1
Chang, C.C.2
-
18
-
-
0032287655
-
A neuro-fuzzy controller for mobile robot navigation and multirobot convoying
-
Ng, K. C. and Trivedi, M. M., A neuro-fuzzy controller for mobile robot navigation and multirobot convoying. IEEE Trans. Syst. Man Cybern. 28, 829-840 (1998).
-
(1998)
IEEE Trans. Syst. Man Cybern.
, vol.28
, pp. 829-840
-
-
Ng, K.C.1
Trivedi, M.M.2
-
19
-
-
0003584577
-
-
Prentice Hall, Upper Saddle River, NJ
-
Russell, S. and Norvig, P., Artificial Intelligence: A Modern Approach. Prentice Hall, Upper Saddle River, NJ, 1995.
-
(1995)
Artificial Intelligence: A Modern Approach
-
-
Russell, S.1
Norvig, P.2
-
21
-
-
0033692820
-
Active multimodel control for dynamic maneuver optimization in unmanned air vehicles
-
Godbole, D., Samad, T., and Gopal, V., Active multimodel control for dynamic maneuver optimization in unmanned air vehicles. Proceedings of IEEE International Conference on Robotics and Automation, 2000, pp. 1257-1262.
-
(2000)
Proceedings of IEEE International Conference on Robotics and Automation
, pp. 1257-1262
-
-
Godbole, D.1
Samad, T.2
Gopal, V.3
-
22
-
-
85012688561
-
-
Princeton University Press, Princeton, NJ
-
Bellman, R. E., Dynamic Programming. Princeton University Press, Princeton, NJ, 1957.
-
(1957)
Dynamic Programming
-
-
Bellman, R.E.1
|