-
2
-
-
0003602259
-
Learning and sequential decision making
-
Barto, A.C., Sutton, R.S. and Watkins, C.J.C.H., 1989, Learning and Sequential Decision Making, COINS Technical Report.
-
(1989)
COINS Technical Report
-
-
Barto, A.C.1
Sutton, R.S.2
Watkins, C.J.C.H.3
-
4
-
-
0001373628
-
Random neural networks with negative and positive signals and product form solution
-
Gelenbe, E., 1989, Random neural networks with negative and positive signals and product form solution. Neural Comput. 1(4), 502-510.
-
(1989)
Neural Comput.
, vol.1
, Issue.4
, pp. 502-510
-
-
Gelenbe, E.1
-
5
-
-
0000145931
-
Stability of the random neural network model
-
Gelenbe, E., 1990, Stability of the random neural network model. Neural Comput. 2(2), 239-247.
-
(1990)
Neural Comput.
, vol.2
, Issue.2
, pp. 239-247
-
-
Gelenbe, E.1
-
6
-
-
0000428263
-
Learning in the recurrent random neural network
-
Gelenbe, E., 1993, Learning in the recurrent random neural network. Neural Computation 5(1), 154-164.
-
(1993)
Neural Computation
, vol.5
, Issue.1
, pp. 154-164
-
-
Gelenbe, E.1
-
7
-
-
0342419711
-
Neural networks in Mazes
-
Beijing, China, November
-
Halici, U. and Yaranli, U., 1992, Neural Networks in Mazes, in: Proc. of IEEE-INNS International Joint Conference on Neural Networks. Beijing, China, November, Vol-II, 711-716.
-
(1992)
Proc. of IEEE-INNS International Joint Conference on Neural Networks
, vol.2
, pp. 711-716
-
-
Halici, U.1
Yaranli, U.2
-
9
-
-
85194580887
-
Simulation results for Connectionist Maze Learning
-
Dept. of Electrical Engineering, METU
-
Madenoglu, A., 1994, Simulation results for Connectionist Maze Learning, B.Sc. Project Report, Dept. of Electrical Engineering, METU.
-
(1994)
B.Sc. Project Report
-
-
Madenoglu, A.1
-
12
-
-
85194529880
-
A general framework for reinforcement learning
-
Paris
-
Szepesvary, C., 1995, A general framework for reinforcement learning, in: Proc. of International Conference on Artificial neural Networks, Paris, II, 165-170.
-
(1995)
Proc. of International Conference on Artificial Neural Networks, Paris
, vol.2
, pp. 165-170
-
-
Szepesvary, C.1
-
14
-
-
33847202724
-
Learning to predict by the methods of temporal difference
-
Sutton, R.S., 1988, Learning to predict by the methods of temporal difference. Machine Learn. 3, 9-44.
-
(1988)
Machine Learn.
, vol.3
, pp. 9-44
-
-
Sutton, R.S.1
-
16
-
-
34249833101
-
Technical Note: Q learning
-
Watkins, C. and Dayan, P., 1992, Technical Note: Q learning. Machine Learn. 8, 55-68.
-
(1992)
Machine Learn.
, vol.8
, pp. 55-68
-
-
Watkins, C.1
Dayan, P.2
-
17
-
-
85194560129
-
Indirect adaptive explorations in entropy-based reinforcement learning
-
Zhang, P. and Canu, S., 1995, Indirect Adaptive Explorations in Entropy-based Reinforcement Learning, in: Proc. of International Conference on Artificial Neural Networks, Paris, II, 171-176.
-
(1995)
Proc. of International Conference on Artificial Neural Networks, Paris
, vol.2
, pp. 171-176
-
-
Zhang, P.1
Canu, S.2
|