-
2
-
-
0029679044
-
Reinforcement Learning: A Survey
-
L.P.Kaelbling, M.L.Littman, A.W.Moore, "Reinforcement Learning: A Survey," Journal of Artificial Intelligence Research, Vol.4, pp. 237-285, 1996.
-
(1996)
Journal of Artificial Intelligence Research
, vol.4
, pp. 237-285
-
-
Kaelbling, L.P.1
Littman, M.L.2
Moore, A.W.3
-
3
-
-
27944496814
-
State Space Partitioning and Clustering with Sensor Alignment for Autonomous Robots
-
Man and Cybernetics, pp
-
T.Hamagami, H.Hirata, "State Space Partitioning and Clustering with Sensor Alignment for Autonomous Robots," Proc of IEEE International Conference on Systems, Man and Cybernetics, pp.2655-2660, 2005.
-
(2005)
Proc of IEEE International Conference on Systems
, pp. 2655-2660
-
-
Hamagami, T.1
Hirata, H.2
-
4
-
-
15744378552
-
Development of Intelligent Wheelchair Acquiring Autonomous, Cooperative, and Collaborative Behavior
-
Man and Cybernetics, pp
-
T.Hamagami, H.Hirata, T.Hamagami, H.Hirata, "Development of Intelligent Wheelchair Acquiring Autonomous, Cooperative, and Collaborative Behavior," Proc of IEEE International Conference on Systems, Man and Cybernetics, pp.3235-3530, 2004.
-
(2004)
Proc of IEEE International Conference on Systems
, pp. 3235-3530
-
-
Hamagami, T.1
Hirata, H.2
Hamagami, T.3
Hirata, H.4
-
6
-
-
0031215211
-
-
M. Wiering and J. Schmidhuber, HQ-learning, Adaptive Behavior, 6.2, pp.219-246, 1998.
-
M. Wiering and J. Schmidhuber, "HQ-learning," Adaptive Behavior, vol. 6.2, pp.219-246, 1998.
-
-
-
-
7
-
-
1942452236
-
Learning predictive state representations
-
Satinder Singh, Michael L. Littman, Nicholas K. Jong, David Pardoe, and Peter Stone, "Learning predictive state representations," In Proceedings of the Twentieth International Conference on Machine Learning, pp. 712-719, 2003.
-
(2003)
Proceedings of the Twentieth International Conference on Machine Learning
, pp. 712-719
-
-
Singh, S.1
Littman, M.L.2
Jong, N.K.3
Pardoe, D.4
Stone, P.5
-
8
-
-
0036978717
-
-
Hamagami, T.; Koakutsu, S.; Hirata, H.,Reinforcement learning to compensate for perceptual aliasing using dynamic additional parameter: motivational value, Systems, Man and Cybernetics, 2002 IEEE International Conference on 2, pp. 1-6, 2002.
-
Hamagami, T.; Koakutsu, S.; Hirata, H.,"Reinforcement learning to compensate for perceptual aliasing using dynamic additional parameter: motivational value," Systems, Man and Cybernetics, 2002 IEEE International Conference on Volume. 2, pp. 1-6, 2002.
-
-
-
-
9
-
-
34548115736
-
-
Complex-Valued Neural Networks : Theories and Applications A. Hirose, ed., Series on Innovative Intelligence, World Scientific Publishing Co. Pte. Ltd., Singapore, Nov. 2003.
-
"Complex-Valued Neural Networks : Theories and Applications" A. Hirose, ed., Series on Innovative Intelligence, World Scientific Publishing Co. Pte. Ltd., Singapore, Nov. 2003.
-
-
-
-
10
-
-
0020970738
-
Neuronlike elements that can solve difficult learning control problems
-
A.G.Barto, R.S.Sutton, C.W.Anderson, "Neuronlike elements that can solve difficult learning control problems," IEEE Trans. on Systems, Man, and Cybernetics, 13, pp.835-846, 1983.
-
(1983)
IEEE Trans. on Systems, Man, and Cybernetics
, vol.13
, pp. 835-846
-
-
Barto, A.G.1
Sutton, R.S.2
Anderson, C.W.3
|