-
2
-
-
0029679044
-
Reinforcement learning: A survey
-
L. Kaelbling, M.L. Littman, & A.W. Moore, Reinforcement learning: A survey, Journal of Artificial Intelligence Research, 4, 1996, 237-285.
-
(1996)
Journal of Artificial Intelligence Research
, vol.4
, pp. 237-285
-
-
Kaelbling, L.1
Littman, M.L.2
Moore, A.W.3
-
3
-
-
33747997674
-
Variable resolution dynamic programming: Efficiently learning action maps in multivariate real-valued spaces
-
Evanston, IL
-
A.W. Moore, Variable resolution dynamic programming: Efficiently learning action maps in multivariate real-valued spaces, Proc. 8th Int. Conf. on Machine Learning, Evanston, IL, 1991, 333-337.
-
(1991)
Proc. 8th Int. Conf. on Machine Learning
, pp. 333-337
-
-
Moore, A.W.1
-
4
-
-
84880680664
-
Variable resolution discretization for high-accuracy solutions of optimal control problems
-
Stockholm, Sweden
-
R. Munos & A. Moore, Variable resolution discretization for high-accuracy solutions of optimal control problems, Proc. 16th Int. Joint Conf. on Artificial Intelligence, 2, Stockholm, Sweden, 1999, 1348-1355.
-
(1999)
Proc. 16th Int. Joint Conf. on Artificial Intelligence
, vol.2
, pp. 1348-1355
-
-
Munos, R.1
Moore, A.2
-
5
-
-
0002192119
-
Input generalization in delayed reinforcement learning: An algorithm and performance comparisons
-
Sydney, Australia
-
D. Chapman & L.P. Kaelbling, Input generalization in delayed reinforcement learning: An algorithm and performance comparisons, Proc. Int. Joint Conf. on Artificial Intelligence, Sydney, Australia, 1991, 726-731.
-
(1991)
Proc. Int. Joint Conf. on Artificial Intelligence
, pp. 726-731
-
-
Chapman, D.1
Kaelbling, L.P.2
-
6
-
-
0030380251
-
Action-based sensor space categorization for robot learning
-
Osaka, Japan
-
M. Asada, S. Noda, & K. Hosoda, Action-based sensor space categorization for robot learning, Proc. 1996 IEEE/RSJ Int. Conf. on Intelligence Robots and Systems (IROS'96), 3, Osaka, Japan, 1996, 1502-1509.
-
(1996)
Proc. 1996 IEEE/RSJ Int. Conf. on Intelligence Robots and Systems (IROS'96)
, vol.3
, pp. 1502-1509
-
-
Asada, M.1
Noda, S.2
Hosoda, K.3
-
7
-
-
0030395609
-
Simultaneous learning of situation classification based on rewards and behavior selection based on the situation
-
Osaka, Japan
-
A. Ueno, K. Hori, & S. Nakasuda, Simultaneous learning of situation classification based on rewards and behavior selection based on the situation, Proc. 1996 IEEE/RSJ Int. Conf. on Intelligence Robots and Systems (IROS'96), 3, Osaka, Japan, 1996, 1510-1517.
-
(1996)
Proc. 1996 IEEE/RSJ Int. Conf. on Intelligence Robots and Systems (IROS'96)
, vol.3
, pp. 1510-1517
-
-
Ueno, A.1
Hori, K.2
Nakasuda, S.3
-
8
-
-
0033346697
-
Autonomous action-mode change in a two-mobile robotics system: S-temperature based on-line learning
-
Kyongju, Korea
-
T. Sawada, S. Ichikawa, & F. Hara. Autonomous action-mode change in a two-mobile robotics system: S-temperature based on-line learning, Proc. 1999 IEEE/RSJ Int. Conf. on Intelligence Robots and Systems (IROS'99), 1, Kyongju, Korea, 1999, 393-399.
-
(1999)
Proc. 1999 IEEE/RSJ Int. Conf. on Intelligence Robots and Systems (IROS'99)
, vol.1
, pp. 393-399
-
-
Sawada, T.1
Ichikawa, S.2
Hara, F.3
-
9
-
-
84947709855
-
Adaptive state-space quantization for reinforcement learning of collision-free navigation
-
Raleigh, NC
-
B.J.A. Krose & J.W.M. van Dam, Adaptive state-space quantization for reinforcement learning of collision-free navigation, Proc. 1992 IEEE/RSJ Int. Conf. on Intelligence Robots and Systems (IROS'92), 2, Raleigh, NC, 1992, 1327-1332.
-
(1992)
Proc. 1992 IEEE/RSJ Int. Conf. on Intelligence Robots and Systems (IROS'92)
, vol.2
, pp. 1327-1332
-
-
Krose, B.J.A.1
Van Dam, J.W.M.2
-
10
-
-
0008364145
-
Reinforcement learning using functional approximation for generalization and their application to cart centering and fractal compression
-
Stockholm, Sweden
-
C. Claussen, S. Gutta, & H. Wechsler, Reinforcement learning using functional approximation for generalization and their application to cart centering and fractal compression, Proc. 16th Int. Joint Conf. on Artificial Intelligence, 2, Stockholm, Sweden, 1999, 1362-1367
-
(1999)
Proc. 16th Int. Joint Conf. on Artificial Intelligence
, vol.2
, pp. 1362-1367
-
-
Claussen, C.1
Gutta, S.2
Wechsler, H.3
-
11
-
-
0003505613
-
-
Report TKK-F-A601, Helsinki University of Technology, Espoo, Finland
-
T. Kohonen, Learning vector quantization for pattern recognition, Report TKK-F-A601, Helsinki University of Technology, Espoo, Finland, 1986.
-
(1986)
Learning Vector Quantization for Pattern Recognition
-
-
Kohonen, T.1
-
12
-
-
0004049893
-
-
doctoral diss., King's College, Cambridge, UK
-
C.J.C.H. Watkins, Learning from delayed rewards, doctoral diss., King's College, Cambridge, UK, 1989.
-
(1989)
Learning from Delayed Rewards
-
-
Watkins, C.J.C.H.1
-
13
-
-
0031341345
-
Neural reinforcement learning for behaviour synthesis
-
C. Touzet, Neural reinforcement learning for behaviour synthesis, Robotics and Autonomous Systems. 22(3-4), 1997, 251-282.
-
(1997)
Robotics and Autonomous Systems.
, vol.22
, Issue.3-4
, pp. 251-282
-
-
Touzet, C.1
-
15
-
-
0003356379
-
VQQL: Applying vector quantization to reinforcement learning
-
Stockholm, Sweden: Springer Verlag
-
F. Fernández and D. Borrajo, VQQL: Applying vector quantization to reinforcement learning, RoboCup-99: Robot Soccer World Cup III (Stockholm, Sweden: Springer Verlag, 2000).
-
(2000)
RoboCup-99: Robot Soccer World Cup III
-
-
Fernández, F.1
Borrajo, D.2
-
17
-
-
0018918171
-
An algorithm for vector quantizer design
-
Com-28
-
Y. Linde, A. Buzo, & R.M. Gray, An algorithm for vector quantizer design, IEEE Trans. on Communications, 1 (1), Com-28, 1980, 84-95.
-
(1980)
IEEE Trans. on Communications
, vol.1
, Issue.1
, pp. 84-95
-
-
Linde, Y.1
Buzo, A.2
Gray, R.M.3
-
18
-
-
0033312347
-
A ease study for life-long learning and adaptation in cooperative robot teams
-
Boston, MA
-
L.E. Parker, A ease study for life-long learning and adaptation in cooperative robot teams, Proc. SPIE Sensor Fusion and Decentralized Control in Robotic Systems II, 3839, Boston, MA, 1999, 92-101.
-
(1999)
Proc. SPIE Sensor Fusion and Decentralized Control in Robotic Systems II
, vol.3839
, pp. 92-101
-
-
Parker, L.E.1
-
19
-
-
0001790234
-
Broadcast of local eligibility for multi-target observation
-
L.E. Parker, G. Bekey, & J. Barhem (Eds.), Tokyo, Japan: Springer
-
B.B. Werger & M. Matarić, Broadcast of local eligibility for multi-target observation, in L.E. Parker, G. Bekey, & J. Barhem (Eds.), Distributed Autonomous Robotic Systems, 4, (Tokyo, Japan: Springer, 2000), 347-356.
-
(2000)
Distributed Autonomous Robotic Systems
, vol.4
, pp. 347-356
-
-
Werger, B.B.1
Matarić, M.2
-
20
-
-
0008303956
-
Ultrafast neural network training for robot learning from uncertain data
-
Tokyo
-
J. Barhen & V. Protopopescu, Ultrafast neural network training for robot learning from uncertain data, Distributed Autonomous Robotic Systems, 4, Tokyo, 2000, 347-356.
-
(2000)
Distributed Autonomous Robotic Systems
, vol.4
, pp. 347-356
-
-
Barhen, J.1
Protopopescu, V.2
-
21
-
-
0001534236
-
Multi-robot learning in a cooperative observation task
-
L.E. Parker, G. Bekey, & J. Barhem (Eds.), Tokyo, Japan: Springer
-
L. Parker & C. Touzet, Multi-robot learning in a cooperative observation task, in L.E. Parker, G. Bekey, & J. Barhem (Eds.), Distributed Autonomous Robotic Systems, 4 (Tokyo, Japan: Springer, 2000) 391-401.
-
(2000)
Distributed Autonomous Robotic Systems
, vol.4
, pp. 391-401
-
-
Parker, L.1
Touzet, C.2
-
22
-
-
5644301833
-
Iterative VQQL for learning skills
-
Leganés, Madrid, Spain
-
F. Fernández & D. Borrajo, Iterative VQQL for learning skills, Proc. of Learning '00, Leganés, Madrid, Spain, 2000.
-
(2000)
Proc. of Learning '00
-
-
Fernández, F.1
Borrajo, D.2
-
24
-
-
0004491880
-
Robocup: The robot world cup initiative
-
Montreal, Canada
-
H. Kitano, M. Asada, Y. Kuniyoshi, I. Noda, & E. Osawa, Robocup: The robot world cup initiative, Proc. IJCAI-95 Workshop on Learning Robots, Montreal, Canada, 1995, 19-24.
-
(1995)
Proc. IJCAI-95 Workshop on Learning Robots
, pp. 19-24
-
-
Kitano, H.1
Asada, M.2
Kuniyoshi, Y.3
Noda, I.4
Osawa, E.5
-
25
-
-
0003356517
-
Soccer server: A simulator of robocup
-
Tokyo, Japan
-
I. Noda, Soccer server: A simulator of robocup, Proc. 4th Int. Symposium'95, Tokyo, Japan, 1995, 29-34.
-
(1995)
Proc. 4th Int. Symposium'95
, pp. 29-34
-
-
Noda, I.1
-
26
-
-
0004267735
-
-
Dordrecht: Kluwer
-
D. Aha, ed., Lazy learning (Dordrecht: Kluwer, 1997).
-
(1997)
Lazy Learning
-
-
Aha, D.1
-
27
-
-
0032000094
-
Multiple-prototype classifier design
-
J.C. Bezdek, T.R. Rechherzer, G.S. Lim, & Y. Attikiouzel, Multiple-prototype classifier design, IEEE Trans. on Systems, Man and Cybernetics, 28(1), 1998, 67-79.
-
(1998)
IEEE Trans. on Systems, Man and Cybernetics
, vol.28
, Issue.1
, pp. 67-79
-
-
Bezdek, J.C.1
Rechherzer, T.R.2
Lim, G.S.3
Attikiouzel, Y.4
-
28
-
-
0028748949
-
Growing cell structures: A self-organizing network for unsupervised and supervised learning
-
B. Fritzke, Growing cell structures: A self-organizing network for unsupervised and supervised learning, Neural Networks, 7(9), 1994, 1441-1460.
-
(1994)
Neural Networks
, vol.7
, Issue.9
, pp. 1441-1460
-
-
Fritzke, B.1
|