-
2
-
-
84966275544
-
Minimization of functions having Lipschitz continuous first-partial derivatives
-
Armijo L. Minimization of functions having Lipschitz continuous first-partial derivatives. Pacific Journal of Mathematics 16 (1966) 1-3
-
(1966)
Pacific Journal of Mathematics
, vol.16
, pp. 1-3
-
-
Armijo, L.1
-
3
-
-
0028584964
-
-
S.J. Bradtke, B.E. Ydstie, A.G. Barto, Adaptive linear quadratic control using policy iteration, in: American Control Conference, 1994, pp. 3475-3479
-
-
-
-
4
-
-
0023736550
-
-
R.W. Brockett, On the computer control of movement, in: Proceedings of the 1988 IEEE Conference on Robotics and Automation, April 1988, New York, pp. 534-540
-
-
-
-
5
-
-
33744738305
-
-
L. Crawford, S.S. Sastry, Learning controllers for complex behavioral systems, in: Neural Information Processing Systems Tenth Annual Conference, NIPS 96, 1996
-
-
-
-
6
-
-
24344502534
-
On the specification complexity of linguistic control procedures
-
Egerstedt M. On the specification complexity of linguistic control procedures. International Journal of Hybrid Systems 2 1-2 (2002) 129-140
-
(2002)
International Journal of Hybrid Systems
, vol.2
, Issue.1-2
, pp. 129-140
-
-
Egerstedt, M.1
-
7
-
-
0036994152
-
-
M. Egerstedt, D. Hristu-Varsakelis, Observability and policy optimization for mobile robots, in: IEEE Conference on Decision and Control, Las Vegas, NV, December 2002
-
-
-
-
8
-
-
0037291277
-
Feedback can reduce the specification complexity of motor programs
-
Egerstedt M., and Brockett R.W. Feedback can reduce the specification complexity of motor programs. IEEE Transactions on Automatic Control 48 2 (2003) 213-223
-
(2003)
IEEE Transactions on Automatic Control
, vol.48
, Issue.2
, pp. 213-223
-
-
Egerstedt, M.1
Brockett, R.W.2
-
9
-
-
33744753814
-
-
T.T. Georgiou, Relative entropy and the multi-variable multi-dimesional moment problem, IEEE Transaction on IT CLN 04-326. Revised December 2004 (submitted for publication)
-
-
-
-
10
-
-
85128575686
-
-
D. Hristu-Varsakelis, M. Egerstedt, P.S. Krishnaprasad, On the structural complexity of the motion description language MDLe, in: IEEE Conference on Decision and Control, December 2003
-
-
-
-
11
-
-
0000439891
-
On the convergence of stochastic iterative dynamic programming algorithms
-
Jaakkola T., Jordan M.I., and Singh S.P. On the convergence of stochastic iterative dynamic programming algorithms. Neural Computation 6 6 (1994)
-
(1994)
Neural Computation
, vol.6
, Issue.6
-
-
Jaakkola, T.1
Jordan, M.I.2
Singh, S.P.3
-
13
-
-
0033100636
-
Controllers for reachability specifications for hybrid systems
-
Lygeros J., Tomlin C., and Sastry S. Controllers for reachability specifications for hybrid systems. Automatica 35 3 (1999) 349-370
-
(1999)
Automatica
, vol.35
, Issue.3
, pp. 349-370
-
-
Lygeros, J.1
Tomlin, C.2
Sastry, S.3
-
14
-
-
0034546446
-
-
K. Morgansen, R.W. Brockett, Optimal regulation and reinforcement learning for the nonholonomic integrator, in: Proceedings of the American Control Conference, June 2000, pp. 462-466
-
-
-
-
15
-
-
0242366144
-
Bisimilar linear systems
-
Pappas G.J. Bisimilar linear systems. Automatica 39 12 (2003) 2035-2047
-
(2003)
Automatica
, vol.39
, Issue.12
, pp. 2035-2047
-
-
Pappas, G.J.1
-
17
-
-
0000723997
-
Generalization in reinforcement learning: Successful examples using sparse coarse coding
-
Sutton R.S. Generalization in reinforcement learning: Successful examples using sparse coarse coding. Neural Information Processing Systems 8 (1996)
-
(1996)
Neural Information Processing Systems
, vol.8
-
-
Sutton, R.S.1
-
18
-
-
0004102479
-
-
MIT Press, Cambridge, MA
-
Sutton R.S., and Barto A.G. Reinforcement Learning, An Introduction (1998), MIT Press, Cambridge, MA
-
(1998)
Reinforcement Learning, An Introduction
-
-
Sutton, R.S.1
Barto, A.G.2
-
19
-
-
33744777857
-
-
R.C. Thompson, Lecture 10 : Part I - Convergence domains of the Campbell Baker Hausdorff formula, in: John Hopkins Lecture Notes, 1988
-
-
-
-
20
-
-
0028497630
-
Asynchronous stochastic approximation and Q-learning
-
Tsitsiklis J.N. Asynchronous stochastic approximation and Q-learning. Machine Learning 16 3 (1994)
-
(1994)
Machine Learning
, vol.16
, Issue.3
-
-
Tsitsiklis, J.N.1
|