-
1
-
-
0000396062
-
Natural gradient works efficiently in learning
-
S. Amari. Natural gradient works efficiently in learning. Neural Computation, 10:251-276, 1998.
-
(1998)
Neural Computation
, vol.10
, pp. 251-276
-
-
Amari, S.1
-
2
-
-
0003706925
-
-
ISA: The Instrumentation, Systems, and Automation Society
-
K. J. Åström and T. Hägglund. PID Controllers: Theory, Design, and Tuning. ISA: The Instrumentation, Systems, and Automation Society, 1995.
-
(1995)
PID Controllers: Theory, Design, and Tuning
-
-
Åström, K.J.1
Hägglund, T.2
-
3
-
-
0037211015
-
Fast calculation of stabilizing PID controllers
-
M. T. Söylemez, N. Munro, and H. Baki. Fast calculation of stabilizing PID controllers. Automatica, 39 (1):121-126, 2003.
-
(2003)
Automatica
, vol.39
, Issue.1
, pp. 121-126
-
-
Söylemez, M.T.1
Munro, N.2
Baki, H.3
-
5
-
-
67149094917
-
A real-time 3-D musculoskeletal model for dynamic simulation of arm movements
-
E. K. Chadwick, D. Blana, A. J. van den Bogert, and R. F. Kirsch. A real-time 3-D musculoskeletal model for dynamic simulation of arm movements. In IEEE Transactions on Biomedical Engineering, volume 56, pages 941-948, 2009.
-
(2009)
IEEE Transactions on Biomedical Engineering
, vol.56
, pp. 941-948
-
-
Chadwick, E.K.1
Blana, D.2
Den Van Bogert, A.J.3
Kirsch, R.F.4
-
10
-
-
27644511603
-
Control of markov chains with safety bounds
-
October
-
A. Arapostathis, R. Kumar, and S. P. Hsu. Control of markov chains with safety bounds. In IEEE Transactions on Automation Science and Engineering, volume 2, pages 333-343, October 2005.
-
(2005)
IEEE Transactions on Automation Science and Engineering
, vol.2
, pp. 333-343
-
-
Arapostathis, A.1
Kumar, R.2
Hsu, S.P.3
-
11
-
-
84898984859
-
Control design for Markov chains under safety constraints: A convex approach
-
abs/1209.2883
-
E. Arvelo and N. C. Martins. Control design for Markov chains under safety constraints: A convex approach. CoRR, abs/1209.2883, 2012.
-
(2012)
CoRR
-
-
Arvelo, E.1
Martins, N.C.2
-
12
-
-
31144477417
-
Risk-sensitive reinforcement learning applied to control under constraints
-
P. Geibel and F. Wysotzki. Risk-sensitive reinforcement learning applied to control under constraints. Journal of Artificial Intelligence Research 24, pages 81-108, 2005.
-
(2005)
Journal of Artificial Intelligence Research
, vol.24
, pp. 81-108
-
-
Geibel, P.1
Wysotzki, F.2
-
14
-
-
70349984547
-
Natural actor-critic algorithms
-
S. Bhatnagar, R. S. Sutton, M. Ghavamzadeh, and M. Lee. Natural actor-critic algorithms. Automatica, 45(11):2471-2482, 2009.
-
(2009)
Automatica
, vol.45
, Issue.11
, pp. 2471-2482
-
-
Bhatnagar, S.1
Sutton, R.S.2
Ghavamzadeh, M.3
Lee, M.4
-
16
-
-
84892188881
-
Why natural gradient?
-
S. Amari and S. Douglas. Why natural gradient? In Proceedings of the 1998 IEEE International Conference on Acoustics, Speech, and Signal Processing, volume 2, pages 1213-1216, 1998.
-
(1998)
Proceedings of the 1998 IEEE International Conference on Acoustics, Speech, and Signal Processing
, pp. 1213-1216
-
-
Amari, S.1
Douglas, S.2
-
18
-
-
0037403111
-
Mirror descent and nonlinear projected subgradient methods for convex optimization
-
A. Beck and M. Teboulle. Mirror descent and nonlinear projected subgradient methods for convex optimization. Operations Research Letters, 2003.
-
(2003)
Operations Research Letters
-
-
Beck, A.1
Teboulle, M.2
-
24
-
-
84898939480
-
Policy gradient methods for reinforcement learning with function approximation
-
R. S. Sutton, D. McAllester, S. Singh, and Y. Mansour. Policy gradient methods for reinforcement learning with function approximation. In Advances in Neural Information Processing Systems 12, pages 1057-1063, 2000.
-
(2000)
Advances in Neural Information Processing Systems
, vol.12
, pp. 1057-1063
-
-
Sutton, R.S.1
McAllester, D.2
Singh, S.3
Mansour, Y.4
-
26
-
-
40649106649
-
Natural actor-critic
-
J. Peters and S. Schaal. Natural actor-critic. Neurocomputing, 71:1180-1190, 2008.
-
(2008)
Neurocomputing
, vol.71
, pp. 1180-1190
-
-
Peters, J.1
Schaal, S.2
-
30
-
-
67349216631
-
Combined feedforward and feedback control of a redundant, nonlinear, dynamic musculoskeletal system
-
D. Blana, R. F. Kirsch, and E. K. Chadwick. Combined feedforward and feedback control of a redundant, nonlinear, dynamic musculoskeletal system. Medical and Biological Engineering and Computing, 47: 533-542, 2009.
-
(2009)
Medical and Biological Engineering and Computing
, vol.47
, pp. 533-542
-
-
Blana, D.1
Kirsch, R.F.2
Chadwick, E.K.3
|