-
1
-
-
74049127928
-
Fitted Q-iteration in continuous action-space MDPs
-
Platt J.C., Koller D., Singer Y., and Roweis S.T. (Eds), MIT Press
-
Antos A., Munos R., and Szepesvári Cs. Fitted Q-iteration in continuous action-space MDPs. In: Platt J.C., Koller D., Singer Y., and Roweis S.T. (Eds). Advances in neural information processing systems: Vol. 20 (2008), MIT Press 9-16
-
(2008)
Advances in neural information processing systems: Vol. 20
, pp. 9-16
-
-
Antos, A.1
Munos, R.2
Szepesvári, Cs.3
-
2
-
-
0041877717
-
A convergent actor-critic-based FRL algorithm with application to power management of wireless transmitters
-
Berenji H.R., and Vengerov D. A convergent actor-critic-based FRL algorithm with application to power management of wireless transmitters. IEEE Transactions on Fuzzy Systems 11 4 (2003) 478-485
-
(2003)
IEEE Transactions on Fuzzy Systems
, vol.11
, Issue.4
, pp. 478-485
-
-
Berenji, H.R.1
Vengerov, D.2
-
6
-
-
34547223380
-
-
Buşoniu, L., De Schutter, B., & Babuška, R. (2006). Decentralized reinforcement learning control of a robotic manipulator. In Proceedings 9th international conference of control, automation, robotics, and vision, ICARCV-06 (pp. 1347-1352). Singapore, 5-8 December.
-
Buşoniu, L., De Schutter, B., & Babuška, R. (2006). Decentralized reinforcement learning control of a robotic manipulator. In Proceedings 9th international conference of control, automation, robotics, and vision, ICARCV-06 (pp. 1347-1352). Singapore, 5-8 December.
-
-
-
-
7
-
-
50249166041
-
-
Buşoniu, L., Ernst, D., De Schutter, B., & Babuška, R. (2007). Fuzzy approximation for convergent model-based reinforcement learning. In Proceedings 2007 IEEE international conference on fuzzy systems, FUZZ-IEEE-07 (pp. 968-973). London, UK, 23-26 July.
-
Buşoniu, L., Ernst, D., De Schutter, B., & Babuška, R. (2007). Fuzzy approximation for convergent model-based reinforcement learning. In Proceedings 2007 IEEE international conference on fuzzy systems, FUZZ-IEEE-07 (pp. 968-973). London, UK, 23-26 July.
-
-
-
-
8
-
-
55249099118
-
-
Buşoniu, L., Ernst, D., De Schutter, B., & Babuška, R. (2008a). Consistency of fuzzy model-based reinforcement learning. In Proceedings 2008 IEEE international conference on fuzzy systems, FUZZ-IEEE-08 (pp. 518-524). Hong Kong, 1-6 June.
-
Buşoniu, L., Ernst, D., De Schutter, B., & Babuška, R. (2008a). Consistency of fuzzy model-based reinforcement learning. In Proceedings 2008 IEEE international conference on fuzzy systems, FUZZ-IEEE-08 (pp. 518-524). Hong Kong, 1-6 June.
-
-
-
-
9
-
-
49949101369
-
Continuous-state reinforcement learning with fuzzy approximation
-
Adaptive agents and multi-agent systems III. Tuyls I.K., Nowé A., Guessoum Z., and Kudenko D. (Eds), Springer
-
Buşoniu L., Ernst D., De Schutter B., and Babuška R. Continuous-state reinforcement learning with fuzzy approximation. In: Tuyls I.K., Nowé A., Guessoum Z., and Kudenko D. (Eds). Adaptive agents and multi-agent systems III. Lecture notes in computer science Vol. 4865 (2008), Springer 27-43
-
(2008)
Lecture notes in computer science
, vol.4865
, pp. 27-43
-
-
Buşoniu, L.1
Ernst, D.2
De Schutter, B.3
Babuška, R.4
-
10
-
-
0026206780
-
An optimal one-way multigrid algorithm for discrete-time stochastic control
-
Chow C.-S., and Tsitsiklis J.N. An optimal one-way multigrid algorithm for discrete-time stochastic control. IEEE Transactions on Automatic Control 36 8 (1991) 898-914
-
(1991)
IEEE Transactions on Automatic Control
, vol.36
, Issue.8
, pp. 898-914
-
-
Chow, C.-S.1
Tsitsiklis, J.N.2
-
12
-
-
70449644892
-
-
Farahmand, A. M., Ghavamzadeh, M., Szepesvári, Cs., & Mannor, S. (2009). Regularized fitted Q-iteration for planning in continuous-space Markovian decision problems. In Proceedings 2009 American control conference, ACC-09(pp. 725-730). St. Louis, US, 10-12 June.
-
Farahmand, A. M., Ghavamzadeh, M., Szepesvári, Cs., & Mannor, S. (2009). Regularized fitted Q-iteration for planning in continuous-space Markovian decision problems. In Proceedings 2009 American control conference, ACC-09(pp. 725-730). St. Louis, US, 10-12 June.
-
-
-
-
14
-
-
77950858524
-
-
Gordon, G. (1995). Stable function approximation in dynamic programming. In Proceedings 12th international conference on machine learning, ICML-95(pp. 261-268). Tahoe City, US, 9-12 July.
-
Gordon, G. (1995). Stable function approximation in dynamic programming. In Proceedings 12th international conference on machine learning, ICML-95(pp. 261-268). Tahoe City, US, 9-12 July.
-
-
-
-
15
-
-
0030377615
-
-
Horiuchi, T., Fujino, A., Katai, O., & Sawaragi, T. (1996). Fuzzy interpolation-based Q-learning with continuous states and actions. In Proceedings 5th IEEE international conference on fuzzy systems, FUZZ-IEEE-96 (pp. 594-600). New Orleans, US, 8-11 September.
-
Horiuchi, T., Fujino, A., Katai, O., & Sawaragi, T. (1996). Fuzzy interpolation-based Q-learning with continuous states and actions. In Proceedings 5th IEEE international conference on fuzzy systems, FUZZ-IEEE-96 (pp. 594-600). New Orleans, US, 8-11 September.
-
-
-
-
20
-
-
0041876271
-
A reinforcement learning adaptive fuzzy controller for robots
-
Lin C.-K. A reinforcement learning adaptive fuzzy controller for robots. Fuzzy Sets and Systems 137 3 (2003) 339-352
-
(2003)
Fuzzy Sets and Systems
, vol.137
, Issue.3
, pp. 339-352
-
-
Lin, C.-K.1
-
21
-
-
0036832953
-
Variable-resolution discretization in optimal control
-
Munos R., and Moore A. Variable-resolution discretization in optimal control. Machine Learning 49 2-3 (2002) 291-323
-
(2002)
Machine Learning
, vol.49
, Issue.2-3
, pp. 291-323
-
-
Munos, R.1
Moore, A.2
-
23
-
-
0008872081
-
Analysis of a numerical dynamic programming algorithm applied to economic models
-
Santos M.S., and Vigo-Aguiar J. Analysis of a numerical dynamic programming algorithm applied to economic models. Econometrica 66 2 (1998) 409-426
-
(1998)
Econometrica
, vol.66
, Issue.2
, pp. 409-426
-
-
Santos, M.S.1
Vigo-Aguiar, J.2
-
25
-
-
14344263882
-
-
Szepesvári, Cs., & Smart, W. D. (2004). Interpolation-based Q-learning. In Proceedings 21st international conference on machine learning, ICML-04(pp. 791-798). Bannf, Canada, 4-8 July.
-
Szepesvári, Cs., & Smart, W. D. (2004). Interpolation-based Q-learning. In Proceedings 21st international conference on machine learning, ICML-04(pp. 791-798). Bannf, Canada, 4-8 July.
-
-
-
-
26
-
-
0029752470
-
Feature-based methods for large scale dynamic programming
-
Tsitsiklis J.N., and Van Roy B. Feature-based methods for large scale dynamic programming. Machine Learning 22 1-3 (1996) 59-94
-
(1996)
Machine Learning
, vol.22
, Issue.1-3
, pp. 59-94
-
-
Tsitsiklis, J.N.1
Van Roy, B.2
|