-
4
-
-
84904099309
-
GridLAB-D: An agent-based simulation framework for smart grids
-
Chassin, D. P., Fuller, J. C., & Djilali, N. (2014). GridLAB-D: An agent-based simulation framework for smart grids. Journal of Applied Mathematics, 2014.
-
(2014)
Journal of Applied Mathematics
, pp. 2014
-
-
Chassin, D.P.1
Fuller, J.C.2
Djilali, N.3
-
7
-
-
34147120474
-
A note on two problems in connexion with graphs
-
Dijkstra, E. (1959). A note on two problems in connexion with graphs. Numerische Mathematik, 1 (1), 269-271.
-
(1959)
Numerische Mathematik
, vol.1
, Issue.1
, pp. 269-271
-
-
Dijkstra, E.1
-
8
-
-
58449110583
-
Regularized fitted Q-iteration: Application to planning
-
Springer
-
Farahmand, A., Ghavamzadeh, M., Szepesvári, C., & Mannor, S. (2008). Regularized fitted Q-iteration: Application to planning. In Recent Advances in Reinforcement Learning, pp. 55-68. Springer.
-
(2008)
Recent Advances in Reinforcement Learning
, pp. 55-68
-
-
Farahmand, A.1
Ghavamzadeh, M.2
Szepesvári, C.3
Mannor, S.4
-
11
-
-
84899829959
-
A formal basis for the heuristic determination of minimum cost paths
-
Hart, P., Nilsson, N., & Raphael, B. (1968). A formal basis for the heuristic determination of minimum cost paths. Systems Science and Cybernetics, IEEE Transactions on, 4 (2), 100-107.
-
(1968)
Systems Science and Cybernetics, IEEE Transactions on
, vol.4
, Issue.2
, pp. 100-107
-
-
Hart, P.1
Nilsson, N.2
Raphael, B.3
-
12
-
-
79956361567
-
Efficient planning under uncertainty with macroactions
-
He, R., Brunskill, E., & Roy, N. (2011). Efficient planning under uncertainty with macroactions. Journal of Artificial Intelligence Research, 40, 523-570.
-
(2011)
Journal of Artificial Intelligence Research
, vol.40
, pp. 523-570
-
-
He, R.1
Brunskill, E.2
Roy, N.3
-
13
-
-
0002956570
-
SPUDD: Stochastic Planning Using Decision Diagrams
-
Hoey, J., St-Aubin, R., Hu, A. J., & Boutilier, C. (1999). SPUDD: Stochastic Planning Using Decision Diagrams. In Proceedings of Uncertainty in Artificial Intelligence, Stockholm, Sweden.
-
(1999)
Proceedings of Uncertainty in Artificial Intelligence, Stockholm, Sweden
-
-
Hoey, J.1
St-Aubin, R.2
Hu, A.J.3
Boutilier, C.4
-
14
-
-
0000148778
-
A heuristic approach to the discovery of macro-operators
-
Iba, G. A. (1989). A heuristic approach to the discovery of macro-operators. Machine Learning, 3, 285-317.
-
(1989)
Machine Learning
, vol.3
, pp. 285-317
-
-
Iba, G.A.1
-
16
-
-
0036832951
-
A sparse sampling algorithm for near-optimal planning in large Markov decision processes
-
Kearns, M., Mansour, Y., & Ng, A. Y. (2002). A sparse sampling algorithm for near-optimal planning in large Markov decision processes. Machine Learning, 49 (2-3), 193-208.
-
(2002)
Machine Learning
, vol.49
, Issue.2-3
, pp. 193-208
-
-
Kearns, M.1
Mansour, Y.2
Ng, A.Y.3
-
18
-
-
80055032021
-
Skill discovery in continuous reinforcement learning domains using skill chaining
-
Konidaris, G., & Barto, A. (2009). Skill discovery in continuous reinforcement learning domains using skill chaining. In Advances in Neural Information Processing Systems 22, pp. 1015-1023.
-
(2009)
Advances in Neural Information Processing Systems
, vol.22
, pp. 1015-1023
-
-
Konidaris, G.1
Barto, A.2
-
20
-
-
85162033542
-
Constructing skill trees for reinforcement learning agents from demonstration trajectories
-
Konidaris, G., Kuindersma, S., Barto, A., & Grupen, R. (2010). Constructing skill trees for reinforcement learning agents from demonstration trajectories. In Advances in Neural Information Processing Systems, pp. 1162-1170.
-
(2010)
Advances in Neural Information Processing Systems
, pp. 1162-1170
-
-
Konidaris, G.1
Kuindersma, S.2
Barto, A.3
Grupen, R.4
-
25
-
-
84938531572
-
-
Accessed: 2015-06-29
-
Mann, T. A. (2014). Cyclic Inventory Management (CIM). https://code.google.com/p/rddlsim/source/browse/trunk/files/rddl2/examples/cim.rddl2. Accessed: 2015-06-29.
-
(2014)
Cyclic Inventory Management (CIM)
-
-
Mann, T.A.1
-
27
-
-
14344250635
-
Dynamic abstraction in reinforcement learning via clustering
-
New York, NY, USA. ACM
-
st International Conference on Machine learning, ICML '04, pp. 71-, New York, NY, USA. ACM.
-
(2004)
st International Conference on Machine Learning, ICML '04
, pp. 71
-
-
Mannor, S.1
Menache, I.2
Hoze, A.3
Klein, U.4
-
32
-
-
0002499613
-
Graph spanners
-
Peleg, D., & Schäffer, A. A. (1989). Graph spanners. Journal of Graph Theory, 13 (1), 99-116.
-
(1989)
Journal of Graph Theory
, vol.13
, Issue.1
, pp. 99-116
-
-
Peleg, D.1
Schäffer, A.A.2
-
33
-
-
44949241322
-
Reinforcement learning of motor skills with policy gradients
-
Peters, J., & Schaal, S. (2008). Reinforcement learning of motor skills with policy gradients. Neural Networks, 21, 682-691.
-
(2008)
Neural Networks
, vol.21
, pp. 682-691
-
-
Peters, J.1
Schaal, S.2
-
35
-
-
84957069070
-
Theoretical results on reinforcement learning with temporally abstract options
-
Springer.
-
Precup, D., Sutton, R. S., & Singh, S. (1998). Theoretical results on reinforcement learning with temporally abstract options. In Machine Learning: ECML-1998, pp. 382-393. Springer.
-
(1998)
Machine Learning: ECML-1998
, pp. 382-393
-
-
Precup, D.1
Sutton, R.S.2
Singh, S.3
-
37
-
-
33646398129
-
Neural fitted Q iteration-first experiences with a data efficient neural reinforcement learning method
-
Springer.
-
Riedmiller, M. (2005). Neural fitted Q iteration-first experiences with a data efficient neural reinforcement learning method. In Machine Learning: ECML-2005, pp. 317-328. Springer.
-
(2005)
Machine Learning: ECML-2005
, pp. 317-328
-
-
Riedmiller, M.1
-
38
-
-
27144482716
-
Highway hierarchies hasten exact shortest path queries
-
Brodal, G., & Leonardi, S. (Eds.) Algorithms: ESA-2005 Springer Berlin Heidelberg
-
Sanders, P., & Schultes, D. (2005). Highway hierarchies hasten exact shortest path queries. In Brodal, G., & Leonardi, S. (Eds.), Algorithms: ESA-2005, Vol. 3669 of Lecture Notes in Computer Science, pp. 568-579. Springer Berlin Heidelberg.
-
(2005)
Lecture Notes in Computer Science
, vol.3669
, pp. 568-579
-
-
Sanders, P.1
Schultes, D.2
-
41
-
-
0031277069
-
Optimality of (s,S) policies in inventory models with markovian demand
-
Sethi, S. P., & Cheng, F. (1997). Optimality of (s,S) policies in inventory models with markovian demand. Operations Research, 45 (6), 931-939.
-
(1997)
Operations Research
, vol.45
, Issue.6
, pp. 931-939
-
-
Sethi, S.P.1
Cheng, F.2
-
42
-
-
80054721180
-
Connectionist reinforcement learning for intelligent unit micro management in starcraft
-
IEEE
-
Shantia, A., Begue, E., & Wiering, M. (2011). Connectionist reinforcement learning for intelligent unit micro management in starcraft. In Proceedings of the International Joint Conference on Neural Networks, pp. 1794-1801. IEEE.
-
(2011)
Proceedings of the International Joint Conference on Neural Networks
, pp. 1794-1801
-
-
Shantia, A.1
Begue, E.2
Wiering, M.3
-
46
-
-
84912073624
-
Learning options in reinforcement learning
-
Springer
-
Stolle, M., & Precup, D. (2002). Learning options in reinforcement learning. In Abstraction, Reformulation, and Approximation, pp. 212-223. Springer.
-
(2002)
Abstraction, Reformulation, and Approximation
, pp. 212-223
-
-
Stolle, M.1
Precup, D.2
-
47
-
-
27544506565
-
Reinforcement learning for robocup soccer keepaway
-
Stone, P., Sutton, R. S., & Kuhlmann, G. (2005). Reinforcement learning for robocup soccer keepaway. Adaptive Behavior, 13 (3), 165-188.
-
(2005)
Adaptive Behavior
, vol.13
, Issue.3
, pp. 165-188
-
-
Stone, P.1
Sutton, R.S.2
Kuhlmann, G.3
-
48
-
-
0033170372
-
Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
-
Sutton, R. S., Precup, D., & Singh, S. (1999). Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning. Artificial Intelligence, 112 (1), 181-211.
-
(1999)
Artificial Intelligence
, vol.112
, Issue.1
, pp. 181-211
-
-
Sutton, R.S.1
Precup, D.2
Singh, S.3
-
51
-
-
58349118462
-
FF-Replan: A Baseline for Probabilistic Planning
-
Yoon, S. W., Fern, A., & Givan, R. (2007). FF-Replan: A Baseline for Probabilistic Planning. In Proceedings of the International Conference on Automated Planning and Scheduling, Vol. 7, pp. 352-359.
-
(2007)
Proceedings of the International Conference on Automated Planning and Scheduling
, vol.7
, pp. 352-359
-
-
Yoon, S.W.1
Fern, A.2
Givan, R.3
|