-
5
-
-
58049128403
-
Npclu: An approach for clustering spatially extended objects
-
December
-
M. Halkidi and M. Vazirgiannis. Npclu: An approach for clustering spatially extended objects. Intell. Data Anal., 12:587-606, December 2008.
-
(2008)
Intell. Data Anal.
, vol.12
, pp. 587-606
-
-
Halkidi, M.1
Vazirgiannis, M.2
-
6
-
-
52649148744
-
Self-optimizing memory controllers: A reinforcement learning approach
-
Engin Ipek, Onur Mutlu, Jose F. Martinez, and Rich Caruana. Self-optimizing memory controllers: A reinforcement learning approach. Computer Architecture, International Symposium on, 0:39-50, 2008.
-
(2008)
Computer Architecture International Symposium on
, pp. 39-50
-
-
Ipek, E.1
Mutlu, O.2
Martinez, J.F.3
Caruana, R.4
-
7
-
-
33750705246
-
Causal graph based decomposition of factored mdps
-
December
-
Anders Jonsson and Andrew Barto. Causal graph based decomposition of factored mdps. J. Mach. Learn. Res., 7:2259-2301, December 2006.
-
(2006)
J. Mach. Learn. Res.
, vol.7
, pp. 2259-2301
-
-
Jonsson, A.1
Barto, A.2
-
10
-
-
80055032021
-
Skill discovery in continuous reinforcement learning domains using skill chaining
-
George Konidaris and Andrew G. Barto. Skill discovery in continuous reinforcement learning domains using skill chaining. In Advances in Neural Information Processing Systems 22, pages 1015-1023, 2009.
-
(2009)
Advances in Neural Information Processing Systems
, vol.22
, pp. 1015-1023
-
-
Konidaris, G.1
Barto, A.G.2
-
11
-
-
0013465187
-
Automatic discovery of subgoals in reinforcement learning using diverse density
-
Amy McGovern and Andrew G. Barto. Automatic discovery of subgoals in reinforcement learning using diverse density. In ICML, pages 361-368, 2001.
-
(2001)
ICML
, pp. 361-368
-
-
McGovern, A.1
Barto, A.G.2
-
12
-
-
77958566186
-
Reinforcement learning for closed-loop propofol anesthesia: A human volunteer study
-
Brett Moore, Periklis Panousis, Vivek Kulkarni, Larry Pyeatt, and Anthony Doufas. Reinforcement learning for closed-loop propofol anesthesia: A human volunteer study. In Innovative Applications of Artificial Intelligence, 2010.
-
(2010)
Innovative Applications of Artificial Intelligence
-
-
Moore, B.1
Panousis, P.2
Kulkarni, V.3
Pyeatt, L.4
Doufas, A.5
-
13
-
-
77950032550
-
Markov chain sampling methods for Dirichlet process mixture models
-
R.M. Neal. Markov chain sampling methods for Dirichlet process mixture models. Journal of computational and graphical statistics, 9(2):249-265, 2000.
-
(2000)
Journal of Computational and Graphical Statistics
, vol.9
, Issue.2
, pp. 249-265
-
-
Neal, R.M.1
-
15
-
-
14344250461
-
Policyblocks: An algorithm for creating useful macro-actions in reinforcement learning
-
Marc Pickett and Andrew G. Barto. Policyblocks: An algorithm for creating useful macro-actions in reinforcement learning. In ICML, pages 506-513, 2002.
-
(2002)
ICML
, pp. 506-513
-
-
Pickett, M.1
Barto, A.G.2
-
18
-
-
78651097494
-
Skill characterization based on betweenness
-
Özgür Ş imşek and Andrew G. Barto. Skill characterization based on betweenness. In NIPS, pages 1497-1504, 2008.
-
(2008)
NIPS
, pp. 1497-1504
-
-
Şimşek, O.1
Barto, A.G.2
-
19
-
-
0033170372
-
Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
-
Richard Sutton, Doina Precup, and Satinder Singh. Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning. Artificial Intelligence, 112:181-211, 1999.
-
(1999)
Artificial Intelligence
, vol.112
, pp. 181-211
-
-
Sutton, R.1
Precup, D.2
Singh, S.3
-
22
-
-
34547994508
-
Multi-task reinforcement learning: A hierarchical bayesian approach
-
ACM Press
-
Aaron Wilson, Alan Fern, Soumya Ray, and Prasad Tadepalli. Multi-task reinforcement learning: A hierarchical bayesian approach. In In: ICML 07: Proceedings of the 24th international conference on Machine learning, page 1015. ACM Press, 2007.
-
(2007)
ICML 07: Proceedings of the 24th International Conference on Machine Learning
, pp. 1015
-
-
Wilson, A.1
Fern, A.2
Ray, S.3
Tadepalli, P.4
|