-
1
-
-
0242456818
-
Relational Markov models and their application to adaptive web navigation
-
Edmonton, Alberta, Canada: ACM SIGKDD
-
Anderson, C. R.; Domingos, P.; and Weld, D. S. 2002. Relational Markov models and their application to adaptive web navigation. In Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD-2002). Edmonton, Alberta, Canada: ACM SIGKDD.
-
(2002)
Proceedings of the Eighth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD-2002)
-
-
Anderson, C.R.1
Domingos, P.2
Weld, D.S.3
-
2
-
-
0034248853
-
Stochastic dynamic programming with factored representations
-
Boutilier, C.; Dearden, R.; and Goldszmidt, M. 2000. Stochastic dynamic programming with factored representations. Artificial Intelligence 121(1-2):49-107.
-
(2000)
Artificial Intelligence
, vol.121
, Issue.1-2
, pp. 49-107
-
-
Boutilier, C.1
Dearden, R.2
Goldszmidt, M.3
-
5
-
-
84942867726
-
An overview of MAXQ hierarchical reinforcement learning
-
Choueiry, B. Y., and Walsh, T., eds., Lecture Notes in Artificial Intelligence. New York: Springer Verlag
-
Dietterich, T. G. 2000. An overview of MAXQ hierarchical reinforcement learning. In Choueiry, B. Y., and Walsh, T., eds., Proceedings of the Symposium on Abstraction, Reformulation and Approximation (SARA 2000), Lecture Notes in Artificial Intelligence. New York: Springer Verlag.
-
(2000)
Proceedings of the Symposium on Abstraction, Reformulation and Approximation (SARA 2000)
-
-
Dietterich, T.G.1
-
7
-
-
0041779094
-
Learning Probabilistic Relational Models
-
Dzeroski, S. and Lavrac, N., eds.. Springer-Verlag
-
Getoor, L.; Friedman, N.; Koller, D.; and Pfeffer, A. 2001. Learning Probabilistic Relational Models. In Dzeroski, S. and Lavrac, N., eds., Relational Data Mining. Springer-Verlag.
-
(2001)
Relational Data Mining
-
-
Getoor, L.1
Friedman, N.2
Koller, D.3
Pfeffer, A.4
-
8
-
-
0006419533
-
Hierarchical solution of Markov decision processes using macro-actions
-
Cooper, G. F., and Moral, S., eds.. Morgan Kaufmann
-
Hauskrecht, M.; Meuleau, N.; Boutilier, C.; Kaelbling, L. P.; and Dean, T. 1998. Hierarchical solution of Markov decision processes using macro-actions. In Cooper, G. F., and Moral, S., eds., Proceedings of the Fourteenth Conference on Uncertainty in Artificial Intelligence (UAI-98). Morgan Kaufmann.
-
(1998)
Proceedings of the Fourteenth Conference on Uncertainty in Artificial Intelligence (UAI-98)
-
-
Hauskrecht, M.1
Meuleau, N.2
Boutilier, C.3
Kaelbling, L.P.4
Dean, T.5
-
10
-
-
0346738900
-
Flexible decomposition algorithms for weakly coupled Markov decision problems
-
Cooper, G. F., and Moral, S., eds.. Morgan Kaufmann
-
Parr, R. 1998a. Flexible decomposition algorithms for weakly coupled Markov decision problems. In Cooper, G. F., and Moral, S., eds., Proceedings of the Fourteenth Conference on Uncertainty in Artificial Intelligence (UAI-98). Morgan Kaufmann.
-
(1998)
Proceedings of the Fourteenth Conference on Uncertainty in Artificial Intelligence (UAI-98)
-
-
Parr, R.1
-
12
-
-
0003392384
-
-
Ph.D. Dissertation, Department of Computer Science, University of Massachusetts, Amherst, MA
-
Precup, D. 2000. Temporal Abstraction in Reinforcement Learning. Ph.D. Dissertation, Department of Computer Science, University of Massachusetts, Amherst, MA.
-
(2000)
Temporal Abstraction in Reinforcement Learning
-
-
Precup, D.1
-
14
-
-
1942514642
-
Model minimization in hierarchical reinforcement learning
-
Holte, R., ed.
-
Ravindran, B., and Barto, A. G. 2002. Model minimization in hierarchical reinforcement learning. In Holte, R., ed., Proceedings of the 2002 Symposium on Abstraction, Reformulation, and Approximation (SARA-200229.
-
(2002)
Proceedings of the 2002 Symposium on Abstraction, Reformulation, and Approximation SARA-200229
-
-
Ravindran, B.1
Barto, A.G.2
-
15
-
-
1942484796
-
Relativized options: Choosing the right transformation
-
Fawcett, T., and Mishra, N., eds. Washington, DC: AAAI Press
-
Ravindran, B., and Barto, A. G. 2003. Relativized options: Choosing the right transformation. In Fawcett, T., and Mishra, N., eds., Proceedings of the Twentieth International Conference on Machine Learning, 608-615. Washington, DC: AAAI Press.
-
(2003)
Proceedings of the Twentieth International Conference on Machine Learning
, pp. 608-615
-
-
Ravindran, B.1
Barto, A.G.2
-
16
-
-
32844454706
-
-
Ph.D. Dissertation, Department of Computer Science, University of Massachusetts, Amherst, MA
-
Ravindran, B. 2004. An Algebraic Approach to Abstraction in Reinforcement Learning. Ph.D. Dissertation, Department of Computer Science, University of Massachusetts, Amherst, MA.
-
(2004)
An Algebraic Approach to Abstraction in Reinforcement Learning
-
-
Ravindran, B.1
|