-
1
-
-
0037262814
-
An introduction to MCMC for machine learning
-
Andrieu, C., de Freitas, N., Doucet, A., &; Jordan, M. (2003). An introduction to MCMC for machine learning. Machine Learning, 50, 5-43.
-
(2003)
Machine Learning
, vol.50
, pp. 5-43
-
-
Andrieu, C.1
De Freitas, N.2
Doucet, A.3
Jordan, M.4
-
2
-
-
50549213583
-
Optimal control of Markov processes with incomplete state information
-
Astrom, K. (1965). Optimal control of Markov processes with incomplete state information. Journal of Mathematical Analysis and Applications, 10(1), 174-205.
-
(1965)
Journal of Mathematical Analysis and Applications
, vol.10
, Issue.1
, pp. 174-205
-
-
Astrom, K.1
-
3
-
-
85012688561
-
-
Princeton University Press, Princeton, NJ
-
Bellman, R. (1957). Dynamic Programming. Princeton University Press, Princeton, NJ.
-
(1957)
Dynamic Programming
-
-
Bellman, R.1
-
4
-
-
84968468700
-
Polynomial approximation - A new computational technique in dynamic programming: Allocation processes
-
Bellman, R., Kalaba, R., & Kotkin, B. (1963). Polynomial approximation - a new computational technique in dynamic programming: Allocation processes. Mathematics of Computation, 17(82), 155-161.
-
(1963)
Mathematics of Computation
, vol.17
, Issue.82
, pp. 155-161
-
-
Bellman, R.1
Kalaba, R.2
Kotkin, B.3
-
5
-
-
0000268954
-
A counterexample for temporal differences learning
-
Bertsekas, D. (1995). A counterexample for temporal differences learning. Neural Computation, 7(2), 270-279.
-
(1995)
Neural Computation
, vol.7
, Issue.2
, pp. 270-279
-
-
Bertsekas, D.1
-
8
-
-
85166207010
-
Exploiting structure in policy construction
-
Boutilier, C., Dearden, R., & Goldszmidt, M. (1995). Exploiting structure in policy construction. In Proceedings of the 14th International Joint Conference on Artificial Intelligence, pp. 1104-1111.
-
(1995)
Proceedings of the 14th International Joint Conference on Artificial Intelligence
, pp. 1104-1111
-
-
Boutilier, C.1
Dearden, R.2
Goldszmidt, M.3
-
9
-
-
3042524845
-
Planning under continuous time and resource uncertainty: A challenge for AI
-
Bresina, J., Dearden, R., Meuleau, N., Ramakrishnan, S., Smith, D., &: Washington, R. (2002). Planning under continuous time and resource uncertainty: A challenge for AI. In Proceedings of the 18th Conference on Uncertainty in Artificial Intelligence, pp. 77-84.
-
(2002)
Proceedings of the 18th Conference on Uncertainty in Artificial Intelligence
, pp. 77-84
-
-
Bresina, J.1
Dearden, R.2
Meuleau, N.3
Ramakrishnan, S.4
Smith, D.5
Washington, R.6
-
10
-
-
0002205556
-
Rao-Blackwellisation of sampling schemes
-
Casella, G., & Robert, C. (1996). Rao-Blackwellisation of sampling schemes. Biometrika, 83(1), 81-94.
-
(1996)
Biometrika
, vol.83
, Issue.1
, pp. 81-94
-
-
Casella, G.1
Robert, C.2
-
11
-
-
0026206780
-
An optimal one-way multigrid algorithm for discrete-time stochastic control
-
Chow, C.-S., &; Tsitsiklis, J. (1991). An optimal one-way multigrid algorithm for discrete-time stochastic control. IEEE Transactions on Automatic Control, 36(8), 898-914.
-
(1991)
IEEE Transactions on Automatic Control
, vol.36
, Issue.8
, pp. 898-914
-
-
Chow, C.-S.1
Tsitsiklis, J.2
-
14
-
-
0348090400
-
The linear programming approach to approximate dynamic programming
-
de Farias, D. P., & Van Roy, B. (2003). The linear programming approach to approximate dynamic programming. Operations Research, 51(6), 850-856.
-
(2003)
Operations Research
, vol.51
, Issue.6
, pp. 850-856
-
-
De Farias, D.P.1
Van Roy, B.2
-
15
-
-
5544258192
-
On constraint sampling for the linear programming approach to approximate dynamic programming
-
de Farias, D. P., & Van Roy, B. (2004). On constraint sampling for the linear programming approach to approximate dynamic programming. Mathematics of Operations Research, 29(3), 462-478.
-
(2004)
Mathematics of Operations Research
, vol.29
, Issue.3
, pp. 462-478
-
-
De Farias, D.P.1
Van Roy, B.2
-
16
-
-
84990553353
-
A model for reasoning about persistence and causation
-
Dean, T., & Kanazawa, K. (1989). A model for reasoning about persistence and causation. Computational Intelligence, 5, 142-150.
-
(1989)
Computational Intelligence
, vol.5
, pp. 142-150
-
-
Dean, T.1
Kanazawa, K.2
-
18
-
-
4243137056
-
Hybrid Monte Carlo
-
Duane, S., Kennedy, A. D., Pendleton, B., & Roweth, D. (1987). Hybrid Monte Carlo. Physics Letters B, 195(2), 216-222.
-
(1987)
Physics Letters B
, vol.195
, Issue.2
, pp. 216-222
-
-
Duane, S.1
Kennedy, A.D.2
Pendleton, B.3
Roweth, D.4
-
19
-
-
29344460055
-
Dynamic programming for structured continuous Markov decision problems
-
Feng, Z., Dearden, R., Meuleau, N., & Washington, R. (2004). Dynamic programming for structured continuous Markov decision problems. In Proceedings of the 20th Conference on Uncertainty in Artificial Intelligence, pp. 154-161.
-
(2004)
Proceedings of the 20th Conference on Uncertainty in Artificial Intelligence
, pp. 154-161
-
-
Feng, Z.1
Dearden, R.2
Meuleau, N.3
Washington, R.4
-
21
-
-
0021518209
-
Stochastic relaxation, Gibbs distribution, and the Bayesian restoration of images
-
Geman, S., &; Geman, D. (1984). Stochastic relaxation, Gibbs distribution, and the Bayesian restoration of images. IEEE Transactions on Pattern Analysis and Machine Intelligence, 6(6), 721-741.
-
(1984)
IEEE Transactions on Pattern Analysis and Machine Intelligence
, vol.6
, Issue.6
, pp. 721-741
-
-
Geman, S.1
Geman, D.2
-
24
-
-
29344475738
-
Solving factored MDPs with continuous and discrete variables
-
Guestrin, C., Hauskrecht, M., &; Kveton, B. (2004). Solving factored MDPs with continuous and discrete variables. In Proceedings of the 20th Conference on Uncertainty in Artificial Intelligence, pp. 235-242.
-
(2004)
Proceedings of the 20th Conference on Uncertainty in Artificial Intelligence
, pp. 235-242
-
-
Guestrin, C.1
Hauskrecht, M.2
Kveton, B.3
-
25
-
-
84880803349
-
Generalizing plans to new environments in relational MDPs
-
Guestrin, C., Koller, D., Gearhart, C., &: Kanodia, N. (2003). Generalizing plans to new environments in relational MDPs. In Proceedings of the 18th International Joint Conference on Artificial Intelligence, pp. 1003-1010.
-
(2003)
Proceedings of the 18th International Joint Conference on Artificial Intelligence
, pp. 1003-1010
-
-
Guestrin, C.1
Koller, D.2
Gearhart, C.3
Kanodia, N.4
-
26
-
-
84880898477
-
Max-norm projections for factored MDPs
-
Guestrin, C., Koller, D., & Parr, R. (2001). Max-norm projections for factored MDPs. In Proceedings of the 17th International Joint Conference on Artificial Intelligence, pp. 673-682.
-
(2001)
Proceedings of the 17th International Joint Conference on Artificial Intelligence
, pp. 673-682
-
-
Guestrin, C.1
Koller, D.2
Parr, R.3
-
27
-
-
84899028010
-
Multiagent planning with factored MDPs
-
Guestrin, C., Koller, D., &: Parr, R. (2002). Multiagent planning with factored MDPs. In Advances in Neural Information Processing Systems 14, pp. 1523-1530.
-
(2002)
Advances in Neural Information Processing Systems
, vol.14
, pp. 1523-1530
-
-
Guestrin, C.1
Koller, D.2
Parr, R.3
-
28
-
-
4544318426
-
Efficient solution algorithms for factored MDPs
-
Guestrin, C., Koller, D., Parr, R., &; Venkataraman, S. (2003). Efficient solution algorithms for factored MDPs. Journal of Artificial Intelligence Research, 19, 399-468.
-
(2003)
Journal of Artificial Intelligence Research
, vol.19
, pp. 399-468
-
-
Guestrin, C.1
Koller, D.2
Parr, R.3
Venkataraman, S.4
-
29
-
-
0036923118
-
Context specific multiagent coordination and planning with factored MDPs
-
Guestrin, C., Venkataraman, S., &; Koller, D. (2002). Context specific multiagent coordination and planning with factored MDPs. In Proceedings of the 18th National Conference on Artificial Intelligence, pp. 253-259.
-
(2002)
Proceedings of the 18th National Conference on Artificial Intelligence
, pp. 253-259
-
-
Guestrin, C.1
Venkataraman, S.2
Koller, D.3
-
30
-
-
77956890234
-
Monte Carlo sampling methods using Markov chains and their application
-
Hastings, W. K. (1970). Monte Carlo sampling methods using Markov chains and their application. Biometrika, 57, 97-109.
-
(1970)
Biometrika
, vol.57
, pp. 97-109
-
-
Hastings, W.K.1
-
31
-
-
0001770240
-
Value-function approximations for partially observable Markov decision processes
-
Hauskrecht, M. (2000). Value-function approximations for partially observable Markov decision processes. Journal of Artificial Intelligence Research, 13, 33-94.
-
(2000)
Journal of Artificial Intelligence Research
, vol.13
, pp. 33-94
-
-
Hauskrecht, M.1
-
32
-
-
84898970468
-
Linear program approximations for factored continuous-state Markov decision processes
-
Hauskrecht, M., & Kveton, B. (2004). Linear program approximations for factored continuous-state Markov decision processes. In Advances in Neural Information Processing Systems 16, pp. 895-902.
-
(2004)
Advances in Neural Information Processing Systems
, vol.16
, pp. 895-902
-
-
Hauskrecht, M.1
Kveton, B.2
-
33
-
-
0032398552
-
Auxiliary variable methods for Markov chain Monte Carlo with applications
-
Higdon, D. (1998). Auxiliary variable methods for Markov chain Monte Carlo with applications. Journal of the American Statistical Association, 55(442), 585-595.
-
(1998)
Journal of the American Statistical Association
, vol.55
, Issue.442
, pp. 585-595
-
-
Higdon, D.1
-
34
-
-
0000086731
-
Influence diagrams
-
Strategic Decisions Group, Menlo Park, CA
-
Howard, R., & Matheson, J. (1984). Influence diagrams. In Readings on the Principles and Applications of Decision Analysis, Vol. 2, pp. 719-762. Strategic Decisions Group, Menlo Park, CA.
-
(1984)
Readings on the Principles and Applications of Decision Analysis
, vol.2
, pp. 719-762
-
-
Howard, R.1
Matheson, J.2
-
36
-
-
0000305280
-
From influence diagrams to junction trees
-
Jensen, F., Jensen, F., &; Dittmer, S. (1994). From influence diagrams to junction trees. In Proceedings of the 10th Conference on Uncertainty in Artificial Intelligence, pp. 367-373.
-
(1994)
Proceedings of the 10th Conference on Uncertainty in Artificial Intelligence
, pp. 367-373
-
-
Jensen, F.1
Jensen, F.2
Dittmer, S.3
-
37
-
-
0000564361
-
A polynomial algorithm in linear programming
-
Khachiyan, L. (1979). A polynomial algorithm in linear programming. Doklady Akademii Nauk SSSR, 244, 1093-1096.
-
(1979)
Doklady Akademii Nauk SSSR
, vol.244
, pp. 1093-1096
-
-
Khachiyan, L.1
-
38
-
-
26444479778
-
Optimization by simulated annealing
-
Kirkpatrick, S., Gelatt, C. D., & Vecchi, M. P. (1983). Optimization by simulated annealing. Science, 220(4598), 671-680.
-
(1983)
Science
, vol.220
, Issue.4598
, pp. 671-680
-
-
Kirkpatrick, S.1
Gelatt, C.D.2
Vecchi, M.P.3
-
44
-
-
29344433509
-
Samuel meets Amarel: Automating value function approximation using global state space analysis
-
Mahadevan, S. (2005). Samuel meets Amarel: Automating value function approximation using global state space analysis. In Proceedings of the 20th National Conference on Artificial Intelligence, pp. 1000-1005.
-
(2005)
Proceedings of the 20th National Conference on Artificial Intelligence
, pp. 1000-1005
-
-
Mahadevan, S.1
-
45
-
-
77957901577
-
Value function approximation with diffusion wavelets and Laplacian eigenfunctions
-
Mahadevan, S., & Maggioni, M. (2006). Value function approximation with diffusion wavelets and Laplacian eigenfunctions. In Advances in Neural Information Processing Systems 18, pp. 843-850.
-
(2006)
Advances in Neural Information Processing Systems
, vol.18
, pp. 843-850
-
-
Mahadevan, S.1
Maggioni, M.2
-
46
-
-
33750591731
-
Learning representation and control in continuous Markov decision processes
-
Mahadevan, S., Maggioni, M., Ferguson, K., &: Osentoski, S. (2006). Learning representation and control in continuous Markov decision processes. In Proceedings of the 21st National Conference on Artificial Intelligence.
-
(2006)
Proceedings of the 21st National Conference on Artificial Intelligence
-
-
Mahadevan, S.1
Maggioni, M.2
Ferguson, K.3
Osentoski, S.4
-
47
-
-
0001257766
-
Linear programming and sequential decisions
-
Manne, A. (1960). Linear programming and sequential decisions. Management Science, 6(3), 259-267.
-
(1960)
Management Science
, vol.6
, Issue.3
, pp. 259-267
-
-
Manne, A.1
-
48
-
-
5744249209
-
Equation of state calculations by fast computing machines
-
Metropolis, N., Rosenbluth, A., Rosenbluth, M., Teller, A., & Teller, E. (1953). Equation of state calculations by fast computing machines. Journal of Chemical Physics, 21, 1087-1092.
-
(1953)
Journal of Chemical Physics
, vol.21
, pp. 1087-1092
-
-
Metropolis, N.1
Rosenbluth, A.2
Rosenbluth, M.3
Teller, A.4
Teller, E.5
-
49
-
-
0036832953
-
Variable resolution discretization in optimal control
-
Munos, R., & Moore, A. (2002). Variable resolution discretization in optimal control. Machine Learning, 49, 291-323.
-
(2002)
Machine Learning
, vol.49
, pp. 291-323
-
-
Munos, R.1
Moore, A.2
-
53
-
-
0036927202
-
Greedy linear value-approximation for factored Markov decision processes
-
Patrascu, R., Poupart, P., Schuurmans, D., Boutilier, C., & Guestrin, C. (2002). Greedy linear value-approximation for factored Markov decision processes. In Proceedings of the 18th National Conference on Artificial Intelligence, pp. 285-291.
-
(2002)
Proceedings of the 18th National Conference on Artificial Intelligence
, pp. 285-291
-
-
Patrascu, R.1
Poupart, P.2
Schuurmans, D.3
Boutilier, C.4
Guestrin, C.5
-
55
-
-
0001509947
-
Using randomization to break the curse of dimensionality
-
Rust, J. (1997). Using randomization to break the curse of dimensionality. Econometrica, 65(3), 487-516.
-
(1997)
Econometrica
, vol.65
, Issue.3
, pp. 487-516
-
-
Rust, J.1
-
61
-
-
0001046225
-
Practical issues in temporal difference learning
-
Tesauro, G. (1992). Practical issues in temporal difference learning. Machine Learning, 8(3-4), 257-277.
-
(1992)
Machine Learning
, vol.8
, Issue.3-4
, pp. 257-277
-
-
Tesauro, G.1
-
62
-
-
0000985504
-
TD-Gammon, a self-teaching backgammon program, achieves masterlevel play
-
Tesauro, G. (1994). TD-Gammon, a self-teaching backgammon program, achieves masterlevel play. Neural Computation, 6(2), 215-219.
-
(1994)
Neural Computation
, vol.6
, Issue.2
, pp. 215-219
-
-
Tesauro, G.1
-
63
-
-
0029276036
-
Temporal difference learning and TD-Gammon
-
Tesauro, G. (1995). Temporal difference learning and TD-Gammon. Communications of the ACM, 38(3), 58-68.
-
(1995)
Communications of the ACM
, vol.38
, Issue.3
, pp. 58-68
-
-
Tesauro, G.1
-
66
-
-
33750590218
-
Annealed MAP
-
Yuan, C., Lu, T.-C., & Druzdzel, M. (2004). Annealed MAP. In Proceedings of the 20th Conference on Uncertainty in Artificial Intelligence, pp. 628-635.
-
(2004)
Proceedings of the 20th Conference on Uncertainty in Artificial Intelligence
, pp. 628-635
-
-
Yuan, C.1
Lu, T.-C.2
Druzdzel, M.3
|