-
1
-
-
0029210635
-
Learning to act using real-time dynamic programming
-
A. Barto, S. Bradtke, and S. Singh. Learning to act using real-time dynamic programming. Artificial Intelligence, 72(1):81-138, 1995.
-
(1995)
Artificial Intelligence
, vol.72
, Issue.1
, pp. 81-138
-
-
Barto, A.1
Bradtke, S.2
Singh, S.3
-
6
-
-
80053145429
-
Heuristic search for planning under uncertainty
-
H. G. R. Dechter and J. Halpern, editors, College Publications, London, UK
-
B. Bonet and E. Hansen. Heuristic search for planning under uncertainty. In H. G. R. Dechter and J. Halpern, editors, Heuristics, Probability and Causality: A Tribute to Judea Pearl, pages 3-22. College Publications, London, UK, 2010.
-
(2010)
Heuristics, Probability and Causality: A Tribute to Judea Pearl
, pp. 3-22
-
-
Bonet, B.1
Hansen, E.2
-
7
-
-
0037840849
-
On the undecidability of probabilistic planning and related stochastic optimization problems
-
O. Madani, S. Hanks, and A. Condon. On the undecidability of probabilistic planning and related stochastic optimization problems. Artificial Intelligence, 147:5-34, 2003.
-
(2003)
Artificial Intelligence
, vol.147
, pp. 5-34
-
-
Madani, O.1
Hanks, S.2
Condon, A.3
-
8
-
-
40449123123
-
Decentralized algorithms for netcentric force protection against antiship missiles
-
DOI 10.1109/TAES.2007.4441744
-
M. Maskery, V. Krishnamurthy, and C. O'Regan. Decentralized algorithms for netcentric force protection against antiship missiles. IEEE Transactions on Aerospace and Electronic Systems, 43(4):1351-1372, 2008. (Pubitemid 351344361)
-
(2007)
IEEE Transactions on Aerospace and Electronic Systems
, vol.43
, Issue.4
, pp. 1351-1372
-
-
Maskery, M.1
Krishnamurthy, V.2
O'Regan, C.3
-
10
-
-
0035452408
-
On terminating Markov decision processes with a risk-averse objective function
-
DOI 10.1016/S0005-1098(01)00084-X, PII S000510980100084X
-
S. Patek. On terminating Markov decision processes with a risk averse objective function. Automatica, 37(9):1379-1386, 2001. (Pubitemid 32691033)
-
(2001)
Automatica
, vol.37
, Issue.9
, pp. 1379-1386
-
-
Patek, S.D.1
-
11
-
-
34548284981
-
Partially observed stochastic shortest path problems with approximate solution by neurodynamic programming
-
DOI 10.1109/TSMCA.2007.902662
-
S. Patek. Partially observed stochastic shortest path problems with approximate solution by neurodynamic programming. IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans, 37(5):710-720, 2007. (Pubitemid 47323535)
-
(2007)
IEEE Transactions on Systems, Man, and Cybernetics Part A:Systems and Humans
, vol.37
, Issue.5
, pp. 710-720
-
-
Patek, S.D.1
-
15
-
-
0028497630
-
Asynchronous stochastic approximation and Q-learning
-
J. Tsitsiklis. Asynchronous stochastic approximation and Q-learning. Machine Learning, 16:185-202, 1994.
-
(1994)
Machine Learning
, vol.16
, pp. 185-202
-
-
Tsitsiklis, J.1
-
16
-
-
0036374229
-
Speeding up the convergence of value iteration in partially observable Markov decision processes
-
N. Zhang and W. Zhang. Speeding up the convergence of value iteration in partially observable Markov decision processes. Journal of Artificial Intelligence Research, 14:29-51, 2001. (Pubitemid 33738058)
-
(2001)
Journal of Artificial Intelligence Research
, vol.14
, pp. 29-51
-
-
Zhang, N.L.1
Zhang, W.2
|