-
3
-
-
0024680419
-
Adaptive Aggregation for Infinite Horizon Dynamic Programming
-
Bertsekas, D. P., & Castanon, D. C. (1989). Adaptive Aggregation for Infinite Horizon Dynamic Programming. IEEE trans. on auto. control, vol 34, No 6.
-
(1989)
IEEE Trans. on Auto. Control
, vol.34
, Issue.6
-
-
Bertsekas, D.P.1
Castanon, D.C.2
-
4
-
-
85166207010
-
Exploiting structures in policy construction
-
Boutilier, C, Dearden, R., & Goldszmidt, M. (1995). Exploiting structures in policy construction. In Proceedings of IJCAI-95, 1104-1111.
-
(1995)
Proceedings of IJCAI-95
, pp. 1104-1111
-
-
Boutilier, C.1
Dearden, R.2
Goldszmidt, M.3
-
5
-
-
0030349220
-
Computing optimal policies for partially observable decision processes using compact representations
-
Boutilier, C, & Poole, D. (1996). Computing optimal policies for partially observable decision processes using compact representations. In Proceedings of AAAI-96, 1168-1175.
-
(1996)
Proceedings of AAAI-96
, pp. 1168-1175
-
-
Boutilier, C.1
Poole, D.2
-
6
-
-
0031385391
-
A heuristic variable grid solution method for POMDPs
-
Brafman, R. I. (1997). A heuristic variable grid solution method for POMDPs. In Proceedings of AAAI-97, 727-733.
-
(1997)
Proceedings of AAAI-97
, pp. 727-733
-
-
Brafman, R.I.1
-
7
-
-
0008084202
-
-
TR CS-94-14, Department of Computer Science, Brown University, Providence, Rhode Island 02912, USA
-
Cassandra, A. R. (1994). Optimal polices for partially observable Markov decision processes. TR CS-94-14, Department of Computer Science, Brown University, Providence, Rhode Island 02912, USA.
-
(1994)
Optimal Polices for Partially Observable Markov Decision Processes
-
-
Cassandra, A.R.1
-
8
-
-
0028564629
-
Acting optimally in partially observable stochastic domains
-
Cassandra, A. R., Kaelbling, L. P., & Littman, M. L. (1994). Acting optimally in partially observable stochastic domains. In Proceedings of AAAI-94, 1023-1028.
-
(1994)
Proceedings of AAAI-94
, pp. 1023-1028
-
-
Cassandra, A.R.1
Kaelbling, L.P.2
Littman, M.L.3
-
10
-
-
0001909869
-
Incremental pruning: A simple, fast, exact method for partially observable Markov decision processes
-
Cassandra, A. R., Littman, M. L., & Zhang, N. L. (1997). Incremental pruning: A simple, fast, exact method for partially observable Markov decision processes. In Proceedings of Thirteenth Conference on Uncertainty in Artificial Intelligence, 54-61.
-
(1997)
Proceedings of Thirteenth Conference on Uncertainty in Artificial Intelligence
, pp. 54-61
-
-
Cassandra, A.R.1
Littman, M.L.2
Zhang, N.L.3
-
11
-
-
0003818801
-
-
PhD thesis, University of British Columbia, Vancouver, BC, Canada
-
Cheng, H. T. (1988). Algorithms for partially observable Markov decision processes. PhD thesis, University of British Columbia, Vancouver, BC, Canada.
-
(1988)
Algorithms for Partially Observable Markov Decision Processes
-
-
Cheng, H.T.1
-
12
-
-
0000746330
-
Model reduction techniques for computing approximately optimal solution for Markov decision processes
-
Dean, T. L., Givan, R., & Leach, S. (1997). Model reduction techniques for computing approximately optimal solution for Markov decision processes. In Proceedings of the Thirteenth Conference on Uncertainty in Artificial Intelligence, 124-131.
-
(1997)
Proceedings of the Thirteenth Conference on Uncertainty in Artificial Intelligence
, pp. 124-131
-
-
Dean, T.L.1
Givan, R.2
Leach, S.3
-
13
-
-
0027708037
-
Planning with deadlines in stochastic domains
-
Dean, T. L., Kaelbling, L. P., Kirman, J., & Nicholson A. (1993). Planning with deadlines in stochastic domains. In Proceedings of AAAI-93, 574-579.
-
(1993)
Proceedings of AAAI-93
, pp. 574-579
-
-
Dean, T.L.1
Kaelbling, L.P.2
Kirman, J.3
Nicholson, A.4
-
14
-
-
0006424007
-
-
TR CS-95-10, Department of Computer Science, Brown University, Provi-dence, Rhode Island 02912, USA
-
Dean T. L., & Lin, S. H. (1995). Decomposition techniques for planning in stochastic domains. TR CS-95-10, Department of Computer Science, Brown University, Provi-dence, Rhode Island 02912, USA.
-
(1995)
Decomposition Techniques for Planning in Stochastic Domains
-
-
Dean, T.L.1
Lin, S.H.2
-
16
-
-
0021486586
-
The optimal search for a moving target when the search path is constrained
-
Eagle, J. N. (1984). The optimal search for a moving target when the search path is constrained. Operations Research, 32(5), 1107-1115.
-
(1984)
Operations Research
, vol.32
, Issue.5
, pp. 1107-1115
-
-
Eagle, J.N.1
-
17
-
-
0031385618
-
Incremental methods for computing bounds in partially observable Markov decision processes
-
Hauskrecht, M. (1997). Incremental methods for computing bounds in partially observable Markov decision processes. In Proceedings of AAAI-97, 734-739.
-
(1997)
Proceedings of AAAI-97
, pp. 734-739
-
-
Hauskrecht, M.1
-
18
-
-
0008038484
-
-
TR CS-94-40, Department of Computer Science, Brown University, Providence, Rhode Island 02912, USA
-
Littman, M. L. (1994). The witness algorithm: Solving partially observable Markov decision processes. TR CS-94-40, Department of Computer Science, Brown University, Providence, Rhode Island 02912, USA.
-
(1994)
The Witness Algorithm: Solving Partially Observable Markov Decision Processes
-
-
Littman, M.L.1
-
19
-
-
0003596835
-
-
TR CS-95-19, Department of Computer Science, Brown University, Providence, Rhode Island 02912, USA
-
Littman, M. L., Cassandra, A. R., & Kaelbling, L. P. (1995). Efficient dynamic-programming updates in partially observable Markov decision processes. TR CS-95-19, Department of Computer Science, Brown University, Providence, Rhode Island 02912, USA.
-
(1995)
Efficient Dynamic-programming Updates in Partially Observable Markov Decision Processes
-
-
Littman, M.L.1
Cassandra, A.R.2
Kaelbling, L.P.3
-
20
-
-
0002679852
-
A survey of algorithmic methods for solving partially observable Markov decision processes
-
Lovejoy, W. S. (1991a). A survey of algorithmic methods for solving partially observable Markov decision processes. Annals of Operations Research, 28 (1), 47-65.
-
(1991)
Annals of Operations Research
, vol.28
, Issue.1
, pp. 47-65
-
-
Lovejoy, W.S.1
-
21
-
-
0000494894
-
Computationally feasible bounds for partially observed Markov decision processes
-
Lovejoy, W. S. (1991b). Computationally feasible bounds for partially observed Markov decision processes. Operations Research, 39 (1), 162-175.
-
(1991)
Operations Research
, vol.39
, Issue.1
, pp. 162-175
-
-
Lovejoy, W.S.1
-
22
-
-
0019909899
-
A survey of partially observable Markov decision processes: Theory, models, and algorithms
-
Monahan, G. E. (1982). A survey of partially observable Markov decision processes: theory, models, and algorithms. Management Science, 28 (1), 1-16.
-
(1982)
Management Science
, vol.28
, Issue.1
, pp. 1-16
-
-
Monahan, G.E.1
-
23
-
-
85168129602
-
Approximating optimal polices for partially observable stochastic domains
-
Parr, R., & Russell, S. (1995). Approximating optimal polices for partially observable stochastic domains. In Proceedings of IJCAI-95, 1088-1094.
-
(1995)
Proceedings of IJCAI-95
, pp. 1088-1094
-
-
Parr, R.1
Russell, S.2
-
24
-
-
0003438819
-
-
Ph.D. Thesis, Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology
-
Platzman, L. K. (1977). Finite-memory estimation and control of finite probabilistic systems. Ph.D. Thesis, Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology.
-
(1977)
Finite-memory Estimation and Control of Finite Probabilistic Systems
-
-
Platzman, L.K.1
-
25
-
-
77957101448
-
Markov decision processes
-
D. P. Heyman and M. J. Sobel (eds.) Elsevier Science Publishers
-
Puterman, M. L. (1990). Markov decision processes. In D. P. Heyman and M. J. Sobel (eds.), Handbooks in OR & MS., Elsevier Science Publishers, Vol. 2, 331-434.
-
(1990)
Handbooks in or & MS
, vol.2
, pp. 331-434
-
-
Puterman, M.L.1
-
26
-
-
0003871607
-
-
PhD thesis, Stanford University, Stanford, California, USA
-
Sondik, E. J. (1971). The optimal control of partially observable Markov processes. PhD thesis, Stanford University, Stanford, California, USA.
-
(1971)
The Optimal Control of Partially Observable Markov Processes
-
-
Sondik, E.J.1
-
27
-
-
84936534549
-
-
Southwest Fisheries Center Administrative Report H-79-13, National Marine Fisheries Service, Honolulu, Hawaii
-
Sondik, E. J., & Mendelssohn, R. (1979). Information seeking in Markov decision processes, Southwest Fisheries Center Administrative Report H-79-13, National Marine Fisheries Service, Honolulu, Hawaii.
-
(1979)
Information Seeking in Markov Decision Processes
-
-
Sondik, E.J.1
Mendelssohn, R.2
-
28
-
-
0000893414
-
Partially observed Markov decision processes: A survey
-
White III, C. C. (1991). Partially observed Markov decision processes: A survey. Annals of Operations Research, 32.
-
(1991)
Annals of Operations Research
, vol.32
-
-
White, I.I.I.C.C.1
-
30
-
-
0005951145
-
Finite-memory suboptimal design for partially observed Markov decision processes
-
White III, C. C, & Scherer, W. T., (1994). Finite-memory suboptimal design for partially observed Markov decision processes. Operations Research, 42(3), 440-455.
-
(1994)
Operations Research
, vol.42
, Issue.3
, pp. 440-455
-
-
White, I.I.I.C.C.1
Scherer, W.T.2
|