-
1
-
-
1942514241
-
Scalable internal-state policy-gradient methods for POMDPs
-
Douglas Aberdeen and Jonathan Baxter. Scalable internal-state policy-gradient methods for POMDPs. In Proc. of ML-02, pages 3-10, 2002.
-
(2002)
Proc. of ML-02
, pp. 3-10
-
-
Aberdeen, D.1
Baxter, J.2
-
2
-
-
0036930295
-
A POMDP formulation of preference elicitation problems
-
Edmonton
-
Craig Boutilier. A POMDP formulation of preference elicitation problems. In Proc. of AAAI-2002, pages 239-246, Edmonton, 2002.
-
(2002)
Proc. of AAAI-2002
, pp. 239-246
-
-
Boutilier, C.1
-
4
-
-
0028564629
-
Acting optimally in partially observable stochastic domains
-
Seattle
-
Anthony R. Cassandra, Leslie Pack Kaelbling, and Michael L. Littman. Acting optimally in partially observable stochastic domains. In Proc. of AAAI-94, pages 1023-1028, Seattle, 1994.
-
(1994)
Proc. of AAAI-94
, pp. 1023-1028
-
-
Cassandra, A.R.1
Kaelbling, L.P.2
Littman, M.L.3
-
5
-
-
0001909869
-
Incremental pruning: A simple, fast, exact method for POMDPs
-
Providence, RI
-
Anthony R. Cassandra, Michael L. Littman, and Nevin L. Zhang. Incremental pruning: A simple, fast, exact method for POMDPs. In Proc. of UAI-97, pages 54-61, Providence, RI, 1997.
-
(1997)
Proc. of UAI-97
, pp. 54-61
-
-
Cassandra, A.R.1
Littman, M.L.2
Zhang, N.L.3
-
7
-
-
0000411214
-
Tabu search - Part I
-
Fred Glover. Tabu search - part I. ORSA Journal on Computing, 1(3):190-206, 1989.
-
(1989)
ORSA Journal on Computing
, vol.1
, Issue.3
, pp. 190-206
-
-
Glover, F.1
-
8
-
-
0003125478
-
Solving POMDPs by searching in policy space
-
Madison, WI
-
Eric A. Hansen. Solving POMDPs by searching in policy space. In Proc. of UAI-98, pages 211-219, Madison, WI, 1998.
-
(1998)
Proc. of UAI-98
, pp. 211-219
-
-
Hansen, E.A.1
-
9
-
-
0004097542
-
-
PhD thesis, TU Darmstadt, Darmstadt, Germany
-
Holger H. Hoos. Stochastic Local Search - Methods, Models, Applications. PhD thesis, TU Darmstadt, Darmstadt, Germany, 1998.
-
(1998)
Stochastic Local Search - Methods, Models, Applications.
-
-
Hoos, H.H.1
-
10
-
-
0003272035
-
Memoryless policies: Theoretical limitations and practical results. Dave Cliff, Philip Husbands, Jean-Arcady Meyer, and Stewart W. Wilson, editors
-
Cambridge, MA, The MIT Press
-
Michael L. Littman. Memoryless policies: Theoretical limitations and practical results. In Dave Cliff, Philip Husbands, Jean-Arcady Meyer, and Stewart W. Wilson, editors, Proceedings of the Third International Conference on Simulation of Adaptive Behavior, Cambridge, MA, 1994. The MIT Press.
-
(1994)
Proceedings of the Third International Conference on Simulation of Adaptive Behavior
-
-
Littman, M.L.1
-
11
-
-
85138579181
-
Learning policies for partially observable environments: Scaling up
-
Lake Tahoe
-
Michael L. Littman, Anthony R. Cassandra, and Leslie Pack Kaelbling. Learning policies for partially observable environments: Scaling up. In Proc. of ML-95, pages 362-370, Lake Tahoe, 1995.
-
(1995)
Proc. of ML-95
, pp. 362-370
-
-
Littman, M.L.1
Cassandra, A.R.2
Kaelbling, L.P.3
-
12
-
-
0002500946
-
Solving POMDPs by searching the space of finite policies
-
Stockholm
-
Nicolas Meuleau, Kee-Eung Kim, Leslie Pack Kaelbling, and Anthony R. Cassandra. Solving POMDPs by searching the space of finite policies. In Proc. of UAI-99, pages 417-426, Stockholm, 1999.
-
(1999)
Proc. of UAI-99
, pp. 417-426
-
-
Meuleau, N.1
Kim, K.-E.2
Kaelbling, L.P.3
Cassandra, A.R.4
-
13
-
-
0002103968
-
Learning finite-state controllers for partially observable environments
-
Stockholm
-
Nicolas Meuleau, Leonid Peshkin, Kee-Eung Kim, and Leslie Pack Kaelbling. Learning finite-state controllers for partially observable environments. In Proc. of UAI-99, pages 427-436, Stockholm, 1999.
-
(1999)
Proc. of UAI-99
, pp. 427-436
-
-
Meuleau, N.1
Peshkin, L.2
Kim, K.-E.3
Kaelbling, L.P.4
-
14
-
-
84880772945
-
Point-based value iteration: An anytime algorithm for POMDPs
-
Acapulco
-
Joelle Pineau, Geoff Gordon, and Sebastian Thrun. Point-based value iteration: An anytime algorithm for POMDPs. In Proc. of IJCAI-03, pages 1025-1030, Acapulco, 2003.
-
(2003)
Proc. of IJCAI-03
, pp. 1025-1030
-
-
Pineau, J.1
Gordon, G.2
Thrun, S.3
-
16
-
-
0015658957
-
The optimal control of partially observable Markov processes over a finite horizon
-
Richard D. Smallwood and Edward J. Sondik. The optimal control of partially observable Markov processes over a finite horizon. Operations Research, 21:1071-1088, 1973.
-
(1973)
Operations Research
, vol.21
, pp. 1071-1088
-
-
Smallwood, R.D.1
Sondik, E.J.2
|