SCOPUS 정보 검색 플랫폼 - 논문 보기

메뉴 건너뛰기

Proceedings of the National Conference on Artificial Intelligence

Volumn , Issue , 2004, Pages 690-696

Stochastic local search for POMDP controllers

(2) Braziunas, Darius a Boutilier, Craig a

a UNIVERSITY OF TORONTO (Canada)

Author keywords

[No Author keywords available]

Indexed keywords

FUNCTION SPACES; PARTIALLY OBSERVABLE MARKOV DECISION PROCESSES (POMDP); PRECISE SEQUENCES; RESTRICTED SPACES;

ALGORITHMS; COMPUTATIONAL METHODS; DECISION MAKING; DYNAMIC PROGRAMMING; HEURISTIC METHODS; MATHEMATICAL MODELS;

MARKOV PROCESSES;

EID: 9444288081 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (37)

References (18)

1
- 1942514241
- Scalable internal-state policy-gradient methods for POMDPs
- Douglas Aberdeen and Jonathan Baxter. Scalable internal-state policy-gradient methods for POMDPs. In Proc. of ML-02, pages 3-10, 2002.
- (2002) Proc. of ML-02 , pp. 3-10
- Aberdeen, D.¹ Baxter, J.²

2
- 0036930295
- A POMDP formulation of preference elicitation problems
- Edmonton
- Craig Boutilier. A POMDP formulation of preference elicitation problems. In Proc. of AAAI-2002, pages 239-246, Edmonton, 2002.
- (2002) Proc. of AAAI-2002 , pp. 239-246
- Boutilier, C.¹

3
- 0003989210
- PhD thesis, Brown University, Providence, RI
- Anthony R. Cassandra. Exact and Approximate Algorithms for Partially Observable Markov Decision Processes. PhD thesis, Brown University, Providence, RI, 1998.
- (1998) Exact and Approximate Algorithms for Partially Observable Markov Decision Processes.
- Cassandra, A.R.¹

4
- 0028564629
- Acting optimally in partially observable stochastic domains
- Seattle
- Anthony R. Cassandra, Leslie Pack Kaelbling, and Michael L. Littman. Acting optimally in partially observable stochastic domains. In Proc. of AAAI-94, pages 1023-1028, Seattle, 1994.
- (1994) Proc. of AAAI-94 , pp. 1023-1028
- Cassandra, A.R.¹ Kaelbling, L.P.² Littman, M.L.³

5
- 0001909869
- Incremental pruning: A simple, fast, exact method for POMDPs
- Providence, RI
- Anthony R. Cassandra, Michael L. Littman, and Nevin L. Zhang. Incremental pruning: A simple, fast, exact method for POMDPs. In Proc. of UAI-97, pages 54-61, Providence, RI, 1997.
- (1997) Proc. of UAI-97 , pp. 54-61
- Cassandra, A.R.¹ Littman, M.L.² Zhang, N.L.³

6
- 0012252088
- Solving large POMDPs by real time dynamic programming
- Hector Geffner and Blai Bonet. Solving large POMDPs by real time dynamic programming. In Working Notes, Fall AAAI Symposium on POMDPs, 1998.
- (1998) Working Notes, Fall AAAI Symposium on POMDPs
- Geffner, H.¹ Bonet, B.²

7
- 0000411214
- Tabu search - Part I
- Fred Glover. Tabu search - part I. ORSA Journal on Computing, 1(3):190-206, 1989.
- (1989) ORSA Journal on Computing , vol.1 , Issue.3 , pp. 190-206
- Glover, F.¹

8
- 0003125478
- Solving POMDPs by searching in policy space
- Madison, WI
- Eric A. Hansen. Solving POMDPs by searching in policy space. In Proc. of UAI-98, pages 211-219, Madison, WI, 1998.
- (1998) Proc. of UAI-98 , pp. 211-219
- Hansen, E.A.¹

9
- 0004097542
- PhD thesis, TU Darmstadt, Darmstadt, Germany
- Holger H. Hoos. Stochastic Local Search - Methods, Models, Applications. PhD thesis, TU Darmstadt, Darmstadt, Germany, 1998.
- (1998) Stochastic Local Search - Methods, Models, Applications.
- Hoos, H.H.¹

10
- 0003272035
- Memoryless policies: Theoretical limitations and practical results. Dave Cliff, Philip Husbands, Jean-Arcady Meyer, and Stewart W. Wilson, editors
- Cambridge, MA, The MIT Press
- Michael L. Littman. Memoryless policies: Theoretical limitations and practical results. In Dave Cliff, Philip Husbands, Jean-Arcady Meyer, and Stewart W. Wilson, editors, Proceedings of the Third International Conference on Simulation of Adaptive Behavior, Cambridge, MA, 1994. The MIT Press.
- (1994) Proceedings of the Third International Conference on Simulation of Adaptive Behavior
- Littman, M.L.¹

11
- 85138579181
- Learning policies for partially observable environments: Scaling up
- Lake Tahoe
- Michael L. Littman, Anthony R. Cassandra, and Leslie Pack Kaelbling. Learning policies for partially observable environments: Scaling up. In Proc. of ML-95, pages 362-370, Lake Tahoe, 1995.
- (1995) Proc. of ML-95 , pp. 362-370
- Littman, M.L.¹ Cassandra, A.R.² Kaelbling, L.P.³

12
- 0002500946
- Solving POMDPs by searching the space of finite policies
- Stockholm
- Nicolas Meuleau, Kee-Eung Kim, Leslie Pack Kaelbling, and Anthony R. Cassandra. Solving POMDPs by searching the space of finite policies. In Proc. of UAI-99, pages 417-426, Stockholm, 1999.
- (1999) Proc. of UAI-99 , pp. 417-426
- Meuleau, N.¹ Kim, K.-E.² Kaelbling, L.P.³ Cassandra, A.R.⁴

13
- 0002103968
- Learning finite-state controllers for partially observable environments
- Stockholm
- Nicolas Meuleau, Leonid Peshkin, Kee-Eung Kim, and Leslie Pack Kaelbling. Learning finite-state controllers for partially observable environments. In Proc. of UAI-99, pages 427-436, Stockholm, 1999.
- (1999) Proc. of UAI-99 , pp. 427-436
- Meuleau, N.¹ Peshkin, L.² Kim, K.-E.³ Kaelbling, L.P.⁴

14
- 84880772945
- Point-based value iteration: An anytime algorithm for POMDPs
- Acapulco
- Joelle Pineau, Geoff Gordon, and Sebastian Thrun. Point-based value iteration: An anytime algorithm for POMDPs. In Proc. of IJCAI-03, pages 1025-1030, Acapulco, 2003.
- (2003) Proc. of IJCAI-03 , pp. 1025-1030
- Pineau, J.¹ Gordon, G.² Thrun, S.³

15
- 84898959164
- Bounded finite state controllers
- Vancouver
- Pascal Poupart and Craig Boutilier. Bounded finite state controllers. In Advances in Neural Information Processing Systems 16 (NIPS-2003), Vancouver, 2003.
- (2003) Advances in Neural Information Processing Systems 16 (NIPS-2003)
- Poupart, P.¹ Boutilier, C.²

16
- 0015658957
- The optimal control of partially observable Markov processes over a finite horizon
- Richard D. Smallwood and Edward J. Sondik. The optimal control of partially observable Markov processes over a finite horizon. Operations Research, 21:1071-1088, 1973.
- (1973) Operations Research , vol.21 , pp. 1071-1088
- Smallwood, R.D.¹ Sondik, E.J.²

17
- 3042527666
- A point-based POMDP algorithm for robot planning
- New Orleans, to appear
- Matthijs T. J. Spaan and Nikos Vlassis. A point-based POMDP algorithm for robot planning. In IEEE International Conference on Robotics and Automation, New Orleans, 2004. to appear.
- (2004) IEEE International Conference on Robotics and Automation
- Spaan, M.T.J.¹ Vlassis, N.²

18
- 0031215211
- HQ-learning
- Marco Wiering and Juergen Schmidhuber. HQ-learning. Adaptive Behavior, 6(2):219-246, 1997.
- (1997) Adaptive Behavior , vol.6 , Issue.2 , pp. 219-246
- Wiering, M.¹ Schmidhuber, J.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.