메뉴 건너뛰기




Volumn , Issue , 2004, Pages 690-696

Stochastic local search for POMDP controllers

Author keywords

[No Author keywords available]

Indexed keywords

FUNCTION SPACES; PARTIALLY OBSERVABLE MARKOV DECISION PROCESSES (POMDP); PRECISE SEQUENCES; RESTRICTED SPACES;

EID: 9444288081     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (37)

References (18)
  • 1
    • 1942514241 scopus 로고    scopus 로고
    • Scalable internal-state policy-gradient methods for POMDPs
    • Douglas Aberdeen and Jonathan Baxter. Scalable internal-state policy-gradient methods for POMDPs. In Proc. of ML-02, pages 3-10, 2002.
    • (2002) Proc. of ML-02 , pp. 3-10
    • Aberdeen, D.1    Baxter, J.2
  • 2
    • 0036930295 scopus 로고    scopus 로고
    • A POMDP formulation of preference elicitation problems
    • Edmonton
    • Craig Boutilier. A POMDP formulation of preference elicitation problems. In Proc. of AAAI-2002, pages 239-246, Edmonton, 2002.
    • (2002) Proc. of AAAI-2002 , pp. 239-246
    • Boutilier, C.1
  • 4
    • 0028564629 scopus 로고
    • Acting optimally in partially observable stochastic domains
    • Seattle
    • Anthony R. Cassandra, Leslie Pack Kaelbling, and Michael L. Littman. Acting optimally in partially observable stochastic domains. In Proc. of AAAI-94, pages 1023-1028, Seattle, 1994.
    • (1994) Proc. of AAAI-94 , pp. 1023-1028
    • Cassandra, A.R.1    Kaelbling, L.P.2    Littman, M.L.3
  • 5
    • 0001909869 scopus 로고    scopus 로고
    • Incremental pruning: A simple, fast, exact method for POMDPs
    • Providence, RI
    • Anthony R. Cassandra, Michael L. Littman, and Nevin L. Zhang. Incremental pruning: A simple, fast, exact method for POMDPs. In Proc. of UAI-97, pages 54-61, Providence, RI, 1997.
    • (1997) Proc. of UAI-97 , pp. 54-61
    • Cassandra, A.R.1    Littman, M.L.2    Zhang, N.L.3
  • 7
    • 0000411214 scopus 로고
    • Tabu search - Part I
    • Fred Glover. Tabu search - part I. ORSA Journal on Computing, 1(3):190-206, 1989.
    • (1989) ORSA Journal on Computing , vol.1 , Issue.3 , pp. 190-206
    • Glover, F.1
  • 8
    • 0003125478 scopus 로고    scopus 로고
    • Solving POMDPs by searching in policy space
    • Madison, WI
    • Eric A. Hansen. Solving POMDPs by searching in policy space. In Proc. of UAI-98, pages 211-219, Madison, WI, 1998.
    • (1998) Proc. of UAI-98 , pp. 211-219
    • Hansen, E.A.1
  • 10
    • 0003272035 scopus 로고
    • Memoryless policies: Theoretical limitations and practical results. Dave Cliff, Philip Husbands, Jean-Arcady Meyer, and Stewart W. Wilson, editors
    • Cambridge, MA, The MIT Press
    • Michael L. Littman. Memoryless policies: Theoretical limitations and practical results. In Dave Cliff, Philip Husbands, Jean-Arcady Meyer, and Stewart W. Wilson, editors, Proceedings of the Third International Conference on Simulation of Adaptive Behavior, Cambridge, MA, 1994. The MIT Press.
    • (1994) Proceedings of the Third International Conference on Simulation of Adaptive Behavior
    • Littman, M.L.1
  • 11
    • 85138579181 scopus 로고
    • Learning policies for partially observable environments: Scaling up
    • Lake Tahoe
    • Michael L. Littman, Anthony R. Cassandra, and Leslie Pack Kaelbling. Learning policies for partially observable environments: Scaling up. In Proc. of ML-95, pages 362-370, Lake Tahoe, 1995.
    • (1995) Proc. of ML-95 , pp. 362-370
    • Littman, M.L.1    Cassandra, A.R.2    Kaelbling, L.P.3
  • 12
    • 0002500946 scopus 로고    scopus 로고
    • Solving POMDPs by searching the space of finite policies
    • Stockholm
    • Nicolas Meuleau, Kee-Eung Kim, Leslie Pack Kaelbling, and Anthony R. Cassandra. Solving POMDPs by searching the space of finite policies. In Proc. of UAI-99, pages 417-426, Stockholm, 1999.
    • (1999) Proc. of UAI-99 , pp. 417-426
    • Meuleau, N.1    Kim, K.-E.2    Kaelbling, L.P.3    Cassandra, A.R.4
  • 13
    • 0002103968 scopus 로고    scopus 로고
    • Learning finite-state controllers for partially observable environments
    • Stockholm
    • Nicolas Meuleau, Leonid Peshkin, Kee-Eung Kim, and Leslie Pack Kaelbling. Learning finite-state controllers for partially observable environments. In Proc. of UAI-99, pages 427-436, Stockholm, 1999.
    • (1999) Proc. of UAI-99 , pp. 427-436
    • Meuleau, N.1    Peshkin, L.2    Kim, K.-E.3    Kaelbling, L.P.4
  • 14
    • 84880772945 scopus 로고    scopus 로고
    • Point-based value iteration: An anytime algorithm for POMDPs
    • Acapulco
    • Joelle Pineau, Geoff Gordon, and Sebastian Thrun. Point-based value iteration: An anytime algorithm for POMDPs. In Proc. of IJCAI-03, pages 1025-1030, Acapulco, 2003.
    • (2003) Proc. of IJCAI-03 , pp. 1025-1030
    • Pineau, J.1    Gordon, G.2    Thrun, S.3
  • 16
    • 0015658957 scopus 로고
    • The optimal control of partially observable Markov processes over a finite horizon
    • Richard D. Smallwood and Edward J. Sondik. The optimal control of partially observable Markov processes over a finite horizon. Operations Research, 21:1071-1088, 1973.
    • (1973) Operations Research , vol.21 , pp. 1071-1088
    • Smallwood, R.D.1    Sondik, E.J.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.