SCOPUS 정보 검색 플랫폼 - 논문 보기

메뉴 건너뛰기

Advances in Neural Information Processing Systems

Volumn , Issue , 2004, Pages

Bounded finite state controllers

(2) Poupart, Pascal a Boutilier, Craig a

a UNIVERSITY OF TORONTO (Canada)

Author keywords

[No Author keywords available]

Indexed keywords

APPROXIMATION ALGORITHMS; ITERATIVE METHODS;

FINITE-STATE CONTROLLERS; GRADIENT ASCENT; LOCAL OPTIMA; POLICY ITERATION;

CONTROLLERS;

EID: 84898959164 PISSN: 10495258 EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (71)

References (14)

1
- 1942514241
- Scaling internal-state policy-gradient methods for POMDPs
- Sydney, Australia
- D. Aberdeen and J. Baxter. Scaling internal-state policy-gradient methods for POMDPs. Proc. ICML-02, pp.3-10, Sydney, Australia, 2002.
- (2002) Proc. ICML-02 , pp. 3-10
- Aberdeen, D.¹ Baxter, J.²

2
- 0030349220
- Computing optimal policies for partially observable decision processes using compact representations
- Portland, OR
- C. Boutilier and D. Poole. Computing optimal policies for partially observable decision processes using compact representations. Proc. AAAI-96, pp.1168-1175, Portland, OR, 1996.
- (1996) Proc. AAAI-96 , pp. 1168-1175
- Boutilier, C.¹ Poole, D.²

3
- 84898995397
- Master's thesis, University of Toronto, Toronto
- D. Braziunas. Stochastic local search for POMDP controllers. Master's thesis, University of Toronto, Toronto, 2003.
- (2003) Stochastic Local Search for POMDP Controllers
- Braziunas, D.¹

4
- 0001909869
- Incremental pruning: A simple, fast, exact method for POMDPs
- Providence, RI
- A. R. Cassandra, M. L. Littman, and N. L. Zhang. Incremental pruning: A simple, fast, exact method for POMDPs. Proc.UAI-97, pp.54-61, Providence, RI, 1997.
- (1997) Proc.UAI-97 , pp. 54-61
- Cassandra, A.R.¹ Littman, M.L.² Zhang, N.L.³

5
- 0003818801
- PhD thesis, University of British Columbia, Vancouver
- H.-T. Cheng. Algorithms for Partially Observable Markov Decision Processes. PhD thesis, University of British Columbia, Vancouver, 1988.
- (1988) Algorithms for Partially Observable Markov Decision Processes
- Cheng, H.-T.¹

6
- 58349094926
- Approximate planning for factored POMDPs
- Toledo, Spain
- Z. Feng and E. A. Hansen. Approximate planning for factored POMDPs. Proc. ECP-01, Toledo, Spain, 2001.
- (2001) Proc. ECP-01
- Feng, Z.¹ Hansen, E.A.²

7
- 0003125478
- Solving POMDPs by searching in policy space
- Madison, Wisconsin
- E. A. Hansen. Solving POMDPs by searching in policy space. Proc. UAI-98, pp.211-219, Madison, Wisconsin, 1998.
- (1998) Proc. UAI-98 , pp. 211-219
- Hansen, E.A.¹

8
- 0001770240
- Value-function approximations for partially observable Markov decision processes
- M. Hauskrecht. Value-function approximations for partially observable Markov decision processes. Journal of Artificial Intelligence Research, 13:33-94, 2000.
- (2000) Journal of Artificial Intelligence Research , vol.13 , pp. 33-94
- Hauskrecht, M.¹

9
- 0032073263
- Planning and acting in partially observable stochastic domains
- L. P. Kaelbling, M. Littman, and A. R. Cassandra. Planning and acting in partially observable stochastic domains. Artificial Intelligence, 101:99-134, 1998.
- (1998) Artificial Intelligence , vol.101 , pp. 99-134
- Kaelbling, L.P.¹ Littman, M.² Cassandra, A.R.³

10
- 0002500946
- Solving POMDPs by searching the space of finite policies
- Stockholm
- N. Meuleau, K.-E. Kim, L. P. Kaelbling, and A. R. Cassandra. Solving POMDPs by searching the space of finite policies. Proc. UAI-99, pp.417-426, Stockholm, 1999.
- (1999) Proc. UAI-99 , pp. 417-426
- Meuleau, N.¹ Kim, K.-E.² Kaelbling, L.P.³ Cassandra, A.R.⁴

11
- 0002103968
- Learning finite-state controllers for partially observable environments
- Stockholm
- N. Meuleau, L. Peshkin, K.-E. Kim, and L. P. Kaelbling. Learning finite-state controllers for partially observable environments. Proc. UAI-99, pp.427-436, Stockholm, 1999.
- (1999) Proc. UAI-99 , pp. 427-436
- Meuleau, N.¹ Peshkin, L.² Kim, K.-E.³ Kaelbling, L.P.⁴

12
- 84880772945
- Point-based value iteration: An anytime algorithm for POMDPs
- Acapulco, Mexico
- J. Pineau, G. Gordon, and S. Thrun. Point-based value iteration: An anytime algorithm for POMDPs. In Proc. IJCAI-03, Acapulco, Mexico, 2003.
- (2003) Proc. IJCAI-03
- Pineau, J.¹ Gordon, G.² Thrun, S.³

13
- 33748561594
- Value-directed compressions of POMDPs
- Vancouver, Canada
- P. Poupart and C. Boutilier. Value-directed compressions of POMDPs. Proc. NIPS-02, pp.1547- 1554, Vancouver, Canada, 2002.
- (2002) Proc. NIPS-02 , pp. 1547-1554
- Poupart, P.¹ Boutilier, C.²

14
- 0036374229
- Speeding up the convergence of value-iteration in partially observable Markov decision processes
- N. L. Zhang and W. Zhang. Speeding up the convergence of value-iteration in partially observable Markov decision processes. Journal of Artificial Intelligence Research, 14:29-51, 2001.
- (2001) Journal of Artificial Intelligence Research , vol.14 , pp. 29-51
- Zhang, N.L.¹ Zhang, W.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.