SCOPUS 정보 검색 플랫폼 - 논문 보기

메뉴 건너뛰기

IJCAI International Joint Conference on Artificial Intelligence

Volumn , Issue , 2009, Pages 1641-1646

Solving POMDPs: RTDP-Bel vs. point-based algorithms

(2) Bonet, Blai a Geffner, Héctor b

a UNIVERSIDAD SIMÓN BOLÍVAR (Venezuela)

b UNIVERSITAT POMPEU FABRA (Spain)

Author keywords

[No Author keywords available]

Indexed keywords

ARTIFICIAL INTELLIGENCE;

APPROXIMATE METHODS; DISCRETIZATIONS; POINT-BASED ALGORITHM; REPRESENTATIONAL GAP; TABULAR REPRESENTATIONS; VALUE FUNCTIONS; VALUE ITERATION;

ITERATIVE METHODS;

EID: 78751689703 PISSN: 10450823 EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (102)

References (19)

1
- 50549213583
- Optimal control of Markov Decision Processes with incomplete state estimation
- K. Astrom. Optimal control of Markov Decision Processes with incomplete state estimation. J. Math. Anal. Appl., 10:174-205, 1965.
- (1965) J. Math. Anal. Appl. , vol.10 , pp. 174-205
- Astrom, K.¹

2
- 0029210635
- Learning to act using real-time dynamic programming
- A. Barto, S. Bradtke, and S. Singh. Learning to act using real-time dynamic programming. Art. Int., 72:81-138, 1995.
- (1995) Art. Int. , vol.72 , pp. 81-138
- Barto, A.¹ Bradtke, S.² Singh, S.³

3
- 0003487482
- Athena Scientific
- D. Bertsekas and J. Tsitsiklis. Neuro-Dynamic Programming. Athena Scientific, 1996.
- (1996) Neuro-Dynamic Programming
- Bertsekas, D.¹ Tsitsiklis, J.²

4
- 0003565783
- (2 Vols). Athena Scientific
- D. Bertsekas. Dynamic Programming and Optimal Control, (2 Vols). Athena Scientific, 1995.
- (1995) Dynamic Programming and Optimal Control
- Bertsekas, D.¹

5
- 0002098456
- Learning sorting and decision trees with POMDPs
- B. Bonet and H. Geffner. Learning sorting and decision trees with POMDPs. In Proc. ICML, pages 73-81, 1998.
- (1998) Proc. ICML , pp. 73-81
- Bonet, B.¹ Geffner, H.²

6
- 0012252088
- Solving large POMDPs using real time dynamic programming
- AAAI Press
- B. Bonet and H. Geffner. Solving large POMDPs using real time dynamic programming. In Proc. AAAI Fall Symp. on POMDPs. AAAI Press, 1998.
- (1998) Proc. AAAI Fall Symp. on POMDPs
- Bonet, B.¹ Geffner, H.²

7
- 85166261608
- Planning with incomplete information as heuristic search in belief space
- B. Bonet and H. Geffner. Planning with incomplete information as heuristic search in belief space. In Proc. ICAPS, pages 52-61, 2000.
- (2000) Proc. ICAPS , pp. 52-61
- Bonet, B.¹ Geffner, H.²

8
- 31144460375
- An ε-optimal grid-based algorithm for Partially Observable Markov Decision Processes
- B. Bonet. An ε-optimal grid-based algorithm for Partially Observable Markov Decision Processes. In Proc. ICML, pages 51-58, 2002.
- (2002) Proc. ICML , pp. 51-58
- Bonet, B.¹

9
- 0001770240
- Value-Function Approximations for Partially Observable Markov Decision Processes
- M. Hauskrecht. Value-function approximations for partially observable Markov decision processes. JAIR, 13:33-94, 2000. (Pubitemid 33682083)
- (2000) Journal of Artificial Intelligence Research , vol.13 , pp. 33-94
- Hauskrecht, M.¹

10
- 0032073263
- Planning and acting in partially observable stochastic domains
- L. P. Kaelbling, M. Littman, and A. R. Cassandra. Planning and acting in partially observable stochastic domains. Art. Int., 101:99-134, 1999.
- (1999) Art. Int. , vol.101 , pp. 99-134
- Kaelbling, L.P.¹ Littman, M.² Cassandra, A.R.³

11
- 0025400088
- Real-time heuristic search
- R. Korf. Real-time heuristic search. Art. Int., 42(2-3):189-211, 1990.
- (1990) Art. Int. , vol.42 , Issue.2-3 , pp. 189-211
- Korf, R.¹

12
- 0019909899
- A survey of partially observable Markov decision processes: Theory, models and algorithms
- G. Monahan. A survey of partially observable Markov decision processes: Theory, models and algorithms. Management Science, 28(1):1-16, 1983.
- (1983) Management Science , vol.28 , Issue.1 , pp. 1-16
- Monahan, G.¹

13
- 52249090123
- Anytime point-based approximations for large POMDPs
- J. Pineau, G. J. Gordon, and S. Thrun. Anytime point-based approximations for large POMDPs. JAIR, 27:335-380, 2006.
- (2006) JAIR , vol.27 , pp. 335-380
- Pineau, J.¹ Gordon, G.J.² Thrun, S.³

14
- 84880906197
- Forward search value iteration for POMDPs
- G. Shani, R. I. Brafman, and S. E. Shimony. Forward search value iteration for POMDPs. In Proc. IJCAI, pages 2619-2624, 2007.
- (2007) Proc. IJCAI , pp. 2619-2624
- Shani, G.¹ Brafman, R.I.² Shimony, S.E.³

15
- 80053262864
- Point-based POMDP algorithms: Improved analysis and implementation
- T. Smith and R. Simmons. Point-based POMDP algorithms: Improved analysis and implementation. In Proc. UAI, pages 542-547, 2005.
- (2005) Proc. UAI , pp. 542-547
- Smith, T.¹ Simmons, R.²

16
- 0003871607
- PhD thesis, Stanford University
- E. Sondik. The Optimal Control of Partially Observable Markov Decision Processes. PhD thesis, Stanford University, 1971.
- (1971) The Optimal Control of Partially Observable Markov Decision Processes
- Sondik, E.¹

17
- 0017943242
- The optimal control of partially observable Markov decision processes over the infinite horizon: Discounted costs
- E. Sondik. The optimal control of partially observable Markov decision processes over the infinite horizon: discounted costs. Oper. Res., 26(2), 1978.
- (1978) Oper. Res. , vol.26 , Issue.2
- Sondik, E.¹

18
- 31144472319
- Perseus: Randomized point-based value iteration for POMDPs
- M. T. J. Spaan and N. A. Vlassis. Perseus: Randomized point-based value iteration for POMDPs. JAIR, 24:195-220, 2005.
- (2005) JAIR , vol.24 , pp. 195-220
- Spaan, M.T.J.¹ Vlassis, N.A.²

19
- 0004102479
- MIT Press
- R. Sutton and A. Barto. Reinforcement Learning: An Introduction. MIT Press, 1998.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.¹ Barto, A.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.