SCOPUS 정보 검색 플랫폼

Autonomous Agents and Multi-Agent Systems

Volumn 27, Issue 1, 2013, Pages 1-51

A survey of point-based POMDP solvers

(3) Shani, Guy a Pineau, Joelle b Kaplow, Robert b

a BEN GURION UNIVERSITY OF THE NEGEV (Israel)

b MCGILL UNIVERSITY (Canada)

Author keywords

Decision theoretic planning; Partially observable Markov decision processes; Reinforcement learning

Indexed keywords

ITERATIVE METHODS; REINFORCEMENT LEARNING; SURVEYS;

COMPLEX DOMAINS; DECISION-THEORETIC PLANNING; EMPIRICAL ANALYSIS; EXPONENTIAL GROWTH; FINITE SUBSETS; PARTIALLY OBSERVABLE MARKOV DECISION PROCESS; POINT-BASED VALUE ITERATIONS; VALUE FUNCTIONS;

MARKOV PROCESSES;

EID: 84873747053 PISSN: 13872532 EISSN: 15737454 Source Type: Journal
DOI: 10.1007/s10458-012-9200-2 Document Type: Article

Times cited : (439)

References (70)

1
- 78751688119
- A translation-based approach to contingent planning
- Albore, A., Palacios, H., & Geffner, H. (2009). A translation-based approach to contingent planning. In International joint conference on artificial intelligence (IJCAI) (pp. 1623-1628).
- (2009) International joint conference on artificial intelligence (IJCAI) , pp. 1623-1628
- Albore, A.¹ Palacios, H.² Geffner, H.³

2
- 66149190706
- Solving POMDPs from both sides: Growing dual parsimonious bounds
- Armstrong-Crews, N., Gordon, G., & Veloso, M. (2008). Solving POMDPs from both sides: Growing dual parsimonious bounds. In AAAI workshop for advancement in POMDP solvers.
- (2008) AAAI workshop for advancement in POMDP solvers
- Armstrong-Crews, N.¹ Gordon, G.² Veloso, M.³

3
- 80051624180
- Development and validation of a robust speech interface for improved human-robot interaction
- Atrash A., Kaplow R., Villemure J., West R., Yamani H., Pineau J. (2009) Development and validation of a robust speech interface for improved human-robot interaction. International Journal of Social Robotics 1: 345-356.
- (2009) International Journal of Social Robotics , vol.1 , pp. 345-356
- Atrash, A.¹ Kaplow, R.² Villemure, J.³ West, R.⁴ Yamani, H.⁵ Pineau, J.⁶

4
- 0029210635
- Learning to act using real-time dynamic programming
- doi:10.1016/0004-3702(94)00011-O
- Barto A. G., Bradtke S. J., Singh S. P. (1995) Learning to act using real-time dynamic programming. Artificial Intelligence 72: 81-138. doi: 10. 1016/0004-3702(94)00011-O.
- (1995) Artificial Intelligence , vol.72 , pp. 81-138
- Barto, A.G.¹ Bradtke, S.J.² Singh, S.P.³

5
- 85012688561
- Princeton: Princeton University Press
- Bellman R. (1957) Dynamic programming. Princeton University Press, Princeton.
- (1957) Dynamic Programming
- Bellman, R.¹

6
- 0001700171
- A Markovian decision process
- Bellman R. (1957) A Markovian decision process. Journal of Mathematics and Mechanics 6: 679-684.
- (1957) Journal of Mathematics and Mechanics , vol.6 , pp. 679-684
- Bellman, R.¹

7
- 9444233135
- Labeled RTDP: Improving the convergence of real-time dynamic programming
- Bonet, B., & Geffner, H. (2003). Labeled RTDP: Improving the convergence of real-time dynamic programming. In International conference on planning and scheduling (ICAPS) (pp. 12-31).
- (2003) International conference on planning and scheduling (ICAPS) , pp. 12-31
- Bonet, B.¹ Geffner, H.²

8
- 78751689703
- Solving POMDPs: RTDP-Bel vs. Point-based algorithms
- Bonet, B., & Geffner, H. (2009). Solving POMDPs: RTDP-Bel vs. Point-based algorithms. In International joint conference on artificial intelligence (IJCAI) (pp. 1641-1646).
- (2009) International joint conference on artificial intelligence (IJCAI) , pp. 1641-1646
- Bonet, B.¹ Geffner, H.²

9
- 0036930295
- A POMDP formulation of preference elicitation problems
- Boutilier, C. (2002). A POMDP formulation of preference elicitation problems. In National conference on artificial intelligence (AAAI) (pp. 239-246).
- (2002) National conference on artificial intelligence (AAAI) , pp. 239-246
- Boutilier, C.¹

10
- 84864577176
- Continuous-state POMDPs with hybrid dynamics
- Brunskill, E., Kaelbling, L., Lozano-Perez, T., & Roy, N. (2008). Continuous-state POMDPs with hybrid dynamics. In International symposium on artificial intelligence and mathematics (ISAIM).
- (2008) International symposium on artificial intelligence and mathematics (ISAIM
- Brunskill, E.¹ Kaelbling, L.² Lozano-Perez, T.³ Roy, N.⁴

11
- 0001909869
- Incremental Pruning: A simple, fast, exact method for partially observable Markov decision processes
- Cassandra, A., Littman, M. L., & Zhang, N. L. (1997). Incremental Pruning: A simple, fast, exact method for partially observable Markov decision processes. In Conference on uncertainty in artificial intelligence (UAI) (pp. 54-61). http://www. cs. duke. edu/~mlittman/docs/uai97-pomdp. ps.
- (1997) Conference on uncertainty in artificial intelligence (UAI) , pp. 54-61
- Cassandra, A.¹ Littman, M.L.² Zhang, N.L.³

12
- 84880882345
- Topological value iteration algorithm for Markov decision processes
- Dai, P., & Goldsmith, J. (2007). Topological value iteration algorithm for Markov decision processes. In: International joint conference on artificial intelligence (IJCAI) (pp. 1860-1865).
- (2007) International joint conference on artificial intelligence (IJCAI) , pp. 1860-1865
- Dai, P.¹ Goldsmith, J.²

13
- 78751693208
- Topological order planner for POMDPs
- Dibangoye, J. S., Shani, G., Chaib-draa, B., & Mouaddib, A. I. (2009). Topological order planner for POMDPs. In International joint conference on artificial intelligence (IJCAI) (pp. 1684-1689).
- (2009) International joint conference on artificial intelligence (IJCAI) , pp. 1684-1689
- Dibangoye, J.S.¹ Shani, G.² Chaib-draa, B.³ Mouaddib, A.I.⁴

14
- 84899990618
- The permutable POMDP: Fast solutions to POMDPs for preference elicitation
- Doshi, F., & Roy, N. (2008). The permutable POMDP: Fast solutions to POMDPs for preference elicitation. In International conference on autonomous agents and multiagent systems (AAMAS) (pp. 493-500).
- (2008) International conference on autonomous agents and multiagent systems (AAMAS) , pp. 493-500
- Doshi, F.¹ Roy, N.²

15
- 0012252088
- Solving large POMDPs using real time dynamic programming
- Geffner, H., & Bonet, B. (1998). Solving large POMDPs using real time dynamic programming. In Proceedings AAAI fall symposium on POMDPs.
- (1998) Proceedings AAAI fall symposium on POMDPs
- Geffner, H.¹ Bonet, B.²

16
- 0003125478
- Solving POMDPs by searching in policy space
- Hansen, E. (1998). Solving POMDPs by searching in policy space. In: Conference on uncertainty in artificial intelligence (UAI)(pp. 211-219).
- (1998) Conference on uncertainty in artificial intelligence (UAI) , pp. 211-219
- Hansen, E.¹

17
- 36348935534
- Indefinite-horizon POMDPs with action-based termination
- Hansen, E. A. (2007). Indefinite-horizon POMDPs with action-based termination. In National conference on artificial intelligence (AAAI) (pp. 1237-1242).
- (2007) National conference on artificial intelligence (AAAI) , pp. 1237-1242
- Hansen, E.A.¹

18
- 0031385618
- Incremental methods for computing bounds in partially observable Markov decision processes
- Hauskrecht, M. (1997). Incremental methods for computing bounds in partially observable Markov decision processes. In: National conference on artificial intelligence (pp. 734-739).
- (1997) National conference on artificial intelligence , pp. 734-739
- Hauskrecht, M.¹

19
- 0001770240
- Value-function approximations for partially observable Markov decision processes
- Hauskrecht, M. (2000). Value-function approximations for partially observable Markov decision processes. Journal of Artificial Intelligence Research (JAIR), 13, 33-94. http://www. cs. washington. edu/research/jair/abstracts/hauskrecht00a. html.
- (2000) Journal of Artificial Intelligence Research (JAIR) , vol.13 , pp. 33-94
- Hauskrecht, M.¹

20
- 0034160101
- Planning treatment of ischemic heart disease with partially observable Markov decision processes
- Hauskrecht M., Fraser H. S. F. (2000) Planning treatment of ischemic heart disease with partially observable Markov decision processes. Artificial Intelligence in Medicine 18(3): 221-244.
- (2000) Artificial Intelligence in Medicine , vol.18 , Issue.3 , pp. 221-244
- Hauskrecht, M.¹ Fraser, H.S.F.²

21
- 77950932056
- Automated handwashing assistance for persons with dementia using video and a partially observable Markov decision process
- Hoey J., Poupart P., von Bertoldi A., Craig T., Boutilier C., Mihailidis A. (2010) Automated handwashing assistance for persons with dementia using video and a partially observable Markov decision process. Computer Vision and Image Understanding 114(5): 503-519.
- (2010) Computer Vision and Image Understanding , vol.114 , Issue.5 , pp. 503-519
- Hoey, J.¹ Poupart, P.² von Bertoldi, A.³ Craig, T.⁴ Boutilier, C.⁵ Mihailidis, A.⁶

22
- 0003644124
- Cambridge, MA: MIT Press
- Howard R. A. (1960) Dynamic programming and Markov processes. MIT Press, Cambridge, MA.
- (1960) Dynamic Programming and Markov Processes
- Howard, R.A.¹

23
- 36349037222
- Grasping POMDPs
- Hsiao, K., Kaelbling, L. P., & Lozano-Pérez, T. (2007). Grasping POMDPs. In IEEE international conference on robotics and automation (ICRA) (pp. 4685-4692).
- (2007) IEEE international conference on robotics and automation (ICRA) , pp. 4685-4692
- Hsiao, K.¹ Kaelbling, L.P.² Lozano-Pérez, T.³

24
- 70350379394
- icLQG: Combining local and global optimization for control in information space
- Huynh, V. A., & Roy N. (2009). icLQG: Combining local and global optimization for control in information space. In IEEE international conference on robotics and automation (ICRA) (pp. 2851-2858).
- (2009) IEEE international conference on robotics and automation (ICRA) , pp. 2851-2858
- Huynh, V.A.¹ Roy, N.²

25
- 84880712384
- Using core beliefs for point-based value iteration
- Izadi, M. T., Rajwade, A. V., & Precup, D. (2005). Using core beliefs for point-based value iteration. In International joint conference on artificial intelligence (pp. 1751-1753).
- (2005) International joint conference on artificial intelligence , pp. 1751-1753
- Izadi, M.T.¹ Rajwade, A.V.² Precup, D.³

26
- 33746044183
- Belief selection in point-based planning algorithms for POMDPs
- Izadi, M. T., Precup, D., & Azar, D. (2006). Belief selection in point-based planning algorithms for POMDPs. In Canadian conference on artificial intelligence (pp. 383-394).
- (2006) Canadian conference on artificial intelligence , pp. 383-394
- Izadi, M.T.¹ Precup, D.² Azar, D.³

27
- 36348942884
- Point-based policy iteration
- AAAI Press
- Ji, S., Parr, R., Li, H., Liao, X., & Carin, L. (2007). Point-based policy iteration. In National conference on artificial intelligence (AAAI) (pp. 1243-1249). AAAI Press.
- (2007) National conference on artificial intelligence (AAAI) , pp. 1243-1249
- Ji, S.¹ Parr, R.² Li, H.³ Liao, X.⁴ Carin, L.⁵

28
- 0032073263
- Planning and acting in partially observable stochastic domains
- Kaelbling, L., Littman, M., & Cassandra, A. (1998). Planning and acting in partially observable stochastic domains. In Artificial intelligence (pp. 99-134).
- (1998) Artificial intelligence , pp. 99-134
- Kaelbling, L.¹ Littman, M.² Cassandra, A.³

29
- 84860614968
- Master's thesis, McGill University
- Kaplow, R. (2010). Point-based POMDP solvers: Survey and comparative analysis. Master's thesis, McGill University.
- (2010) Point-based POMDP solvers: Survey and comparative analysis
- Kaplow, R.¹

30
- 70349645087
- SARSOP: Efficient point-based POMDP planning by approximating optimally reachable belief spaces
- Kurniawati, H., Hsu, D., & Lee, W. (2008). SARSOP: Efficient point-based POMDP planning by approximating optimally reachable belief spaces. In Robotics: Science and systems (RSS).
- (2008) Robotics: Science and systems (RSS)
- Kurniawati, H.¹ Hsu, D.² Lee, W.³

31
- 0003861655
- PhD thesis, Department of Computer Science, Brown University, Providence, RI, Also Technical Report CS-96-09
- Littman, M. L. (1996). Algorithms for sequential decision making. PhD thesis, Department of Computer Science, Brown University, Providence, RI. ftp://ftp. cs. brown. edu/pub/techreports/96/cs96-09. ps. Z. Also Technical Report CS-96-09.
- (1996) Algorithms for sequential decision making
- Littman, M.L.¹

32
- 85138579181
- Learning policies for partially observable environments: Scaling up
- Littman, M. L., Cassandra, A. R., & Kaelbling, L. P. (1995). Learning policies for partially observable environments: Scaling up. In International conference on machine learning (ICML) (pp. 362-370).
- (1995) International conference on machine learning (ICML) , pp. 362-370
- Littman, M.L.¹ Cassandra, A.R.² Kaelbling, L.P.³

33
- 1942449765
- Predictive representations of state
- Littman, M. L., Sutton, R. S., & Singh, S. P. (2001). Predictive representations of state. In Advances in neural information processing systems (NIPS) (pp. 1555-1561).
- (2001) Advances in neural information processing systems (NIPS) , pp. 1555-1561
- Littman, M.L.¹ Sutton, R.S.² Singh, S.P.³

34
- 9444284311
- An instance-based state representation for network repair
- Littman, M. L., Ravi, N., Fenson, E., & Howard, R. (2004). An instance-based state representation for network repair. In National conference on artificial intelligence (AAAI) (pp. 287-292).
- (2004) National conference on artificial intelligence (AAAI) , pp. 287-292
- Littman, M.L.¹ Ravi, N.² Fenson, E.³ Howard, R.⁴

35
- 0000494894
- Computationally feasible bounds for partially observed Markov decision processes
- Lovejoy W. S. (1991) Computationally feasible bounds for partially observed Markov decision processes. Operations Research 39(1): 162-175.
- (1991) Operations Research , vol.39 , Issue.1 , pp. 162-175
- Lovejoy, W.S.¹

36
- 0141596576
- Policy invariance underreward transformations: Theory and application to reward shaping
- Ng, A., Harada, D., & Russell, S. (1999). Policy invariance underreward transformations: Theory and application to reward shaping. In International conference on machine learning(ICML).
- (1999) International conference on machine learning(ICML)
- Ng, A.¹ Harada, D.² Russell, S.³

37
- 77949774253
- POMDP planning for robust robot control
- Springer
- Pineau, J., & Gordon, G. (2005). POMDP planning for robust robot control. In International symposium on robotics research (ISRR) (Vol. 28, pp. 69-82). Springer.
- (2005) International symposium on robotics research (ISRR) , vol.28 , pp. 69-82
- Pineau, J.¹ Gordon, G.²

38
- 84880772945
- Point-based value iteration: An anytime algorithm for POMDPs
- Pineau, J., Gordon, G., & Thrun, S. (2003). Point-based value iteration: An anytime algorithm for POMDPs. In International joint conference on artificial intelligence (pp. 1025-1032).
- (2003) International joint conference on artificial intelligence , pp. 1025-1032
- Pineau, J.¹ Gordon, G.² Thrun, S.³

39
- 84873746061
- Applying metric-trees to belief-point POMDPs
- Pineau, J., Gordon, G. J., & Thrun, S. (2003). Applying metric-trees to belief-point POMDPs. In Advances in neural information processing systems (NIPS).
- (2003) Advances in neural information processing systems (NIPS)
- Pineau, J.¹ Gordon, G.J.² Thrun, S.³

40
- 52249090123
- Anytime point-based approximations for large POMDPs
- Pineau J., Gordon G. J., Thrun S. (2006) Anytime point-based approximations for large POMDPs. Journal of Artificial Intelligence Research (JAIR) 27: 335-380.
- (2006) Journal of Artificial Intelligence Research (JAIR) , vol.27 , pp. 335-380
- Pineau, J.¹ Gordon, G.J.² Thrun, S.³

41
- 3042574553
- Master's thesis, The Hong-Kong University of Science and Technology
- Poon, L. (2001). A fast heuristic algorithm for decision theoretic planning. Master's thesis, The Hong-Kong University of Science and Technology.
- (2001) A fast heuristic algorithm for decision theoretic planning
- Poon, L.¹

42
- 33750724397
- Point-based value iteration for continuous POMDPs
- Porta J. M., Vlassis N., Spaan M. T. J., Poupart P. (2006) Point-based value iteration for continuous POMDPs. Journal of Machine Learning Research 7: 2329-2367.
- (2006) Journal of Machine Learning Research , vol.7 , pp. 2329-2367
- Porta, J.M.¹ Vlassis, N.² Spaan, M.T.J.³ Poupart, P.⁴

43
- 26944499915
- PhD thesis, Department of Computer Science, University of Toronto
- Poupart, P. (2005). Exploiting structure to efficiently solve large scale partially observable Markov decision processes. PhD thesis, Department of Computer Science, University of Toronto.
- (2005) Exploiting structure to efficiently solve large scale partially observable Markov decision processes
- Poupart, P.¹

44
- 34247196722
- Bounded finite state controllers
- Poupart, P., & Boutilier, C. (2003). Bounded finite state controllers. In Advances in neural information processing systems (NIPS).
- (2003) Advances in neural information processing systems (NIPS)
- Poupart, P.¹ Boutilier, C.²

45
- 80054839608
- Closing the gap: Improved bounds on optimal POMDP solutions
- Poupart, P., Kim, K. E., & Kim, D. (2011). Closing the gap: Improved bounds on optimal POMDP solutions. In International conference on planning and scheduling (ICAPS).
- (2011) International conference on planning and scheduling (ICAPS)
- Poupart, P.¹ Kim, K.E.² Kim, D.³

46
- 85102627959
- New York, NY: Wiley
- Puterman M. L. (1994) Markov decision processes: Discrete stochastic dynamic programming. Wiley, New York, NY.
- (1994) Markov Decision Processes: Discrete Stochastic Dynamic Programming
- Puterman, M.L.¹

47
- 84880870183
- AEMS: An anytime online search algorithm for approximate policy refinement in large POMDPs
- Ross, S., & Chaib-draa, B. (2007). AEMS: An anytime online search algorithm for approximate policy refinement in large POMDPs. In International joint conference on artificial intelligence (IJCAI) (pp. 2592-2598).
- (2007) International joint conference on artificial intelligence (IJCAI) , pp. 2592-2598
- Ross, S.¹ Chaib-draa, B.²

48
- 77958579615
- Symbolic dynamic programming for first-order POMDPs
- Sanner, S., & Kersting, K. (2010). Symbolic dynamic programming for first-order POMDPs. In National conference on artificial intelligence (AAAI).
- (2010) National conference on artificial intelligence (AAAI)
- Sanner, S.¹ Kersting, K.²

49
- 77954758939
- Evaluating point-based POMDP solvers on multicore machines
- Shani G. (2010) Evaluating point-based POMDP solvers on multicore machines. IEEE Transactions on Systems, Man, and Cybernetics, Part B 40(4): 1062-1074.
- (2010) IEEE Transactions on Systems, Man, and Cybernetics, Part B , vol.40 , Issue.4 , pp. 1062-1074
- Shani, G.¹

50
- 84858728290
- Improving existing fault recovery policies
- Shani, G., & Meek, C. (2009). Improving existing fault recovery policies. In Advances in neural information processing systems (NIPS) (Vol. 22, pp. 1642-1650).
- (2009) Advances in neural information processing systems (NIPS) , vol.22 , pp. 1642-1650
- Shani, G.¹ Meek, C.²

51
- 24944458830
- An MDP-based recommender system
- Shani G., Heckerman D., Brafman R. I. (2005) An MDP-based recommender system. Journal of Machine Learning Research 6: 1265-1295.
- (2005) Journal of Machine Learning Research , vol.6 , pp. 1265-1295
- Shani, G.¹ Heckerman, D.² Brafman, R.I.³

52
- 84880906197
- Forward search value iteration for POMDPs
- Shani, G., Brafman, R., & Shimony, S. (2007). Forward search value iteration for POMDPs. In International joint conference on artificial intelligence (IJCAI).
- (2007) International joint conference on artificial intelligence (IJCAI)
- Shani, G.¹ Brafman, R.² Shimony, S.³

53
- 57049174051
- Prioritizing point-based POMDP solvers
- Shani G., Brafman R. I., Shimony S. E. (2008) Prioritizing point-based POMDP solvers. IEEE Transactions on Systems, Man, and Cybernetics, Part B 38(6): 1592-1605.
- (2008) IEEE Transactions on Systems, Man, and Cybernetics, Part B , vol.38 , Issue.6 , pp. 1592-1605
- Shani, G.¹ Brafman, R.I.² Shimony, S.E.³

54
- 58849155135
- Efficient ADD operations for point-based algorithms
- Shani, G., Poupart, P., Brafman, R. I., & Shimony, S. E. (2008). Efficient ADD operations for point-based algorithms. In International conference on automated scheduling and planning (ICAPS) (pp. 330-337).
- (2008) International conference on automated scheduling and planning (ICAPS) , pp. 330-337
- Shani, G.¹ Poupart, P.² Brafman, R.I.³ Shimony, S.E.⁴

55
- 57749207691
- Symbolic heuristic search value iteration for factored POMDPs
- Sim, H. S., Kim, K. E., Kim, J. H., Chang, D. S., & Koo, M. W. (2008). Symbolic heuristic search value iteration for factored POMDPs. In National conference on artificial intelligence (pp. 1088-1093).
- (2008) National conference on artificial intelligence , pp. 1088-1093
- Sim, H.S.¹ Kim, K.E.² Kim, J.H.³ Chang, D.S.⁴ Koo, M.W.⁵

56
- 0029753630
- Reinforcement learning with replacing eligibility traces
- Singh S. P., Sutton R. S. (1996) Reinforcement learning with replacing eligibility traces. Machine Learning 22: 123-158.
- (1996) Machine Learning , vol.22 , pp. 123-158
- Singh, S.P.¹ Sutton, R.S.²

57
- 31144465830
- Heuristic search value iteration for POMDPs
- Smith, T., & Simmons, R. (2004). Heuristic search value iteration for POMDPs. In Conference on uncertainty in artificial intelligence (UAI).
- (2004) Conference on uncertainty in artificial intelligence (UAI)
- Smith, T.¹ Simmons, R.²

58
- 80053262864
- Point-based POMDP algorithms: Improved analysis and implementation
- Smith, T., & Simmons, R. G. (2005). Point-based POMDP algorithms: Improved analysis and implementation. In Conference on uncertainty in artificial intelligence (UAI) (pp. 542-547).
- (2005) Conference on uncertainty in artificial intelligence (UAI) , pp. 542-547
- Smith, T.¹ Simmons, R.G.²

59
- 0003871607
- PhD thesis, Stanford University
- Sondik, E. (1971). The optimal control of partially observable Markov decision processes. PhD thesis, Stanford University.
- (1971) The optimal control of partially observable Markov decision processes
- Sondik, E.¹

60
- 0017943242
- The optimal control of partially observable Markov processes over the infinite horizon: Discounted costs
- Sondik E. J. (1978) The optimal control of partially observable Markov processes over the infinite horizon: Discounted costs. Operations Research 26: 282-304.
- (1978) Operations Research , vol.26 , pp. 282-304
- Sondik, E.J.¹

61
- 3042527666
- A point-based POMDP algorithm for robot planning
- Spaan, M., & Vlassis, N. (2004). A point-based POMDP algorithm for robot planning. In IEEE international conference on robotics and automation (ICRA) (pp. 2399-2404).
- (2004) IEEE international conference on robotics and automation (ICRA) , pp. 2399-2404
- Spaan, M.¹ Vlassis, N.²

62
- 31144472319
- Perseus: Randomized point-based value iteration for POMDPs
- Spaan M., Vlassis N. (2005) Perseus: Randomized point-based value iteration for POMDPs. Journal of Artificial Intelligence Research, 24: 195-220.
- (2005) Journal of Artificial Intelligence Research, , vol.24 , pp. 195-220
- Spaan, M.¹ Vlassis, N.²

63
- 0004102479
- Cambridge, MA: MIT Press
- Sutton R. S., Barto A. G. (1998) Reinforcement learning: An introduction. MIT Press, Cambridge, MA.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

64
- 77956531256
- Technical report TR09-13, University Of Alberta
- Szepesvari, C. (2009). Reinforcement learning algorithms for MDPs-a survey. Technical report TR09-13, University Of Alberta.
- (2009) Reinforcement learning algorithms for MDPs-a survey
- Szepesvari, C.¹

65
- 36349016578
- Scaling up: Solving POMDPs through value based clustering
- Virin, Y., Shani, G., Shimony, S. E., & Brafman, R. I. (2007). Scaling up: Solving POMDPs through value based clustering. In: National conference on artificial intelligence (AAAI) (pp. 1290-1295).
- (2007) National conference on artificial intelligence (AAAI) , pp. 1290-1295
- Virin, Y.¹ Shani, G.² Shimony, S.E.³ Brafman, R.I.⁴

66
- 77958541321
- Relational partially observable mdps
- Wang, C., & Khardon, R. (2010). Relational partially observable mdps. In National conference on artificial intelligence (AAAI).
- (2010) National conference on artificial intelligence (AAAI)
- Wang, C.¹ Khardon, R.²

67
- 33750703175
- Partially observable Markov decision processes for spoken dialog systems
- Williams J. D., Young S. (2007) Partially observable Markov decision processes for spoken dialog systems. Computer Speech & Language 21(2): 393-422.
- (2007) Computer Speech & Language , vol.21 , Issue.2 , pp. 393-422
- Williams, J.D.¹ Young, S.²

68
- 21844451909
- Prioritization methods for accelerating MDP solvers
- Wingate D., Seppi K. D. (2005) Prioritization methods for accelerating MDP solvers. Journal of Machine Learning Research (JMLR) 6: 851-881.
- (2005) Journal of Machine Learning Research (JMLR) , vol.6 , pp. 851-881
- Wingate, D.¹ Seppi, K.D.²

69
- 0036374229
- Speeding up the convergence of value iteration in partially observable Markov decision processes
- Zhang N. L., Zhang S. (2001) Speeding up the convergence of value iteration in partially observable Markov decision processes. Journal of Artificial Intelligence Research (JAIR) 14: 29-51.
- (2001) Journal of Artificial Intelligence Research (JAIR) , vol.14 , pp. 29-51
- Zhang, N.L.¹ Zhang, S.²

70
- 0030230721
- Using anytime algorithms in intelligent systems
- Zilberstein S. (1996) Using anytime algorithms in intelligent systems. AI Magazine 17: 73-83.
- (1996) AI Magazine , vol.17 , pp. 73-83
- Zilberstein, S.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.