SCOPUS 정보 검색 플랫폼

Decision Theory Models for Applications in Artificial Intelligence: Concepts and Solutions

Volumn , Issue , 2011, Pages 33-62

An introduction to fully and partially observable Markov decision processes

(1) Poupart, Pascal a

a UNIVERSITY OF WATERLOO (Canada)

Author keywords

[No Author keywords available]

Indexed keywords

EID: 84889585200 PISSN: None EISSN: None Source Type: Book
DOI: 10.4018/978-1-60960-165-2.ch003 Document Type: Chapter

Times cited : (12)

References (66)

1
- 14344251217
- Apprenticeship learning via inverse reinforcement learning
- Banff, Alberta
- Abbeel, P., & Ng, A. (2004). Apprenticeship learning via inverse reinforcement learning, International Conference on Machine Learning (ICML), Banff, Alberta, (pp. 1-8).
- (2004) International Conference On Machine Learning (ICML) , pp. 1-8
- Abbeel, P.¹ Ng, A.²

2
- 0003989208
- Boca Raton, FL, Chapman & Hall/CRC
- Altman, E. (1999). Constrained Markov Decision Processes. Boca Raton, FL: Chapman & Hall/CRC.
- (1999) Constrained Markov Decision Processes
- Altman, E.¹

3
- 84880889008
- Solving POMDPs using quadratically constrained linear programs
- IJCAI
- Amato, C., Bernstein, D. S., & Zilberstein, S. (2007). Solving POMDPs using quadratically constrained linear programs. International Joint Conferences on Artificial Intelligence (pp. 2418-2424). IJCAI.
- (2007) International Joint Conferences On Artificial Intelligence , pp. 2418-2424
- Amato, C.¹ Bernstein, D.S.² Zilberstein, S.³

4
- 77954951649
- Optimizing fixed-size stochastic controllers for POMDPs and decentralized POMDPs
- doi:10.1007/s10458-009-9103-z
- Amato, C., Bernstein, D. S., & Zilberstein, S. (2010). Optimizing fixed-size stochastic controllers for POMDPs and decentralized POMDPs. Autonomous Agents and Multi-Agent Systems, 21(3), 293-320. doi:10.1007/s10458-009-9103-z
- (2010) Autonomous Agents and Multi-Agent Systems , vol.21 , Issue.3 , pp. 293-320
- Amato, C.¹ Bernstein, D.S.² Zilberstein, S.³

5
- 0001700171
- A Markov decision process
- doi:10.1512/iumj.1957.6.06038
- Bellman, R. (1957). A Markov decision process. Indiana University Mathematics Journal, 6(4), 679-684. doi:10.1512/iumj.1957.6.06038
- (1957) Indiana University Mathematics Journal , vol.6 , Issue.4 , pp. 679-684
- Bellman, R.¹

6
- 0030242097
- Input/output HMMs for sequence processing
- doi:10.1109/72.536317
- Bengio, Y., & Frasconi, P. (1996). Input/output HMMs for sequence processing. IEEE Transactions on Neural Networks, 7(5), 1231-1249. doi:10.1109/72.536317
- (1996) IEEE Transactions On Neural Networks , vol.7 , Issue.5 , pp. 1231-1249
- Bengio, Y.¹ Frasconi, P.²

7
- 65349083220
- Policy iteration for decentralized control of Markov decision processes. [JAIR]
- Bernstein, D. S., Amato, C., Hanse, E. A., & Zilberstein, S. (2009). Policy iteration for decentralized control of Markov decision processes. [JAIR]. Journal of Artificial Intelligence Research, 34, 89-132.
- (2009) Journal of Artificial Intelligence Research , vol.34 , pp. 89-132
- Bernstein, D.S.¹ Amato, C.² Hanse, E.A.³ Zilberstein, S.⁴

8
- 9444288081
- Stochastic local search for POMDP controllers
- San Jose, California
- Braziunas, D., & Boutilier, C. (2004). Stochastic local search for POMDP controllers. National Conference on Artificial Intelligence (AAAI), San Jose, California, (pp. 690-696).
- (2004) National Conference on Artificial Intelligence (AAAI) , pp. 690-696
- Braziunas, D.¹ Boutilier, C.²

9
- 79955875655
- Inverse reinforcement learning in partially observable domains. [JMLR]
- Choi, J., & Kim, K.-E. (2011). Inverse reinforcement learning in partially observable domains. [JMLR]. Journal of Machine Learning Research, 12, 691-730.
- (2011) Journal of Machine Learning Research , vol.12 , pp. 691-730
- Choi, J.¹ Kim, K.-E.²

10
- 79953864311
- Efficient solutions to factored MDPs with imprecise transition probabilities. [AIJ]
- doi:10.1016/j.artint.2011.01.001
- Delgado, K. V., Sanner, S., & Nunes de Barros, L. (2011). Efficient solutions to factored MDPs with imprecise transition probabilities. [AIJ]. Artificial Intelligence, 175(9-10), 1498-1527. doi:10.1016/j.artint.2011.01.001
- (2011) Artificial Intelligence , vol.175 , Issue.9-10 , pp. 1498-1527
- Delgado, K.V.¹ Sanner, S.² Nunes de Barros, L.³

11
- 0002629270
- Maximum likelihood from incomplete data via the EM algorithm
- Dempster, A., Laird, L., & Rubin, D. (1977). Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society. Series B. Methodological, 39(1), 1-38.
- (1977) Journal of the Royal Statistical Society. Series B. Methodological , vol.39 , Issue.1 , pp. 1-38
- Dempster, A.¹ Laird, L.² Rubin, D.³

12
- 0001806701
- The MAXQ method for hierarchical reinforcement learning
- Dietterich, T. (1998). The MAXQ method for hierarchical reinforcement learning. International Conference on Machine Learning, (pp. 118-126).
- (1998) International Conference On Machine Learning , pp. 118-126
- Dietterich, T.¹

13
- 76549099978
- Policy explanation in factored Markov decision processes
- Elizalde, F., Sucar, L. E., Luque, M., Diez, F. J., & Reyes, A. (2008). Policy explanation in factored Markov decision processes. European Workshop on Probabilistic Graphical Models, (pp. 97-104).
- (2008) European Workshop On Probabilistic Graphical Models , pp. 97-104
- Elizalde, F.¹ Sucar, L.E.² Luque, M.³ Diez, F.J.⁴ Reyes, A.⁵

14
- 79956364385
- Non-deterministic policies in Markovian decision processes. [JAIR]
- Fard, M. M., & Pineau, J. (2011). Non-deterministic policies in Markovian decision processes. [JAIR]. Journal of Artificial Intelligence Research, 40, 1-24.
- (2011) Journal of Artificial Intelligence Research , vol.40 , pp. 1-24
- Fard, M.M.¹ Pineau, J.²

15
- 29344460055
- Dynamic programming for structured continuous Markov decision problems
- Feng, Z., Dearden, R., Meuleau, N., & Washington, R. (2004). Dynamic programming for structured continuous Markov decision problems. International Conference on Uncertainty in Artificial Intelligence (UAI), (pp. 154-161).
- (2004) International Conference On Uncertainty In Artificial Intelligence (UAI) , pp. 154-161
- Feng, Z.¹ Dearden, R.² Meuleau, N.³ Washington, R.⁴

16
- 29344465971
- A framework for sequential planning in multiagent settings [JAIR]
- Gmytrasiewicz, P., & Doshi, P. (2005). A framework for sequential planning in multiagent settings [JAIR]. Journal of Artificial Intelligence Research, 24, 49-79.
- (2005) Journal of Artificial Intelligence Research , vol.24 , pp. 49-79
- Gmytrasiewicz, P.¹ Doshi, P.²

17
- 4544318426
- Efficient solution algorithms for factored MDPs. [JAIR]
- Guestrin, C., Koller, D., Parr, R., & Venkataraman, S. (2003). Efficient solution algorithms for factored MDPs. [JAIR]. Journal of Artificial Intelligence Research, 19, 399-468.
- (2003) Journal of Artificial Intelligence Research , vol.19 , pp. 399-468
- Guestrin, C.¹ Koller, D.² Parr, R.³ Venkataraman, S.⁴

18
- 75449083294
- Continuous time Markov decision processes
- Guo, X., & Hernandez-Lerma, O. (2009). Continuous time Markov decision processes. Applications of Mathematics, 62, 9-18.
- (2009) Applications of Mathematics , vol.62 , pp. 9-18
- Guo, X.¹ Hernandez-Lerma, O.²

19
- 84898987770
- An improved policy iteration algorithm for partially observable MDPs
- Denver, Colorado: NIPS
- Hansen, E. A. (1997). An improved policy iteration algorithm for partially observable MDPs. Advances in Neural Information Processing systems (pp. 1015-1021). Denver, Colorado: NIPS.
- (1997) Advances In Neural Information Processing Systems , pp. 1015-1021
- Hansen, E.A.¹

20
- 0003125478
- Solving POMDPs by searching in the policy space
- Madison, Wisconsin
- Hansen, E. A. (1998). Solving POMDPs by searching in the policy space. International Conference on Uncertainty in Artificial Intelligence (UAI), Madison, Wisconsin, (pp. 211-219).
- (1998) International Conference On Uncertainty In Artificial Intelligence (UAI) , pp. 211-219
- Hansen, E.A.¹

21
- 27344455577
- Synthesis of hierarchical finite-state controllers for POMDPs
- Hansen, E. A., & Zhou, R. (2003). Synthesis of hierarchical finite-state controllers for POMDPs. International Conference on Automated Planning and Scheduling (ICAPS), (pp. 113-122).
- (2003) International Conference On Automated Planning and Scheduling (ICAPS) , pp. 113-122
- Hansen, E.A.¹ Zhou, R.²

22
- 84880741298
- Solving POMDPs with continuous or large discrete observation spaces
- Hoey, J., & Poupart, P. (2005). Solving POMDPs with continuous or large discrete observation spaces. International Joint Conferences on Artificial Intelligence (pp. 1332-1338). IJCAI.
- (2005) International Joint Conferences On Artificial Intelligence , pp. 1332-1338
- Hoey, J.¹ Poupart, P.²

23
- 0002956570
- SPUDD: Stochastic planning using decision diagrams
- Hoey, J., St-Aubin, R., Hu, J. A., & Boutilier, C. (1999). SPUDD: stochastic planning using decision diagrams. International Conference on Uncertainty in Artificial Intelligence (UAI), (pp. 279-288).
- (1999) International Conference On Uncertainty In Artificial Intelligence (UAI) , pp. 279-288
- Hoey, J.¹ St-Aubin, R.² Hu, J.A.³ Boutilier, C.⁴

24
- 0003871605
- New York, NY, John Wiley & Sons
- Howard, R. A. (1971). Dynamic Probabilistic Systems (Vol. II). New York, NY: John Wiley & Sons.
- (1971) Dynamic Probabilistic Systems (Vol. II)
- Howard, R.A.¹

25
- 0032073263
- Planning and acting in partially observable stochastic domains
- doi:10.1016/S0004-3702(98)00023-X
- Kaelbling, L. P., Littman, M., & Cassandra, A. R. (1998). Planning and acting in partially observable stochastic domains. Artificial Intelligence, 101, 99-134. doi:10.1016/S0004-3702(98)00023-X
- (1998) Artificial Intelligence , vol.101 , pp. 99-134
- Kaelbling, L.P.¹ Littman, M.² Cassandra, A.R.³

26
- 84880649215
- A sparse sampling algorithm for near-optimal planning in large Markov decision processes
- Stockholm, Sweden: IJCAI
- Kearns, M., Mansour, Y., & Ng, A. Y. (1999). A sparse sampling algorithm for near-optimal planning in large Markov decision processes. International Joint Conferences on Artificial Intelligence (pp. 1324-1331). Stockholm, Sweden: IJCAI.
- (1999) International Joint Conferences On Artificial Intelligence , pp. 1324-1331
- Kearns, M.¹ Mansour, Y.² Ng, A.Y.³

27
- 78650612949
- Minimal sufficient explanations for factored Markov decision processes
- Khan, O. Z., Poupart, P., & Black, J. P. (2009). Minimal sufficient explanations for factored Markov decision processes. International Conference on Automated Plannning and Scheduling (ICAPS).
- (2009) International Conference On Automated Plannning and Scheduling (ICAPS)
- Khan, O.Z.¹ Poupart, P.² Black, J.P.³

28
- 84881060324
- Point-Based Value Iteration for Constrained POMDPs
- IJCAI
- Kim, D., Lee, J., Kim, K.-E., & Poupart, P. (2011). Point-Based Value Iteration for Constrained POMDPs. International Joint conferences on Artificial Intelligence (pp. 1968-1974). IJCAI.
- (2011) International Joint Conferences On Artificial Intelligence , pp. 1968-1974
- Kim, D.¹ Lee, J.² Kim, K.-E.³ Poupart, P.⁴

29
- 70349645087
- SAR-SOP: Efficient point-based POMDP planning by approximating optimally reachable belief spaces
- Kurniawati, H., Hsu, D., & Lee, W. (2008). SAR-SOP: Efficient point-based POMDP planning by approximating optimally reachable belief spaces. International Conference on Robotics: Science and Systems (RSS).
- (2008) International Conference On Robotics: Science and Systems (RSS)
- Kurniawati, H.¹ Hsu, D.² Lee, W.³

30
- 33750586671
- Solving factored MDPs with hybrid state and action variables. [JAIR]
- Kveton, B., Hauskrecht, M., & Guestrin, C. (2006). Solving factored MDPs with hybrid state and action variables. [JAIR]. Journal of Artificial Intelligence Research, 27, 153-201.
- (2006) Journal of Artificial Intelligence Research , vol.27 , pp. 153-201
- Kveton, B.¹ Hauskrecht, M.² Guestrin, C.³

31
- 0036374190
- Nonapproximability results for partially observable Markov decision processes. [JAIR]
- Lusena, C., Goldsmith, J., & Mundhenk, M. (2001). Nonapproximability results for partially observable Markov decision processes. [JAIR]. Journal of Artificial Intelligence Research, 14, 83-103.
- (2001) Journal of Artificial Intelligence Research , vol.14 , pp. 83-103
- Lusena, C.¹ Goldsmith, J.² Mundhenk, M.³

32
- 65349138293
- A Heuristic Search Approach to Planning with Continuous Resources in Stochastic Domains. [JAIR]
- Mausam
- Meuleau, N., Benazera, E., Brafman, R.I., & Hansen, E.A. & Mausam. (2009). A Heuristic Search Approach to Planning with Continuous Resources in Stochastic Domains. [JAIR]. Journal of Artificial Intelligence Research, 34, 27-59.
- (2009) Journal of Artificial Intelligence Research , vol.34 , pp. 27-59
- Meuleau, N.¹ Benazera, E.² Brafman, R.I.³ Hansen, E.A.⁴

33
- 0002500946
- Solving POMDPS by searching the space of finite policies
- Stockholm, Sweden
- Meuleau, N., Kim, K.-E., Kaelbling, L. P., & Cassandra, A. R. (1999), Solving POMDPS by searching the space of finite policies, International Conference on Uncertainty in Artificial Intelligence (UAI), Stockholm, Sweden, (pp. 417-426).
- (1999) International Conference On Uncertainty In Artificial Intelligence (UAI) , pp. 417-426
- Meuleau, N.¹ Kim, K.-E.² Kaelbling, L.P.³ Cassandra, A.R.⁴

34
- 80053212134
- Apprenticeship learning using inverse reinforcement learning and gradient methods
- Vancouver, Canada
- Neu, G., & Szepesvari, C. (2007). Apprenticeship learning using inverse reinforcement learning and gradient methods. International Conference on Uncertainty in Artificial Intelligence (UAI), Vancouver, Canada, (pp. 295-302).
- (2007) International Conference On Uncertainty In Artificial Intelligence (UAI) , pp. 295-302
- Neu, G.¹ Szepesvari, C.²

35
- 0042547347
- Algorithms for inverse reinforcement learning
- Stanford, California
- Ng, A., & Russell, S. (2000). Algorithms for inverse reinforcement learning. International Conference on Machine Learning (ICML), Stanford, California, (pp. 663-670).
- (2000) International Conference On Machine Learning (ICML) , pp. 663-670
- Ng, A.¹ Russell, S.²

36
- 80053135715
- A geometric traversal algorithm for reward-uncertain MDPs
- Barcelona, Spain
- Oh, E., & Kim, K.-E. (2011). A geometric traversal algorithm for reward-uncertain MDPs. International Conference on Uncertainty in Artificial Intelligence (UAI), Barcelona, Spain.
- (2011) International Conference On Uncertainty In Artificial Intelligence (UAI)
- Oh, E.¹ Kim, K.-E.²

37
- 0000977910
- The complexity of Markov decision processes
- doi:10.1287/moor.12.3.441
- Papadimitriou, C. H., & Tsitsilis, J. N. (1987). The complexity of Markov decision processes. Mathematics of Operations Research, 12(3), 441-450. doi:10.1287/moor.12.3.441
- (1987) Mathematics of Operations Research , vol.12 , Issue.3 , pp. 441-450
- Papadimitriou, C.H.¹ Tsitsilis, J.N.²

38
- 0001070375
- Reinforcement learning with hierarchies of machines
- NIPS
- Parr, R., & Russell, S. J. (1997). Reinforcement learning with hierarchies of machines. Advances in Neural Information Processing Systems. NIPS.
- (1997) Advances In Neural Information Processing Systems
- Parr, R.¹ Russell, S.J.²

39
- 52249090123
- Anytime point-based approximations for large POMDPs. [JAIR]
- Pineau, J., Gordon, G., & Thrun, S. (2006). Anytime point-based approximations for large POMDPs. [JAIR]. Journal of Artificial Intelligence Research, 27, 335-380.
- (2006) Journal of Artificial Intelligence Research , vol.27 , pp. 335-380
- Pineau, J.¹ Gordon, G.² Thrun, S.³

40
- 20444478005
- Policy-contingent abstraction for robust robot control
- Pineau, J., Gordon, G. J., & Thrun, S. (2003). Policy-contingent abstraction for robust robot control. International Conference on Uncertainty in Artificial Intelligence (UAI), (pp. 477-484).
- (2003) International Conference On Uncertainty In Artificial Intelligence (UAI) , pp. 477-484
- Pineau, J.¹ Gordon, G.J.² Thrun, S.³

41
- 0034292276
- Constrained Markovian decision processes: The dynamic programming approach
- doi:10.1016/S0167-6377(00)00039-0
- Piunovskiy, A. B., & Mao, X. (2000). Constrained Markovian decision processes: the dynamic programming approach. Operations Research Letters, 27(3), 119-126. doi:10.1016/S0167-6377(00)00039-0
- (2000) Operations Research Letters , vol.27 , Issue.3 , pp. 119-126
- Piunovskiy, A.B.¹ Mao, X.²

42
- 33750724397
- Point-Based Value Iteration for Continuous POMDPs. [JMLR]
- Porta, J. M., Vlassis, N. A., Spaan, M. T. J., & Poupart, P. (2006). Point-Based Value Iteration for Continuous POMDPs. [JMLR]. Journal of Machine Learning Research, 7, 2329-2367.
- (2006) Journal of Machine Learning Research , vol.7 , pp. 2329-2367
- Porta, J.M.¹ Vlassis, N.A.² Spaan, M.T.J.³ Poupart, P.⁴

43
- 84898149113
- Vancouver, Canada: NIPS
- Poupart, P., & Boutilier, C. (2003). Bounded finite state controllers, Advances in Neural Information Processing Systems. Vancouver, Canada: NIPS.
- (2003) Bounded Finite State Controllers, Advances In Neural Information Processing Systems
- Poupart, P.¹ Boutilier, C.²

44
- 80054839608
- Closing the Gap: Improved Bounds on Optimal POMDP Solutions
- Freiburg, Germany
- Poupart, P., Kim, K. E., & Kim, D. (2011). Closing the Gap: Improved Bounds on Optimal POMDP Solutions, International Conference on Automated Planning and Scheduling (ICAPS), Freiburg, Germany.
- (2011) International Conference On Automated Planning and Scheduling (ICAPS)
- Poupart, P.¹ Kim, K.E.² Kim, D.³

45
- 84898319353
- Analyzing and Escaping Local Optima in Planning as Inference for Partially Observable Domains
- Athens, Greece
- Poupart, P., Lang, T., & Toussaint, M. (2011). Analyzing and Escaping Local Optima in Planning as Inference for Partially Observable Domains. European Conference on Machine Learning (ECML), Athens, Greece.
- (2011) European Conference On Machine Learning (ECML)
- Poupart, P.¹ Lang, T.² Toussaint, M.³

46
- 85102627959
- (2nd ed.). New York, NY: John Wiley & Sons
- Puterman, M. (2005). Markov Decision Processes: Discrete Stochastic Dynamic Programming (2nd ed.). New York, NY: John Wiley & Sons.
- (2005) Markov Decision Processes: Discrete Stochastic Dynamic Programming
- Puterman, M.¹

47
- 77958520196
- Robust policy computation in reward-uncertain MDPs using nondominated policies
- Regan, K., & Boutilier, C. (2010). Robust policy computation in reward-uncertain MDPs using nondominated policies. National Conference on Artificial Intelligence (AAAI).
- (2010) National Conference On Artificial Intelligence (AAAI)
- Regan, K.¹ Boutilier, C.²

48
- 84881054930
- Eliciting additive reward functions for Markov decision processes
- Barcelona, Spain: IJCAI
- Regan, K., & Boutilier, C. (2011a). Eliciting additive reward functions for Markov decision processes. International Joint Conferences on Artificial Intelligence (pp. 2159-2164). Barcelona, Spain: IJCAI.
- (2011) , pp. 2159-2164
- Regan, K.¹ Boutilier, C.²

49
- 84881084517
- Robust Online Optimization of Reward-Uncertain MDPs
- Barcelona, Spain: IJCAI
- Regan, K., & Boutilier, C. (2011b). Robust Online Optimization of Reward-Uncertain MDPs. International Joint Conferences on Artificial Intelligence (pp. 2165-2171). Barcelona, Spain: IJCAI.
- (2011) , pp. 2165-2171
- Regan, K.¹ Boutilier, C.²

50
- 52249086942
- Online planning algorithms for POMDPs. [JAIR]
- Ross, S., Pineau, J., Paquet, S., & Chaib-Draa, B. (2008). Online planning algorithms for POMDPs. [JAIR]. Journal of Artificial Intelligence Research, 32, 663-704.
- (2008) Journal of Artificial Intelligence Research , vol.32 , pp. 663-704
- Ross, S.¹ Pineau, J.² Paquet, S.³ Chaib-Draa, B.⁴

51
- 80053161811
- Symbolic dynamic programming for discrete and continuous state MDPs
- Barcelona, Spain
- Sanner, S., Delgado, K. V., & de Barros, L. N. (2011). Symbolic dynamic programming for discrete and continuous state MDPs. International Conference on Uncertainty in Artificial Intelligence (UAI), Barcelona, Spain.
- (2011) International Conference On Uncertainty In Artificial Intelligence (UAI)
- Sanner, S.¹ Delgado, K.V.² de Barros, L.N.³

52
- 84880704888
- Affine algebraic decision diagrams (AADDs) and their application to structured probabilistic inference
- Sanner, S., & McAllester, D. (2005). Affine algebraic decision diagrams (AADDs) and their application to structured probabilistic inference. International Joint Conference on Artificial Intelligence (IJCAI).
- (2005) International Joint Conference On Artificial Intelligence (IJCAI)
- Sanner, S.¹ McAllester, D.²

53
- 0015630091
- Markovian decision processes with uncertain transition probabilities
- doi:10.1287/opre.21.3.728
- Satia, J. K., & Lave, R. E. Jr. (1970). Markovian decision processes with uncertain transition probabilities. Operations Research, 21, 728-740. doi:10.1287/opre.21.3.728
- (1970) Operations Research , vol.21 , pp. 728-740
- Satia, J.K.¹ Lave Jr., R.E.²

54
- 1542342765
- Direct value-approximation for factored MDPs
- Schuurmans, D., & Patrascu, R. (2001). Direct value-approximation for factored MDPs. Advances in Neural Information Processing Systems (pp. 1579-1586). Vancouver, BC: NIPS.
- (2001) Advances In Neural Information Processing Systems , pp. 1579-1586
- Schuurmans, D.¹ Patrascu, R.²

55
- 84880906197
- Forward search value iteration for POMDPs
- IJCAI
- Shani, G., Brafman, R. I., & Shimony, S. E. (2007). Forward search value iteration for POMDPs. International Joint Conferences on Artificial Intelligence (pp. 2619-2624). IJCAI.
- (2007) International Joint Conferences On Artificial Intelligence , pp. 2619-2624
- Shani, G.¹ Brafman, R.I.² Shimony, S.E.³

56
- 58849155135
- Efficient ADD Operations for Point-Based Algorithms
- Shani, G., Poupart, P., Brafman, R. I., & Shimony, S. E. (2008). Efficient ADD Operations for Point-Based Algorithms. International Conference on Automated Planning and Scheduling (ICAPS), pp. 330-337.
- (2008) International Conference On Automated Planning and Scheduling (ICAPS) , pp. 330-337
- Shani, G.¹ Poupart, P.² Brafman, R.I.³ Shimony, S.E.⁴

57
- 85161963598
- Monte Carlo planning in large POMDPs
- Vancouver, BC, NIPS
- Silver, D., & Veness, J. (2010). Monte Carlo planning in large POMDPs, Advances in Neural Information Processing Systems. Vancouver, BC: NIPS.
- (2010) Advances In Neural Information Processing Systems
- Silver, D.¹ Veness, J.²

58
- 57749207691
- Symbolic heuristic search value iteration for factored POMDPs
- Sim, H. S., Kim, K.-E., Kim, J. H., Chang, D.-S., & Koo, M.-Y. (2008). Symbolic heuristic search value iteration for factored POMDPs. National Conference on Artificial Intelligence (AAAI), pp. 1088-1093.
- (2008) National Conference On Artificial Intelligence (AAAI) , pp. 1088-1093
- Sim, H.S.¹ Kim, K.-E.² Kim, J.H.³ Chang, D.-S.⁴ Koo, M.-Y.⁵

59
- 0015658957
- The optimal control of partially observable Markov processes over a finite horizon
- doi:10.1287/opre.21.5.1071
- Smallwood, R. D., & Sondik, E. J. (1973). The optimal control of partially observable Markov processes over a finite horizon. Operations Research, 21, 1071-1088. doi:10.1287/opre.21.5.1071
- (1973) Operations Research , vol.21 , pp. 1071-1088
- Smallwood, R.D.¹ Sondik, E.J.²

60
- 80053262864
- Point-based POMDP algorithms: Improved analysis and implementation
- Smith, T., & Simmons, R. G. (2005). Point-based POMDP algorithms: improved analysis and implementation, International Conference on Uncertainty in Artificial Intelligence (UAI), pp. 542-547.
- (2005) International Conference On Uncertainty In Artificial Intelligence (UAI) , pp. 542-547
- Smith, T.¹ Simmons, R.G.²

61
- 31144472319
- Perseus: Randomized point-based value iteration for POM-DPs. [JAIR]
- Spaan, M. T. J., & Vlassis, N. A. (2005). Perseus: randomized point-based value iteration for POM-DPs. [JAIR]. Journal of Artificial Intelligence Research, 24, 195-220.
- (2005) Journal of Artificial Intelligence Research , vol.24 , pp. 195-220
- Spaan, M.T.J.¹ Vlassis, N.A.²

62
- 0033170372
- Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning. [AIJ]
- doi:10.1016/S0004-3702(99)00052-1
- Sutton, R., Precup, D., & Singh, S. P. (1999). Between MDPs and semi-MDPs: a framework for temporal abstraction in reinforcement learning. [AIJ]. Artificial Intelligence, 112(1-2), 181-211. doi:10.1016/S0004-3702(99)00052-1
- (1999) Artificial Intelligence , vol.112 , Issue.1-2 , pp. 181-211
- Sutton, R.¹ Precup, D.² Singh, S.P.³

63
- 84898978676
- Monte Carlo POMDPs. [NIPS]
- Thrun, S. (1999). Monte Carlo POMDPs. [NIPS]. Advances in Neural Information Processing Systems, 1064-1070.
- (1999) Advances In Neural Information Processing Systems , pp. 1064-1070
- Thrun, S.¹

64
- 51349153274
- Probabilistic inference for solving (PO) MDPs
- School of Informatics, University of Edinburgh
- Toussaint, M., Harmeling, S., & Storkey, A. (2006). Probabilistic inference for solving (PO) MDPs, Technical Report EDI-INF-RR-0934, School of Informatics, University of Edinburgh.
- (2006) Technical Report EDI-INF-RR-0934
- Toussaint, M.¹ Harmeling, S.² Storkey, A.³

65
- 0028460403
- Markov decision processes with imprecise transition probabilities
- doi:10.1287/opre.42.4.739
- White, C. C. III, & El-Deib, H. K. (1994). Markov decision processes with imprecise transition probabilities. Operations Research, 42(4), 739-749. doi:10.1287/opre.42.4.739
- (1994) Operations Research , vol.42 , Issue.4 , pp. 739-749
- White, C.C.I.¹ El-Deib, H.K.²

66
- 0010810245
- Planning in stochastic domains: Problem characteristics and approximation
- Hong Kong University of Science and Technology
- Zhang, N. L., & Liu, W. (1996). Planning in stochastic domains: problem characteristics and approximation. Technical Report HKUST-CS96-31, Hong Kong University of Science and Technology.
- (1996) Technical Report HKUST-CS96-31
- Zhang, N.L.¹ Liu, W.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.