SCOPUS 정보 검색 플랫폼

Volumn 7, Issue , 2006, Pages 2329-2367

Point-based value iteration for continuous POMDPs

(4) Porta, Josep M a Vlassis, Nikos b Spaan, Matthijs T J b Poupart, Pascal c

a CSIC (Spain)

Author keywords

Continuous action space; Continuous observation space; Continuous state space; Partially observable Markov decision processes; Planning under uncertainty; Point based value iteration

Indexed keywords

COMPUTATIONAL COMPLEXITY; ITERATIVE METHODS; LEARNING ALGORITHMS; MARKOV PROCESSES; OPTIMIZATION;

CONTINUOUS ACTION SPACE; CONTINUOUS OBSERVATION SPACE; PARTIALLY OBSERVABLE MARKOV DECISION PROCESSES; PLANNING UNDER UNCERTAINTY; POINT-BASED VALUE ITERATION;

LEARNING SYSTEMS;

EID: 33750724397 PISSN: 15337928 EISSN: 15337928 Source Type: Journal
DOI: None Document Type: Article

Times cited : (215)

References (59)

1
- 50549213583
- Optimal control of Markov decision processes with incomplete state estimation
- K. J. Åström. Optimal control of Markov decision processes with incomplete state estimation. Journal of Mathematical Analysis and Applications, 10:174-205, 1965.
- (1965) Journal of Mathematical Analysis and Applications , vol.10 , pp. 174-205
- Åström, K.J.¹

2
- 1942514241
- Scaling internal-state policy-gradient methods for POMDPs
- Sydney, Australia
- D. Aberdeen and J. Baxter. Scaling internal-state policy-gradient methods for POMDPs. In Proceedings of the International Conference on Machine Learning, pages 3-10, Sydney, Australia, 2002.
- (2002) Proceedings of the International Conference on Machine Learning , pp. 3-10
- Aberdeen, D.¹ Baxter, J.²

3
- 84899015857
- Reinforcement learning with long short-term memory
- MIT Press
- B. Bakker. Reinforcement learning with long short-term memory. In Advances in Neural Information Processing Systems 15 (NIPS-2002), pages 1475-1482. MIT Press, 2003.
- (2003) Advances in Neural Information Processing Systems 15 (NIPS-2002) , pp. 1475-1482
- Bakker, B.¹

4
- 0013535965
- Infinite-horizon policy-gradient estimation
- J. Baxter and P. L. Bartlett. Infinite-horizon policy-gradient estimation. Journal of Artificial Intelligence Research, 15:319-350, 2001.
- (2001) Journal of Artificial Intelligence Research , vol.15 , pp. 319-350
- Baxter, J.¹ Bartlett, P.L.²

5
- 0003787146
- Princenton University Press
- R. E. Bellman. Dynamic Programming. Princenton University Press, 1957.
- (1957) Dynamic Programming
- Bellman, R.E.¹

6
- 0003565783
- Athena Scientific, Belmont, MA, 2nd Edition
- D. P. Bertsekas. Dynamic Programming and Optimal Control. Athena Scientific, Belmont, MA, 2001. 2nd Edition.
- (2001) Dynamic Programming and Optimal Control
- Bertsekas, D.P.¹

7
- 0003487482
- Athena Scientific, Belmont, MA
- D. P. Bertsekas and J. N. Tsitsiklis. Neuro-Dynamic Programming. Athena Scientific, Belmont, MA, 1996.
- (1996) Neuro-dynamic Programming
- Bertsekas, D.P.¹ Tsitsiklis, J.N.²

8
- 84880705310
- A decision-theoretic approach to task assistance for persons with dementia
- Edinburgh, Scotland
- J. Boger, P. Poupart, J. Hoey, C. Boutilier, G. Fernie, and A. Mihailidis. A decision-theoretic approach to task assistance for persons with dementia. In Proceedings of the International Joint Conference on Artificial Intelligence, pages 1293-1299, Edinburgh, Scotland, 2005.
- (2005) Proceedings of the International Joint Conference on Artificial Intelligence , pp. 1293-1299
- Boger, J.¹ Poupart, P.² Hoey, J.³ Boutilier, C.⁴ Fernie, G.⁵ Mihailidis, A.⁶

9
- 0036930295
- A POMDP formulation of preference elicitation problems
- Edmonton, AB
- C. Boutilier. A POMDP formulation of preference elicitation problems. In Proceedings of the National Conference on Artificial Intelligence, pages 239-246, Edmonton, AB, 2002.
- (2002) Proceedings of the National Conference on Artificial Intelligence , pp. 239-246
- Boutilier, C.¹

10
- 0030349220
- Computing optimal policies for partially observable decision processes using compact representations
- Portland, OR
- C. Boutilier and D. Poole. Computing optimal policies for partially observable decision processes using compact representations. In Proceedings of the National Conference on Artificial Intelligence, pages 1168-1175, Portland, OR, 1996.
- (1996) Proceedings of the National Conference on Artificial Intelligence , pp. 1168-1175
- Boutilier, C.¹ Poole, D.²

11
- 84883096571
- Planning in continuous state spaces with parametric POMDPs
- A. Brooks, A. Makarenko, S. Williams, and H. Durrant-Whyte. Planning in continuous state spaces with parametric POMDPs. In Reasoning with Uncertainty in Robotics, Workshop of the International Joint Conference on Artificial Intelligence, pages 40-47, 2006.
- (2006) Reasoning with Uncertainty in Robotics, Workshop of the International Joint Conference on Artificial Intelligence , pp. 40-47
- Brooks, A.¹ Makarenko, A.² Williams, S.³ Durrant-Whyte, H.⁴

12
- 0030388815
- Acting under uncertainty: Discrete Bayesian models for mobile-robot navigation
- A. R. Cassandra, L. P. Kaelbling, and J. A. Kurien. Acting under uncertainty: discrete Bayesian models for mobile-robot navigation. In Proceedings of the International Conference on Intelligent Robots and Systems, pages 963-972, 1996.
- (1996) Proceedings of the International Conference on Intelligent Robots and Systems , pp. 963-972
- Cassandra, A.R.¹ Kaelbling, L.P.² Kurien, J.A.³

13
- 0001909869
- Incremental pruning: A simple, fast, exact algorithm for partially observable Markov decision processes
- A. R. Cassandra, M. L. Littman, and N. L. Zhang. Incremental pruning: A simple, fast, exact algorithm for partially observable Markov decision processes. In Proceedings of Uncertainty in Artificial Intelligence, pages 54-61, 1997.
- (1997) Proceedings of Uncertainty in Artificial Intelligence , pp. 54-61
- Cassandra, A.R.¹ Littman, M.L.² Zhang, N.L.³

14
- 0003818801
- PhD thesis, University of British Columbia
- H. T. Cheng. Algorithms for Partially Observable Markov Decision Processes. PhD thesis, University of British Columbia, 1988.
- (1988) Algorithms for Partially Observable Markov Decision Processes
- Cheng, H.T.¹

15
- 84898804274
- Active gesture recognition using partially observable Markov decision processes
- Vienna, Austria
- T. Darrell and A. P. Pentland. Active gesture recognition using partially observable Markov decision processes. In IEEE International Conference on Pattern Recognition, pages 984-988, Vienna, Austria, 1996.
- (1996) IEEE International Conference on Pattern Recognition , pp. 984-988
- Darrell, T.¹ Pentland, A.P.²

16
- 0032632827
- Monte-Carlo localization for mobile robots
- F. Dellaert, D. Fox, W. Burgard, and S. Thrun. Monte-Carlo localization for mobile robots. In Proceedings of the IEEE International Conference on Robotics and Automation, pages 13221328, 1999.
- (1999) Proceedings of the IEEE International Conference on Robotics and Automation , pp. 13221328
- Dellaert, F.¹ Fox, D.² Burgard, W.³ Thrun, S.⁴

17
- 0003665481
- Springer-Verlag, New York
- A. Doucet, N. de Freitas, and N. Gordon. Sequential Monte Carlo in Practice. Springer-Verlag, New York, 2001.
- (2001) Sequential Monte Carlo in Practice
- Doucet, A.¹ De Freitas, N.² Gordon, N.³

18
- 1942450858
- PhD thesis, University of Massassachusetts Amherst
- M. Duff. Optimal Learning: Computational procedures for Bayes-adaptive Markov decision processes. PhD thesis, University of Massassachusetts Amherst, 2002.
- (2002) Optimal Learning: Computational Procedures for Bayes-adaptive Markov Decision Processes
- Duff, M.¹

19
- 0344510322
- Controlled random sequences
- E. B. Dynkin. Controlled random sequences. Theory of probability and its applications, 10(1): 1-14, 1965.
- (1965) Theory of Probability and Its Applications , vol.10 , Issue.1 , pp. 1-14
- Dynkin, E.B.¹

20
- 84898969051
- Kld-sampling: Adaptive particle niters
- MIT Press
- D. Fox. Kld-sampling: Adaptive particle niters. In Advances in Neural Information Processing Systems 14 (NIPS-2001), pages 713-720. MIT Press, 2002.
- (2002) Advances in Neural Information Processing Systems 14 (NIPS-2001) , pp. 713-720
- Fox, D.¹

21
- 0344445520
- Adapting the sample size in particle niters through kld-sampling
- D. Fox. Adapting the sample size in particle niters through kld-sampling. International Journal of Robotics Research, 22(10-11):985-1004, 2003.
- (2003) International Journal of Robotics Research , vol.22 , Issue.10-11 , pp. 985-1004
- Fox, D.¹

22
- 84898983549
- Hierarchical clustering of a mixture model
- MIT Press
- J. Goldberger and S. Roweis. Hierarchical clustering of a mixture model. In Advances in Neural Information Processing Systems 17 (NIPS-2004), pages 505-512. MIT Press, 2005.
- (2005) Advances in Neural Information Processing Systems 17 (NIPS-2004) , pp. 505-512
- Goldberger, J.¹ Roweis, S.²

23
- 84880741298
- Solving pomdps with continuous or large discrete observation spaces
- J. Hoey and P. Poupart. Solving pomdps with continuous or large discrete observation spaces. In Proceedings of the International Joint Conference on Artificial Intelligence, pages 1332-1338, 2005.
- (2005) Proceedings of the International Joint Conference on Artificial Intelligence , pp. 1332-1338
- Hoey, J.¹ Poupart, P.²

24
- 0032136153
- Condensation - Conditional density propagation for visual tracking
- M. Isard and A. Blake. Condensation - conditional density propagation for visual tracking. International Journal of Computer Vision, 29(1):5-28, 1998.
- (1998) International Journal of Computer Vision , vol.29 , Issue.1 , pp. 5-28
- Isard, M.¹ Blake, A.²

25
- 0032073263
- Planning and acting in partially observable stochastic domains
- L. P. Kaelbling, M. L. Littman, and A. R. Cassandra. Planning and acting in partially observable stochastic domains. Artificial Intelligence, 101(1-2):99-134, 1998.
- (1998) Artificial Intelligence , vol.101 , Issue.1-2 , pp. 99-134
- Kaelbling, L.P.¹ Littman, M.L.² Cassandra, A.R.³

26
- 0036374190
- Nonapproximability results for partially observable Markov decision processes
- C. Lusena, J. Goldsmith, and M. Mundhenk. Nonapproximability results for partially observable Markov decision processes. Journal of Artificial Intelligence Research, 14:83-103, 2001.
- (2001) Journal of Artificial Intelligence Research , vol.14 , pp. 83-103
- Lusena, C.¹ Goldsmith, J.² Mundhenk, M.³

27
- 0032596468
- On the undecidability of probabilistic planning and infinitehorizon partially observable Markov decision problems
- Orlando, FL
- O. Madani, S. Hanks, and A. Condon. On the undecidability of probabilistic planning and infinitehorizon partially observable Markov decision problems. In Proceedings of the National Conference on Artificial Intelligence, pages 541-548, Orlando, FL, 1999.
- (1999) Proceedings of the National Conference on Artificial Intelligence , pp. 541-548
- Madani, O.¹ Hanks, S.² Condon, A.³

28
- 0002103968
- Learning finite-state controllers for partially observable environments
- Stockholm
- N. Meuleau, L. Peshkin, K.-E. Kim, and L. P. Kaelbling. Learning finite-state controllers for partially observable environments. In Proceedings of Uncertainty in Artificial Intelligence, pages 427-436, Stockholm, 1999.
- (1999) Proceedings of Uncertainty in Artificial Intelligence , pp. 427-436
- Meuleau, N.¹ Peshkin, L.² Kim, K.-E.³ Kaelbling, L.P.⁴

29
- 0019909899
- A survey of partially observable Markov decision processes: Theory, models, and algorithms
- G. E. Monahan. A survey of partially observable Markov decision processes: Theory, models, and algorithms. Management Science, 28(1): 1-16, 1982.
- (1982) Management Science , vol.28 , Issue.1 , pp. 1-16
- Monahan, G.E.¹

30
- 0036931186
- Experiences with a mobile robotic guide for the elderly
- Edmonton, AB
- M. Montemerlo, J. Pineau, N. Roy, S. Thrun, and V. Verma. Experiences with a mobile robotic guide for the elderly. In Proceedings of the National Conference on Artificial Intelligence, pages 587-592, Edmonton, AB, 2002.
- (2002) Proceedings of the National Conference on Artificial Intelligence , pp. 587-592
- Montemerlo, M.¹ Pineau, J.² Roy, N.³ Thrun, S.⁴ Verma, V.⁵

31
- 0141819580
- PEGASUS: A policy search method for large MDPs and POMDPs
- Stanford, CA
- A. Y. Ng and M. Jordan. PEGASUS: A policy search method for large MDPs and POMDPs. In Proceedings of Uncertainty in Artificial Intelligence, pages 406-415, Stanford, CA, 2000.
- (2000) Proceedings of Uncertainty in Artificial Intelligence , pp. 406-415
- Ng, A.Y.¹ Jordan, M.²

32
- 0000977910
- The complexity of Markov decision processes
- C. H. Papadimitriou and J. N. Tsitsiklis. The complexity of Markov decision processes. Mathematics of Operations Research, 12(3):441-450, 1987.
- (1987) Mathematics of Operations Research , vol.12 , Issue.3 , pp. 441-450
- Papadimitriou, C.H.¹ Tsitsiklis, J.N.²

33
- 84880772945
- Point-based value iteration: An anytime algorithm for pomdps
- J. Pineau, G. Gordon, and S. Thrun. Point-based value iteration: An anytime algorithm for pomdps. In Proceedings of the International Joint Conference on Artificial Intelligence, pages 1025-1032, 2003a.
- (2003) Proceedings of the International Joint Conference on Artificial Intelligence , pp. 1025-1032
- Pineau, J.¹ Gordon, G.² Thrun, S.³

34
- 0037475131
- Towards robotic assistants in nursing homes: Challenges and results
- J. Pineau, M. Montemerlo, M. Pollack, N. Roy, and S. Thrun. Towards robotic assistants in nursing homes: challenges and results. Robotics and Autonomous Systems, 42(3-4):271-281, 2003b.
- (2003) Robotics and Autonomous Systems , vol.42 , Issue.3-4 , pp. 271-281
- Pineau, J.¹ Montemerlo, M.² Pollack, M.³ Roy, N.⁴ Thrun, S.⁵

35
- 1542427941
- Filtering via simulation: Auxiliary particle niters
- M. K. Pitt and N. Shephard. Filtering via simulation: Auxiliary particle niters. Journal of the American Statististical Association, 94(446):590-599, 1999.
- (1999) Journal of the American Statististical Association , vol.94 , Issue.446 , pp. 590-599
- Pitt, M.K.¹ Shephard, N.²

36
- 14044260670
- Appearance-based concurrent map building and localization using a multi-hypotheses tracker
- Sendai, Japan
- J. M. Porta and B. J. A. Kröse. Appearance-based concurrent map building and localization using a multi-hypotheses tracker. In Proceedings of the International Conference on Intelligent Robots and Systems, pages 3424-3429, Sendai, Japan, 2004.
- (2004) Proceedings of the International Conference on Intelligent Robots and Systems , pp. 3424-3429
- Porta, J.M.¹ Kröse, B.J.A.²

37
- 84959528732
- Robot planning in partially observable continuous domains
- MIT, Cambridge, MA
- J. M. Porta, M. T. J. Spaan, and N. Vlassis. Robot planning in partially observable continuous domains. In Robotics: Science and Systems I, pages 217-224, MIT, Cambridge, MA, 2005.
- (2005) Robotics: Science and Systems I , pp. 217-224
- Porta, J.M.¹ Spaan, M.T.J.² Vlassis, N.³

38
- 33748561594
- Value-directed compressions of POMDPs
- MIT Press
- P. Poupart and C. Boutilier. Value-directed compressions of POMDPs. In Advances in Neural Information Processing Systems 15 (NIPS-2002), pages 1547-1554. MIT Press, 2003.
- (2003) Advances in Neural Information Processing Systems 15 (NIPS-2002) , pp. 1547-1554
- Poupart, P.¹ Boutilier, C.²

39
- 84898959164
- Bounded finite state controllers
- MIT Press
- P. Poupart and C. Boutilier. Bounded finite state controllers. In Advances in Neural Information Processing Systems 16 (NIPS-2003), pages 823-830. MIT Press, 2004.
- (2004) Advances in Neural Information Processing Systems 16 (NIPS-2003) , pp. 823-830
- Poupart, P.¹ Boutilier, C.²

40
- 31144457984
- VDCBPI: An approximate scalable algorithm for large POMDPs
- MIT Press
- P. Poupart and C. Boutilier. VDCBPI: an approximate scalable algorithm for large POMDPs. In Advances in Neural Information Processing Systems 17 (NIPS-2004), pages 1081-1088. MIT Press, 2005.
- (2005) Advances in Neural Information Processing Systems 17 (NIPS-2004) , pp. 1081-1088
- Poupart, P.¹ Boutilier, C.²

41
- 33749251297
- An analytic solution to discrete bayesian reinforcement learning
- P. Poupart, N. Vlassis, J. Hoey, and K. Regan. An analytic solution to discrete bayesian reinforcement learning. In Proceedings of the International Conference on Machine Learning, pages 697-704, 2006.
- (2006) Proceedings of the International Conference on Machine Learning , pp. 697-704
- Poupart, P.¹ Vlassis, N.² Hoey, J.³ Regan, K.⁴

42
- 85102627959
- Wiley Series in Probability and Mathematical Statistics. John Wiley and Sons, Inc.
- M. L. Puterman. Markov Decision Processes: Discrete Stochastic Dynamic Programming. Wiley Series in Probability and Mathematical Statistics. John Wiley and Sons, Inc., 1994.
- (1994) Markov Decision Processes: Discrete Stochastic Dynamic Programming
- Puterman, M.L.¹

43
- 1642285714
- PhD thesis, Carnegie Mellon University
- N. Roy. Finding Approximate POMDP Solutions Through Belief Compression. PhD thesis, Carnegie Mellon University, 2003.
- (2003) Finding Approximate POMDP Solutions Through Belief Compression
- Roy, N.¹

44
- 84899031196
- Exponential family PCA for belief compression in POMDPs
- MIT Press
- N. Roy and G. Gordon. Exponential family PCA for belief compression in POMDPs. In Advances in Neural Information Processing Systems 15 (NIPS-2002), pages 1635-1642. MIT Press, 2003.
- (2003) Advances in Neural Information Processing Systems 15 (NIPS-2002) , pp. 1635-1642
- Roy, N.¹ Gordon, G.²

45
- 27344443125
- Finding approximate POMDP solutions through belief compression
- N. Roy, G. Gordon, and S. Thrun. Finding approximate POMDP solutions through belief compression. Journal of Artificial Intelligence Research, 23:1-40, 2005.
- (2005) Journal of Artificial Intelligence Research , vol.23 , pp. 1-40
- Roy, N.¹ Gordon, G.² Thrun, S.³

46
- 84880707672
- Spoken dialog management using probabilistic reasoning
- Hong Kong
- N. Roy, J. Pineau, and S. Thrun. Spoken dialog management using probabilistic reasoning. In Proceedings of the 38th Annual Meeting of the Association for Computational Linguistics, pages 93-100, Hong Kong, 2000.
- (2000) Proceedings of the 38th Annual Meeting of the Association for Computational Linguistics , pp. 93-100
- Roy, N.¹ Pineau, J.² Thrun, S.³

47
- 33750693444
- Predictive linear-gaussian models of stochastic dynamical systems
- Matt Rudary, Satinder Singh, and David Wingate. Predictive linear-gaussian models of stochastic dynamical systems. In Proceedings of Uncertainty in Artificial Intelligence, pages 501-508, 2005.
- (2005) Proceedings of Uncertainty in Artificial Intelligence , pp. 501-508
- Rudary, M.¹ Singh, S.² Wingate, D.³

48
- 0000375787
- Probabilistic robot navigation in partially observable environments
- R. Simmons and S. Koenig. Probabilistic robot navigation in partially observable environments. In Proceedings of the International Joint Conference on Artificial Intelligence, pages 1080-1087, 1995.
- (1995) Proceedings of the International Joint Conference on Artificial Intelligence , pp. 1080-1087
- Simmons, R.¹ Koenig, S.²

49
- 31144465830
- Heuristic search value iteration for POMDPs
- Banff, Alberta
- T. Smith and R. Simmons. Heuristic search value iteration for POMDPs. In. Proceedings of Uncertainty in Artificial Intelligence, pages 520-527, Banff, Alberta, 2004.
- (2004) Proceedings of Uncertainty in Artificial Intelligence , pp. 520-527
- Smith, T.¹ Simmons, R.²

50
- 0003871607
- PhD thesis, Stanford University
- E. J. Sondik. The Optimal Control of Partially Observable Markov Processes. PhD thesis, Stanford University, 1971.
- (1971) The Optimal Control of Partially Observable Markov Processes
- Sondik, E.J.¹

51
- 31144472319
- Perseus: Randomized point-based value iteration for pomdps
- M. T. J. Spaan and N. Vlassis. Perseus: Randomized point-based value iteration for pomdps. Journal of Artificial Intelligence Research, 24:195-220, 2005.
- (2005) Journal of Artificial Intelligence Research , vol.24 , pp. 195-220
- Spaan, M.T.J.¹ Vlassis, N.²

52
- 0004102479
- MIT Press
- R. S. Sutton and A. G. Barto. Reinforcement Learning: An Introduction. MIT Press, 1998.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

53
- 0036057745
- Approximate planning with hierarchical partially observable Markov decision processes for robot navigation
- G. Theocharous and S. Mahadevan. Approximate planning with hierarchical partially observable Markov decision processes for robot navigation. In Proceedings of the IEEE International Conference on Robotics and Automation, pages 1347-1352, 2002.
- (2002) Proceedings of the IEEE International Conference on Robotics and Automation , pp. 1347-1352
- Theocharous, G.¹ Mahadevan, S.²

54
- 84898978676
- Monte Carlo POMDPs
- MIT Press
- S. Thrun. Monte Carlo POMDPs. In Advances in Neural Information Processing Systems 12 (NIPS-1999), pages 1064-1070. MIT Press, 2000.
- (2000) Advances in Neural Information Processing Systems 12 (NIPS-1999) , pp. 1064-1070
- Thrun, S.¹

55
- 0036059551
- Auxiliary particle filter robot localization from high-dimensional sensor observations
- N. Vlassis, B. Terwijn, and B.J.A. Kröse. Auxiliary particle filter robot localization from high-dimensional sensor observations. In Proceedings of the IEEE International Conference on Robotics and Automation, pages 7-12, 2002.
- (2002) Proceedings of the IEEE International Conference on Robotics and Automation , pp. 7-12
- Vlassis, N.¹ Terwijn, B.² Kröse, B.J.A.³

56
- 0029250080
- Reinforcement learning of non-Markov decision processes
- Steven D. Whitehead and Long-Ji Lin. Reinforcement learning of non-Markov decision processes. Artificial Intelligence, 73(1-2):271-306, 1995.
- (1995) Artificial Intelligence , vol.73 , Issue.1-2 , pp. 271-306
- Whitehead, S.D.¹ Lin, L.-J.²

57
- 33750709342
- Technical Report CUED/F-INFEG/TR.520, Cambridge University, Engineering Department
- J. Williams, P. Poupart, and S. Young. Using factored Markov decision processes with continuous observations for dialogue management. Technical Report CUED/F-INFEG/TR.520, Cambridge University, Engineering Department, 2005.
- (2005) Using Factored Markov Decision Processes with Continuous Observations for Dialogue Management
- Williams, J.¹ Poupart, P.² Young, S.³

58
- 33645958652
- Planning and acting under uncertainty: A new model for spoken dialogue systems
- Seattle, WA
- B. Zhang, Q. Cai, J. Mao, and B. Guo. Planning and acting under uncertainty: a new model for spoken dialogue systems. In Proceedings of Uncertainty in Artificial Intelligence, pages 572-579, Seattle, WA, 2001.
- (2001) Proceedings of Uncertainty in Artificial Intelligence , pp. 572-579
- Zhang, B.¹ Cai, Q.² Mao, J.³ Guo, B.⁴

59
- 0036374229
- Speeding up the convergence of value iteration in partially observable Markov decision processes
- N. L. Zhang and W. Zhang. Speeding up the convergence of value iteration in partially observable Markov decision processes. Journal of Artificial Intelligence Research, 14:29-51, 2001.
- (2001) Journal of Artificial Intelligence Research , vol.14 , pp. 29-51
- Zhang, N.L.¹ Zhang, W.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.