SCOPUS 정보 검색 플랫폼

Journal of Artificial Intelligence Research

Volumn 14, Issue , 2001, Pages 29-51

Speeding up the convergence of value iteration in partially observable Markov decision processes

(2) Zhang, Nevin L a Zhang, Weihong a

a HONG KONG UNIVERSITY OF SCIENCE AND TECHNOLOGY (Hong Kong)

Author keywords

[No Author keywords available]

Indexed keywords

ALGORITHMS; CONVERGENCE OF NUMERICAL METHODS; ITERATIVE METHODS; MARKOV PROCESSES; MATHEMATICAL MODELS;

PARTIALLY OBSERVABLE MARKOV DECISION PROCESSES (POMDP);

DECISION MAKING;

EID: 0036374229 PISSN: 10769757 EISSN: None Source Type: Journal
DOI: 10.1613/jair.761 Document Type: Article

Times cited : (105)

References (30)

1
- 50549213583
- Optimal control of Markov decision processes with the incomplete state estimation
- Aström, K. J. (1965). Optimal control of Markov decision processes with the incomplete state estimation. Journal of Computer and System Sciences, 10, 174-205.
- (1965) Journal of Computer and System Sciences , vol.10 , pp. 174-205
- Aström, K.J.¹

2
- 0031385391
- A heuristic variable grid solution for POMDPs
- Brafman, R. I. (1997). A heuristic variable grid solution for POMDPs. In Proceedings of the Fourteenth National Conference on Artificial Intelligence(AAAI-97), 727-733.
- (1997) Proceedings of the Fourteenth National Conference on Artificial Intelligence(AAAI-97) , pp. 727-733
- Brafman, R.I.¹

3
- 0001909869
- Incremental pruning: A simple, fast, exact method for partially observable Markov decision processes
- Cassandra, A. R., Littman, M. L., and Zhang, N. L. (1997). Incremental pruning: A simple, fast, exact method for partially observable Markov decision processes. In Proceedings of Thirteenth Conference on Uncertainty in Artificial Intelligence, 54-61.
- (1997) Proceedings of Thirteenth Conference on Uncertainty in Artificial Intelligence , pp. 54-61
- Cassandra, A.R.¹ Littman, M.L.² Zhang, N.L.³

4
- 0003989210
- PhD thesis, Department of Computer Science, Brown University
- Cassandra, A. R. (1998a). Exact and approximate algorithms for partially observable Markov decision processes, PhD thesis, Department of Computer Science, Brown University.
- (1998) Exact and Approximate Algorithms for Partially Observable Markov Decision Processes
- Cassandra, A.R.¹

5
- 0010398376
- A survey of POMDP applications
- Cassandra, A. R. (1998b). A survey of POMDP applications, in Working Notes of AAAI 1998 Fall Symposium on Planning with Partially Observable Markov Decision Processes, 17-24.
- (1998) Working Notes of AAAI 1998 Fall Symposium on Planning with Partially Observable Markov Decision Processes , pp. 17-24
- Cassandra, A.R.¹

6
- 0003430814
- Prentice-Hall
- Denardo, E. V. (1982). Dynamic Programming: Models and Applications Prentice-Hall.
- (1982) Dynamic Programming: Models and Applications
- Denardo, E.V.¹

7
- 0021486586
- The optimal search for a moving target when the search path is constrained
- Eagle, J. N. (1984). The optimal search for a moving target when the search path is constrained. Operations Research, 32(5), 1107-1115.
- (1984) Operations Research , vol.32 , Issue.5 , pp. 1107-1115
- Eagle, J.N.¹

8
- 0003818801
- Ph D thesis, University of British Columbia
- Cheng, H. T. (1988). Algorithms for partially observable Markov decision processes. Ph D thesis, University of British Columbia.
- (1988) Algorithms for Partially Observable Markov Decision Processes
- Cheng, H.T.¹

9
- 0003125478
- Solving POMDPs by searching in policy space
- Hansen, E. A. (1998). Solving POMDPs by searching in policy space. In Proceedings of Fourteenth Conference on Uncertainty in Artificial Intelligence, 211-219.
- (1998) Proceedings of Fourteenth Conference on Uncertainty in Artificial Intelligence , pp. 211-219
- Hansen, E.A.¹

10
- 0031385618
- Incremental methods for computing bounds in partially observable Markov decision processes
- Hauskrecht, M. (1997a). Incremental methods for computing bounds in partially observable Markov decision processes, in Proceedings of the Fourteenth National Conference on Artificial Intelligence (AAAI-97), 734-749.
- (1997) Proceedings of the Fourteenth National Conference on Artificial Intelligence (AAAI-97) , pp. 734-749
- Hauskrecht, M.¹

11
- 0003613101
- PhD thesis, Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology
- Hauskrecht, M. (1997b). Planning and control in stochastic domains with imperfect information. PhD thesis, Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology.
- (1997) Planning and Control in Stochastic Domains with Imperfect Information
- Hauskrecht, M.¹

12
- 0001770240
- Value function approximations for partially observable Markov decision processes
- Hauskrecht, M. (2000). Value function approximations for partially observable Markov decision processes, Journal of Artificial Intelligence Research, 13, 33-95.
- (2000) Journal of Artificial Intelligence Research , vol.13 , pp. 33-95
- Hauskrecht, M.¹

13
- 0003596835
- Efficient dynamic-programming updates in partially observable Markov decision processes
- Brown University
- Littman, M. L., Cassandra, A. R. and Kaelbling, L. P. (1995a). Efficient dynamic-programming updates in partially observable Markov decision processes. Technical Report CS-95-19, Brown University.
- (1995) Technical Report CS-95-19
- Littman, M.L.¹ Cassandra, A.R.² Kaelbling, L.P.³

14
- 85138579181
- Learning policies for partially observable environments, scaling up
- Littman, M. L., Cassandra, A. R. and Kaelbling, L. P. (1995b). Learning policies for partially observable environments, scaling up. In Proceedings of the Fifteenth Conference on Machine Learning, 362-370.
- (1995) Proceedings of the Fifteenth Conference on Machine Learning , pp. 362-370
- Littman, M.L.¹ Cassandra, A.R.² Kaelbling, L.P.³

15
- 0003861655
- Ph D thesis, Department of Computer Science, Brown University
- Littman, M. L. (1996). Algorithms for sequential decision making. Ph D thesis, Department of Computer Science, Brown University.
- (1996) Algorithms for Sequential Decision Making
- Littman, M.L.¹

16
- 0032073263
- Planning and acting in partially observable stochastic domains
- Kaelbling, L. P., Littman. M. L. and Cassandra, A. R.(1998). Planning and acting in partially observable stochastic domains, Artificial Intelligence, Vol 101.
- (1998) Artificial Intelligence , vol.101
- Kaelbling, L.P.¹ Littman, M.L.² Cassandra, A.R.³

17
- 0000494894
- Computationally feasible bounds for partially observed Markov decision processes
- Lovejoy, W. S. (1991). Computationally feasible bounds for partially observed Markov decision processes. Operations Research, 39, 192-175.
- (1991) Operations Research , vol.39 , pp. 192-1175
- Lovejoy, W.S.¹

18
- 0001095688
- Suboptimal policies with bounds for parameter adaptive decision processes
- Lovejoy, W. S. (1993). Suboptimal policies with bounds for parameter adaptive decision processes. Operations Research, 41, 583-599.
- (1993) Operations Research , vol.41 , pp. 583-599
- Lovejoy, W.S.¹

19
- 0019909899
- A survey of partially observable Markov decision processes: Theory, models, and algorithms
- Monahan, G. E. (1982). A survey of partially observable Markov decision processes: theory, models, and algorithms. Management Science, 28 (1), 1-16.
- (1982) Management Science , vol.28 , Issue.1 , pp. 1-16
- Monahan, G.E.¹

20
- 85168129602
- Approximating optimal policies for partially observable stochastic domains
- Parr, R., and Russell, S. (1995). Approximating optimal policies for partially observable stochastic domains. In Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence 1088-1094.
- (1995) Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence , pp. 1088-1094
- Parr, R.¹ Russell, S.²

21
- 0000977910
- The complexity of Markov decision processes
- Papadimitriou, C. H., Tsitsiklis, J. N.(1987). The complexity of Markov decision processes. Mathematics of Operations Research, 12(3), 441-450.
- (1987) Mathematics of Operations Research , vol.12 , Issue.3 , pp. 441-450
- Papadimitriou, C.H.¹ Tsitsiklis, J.N.²

22
- 0019037868
- Optimal infinite-horizon undiscounted control of finite probabilistic systems
- Platzman, L. K.(1980). Optimal infinite-horizon undiscounted control of finite probabilistic systems. SIAM Journal of Control and Optimization, 18, 362-380.
- (1980) SIAM Journal of Control and Optimization , vol.18 , pp. 362-380
- Platzman, L.K.¹

23
- 77957101448
- Markov decision processes
- D. P. Heyman and M. J. Sobel (eds.), Elsevier Science Publishers
- Puterman, M. L. (1990), Markov decision processes, in D. P. Heyman and M. J. Sobel (eds.), Handbooks in OR & MS., Vol. 2, 331-434, Elsevier Science Publishers.
- (1990) Handbooks in or & MS , vol.2 , pp. 331-434
- Puterman, M.L.¹

24
- 0015658957
- The optimal control of partially observable processes over a finite horizon
- Smallwood, R. D. and Sondik, E. J. (1973). The optimal control of partially observable processes over a finite horizon. Operations Research, 21, 1071-1088.
- (1973) Operations Research , vol.21 , pp. 1071-1088
- Smallwood, R.D.¹ Sondik, E.J.²

25
- 0003871607
- PhD thesis, Stanford University
- Sondik, E. J. (1971). The optimal control of partially observable Markov processes. PhD thesis, Stanford University.
- (1971) The Optimal Control of Partially Observable Markov Processes
- Sondik, E.J.¹

26
- 0015658957
- The optimal control of partially observable Markov processes over the infinite horizon
- Sondik, E. J. (1978). The optimal control of partially observable Markov processes over the infinite horizon, Operations Research, 21, 1071-1088.
- (1978) Operations Research , vol.21 , pp. 1071-1088
- Sondik, E.J.¹

27
- 0024739631
- Solution procedures for partially observed Markov decision processes
- White, C. C. III and Scherer, W. T. (1989). Solution procedures for partially observed Markov decision processes, Operations Research, 37(5), 791-797.
- (1989) Operations Research , vol.37 , Issue.5 , pp. 791-797
- White III, C.C.¹ Scherer, W.T.²

28
- 0000675435
- A method for speeding up value iteration in partially observable Markov decision processes
- Zhang, N. L., Lee, S. S., and Zhang, W. (1999). A method for speeding up value iteration in partially observable Markov decision processes, in Proc. of the 15th Conference on Uncertainties in Artificial Intelligence.
- (1999) Proc. of the 15th Conference on Uncertainties in Artificial Intelligence
- Zhang, N.L.¹ Lee, S.S.² Zhang, W.³

29
- 85016628903
- A model approximation scheme for planning in stochastic domains
- Zhang, N. L. and W. Liu (1997). A model approximation scheme for planning in stochastic domains, Journal of Artificial Intelligence Research, 7, 199-230.
- (1997) Journal of Artificial Intelligence Research , vol.7 , pp. 199-230
- Zhang, N.L.¹ Liu, W.²

30
- 84867833986
- A POMDP approximation algorithm that anticipates the need to observe
- To appear in Proceedings of the Pacific Rim Conference on Artificial Intelligence (PRICAI-2000), New York: Springer-Verlag
- Zubek, V. B. and Dietterich, T. G.(2000). A POMDP approximation algorithm that anticipates the need to observe. To appear in Proceedings of the Pacific Rim Conference on Artificial Intelligence (PRICAI-2000), Lecture Notes in Computer Science, New York: Springer-Verlag.
- (2000) Lecture Notes in Computer Science
- Zubek, V.B.¹ Dietterich, T.G.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.