SCOPUS 정보 검색 플랫폼

Volumn 33, Issue 1, 2008, Pages 1-11

On near optimality of the set of finite-state controllers for average cost POMDP

(2) Yu, Huizhen a Bertsekas, Dimitri P b

b MASSACHUSETTS INSTITUTE OF TECHNOLOGY (United States)

Author keywords

Average cost criterion; Finite state and control models; Optimality conditions; Partially observable markov decision processes

Indexed keywords

CONTROLLERS; COST FUNCTIONS; OPTIMIZATION; STOCHASTIC CONTROL SYSTEMS;

AVERAGE COST CRITERION; AVERAGE COSTS; BOUNDED SOLUTIONS; CONSTANT AVERAGE COSTS; CONTROL SPACES; FINITE-STATE AND CONTROL MODELS; FINITE-STATE CONTROLLERS; INITIAL STATE; NEAR OPTIMALITY; OPTIMALITY CONDITIONS; PARTIALLY OBSERVABLE MARKOV DECISION PROCESSES;

COSTS;

EID: 61349089285 PISSN: 0364765X EISSN: 15265471 Source Type: Journal
DOI: 10.1287/moor.1070.0279 Document Type: Article

Times cited : (44)

References (19)

1
- 61349084295
- Technical report, RSISE, Australian National University, Canberra, Australia
- Aberdeen, D., J. Baxter. 2001. Internal-state policy-gradient algorithms for infinite-horizon POMDPs. Technical report, RSISE, Australian National University, Canberra, Australia.
- (2001) Internal-state policy-gradient algorithms for infinite-horizon POMDPs
- Aberdeen, D.¹ Baxter, J.²

2
- 0013535965
- Infinite-horizon policy-gradient estimation
- Baxter, J., P. L. Bartlett. 2001. Infinite-horizon policy-gradient estimation. J. Artificial Intelligence Res. 15 319-350.
- (2001) J. Artificial Intelligence Res , vol.15 , pp. 319-350
- Baxter, J.¹ Bartlett, P.L.²

3
- 0003923091
- Academic Press, New York
- Bertsekas, D. P., S. Shreve. 1978. Stochastic Optimal Control: The Discrete Time Case. Academic Press, New York.
- (1978) Stochastic Optimal Control: The Discrete Time Case
- Bertsekas, D.P.¹ Shreve, S.²

4
- 38049180377
- An expected average reward criterion
- Bierth, K. J. 1987. An expected average reward criterion. Stochastic Process. Appl. 26 123-140.
- (1987) Stochastic Process. Appl , vol.26 , pp. 123-140
- Bierth, K.J.¹

5
- 0003634432
- Springer-Verlag, New York
- Dynkin, E. B., A. A. Yushkevich. 1979. Controlled Markov Processes. Springer-Verlag, New York.
- (1979) Controlled Markov Processes
- Dynkin, E.B.¹ Yushkevich, A.A.²

6
- 61349154325
- An ε-optimal control of a finite Markov chain with, an average reward criterion
- Feinberg, E. A. 1980. An ε-optimal control of a finite Markov chain with, an average reward criterion. Theory Probab. Appl. 25 70-81.
- (1980) Theory Probab. Appl , vol.25 , pp. 70-81
- Feinberg, E.A.¹

7
- 0001467751
- Controlled Markov processes with arbitrary numerical criteria
- Feinberg, E. A. 1982. Controlled Markov processes with arbitrary numerical criteria. Theory Probab. Appl. 27 486-503.
- (1982) Theory Probab. Appl , vol.27 , pp. 486-503
- Feinberg, E.A.¹

8
- 0002610493
- Nonrandomized Markov and semi-Markov strategies in dynamic programming
- Feinberg, E. A. 1982. Nonrandomized Markov and semi-Markov strategies in dynamic programming. Theory Probab. Appl. 27 116-126.
- (1982) Theory Probab. Appl , vol.27 , pp. 116-126
- Feinberg, E.A.¹

9
- 0004808420
- On the average cost optimality equation and the structure of optimal policies for partially observable Markov decision processes
- Fernández-Gaucherand, E., A. Arapostathis, S. I. Marcus. 1991. On the average cost optimality equation and the structure of optimal policies for partially observable Markov decision processes. Ann. Oper: Res. 29 439-470.
- (1991) Ann. Oper: Res , vol.29 , pp. 439-470
- Fernández-Gaucherand, E.¹ Arapostathis, A.² Marcus, S.I.³

10
- 28544443262
- On the existence of stationary optimal policies for partially observed MDPs under the long-run average cost criterion
- Hsu, S.-P., D.-M. Chuang, A. Arapostathis. 2006. On the existence of stationary optimal policies for partially observed MDPs under the long-run average cost criterion. Systems Control Lett. 55 165-173.
- (2006) Systems Control Lett , vol.55 , pp. 165-173
- Hsu, S.-P.¹ Chuang, D.-M.² Arapostathis, A.³

11
- 0000624333
- Reinforcement learning algorithm for partially observable Markov decision problems
- Denver, CO. MIT Press, Cambridge, MA
- Jaakkola, T. S., S. P. Singh, M. I. Jordan. 1995. Reinforcement learning algorithm for partially observable Markov decision problems. Proc. Neural Inform. Processing Systems Conf., Denver, CO. MIT Press, Cambridge, MA.
- (1995) Proc. Neural Inform. Processing Systems Conf
- Jaakkola, T.S.¹ Singh, S.P.² Jordan, M.I.³

12
- 0004047518
- Oxford University Press, Oxford, UK
- Lauritzen, S. L. 1996. Graphical Models. Oxford University Press, Oxford, UK.
- (1996) Graphical Models
- Lauritzen, S.L.¹

13
- 0002103968
- Learning finite-state controllers for partially observable environment
- Stockholm, Sweden. Morgan Kaufmann, San Francisco
- Meuleau, N., L. Peshkin, K.-E. Kim, L. P. Kaelbling. 1999. Learning finite-state controllers for partially observable environment. Proc. 15th Conf. Uncertainty in Artificial Intelligence, Stockholm, Sweden. Morgan Kaufmann, San Francisco.
- (1999) Proc. 15th Conf. Uncertainty in Artificial Intelligence
- Meuleau, N.¹ Peshkin, L.² Kim, K.-E.³ Kaelbling, L.P.⁴

14
- 0019037868
- Optimal infinite-horizon undiscounted control of finite probabilistic systems
- Platzman, L. K. 1980. Optimal infinite-horizon undiscounted control of finite probabilistic systems. SIAM J. Control Optim. 18(4) 362-380.
- (1980) SIAM J. Control Optim , vol.18 , Issue.4 , pp. 362-380
- Platzman, L.K.¹

15
- 0003998452
- John Wiley and Sons, Inc, New York
- Puterman, M. L. 1994. Markov Decision Processes: Discrete Stochastic Dynamic Programming. John Wiley and Sons, Inc., New York.
- (1994) Markov Decision Processes: Discrete Stochastic Dynamic Programming
- Puterman, M.L.¹

16
- 5744238162
- Arbitrary state Markovian decision processes
- Ross, S. M. 1968. Arbitrary state Markovian decision processes. Ann. Math. Statist. 39(6) 2118-2122.
- (1968) Ann. Math. Statist , vol.39 , Issue.6 , pp. 2118-2122
- Ross, S.M.¹

17
- 61349091186
- Giardini Editori e Stampatori, Pisa, Italy
- Runggaldier, W. J., L. Stettner. 1994. Approximations of Discrete Time Partially Observable Control Problems, Applied Mathematics Monographs, Vol. 6. Giardini Editori e Stampatori, Pisa, Italy.
- (1994) Approximations of Discrete Time Partially Observable Control Problems, Applied Mathematics Monographs , vol.6
- Runggaldier, W.J.¹ Stettner, L.²

18
- 80053276028
- A function approximation approach to estimation of policy gradient for POMDP with structured polices
- Edinburgh, UK, AUAI Press
- Yu, H. 2005. A function approximation approach to estimation of policy gradient for POMDP with structured polices. Proc. 21st Conf. Uncertainty in Artificial Intelligence, Edinburgh, UK, AUAI Press.
- (2005) Proc. 21st Conf. Uncertainty in Artificial Intelligence
- Yu, H.¹

19
- 39649089922
- Ph.D. thesis, Massachusetts Institute of Technology, Cambridge, MA
- Yu, H. 2006. Approximate solution methods for partially observable Markov and semi-Markov decision processes. Ph.D. thesis, Massachusetts Institute of Technology, Cambridge, MA.
- (2006) Approximate solution methods for partially observable Markov and semi-Markov decision processes
- Yu, H.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.