SCOPUS 정보 검색 플랫폼

Volumn 53, Issue 1, 2005, Pages 126-139

An adaptive sampling algorithm for solving Markov decision processes

Author keywords

Dynamic programming optimal control: Markov finite state

Indexed keywords

ADAPTIVE SAMPLING; DYNAMIC PROGRAMMING/OPTIMAL CONTROL; MARKOV DECISION PROCESSES; MARKOV FINITE STATE;

ADAPTIVE ALGORITHMS; ALGORITHMS; DYNAMIC PROGRAMMING; INVENTORY CONTROL; OPTIMAL CONTROL SYSTEMS; PROBLEM SOLVING; RANDOM PROCESSES;

MARKOV PROCESSES;

EID: 14644444172 PISSN: 0030364X EISSN: None Source Type: Journal
DOI: 10.1287/opre.1040.0145 Document Type: Article

Times cited : (124)

References (17)

1
- 0000616723
- Sample mean based index policies with O(log n) regret for the multiarmed bandit problem
- Agrawal, R. 1995. Sample mean based index policies with O(log n) regret for the multiarmed bandit problem. Advances Appl. Probab. 27 1054-1078.
- (1995) Advances Appl. Probab. , vol.27 , pp. 1054-1078
- Agrawal, R.¹

2
- 0024886640
- Asymptotically efficient adaptive allocation schemes for controlled Markov chains: Finite parameter space
- Agrawal, R., D. Teneketzis, V. Anantharam. 1989. Asymptotically efficient adaptive allocation schemes for controlled Markov chains: Finite parameter space. IEEE Trans. Automat. Control 34 1249-1259.
- (1989) IEEE Trans. Automat. Control , vol.34 , pp. 1249-1259
- Agrawal, R.¹ Teneketzis, D.² Anantharam, V.³

3
- 0036568025
- Finite-time analysis of the multiarmed bandit problem
- Auer, P., N. Cesa-Bianchi, P. Fisher. 2002. Finite-time analysis of the multiarmed bandit problem. Machine Learning 47 235-256.
- (2002) Machine Learning , vol.47 , pp. 235-256
- Auer, P.¹ Cesa-Bianchi, N.² Fisher, P.³

4
- 0003565783
- Athena Scientific, Belmont, MA
- Bertsekas, D. P. 1995. Dynamic Programming and Optimal Control, Vols. 1 and 2. Athena Scientific, Belmont, MA.
- (1995) Dynamic Programming and Optimal Control , vol.1-2
- Bertsekas, D.P.¹

6
- 0034264701
- A survey of computational complexity results in systems and control
- Blondel, V. D., J. Tsitsiklis. 2000. A survey of computational complexity results in systems and control. Automatica 36 1249-1274.
- (2000) Automatica , vol.36 , pp. 1249-1274
- Blondel, V.D.¹ Tsitsiklis, J.²

7
- 0031590025
- Pricing American-style securities using simulation
- Broadie, M., P. Glasserman. 1997. Pricing American-style securities using simulation. J. Econom. Dynamics Control 21 1323-1352.
- (1997) J. Econom. Dynamics Control , vol.21 , pp. 1323-1352
- Broadie, M.¹ Glasserman, P.²

9
- 0004116989
- MIT Press, Cambridge, MA
- Cormen, T. H., C. E. Leiserson, R. L. Rivest. 1990. Introduction to Algorithms. MIT Press, Cambridge, MA.
- (1990) Introduction to Algorithms
- Cormen, T.H.¹ Leiserson, C.E.² Rivest, R.L.³

10
- 0031145551
- Asymptotically efficient adaptive choice of control laws in controlled Markov chains
- Graves, T. L., T. L. Lai. 1997. Asymptotically efficient adaptive choice of control laws in controlled Markov chains. SIAM J. Control Optim. 35 715-743.
- (1997) SIAM J. Control Optim. , vol.35 , pp. 715-743
- Graves, T.L.¹ Lai, T.L.²

11
- 0004253974
- Oxford University Press, New York
- Grimmett, G., D. Stirzaker. 2001. Probability and Random Processes, 3rd ed. Oxford University Press, New York.
- (2001) Probability and Random Processes, 3rd Ed.
- Grimmett, G.¹ Stirzaker, D.²

12
- 0025502594
- Error bounds for rolling horizon policies in discrete-time Markov control processes
- Hernández-Lerma, O., J. B. Lasserre. 1990. Error bounds for rolling horizon policies in discrete-time Markov control processes. IEEE Trans. Automat. Control 35 1118-1124.
- (1990) IEEE Trans. Automat. Control , vol.35 , pp. 1118-1124
- Hernández-Lerma, O.¹ Lasserre, J.B.²

13
- 84947403595
- Probability inequalities for sums of bounded random variables
- Hoeffding, W. 1963. Probability inequalities for sums of bounded random variables. J. Amer. Statist. Assoc. 58 13-30.
- (1963) J. Amer. Statist. Assoc. , vol.58 , pp. 13-30
- Hoeffding, W.¹

14
- 0036832951
- A sparse sampling algorithm for near-optimal planning in large Markov decision processes
- Kearns, M., Y. Mansour, A. Y. Ng. 2001. A sparse sampling algorithm for near-optimal planning in large Markov decision processes. Machine Learning 49 193-208.
- (2001) Machine Learning , vol.49 , pp. 193-208
- Kearns, M.¹ Mansour, Y.² Ng, A.Y.³

15
- 0002899547
- Asymptotically efficient adaptive allocation rules
- Lai, T., H. Robbins. 1985. Asymptotically efficient adaptive allocation rules. Advances Appl. Math. 6 4-22.
- (1985) Advances Appl. Math. , vol.6 , pp. 4-22
- Lai, T.¹ Robbins, H.²

16
- 0004312378
- McGraw-Hill, New York
- Law, A. M., W. D. Kelton. 2000. Simulation Modeling and Analysis, 3rd ed. McGraw-Hill, New York.
- (2000) Simulation Modeling and Analysis, 3rd Ed.
- Law, A.M.¹ Kelton, W.D.²

17
- 0004284430
- John Wiley and Sons, New York
- Ross, S. 1995. Stochastic Process, 2nd ed. John Wiley and Sons, New York.
- (1995) Stochastic Process, 2nd Ed.
- Ross, S.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.