SCOPUS 정보 검색 플랫폼

IEEE SSCI 2011: Symposium Series on Computational Intelligence - ADPRL 2011: 2011 IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning

Volumn , Issue , 2011, Pages 48-55

Optimistic planning for sparsely stochastic systems

(4) Buşoniu, Lucian a Munos, Rémi b De Schutter, Bart a Babuška, Robert a

a DELFT UNIVERSITY OF TECHNOLOGY (Netherlands)

b INRIA (France)

Author keywords

Markov decision processes; model predictive control; online planning; optimistic planning; stochastic systems

Indexed keywords

HIV INFECTION; MARKOV DECISION PROCESSES; NOVEL ALGORITHM; NUMERICAL RESULTS; ON-LINE CONTROLS; ON-LINE PLANNING; OPTIMISTIC PLANNING; RANDOM STATE; SELECTION METHODS;

ALGORITHMS; ARTIFICIAL INTELLIGENCE; DYNAMIC PROGRAMMING; MARKOV PROCESSES; MODEL PREDICTIVE CONTROL; ONLINE SYSTEMS; REINFORCEMENT LEARNING; STOCHASTIC CONTROL SYSTEMS; TREES (MATHEMATICS);

STOCHASTIC SYSTEMS;

EID: 80052220117 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ADPRL.2011.5967375 Document Type: Conference Paper

Times cited : (15)

References (24)

1
- 34548331001
- Cambridge University Press
- S. M. La Valle, Planning Algorithms. Cambridge University Press, 2006.
- (2006) Planning Algorithms
- La Valle, S.M.¹

2
- 0036832951
- A sparse sampling algorithm for near-optimal planning in large Markov decision processes
- DOI 10.1023/A:1017932429737
- M. J. Kearns, Y. Mansour, and A. Y. Ng, "A sparse sampling algorithm for near-optimal planning in large Markov decision processes," Machine Learning, vol. 49, no. 2-3, pp. 193-208, 2002. (Pubitemid 34325686)
- (2002) Machine Learning , vol.49 , Issue.2-3 , pp. 193-208
- Kearns, M.¹ Mansour, Y.² Ng, A.Y.³

3
- 80052243319
- Online resolution techniques
- O. Sigaud and O. Buffet, Eds. Wiley, ch. 6
- L. Péret and F. Garcia, "Online resolution techniques," in Markov Decision Processes in Artificial Intelligence, O. Sigaud and O. Buffet, Eds. Wiley, 2010, ch. 6, pp. 153-183.
- (2010) Markov Decision Processes in Artificial Intelligence , pp. 153-183
- Péret, L.¹ Garcia, F.²

4
- 58449098161
- Lazy planning under uncertainties by optimizing decisions on an ensemble of incomplete disturbance trees
- S. Girgin, M. Loth, R. Munos, P. Preux, and D. Ryabko, Eds. Springer
- B. Defourny, D. Ernst, and L. Wehenkel, "Lazy planning under uncertainties by optimizing decisions on an ensemble of incomplete disturbance trees," in Recent Advances in Reinforcement Learning, ser. Lecture Notes in Computer Science, S. Girgin, M. Loth, R. Munos, P. Preux, and D. Ryabko, Eds. Springer, 2008, vol. 5323, pp. 1-14.
- (2008) Recent Advances in Reinforcement Learning, Ser. Lecture Notes in Computer Science , vol.5323 , pp. 1-14
- Defourny, B.¹ Ernst, D.² Wehenkel, L.³

5
- 0004268529
- Prentice Hall
- J. M. Maciejowski, Predictive Control with Constraints. Prentice Hall, 2002.
- (2002) Predictive Control with Constraints
- MacIejowski, J.M.¹

6
- 0003517858
- Springer-Verlag
- E. F. Camacho and C. Bordons, Model Predictive Control. Springer- Verlag, 2004.
- (2004) Model Predictive Control
- Camacho, E.F.¹ Bordons, C.²

7
- 0036568025
- Finite-time analysis of the multiarmed bandit problem
- DOI 10.1023/A:1013689704352, Computational Learning Theory
- P. Auer, N. Cesa-Bianchi, and P. Fischer, "Finite-time analysis of the multiarmed bandit problem," Machine Learning, vol. 47, no. 2-3, pp. 235-256, 2002. (Pubitemid 34126111)
- (2002) Machine Learning , vol.47 , Issue.2-3 , pp. 235-256
- Auer, P.¹ Cesa-Bianchi, N.² Fischer, P.³

8
- 70349275222
- Bandit algorithms for tree search
- Vancouver, Canada 19-22 July
- P.-A. Coquelin and R. Munos, "Bandit algorithms for tree search," in Proceedings of the 23rd Conference on Uncertainty in Artificial Intelligence (UAI-07), Vancouver, Canada, 19-22 July 2007, pp. 67-74.
- (2007) Proceedings of the 23rd Conference on Uncertainty in Artificial Intelligence (UAI-07) , pp. 67-74
- Coquelin, P.-A.¹ Munos, R.²

9
- 77952027689
- Online optimization in X-armed bandits
- D. Koller, D. Schuurmans, Y. Bengio, and L. Bottou, Eds. MIT Press
- S. Bubeck, R. Munos, G. Stoltz, and C. Szepesvári, "Online optimization in X-armed bandits," in Advances in Neural Information Processing Systems 21, D. Koller, D. Schuurmans, Y. Bengio, and L. Bottou, Eds. MIT Press, 2009, pp. 201-208.
- (2009) Advances in Neural Information Processing Systems , vol.21 , pp. 201-208
- Bubeck, S.¹ Munos, R.² Stoltz, G.³ Szepesvári, C.⁴

10
- 33750698453
- Planning for Markov decision processes with sparse stochasticity
- MIT Press
- M. Likhachev, G. J. Gordon, and S. Thrun, "Planning for Markov decision processes with sparse stochasticity," in Advances in Neural Information Processing Systems 17. MIT Press, 2004.
- (2004) Advances in Neural Information Processing Systems , vol.17
- Likhachev, M.¹ Gordon, G.J.² Thrun, S.³

11
- 34547120053
- Springer
- H. S. Chang, M. C. Fu, J. Hu, and S. I. Marcus, Simulation-Based Algorithms for Markov Decision Processes. Springer, 2007.
- (2007) Simulation-Based Algorithms for Markov Decision Processes
- Chang, H.S.¹ Fu, M.C.² Hu, J.³ Marcus, S.I.⁴

12
- 58449106591
- Optimistic planning of deterministic systems
- Villeneuve d'Ascq, France, 30 June-3 July
- J.-F. Hren and R. Munos, "Optimistic planning of deterministic systems," in Proceedings 8th European Workshop on Reinforcement Learning (EWRL-08), Villeneuve d'Ascq, France, 30 June-3 July 2008, pp. 151-164.
- (2008) Proceedings 8th European Workshop on Reinforcement Learning (EWRL-08) , pp. 151-164
- Hren, J.-F.¹ Munos, R.²

13
- 67650469377
- Planning under uncertainty, ensembles of disturbance trees and kernelized discrete action spaces
- Nashville, US, 30 March-2 April 2009
- B. Defourny, D. Ernst, and L. Wehenkel, "Planning under uncertainty, ensembles of disturbance trees and kernelized discrete action spaces," in Proceedings 2009 IEEE International Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL-09), Nashville, US, 30 March-2 April 2009, pp. 145-152.
- Proceedings 2009 IEEE International Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL-09) , pp. 145-152
- Defourny, B.¹ Ernst, D.² Wehenkel, L.³

14
- 84888141227
- Open loop optimistic planning
- Haifa, Israel 27-29 June
- S. Bubeck and R. Munos, "Open loop optimistic planning," in Proceedings 23rd Annual Conference on Learning Theory (COLT-10), Haifa, Israel, 27-29 June 2010, pp. 477-489.
- (2010) Proceedings 23rd Annual Conference on Learning Theory (COLT-10) , pp. 477-489
- Bubeck, S.¹ Munos, R.²

15
- 0004102479
- MIT Press
- R. S. Sutton and A. G. Barto, Reinforcement Learning: An Introduction. MIT Press, 1998.
- (1998) Reinforcement Learning: An Introduction.
- Sutton, R.S.¹ Barto, A.G.²

16
- 0003565783
- 3rd ed. Athena Scientific
- D. P. Bertsekas, Dynamic Programming and Optimal Control, 3rd ed. Athena Scientific, 2007, vol. 2.
- (2007) Dynamic Programming and Optimal Control , vol.2
- Bertsekas, D.P.¹

17
- 79955859296
- Morgan & Claypool Publishers
- Cs. Szepesvári, Algorithms for Reinforcement Learning. Morgan & Claypool Publishers, 2010.
- (2010) Algorithms for Reinforcement Learning
- Szepesvári, C.S.¹

18
- 80052257863
- Wiley
- O. Sigaud and O. Buffet, Eds., Markov Decision Processes in Artificial Intelligence. Wiley, 2010.
- (2010) Markov Decision Processes in Artificial Intelligence
- Sigaud, O.¹ Buffet, O.²

19
- 77955814101
- Reinforcement learning and dynamic programming using function approximators, ser
- Taylor & Francis CRC Press
- L. Bus,oniu, R. Babuška, B. De Schutter, and D. Ernst, Reinforcement Learning and Dynamic Programming Using Function Approximators, ser. Automation and Control Engineering. Taylor & Francis CRC Press, 2010.
- (2010) Automation and Control Engineering
- Buşoniu, L.¹ Babuška, R.² De Schutter, B.³ Ernst, D.⁴

20
- 77950867376
- Approximate dynamic programming with a fuzzy parameterization
- L. Bus,oniu, D. Ernst, B. De Schutter, and R. Babuška, "Approximate dynamic programming with a fuzzy parameterization," Automatica, vol. 46, no. 5, pp. 804-814, 2010.
- (2010) Automatica , vol.46 , Issue.5 , pp. 804-814
- Buşoniu, L.¹ Ernst, D.² De Schutter, B.³ Babuška, R.⁴

21
- 28544448294
- Dynamic multidrug therapies for HIV: Optimal and STI control approaches
- B. Adams, H. Banks, H.-D. Kwon, and H. Tran, "Dynamic multidrug therapies for HIV: Optimal and STI control approaches," Mathematical Biosciences and Engineering, vol. 1, no. 2, pp. 223-241, 2004.
- (2004) Mathematical Biosciences and Engineering , vol.1 , Issue.2 , pp. 223-241
- Adams, B.¹ Banks, H.² Kwon, H.-D.³ Tran, H.⁴

22
- 0033609174
- Control of HIV despite the discontinuation of antiretroviral therapy [2]
- DOI 10.1056/NEJM199905273402114
- J. Lisziewicz, E. Rosenberg, and J. Liebermann, "Control of HIV despite the discontinuation of antiretroviral therapy," New England Journal of Medicine, vol. 340, pp. 1683-1684, 1999. (Pubitemid 29249442)
- (1999) New England Journal of Medicine , vol.340 , Issue.21 , pp. 1683-1684
- Lisziewicz, J.¹ Rosenberg, E.² Lieberman, J.³ Jessen, H.⁴ Lopalco, L.⁵ Siliciano, R.⁶ Walker, B.⁷ Lori, F.⁸

23
- 39649096058
- Clinical data based optimal STI strategies for HIV: A reinforcement learning approach
- 4177178, Proceedings of the 45th IEEE Conference on Decision and Control 2006, CDC
- D. Ernst, G.-B. Stan, J. Gonc,alves, and L. Wehenkel, "Clinical data based optimal STI strategies for HIV: A reinforcement learning approach," in Proceedings 45th IEEE Conference on Decision & Control, San Diego, US, 13-15 December 2006, pp. 667-672. (Pubitemid 351283311)
- (2006) Proceedings of the IEEE Conference on Decision and Control , pp. 667-672
- Ernst, D.¹ Stan, G.-B.² Goncalves, J.³ Wehenkel, L.⁴

24
- 79551686776
- Cross-entropy optimization of control policies with adaptive basis functions
- accepted for publication, available online
- L. Bus,oniu, D. Ernst, B. De Schutter, and R. Babuška, "Cross-entropy optimization of control policies with adaptive basis functions," IEEE Transactions on Systems, Man, and Cybernetics-Part B: Cybernetics, vol. 41, no. 1, 2011, accepted for publication, available online.
- (2011) IEEE Transactions on Systems, Man, and Cybernetics-Part B: Cybernetics , vol.41 , Issue.1
- Buşoniu, L.¹ Ernst, D.² De Schutter, B.³ Babuška, R.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.