SCOPUS 정보 검색 플랫폼

Volumn , Issue , 2007, Pages 185-192

Randomly sampling actions in dynamic programming

Author keywords

[No Author keywords available]

Indexed keywords

DISCRETE TIME CONTROL SYSTEMS; OPTIMIZATION; RANDOM PROCESSES; SAMPLING;

CONTINUOUS ACTIONS; DISCRETIZED STATES; INVARIANT CONTROL;

DYNAMIC PROGRAMMING;

EID: 34548784023 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ADPRL.2007.368187 Document Type: Conference Paper

Times cited : (13)

References (22)

1
- 34547932110
- Dover
- R. Bellman, Dynamic Programming. Dover, 2003.
- (2003) Dynamic Programming
- Bellman, R.¹

2
- 0003565783
- Athena Scientific
- D. P. Bertsekas, Dynamic Programming and Optimal Control. Athena Scientific, 1995.
- (1995) Dynamic Programming and Optimal Control
- Bertsekas, D.P.¹

3
- 84921399937
- J. Si, A. Barto, W. B. Powell, and D. W. II, Handbook of Learning and Approximate Dynamic Programming. IEEE, 2004.
- J. Si, A. Barto, W. B. Powell, and D. W. II, Handbook of Learning and Approximate Dynamic Programming. IEEE, 2004.

4
- 80054015435
- Online, Available
- S. Davies, "Multidimensional triangulation and interpolation for reinforcement learning," 1996. [Online]. Available: citeseer.comp.nus.edu. sg/56687.html
- (1996) Multidimensional triangulation and interpolation for reinforcement learning
- Davies, S.¹

5
- 0031074521
- Locally weighted learning
- C. G. Atkeson, A. W. Moore, and S. Schaal, "Locally weighted learning," Artificial Intelligence Review, vol. 11, pp. 11-73, 1997.
- (1997) Artificial Intelligence Review , vol.11 , pp. 11-73
- Atkeson, C.G.¹ Moore, A.W.² Schaal, S.³

6
- 9944230646
- Wiley-Interscience
- F. L. Lewis and V. L. Syrmos, Optimal Control, 2nd Edition (Hardcover). Wiley-Interscience, 1995.
- (1995) Optimal Control, 2nd Edition (Hardcover)
- Lewis, F.L.¹ Syrmos, V.L.²

7
- 0004291983
- Elsevier
- D. H. Jacobson and D. Q. Mayne, Differential Dynamic Programming. Elsevier, 1970.
- (1970) Differential Dynamic Programming
- Jacobson, D.H.¹ Mayne, D.Q.²

8
- 0004276055
- Academic, NY
- P. Dyer and S. McReynolds, The Computational Theory of Optimal Control. Academic, NY, 1970.
- (1970) The Computational Theory of Optimal Control
- Dyer, P.¹ McReynolds, S.²

9
- 28644446278
- Evolutionary policy iteration for solving Markov decision processes
- H. S. Chang, H. G. Lee, M. C. Fu, and S. I. Marcus, "Evolutionary policy iteration for solving Markov decision processes," IEEE Transactions on Automatic Control, vol. 50, pp. 1804-1808, 2005.
- (2005) IEEE Transactions on Automatic Control , vol.50 , pp. 1804-1808
- Chang, H.S.¹ Lee, H.G.² Fu, M.C.³ Marcus, S.I.⁴

11
- 14644444172
- An adaptive sampling algorithm for solving Markov decision processes
- H. S. Chang, M. C. Fu, J. Hu, and S. I. Marcus, "An adaptive sampling algorithm for solving Markov decision processes," Operations Research, vol. 53, pp. 126-139, 2005.
- (2005) Operations Research , vol.53 , pp. 126-139
- Chang, H.S.¹ Fu, M.C.² Hu, J.³ Marcus, S.I.⁴

12
- 0003636164
- Prentice Hall
- D. P. Bertsekas and J. N. Tsitsiklis, Parallel and Distributed Computation - Numerical Methods. Prentice Hall, 1989.
- (1989) Parallel and Distributed Computation - Numerical Methods
- Bertsekas, D.P.¹ Tsitsiklis, J.N.²

13
- 0003487482
- Athena Scientific, Belmont, MA
- _, Neuro-Dynamic Programming. Athena Scientific, Belmont, MA, 1996.
- (1996) Neuro-Dynamic Programming
- Bertsekas, D.P.¹ Tsitsiklis, J.N.²

14
- 0004102479
- MIT Press, Cambridge, MA
- R. S. Sutton and A. G. Barto, Reinforcement Learning: An Introduction. MIT Press, Cambridge, MA, 1998.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

16
- 0003692801
- Wiley, New York
- A. S. Nemirovsky and D. Yudin, Problem Complexity and Method Efficiency in Optimization. Wiley, New York, 1983.
- (1983) Problem Complexity and Method Efficiency in Optimization
- Nemirovsky, A.S.¹ Yudin, D.²

17
- 78650743758
- Dynamic programming
- J. Rust, "Dynamic programming," in New Palgrave Dictionary of Economics, 2006.
- (2006) New Palgrave Dictionary of Economics
- Rust, J.¹

18
- 34548751619
- G. Gordon, Approximate solutions to Markov decision processes, Ph.D. dissertation, Carnegie Mellon University, 1999. [Online]. Available: citeseer.ist.psu.edu/gordon99approximate.html
- G. Gordon, "Approximate solutions to Markov decision processes," Ph.D. dissertation, Carnegie Mellon University, 1999. [Online]. Available: citeseer.ist.psu.edu/gordon99approximate.html

19
- 34548750791
- R. J. Williams and L. C. Baird, III, Analysis of some incremental variants of policy iteration: First steps toward understanding actor-critic learning systems, Northeastern University, Tech. Rep. NU-CCS-93-11, 1993. [Online]. Available: citeseer.ist.psu.edu/williams93analysis.html
- R. J. Williams and L. C. Baird, III, "Analysis of some incremental variants of policy iteration: First steps toward understanding actor-critic learning systems," Northeastern University, Tech. Rep. NU-CCS-93-11, 1993. [Online]. Available: citeseer.ist.psu.edu/williams93analysis.html

20
- 34249833101
- Q-learning
- C. Watkins and P. Dayan, "Q-learning," Machine Learning, vol. 8, no. 3, pp. 279-292, 1992.
- (1992) Machine Learning , vol.8 , Issue.3 , pp. 279-292
- Watkins, C.¹ Dayan, P.²

22
- 33746071551
- RRT-Plan: A randomized algorithm for STRIPS planning
- D. Burfoot, J. Pineau, and D. Dudek, "RRT-Plan: a randomized algorithm for STRIPS planning," in International Conference on Automated Planning and Scheduling (ICAPS), 2006.
- (2006) International Conference on Automated Planning and Scheduling (ICAPS)
- Burfoot, D.¹ Pineau, J.² Dudek, D.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.