SCOPUS 정보 검색 플랫폼

Volumn 2, Issue , 2004, Pages 1643-1648

An adaptation of particle swarm optimization for markov decision processes

Author keywords

Markov decision process; Particle swarm optimization; Reinforcement learning

Indexed keywords

MARKOV DECISION PROCESS; MULTIDEMENSIONAL SEARCHING; PARTICLE SWARM OPTIMIZATION; REINFORCEMENT LEARNING;

ALGORITHMS; FUNCTIONS; HEURISTIC METHODS; MARKOV PROCESSES; OPTIMIZATION; PROBLEM SOLVING; SET THEORY; STATISTICS;

DECISION THEORY;

EID: 15744369589 PISSN: 1062922X EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICSMC.2004.1399867 Document Type: Conference Paper

Times cited : (6)

References (15)

1
- 0003565783
- Athena Scientific
- D. P. Bertsekas, Dynamic Programming and Optimal Control, Volumes 1 and 2. Athena Scientific, 1995.
- (1995) Dynamic Programming and Optimal Control , vol.1-2
- Bertsekas, D.P.¹

2
- 0003487482
- Athena Scientific
- D. P. Bertsekas and J. N. Tsitsiklis, Neuro Dynamic Programming. Athena Scientific, 1996.
- (1996) Neuro Dynamic Programming
- Bertsekas, D.P.¹ Tsitsiklis, J.N.²

3
- 1542319250
- An asymptotically efficient algorithm for finite horizon stochastic dynamic programming problems
- H. S. Chang, M. Fu, and S. I. Marcus, "An asymptotically efficient algorithm for finite horizon stochastic dynamic programming problems," in Proc. of the 42nd IEEE Conf. Decision and Control, 2003.
- (2003) Proc. of the 42nd IEEE Conf. Decision and Control
- Chang, H.S.¹ Fu, M.² Marcus, S.I.³

5
- 3543128853
- Parallel rollout for on-line solution of partially observable Markov decision processes
- H. S. Chang, R. Givan, and E. K. P. Chong, "Parallel rollout for on-line solution of partially observable Markov decision processes," Discrete Event Dynamic Systems: Theory and Application, Vol. 14, No. 3, pp. 309-341, 2004.
- (2004) Discrete Event Dynamic Systems: Theory and Application , vol.14 , Issue.3 , pp. 309-341
- Chang, H.S.¹ Givan, R.² Chong, E.K.P.³

6
- 8744240256
- An ant system approach to markov decision processes
- H. S. Chang, W. J. Gutjahr, J. Yang, and S. Park, "An Ant System Approach to Markov Decision Processes," in Proc. of the American Control Conference, 2004.
- (2004) Proc. of the American Control Conference
- Chang, H.S.¹ Gutjahr, W.J.² Yang, J.³ Park, S.⁴

7
- 15744397544
- An ant system based exploration-exploitation for reinforcement learning
- H. S. Chang, "An Ant System Based Exploration-Exploitation for Reinforcement Learning," in Proc. of the IEEE Conf. on Systems, Man, and Cybernetics, 2004.
- (2004) Proc. of the IEEE Conf. on Systems, Man, and Cybernetics
- Chang, H.S.¹

8
- 0003871635
- Ph.D. Thesis, Univ. of Michigan, Ann Arbor, MI
- K. A. De Jong, An Analysis of the Behavior of a Class of Genetic Adaptive Systems, Ph.D. Thesis, Univ. of Michigan, Ann Arbor, MI, 1975.
- (1975) An Analysis of the Behavior of a Class of Genetic Adaptive Systems
- De Jong, K.A.¹

9
- 0004222346
- Morgan Kaufmann
- R. C. Eberhart and J. Kennedy, Swarm Intelligence. Morgan Kaufmann, 2001.
- (2001) Swarm Intelligence
- Eberhart, R.C.¹ Kennedy, J.²

10
- 0031352450
- A discrete binary version of the particle swarm algorithm
- J. Kennedy and R. C. Eberhart, "A discrete binary version of the particle swarm algorithm," in Proc. of the IEEE Conf. on Systems, Man, and Cybernetics, 1997, pp. 4104-4109.
- (1997) Proc. of the IEEE Conf. on Systems, Man, and Cybernetics , pp. 4104-4109
- Kennedy, J.¹ Eberhart, R.C.²

11
- 0029535737
- Particle swarm optimization
- J. Kennedy and R. C. Eberhart, "Particle swarm optimization," in Proc. of the IEEE Conf. on Neural Networks, 1995, pp. 1942-1948.
- (1995) Proc. of the IEEE Conf. on Neural Networks , pp. 1942-1948
- Kennedy, J.¹ Eberhart, R.C.²

12
- 85102627959
- Wiley, New York
- M. L. Puterman, Markov Decision Processes: Discrete Stochastic Dynamic Programming. Wiley, New York, 1994.
- (1994) Markov Decision Processes: Discrete Stochastic Dynamic Programming
- Puterman, M.L.¹

13
- 0033901602
- Convergence results for single-step on-policy reinforcement learning algorithms
- S. Singh, T. Jaakkola, M. Littman, and C. Szepesvari, "Convergence results for single-step on-policy reinforcement learning algorithms," Machine Learning, Vol. 38, pp. 287-308, 2000.
- (2000) Machine Learning , vol.38 , pp. 287-308
- Singh, S.¹ Jaakkola, T.² Littman, M.³ Szepesvari, C.⁴

14
- 0004007508
- MIT Press
- R. Sutton and A. Barto, Reinforcement Learning. MIT Press, 2000.
- (2000) Reinforcement Learning
- Sutton, R.¹ Barto, A.²

15
- 0028497630
- Asynchronous stochastic approximation and Q-learning
- J. N. Tsitsiklis, "Asynchronous stochastic approximation and Q-learning," Machine Learning, Vol. 16, pp. 185-202, 1994.
- (1994) Machine Learning , vol.16 , pp. 185-202
- Tsitsiklis, J.N.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.