SCOPUS 정보 검색 플랫폼

Volumn 4, Issue , 2004, Pages 3805-3810

An ant system based exploration-exploitation for reinforcement learning

Author keywords

Ant colony system; Markov decision process; Reinforcement learning

Indexed keywords

ANT COLONY SYSTEM; MARKOV DECISION PROCESS; REINFORCEMENT LEARNING; SIMULATED TRAJECTORIES;

COMPUTER SIMULATION; DECISION THEORY; GRAPH THEORY; MARKOV PROCESSES; OPTIMIZATION; PROBABILITY; RANDOM PROCESSES; THEOREM PROVING; TRAJECTORIES;

LEARNING SYSTEMS;

EID: 15744397544 PISSN: 1062922X EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ICSMC.2004.1400937 Document Type: Conference Paper

Times cited : (16)

References (15)

1
- 0003487482
- Athena Scientific
- D. P. Bertsekas and J. N. Tsitsiklis, Neuro Dynamic Programming, Athena Scientific, 1996.
- (1996) Neuro Dynamic Programming
- Bertsekas, D.P.¹ Tsitsiklis, J.N.²

2
- 0003492070
- Oxford Press
- E. Bonabeau, M. Dorigo, and G. Theralaz, Swarm Intelligence: From Natural to Artifical Systems. Oxford Press, 1999.
- (1999) Swarm Intelligence: From Natural to Artifical Systems
- Bonabeau, E.¹ Dorigo, M.² Theralaz, G.³

3
- 8744240256
- An ant system approach to Markov decision processes
- H. S. Chang, W. J. Gutjahr, J. Yang, and S. Park, "An ant system approach to Markov decision processes," in Proc. of the American Control Conference, 2004.
- (2004) Proc. of the American Control Conference
- Chang, H.S.¹ Gutjahr, W.J.² Yang, J.³ Park, S.⁴

4
- 15744369589
- An adaptation of particle swarm optimization for Markov decision processes
- H. S. Chang, "An adaptation of particle swarm optimization for Markov decision processes," in Proc. of the IEEE Conf. on Systems, Man, and Cybernetics, 2004.
- (2004) Proc. of the IEEE Conf. on Systems, Man, and Cybernetics
- Chang, H.S.¹

5
- 0004033139
- McGraw-Hill
- D. Corne, Fl Glover, and M. Dorigo (eds.), New Ideas in Optimization, McGraw-Hill, 1999.
- (1999) New Ideas in Optimization
- Corne, D.¹ Glover, F.² Dorigo, M.³

7
- 0004222346
- Morgan Kaufmann
- R. C. Eberhart and J. Kennedy, Swarm Intelligence. Morgan Kaufmann, 2001.
- (2001) Swarm Intelligence
- Eberhart, R.C.¹ Kennedy, J.²

8
- 85148633156
- Ant-Q: A reinforcement learning approach to the traveling salesman problem
- L. M. Gambardella and M. Dorigo, "Ant-Q: a reinforcement learning approach to the traveling salesman problem," in Proc. of the 12th Int. Conf. on Machine Learning, 1995, pp. 252-260.
- (1995) Proc. of the 12th Int. Conf. on Machine Learning , pp. 252-260
- Gambardella, L.M.¹ Dorigo, M.²

9
- 0033738317
- A graph-based ant system and its convergence
- W. J. Gutjahr, "A graph-based ant system and its convergence," Future Generation Computer Systems, vol. 16, pp. 873-888, 2000.
- (2000) Future Generation Computer Systems , vol.16 , pp. 873-888
- Gutjahr, W.J.¹

10
- 84948130944
- DQL: A new updating strategy for reinforcement Learning Based on Q-learning
- C. Mariano and E. Morales, "DQL: a New Updating Strategy for Reinforcement Learning based on Q-learning," in Proc. of the 12th European Conference on Machine Learning, 2001, pp. 324-335.
- (2001) Proc. of the 12th European Conference on Machine Learning , pp. 324-335
- Mariano, C.¹ Morales, E.²

11
- 2942522701
- The analysis and performance evaluation of the pheromone-Q-learning algorithm
- N. Monekosso and P. Remagnino, "The analysis and performance evaluation of the pheromone-Q-learning algorithm," Expert Systems, vol. 21, no. 2, pp. 80-91, 2004.
- (2004) Expert Systems , vol.21 , Issue.2 , pp. 80-91
- Monekosso, N.¹ Remagnino, P.²

12
- 85102627959
- Wiley, New York
- M. L. Puterman, Markov Decision Processes: Discrete Stochastic Dynamic Programming. Wiley, New York, 1994.
- (1994) Markov Decision Processes: Discrete Stochastic Dynamic Programming
- Puterman, M.L.¹

13
- 0033901602
- Convergence results for single-step on-policy reinforcement learning algorithms
- S. Singh, T. Jaakkola, M. Littman, and C. Szepesvari, "Convergence results for single-step on-policy reinforcement learning algorithms," Machine Learning, vol. 38, pp. 287-308, 2000.
- (2000) Machine Learning , vol.38 , pp. 287-308
- Singh, S.¹ Jaakkola, T.² Littman, M.³ Szepesvari, C.⁴

14
- 0004007508
- MIT Press
- R. Sutton and A. Barto, Reinforcement Learning. MIT Press, 2000.
- (2000) Reinforcement Learning
- Sutton, R.¹ Barto, A.²

15
- 0028497630
- Asynchronous stochastic approximation and Q-learning
- J. N. Tsitsiklis, "Asynchronous stochastic approximation and Q-learning," Machine Learning, vol. 16, pp. 185-202, 1994.
- (1994) Machine Learning , vol.16 , pp. 185-202
- Tsitsiklis, J.N.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.