메뉴 건너뛰기




Volumn 4, Issue , 2004, Pages 3805-3810

An ant system based exploration-exploitation for reinforcement learning

Author keywords

Ant colony system; Markov decision process; Reinforcement learning

Indexed keywords

ANT COLONY SYSTEM; MARKOV DECISION PROCESS; REINFORCEMENT LEARNING; SIMULATED TRAJECTORIES;

EID: 15744397544     PISSN: 1062922X     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICSMC.2004.1400937     Document Type: Conference Paper
Times cited : (16)

References (15)
  • 6
    • 0002012598 scopus 로고    scopus 로고
    • The ant colony optimization metaheuristic
    • D. Corne, M. Dorigo (eds.), McGraw-Hill, NY, USA
    • M. Dorigo and G. Di Caro, "The ant colony optimization metaheuristic," New Ideas in Optimization, D. Corne, M. Dorigo (eds.), pp. 11-32, McGraw-Hill, NY, USA, 1999.
    • (1999) New Ideas in Optimization , pp. 11-32
    • Dorigo, M.1    Di Caro, G.2
  • 9
    • 0033738317 scopus 로고    scopus 로고
    • A graph-based ant system and its convergence
    • W. J. Gutjahr, "A graph-based ant system and its convergence," Future Generation Computer Systems, vol. 16, pp. 873-888, 2000.
    • (2000) Future Generation Computer Systems , vol.16 , pp. 873-888
    • Gutjahr, W.J.1
  • 11
    • 2942522701 scopus 로고    scopus 로고
    • The analysis and performance evaluation of the pheromone-Q-learning algorithm
    • N. Monekosso and P. Remagnino, "The analysis and performance evaluation of the pheromone-Q-learning algorithm," Expert Systems, vol. 21, no. 2, pp. 80-91, 2004.
    • (2004) Expert Systems , vol.21 , Issue.2 , pp. 80-91
    • Monekosso, N.1    Remagnino, P.2
  • 13
    • 0033901602 scopus 로고    scopus 로고
    • Convergence results for single-step on-policy reinforcement learning algorithms
    • S. Singh, T. Jaakkola, M. Littman, and C. Szepesvari, "Convergence results for single-step on-policy reinforcement learning algorithms," Machine Learning, vol. 38, pp. 287-308, 2000.
    • (2000) Machine Learning , vol.38 , pp. 287-308
    • Singh, S.1    Jaakkola, T.2    Littman, M.3    Szepesvari, C.4
  • 15
    • 0028497630 scopus 로고
    • Asynchronous stochastic approximation and Q-learning
    • J. N. Tsitsiklis, "Asynchronous stochastic approximation and Q-learning," Machine Learning, vol. 16, pp. 185-202, 1994.
    • (1994) Machine Learning , vol.16 , pp. 185-202
    • Tsitsiklis, J.N.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.