메뉴 건너뛰기




Volumn 4, Issue , 2003, Pages 2910-2915

Tabu Search Exploration for on-Policy Reinforcement Learning

Author keywords

[No Author keywords available]

Indexed keywords

DYNAMIC PROGRAMMING; LEARNING ALGORITHMS; PROBLEM SOLVING; VECTOR QUANTIZATION;

EID: 0141704192     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (9)

References (20)
  • 3
    • 0000430514 scopus 로고
    • The convergence of td(lambda) for general lambda
    • P. Dayan. The convergence of td(lambda) for general lambda. Machine Learning, (8):341-362, 1992.
    • (1992) Machine Learning , Issue.8 , pp. 341-362
    • Dayan, P.1
  • 4
    • 0028388685 scopus 로고
    • Td(lambda) converges with probability 1
    • P. Dayan and T. J. Sejnowski. Td(lambda) converges with probability 1. Machine Learning, (14):295-301, 1994.
    • (1994) Machine Learning , Issue.14 , pp. 295-301
    • Dayan, P.1    Sejnowski, T.J.2
  • 5
    • 0141629190 scopus 로고    scopus 로고
    • Hidden strengths and limitations: An empirical investigation of reinforcement learning
    • Morgan Kaufmann
    • G. DeJong. Hidden strengths and limitations: an empirical investigation of reinforcement learning. In International Conference on Machine Learning. Morgan Kaufmann, 2000.
    • (2000) International Conference on Machine Learning
    • DeJong, G.1
  • 9
    • 0029751419 scopus 로고    scopus 로고
    • The effect of representation and knowledge on goal-directed exploration with reinforcement-learning algorithm
    • S. Koenig and R. G. Simmons. The effect of representation and knowledge on goal-directed exploration with reinforcement-learning algorithm. Machine Learning, 22:227-250, 1996.
    • (1996) Machine Learning , vol.22 , pp. 227-250
    • Koenig, S.1    Simmons, R.G.2
  • 12
    • 0003636089 scopus 로고
    • On-line q-learning using connectionist systems
    • Cambridge University Engineering Dept.
    • G. A. Rummery and M. Niranjan. On-line q-learning using connectionist systems. Technical report, Cambridge University Engineering Dept., 1994.
    • (1994) Technical Report
    • Rummery, G.A.1    Niranjan, M.2
  • 14
    • 0033901602 scopus 로고    scopus 로고
    • Convergence results for single-step on-policy reinforcement learning algorithms
    • S. Singh, T. Jaakkola, M. L. Littman, and Csaba Szepesvari. Convergence results for single-step on-policy reinforcement learning algorithms. Machine Learning, 38(3):287-308, 2000.
    • (2000) Machine Learning , vol.38 , Issue.3 , pp. 287-308
    • Singh, S.1    Jaakkola, T.2    Littman, M.L.3    Szepesvari, C.4
  • 17
    • 0003411271 scopus 로고
    • Efficient exploration in reinforcement learning
    • Carnegie Mellon University
    • S. B. Thrun. Efficient exploration in reinforcement learning. Technical Report TR CMU-CS-92-102, Carnegie Mellon University, 1992.
    • (1992) Technical Report , vol.TR CMU-CS-92-102
    • Thrun, S.B.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.