메뉴 건너뛰기




Volumn 7, Issue 2, 2000, Pages 125-138

Learning scheduling control knowledge through reinforcements

Author keywords

Case based function approximation; Reinforcement learning; Scheduling problem; Search control knowledge

Indexed keywords


EID: 84969379788     PISSN: 09696016     EISSN: 14753995     Source Type: Journal    
DOI: 10.1111/j.1475-3995.2000.tb00190.x     Document Type: Article
Times cited : (20)

References (13)
  • 2
    • 85156187730 scopus 로고    scopus 로고
    • Improving elevator performance using reinforcement learning
    • Touretzky, D.S., Mozer, M.C., Hasselmo, M.E., (Eds.), MIT Press, Cambridge MA
    • Crites, R.H., Barto, A.G. 1996. Improving elevator performance using reinforcement learning. In: Touretzky, D.S., Mozer, M.C., Hasselmo, M.E. (Eds.), Advances in Neural Information Processing Systems 8. MIT Press, Cambridge, MA, pp. 1017-1023.
    • (1996) Advances in Neural Information Processing Systems 8 , pp. 1017-1023
    • Crites, R.H.1    Barto, A.G.2
  • 7
    • 0029332288 scopus 로고
    • CABINS: a framework of knowledge acquisition and iterative revision for schedule improvement and reactive repair
    • Miyashita, K., Sycara, K., 1995. CABINS: a framework of knowledge acquisition and iterative revision for schedule improvement and reactive repair. Artificial Intelligence 76 (1-2), 377-426.
    • (1995) Artificial Intelligence , vol.76 , Issue.1-2 , pp. 377-426
    • Miyashita, K.1    Sycara, K.2
  • 9
    • 0031231885 scopus 로고    scopus 로고
    • Experiments with reinforcement learning in problems with continuous state and action spaces
    • Santamaría, J.C., Sutton, R.S., Ram, A., 1998. Experiments with reinforcement learning in problems with continuous state and action spaces. Adaptive Behavior 6 2, 163-218.
    • (1998) Adaptive Behavior , vol.6 , Issue.2 , pp. 163-218
    • Santamaría, J.C.1    Sutton, R.S.2    Ram, A.3
  • 10
    • 33847202724 scopus 로고
    • Learning to predict by the method of temporal differences
    • Sutton, R.S., 1988. Learning to predict by the method of temporal differences. Machine Learning 3 1, 9-44.
    • (1988) Machine Learning , vol.3 , Issue.1 , pp. 9-44
    • Sutton, R.S.1
  • 12
    • 0004049893 scopus 로고
    • Learning from delayed rewards
    • Cambridge University, UK
    • Watkins, C.J.C.H., 1989. Learning from delayed rewards. Ph.D. Thesis. Cambridge University, UK.
    • (1989) Ph.D. Thesis
    • Watkins, C.J.C.H.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.