메뉴 건너뛰기




Volumn , Issue , 1995, Pages 1017-1023

Improving Elevator Performance Using Reinforcement Learning

Author keywords

[No Author keywords available]

Indexed keywords

CONTINUOUS TIME SYSTEMS; ELEVATORS; HEURISTIC ALGORITHMS; STOCHASTIC SYSTEMS;

EID: 85156187730     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (258)

References (8)
  • 2
    • 0000409272 scopus 로고
    • Reinforcement Learning Methods for Continuous-Time Markov Decision Problems
    • G. Tesauro, D. S. Touretzky and T. K. Leen, eds., MIT Press, Cambridge, MA
    • S. J. Bradtke and M. O. Duff. (1995) Reinforcement Learning Methods for Continuous-Time Markov Decision Problems. In: G. Tesauro, D. S. Touretzky and T. K. Leen, eds., Advances in Neural Information Processing Systems 7, MIT Press, Cambridge, MA.
    • (1995) Advances in Neural Information Processing Systems , vol.7
    • Bradtke, S. J.1    Duff, M. O.2
  • 4
    • 0343920388 scopus 로고
    • Efficient Learning of Multiple Degree-of-Freedom Control Problems with Quasi-independent Q-agents
    • M. C. Mozer, P. Smolensky, D. S. Touretzky, J. L. Elman and A. S. Weigend, eds., Hillsdale, NJ
    • K. L. Markey. (1994) Efficient Learning of Multiple Degree-of-Freedom Control Problems with Quasi-independent Q-agents. In: M. C. Mozer, P. Smolensky, D. S. Touretzky, J. L. Elman and A. S. Weigend, eds., Proceedings of the 1993 Connectionist Models Summer School Erlbaum Associates, Hillsdale, NJ.
    • (1994) Proceedings of the 1993 Connectionist Models Summer School Erlbaum Associates
    • Markey, K. L.1
  • 5
    • 0001046225 scopus 로고
    • Practical Issues in Temporal Difference Learning
    • G. Tesauro. (1992) Practical Issues in Temporal Difference Learning. Machine Learning 8:257-277.
    • (1992) Machine Learning , vol.8 , pp. 257-277
    • Tesauro, G.1
  • 6
    • 0000985504 scopus 로고
    • TD-Gammon, a Self-Teaching Backgammon Program, Achieves Master-Level Play
    • G. Tesauro. (1994) TD-Gammon, a Self-Teaching Backgammon Program, Achieves Master-Level Play. Neural Computation 6:215-219.
    • (1994) Neural Computation , vol.6 , pp. 215-219
    • Tesauro, G.1
  • 7
    • 0029276036 scopus 로고
    • Temporal Difference Learning and TD-Gammon
    • G. Tesauro. (1995) Temporal Difference Learning and TD-Gammon. Communications of the ACM38:58-68.
    • (1995) Communications of the ACM38 , pp. 58-68
    • Tesauro, G.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.