SCOPUS 정보 검색 플랫폼

NIPS 1995: Proceedings of the 8th International Conference on Neural Information Processing Systems

Volumn , Issue , 1995, Pages 1017-1023

Improving Elevator Performance Using Reinforcement Learning

(2) Crites, Robert H a Barto, Andrew G a

a UNIVERSITY OF MASSACHUSETTS (United States)

Author keywords

[No Author keywords available]

Indexed keywords

CONTINUOUS TIME SYSTEMS; ELEVATORS; HEURISTIC ALGORITHMS; STOCHASTIC SYSTEMS;

ARRIVAL RATES; CONTINOUS TIME; CONTINUOUS STATE SPACE; DISCRETE EVENT DYNAMIC SYSTEMS; ELEVATOR SYSTEMS; NONSTATIONARY; PERFORMANCE; REAL-WORLD PROBLEM; REINFORCEMENT LEARNING AGENT; REINFORCEMENT LEARNINGS;

REINFORCEMENT LEARNING;

EID: 85156187730 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (258)

References (8)

1
- 0011385502
- Technical Report, ECE Department, University of Massachusetts, Amherst, MA
- G. Bao, C. G. Cassandras, T. E. Djaferis, A. D. Gandhi, and D. P. Loose. (1994) Elevator Dispatchers for Down Peak Traffic. Technical Report, ECE Department, University of Massachusetts, Amherst, MA.
- (1994) Elevator Dispatchers for Down Peak Traffic
- Bao, G.¹ Cassandras, C. G.² Djaferis, T. E.³ Gandhi, A. D.⁴ Loose, D. P.⁵

2
- 0000409272
- Reinforcement Learning Methods for Continuous-Time Markov Decision Problems
- G. Tesauro, D. S. Touretzky and T. K. Leen, eds., MIT Press, Cambridge, MA
- S. J. Bradtke and M. O. Duff. (1995) Reinforcement Learning Methods for Continuous-Time Markov Decision Problems. In: G. Tesauro, D. S. Touretzky and T. K. Leen, eds., Advances in Neural Information Processing Systems 7, MIT Press, Cambridge, MA.
- (1995) Advances in Neural Information Processing Systems , vol.7
- Bradtke, S. J.¹ Duff, M. O.²

3
- 2542443721
- PhD thesis, University of Massachusetts, Amherst, MA
- J. Lewis. (1991) A Dynamic Load Balancing Approach to the Control of Multiserver Polling Systems with Applications to Elevator System Dispatching. PhD thesis, University of Massachusetts, Amherst, MA.
- (1991) A Dynamic Load Balancing Approach to the Control of Multiserver Polling Systems with Applications to Elevator System Dispatching
- Lewis, J.¹

4
- 0343920388
- Efficient Learning of Multiple Degree-of-Freedom Control Problems with Quasi-independent Q-agents
- M. C. Mozer, P. Smolensky, D. S. Touretzky, J. L. Elman and A. S. Weigend, eds., Hillsdale, NJ
- K. L. Markey. (1994) Efficient Learning of Multiple Degree-of-Freedom Control Problems with Quasi-independent Q-agents. In: M. C. Mozer, P. Smolensky, D. S. Touretzky, J. L. Elman and A. S. Weigend, eds., Proceedings of the 1993 Connectionist Models Summer School Erlbaum Associates, Hillsdale, NJ.
- (1994) Proceedings of the 1993 Connectionist Models Summer School Erlbaum Associates
- Markey, K. L.¹

5
- 0001046225
- Practical Issues in Temporal Difference Learning
- G. Tesauro. (1992) Practical Issues in Temporal Difference Learning. Machine Learning 8:257-277.
- (1992) Machine Learning , vol.8 , pp. 257-277
- Tesauro, G.¹

6
- 0000985504
- TD-Gammon, a Self-Teaching Backgammon Program, Achieves Master-Level Play
- G. Tesauro. (1994) TD-Gammon, a Self-Teaching Backgammon Program, Achieves Master-Level Play. Neural Computation 6:215-219.
- (1994) Neural Computation , vol.6 , pp. 215-219
- Tesauro, G.¹

7
- 0029276036
- Temporal Difference Learning and TD-Gammon
- G. Tesauro. (1995) Temporal Difference Learning and TD-Gammon. Communications of the ACM38:58-68.
- (1995) Communications of the ACM38 , pp. 58-68
- Tesauro, G.¹

8
- 0004049893
- PhD thesis, Cambridge University
- C. J. C. H. Watkins. (1989) Learning from Delayed Rewards. PhD thesis, Cambridge University.
- (1989) Learning from Delayed Rewards
- Watkins, C. J. C. H.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.