SCOPUS 정보 검색 플랫폼

Machine Learning

Volumn 12, Issue 4, 1998, Pages 235-262

Elevator Group Control Using Multiple Reinforcement Learning Agents

(2) Crites, Robert H a Barto, Andrew G b

a Unica Technologies (United States)

b University of Massachusetts (United States)

Author keywords

Discrete event dynamic systems; Elevator group control; Multiple agents; Reinforcement learning; Teams

Indexed keywords

ARTIFICIAL INTELLIGENCE; COMPUTER SIMULATION; DISCRETE TIME CONTROL SYSTEMS; DYNAMIC PROGRAMMING; ELEVATORS; LEARNING ALGORITHMS; PROBLEM SOLVING; RANDOM PROCESSES;

DISCRETE EVENT DYNAMIC SYSTEMS; ELEVATOR GROUP CONTROL; REINFORCEMENT LEARNING (RL);

LEARNING SYSTEMS;

EID: 0032208335 PISSN: 08856125 EISSN: None Source Type: Journal
DOI: 10.1023/a:1007518724497 Document Type: Article

Times cited : (218)

References (43)

1
- 84936824515
- New York, NY: Basic Books
- Axelrod, R.M. (1984). The Evolution of Cooperation. New York, NY: Basic Books.
- (1984) The Evolution of Cooperation
- Axelrod, R.M.¹

2
- 0011385502
- ECE Department Technical Report, University of Massachusetts
- Bao, G., Cassandras, G.G., Djaferis, T.E., Gandhi, A.D., & Looze, D.P. (1994). Elevator dispatchers for down peak traffic. ECE Department Technical Report, University of Massachusetts.
- (1994) Elevator Dispatchers for Down Peak Traffic
- Bao, G.¹ Cassandras, G.G.² Djaferis, T.E.³ Gandhi, A.D.⁴ Looze, D.P.⁵

3
- 0010367132
- From chemotaxis to cooperativity: Abstract exercises in neuronal learning strategies
- R. Durbin, C. Miall, and G. Mitchison, (Eds.), Wokingham, England: Addison-Wesley
- Barto, A.G. (1989). From chemotaxis to cooperativity: Abstract exercises in neuronal learning strategies. In R. Durbin, C. Miall, and G. Mitchison, (Eds.), The Computing Neuron. Wokingham, England: Addison-Wesley.
- (1989) The Computing Neuron
- Barto, A.G.¹

4
- 0029210635
- Learning to act using real-time dynamic programming
- Barto, A.G., Bradtke, S.J., & Singh, S.P. (1995). Learning to act using real-time dynamic programming. Artificial Intelligence, 72, 81-138.
- (1995) Artificial Intelligence , vol.72 , pp. 81-138
- Barto, A.G.¹ Bradtke, S.J.² Singh, S.P.³

5
- 0003487482
- Belmont, MA: Athena Scientific Press
- Bertsekas, D.P. & Tsitsiklis, J.N. (1996). Neuro-Dynamic Programming. Belmont, MA: Athena Scientific Press.
- (1996) Neuro-Dynamic Programming
- Bertsekas, D.P.¹ Tsitsiklis, J.N.²

6
- 0343920391
- Unpublished manuscript
- Bradtke, S.J. (1993). Distributed adaptive optimal control of flexible structures. Unpublished manuscript.
- (1993) Distributed Adaptive Optimal Control of Flexible Structures
- Bradtke, S.J.¹

7
- 0000409272
- Reinforcement learning methods for continuous-time Markov decision problems
- G. Tesauro, D. Touretzky, and T. Leen, (Eds.), Cambridge, MA: MIT Press
- Bradtke, S.J. & Duff, M. O. (1995). Reinforcement learning methods for continuous-time Markov decision problems. In G. Tesauro, D. Touretzky, and T. Leen, (Eds.), Advances in Neural Information Processing Systems 7. Cambridge, MA: MIT Press.
- (1995) Advances in Neural Information Processing Systems , vol.7
- Bradtke, S.J.¹ Duff, M.O.²

8
- 0003864134
- Homewood, IL: Aksen Associates
- Cassandras, C.G. (1993). Discrete Event Systems: Modeling and Performance Analysis. Homewood, IL: Aksen Associates.
- (1993) Discrete Event Systems: Modeling and Performance Analysis
- Cassandras, C.G.¹

9
- 0003456153
- PhD thesis, University of Massachusetts
- Crites, R.H. (1996). Large-Scale Dynamic Optimization Using Teams of Reinforcement Learning Agents. PhD thesis, University of Massachusetts.
- (1996) Large-Scale Dynamic Optimization Using Teams of Reinforcement Learning Agents
- Crites, R.H.¹

10
- 2542462345
- Forming control policies from simulation models using reinforcement learning
- Crites, R.H. & Barto, A.G. (1996). Forming control policies from simulation models using reinforcement learning. Proceedings of the Ninth Yale Workshop on Adaptive and Learning Systems.
- (1996) Proceedings of the Ninth Yale Workshop on Adaptive and Learning Systems
- Crites, R.H.¹ Barto, A.G.²

11
- 0003259931
- Improving elevator performance using reinforcement learning
- D. S. Touretzky, M. C. Mozer, and M. E. Hasselmo, (Eds.), Cambridge, MA: MIT Press
- Crites, R. H. & Barto, A.G. (1996). Improving elevator performance using reinforcement learning. In D. S. Touretzky, M. C. Mozer, and M. E. Hasselmo, (Eds.), Advances in Neural Information Processing Systems 8. Cambridge, MA: MIT Press.
- (1996) Advances in Neural Information Processing Systems , vol.8
- Crites, R.H.¹ Barto, A.G.²

12
- 0001234682
- Feudal reinforcement learning
- S. J. Hanson, J. D. Cowan, and C. L. Giles, (Eds.), San Mateo, CA: Morgan Kaufmann
- Dayan, P. & Hinton, G.E. (1993). Feudal reinforcement learning. In S. J. Hanson, J. D. Cowan, and C. L. Giles, (Eds.), Advances in Neural Information Processing Systems 5. San Mateo, CA: Morgan Kaufmann.
- (1993) Advances in Neural Information Processing Systems , vol.5
- Dayan, P.¹ Hinton, G.E.²

13
- 33845333431
- An on-line tuning method for multi-objective control of elevator group
- Fujino, A., Tobita, T., & Yoneda, K. (1992). An on-line tuning method for multi-objective control of elevator group. Proceedings of the International Conference on Industrial Electronics, Control, Instrumentation, and Automation, (pp. 795-800).
- (1992) Proceedings of the International Conference on Industrial Electronics, Control, Instrumentation, and Automation , pp. 795-800
- Fujino, A.¹ Tobita, T.² Yoneda, K.³

14
- 2542472632
- A fuzzy neural network and its application to elevator group control
- T. Terano, M. Sugeno, M. Mukaidono, and K. Shigemasu, (Eds.), Amsterdam: IOS Press
- Imasaki, N., Kiji, J., & Endo, T. (1992). A fuzzy neural network and its application to elevator group control. In T. Terano, M. Sugeno, M. Mukaidono, and K. Shigemasu, (Eds.), Fuzzy Engineering Toward Human Friendly Systems. Amsterdam: IOS Press.
- (1992) Fuzzy Engineering Toward Human Friendly Systems
- Imasaki, N.¹ Kiji, J.² Endo, T.³

15
- 0017472789
- Optimal control of elevators
- Levy, D., Yadin, M., & Alexandrovitz, A. (1977). Optimal control of elevators. International Journal of Systems Science, 8, 301-320.
- (1977) International Journal of Systems Science , vol.8 , pp. 301-320
- Levy, D.¹ Yadin, M.² Alexandrovitz, A.³

16
- 2542443721
- PhD thesis, ECE department, University of Massachusetts
- Lewis, J. (1991). A Dynamic Load Balancing Approach to the Control of Multiserver Polling Systems with Applications to Elevator System Dispatching. PhD thesis, ECE department, University of Massachusetts.
- (1991) A Dynamic Load Balancing Approach to the Control of Multiserver Polling Systems with Applications to Elevator System Dispatching
- Lewis, J.¹

17
- 0343048727
- Technical Report CMU-CS-93-165, Carnegie Mellon University
- Littman, M. & Boyan, J. (1993). A distributed reinforcement learning scheme for network routing. Technical Report CMU-CS-93-165, Carnegie Mellon University.
- (1993) A Distributed Reinforcement Learning Scheme for Network Routing
- Littman, M.¹ Boyan, J.²

18
- 85149834820
- Markov games as a framework for multi-agent reinforcement learning
- San Mateo, CA: Morgan Kaufmann
- Littman, M.L. (1994). Markov games as a framework for multi-agent reinforcement learning. Proceedings of the Eleventh International Conference on Machine Learning. San Mateo, CA: Morgan Kaufmann.
- (1994) Proceedings of the Eleventh International Conference on Machine Learning
- Littman, M.L.¹

19
- 0003861655
- PhD thesis, Brown University
- Littman, M.L. (1996). Algorithms for Sequential Decision Making. PhD thesis, Brown University.
- (1996) Algorithms for Sequential Decision Making
- Littman, M.L.¹

20
- 0343920388
- Efficient learning of multiple degree-of-freedom control problems with quasi-independent Q-agents
- M. C. Mozer, P. Smolensky, D. S. Touretzky, J. L. Elman, and A. S. Weigend, (Eds.), Hillsdale, NJ: Erlbaum Associates
- Markey, K.L. (1994). Efficient learning of multiple degree-of-freedom control problems with quasi-independent Q-agents. In M. C. Mozer, P. Smolensky, D. S. Touretzky, J. L. Elman, and A. S. Weigend, (Eds.), Proceedings of the 1993 Connectionist Models Summer School. Hillsdale, NJ: Erlbaum Associates.
- (1994) Proceedings of the 1993 Connectionist Models Summer School
- Markey, K.L.¹

21
- 0004952188
- Adaptive optimal elevator group control by use of neural networks
- Markon, S., Kita, H., & Nishikawa, Y. (1994). Adaptive optimal elevator group control by use of neural networks. Transactions of the Institute of Systems, Control, and Information Engineers, 7, 487-497.
- (1994) Transactions of the Institute of Systems, Control, and Information Engineers , vol.7 , pp. 487-497
- Markon, S.¹ Kita, H.² Nishikawa, Y.³

22
- 0003891507
- Englewood Cliffs, NJ: Prentice-Hall
- Narendra, K.S. & Thathachar, M.A.L. (1989). Learning Automata: An Introduction. Englewood Cliffs, NJ: Prentice-Hall.
- (1989) Learning Automata: An Introduction
- Narendra, K.S.¹ Thathachar, M.A.L.²

23
- 38249015417
- Electronics and information technology in high-range elevator systems
- Ovaska, S.J. (1992). Electronics and information technology in high-range elevator systems. Mechatronics, 2, 89-99.
- (1992) Mechatronics , vol.2 , pp. 89-99
- Ovaska, S.J.¹

24
- 0031275456
- Optimal dispatching control for elevator systems during uppeak traffic
- Pepyne, D.L. & Cassandras, C.G. (1997). Optimal dispatching control for elevator systems during uppeak traffic. IEEE Transactions on Control Systems Technology, 5, 629-643.
- (1997) IEEE Transactions on Control Systems Technology , vol.5 , pp. 629-643
- Pepyne, D.L.¹ Cassandras, C.G.²

25
- 0003444646
- Cambridge, MA: MIT Press
- Rumelhart, D.E., McClelland, J.L., & the PDP Research Group. (1986). Parallel Distributed Processing: Explorations in the Microstructure of Cognition. Cambridge, MA: MIT Press.
- (1986) Parallel Distributed Processing: Explorations in the Microstructure of Cognition
- Rumelhart, D.E.¹ McClelland, J.L.²

26
- 0021371436
- Development of elevator supervisory group control system with artificial intelligence
- Sakai, Y & Kurosawa, K. (1984). Development of elevator supervisory group control system with artificial intelligence. Hitachi Review, 33, 25-30.
- (1984) Hitachi Review , vol.33 , pp. 25-30
- Sakai, Y.¹ Kurosawa, K.²

27
- 0003297918
- Some studies in machine learning using the game of checkers
- E. Feigenbaum and J. Feldman, (Eds.), New York, NY: McGraw-Hill
- Samuel, A.L. (1963). Some studies in machine learning using the game of checkers. In E. Feigenbaum and J. Feldman, (Eds.), Computers and Thought. New York, NY: McGraw-Hill.
- (1963) Computers and Thought
- Samuel, A.L.¹

28
- 0030050933
- Multiagent reinforcement learning in the iterated prisoner's dilemma
- Sandholm, T.W. & Crites, R.H. (1996). Multiagent reinforcement learning in the iterated prisoner's dilemma. Biosystems, 37, 147-166.
- (1996) Biosystems , vol.37 , pp. 147-166
- Sandholm, T.W.¹ Crites, R.H.²

29
- 0343048713
- Shoham, Y. & Tennenholtz, M. (1993). Co-learning and the evolution of coordinated multi-agent activity.
- (1993) Co-learning and the Evolution of Coordinated Multi-agent Activity
- Shoham, Y.¹ Tennenholtz, M.²

30
- 0027684588
- Elevator traffic simulation
- Siikonen, M.L. (1993). Elevator traffic simulation. Simulation, 61, 257-267.
- (1993) Simulation , vol.61 , pp. 257-267
- Siikonen, M.L.¹

31
- 0004251305
- New York, NY: Wiley and Sons
- Strakosch, G.R. (1983). Vertical Transportation: Elevators and Escalators. New York, NY: Wiley and Sons.
- (1983) Vertical Transportation: Elevators and Escalators
- Strakosch, G.R.¹

32
- 0004102479
- Cambridge, MA: MIT Press
- Sutton, R.S. & Barto, A.G. (1998). Reinforcement Learning: An Introduction. Cambridge, MA: MIT Press.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

33
- 85152198941
- Multi-agent reinforcement learning: Independent vs. cooperative agents
- Tan, M. (1993). Multi-agent reinforcement learning: Independent vs. cooperative agents. Proceedings of the Tenth International Conference on Machine Learning.
- (1993) Proceedings of the Tenth International Conference on Machine Learning
- Tan, M.¹

34
- 0001046225
- Practical issues in temporal difference learning
- Tesauro, G. (1992). Practical issues in temporal difference learning. Machine Learning, 8, 257-277.
- (1992) Machine Learning , vol.8 , pp. 257-277
- Tesauro, G.¹

35
- 0000985504
- TD-Gammon, a self-teaching backgammon program, achieves master-level play
- Tesauro, G. (1994). TD-Gammon, a self-teaching backgammon program, achieves master-level play. Neural Computation, 6, 215-219.
- (1994) Neural Computation , vol.6 , pp. 215-219
- Tesauro, G.¹

36
- 0029276036
- Temporal difference learning and TD-Gammon
- Tesauro, G. (1995). Temporal difference learning and TD-Gammon. Communications of the ACM, 38, 58-68.
- (1995) Communications of the ACM , vol.38 , pp. 58-68
- Tesauro, G.¹

37
- 0026401514
- An elevator characterized group supervisory control system
- Tobita, T., Fujino, A., Inaba, H., Yoneda, K., & Ueshima, T. (1991). An elevator characterized group supervisory control system. Proceedings of IECON, (pp. 1972-1976).
- (1991) Proceedings of IECON , pp. 1972-1976
- Tobita, T.¹ Fujino, A.² Inaba, H.³ Yoneda, K.⁴ Ueshima, T.⁵

38
- 0004162272
- New York, NY: Academic Press
- Tsetlin, M. L. (1973). Automaton Theory and Modeling of Biological Systems. New York, NY: Academic Press.
- (1973) Automaton Theory and Modeling of Biological Systems
- Tsetlin, M.L.¹

39
- 0028448240
- The latest elevator group-control system
- Ujihara, H. & Amano, M. (1994). The latest elevator group-control system. Mitsubishi Electric Advance, 67, 10-12.
- (1994) Mitsubishi Electric Advance , vol.67 , pp. 10-12
- Ujihara, H.¹ Amano, M.²

40
- 0024141340
- The revolutionary AI-2100 elevator-group control system and the new intelligent option series
- Ujihara, H. & Tsuji, S. (1988). The revolutionary AI-2100 elevator-group control system and the new intelligent option series. Mitsubishi Electric Advance, 45, 5-8.
- (1988) Mitsubishi Electric Advance , vol.45 , pp. 5-8
- Ujihara, H.¹ Tsuji, S.²

41
- 0004049893
- PhD thesis, Cambridge University
- Watkins, C. J. C. H. (1989). Learning from Delayed Rewards. PhD thesis, Cambridge University.
- (1989) Learning from Delayed Rewards
- Watkins, C.J.C.H.¹

42
- 84949977009
- Adaptation and Learning in Multi-Agent Systems
- Berlin: Springer Verlag
- Weiss, G. & Sen, S. (1996). Adaptation and Learning in Multi-Agent Systems. Lecture Notes in Artificial Intelligence, Volume 1042. Berlin: Springer Verlag.
- (1996) Lecture Notes in Artificial Intelligence , vol.1042
- Weiss, G.¹ Sen, S.²

43
- 0004113431
- Englewood Cliffs, NJ: Prentice-Hall
- Widrow, B. & Stearns, S.D. (1985). Adaptive Signal Processing. Englewood Cliffs, NJ: Prentice-Hall.
- (1985) Adaptive Signal Processing
- Widrow, B.¹ Stearns, S.D.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.