메뉴 건너뛰기




Volumn 12, Issue 4, 1998, Pages 235-262

Elevator Group Control Using Multiple Reinforcement Learning Agents

Author keywords

Discrete event dynamic systems; Elevator group control; Multiple agents; Reinforcement learning; Teams

Indexed keywords

ARTIFICIAL INTELLIGENCE; COMPUTER SIMULATION; DISCRETE TIME CONTROL SYSTEMS; DYNAMIC PROGRAMMING; ELEVATORS; LEARNING ALGORITHMS; PROBLEM SOLVING; RANDOM PROCESSES;

EID: 0032208335     PISSN: 08856125     EISSN: None     Source Type: Journal    
DOI: 10.1023/a:1007518724497     Document Type: Article
Times cited : (218)

References (43)
  • 3
    • 0010367132 scopus 로고
    • From chemotaxis to cooperativity: Abstract exercises in neuronal learning strategies
    • R. Durbin, C. Miall, and G. Mitchison, (Eds.), Wokingham, England: Addison-Wesley
    • Barto, A.G. (1989). From chemotaxis to cooperativity: Abstract exercises in neuronal learning strategies. In R. Durbin, C. Miall, and G. Mitchison, (Eds.), The Computing Neuron. Wokingham, England: Addison-Wesley.
    • (1989) The Computing Neuron
    • Barto, A.G.1
  • 4
  • 7
    • 0000409272 scopus 로고
    • Reinforcement learning methods for continuous-time Markov decision problems
    • G. Tesauro, D. Touretzky, and T. Leen, (Eds.), Cambridge, MA: MIT Press
    • Bradtke, S.J. & Duff, M. O. (1995). Reinforcement learning methods for continuous-time Markov decision problems. In G. Tesauro, D. Touretzky, and T. Leen, (Eds.), Advances in Neural Information Processing Systems 7. Cambridge, MA: MIT Press.
    • (1995) Advances in Neural Information Processing Systems , vol.7
    • Bradtke, S.J.1    Duff, M.O.2
  • 11
    • 0003259931 scopus 로고    scopus 로고
    • Improving elevator performance using reinforcement learning
    • D. S. Touretzky, M. C. Mozer, and M. E. Hasselmo, (Eds.), Cambridge, MA: MIT Press
    • Crites, R. H. & Barto, A.G. (1996). Improving elevator performance using reinforcement learning. In D. S. Touretzky, M. C. Mozer, and M. E. Hasselmo, (Eds.), Advances in Neural Information Processing Systems 8. Cambridge, MA: MIT Press.
    • (1996) Advances in Neural Information Processing Systems , vol.8
    • Crites, R.H.1    Barto, A.G.2
  • 12
    • 0001234682 scopus 로고
    • Feudal reinforcement learning
    • S. J. Hanson, J. D. Cowan, and C. L. Giles, (Eds.), San Mateo, CA: Morgan Kaufmann
    • Dayan, P. & Hinton, G.E. (1993). Feudal reinforcement learning. In S. J. Hanson, J. D. Cowan, and C. L. Giles, (Eds.), Advances in Neural Information Processing Systems 5. San Mateo, CA: Morgan Kaufmann.
    • (1993) Advances in Neural Information Processing Systems , vol.5
    • Dayan, P.1    Hinton, G.E.2
  • 14
    • 2542472632 scopus 로고
    • A fuzzy neural network and its application to elevator group control
    • T. Terano, M. Sugeno, M. Mukaidono, and K. Shigemasu, (Eds.), Amsterdam: IOS Press
    • Imasaki, N., Kiji, J., & Endo, T. (1992). A fuzzy neural network and its application to elevator group control. In T. Terano, M. Sugeno, M. Mukaidono, and K. Shigemasu, (Eds.), Fuzzy Engineering Toward Human Friendly Systems. Amsterdam: IOS Press.
    • (1992) Fuzzy Engineering Toward Human Friendly Systems
    • Imasaki, N.1    Kiji, J.2    Endo, T.3
  • 20
    • 0343920388 scopus 로고
    • Efficient learning of multiple degree-of-freedom control problems with quasi-independent Q-agents
    • M. C. Mozer, P. Smolensky, D. S. Touretzky, J. L. Elman, and A. S. Weigend, (Eds.), Hillsdale, NJ: Erlbaum Associates
    • Markey, K.L. (1994). Efficient learning of multiple degree-of-freedom control problems with quasi-independent Q-agents. In M. C. Mozer, P. Smolensky, D. S. Touretzky, J. L. Elman, and A. S. Weigend, (Eds.), Proceedings of the 1993 Connectionist Models Summer School. Hillsdale, NJ: Erlbaum Associates.
    • (1994) Proceedings of the 1993 Connectionist Models Summer School
    • Markey, K.L.1
  • 23
    • 38249015417 scopus 로고
    • Electronics and information technology in high-range elevator systems
    • Ovaska, S.J. (1992). Electronics and information technology in high-range elevator systems. Mechatronics, 2, 89-99.
    • (1992) Mechatronics , vol.2 , pp. 89-99
    • Ovaska, S.J.1
  • 26
    • 0021371436 scopus 로고
    • Development of elevator supervisory group control system with artificial intelligence
    • Sakai, Y & Kurosawa, K. (1984). Development of elevator supervisory group control system with artificial intelligence. Hitachi Review, 33, 25-30.
    • (1984) Hitachi Review , vol.33 , pp. 25-30
    • Sakai, Y.1    Kurosawa, K.2
  • 27
    • 0003297918 scopus 로고
    • Some studies in machine learning using the game of checkers
    • E. Feigenbaum and J. Feldman, (Eds.), New York, NY: McGraw-Hill
    • Samuel, A.L. (1963). Some studies in machine learning using the game of checkers. In E. Feigenbaum and J. Feldman, (Eds.), Computers and Thought. New York, NY: McGraw-Hill.
    • (1963) Computers and Thought
    • Samuel, A.L.1
  • 28
    • 0030050933 scopus 로고    scopus 로고
    • Multiagent reinforcement learning in the iterated prisoner's dilemma
    • Sandholm, T.W. & Crites, R.H. (1996). Multiagent reinforcement learning in the iterated prisoner's dilemma. Biosystems, 37, 147-166.
    • (1996) Biosystems , vol.37 , pp. 147-166
    • Sandholm, T.W.1    Crites, R.H.2
  • 30
    • 0027684588 scopus 로고
    • Elevator traffic simulation
    • Siikonen, M.L. (1993). Elevator traffic simulation. Simulation, 61, 257-267.
    • (1993) Simulation , vol.61 , pp. 257-267
    • Siikonen, M.L.1
  • 34
    • 0001046225 scopus 로고
    • Practical issues in temporal difference learning
    • Tesauro, G. (1992). Practical issues in temporal difference learning. Machine Learning, 8, 257-277.
    • (1992) Machine Learning , vol.8 , pp. 257-277
    • Tesauro, G.1
  • 35
    • 0000985504 scopus 로고
    • TD-Gammon, a self-teaching backgammon program, achieves master-level play
    • Tesauro, G. (1994). TD-Gammon, a self-teaching backgammon program, achieves master-level play. Neural Computation, 6, 215-219.
    • (1994) Neural Computation , vol.6 , pp. 215-219
    • Tesauro, G.1
  • 36
    • 0029276036 scopus 로고
    • Temporal difference learning and TD-Gammon
    • Tesauro, G. (1995). Temporal difference learning and TD-Gammon. Communications of the ACM, 38, 58-68.
    • (1995) Communications of the ACM , vol.38 , pp. 58-68
    • Tesauro, G.1
  • 39
    • 0028448240 scopus 로고
    • The latest elevator group-control system
    • Ujihara, H. & Amano, M. (1994). The latest elevator group-control system. Mitsubishi Electric Advance, 67, 10-12.
    • (1994) Mitsubishi Electric Advance , vol.67 , pp. 10-12
    • Ujihara, H.1    Amano, M.2
  • 40
    • 0024141340 scopus 로고
    • The revolutionary AI-2100 elevator-group control system and the new intelligent option series
    • Ujihara, H. & Tsuji, S. (1988). The revolutionary AI-2100 elevator-group control system and the new intelligent option series. Mitsubishi Electric Advance, 45, 5-8.
    • (1988) Mitsubishi Electric Advance , vol.45 , pp. 5-8
    • Ujihara, H.1    Tsuji, S.2
  • 42
    • 84949977009 scopus 로고    scopus 로고
    • Adaptation and Learning in Multi-Agent Systems
    • Berlin: Springer Verlag
    • Weiss, G. & Sen, S. (1996). Adaptation and Learning in Multi-Agent Systems. Lecture Notes in Artificial Intelligence, Volume 1042. Berlin: Springer Verlag.
    • (1996) Lecture Notes in Artificial Intelligence , vol.1042
    • Weiss, G.1    Sen, S.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.