메뉴 건너뛰기




Volumn 18, Issue 1, 2005, Pages 73-82

Application of reinforcement learning for agent-based production scheduling

Author keywords

Dispatching rule selection; Q learning; Reinforcement learning

Indexed keywords

ALGORITHMS; AUTONOMOUS AGENTS; NUMERICAL CONTROL SYSTEMS; PROBLEM SOLVING; PRODUCTION CONTROL; REINFORCEMENT; SCHEDULING;

EID: 10844274625     PISSN: 09521976     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.engappai.2004.08.018     Document Type: Article
Times cited : (223)

References (27)
  • 1
    • 0034325416 scopus 로고    scopus 로고
    • Dynamic job-shop scheduling using reinforcement learning agents
    • Aydin, M.E., Oztemel, E., 2000. Dynamic job-shop scheduling using reinforcement learning agents. Robotics and Autonomous Systems 33, 169-178.
    • (2000) Robotics and Autonomous Systems , vol.33 , pp. 169-178
    • Aydin, M.E.1    Oztemel, E.2
  • 4
    • 85156187730 scopus 로고    scopus 로고
    • Improving elevator performance using reinforcement learning
    • Touretzky, D.S., Mozer, M.C., Hasselmo, M.E. (Eds.). MIT Press, Cambridge, MA
    • Crites, R.H., Barto, A.G., 1996. Improving elevator performance using reinforcement learning. In: Touretzky, D.S., Mozer, M.C., Hasselmo, M.E. (Eds.), Advances in Neural Information Processing Systems: Proceedings of the 1995 Conference. MIT Press, Cambridge, MA, pp. 1017-1023.
    • (1996) Advances in Neural Information Processing Systems: Proceedings of the 1995 Conference , pp. 1017-1023
    • Crites, R.H.1    Barto, A.G.2
  • 5
    • 0242665905 scopus 로고
    • A proposed structure for distributed shop floor control
    • Crowe, T.J., Stahlman, E.J., 1995. A proposed structure for distributed shop floor control. Integrated Manufacturing Systems 6 (6), 31-36.
    • (1995) Integrated Manufacturing Systems , vol.6 , Issue.6 , pp. 31-36
    • Crowe, T.J.1    Stahlman, E.J.2
  • 6
    • 0032643313 scopus 로고    scopus 로고
    • Solving semi-Markov decision problems using average reward reinforcement learning
    • Das, T.K., Gosavi, A., Mahadevan, S., Marchalleck, N., 1999. Solving semi-Markov decision problems using average reward reinforcement learning. Management Science 45 (4), 560-574.
    • (1999) Management Science , vol.45 , Issue.4 , pp. 560-574
    • Das, T.K.1    Gosavi, A.2    Mahadevan, S.3    Marchalleck, N.4
  • 7
    • 44949284972 scopus 로고
    • The evolution of control architectures for automated manufacturing systems
    • Dilts, D.M., Boyd, N.P., Whorms, H.H., 1991. The evolution of control architectures for automated manufacturing systems. Journal of Manufacturing Systems 10 (1), 79-93.
    • (1991) Journal of Manufacturing Systems , vol.10 , Issue.1 , pp. 79-93
    • Dilts, D.M.1    Boyd, N.P.2    Whorms, H.H.3
  • 8
    • 0028590749 scopus 로고
    • Real-time distributed scheduling of heterarchical manufacturing systems
    • Duffie, N.A., Prabhu, V.V., 1994. Real-time distributed scheduling of heterarchical manufacturing systems. Journal of Manufacturing Systems 13 (2), 94-107.
    • (1994) Journal of Manufacturing Systems , vol.13 , Issue.2 , pp. 94-107
    • Duffie, N.A.1    Prabhu, V.V.2
  • 9
    • 17144419347 scopus 로고    scopus 로고
    • The NSF Workshop on Reinforcement Learning: Summary and Observations
    • Mahadevan, S., Kaelbling, L.P., 1996. The NSF Workshop on Reinforcement Learning: Summary and Observations. AI Magazine Winter, 89-97.
    • (1996) AI Magazine , vol.WINTER , pp. 89-97
    • Mahadevan, S.1    Kaelbling, L.P.2
  • 16
    • 0035124331 scopus 로고    scopus 로고
    • Intelligent dynamic control policies for serial production lines
    • Paternina-Arboleda, C.D., Das, T.K., 2001. Intelligent dynamic control policies for serial production lines. IIE Transactions 33, 65-77.
    • (2001) IIE Transactions , vol.33 , pp. 65-77
    • Paternina-Arboleda, C.D.1    Das, T.K.2
  • 21
    • 0019180974 scopus 로고
    • The contract net protocol: High level communication and control in distributed problem solver
    • Smith, R., 1980. The contract net protocol: high level communication and control in distributed problem solver. IEEE Transactions on Computers 29 (12), 1104-1113.
    • (1980) IEEE Transactions on Computers , vol.29 , Issue.12 , pp. 1104-1113
    • Smith, R.1
  • 22
    • 85156221438 scopus 로고    scopus 로고
    • Generalization in reinforcement learning: Successful examples using spare coarse coding
    • Touretzky, D.S., Mozer, M.C., Hasselmo, M.E. (Eds.). MIT Press, Cambridge, MA
    • Sutton, R.S., 1996. Generalization in reinforcement learning: successful examples using spare coarse coding. In: Touretzky, D.S., Mozer, M.C., Hasselmo, M.E. (Eds.), Advances in Neural Information Processing Systems: Proceedings of the 1995 Conference. MIT Press, Cambridge, MA, pp. 1038-1044.
    • (1996) Advances in Neural Information Processing Systems: Proceedings of the 1995 Conference , pp. 1038-1044
    • Sutton, R.S.1
  • 24
    • 0029276036 scopus 로고
    • Temporal difference learning and TD-Gammon
    • Tesauro, G., 1995. Temporal difference learning and TD-Gammon. Communications of the ACM 38 (3), 58-67.
    • (1995) Communications of the ACM , vol.38 , Issue.3 , pp. 58-67
    • Tesauro, G.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.