SCOPUS 정보 검색 플랫폼

Engineering Applications of Artificial Intelligence

Volumn 18, Issue 1, 2005, Pages 73-82

Application of reinforcement learning for agent-based production scheduling

(2) Wang, Yi Chi a,b Usher, John M a

a MISSISSIPPI STATE UNIVERSITY (United States)

b KUN SHAN UNIVERSITY (Taiwan)

Author keywords

Dispatching rule selection; Q learning; Reinforcement learning

Indexed keywords

ALGORITHMS; AUTONOMOUS AGENTS; NUMERICAL CONTROL SYSTEMS; PROBLEM SOLVING; PRODUCTION CONTROL; REINFORCEMENT; SCHEDULING;

DISPATCHING RULE SELECTION; PRODUCTION SCHEDULING; Q-LEARNING; REINFORCEMENT LEARNING (RL);

LEARNING SYSTEMS;

EID: 10844274625 PISSN: 09521976 EISSN: None Source Type: Journal
DOI: 10.1016/j.engappai.2004.08.018 Document Type: Article

Times cited : (223)

References (27)

1
- 0034325416
- Dynamic job-shop scheduling using reinforcement learning agents
- Aydin, M.E., Oztemel, E., 2000. Dynamic job-shop scheduling using reinforcement learning agents. Robotics and Autonomous Systems 33, 169-178.
- (2000) Robotics and Autonomous Systems , vol.33 , pp. 169-178
- Aydin, M.E.¹ Oztemel, E.²

2
- 0004215051
- Springer, Berlin
- Brenner, W., Zarnekow, R., Witting, H., 1998. Intelligent Software Agents. Springer, Berlin, pp. 19-117.
- (1998) Intelligent Software Agents , pp. 19-117
- Brenner, W.¹ Zarnekow, R.² Witting, H.³

3
- 84891459569
- Springer, Berlin
- Brucker, P., 2001. Scheduling Algorithms. Springer, Berlin.
- (2001) Scheduling Algorithms
- Brucker, P.¹

4
- 85156187730
- Improving elevator performance using reinforcement learning
- Touretzky, D.S., Mozer, M.C., Hasselmo, M.E. (Eds.). MIT Press, Cambridge, MA
- Crites, R.H., Barto, A.G., 1996. Improving elevator performance using reinforcement learning. In: Touretzky, D.S., Mozer, M.C., Hasselmo, M.E. (Eds.), Advances in Neural Information Processing Systems: Proceedings of the 1995 Conference. MIT Press, Cambridge, MA, pp. 1017-1023.
- (1996) Advances in Neural Information Processing Systems: Proceedings of the 1995 Conference , pp. 1017-1023
- Crites, R.H.¹ Barto, A.G.²

5
- 0242665905
- A proposed structure for distributed shop floor control
- Crowe, T.J., Stahlman, E.J., 1995. A proposed structure for distributed shop floor control. Integrated Manufacturing Systems 6 (6), 31-36.
- (1995) Integrated Manufacturing Systems , vol.6 , Issue.6 , pp. 31-36
- Crowe, T.J.¹ Stahlman, E.J.²

6
- 0032643313
- Solving semi-Markov decision problems using average reward reinforcement learning
- Das, T.K., Gosavi, A., Mahadevan, S., Marchalleck, N., 1999. Solving semi-Markov decision problems using average reward reinforcement learning. Management Science 45 (4), 560-574.
- (1999) Management Science , vol.45 , Issue.4 , pp. 560-574
- Das, T.K.¹ Gosavi, A.² Mahadevan, S.³ Marchalleck, N.⁴

7
- 44949284972
- The evolution of control architectures for automated manufacturing systems
- Dilts, D.M., Boyd, N.P., Whorms, H.H., 1991. The evolution of control architectures for automated manufacturing systems. Journal of Manufacturing Systems 10 (1), 79-93.
- (1991) Journal of Manufacturing Systems , vol.10 , Issue.1 , pp. 79-93
- Dilts, D.M.¹ Boyd, N.P.² Whorms, H.H.³

8
- 0028590749
- Real-time distributed scheduling of heterarchical manufacturing systems
- Duffie, N.A., Prabhu, V.V., 1994. Real-time distributed scheduling of heterarchical manufacturing systems. Journal of Manufacturing Systems 13 (2), 94-107.
- (1994) Journal of Manufacturing Systems , vol.13 , Issue.2 , pp. 94-107
- Duffie, N.A.¹ Prabhu, V.V.²

9
- 17144419347
- The NSF Workshop on Reinforcement Learning: Summary and Observations
- Mahadevan, S., Kaelbling, L.P., 1996. The NSF Workshop on Reinforcement Learning: Summary and Observations. AI Magazine Winter, 89-97.
- (1996) AI Magazine , vol.WINTER , pp. 89-97
- Mahadevan, S.¹ Kaelbling, L.P.²

10
- 0002228390
- Optimizing production manufacturing using reinforcement learning
- AAAI Press
- Mahadevan, S., Theocharous, G., 1998. Optimizing production manufacturing using reinforcement learning. The Eleventh International FLAIRS Conference, AAAI Press, pp. 372-377.
- (1998) The Eleventh International FLAIRS Conference , pp. 372-377
- Mahadevan, S.¹ Theocharous, G.²

11
- 4944257989
- Designing agent controllers using discrete-event Markov models
- MIT, Cambridge
- Mahadevan, S., Khaleeli, N., Marchalleck, N., 1997a. Designing agent controllers using discrete-event Markov models. AAAI Fall Symposium on Model-Directed Autonomous Systems, MIT, Cambridge.
- (1997) AAAI Fall Symposium on Model-directed Autonomous Systems
- Mahadevan, S.¹ Khaleeli, N.² Marchalleck, N.³

12
- 0001963197
- Self-improving factory simulation using continuous-time average-reward reinforcement learning
- Mahadevan, S., Marchalleck, N., Das, T.K., Gosavi, A., 1997b. Self-improving factory simulation using continuous-time average-reward reinforcement learning. Proceedings of the Fourth International Machine Learning Conference, pp. 202-210.
- (1997) Proceedings of the Fourth International Machine Learning Conference , pp. 202-210
- Mahadevan, S.¹ Marchalleck, N.² Das, T.K.³ Gosavi, A.⁴

13
- 0003949709
- Wiley, New York
- Morton, T.E., Pentico, D.W., 1993. Heuristic Scheduling Systems. Wiley, New York.
- (1993) Heuristic Scheduling Systems
- Morton, T.E.¹ Pentico, D.W.²

14
- 0004215048
- Prentice-Hall, Englewood Cliffs, NJ
- Murch, R., Johnson, T., 1998. Intelligent Software Agents. Prentice-Hall, Englewood Cliffs, NJ.
- (1998) Intelligent Software Agents
- Murch, R.¹ Johnson, T.²

15
- 0003054955
- A survey of scheduling rules
- Panwalkar, S.S., Iskander, W., 1977. A survey of scheduling rules. Operations Research 25 (1), 45-62.
- (1977) Operations Research , vol.25 , Issue.1 , pp. 45-62
- Panwalkar, S.S.¹ Iskander, W.²

16
- 0035124331
- Intelligent dynamic control policies for serial production lines
- Paternina-Arboleda, C.D., Das, T.K., 2001. Intelligent dynamic control policies for serial production lines. IIE Transactions 33, 65-77.
- (2001) IIE Transactions , vol.33 , pp. 65-77
- Paternina-Arboleda, C.D.¹ Das, T.K.²

17
- 0003438602
- Prentice-Hall, Englewood Cliffs, NJ
- Pinedo, M., 1995. Scheduling Theory, Algorithms, and Systems. Prentice-Hall, Englewood Cliffs, NJ.
- (1995) Scheduling Theory, Algorithms, and Systems
- Pinedo, M.¹

18
- 85133749042
- Agent-based systems for intelligent manufacturing: A state-of-the-art survey
- Shen, W., Norrie, D.H., 1999. Agent-based systems for intelligent manufacturing: a state-of-the-art survey. Knowledge and Information Systems: an International Journal 1 (2), 129-156.
- (1999) Knowledge and Information Systems: An International Journal , vol.1 , Issue.2 , pp. 129-156
- Shen, W.¹ Norrie, D.H.²

19
- 0003573348
- Taylor & Francis, London
- Shen, W., Norrie, D.H., Barthés, J.-P.A., 2000. Multi-Agent Systems for Concurrent Intelligent Design and Manufacturing. Taylor & Francis, London.
- (2000) Multi-Agent Systems for Concurrent Intelligent Design and Manufacturing
- Shen, W.¹ Norrie, D.H.² Barthès, J.-P.A.³

20
- 84898972974
- Reinforcement learning for dynamic channel allocation in cellular telephone systems
- MIT Press, Cambridge, MA
- Singh, S.P., Bertsekas, D., 1997. Reinforcement learning for dynamic channel allocation in cellular telephone systems. Advances in Neural Information Processing Systems: Proceedings of the 1996 Conference. MIT Press, Cambridge, MA, pp. 974-980.
- (1997) Advances in Neural Information Processing Systems: Proceedings of the 1996 Conference , pp. 974-980
- Singh, S.P.¹ Bertsekas, D.²

21
- 0019180974
- The contract net protocol: High level communication and control in distributed problem solver
- Smith, R., 1980. The contract net protocol: high level communication and control in distributed problem solver. IEEE Transactions on Computers 29 (12), 1104-1113.
- (1980) IEEE Transactions on Computers , vol.29 , Issue.12 , pp. 1104-1113
- Smith, R.¹

22
- 85156221438
- Generalization in reinforcement learning: Successful examples using spare coarse coding
- Touretzky, D.S., Mozer, M.C., Hasselmo, M.E. (Eds.). MIT Press, Cambridge, MA
- Sutton, R.S., 1996. Generalization in reinforcement learning: successful examples using spare coarse coding. In: Touretzky, D.S., Mozer, M.C., Hasselmo, M.E. (Eds.), Advances in Neural Information Processing Systems: Proceedings of the 1995 Conference. MIT Press, Cambridge, MA, pp. 1038-1044.
- (1996) Advances in Neural Information Processing Systems: Proceedings of the 1995 Conference , pp. 1038-1044
- Sutton, R.S.¹

23
- 0004102479
- The MIT Press, Cambridge, MA
- Sutton, R.S., Barto, A.G., 1999. Reinforcement Learning: An Introduction. The MIT Press, Cambridge, MA.
- (1999) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

24
- 0029276036
- Temporal difference learning and TD-Gammon
- Tesauro, G., 1995. Temporal difference learning and TD-Gammon. Communications of the ACM 38 (3), 58-67.
- (1995) Communications of the ACM , vol.38 , Issue.3 , pp. 58-67
- Tesauro, G.¹

25
- 34249833101
- Q-learning
- Watkins, C.J.C.H., Dayan, P., 1992. Q-learning. Machine Learning 8, 279-292.
- (1992) Machine Learning , vol.8 , pp. 279-292
- Watkins, C.J.C.H.¹ Dayan, P.²

26
- 0003744207
- MIT Press, Cambridge, MA
- Weiss, G., 1999. Multiagent Systems: A Modern Approach to Distributed Artificial Intelligence. MIT Press, Cambridge, MA.
- (1999) Multiagent Systems: A Modern Approach to Distributed Artificial Intelligence
- Weiss, G.¹

27
- 84918834208
- A reinforcement learning approach to job-shop scheduling
- Zhang, W., Dietterich, T.G., 1995. A reinforcement learning approach to job-shop scheduling. Proceedings of the 14th International Joint Conference on Artificial Intelligence, pp. 1114-1120.
- (1995) Proceedings of the 14th International Joint Conference on Artificial Intelligence , pp. 1114-1120
- Zhang, W.¹ Dietterich, T.G.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.