-
1
-
-
0034325416
-
Dynamic job-shop scheduling using reinforcement learning agents
-
Aydin, M.E., Oztemel, E., 2000. Dynamic job-shop scheduling using reinforcement learning agents. Robotics and Autonomous Systems 33, 169-178.
-
(2000)
Robotics and Autonomous Systems
, vol.33
, pp. 169-178
-
-
Aydin, M.E.1
Oztemel, E.2
-
2
-
-
0004215051
-
-
Springer, Berlin
-
Brenner, W., Zarnekow, R., Witting, H., 1998. Intelligent Software Agents. Springer, Berlin, pp. 19-117.
-
(1998)
Intelligent Software Agents
, pp. 19-117
-
-
Brenner, W.1
Zarnekow, R.2
Witting, H.3
-
4
-
-
85156187730
-
Improving elevator performance using reinforcement learning
-
Touretzky, D.S., Mozer, M.C., Hasselmo, M.E. (Eds.). MIT Press, Cambridge, MA
-
Crites, R.H., Barto, A.G., 1996. Improving elevator performance using reinforcement learning. In: Touretzky, D.S., Mozer, M.C., Hasselmo, M.E. (Eds.), Advances in Neural Information Processing Systems: Proceedings of the 1995 Conference. MIT Press, Cambridge, MA, pp. 1017-1023.
-
(1996)
Advances in Neural Information Processing Systems: Proceedings of the 1995 Conference
, pp. 1017-1023
-
-
Crites, R.H.1
Barto, A.G.2
-
5
-
-
0242665905
-
A proposed structure for distributed shop floor control
-
Crowe, T.J., Stahlman, E.J., 1995. A proposed structure for distributed shop floor control. Integrated Manufacturing Systems 6 (6), 31-36.
-
(1995)
Integrated Manufacturing Systems
, vol.6
, Issue.6
, pp. 31-36
-
-
Crowe, T.J.1
Stahlman, E.J.2
-
6
-
-
0032643313
-
Solving semi-Markov decision problems using average reward reinforcement learning
-
Das, T.K., Gosavi, A., Mahadevan, S., Marchalleck, N., 1999. Solving semi-Markov decision problems using average reward reinforcement learning. Management Science 45 (4), 560-574.
-
(1999)
Management Science
, vol.45
, Issue.4
, pp. 560-574
-
-
Das, T.K.1
Gosavi, A.2
Mahadevan, S.3
Marchalleck, N.4
-
7
-
-
44949284972
-
The evolution of control architectures for automated manufacturing systems
-
Dilts, D.M., Boyd, N.P., Whorms, H.H., 1991. The evolution of control architectures for automated manufacturing systems. Journal of Manufacturing Systems 10 (1), 79-93.
-
(1991)
Journal of Manufacturing Systems
, vol.10
, Issue.1
, pp. 79-93
-
-
Dilts, D.M.1
Boyd, N.P.2
Whorms, H.H.3
-
8
-
-
0028590749
-
Real-time distributed scheduling of heterarchical manufacturing systems
-
Duffie, N.A., Prabhu, V.V., 1994. Real-time distributed scheduling of heterarchical manufacturing systems. Journal of Manufacturing Systems 13 (2), 94-107.
-
(1994)
Journal of Manufacturing Systems
, vol.13
, Issue.2
, pp. 94-107
-
-
Duffie, N.A.1
Prabhu, V.V.2
-
9
-
-
17144419347
-
The NSF Workshop on Reinforcement Learning: Summary and Observations
-
Mahadevan, S., Kaelbling, L.P., 1996. The NSF Workshop on Reinforcement Learning: Summary and Observations. AI Magazine Winter, 89-97.
-
(1996)
AI Magazine
, vol.WINTER
, pp. 89-97
-
-
Mahadevan, S.1
Kaelbling, L.P.2
-
11
-
-
4944257989
-
Designing agent controllers using discrete-event Markov models
-
MIT, Cambridge
-
Mahadevan, S., Khaleeli, N., Marchalleck, N., 1997a. Designing agent controllers using discrete-event Markov models. AAAI Fall Symposium on Model-Directed Autonomous Systems, MIT, Cambridge.
-
(1997)
AAAI Fall Symposium on Model-directed Autonomous Systems
-
-
Mahadevan, S.1
Khaleeli, N.2
Marchalleck, N.3
-
12
-
-
0001963197
-
Self-improving factory simulation using continuous-time average-reward reinforcement learning
-
Mahadevan, S., Marchalleck, N., Das, T.K., Gosavi, A., 1997b. Self-improving factory simulation using continuous-time average-reward reinforcement learning. Proceedings of the Fourth International Machine Learning Conference, pp. 202-210.
-
(1997)
Proceedings of the Fourth International Machine Learning Conference
, pp. 202-210
-
-
Mahadevan, S.1
Marchalleck, N.2
Das, T.K.3
Gosavi, A.4
-
14
-
-
0004215048
-
-
Prentice-Hall, Englewood Cliffs, NJ
-
Murch, R., Johnson, T., 1998. Intelligent Software Agents. Prentice-Hall, Englewood Cliffs, NJ.
-
(1998)
Intelligent Software Agents
-
-
Murch, R.1
Johnson, T.2
-
15
-
-
0003054955
-
A survey of scheduling rules
-
Panwalkar, S.S., Iskander, W., 1977. A survey of scheduling rules. Operations Research 25 (1), 45-62.
-
(1977)
Operations Research
, vol.25
, Issue.1
, pp. 45-62
-
-
Panwalkar, S.S.1
Iskander, W.2
-
16
-
-
0035124331
-
Intelligent dynamic control policies for serial production lines
-
Paternina-Arboleda, C.D., Das, T.K., 2001. Intelligent dynamic control policies for serial production lines. IIE Transactions 33, 65-77.
-
(2001)
IIE Transactions
, vol.33
, pp. 65-77
-
-
Paternina-Arboleda, C.D.1
Das, T.K.2
-
17
-
-
0003438602
-
-
Prentice-Hall, Englewood Cliffs, NJ
-
Pinedo, M., 1995. Scheduling Theory, Algorithms, and Systems. Prentice-Hall, Englewood Cliffs, NJ.
-
(1995)
Scheduling Theory, Algorithms, and Systems
-
-
Pinedo, M.1
-
18
-
-
85133749042
-
Agent-based systems for intelligent manufacturing: A state-of-the-art survey
-
Shen, W., Norrie, D.H., 1999. Agent-based systems for intelligent manufacturing: a state-of-the-art survey. Knowledge and Information Systems: an International Journal 1 (2), 129-156.
-
(1999)
Knowledge and Information Systems: An International Journal
, vol.1
, Issue.2
, pp. 129-156
-
-
Shen, W.1
Norrie, D.H.2
-
19
-
-
0003573348
-
-
Taylor & Francis, London
-
Shen, W., Norrie, D.H., Barthés, J.-P.A., 2000. Multi-Agent Systems for Concurrent Intelligent Design and Manufacturing. Taylor & Francis, London.
-
(2000)
Multi-Agent Systems for Concurrent Intelligent Design and Manufacturing
-
-
Shen, W.1
Norrie, D.H.2
Barthès, J.-P.A.3
-
20
-
-
84898972974
-
Reinforcement learning for dynamic channel allocation in cellular telephone systems
-
MIT Press, Cambridge, MA
-
Singh, S.P., Bertsekas, D., 1997. Reinforcement learning for dynamic channel allocation in cellular telephone systems. Advances in Neural Information Processing Systems: Proceedings of the 1996 Conference. MIT Press, Cambridge, MA, pp. 974-980.
-
(1997)
Advances in Neural Information Processing Systems: Proceedings of the 1996 Conference
, pp. 974-980
-
-
Singh, S.P.1
Bertsekas, D.2
-
21
-
-
0019180974
-
The contract net protocol: High level communication and control in distributed problem solver
-
Smith, R., 1980. The contract net protocol: high level communication and control in distributed problem solver. IEEE Transactions on Computers 29 (12), 1104-1113.
-
(1980)
IEEE Transactions on Computers
, vol.29
, Issue.12
, pp. 1104-1113
-
-
Smith, R.1
-
22
-
-
85156221438
-
Generalization in reinforcement learning: Successful examples using spare coarse coding
-
Touretzky, D.S., Mozer, M.C., Hasselmo, M.E. (Eds.). MIT Press, Cambridge, MA
-
Sutton, R.S., 1996. Generalization in reinforcement learning: successful examples using spare coarse coding. In: Touretzky, D.S., Mozer, M.C., Hasselmo, M.E. (Eds.), Advances in Neural Information Processing Systems: Proceedings of the 1995 Conference. MIT Press, Cambridge, MA, pp. 1038-1044.
-
(1996)
Advances in Neural Information Processing Systems: Proceedings of the 1995 Conference
, pp. 1038-1044
-
-
Sutton, R.S.1
-
23
-
-
0004102479
-
-
The MIT Press, Cambridge, MA
-
Sutton, R.S., Barto, A.G., 1999. Reinforcement Learning: An Introduction. The MIT Press, Cambridge, MA.
-
(1999)
Reinforcement Learning: An Introduction
-
-
Sutton, R.S.1
Barto, A.G.2
-
24
-
-
0029276036
-
Temporal difference learning and TD-Gammon
-
Tesauro, G., 1995. Temporal difference learning and TD-Gammon. Communications of the ACM 38 (3), 58-67.
-
(1995)
Communications of the ACM
, vol.38
, Issue.3
, pp. 58-67
-
-
Tesauro, G.1
|