-
1
-
-
0037616356
-
Reinforcement learning for true adaptive traffic signal control
-
DOI 10.1061/(ASCE)0733-947X(2003)129:3(278)
-
ABDULHAI, B., PRINGLE, R., AND KARAKOULAS, G. 2003. Reinforcement learning for the true adaptive traffic signal control. J. Trans. Engin. 129, 3, 278-285. (Pubitemid 36594889)
-
(2003)
Journal of Transportation Engineering
, vol.129
, Issue.3
, pp. 278-285
-
-
Abdulhai, B.1
Pringle, R.2
Karakoulas, G.J.3
-
2
-
-
33644809850
-
A distributed approach for coordination of traffic signal agents
-
BAZZAN, A. L. 2005. A distributed approach for coordination of traffic signal agents. Auton. Agents Multi-Agent Syst. 10, 1, 131-164.
-
(2005)
Auton. Agents Multi-Agent Syst.
, vol.10
, Issue.1
, pp. 131-164
-
-
Bazzan, A.L.1
-
4
-
-
73649088207
-
Learning multi-goal dialogue strategies using reinforcement learning with reduced state-action spaces
-
CUAYAHUITL, H., RENALS, S., LEMON, O., AND SHIMODAIRA, H. 2006. Learning multi-goal dialogue strategies using reinforcement learning with reduced state-action spaces. Int. J. Game Theory, 547-565.
-
(2006)
Int. J. Game Theory
, pp. 547-565
-
-
Cuayahuitl, H.1
Renals, S.2
Lemon, O.3
Shimodaira, H.4
-
5
-
-
33749262176
-
Dealing with non-stationary environments using context detection
-
ACM, New York
-
DA SILVA, B. C., BASSO, E. W., BAZZAN, A. L. C., AND ENGEL, P. M. 2006. Dealing with non-stationary environments using context detection. In Proceedings of the 23rd International Conference on Machine Learning (ICML'06). ACM, New York, 217-224.
-
(2006)
Proceedings of the 23rd International Conference on Machine Learning (ICML'06)
, pp. 217-224
-
-
Da Silva, B.C.1
Basso, E.W.2
Bazzan, A.L.C.3
Engel, P.M.4
-
7
-
-
33750270145
-
Building autonomic systems using collaborative reinforcement learning
-
DOI 10.1017/S0269888906000956, PII S0269888906000956
-
DOWLING, J., CUNNINGHAM, R., CURRAN, E., AND CAHILL, V. 2006. Building autonomic systems using collaborative reinforcement learning. Knowl. Engin. Rev. 21, 3, 231-238. (Pubitemid 44610665)
-
(2006)
Knowledge Engineering Review
, vol.21
, Issue.3
, pp. 231-238
-
-
Dowling, J.1
Cunningham, R.2
Curran, E.3
Cahill, V.4
-
9
-
-
70350650156
-
Using reinforcement learning for multi-policy optimization in decentralized autonomic systems - An experimental evaluation
-
W. Reif, G. Wang, and J. Indulska, Eds. Lecture Notes in Computer Science Springer
-
DUSPARIC, I. AND CAHILL, V. 2009b. Using reinforcement learning for multi-policy optimization in decentralized autonomic systems - An experimental evaluation. In Proceedings of the 6th International Conference on Autonomic and Trusted Computing, W. Reif, G. Wang, and J. Indulska, Eds. Lecture Notes in Computer Science, vol. 5586. Springer, 105-119.
-
(2009)
Proceedings of the 6th International Conference on Autonomic and Trusted Computing
, vol.5586
, pp. 105-119
-
-
Dusparic, I.1
Cahill, V.2
-
10
-
-
10644231952
-
Urban traffic control structure based on hybrid petri nets
-
FEBBRARO, A. D., GIGLIO, D., AND SACCO, N. 2004. Urban traffic control structure based on hybrid petri nets. IEEE Trans. Intell. Trans. Syst. 5, 4, 224-237.
-
(2004)
IEEE Trans. Intell. Trans. Syst.
, vol.5
, Issue.4
, pp. 224-237
-
-
Febbraro, A.D.1
Giglio, D.2
Sacco, N.3
-
12
-
-
84901424438
-
Evolutionary swarm traffic: If ant roads had traffic lights
-
IEEE Computer Society, Washington, DC
-
HOAR, R., PENNER, J., AND JACOB, C. 2002. Evolutionary swarm traffic: If ant roads had traffic lights. In (CEC'02) Proceedings of the Evolutionary Computation (CEC '02). Proceedings of the 2002 Congress. IEEE Computer Society, Washington, DC, 1910-1915.
-
(2002)
(CEC'02) Proceedings of the Evolutionary Computation (CEC '02). Proceedings of the 2002 Congress
, pp. 1910-1915
-
-
Hoar, R.1
Penner, J.2
Jacob, C.3
-
16
-
-
0037253062
-
The vision of autonomic computing
-
KEPHART, J. O. AND CHESS, D. M. 2003. The vision of autonomic computing. Comput. 36, 1, 41-50.
-
(2003)
Comput.
, vol.36
, Issue.1
, pp. 41-50
-
-
Kephart, J.O.1
Chess, D.M.2
-
17
-
-
40949099898
-
Utile coordination: Learning interdependencies among cooperative agents
-
KOK, J. R., 'T HOEN, P. J., BAKKER, B., AND VLASSIS, N. 2005. Utile coordination: Learning interdependencies among cooperative agents. In Proceedings of the IEEE Symposium on Computational Intelligence and Games (CIG). 29-36.
-
(2005)
Proceedings of the IEEE Symposium on Computational Intelligence and Games (CIG)
, pp. 29-36
-
-
Kok, J.R.1
Hoen P J, '.T.2
Bakker, B.3
Vlassis, N.4
-
18
-
-
4544266153
-
Reinforcement learning for autonomic network repair
-
IEEE Computer Society, Washington, DC
-
LITTMAN, M. L., RAVI, N., FENSON, E., AND HOWARD, R. 2004. Reinforcement learning for autonomic network repair. In Proceedings of the 1st International Conference on Autonomic Computing (ICAC'04). IEEE Computer Society, Washington, DC, 284-285.
-
(2004)
Proceedings of the 1st International Conference on Autonomic Computing (ICAC'04)
, pp. 284-285
-
-
Littman, M.L.1
Ravi, N.2
Fenson, E.3
Howard, R.4
-
21
-
-
50649087556
-
Grid differentiated services: A reinforcement learning approach
-
IEEE Computer Society, Washington, DC
-
PEREZ, J., GERMAIN-RENAUD, C., KEGL, B., AND LOOMIS, C. 2008. Grid differentiated services: A reinforcement learning approach. In Proceedings of the 8th IEEE International Symposium on Cluster Computing and the Grid (CCGRID '08). IEEE Computer Society, Washington, DC, 287-294.
-
(2008)
Proceedings of the 8th IEEE International Symposium on Cluster Computing and the Grid (CCGRID '08)
, pp. 287-294
-
-
Perez, J.1
Germain-Renaud, C.2
Kegl, B.3
Loomis, C.4
-
22
-
-
48249132784
-
Organic control of traffic lights
-
Springer
-
PROTHMANN, H., ROCHNER, F., TOMFORDE, S., BRANKE, J., MÜLLER-SCHLOER, C., AND SCHMECK, H. 2008. Organic control of traffic lights. In Proceedings of the 5th International Conference on Autonomic and Trusted Computing (ATC '08). Springer, 219-233.
-
(2008)
Proceedings of the 5th International Conference on Autonomic and Trusted Computing (ATC '08)
, pp. 219-233
-
-
Prothmann, H.1
Rochner, F.2
Tomforde, S.3
Branke, J.4
Müller-Schloer, C.5
Schmeck, H.6
-
23
-
-
34250751835
-
Requirements for an ubiquitous computing simulation and emulation environment
-
ACM, New York
-
REYNOLDS, V., CAHILL, V., AND SENART, A. 2006. Requirements for an ubiquitous computing simulation and emulation environment. In Proceedings of the InterSense '06 Conference. ACM, New York.
-
(2006)
Proceedings of the InterSense '06 Conference
-
-
Reynolds, V.1
Cahill, V.2
Senart, A.3
-
25
-
-
84864064043
-
Natural actor-critic for road traffic optimisation
-
The MIT Press, Cambridge, MA
-
RICHTER, S., ABERDEEN, D., AND YU, J. 2007. Natural actor-critic for road traffic optimisation. Adv. Neural Inf. Process. Syst. 19. The MIT Press, Cambridge, MA.
-
(2007)
Adv. Neural Inf. Process. Syst.
, vol.19
-
-
Richter, S.1
Aberdeen, D.2
J, Y.U.3
-
27
-
-
62949112174
-
A collaborative reinforcement learning approach to urban traffic control optimization
-
SALKHAM, A., CUNNINGHAM, R., GARG, A., AND CAHILL, V. 2008. A collaborative reinforcement learning approach to urban traffic control optimization. In Proceedings of the IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT). Vol. 2. 560-566.
-
(2008)
Proceedings of the IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT)
, vol.2
, pp. 560-566
-
-
Salkham, A.1
Cunningham, R.2
Garg, A.3
Cahill, V.4
-
28
-
-
0001395498
-
Distributed value functions
-
SCHNEIDER, J., WONG, W.-K., MOORE, A., AND RIEDMILLER, M. 1999. Distributed value functions. In Proceedings of the 16th International Conference on Machine Learning. Morgan Kaufmann, 371-378.
-
(1999)
Proceedings of the 16th International Conference on Machine Learning. Morgan Kaufmann
, pp. 371-378
-
-
Schneider, J.1
Wong, W.-K.2
Moore, A.3
Riedmiller, M.4
-
29
-
-
0008321896
-
Reinforcement learning: An introduction
-
The MIT Press, Cambridge, MA
-
SUTON, R. S. AND BARTO, A. G. 1998. Reinforcement Learning: An Introduction. A Bradford Book. The MIT Press, Cambridge, MA.
-
(1998)
A Bradford Book
-
-
Suton, R.S.1
Barto, A.G.2
-
30
-
-
0032096675
-
Multiagent systems
-
SYCARA, K. 1998. Multiagent systems. AI Mag. 19, 2.
-
(1998)
AI Mag.
, vol.19
, pp. 2
-
-
Sycara, K.1
-
32
-
-
33847379922
-
Reinforcement learning in autonomic computing: A manifesto and case studies
-
DOI 10.1109/MIC.2007.21
-
TESAURO, G. 2007. Reinforcement learning in autonomic computing: A manifesto and case studies. IEEE Internet Comput. 11, 1, 22-30. (Pubitemid 46335538)
-
(2007)
IEEE Internet Computing
, vol.11
, Issue.1
, pp. 22-30
-
-
Tesauro, G.1
-
33
-
-
4544234137
-
A multi-agent systems approach to autonomic computing
-
TESAURO, G., CHESS, D. M., WALSH, W. E., DAS, R., SEGAL, A., WHALLEY, I., KEPHART, J. O., AND WHITE, S. R. 2004. A multi-agent systems approach to autonomic computing. In Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems. 464-471.
-
(2004)
Proceedings of the International Joint Conference on Autonomous Agents and Multiagent Systems
, pp. 464-471
-
-
Tesauro, G.1
Chess, D.M.2
Walsh, W.E.3
Das, R.4
Segal, A.5
Whalley, I.6
Kephart, J.O.7
White, S.R.8
-
34
-
-
33745506664
-
Utility-function-driven resource allocation in autonomic systems
-
DOI 10.1109/ICAC.2005.65, 1498088, Proceedings - Second International Conference on Autonomic Computing, ICAC 2005
-
TESAURO, G., DAS, R., WALSH, W. E., AND KEPHART, J. O. 2005. Utility-Function-Driven resource allocation in autonomic systems. In Proceedings of the International Conference on Autonomic Computing. 342-343. (Pubitemid 43959647)
-
(2005)
Proceedings - Second International Conference on Autonomic Computing, ICAC 2005
, vol.2005
, pp. 342-343
-
-
Tesauro, G.1
Das, R.2
Walsh, W.E.3
Kephart, J.O.4
-
35
-
-
34247560904
-
A hybrid reinforcement learning approach to autonomic resource allocation
-
1662383, Proceedings - 3rd International Conference on Autonomic Computing, ICAC 2006
-
TESAURO, G., JONG, N. K., DAS, R., AND BENNANI, M. N. 2006. A hybrid reinforcement learning approach to autonomic resource allocation. In Proceedings of the IEEE International Conference on Autonomic Computing (ICAC '06). IEEE Computer Society, Washington, DC, 65-73. (Pubitemid 46666907)
-
(2006)
Proceedings - 3rd International Conference on Autonomic Computing, ICAC 2006
, vol.2006
, pp. 65-73
-
-
Tesauro, G.1
Jong, N.K.2
Das, R.3
Bennani, M.N.4
-
36
-
-
34249833101
-
Technical note: Q-learning
-
WATKINS, C. J. C. H. AND DAYAN, P. 1992. Technical note: Q-learning. Mach. Learn. 8, 3, 279-292.
-
(1992)
Mach. Learn.
, vol.8
, Issue.3
, pp. 279-292
-
-
Watkins, C.J.C.H.1
Dayan, P.2
-
37
-
-
34247215062
-
-
Tech. rep., Institute of Information and Computing Sciences, Utrecht University
-
WIERING, M., VAN VEENEN, J., VREEKEN, J., AND KOOPMAN, A. 2004. Intelligent traffic light control. Tech. rep., Institute of Information and Computing Sciences, Utrecht University.
-
(2004)
Intelligent Traffic Light Control
-
-
Wiering, M.1
Van Veenen, J.2
Vreeken, J.3
Koopman, A.4
-
38
-
-
28444438872
-
Intelligent cooperation control of urban traffic networks
-
2005 International Conference on Machine Learning and Cybernetics, ICMLC 2005
-
YANG, Z., CHEN, X., TANG, Y., AND SUN, J. 2005. Intelligent cooperation control of urban traffic networks. In Proceedings of the International Conference on Machine Learning and Cybernetics. 1482-1486. (Pubitemid 41734160)
-
(2005)
2005 International Conference on Machine Learning and Cybernetics, ICMLC 2005
, pp. 1482-1486
-
-
Yang, Z.-S.1
Chen, X.2
Tang, Y.-S.3
Sun, J.-P.4
|