메뉴 건너뛰기




Volumn 10, Issue 3, 2007, Pages 287-299

On the use of hybrid reinforcement learning for autonomic resource allocation

Author keywords

Performance management; Policy learning; Reinforcement learning; Resource allocation

Indexed keywords

APPROXIMATION THEORY; CLOSED LOOP CONTROL SYSTEMS; DATA STRUCTURES; QUEUEING THEORY; RESOURCE ALLOCATION;

EID: 34548031419     PISSN: 13867857     EISSN: 15737543     Source Type: Journal    
DOI: 10.1007/s10586-007-0035-6     Document Type: Article
Times cited : (86)

References (36)
  • 1
    • 34548037620 scopus 로고    scopus 로고
    • Model-based and model-free approaches to autonomic resource allocation
    • Tech. Rep. RC23802
    • Das, R., Tesauro, G., Walsh, W.E.: Model-based and model-free approaches to autonomic resource allocation. IBM Research, Tech. Rep. RC23802 (2005)
    • (2005) IBM Research
    • Das, R.1    Tesauro, G.2    Walsh, W.E.3
  • 2
    • 29344462255 scopus 로고    scopus 로고
    • Online resource allocation using decompositional reinforcement learning
    • Tesauro, G.: Online resource allocation using decompositional reinforcement learning. In: Proc. of AAAI-05, pp. 886-891 (2005)
    • (2005) Proc. of AAAI-05 , pp. 886-891
    • Tesauro, G.1
  • 3
    • 33745506664 scopus 로고    scopus 로고
    • Utility-function-driven resource allocation in autonomic systems
    • Tesauro, G., Das, R., Walsh, W.E., Kephart, J.O.: Utility-function-driven resource allocation in autonomic systems. In: Proc. of ICAC-05, pp. 342-343 (2005)
    • (2005) Proc. of ICAC-05 , pp. 342-343
    • Tesauro, G.1    Das, R.2    Walsh, W.E.3    Kephart, J.O.4
  • 7
    • 0037253062 scopus 로고    scopus 로고
    • The vision of autonomic computing
    • Kephart, J.O., Chess, D.M.: The vision of autonomic computing. Computer 36(1), 41-52 (2003)
    • (2003) Computer , vol.36 , Issue.1 , pp. 41-52
    • Kephart, J.O.1    Chess, D.M.2
  • 8
    • 34247560904 scopus 로고    scopus 로고
    • A hybrid reinforcement learning approach to autonomic resource allocation
    • Tesauro, G., Jong, N.K., Das, R., Bennani, M.N.: A hybrid reinforcement learning approach to autonomic resource allocation. In: Proc. of ICAC-06, pp. 65-73 (2006)
    • (2006) Proc. of ICAC-06 , pp. 65-73
    • Tesauro, G.1    Jong, N.K.2    Das, R.3    Bennani, M.N.4
  • 9
    • 33745504762 scopus 로고    scopus 로고
    • A reinforcement learning framework for dynamic resource allocation: First results
    • Vengerov, D., Iakovlev, N.: A reinforcement learning framework for dynamic resource allocation: First results. In: Proc. of ICAC-05, pp. 339-340 (2005)
    • (2005) Proc. of ICAC-05 , pp. 339-340
    • Vengerov, D.1    Iakovlev, N.2
  • 10
    • 33847351708 scopus 로고    scopus 로고
    • A reinforcement learning framework for utility-based scheduling in resource-constrained systems
    • Tech. Rep. TR-2005-141
    • Vengerov, D.: A reinforcement learning framework for utility-based scheduling in resource-constrained systems. Sun Microsystems, Tech. Rep. TR-2005-141 (2005)
    • (2005) Sun Microsystems
    • Vengerov, D.1
  • 11
    • 9944265743 scopus 로고    scopus 로고
    • Adaptive job routing and scheduling
    • Whiteson, S., Stone, P.: Adaptive job routing and scheduling. Eng. Appl. Artif. Intell. 17 (7), 855-869 (2004)
    • (2004) Eng. Appl. Artif. Intell. , vol.17 , Issue.7 , pp. 855-869
    • Whiteson, S.1    Stone, P.2
  • 13
    • 0029276036 scopus 로고
    • Temporal difference learning and TD-Gammon
    • Tesauro, G.: Temporal difference learning and TD-Gammon. Commun. ACM 38(3), 58-68 (1995)
    • (1995) Commun. ACM , vol.38 , Issue.3 , pp. 58-68
    • Tesauro, G.1
  • 14
    • 0035391755 scopus 로고    scopus 로고
    • Learning to trade via direct reinforcement
    • Moody, J., Saffell, M.: Learning to trade via direct reinforcement. IEEE Trans. Neural Netw. 12(4), 875-889 (2001)
    • (2001) IEEE Trans. Neural Netw. , vol.12 , Issue.4 , pp. 875-889
    • Moody, J.1    Saffell, M.2
  • 15
    • 31844443291 scopus 로고    scopus 로고
    • Inverted autonomous helicopter flight via reinforcement learning
    • Ng, A.Y., et al.: Inverted autonomous helicopter flight via reinforcement learning. In: Intl. Symposium on Experimental Robotics (2004)
    • (2004) Intl. Symposium on Experimental Robotics
    • Ng, A.Y.1
  • 18
    • 34548038353 scopus 로고    scopus 로고
    • IBM: Websphere benchmark sample, http://www-306.ibm.com/software/ webservers/appserv/benchmark3.html (2004)
    • (2004) IBM: Websphere Benchmark Sample
  • 20
    • 84899022377 scopus 로고    scopus 로고
    • How to dynamically merge Markov decision processes
    • In: Jordan, M.I., Kearns, M.J., Solla, S.A. (eds.) MIT
    • Singh, S., Cohn, D.: How to dynamically merge Markov decision processes. In: Jordan, M.I., Kearns, M.J., Solla, S.A. (eds.) Advances in Neural Information Processing Systems, vol. 10, pp. 1057-1063. MIT (1998)
    • (1998) Advances in Neural Information Processing Systems , vol.10 , pp. 1057-1063
    • Singh, S.1    Cohn, D.2
  • 21
    • 84898995067 scopus 로고    scopus 로고
    • Learning from demonstration
    • In: Mozer, M.C., et al. (eds.) MIT
    • Schaal, S.: Learning from demonstration. In: Mozer, M.C., et al. (eds.) Advances in Neural Information Processing Systems, vol. 9, pp. 1040-1046. MIT (1997)
    • (1997) Advances in Neural Information Processing Systems , vol.9 , pp. 1040-1046
    • Schaal, S.1
  • 23
    • 27344432348 scopus 로고    scopus 로고
    • Accelerating reinforcement learning through implicit imitation
    • Price, B., Boutilier, C.: Accelerating reinforcement learning through implicit imitation. J. AI Res. 19, 569-629 (2003)
    • (2003) J. AI Res. , vol.19 , pp. 569-629
    • Price, B.1    Boutilier, C.2
  • 24
    • 0001347323 scopus 로고
    • Complexity regularization with application to artificial neural networks
    • In: Roussas, G. (ed.)
    • Barron, A.R.: Complexity regularization with application to artificial neural networks. In: Roussas, G. (ed.) Nonparametric Functional Estimation and Related Topics (1991)
    • (1991) Nonparametric Functional Estimation and Related Topics
    • Barron, A.R.1
  • 26
    • 84962045565 scopus 로고    scopus 로고
    • Multi-agent Q-learning and regression trees for automated pricing decisions
    • In: Kaufmann, San Francisco
    • Sridharan, M., Tesauro, G.: Multi-agent Q-learning and regression trees for automated pricing decisions. In: Proc. 17th Intl. Conf. on Machine Learning, pp. 927-934. Kaufmann, San Francisco (2000)
    • (2000) Proc. 17th Intl. Conf. on Machine Learning , pp. 927-934
    • Sridharan, M.1    Tesauro, G.2
  • 27
    • 0000646059 scopus 로고
    • Learning internal representations by error propagation
    • In: Rumelhart, D.E., McClelland, J.L., et al.(eds.) MIT, Cambridge
    • Rumelhart, D.E., Hinton, G.E., Williams, R.J.: Learning internal representations by error propagation. In: Rumelhart, D.E., McClelland, J.L., et al.(eds.) Foundations. Parallel Distributed Processing, vol. 1, pp. 318-362. MIT, Cambridge (1987)
    • (1987) Foundations. Parallel Distributed Processing , vol.1 , pp. 318-362
    • Rumelhart, D.E.1    Hinton, G.E.2    Williams, R.J.3
  • 28
    • 85151728371 scopus 로고
    • Residual algorithms: Reinforcement learning with function approximation
    • Baird, L.: Residual algorithms: Reinforcement learning with function approximation. In: Proc. of ICML-95 (1995)
    • (1995) Proc. of ICML-95
    • Baird, L.1
  • 29
    • 0004049893 scopus 로고
    • Learning from delayed rewards
    • Ph.D. dissertation, Cambridge University
    • Watkins, C.: Learning from delayed rewards. Ph.D. dissertation, Cambridge University (1989)
    • (1989)
    • Watkins, C.1
  • 32
    • 4544322107 scopus 로고    scopus 로고
    • Assessing the robustness of self-managing computer systems under variable workloads
    • Bennani, M.N., Menascé, D.A.: Assessing the robustness of self-managing computer systems under variable workloads. In: Proc. of ICAC-04, pp. 62-69 (2004)
    • (2004) Proc. of ICAC-04 , pp. 62-69
    • Bennani, M.N.1    Menascé, D.A.2
  • 33
    • 33745483310 scopus 로고    scopus 로고
    • Resource allocation for autonomic data centers using analytic performance models
    • Bennani, M.N., Menascé, D.A.: Resource allocation for autonomic data centers using analytic performance models. In: Proc. of ICAC-05, pp. 229-240 (2005)
    • (2005) Proc. of ICAC-05 , pp. 229-240
    • Bennani, M.N.1    Menascé, D.A.2
  • 36
    • 34548042719 scopus 로고    scopus 로고
    • IBM: PowerExecutive, www.ibm.com/systems/management/director/extensions/ powerexec.html (2006)
    • (2006) IBM: PowerExecutive


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.