SCOPUS 정보 검색 플랫폼

Cluster Computing

Volumn 10, Issue 3, 2007, Pages 287-299

On the use of hybrid reinforcement learning for autonomic resource allocation

(4) Tesauro, Gerald a Jong, Nicholas K b Das, Rajarshi a Bennani, Mohamed N c

a IBM T J WATSON RESEARCH CENTER (United States)

b University of Texas at Austin (United States)

c Oracle (United States)

Author keywords

Performance management; Policy learning; Reinforcement learning; Resource allocation

Indexed keywords

APPROXIMATION THEORY; CLOSED LOOP CONTROL SYSTEMS; DATA STRUCTURES; QUEUEING THEORY; RESOURCE ALLOCATION;

PERFORMANCE MODEL; TRAFFIC MODEL;

REINFORCEMENT LEARNING;

EID: 34548031419 PISSN: 13867857 EISSN: 15737543 Source Type: Journal
DOI: 10.1007/s10586-007-0035-6 Document Type: Article

Times cited : (86)

References (36)

1
- 34548037620
- Model-based and model-free approaches to autonomic resource allocation
- Tech. Rep. RC23802
- Das, R., Tesauro, G., Walsh, W.E.: Model-based and model-free approaches to autonomic resource allocation. IBM Research, Tech. Rep. RC23802 (2005)
- (2005) IBM Research
- Das, R.¹ Tesauro, G.² Walsh, W.E.³

2
- 29344462255
- Online resource allocation using decompositional reinforcement learning
- Tesauro, G.: Online resource allocation using decompositional reinforcement learning. In: Proc. of AAAI-05, pp. 886-891 (2005)
- (2005) Proc. of AAAI-05 , pp. 886-891
- Tesauro, G.¹

3
- 33745506664
- Utility-function-driven resource allocation in autonomic systems
- Tesauro, G., Das, R., Walsh, W.E., Kephart, J.O.: Utility-function-driven resource allocation in autonomic systems. In: Proc. of ICAC-05, pp. 342-343 (2005)
- (2005) Proc. of ICAC-05 , pp. 342-343
- Tesauro, G.¹ Das, R.² Walsh, W.E.³ Kephart, J.O.⁴

4
- 6944233859
- Wiley
- Hellerstein, J.L., Diao, Y., Parekh, S., Tilbury, D.M.: Feedback Control of Computing Systems. Wiley (2004)
- (2004) Feedback Control of Computing Systems
- Hellerstein, J.L.¹ Diao, Y.² Parekh, S.³ Tilbury, D.M.⁴

5
- 1842440583
- Prentice Hall, Upper Saddle River
- Menascé, D.A., Almedia, V.A.F., Dowdy, L.W.: Performance by Design: Computer Capacity Planning by Example. Prentice Hall, Upper Saddle River (2004)
- (2004) Performance By Design: Computer Capacity Planning By Example
- Menascé, D.A.¹ Almedia, V.A.F.² Dowdy, L.W.³

6
- 33244472642
- An analytical model for multi-tier internet services and its applications
- Urgaonkar, B., Pacifici, G., Shenoy, P., Spreitzer, M., Tantawi, A.: An analytical model for multi-tier internet services and its applications. In: Proc. of SIGMETRICS-05, pp. 291-302 (2005)
- (2005) Proc. of SIGMETRICS-05
- Urgaonkar, B.¹ Pacifici, G.² Shenoy, P.³ Spreitzer, M.⁴ Tantawi, A.⁵

7
- 0037253062
- The vision of autonomic computing
- Kephart, J.O., Chess, D.M.: The vision of autonomic computing. Computer 36(1), 41-52 (2003)
- (2003) Computer , vol.36 , Issue.1 , pp. 41-52
- Kephart, J.O.¹ Chess, D.M.²

8
- 34247560904
- A hybrid reinforcement learning approach to autonomic resource allocation
- Tesauro, G., Jong, N.K., Das, R., Bennani, M.N.: A hybrid reinforcement learning approach to autonomic resource allocation. In: Proc. of ICAC-06, pp. 65-73 (2006)
- (2006) Proc. of ICAC-06 , pp. 65-73
- Tesauro, G.¹ Jong, N.K.² Das, R.³ Bennani, M.N.⁴

9
- 33745504762
- A reinforcement learning framework for dynamic resource allocation: First results
- Vengerov, D., Iakovlev, N.: A reinforcement learning framework for dynamic resource allocation: First results. In: Proc. of ICAC-05, pp. 339-340 (2005)
- (2005) Proc. of ICAC-05 , pp. 339-340
- Vengerov, D.¹ Iakovlev, N.²

10
- 33847351708
- A reinforcement learning framework for utility-based scheduling in resource-constrained systems
- Tech. Rep. TR-2005-141
- Vengerov, D.: A reinforcement learning framework for utility-based scheduling in resource-constrained systems. Sun Microsystems, Tech. Rep. TR-2005-141 (2005)
- (2005) Sun Microsystems
- Vengerov, D.¹

11
- 9944265743
- Adaptive job routing and scheduling
- Whiteson, S., Stone, P.: Adaptive job routing and scheduling. Eng. Appl. Artif. Intell. 17 (7), 855-869 (2004)
- (2004) Eng. Appl. Artif. Intell. , vol.17 , Issue.7 , pp. 855-869
- Whiteson, S.¹ Stone, P.²

12
- 0004102479
- MIT, Cambridge
- Sutton, R.S., Barto, A.G.: Reinforcement Learning: An Introduction. MIT, Cambridge (1998)
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

13
- 0029276036
- Temporal difference learning and TD-Gammon
- Tesauro, G.: Temporal difference learning and TD-Gammon. Commun. ACM 38(3), 58-68 (1995)
- (1995) Commun. ACM , vol.38 , Issue.3 , pp. 58-68
- Tesauro, G.¹

14
- 0035391755
- Learning to trade via direct reinforcement
- Moody, J., Saffell, M.: Learning to trade via direct reinforcement. IEEE Trans. Neural Netw. 12(4), 875-889 (2001)
- (2001) IEEE Trans. Neural Netw. , vol.12 , Issue.4 , pp. 875-889
- Moody, J.¹ Saffell, M.²

15
- 31844443291
- Inverted autonomous helicopter flight via reinforcement learning
- Ng, A.Y., et al.: Inverted autonomous helicopter flight via reinforcement learning. In: Intl. Symposium on Experimental Robotics (2004)
- (2004) Intl. Symposium on Experimental Robotics
- Ng, A.Y.¹

16
- 85012688561
- Princeton University Press
- Bellman, R.E.: Dynamic Programming. Princeton University Press (1957)
- (1957) Dynamic Programming
- Bellman, R.E.¹

17
- 4544366889
- Utility functions in autonomic systems
- Walsh, W.E., Tesauro, G., Kephart, J.O., Das, R.: Utility functions in autonomic systems. In: Proc. of ICAC-04, pp. 70-77 (2004)
- (2004) Proc. of ICAC-04 , pp. 70-77
- Walsh, W.E.¹ Tesauro, G.² Kephart, J.O.³ Das, R.⁴

18
- 34548038353
- IBM: Websphere benchmark sample, http://www-306.ibm.com/software/ webservers/appserv/benchmark3.html (2004)
- (2004) IBM: Websphere Benchmark Sample

19
- 0004308740
- Internet traffic: Periodicity, tail behavior and performance implications
- In: Gelenbe, E. (ed.) CRC
- Squillante, M.S., Yao, D.D., Zhang, L.: Internet traffic: Periodicity, tail behavior and performance implications. In: Gelenbe, E. (ed.) System Performance Evaluation: Methodologies and Applications, pp. 23-37. CRC (1999)
- (1999) System Performance Evaluation: Methodologies and Applications , pp. 23-37
- Squillante, M.S.¹ Yao, D.D.² Zhang, L.³

20
- 84899022377
- How to dynamically merge Markov decision processes
- In: Jordan, M.I., Kearns, M.J., Solla, S.A. (eds.) MIT
- Singh, S., Cohn, D.: How to dynamically merge Markov decision processes. In: Jordan, M.I., Kearns, M.J., Solla, S.A. (eds.) Advances in Neural Information Processing Systems, vol. 10, pp. 1057-1063. MIT (1998)
- (1998) Advances in Neural Information Processing Systems , vol.10 , pp. 1057-1063
- Singh, S.¹ Cohn, D.²

21
- 84898995067
- Learning from demonstration
- In: Mozer, M.C., et al. (eds.) MIT
- Schaal, S.: Learning from demonstration. In: Mozer, M.C., et al. (eds.) Advances in Neural Information Processing Systems, vol. 9, pp. 1040-1046. MIT (1997)
- (1997) Advances in Neural Information Processing Systems , vol.9 , pp. 1040-1046
- Schaal, S.¹

22
- 0036058423
- Effective reinforcement learning for mobile robots
- Smart, W.D., Kaelbling, L.P.: Effective reinforcement learning for mobile robots. In: Proc. of Intl. Conf. on Robotics and Automation (ICRA-02) (2002)
- (2002) Proc. of Intl. Conf. on Robotics and Automation (ICRA-02)
- Smart, W.D.¹ Kaelbling, L.P.²

23
- 27344432348
- Accelerating reinforcement learning through implicit imitation
- Price, B., Boutilier, C.: Accelerating reinforcement learning through implicit imitation. J. AI Res. 19, 569-629 (2003)
- (2003) J. AI Res. , vol.19 , pp. 569-629
- Price, B.¹ Boutilier, C.²

24
- 0001347323
- Complexity regularization with application to artificial neural networks
- In: Roussas, G. (ed.)
- Barron, A.R.: Complexity regularization with application to artificial neural networks. In: Roussas, G. (ed.) Nonparametric Functional Estimation and Related Topics (1991)
- (1991) Nonparametric Functional Estimation and Related Topics
- Barron, A.R.¹

25
- 4644323293
- Least-squares policy iteration
- Lagoudakis, M.G., Parr, R.: Least-squares policy iteration. J. Mach. Learn. Res. 4, 1107-1149 (2003)
- (2003) J. Mach. Learn. Res. , vol.4 , pp. 1107-1149
- Lagoudakis, M.G.¹ Parr, R.²

26
- 84962045565
- Multi-agent Q-learning and regression trees for automated pricing decisions
- In: Kaufmann, San Francisco
- Sridharan, M., Tesauro, G.: Multi-agent Q-learning and regression trees for automated pricing decisions. In: Proc. 17th Intl. Conf. on Machine Learning, pp. 927-934. Kaufmann, San Francisco (2000)
- (2000) Proc. 17th Intl. Conf. on Machine Learning , pp. 927-934
- Sridharan, M.¹ Tesauro, G.²

27
- 0000646059
- Learning internal representations by error propagation
- In: Rumelhart, D.E., McClelland, J.L., et al.(eds.) MIT, Cambridge
- Rumelhart, D.E., Hinton, G.E., Williams, R.J.: Learning internal representations by error propagation. In: Rumelhart, D.E., McClelland, J.L., et al.(eds.) Foundations. Parallel Distributed Processing, vol. 1, pp. 318-362. MIT, Cambridge (1987)
- (1987) Foundations. Parallel Distributed Processing , vol.1 , pp. 318-362
- Rumelhart, D.E.¹ Hinton, G.E.² Williams, R.J.³

28
- 85151728371
- Residual algorithms: Reinforcement learning with function approximation
- Baird, L.: Residual algorithms: Reinforcement learning with function approximation. In: Proc. of ICML-95 (1995)
- (1995) Proc. of ICML-95
- Baird, L.¹

29
- 0004049893
- Learning from delayed rewards
- Ph.D. dissertation, Cambridge University
- Watkins, C.: Learning from delayed rewards. Ph.D. dissertation, Cambridge University (1989)
- (1989)
- Watkins, C.¹

30
- 84948401557
- An observation-based approach towards self-managing web servers
- Pradhan, P., Tewari, R., Sahu, S., Chandra, C., Shenoy, P.: An observation-based approach towards self-managing web servers. In: Proc. of Intl. Workshop on Quality of Service, pp. 13-22 (2002)
- (2002) Proc. of Intl. Workshop on Quality of Service , pp. 13-22
- Pradhan, P.¹ Tewari, R.² Sahu, S.³ Chandra, C.⁴ Shenoy, P.⁵

31
- 35248816508
- Dynamic resource allocation for shared data centers using online measurements
- Chandra, A., Gong, W., Shenoy, P.: Dynamic resource allocation for shared data centers using online measurements. In: Proc. of ACM/IEEE Intl. Workshop on Quality of Service (IWQoS), pp. 381-400 (2003)
- (2003) Proc. of ACM/IEEE Intl. Workshop on Quality of Service (IWQoS) , pp. 381-400
- Chandra, A.¹ Gong, W.² Shenoy, P.³

32
- 4544322107
- Assessing the robustness of self-managing computer systems under variable workloads
- Bennani, M.N., Menascé, D.A.: Assessing the robustness of self-managing computer systems under variable workloads. In: Proc. of ICAC-04, pp. 62-69 (2004)
- (2004) Proc. of ICAC-04 , pp. 62-69
- Bennani, M.N.¹ Menascé, D.A.²

33
- 33745483310
- Resource allocation for autonomic data centers using analytic performance models
- Bennani, M.N., Menascé, D.A.: Resource allocation for autonomic data centers using analytic performance models. In: Proc. of ICAC-05, pp. 229-240 (2005)
- (2005) Proc. of ICAC-05 , pp. 229-240
- Bennani, M.N.¹ Menascé, D.A.²

34
- 34247632800
- IBM: WebSphere Extended Deployment, www.ibm.com/software/webservers/ appserv/extend/ (2006)
- (2006) IBM: WebSphere Extended Deployment

35
- 34247611659
- IBM: Tivoli Intelligent Orchestrator product overview, http://www.ibm.com/software/tivoli/products/intell-orch (2005)
- (2005) IBM: Tivoli Intelligent Orchestrator Product Overview

36
- 34548042719
- IBM: PowerExecutive, www.ibm.com/systems/management/director/extensions/ powerexec.html (2006)
- (2006) IBM: PowerExecutive

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.