메뉴 건너뛰기




Volumn 20, Issue 3, 2007, Pages 383-390

A reinforcement learning approach to dynamic resource allocation

Author keywords

Reinforcement learning; Resource allocation; Utility computing

Indexed keywords

BLOCK CODES; DECISION THEORY; PROGRAM PROCESSORS; REINFORCEMENT LEARNING; STOCHASTIC CONTROL SYSTEMS; UTILITY PROGRAMS;

EID: 33847768224     PISSN: 09521976     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.engappai.2006.06.019     Document Type: Article
Times cited : (59)

References (13)
  • 1
    • 0034440921 scopus 로고    scopus 로고
    • Bartelo, F., Paradells, J., 2000. Performance evaluation of public access mobile radio (PAMR) systems with priority calls. In: Proceedings of the 11th IEEE International Symposium on Personal, Indoor and Mobile Radio Communications, September 18-21.
  • 3
    • 33847775546 scopus 로고    scopus 로고
    • Byde, A., Salle, M., Bartolini, C., 2003. Market-based resource allocation for utility data centers. HP Technical Report HPL-2003-188, 〈http://www.hpl.hp.com/techreports/2003/HPL-2003-188.pdf〉.
  • 4
    • 0000439891 scopus 로고
    • On the convergence of stochastic iterative dynamic programming algorithms
    • Jaakkola T., Jordan M.I., and Singh S.P. On the convergence of stochastic iterative dynamic programming algorithms. Neural Comput. 6 6 (1994) 1185-1201
    • (1994) Neural Comput. , vol.6 , Issue.6 , pp. 1185-1201
    • Jaakkola, T.1    Jordan, M.I.2    Singh, S.P.3
  • 6
    • 33847779578 scopus 로고    scopus 로고
    • "Solaris Containers" (2004). A Sun Microsystems white paper. Available electronically at 〈http://wwws.sun.com/software/whitepapers/solaris10/grid_containers .pdf〉.
  • 8
    • 33745506664 scopus 로고    scopus 로고
    • Tesauro, G., Das, R., Walsh, W.E., Kephart, J.O., 2005. Utility-function-driven resource allocation in autonomic systems. In: Proceedings of the Second IEEE International Conference on Autonomic Computing (ICAC-05).
  • 9
    • 0031143730 scopus 로고    scopus 로고
    • An analysis of temporal-difference learning with function approximation
    • Tsitsiklis J.N., and Van Roy B. An analysis of temporal-difference learning with function approximation. IEEE Trans. Automat. Control 42 5 (1997) 674-690
    • (1997) IEEE Trans. Automat. Control , vol.42 , Issue.5 , pp. 674-690
    • Tsitsiklis, J.N.1    Van Roy, B.2
  • 10
    • 33745504762 scopus 로고    scopus 로고
    • Vengerov, D., Iakovlev, N., 2005. A reinforcement learning framework for dynamic resource allocation: first results. In: Proceedings of the Second IEEE International Conference on Autonomic Computing (ICAC-05).
  • 11
    • 24644466803 scopus 로고    scopus 로고
    • A fuzzy reinforcement learning approach to power control in wireless transmitters
    • Vengerov D., Bambos N., and Berenji H.R. A fuzzy reinforcement learning approach to power control in wireless transmitters. IEEE Trans. Syst. Man Cybern. B 35 4 (2005)
    • (2005) IEEE Trans. Syst. Man Cybern. B , vol.35 , Issue.4
    • Vengerov, D.1    Bambos, N.2    Berenji, H.R.3
  • 12
    • 4544366889 scopus 로고    scopus 로고
    • Walsh, W.E., Tesauro, G., Kephart, J.O., Das, R., 2004. Utility functions in autonomic systems. In: Proceedings of International Conference on Autonomic Computing, 〈http://www.research.ibm.com/people/w/wwalsh1/Papers/icac04NeDAR.pd f〉.
  • 13
    • 0026994365 scopus 로고    scopus 로고
    • Wang, L.-X., 1992. Fuzzy systems are universal approximators. In: Proceedings of the IEEE International Conference on Fuzzy Systems (FUZZ-IEEE '92), pp. 1163-1169.


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.