메뉴 건너뛰기




Volumn , Issue , 2006, Pages 68-72

Aggregation of reinforcement learning algorithms

Author keywords

[No Author keywords available]

Indexed keywords

COMPUTATIONAL COMPLEXITY; MATHEMATICAL MODELS; OPTIMAL CONTROL SYSTEMS; REINFORCEMENT LEARNING; ROBUSTNESS (CONTROL SYSTEMS);

EID: 40649089465     PISSN: 10987576     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (8)

References (18)
  • 1
    • 0020970738 scopus 로고
    • Neuronlike Adaptive Elements That Can Solve Difficult Learning Control Problems
    • Andrew G. Barto, Richard S. Sutton, and Charles W. Anderson, "Neuronlike Adaptive Elements That Can Solve Difficult Learning Control Problems", IEEE Transactions on Systems, Man, and Cybernetics, Vol. SMC-13, No.5 1983, pp834-846.
    • (1983) IEEE Transactions on Systems, Man, and Cybernetics , vol.SMC-13 , Issue.5 , pp. 834-846
    • Barto, A.G.1    Sutton, R.S.2    Anderson, C.W.3
  • 2
    • 85156187730 scopus 로고    scopus 로고
    • Improving Elevator Performance Using Reinforcement Learning
    • 8, Cambridge, MA: The MIT Press
    • R.H. Crites and A.G. Barto, "Improving Elevator Performance Using Reinforcement Learning", Advances in Neural Information Processing Systems. 8, 1996, pp1017-1023, Cambridge, MA: The MIT Press.
    • (1996) Advances in Neural Information Processing Systems , pp. 1017-1023
    • Crites, R.H.1    Barto, A.G.2
  • 4
    • 13844311438 scopus 로고    scopus 로고
    • Combining a Stability and a Performance-oriented Control in Power Systems
    • Feb
    • Mevlidin Glavic. Damien Ernst, and Louis Wehenkel, "Combining a Stability and a Performance-oriented Control in Power Systems", IEEE Transactions on Power Systems. Vol. 20, No. 1 Feb. 2005
    • (2005) IEEE Transactions on Power Systems , vol.20 , Issue.1
    • Glavic, M.1    Ernst, D.2    Wehenkel, L.3
  • 7
    • 0029230267 scopus 로고
    • A method of combining multiple experts for the recognition of unconstrained handwritten numerals
    • Y. S. Huang and C. Y. Suen, "A method of combining multiple experts for the recognition of unconstrained handwritten numerals." IEEE Trans. On Pattern Analysis and Machine Intelligence 17(1), 1995, pp90-94.
    • (1995) IEEE Trans. On Pattern Analysis and Machine Intelligence , vol.17 , Issue.1 , pp. 90-94
    • Huang, Y.S.1    Suen, C.Y.2
  • 8
    • 15744363553 scopus 로고    scopus 로고
    • Ju Jiang, Mohamed Kamel, and Lei Chen, Reinforcement Learning and Aggregation , Proceedings of IEEE International Conference on Systems, Man, and Cybernetics 04, 2004, pp1303-1308.
    • Ju Jiang, Mohamed Kamel, and Lei Chen, "Reinforcement Learning and Aggregation ", Proceedings of IEEE International Conference on Systems, Man, and Cybernetics 04, 2004, pp1303-1308.
  • 11
    • 0034830461 scopus 로고    scopus 로고
    • Decision Templates for Multiple Classifier Fusion: An Experimental Comparison
    • Ludmila I. Kuncheva. James C. Bezdek, and Robert P.W. Duin, "Decision Templates for Multiple Classifier Fusion: An Experimental Comparison", Pattern Recognition, 34, (2), 2001, pp299-314.
    • (2001) Pattern Recognition , vol.34 , Issue.2 , pp. 299-314
    • Kuncheva, L.I.1    Bezdek, J.C.2    Duin, R.P.W.3
  • 12
    • 40649109566 scopus 로고    scopus 로고
    • Sharping and Policy Search in Reinforcement Learning
    • Ph.D. dissertation. University of California, Berkeley
    • Andrew Y. Ng, "Sharping and Policy Search in Reinforcement Learning", Ph.D. dissertation. University of California, Berkeley, 2003.
    • (2003)
    • Ng, A.Y.1
  • 14
    • 85058603073 scopus 로고    scopus 로고
    • Handbook of Learning and Approximate Dynamic Programming
    • Publication, IEEE Press
    • Jennie Si. Andy Barto, Warren Powell, and Donald Wunsch, Handbook of Learning and Approximate Dynamic Programming. A John Wiley and Sons, INC. Publication, IEEE Press, 2004
    • (2004) A John Wiley and Sons, INC
    • Andy Barto, J.S.1    Powell, W.2    Wunsch, D.3
  • 15
    • 84898972974 scopus 로고    scopus 로고
    • Satinder Singh, Dimitri Bertsekas, Reinforcement learning for dynamic channel allocation in cellular telephone systems. In M. C. Mozer, M. I. Jordan, and T. Petsche, editors, N1PS-9, The MIT Press, 1997, pp974-980.
    • Satinder Singh, Dimitri Bertsekas, Reinforcement learning for dynamic channel allocation in cellular telephone systems. In M. C. Mozer, M. I. Jordan, and T. Petsche, editors, N1PS-9, The MIT Press, 1997, pp974-980.
  • 16
    • 0004102479 scopus 로고    scopus 로고
    • A Bradford Book. The MIT Press, Cambridge, Massachusetts, London, England, ISBN 0-262-19398-1
    • Richard S. Sutton, and Andrew G. Barto, Reinforcement Learning, An Introduction. A Bradford Book. The MIT Press, Cambridge, Massachusetts, London, England, ISBN 0-262-19398-1, 1998.
    • (1998) Reinforcement Learning, An Introduction
    • Sutton, R.S.1    Barto, A.G.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.