메뉴 건너뛰기




Volumn , Issue , 2008, Pages 81-84

Strategy entropy as a measure of strategy convergence in reinforcement learning

Author keywords

[No Author keywords available]

Indexed keywords

EDUCATION; INTELLIGENT CONTROL; INTELLIGENT NETWORKS; INTELLIGENT SYSTEMS; REINFORCEMENT; REINFORCEMENT LEARNING;

EID: 58049179535     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ICINIS.2008.94     Document Type: Conference Paper
Times cited : (8)

References (16)
  • 3
    • 58049153138 scopus 로고    scopus 로고
    • th International FLAIRS Conference, AAAI Press, 1998, pp.372-377.
    • th International FLAIRS Conference, AAAI Press, 1998, pp.372-377.
  • 4
    • 1442265466 scopus 로고    scopus 로고
    • Power Systems Stability Control: Reinforcement Learning Framework
    • February
    • D. Ernst, M. Glavic and L. Wehenkel, "Power Systems Stability Control: Reinforcement Learning Framework", IEEE Transactions on Power Systems, February 2004, Vol. 19, pp. 427-435.
    • (2004) IEEE Transactions on Power Systems , vol.19 , pp. 427-435
    • Ernst, D.1    Glavic, M.2    Wehenkel, L.3
  • 5
    • 0000719863 scopus 로고
    • Packet routing in dynamically changing networks: A reinforcement learning approach
    • J. Boyan and M. L. Littman, "Packet routing in dynamically changing networks: a reinforcement learning approach", Advances in Neural Information Processing Systems, 1994, Vol. 7, pp. 671-678.
    • (1994) Advances in Neural Information Processing Systems , vol.7 , pp. 671-678
    • Boyan, J.1    Littman, M.L.2
  • 6
    • 0000417520 scopus 로고    scopus 로고
    • Low power wireless communication via reinforcement learning
    • Systems, MIT Press
    • T.X Brown, "Low power wireless communication via reinforcement learning", Advances in Neural Information Processing Systems, MIT Press, 2000, Vol. 12, pp. 893-899.
    • (2000) Advances in Neural Information Processing , vol.12 , pp. 893-899
    • Brown, T.X.1
  • 7
    • 0026880130 scopus 로고
    • Automatic Programming of Behavior-based Robots using Reinforcement Learning
    • June
    • Mahadevan, Sridhar and Jonathan Connell, "Automatic Programming of Behavior-based Robots using Reinforcement Learning", Artificial Intelligence, June, 1992, Vol. 55, pp. 311-365.
    • (1992) Artificial Intelligence , vol.55 , pp. 311-365
    • Mahadevan, S.1    Connell, J.2
  • 8
    • 0037288370 scopus 로고    scopus 로고
    • Recent advances in hierarchical reinforcement learning
    • A. G. Barto and S. Mahadevan, "Recent advances in hierarchical reinforcement learning", Discrete Event Dynamic Systems, 13(4), pp.41-77, 2003
    • (2003) Discrete Event Dynamic Systems , vol.13 , Issue.4 , pp. 41-77
    • Barto, A.G.1    Mahadevan, S.2
  • 9
    • 33748309557 scopus 로고    scopus 로고
    • Decision tree function approximation in reinforcement learning
    • Colorado State University, October
    • Larry D. Pyeatt and Adele E. Howe, "Decision tree function approximation in reinforcement learning", Tech Report, TR CS-98-112, Colorado State University, October 1998.
    • (1998) Tech Report, TR CS-98-112
    • Pyeatt, L.D.1    Howe, A.E.2
  • 12
    • 0001930137 scopus 로고    scopus 로고
    • The physics and mathematics of the second law of thermodynamics
    • cond-mat/9708200
    • Elliott H. Lieb and J. Yngvason, "The physics and mathematics of the second law of thermodynamics". Physics Report, cond-mat/9708200, 1999, 310: pp.1-96.
    • (1999) Physics Report , vol.310 , pp. 1-96
    • Lieb, E.H.1    Yngvason, J.2
  • 13
    • 84856043672 scopus 로고
    • A mathematical theory of communication
    • Claude E. Shannon, "A mathematical theory of communication", Bell System Technical Journal, 1948, Vol. 27, pp. 379-423, 623-656.
    • (1948) Bell System Technical Journal , vol.27
    • Shannon, C.E.1
  • 14
    • 13644276244 scopus 로고    scopus 로고
    • Entropy - A Measure of Uncertainty of Random Variable
    • Zhang Dianhu, Fang Shaohui and Ding Xiaojun, "Entropy - A Measure of Uncertainty of Random Variable", Systems Engineering and Electronics, 1997, No. 11, pp. 1-3.
    • (1997) Systems Engineering and Electronics , Issue.11 , pp. 1-3
    • Zhang, D.1    Fang, S.2    Ding, X.3
  • 15
    • 1842722362 scopus 로고    scopus 로고
    • M. Costa, AL Goldberger, CK Peng, Multi-scale Entropy Analysis of Complex Physiologic Time Series, Physical Review Letters 89, 2002, 0681022002.
    • M. Costa, AL Goldberger, CK Peng, "Multi-scale Entropy Analysis of Complex Physiologic Time Series", Physical Review Letters 89, 2002, 0681022002.
  • 16
    • 84944486544 scopus 로고
    • Prediction and Entropy of Printed English
    • Claude E. Shannon, "Prediction and Entropy of Printed English", Bell Systems Technical Journal, 1951, Vol. 30, pp. 50-64.
    • (1951) Bell Systems Technical Journal , vol.30 , pp. 50-64
    • Shannon, C.E.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.