메뉴 건너뛰기




Volumn 213, Issue , 2012, Pages 39-49

Induced states in a decision tree constructed by Q-learning

Author keywords

CART; Critic actor model; Decision trees; Q learning; Reinforcement learning; State space partition

Indexed keywords

ACTION POLICIES; ADAPTIVE HEURISTIC CRITICS; BENCHMARK DATASETS; CART; CLASSIFICATION AND REGRESSION TREE; DISCRETE STATE SPACE; INFORMATION GAIN; LEARNING CONTROL; LONG-TERM EVALUATION; MECHATRONIC SYSTEMS; NEAR-OPTIMAL POLICIES; NEURAL NETWORK MODEL; OPTIMAL POLICIES; POLICY ITERATION; Q-LEARNING; REINFORCEMENT LEARNING METHOD; SECOND PHASE; SEQUENTIAL PROCESS; SPLITTING CRITERION; STATE AGGREGATION; STATE PARTITION; STATE SPACE; TRAINING PATTERNS; TREE GROWING; TREE INDUCTION; TREE-BASED;

EID: 84863723501     PISSN: 00200255     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.ins.2012.06.009     Document Type: Article
Times cited : (8)

References (32)
  • 4
    • 78650535641 scopus 로고    scopus 로고
    • Autonomic tracing of production processes with mobile and agent-based computing
    • M. Cimino, and F. Marcelloni Autonomic tracing of production processes with mobile and agent-based computing Information Sciences 181 5 2010 935 953
    • (2010) Information Sciences , vol.181 , Issue.5 , pp. 935-953
    • Cimino, M.1    Marcelloni, F.2
  • 5
    • 77954595557 scopus 로고    scopus 로고
    • On the potential contributions of hybrid intelligent approaches to multicomponent robotic system development
    • R.J. Duro, M. Graña, and J. de Lope On the potential contributions of hybrid intelligent approaches to multicomponent robotic system development Information Sciences 180 14 2010 2635 2648
    • (2010) Information Sciences , vol.180 , Issue.14 , pp. 2635-2648
    • Duro, R.J.1    Graña, M.2    De Lope, J.3
  • 6
    • 0032643313 scopus 로고    scopus 로고
    • Solving semi-markov decision problems using average reward reinforcement learning
    • T.K. Das, A. Gosavi, S. Mahadevan, and N. Marchalleck Solving semi-markov decision problems using average reward reinforcement learning Management Science 45 1999 560 574
    • (1999) Management Science , vol.45 , pp. 560-574
    • Das, T.K.1    Gosavi, A.2    Mahadevan, S.3    Marchalleck, N.4
  • 7
    • 0035361059 scopus 로고    scopus 로고
    • Look-ahead based fuzzy decision tree induction
    • M. Dong, and R. Kothari Look-ahead based fuzzy decision tree induction IEEE Transactions on Fuzzy Systems 9 3 2001 461 468
    • (2001) IEEE Transactions on Fuzzy Systems , vol.9 , Issue.3 , pp. 461-468
    • Dong, M.1    Kothari, R.2
  • 9
    • 67649794787 scopus 로고    scopus 로고
    • A novel approach for multi-agent-based intelligent manufacturing system
    • Q. Guo, and M. Zhang A novel approach for multi-agent-based intelligent manufacturing system Information Sciences 179 18 2009 3079 3090
    • (2009) Information Sciences , vol.179 , Issue.18 , pp. 3079-3090
    • Guo, Q.1    Zhang, M.2
  • 11
    • 79953906172 scopus 로고    scopus 로고
    • Self-organizing state aggregation for architecture design of Q-learning
    • K.S. Hwang, H.Y. Lin, Y.P. Hsu, and H.H. Yu Self-organizing state aggregation for architecture design of Q-learning Information Sciences 181 13 2011 2813 2822
    • (2011) Information Sciences , vol.181 , Issue.13 , pp. 2813-2822
    • Hwang, K.S.1    Lin, H.Y.2    Hsu, Y.P.3    Yu, H.H.4
  • 12
    • 0001815269 scopus 로고
    • Constructing optimal binary decision trees is NP-complete
    • L. Hyafil, and R.L. Rivest Constructing optimal binary decision trees is NP-complete Information Processing Letter 5 1 1976 15 17
    • (1976) Information Processing Letter , vol.5 , Issue.1 , pp. 15-17
    • Hyafil, L.1    Rivest, R.L.2
  • 13
    • 78049242443 scopus 로고    scopus 로고
    • Optimal fuzzy control system using the cross-entropy method: A case study of a drilling process
    • R.E. Haber, R.M. del Toro, and A. Gajate Optimal fuzzy control system using the cross-entropy method: a case study of a drilling process Information Sciences 180 14 2010 2777 2792
    • (2010) Information Sciences , vol.180 , Issue.14 , pp. 2777-2792
    • Haber, R.E.1    Del Toro, R.M.2    Gajate, A.3
  • 14
    • 77953649722 scopus 로고    scopus 로고
    • ANGLE: An autonomous, normative and guidable agent with changing knowledge
    • B. Liao, and H. Huang ANGLE: an autonomous, normative and guidable agent with changing knowledge Information Sciences 180 17 2010 3117 3139
    • (2010) Information Sciences , vol.180 , Issue.17 , pp. 3117-3139
    • Liao, B.1    Huang, H.2
  • 15
    • 81355127243 scopus 로고    scopus 로고
    • Nonlinear systems design by a novel fuzzy neural system via hybridization of electromagnetism-like mechanism and particle swarm optimisation algorithms
    • C.H. Lee, and Y.C. Lee Nonlinear systems design by a novel fuzzy neural system via hybridization of electromagnetism-like mechanism and particle swarm optimisation algorithms Information Sciences 186 1 2012 59 72
    • (2012) Information Sciences , vol.186 , Issue.1 , pp. 59-72
    • Lee, C.H.1    Lee, Y.C.2
  • 16
    • 0034274591 scopus 로고    scopus 로고
    • A comparison of prediction accuracy, complexity, and training time of thirty-three old and new classification algorithms
    • T.S. Lim, W.Y. Loh, and Y.S. Shih A comparison of prediction accuracy, complexity, and training time of thirty-three old and new classification algorithms Machine Learning 40 2000 203 228
    • (2000) Machine Learning , vol.40 , pp. 203-228
    • Lim, T.S.1    Loh, W.Y.2    Shih, Y.S.3
  • 21
    • 19744380110 scopus 로고    scopus 로고
    • A multi-agent reinforcement learning approach to obtaining dynamic control policies for stochastic lot scheduling problem
    • C.D. Paternina-Arboledaa, and T.K. Das A multi-agent reinforcement learning approach to obtaining dynamic control policies for stochastic lot scheduling problem Simulation Modelling Practice and Theory 13 5 2005 389 406
    • (2005) Simulation Modelling Practice and Theory , vol.13 , Issue.5 , pp. 389-406
    • Paternina-Arboledaa, C.D.1    Das, T.K.2
  • 27
    • 79952312120 scopus 로고    scopus 로고
    • Hessian matrix distribution for bayesian policy gradient reinforcement learning
    • N.A. Vien, H. Yu, and T.C. Chung Hessian matrix distribution for bayesian policy gradient reinforcement learning Information Sciences 181 9 2011 1671 1685
    • (2011) Information Sciences , vol.181 , Issue.9 , pp. 1671-1685
    • Vien, N.A.1    Yu, H.2    Chung, T.C.3
  • 29
    • 76349113332 scopus 로고    scopus 로고
    • A modified gradient-based neuro-fuzzy learning algorithm and its convergence
    • W. Wu, L. Li, J. Yang, and Y. Liu A modified gradient-based neuro-fuzzy learning algorithm and its convergence Information Sciences 180 9 2010 1630 1642
    • (2010) Information Sciences , vol.180 , Issue.9 , pp. 1630-1642
    • Wu, W.1    Li, L.2    Yang, J.3    Liu, Y.4
  • 32
    • 77953650410 scopus 로고    scopus 로고
    • Searching for overlapping coalitions in multiple virtual organizations
    • Guofu Zhang, Jianguo Jiang, Zhaopin Su, Meibin Qi, and Hua Fang Searching for overlapping coalitions in multiple virtual organizations Information Sciences 180 17 2010 3140 3156
    • (2010) Information Sciences , vol.180 , Issue.17 , pp. 3140-3156
    • Zhang, G.1    Jiang, J.2    Su, Z.3    Qi, M.4    Fang, H.5


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.