메뉴 건너뛰기




Volumn 42, Issue 4, 2012, Pages 999-1004

Fusion of multiple behaviors using layered reinforcement learning

Author keywords

Behavior based control; intelligent robots; reinforcement learning

Indexed keywords

BEHAVIOR-BASED CONTROL; COMPLEX BEHAVIOR; CONTROL POLICY; DYNAMIC ENVIRONMENTS; POST PROCESS; Q-LEARNING; Q-LEARNING ALGORITHMS; STATE SPACE; TREE INDUCTION;

EID: 84862798631     PISSN: 10834427     EISSN: None     Source Type: Journal    
DOI: 10.1109/TSMCA.2012.2183349     Document Type: Article
Times cited : (14)

References (18)
  • 3
    • 13644265156 scopus 로고    scopus 로고
    • Reinforcement learning-based output feedback control of nonlinear systems with input constraints
    • Feb.
    • P. He and S. Jagannathan, "Reinforcement learning-based output feedback control of nonlinear systems with input constraints," IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 35, no. 1, pp. 150-154, Feb. 2005.
    • (2005) IEEE Trans. Syst., Man, Cybern. B, Cybern. , vol.35 , Issue.1 , pp. 150-154
    • He, P.1    Jagannathan, S.2
  • 4
    • 0026154509 scopus 로고
    • A survey of decision tree classifier methodology
    • May/Jun.
    • S. R. Safavian and D. Landgrebe, "A survey of decision tree classifier methodology," IEEE Trans. Syst., Man, Cybern., vol. 21, no. 3, pp. 660-674, May/Jun. 1991.
    • (1991) IEEE Trans. Syst., Man, Cybern. , vol.21 , Issue.3 , pp. 660-674
    • Safavian, S.R.1    Landgrebe, D.2
  • 5
    • 0002431740 scopus 로고    scopus 로고
    • Automatic construction of decision trees from data: A multi-disciplinary survey
    • Dec.
    • S. K. Murthy, "Automatic construction of decision trees from data: A multi-disciplinary survey," Data Mining Knowl. Disc., vol. 2, no. 4, pp. 345-389, Dec. 1998.
    • (1998) Data Mining Knowl. Disc. , vol.2 , Issue.4 , pp. 345-389
    • Murthy, S.K.1
  • 6
    • 0022688781 scopus 로고
    • A robust layered control system for a mobile robot
    • Mar.
    • R. Brooks, "A robust layered control system for a mobile robot," IEEE J. Robot. Autom., vol. RA-2, no. 1, pp. 14-23, Mar. 1986.
    • (1986) IEEE J. Robot. Autom. , vol.RA-2 , Issue.1 , pp. 14-23
    • Brooks, R.1
  • 7
    • 69549083130 scopus 로고    scopus 로고
    • Efficient behavior learning based on state value estimation of self and others
    • Y. Takahashi, K. Noma, and M. Asada, "Efficient behavior learning based on state value estimation of self and others," Adv. Robot., vol. 22, no. 12, pp. 1379-1395, 2008.
    • (2008) Adv. Robot. , vol.22 , Issue.12 , pp. 1379-1395
    • Takahashi, Y.1    Noma, K.2    Asada, M.3
  • 8
    • 0012075670 scopus 로고    scopus 로고
    • Towards a life-long learning soccer agent
    • Jun.
    • A. Kleiner, M. Dietl, and B. Nebel, "Towards a life-long learning soccer agent," in Proc. Int. RoboCup Symp., Jun. 2002, pp. 119-127.
    • (2002) Proc. Int. RoboCup Symp. , pp. 119-127
    • Kleiner, A.1    Dietl, M.2    Nebel, B.3
  • 11
    • 40949147745 scopus 로고    scopus 로고
    • A comprehensive survey of multiagent reinforcement learning
    • Mar.
    • L. Busoniu, R. Babuska, and B. Schutter, "A comprehensive survey of multiagent reinforcement learning," IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 38, no. 2, pp. 156-172, Mar. 2008.
    • (2008) IEEE Trans. Syst., Man, Cybern. B, Cybern. , vol.38 , Issue.2 , pp. 156-172
    • Busoniu, L.1    Babuska, R.2    Schutter, B.3
  • 12
    • 0001815269 scopus 로고
    • Constructing optimal binary decision trees is NP-complete
    • L. Hyafil and R. L. Rivest, "Constructing optimal binary decision trees is NP-complete," Inf. Process. Lett., vol. 5, no. 1, pp. 15-17, 1976.
    • (1976) Inf. Process. Lett. , vol.5 , Issue.1 , pp. 15-17
    • Hyafil, L.1    Rivest, R.L.2
  • 13
    • 0037365123 scopus 로고    scopus 로고
    • Decision forest: Combining the predictions of multiple independent decision tree models
    • Mar./Apr.
    • W. Tong, H. Hong, H. Fang, Q. Xie, and R. Perkins, "Decision forest: Combining the predictions of multiple independent decision tree models," J. Chem. Inf. Comput. Sci, vol. 43, no. 2, pp. 525-531, Mar./Apr. 2003.
    • (2003) J. Chem. Inf. Comput. Sci , vol.43 , Issue.2 , pp. 525-531
    • Tong, W.1    Hong, H.2    Fang, H.3    Xie, Q.4    Perkins, R.5
  • 14
  • 15
    • 14344261491 scopus 로고    scopus 로고
    • Using relative novelty to identify useful temporal abstractions in reinforcement learning
    • O. Simsek and A. G. Barto, "Using relative novelty to identify useful temporal abstractions in reinforcement learning," in Proc. 21st Int. Conf. Mach. Learn., 2004, p. 95.
    • (2004) Proc. 21st Int. Conf. Mach. Learn. , pp. 95
    • Simsek, O.1    Barto, A.G.2
  • 16
    • 46449122257 scopus 로고    scopus 로고
    • Behavior cloning by a selforganizing decision tree
    • K. S. Hwang, Y. J. Chen, and T. H. Yang, "Behavior cloning by a selforganizing decision tree," in Proc. IEEE ICIT, 2007, pp. 20-24.
    • (2007) Proc. IEEE ICIT , pp. 20-24
    • Hwang, K.S.1    Chen, Y.J.2    Yang, T.H.3
  • 17
    • 34548139025 scopus 로고    scopus 로고
    • Self organizing decision tree based on reinforcement learning and its application on state space partition
    • Man, Cybern.
    • K. S. Hwang, T. W. Yang, and C. J. Lin, "Self organizing decision tree based on reinforcement learning and its application on state space partition," in Proc. IEEE Int. Conf. Syst., Man, Cybern., 2006, pp. 5088-5093.
    • (2006) Proc. IEEE Int. Conf. Syst. , pp. 5088-5093
    • Hwang, K.S.1    Yang, T.W.2    Lin, C.J.L.3
  • 18
    • 0035361059 scopus 로고    scopus 로고
    • Look-ahead based fuzzy decision tree induction
    • Jun.
    • M. Dong and R. Kothari, "Look-ahead based fuzzy decision tree induction," IEEE Trans. Fuzzy Syst., vol. 9, no. 3, pp. 461-468, Jun. 2001.
    • (2001) IEEE Trans. Fuzzy Syst. , vol.9 , Issue.3 , pp. 461-468
    • Dong, M.1    Kothari, R.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.