메뉴 건너뛰기




Volumn 4865 LNAI, Issue , 2008, Pages 27-43

Continuous-state reinforcement learning with fuzzy approximation

Author keywords

[No Author keywords available]

Indexed keywords

ADAPTIVE SYSTEMS; AGENTS; ALGORITHMS; APPROXIMATION ALGORITHMS; CONTROL THEORY; EDUCATION; FUZZY CONTROL; INTELLIGENT AGENTS; ITERATIVE METHODS; LEARNING ALGORITHMS; LEARNING SYSTEMS; POLYNOMIAL APPROXIMATION; REINFORCEMENT; REINFORCEMENT LEARNING;

EID: 49949101369     PISSN: 03029743     EISSN: 16113349     Source Type: Book Series    
DOI: 10.1007/978-3-540-77949-0_3     Document Type: Conference Paper
Times cited : (17)

References (21)
  • 7
    • 0026923465 scopus 로고
    • Learning and tuning fuzzy logic controllers through reinforcements
    • Berenji, H.R., Khedkar, P.: Learning and tuning fuzzy logic controllers through reinforcements. IEEE Transactions on Neural Networks 3(5), 724-740 (1992)
    • (1992) IEEE Transactions on Neural Networks , vol.3 , Issue.5 , pp. 724-740
    • Berenji, H.R.1    Khedkar, P.2
  • 8
    • 0041877717 scopus 로고    scopus 로고
    • A convergent actor-critic-based FRL algorithm with application to power management of wireless transmitters
    • Berenji, H.R., Vengerov, D.: A convergent actor-critic-based FRL algorithm with application to power management of wireless transmitters. IEEE Transactions on Fuzzy Systems 11(4), 478-485 (2003)
    • (2003) IEEE Transactions on Fuzzy Systems , vol.11 , Issue.4 , pp. 478-485
    • Berenji, H.R.1    Vengerov, D.2
  • 10
    • 0041876271 scopus 로고    scopus 로고
    • A reinforcement learning adaptive fuzzy controller for robots
    • Lin, C.K.: A reinforcement learning adaptive fuzzy controller for robots. Fuzzy Sets and Systems 137, 339-352 (2003)
    • (2003) Fuzzy Sets and Systems , vol.137 , pp. 339-352
    • Lin, C.K.1
  • 11
    • 0029752470 scopus 로고    scopus 로고
    • Feature-based methods for large scale dynamic programming
    • Tsitsiklis, J.N., Van Roy, B.: Feature-based methods for large scale dynamic programming. Machine Learning 22(1-3), 59-94 (1996)
    • (1996) Machine Learning , vol.22 , Issue.1-3 , pp. 59-94
    • Tsitsiklis, J.N.1    Van Roy, B.2
  • 14
    • 22944460232 scopus 로고    scopus 로고
    • Wiering, M.: Convergence and divergence in standard and averaging reinforcement learning. In: Boulicaut, J.-F., Esposito, F., Giannotti, F., Pedreschi, D. (eds.) ECML 2004. LNCS (LNAI), 3201, pp. 477-488. Springer, Heidelberg (2004)
    • Wiering, M.: Convergence and divergence in standard and averaging reinforcement learning. In: Boulicaut, J.-F., Esposito, F., Giannotti, F., Pedreschi, D. (eds.) ECML 2004. LNCS (LNAI), vol. 3201, pp. 477-488. Springer, Heidelberg (2004)
  • 15
    • 0036832956 scopus 로고    scopus 로고
    • Kernel-based reinforcement learning
    • Ormoneit, D., Sen, S.: Kernel-based reinforcement learning. Machine Learning 49(2-3), 161-178 (2002)
    • (2002) Machine Learning , vol.49 , Issue.2-3 , pp. 161-178
    • Ormoneit, D.1    Sen, S.2
  • 19
    • 1442288723 scopus 로고    scopus 로고
    • Near Optimal Closed-loop Control
    • PhD thesis, University of Liège, Belgium March
    • Ernst, D.: Near Optimal Closed-loop Control. Application to Electric Power Systems. PhD thesis, University of Liège, Belgium (March 2003)
    • (2003) Application to Electric Power Systems
    • Ernst, D.1
  • 20
    • 0036832953 scopus 로고    scopus 로고
    • Variable-resolution discretization in optimal control
    • Munos, R., Moore, A.: Variable-resolution discretization in optimal control. Machine Learning 49(2-3), 291-323 (2002)
    • (2002) Machine Learning , vol.49 , Issue.2-3 , pp. 291-323
    • Munos, R.1    Moore, A.2
  • 21
    • 26944466214 scopus 로고    scopus 로고
    • Sherstov, A., Stone, P.: Function approximation via tile coding: Automating parameter choice. In: Zucker, J.-D., Saitta, L. (eds.) SARA 2005. LNCS (LNAI), 3607, pp. 194-205. Springer, Heidelberg (2005)
    • Sherstov, A., Stone, P.: Function approximation via tile coding: Automating parameter choice. In: Zucker, J.-D., Saitta, L. (eds.) SARA 2005. LNCS (LNAI), vol. 3607, pp. 194-205. Springer, Heidelberg (2005)


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.