메뉴 건너뛰기




Volumn , Issue , 2008, Pages 518-524

Consistency of fuzzy model-based reinforcement learning

Author keywords

[No Author keywords available]

Indexed keywords

FUZZY LOGIC; FUZZY SYSTEMS; LEARNING ALGORITHMS; LEARNING SYSTEMS; POLYNOMIAL APPROXIMATION; PROBABILITY DENSITY FUNCTION; REINFORCEMENT; REINFORCEMENT LEARNING; SOLUTIONS;

EID: 55249099118     PISSN: 10987584     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/FUZZY.2008.4630417     Document Type: Conference Paper
Times cited : (7)

References (23)
  • 4
    • 49949101369 scopus 로고    scopus 로고
    • _, Continuous-state reinforcement learning with fuzzy approximation, in Adaptive Agents and Multi-Agent Systems III, ser. Lecture Notes in Computer Science, K. Tuyls. A. Nowé, Z. Guessoum, and D. Kudenko. Eds. Springer. 2008, 4865, pp. 27-43.
    • _, "Continuous-state reinforcement learning with fuzzy approximation," in Adaptive Agents and Multi-Agent Systems III, ser. Lecture Notes in Computer Science, K. Tuyls. A. Nowé, Z. Guessoum, and D. Kudenko. Eds. Springer. 2008, vol. 4865, pp. 27-43.
  • 8
    • 0026923465 scopus 로고
    • Learning and tuning fuzzy logic controllers through reinforcements
    • H. R. Berenji and P. Khedkar, "Learning and tuning fuzzy logic controllers through reinforcements." IEEE Transactions on Neural Networks, vol. 3, no. 5. pp. 724-740, 1992.
    • (1992) IEEE Transactions on Neural Networks , vol.3 , Issue.5 , pp. 724-740
    • Berenji, H.R.1    Khedkar, P.2
  • 9
    • 0041877717 scopus 로고    scopus 로고
    • A convergent actor-critic-based FRL algorithm with application to power management of wireless transmitters
    • H. R. Berenji and D. Vengerov, "A convergent actor-critic-based FRL algorithm with application to power management of wireless transmitters." IEEE Transactions on Fuzzy Systems, vol. 11, no. 4, pp. 478-485, 2003.
    • (2003) IEEE Transactions on Fuzzy Systems , vol.11 , Issue.4 , pp. 478-485
    • Berenji, H.R.1    Vengerov, D.2
  • 11
    • 0041876271 scopus 로고    scopus 로고
    • A reinforcement learning adaptive fuzzy controller for robots
    • C.-K. Lin. "A reinforcement learning adaptive fuzzy controller for robots," Fuzzy Sets and Systems, vol. 137, no. 3, pp. 339-352, 2003.
    • (2003) Fuzzy Sets and Systems , vol.137 , Issue.3 , pp. 339-352
    • Lin, C.-K.1
  • 12
    • 0026206780 scopus 로고
    • An optimal one-way multigrid algorithm for discrete-time stochastic control
    • 12
    • [ 12] C.-S. Chow and J. Tsitsiklis, "An optimal one-way multigrid algorithm for discrete-time stochastic control," IEEE Transactions on Automatic Control, vol. 36, no. 8, pp. 898-914, 1991.
    • (1991) IEEE Transactions on Automatic Control , vol.36 , Issue.8 , pp. 898-914
    • Chow, C.-S.1    Tsitsiklis, J.2
  • 14
    • 0029752470 scopus 로고    scopus 로고
    • Feature-based methods for large scale dynamic programming
    • J. N. Tsitsiklis and B. Van Roy. "Feature-based methods for large scale dynamic programming," Machine Learning, vol. 22, no. 1-3, pp. 59-94, 1996.
    • (1996) Machine Learning , vol.22 , Issue.1-3 , pp. 59-94
    • Tsitsiklis, J.N.1    Van Roy, B.2
  • 15
    • 0008872081 scopus 로고    scopus 로고
    • Analysis of a numerical dynamic programming algorithm applied to economic models
    • M. S. Santos and J. Vigo-Aguiar, "Analysis of a numerical dynamic programming algorithm applied to economic models." Econometrica, vol. 66, no. 2, pp. 409-426, 1998.
    • (1998) Econometrica , vol.66 , Issue.2 , pp. 409-426
    • Santos, M.S.1    Vigo-Aguiar, J.2
  • 17
    • 22944460232 scopus 로고    scopus 로고
    • Convergence and divergence in standard and averaging reinforcement learning
    • Pisa. Italy, 20-24 September
    • M. Wiering, "Convergence and divergence in standard and averaging reinforcement learning," in Proceedings 15th European Conference on Machine Learning (ECML'04), Pisa. Italy, 20-24 September 2004, pp. 477-488.
    • (2004) Proceedings 15th European Conference on Machine Learning (ECML'04) , pp. 477-488
    • Wiering, M.1
  • 18
    • 0036832956 scopus 로고    scopus 로고
    • Kernel-based reinforcement learning
    • D. Ormoneit and S. Sen. "Kernel-based reinforcement learning," Machine Learning, vol. 49, no. 2-3, pp. 161-178, 2002.
    • (2002) Machine Learning , vol.49 , Issue.2-3 , pp. 161-178
    • Ormoneit, D.1    Sen, S.2
  • 21
    • 85153965130 scopus 로고
    • Reinforcement learning with soft state aggregation
    • G. Tesauro, D. S. Touretzky, and T. K. Leen, Eds
    • S. P. Singh. T. Jaakkola, and M. I. Jordan, "Reinforcement learning with soft state aggregation." in Advances in Neural Information Processing Systems 7, G. Tesauro, D. S. Touretzky, and T. K. Leen, Eds., 1995, pp. 361-368.
    • (1995) Advances in Neural Information Processing Systems , vol.7 , pp. 361-368
    • Singh, S.P.1    Jaakkola, T.2    Jordan, M.I.3
  • 22
    • 1442288723 scopus 로고    scopus 로고
    • Near optimal closed-loop control. Application to electric power systems,
    • Ph.D. dissertation, University of Liège, Belgium. March
    • D. Ernst. "Near optimal closed-loop control. Application to electric power systems," Ph.D. dissertation, University of Liège, Belgium. March 2003.
    • (2003)
    • Ernst, D.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.