메뉴 건너뛰기




Volumn 847 LNAI, Issue , 1994, Pages 1-9

Fuzzy reinforcement learning and dynamic programming

Author keywords

[No Author keywords available]

Indexed keywords

COMPUTER CIRCUITS; DECISION MAKING; FUZZY LOGIC; MACHINE LEARNING; REINFORCEMENT LEARNING;

EID: 0004987125     PISSN: 03029743     EISSN: 16113349     Source Type: Book Series    
DOI: 10.1007/3-540-58409-9_1     Document Type: Conference Paper
Times cited : (5)

References (14)
  • 3
    • 0003787146 scopus 로고
    • Princeton University Press, Princeton, NJ
    • R. Bellman. Dynamic Programming. Princeton University Press, Princeton, NJ, 1957.
    • (1957) Dynamic Programming
    • Bellman, R.1
  • 4
    • 0346636333 scopus 로고
    • Decision-making in a fuzzy environment
    • R.E. Bellman and L.A. Zadeh. Decision-making in a fuzzy environment. Management Science, 17(4):B-141:B-164, 1970.
    • (1970) Management Science , vol.17 , Issue.4
    • Bellman, R.E.1    Zadeh, L.A.2
  • 8
    • 85027124419 scopus 로고    scopus 로고
    • Prioritized sweeping: Reinforcement learning with less data and less real time
    • page to appear
    • A. Moore and C. Atkeson. Prioritized sweeping: Reinforcement learning with less data and less real time. Machine Learning, page to appear.
    • Machine Learning
    • Moore, A.1    Atkeson, C.2
  • 9
    • 33847202724 scopus 로고
    • Learning to predict by the methods of temporal differences
    • R.S. Sutton. Learning to predict by the methods of temporal differences. Machine Learning, 3:9-44, 1988.
    • (1988) Machine Learning , vol.3 , pp. 9-44
    • Sutton, R.S.1
  • 11
    • 0001046225 scopus 로고
    • Practical issues in temporal difference learning
    • G. Tesauro. Practical issues in temporal difference learning. Machine Learning, (8):257-277, 1992.
    • (1992) Machine Learning , Issue.8 , pp. 257-277
    • Tesauro, G.1
  • 12
    • 0000985504 scopus 로고
    • Td-gammon, a self-teaching backgammon program, achieves master-level play
    • G. Tesauro. Td-gammon, a self-teaching backgammon program, achieves master-level play. Neural Computation, 6(2):215-219, 1994.
    • (1994) Neural Computation , vol.6 , Issue.2 , pp. 215-219
    • Tesauro, G.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.