메뉴 건너뛰기




Volumn 1886 LNAI, Issue , 2000, Pages 125-135

Experience-based reinforcement learning to acquire effective behavior in a multi-agent domain

Author keywords

[No Author keywords available]

Indexed keywords

INTELLIGENT AGENTS; MACHINE LEARNING; MONTE CARLO METHODS; PROFITABILITY; REINFORCEMENT LEARNING; WAGES; ARTIFICIAL INTELLIGENCE; COMPENSATION (PERSONNEL); MULTI AGENT SYSTEMS;

EID: 84867798807     PISSN: 03029743     EISSN: 16113349     Source Type: Book Series    
DOI: 10.1007/3-540-44533-1_16     Document Type: Conference Paper
Times cited : (23)

References (17)
  • 6
    • 0000146518 scopus 로고
    • Credit assignment in rule discovery systems based on genetic algorithms
    • Grefenstette, J. J.: Credit Assignment in Rule Discovery Systems Based on Genetic Algorithms, Machine Learning Vol.3, pp.225-245(1988).
    • (1988) Machine Learning , vol.3 , pp. 225-245
    • Grefenstette, J.J.1
  • 8
    • 85153938292 scopus 로고
    • Reinforcement learning algorithm for partially observable Markov decision problems
    • Jaakkola, T., Singh, S.P. and Jordan, M.I.: Reinforcement Learning Algorithm for Partially Observable Markov decision Problems, Advances in Neural Information Processing Systems 7(NIPS-94), pp.345-352 (1994).
    • (1994) Advances in Neural Information Processing Systems , vol.7 , Issue.NIPS-94 , pp. 345-352
    • Jaakkola, T.1    Singh, S.P.2    Jordan, M.I.3
  • 10
    • 0030647149 scopus 로고    scopus 로고
    • Reinforcement learning in the multi-robot domain
    • Mataric, J.M.: Reinforcement Learning in the Multi-Robot Domain, Autonomous Robots 4(1), pp.77-83(1997).
    • (1997) Autonomous Robots , vol.4 , Issue.1 , pp. 77-83
    • Mataric, J.M.1
  • 12
    • 0004762325 scopus 로고
    • Multiagent coordination with learning classifier systems
    • Weiss, G. and Sen, S.(eds.) Berlin, Heidelberg. Springer Verlag
    • Sen, S. and Sekaran, M.: Multiagent Coordination with Learning Classifier Systems, in Weiss, G. and Sen, S.(eds.), Adaption and Learning in Multi-agent systems, Berlin, Heidelberg. Springer Verlag, pp.218-233(1995).
    • (1995) Adaption and Learning in Multi-agent Systems , pp. 218-233
    • Sen, S.1    Sekaran, M.2
  • 13
    • 0029753630 scopus 로고    scopus 로고
    • Reinforcement learning with replacing eligibility traces
    • Singh, S.P. and Sutton, R.S.: Reinforcement Learning with Replacing Eligibility Traces, Machine Learning Vol.22, pp.1-37(1996).
    • (1996) Machine Learning , vol.22 , pp. 1-37
    • Singh, S.P.1    Sutton, R.S.2
  • 15
    • 33847202724 scopus 로고
    • Learning to predict by the methods of temporal differences
    • Sutton, R.S.: Learning to Predict by the Methods of Temporal Differences, Machine Learning, Vol. 3, pp.9-44(1988).
    • (1988) Machine Learning , vol.3 , pp. 9-44
    • Sutton, R.S.1
  • 16
    • 34249833101 scopus 로고
    • Technical note: Q-learning
    • Watkins, C. J. H., and Dayan, P.: Technical note: Q-learning, Machine Learning Vol.8, pp.55-68(1992).
    • (1992) Machine Learning , vol.8 , pp. 55-68
    • Watkins, C.J.H.1    Dayan, P.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.