메뉴 건너뛰기




Volumn 5177 LNAI, Issue PART 1, 2008, Pages 182-193

Using generalized learning automata for state space aggregation in MAS

Author keywords

[No Author keywords available]

Indexed keywords

AUTOMATA THEORY; KNOWLEDGE BASED SYSTEMS; LARGE SCALE SYSTEMS; LEARNING SYSTEMS; REINFORCEMENT LEARNING; ROBOTS; SOFTWARE AGENTS;

EID: 57849155854     PISSN: 03029743     EISSN: 16113349     Source Type: Book Series    
DOI: 10.1007/978-3-540-85563-7_28     Document Type: Conference Paper
Times cited : (1)

References (13)
  • 1
    • 0033170372 scopus 로고    scopus 로고
    • Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
    • Sutton, R.S., Precup, D., Singh, S.P.: Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning. Artificial Intelligence 112(1-2), 181-211 (1999)
    • (1999) Artificial Intelligence , vol.112 , Issue.1-2 , pp. 181-211
    • Sutton, R.S.1    Precup, D.2    Singh, S.P.3
  • 2
    • 84912073624 scopus 로고    scopus 로고
    • Stolle, M., Precup, D.: Learning options in reinforcement learning. In: Koenig, S., Holte, R.C. (eds.) SARA 2002. LNCS (LNAI), 2371, pp. 212-223. Springer, Heidelberg (2002)
    • Stolle, M., Precup, D.: Learning options in reinforcement learning. In: Koenig, S., Holte, R.C. (eds.) SARA 2002. LNCS (LNAI), vol. 2371, pp. 212-223. Springer, Heidelberg (2002)
  • 3
    • 0010220982 scopus 로고    scopus 로고
    • Planning, learning and coordination in multiagent decision processes
    • Boutilier, C.: Planning, learning and coordination in multiagent decision processes. In: Theoretical Aspects of Rationality and Knowledge, pp. 195-201 (1996)
    • (1996) Theoretical Aspects of Rationality and Knowledge , pp. 195-201
    • Boutilier, C.1
  • 8
    • 33747670266 scopus 로고    scopus 로고
    • Learning factor graphs in polynomial time and sample complexity
    • Abbeel, P., Koller, D., Ng, A.Y.: Learning factor graphs in polynomial time and sample complexity. Journal of Machine Learning Research 7, 1743-1788 (2006)
    • (2006) Journal of Machine Learning Research , vol.7 , pp. 1743-1788
    • Abbeel, P.1    Koller, D.2    Ng, A.Y.3
  • 11
    • 0000337576 scopus 로고
    • Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning
    • Williams, R.: Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning. Reinforcement Learning 8, 229-256 (1992)
    • (1992) Reinforcement Learning , vol.8 , pp. 229-256
    • Williams, R.1
  • 13
    • 0011812680 scopus 로고
    • Local and global optimization algorithms for generalized learning automata
    • Phansalkar, V., Thathachar, M.: Local and global optimization algorithms for generalized learning automata. Neural Computation 7(5), 950-973 (1995)
    • (1995) Neural Computation , vol.7 , Issue.5 , pp. 950-973
    • Phansalkar, V.1    Thathachar, M.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.