SCOPUS 정보 검색 플랫폼

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Volumn 5177 LNAI, Issue PART 1, 2008, Pages 182-193

Using generalized learning automata for state space aggregation in MAS

(3) De Hauwere, Yann Michaël a Vrancx, Peter a Nowé, Ann a

a VRIJE UNIVERSITEIT BRUSSEL (Belgium)

Author keywords

[No Author keywords available]

Indexed keywords

AUTOMATA THEORY; KNOWLEDGE BASED SYSTEMS; LARGE SCALE SYSTEMS; LEARNING SYSTEMS; REINFORCEMENT LEARNING; ROBOTS; SOFTWARE AGENTS;

DISCRETE STATE SPACE; GENERALIZED LEARNING; INDEPENDENT LEARNING; MULTI-AGENT ALGORITHMS; MULTI-AGENT ENVIRONMENT; MULTI-AGENT LEARNING; MULTI-AGENT REINFORCEMENT LEARNING; MULTI-AGENT SETTING;

MULTI AGENT SYSTEMS;

EID: 57849155854 PISSN: 03029743 EISSN: 16113349 Source Type: Book Series
DOI: 10.1007/978-3-540-85563-7_28 Document Type: Conference Paper

Times cited : (1)

References (13)

1
- 0033170372
- Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
- Sutton, R.S., Precup, D., Singh, S.P.: Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning. Artificial Intelligence 112(1-2), 181-211 (1999)
- (1999) Artificial Intelligence , vol.112 , Issue.1-2 , pp. 181-211
- Sutton, R.S.¹ Precup, D.² Singh, S.P.³

2
- 84912073624
- Stolle, M., Precup, D.: Learning options in reinforcement learning. In: Koenig, S., Holte, R.C. (eds.) SARA 2002. LNCS (LNAI), 2371, pp. 212-223. Springer, Heidelberg (2002)
- Stolle, M., Precup, D.: Learning options in reinforcement learning. In: Koenig, S., Holte, R.C. (eds.) SARA 2002. LNCS (LNAI), vol. 2371, pp. 212-223. Springer, Heidelberg (2002)

3
- 0010220982
- Planning, learning and coordination in multiagent decision processes
- Boutilier, C.: Planning, learning and coordination in multiagent decision processes. In: Theoretical Aspects of Rationality and Knowledge, pp. 195-201 (1996)
- (1996) Theoretical Aspects of Rationality and Knowledge , pp. 195-201
- Boutilier, C.¹

4
- 85166207010
- Exploiting structure in policy construction
- Mellish, C, ed, Morgan Kaufmann, San Francisco
- Boutilier, C., Dearden, R., Goldszmidt, M.: Exploiting structure in policy construction. In: Mellish, C. (ed.) Proceedings of the 14th International Joint Conference on Artificial Intelligence, pp. 1104-1111. Morgan Kaufmann, San Francisco (1995)
- (1995) Proceedings of the 14th International Joint Conference on Artificial Intelligence , pp. 1104-1111
- Boutilier, C.¹ Dearden, R.² Goldszmidt, M.³

5
- 0012296128
- Multiagent planning with factored mdps
- Guestrin, C., Koller, D., Parr, R.: Multiagent planning with factored mdps. In: 14th Neural Information Processing Systems (NIPS-14) (2001)
- (2001) 14th Neural Information Processing Systems (NIPS-14)
- Guestrin, C.¹ Koller, D.² Parr, R.³

6
- 33749242809
- Learning the structure of factored markov decision processes in reinforcement learning problems
- New York, NY, USA, pp
- Degris, T., Sigaud, O., Wuillemin, P.H.: Learning the structure of factored markov decision processes in reinforcement learning problems. In: Proceedings of the 23rd International Conference on Machine learning, New York, NY, USA, pp. 257-264 (2006)
- (2006) Proceedings of the 23rd International Conference on Machine learning , pp. 257-264
- Degris, T.¹ Sigaud, O.² Wuillemin, P.H.³

7
- 36348930987
- AAAI, pp, AAAI Press, Menlo Park 2007
- Strehl, A.L., Diuk, C., Littman, M.L.: Efficient structure learning in factored-state mdps. In: AAAI, pp. 645-650. AAAI Press, Menlo Park (2007)
- Efficient structure learning in factored-state mdps , pp. 645-650
- Strehl, A.L.¹ Diuk, C.² Littman, M.L.³

8
- 33747670266
- Learning factor graphs in polynomial time and sample complexity
- Abbeel, P., Koller, D., Ng, A.Y.: Learning factor graphs in polynomial time and sample complexity. Journal of Machine Learning Research 7, 1743-1788 (2006)
- (2006) Journal of Machine Learning Research , vol.7 , pp. 1743-1788
- Abbeel, P.¹ Koller, D.² Ng, A.Y.³

9
- 0002500351
- Planning, Learning and Coordination in Multiagent Decision Processes
- Boutilier, C.: Planning, Learning and Coordination in Multiagent Decision Processes. In: Proceedings of the Sixth Conference on Theoretical Aspects of Rationality and Knowledge table of contents, pp. 195-210 (1996)
- (1996) Proceedings of the Sixth Conference on Theoretical Aspects of Rationality and Knowledge table of contents , pp. 195-210
- Boutilier, C.¹

10
- 29344475738
- Solving factored MDPs with continuous and discrete variables
- Guestrin, C., Hauskrecht, M., Kveton, B.: Solving factored MDPs with continuous and discrete variables. In: Proceedings of the 20th conference on Uncertainty in artificial intelligence, pp. 235-242 (2004)
- (2004) Proceedings of the 20th conference on Uncertainty in artificial intelligence , pp. 235-242
- Guestrin, C.¹ Hauskrecht, M.² Kveton, B.³

11
- 0000337576
- Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning
- Williams, R.: Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning. Reinforcement Learning 8, 229-256 (1992)
- (1992) Reinforcement Learning , vol.8 , pp. 229-256
- Williams, R.¹

12
- 2942609194
- Kluwer Academic Pub, Dordrecht
- Thathachar, M., Sastry, P.: Networks of Learning Automata: Techniques for Online Stochastic Optimization. Kluwer Academic Pub., Dordrecht (2004)
- (2004) Networks of Learning Automata: Techniques for Online Stochastic Optimization
- Thathachar, M.¹ Sastry, P.²

13
- 0011812680
- Local and global optimization algorithms for generalized learning automata
- Phansalkar, V., Thathachar, M.: Local and global optimization algorithms for generalized learning automata. Neural Computation 7(5), 950-973 (1995)
- (1995) Neural Computation , vol.7 , Issue.5 , pp. 950-973
- Phansalkar, V.¹ Thathachar, M.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.