SCOPUS 정보 검색 플랫폼

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Volumn 2636, Issue , 2003, Pages 33-48

Cooperative learning using advice exchange

(2) Nunes, Luís a,b Oliveira, Eugénio a

a UNIVERSITY OF PORTO (Portugal)

b INSTITUTO UNIVERSITÁRIO DE LISBOA ISCTE IUL (Portugal)

Author keywords

[No Author keywords available]

Indexed keywords

BACKPROPAGATION; INTELLIGENT AGENTS; LEARNING SYSTEMS; SUPERVISED LEARNING; TRAFFIC CONTROL;

COOPERATIVE LEARNING; EXCHANGE MECHANISM; LEARNING PERFORMANCE; LEARNING PROCESS; LEARNING TECHNIQUES; MUTUAL INTERACTION; TRAFFIC SIMULATIONS; TRAFFIC-CONTROL SIMULATION;

MULTI AGENT SYSTEMS;

EID: 4544316833 PISSN: 03029743 EISSN: 16113349 Source Type: Book Series
DOI: 10.1007/3-540-44826-8 Document Type: Conference Paper

Times cited : (20)

References (39)

1
- 85152198941
- Multi-Agent Reinforcement Learning: Independent vs. Cooperative Agents
- Amherst, MA
- M. Tan. Multi-Agent Reinforcement Learning: Independent vs. Cooperative Agents. Proc. of the Tenth International Conference on Machine Learning, Amherst, MA, 330-337, 1993
- (1993) Proc. of the Tenth International Conference on Machine Learning , pp. 330-337
- Tan, M.¹

2
- 0000580224
- A Temporal-Difference Model of Classical Conditioning
- R. S. Sutton and A. G. Barto. A Temporal-Difference Model of Classical Conditioning. Tech Report GTE Labs. TR 87-509.2, 1987
- (1987) Tech Report GTE Labs , vol.2 , pp. 87-509
- Sutton, R.S.¹ Barto, A.G.²

3
- 85158158334
- A complexity Analisys of Cooperative Mechanisms in Reinforcement Learning
- S. D. Whitehead. A complexity Analisys of Cooperative Mechanisms in Reinforcement Learning. Proc. of the 9th National Conference on Artificial Inteligence (AAAI-91), 607-613, 1991
- (1991) Proc. of the 9th National Conference on Artificial Inteligence (AAAI-91) , pp. 607-613
- Whitehead, S.D.¹

4
- 0000123778
- Self-improving reactive agents based on reinforcement learning, planning and teaching
- Kluwer Academic publishers
- L.-J. Lin. Self-improving reactive agents based on reinforcement learning, planning and teaching. Machine Learning 8: 293-321, Kluwer Academic publishers, 1992
- (1992) Machine Learning , vol.8 , pp. 293-321
- Lin, L.-J.¹

5
- 34249833101
- Technical note: Q-learning
- Kluwer Academic publishers
- C. J. C. H. Watkins, P. D. Dayan. Technical note: Q-learning. Machine Learning 8, 3: 279-292, Kluwer Academic publishers, 1992
- (1992) Machine Learning 8 , vol.3 , pp. 279-292
- Watkins, C.J.C.H.¹ Dayan, P.D.²

6
- 0003812851
- A study of cooperative mechanisms for faster reinforcement learning
- Computer Science Department, University of Rochester
- S. D. Whitehead, D. H. Ballard. A study of cooperative mechanisms for faster reinforcement learning. TR 365, Computer Science Department, University of Rochester, 1991
- (1991) TR 365
- Whitehead, S.D.¹ Ballard, D.H.²

7
- 0342497766
- Technical Report CS-96-190, Brandeis University, Dept. of Computer Science
- M. J. Mataric. Using Communication to Reduce Locality in Distributed Multi-agent learning. Technical Report CS-96-190, Brandeis University, Dept. of Computer Science, 1996
- (1996) Using Communication to Reduce Locality in Distributed Multi-agent learning
- Mataric, M.J.¹

8
- 23144457147
- Teaching by shaping
- Workshop on Learning by Induction vs. Learning by Demonstration, Tahoe City, CA, USA
- C. Baroglio. Teaching by shaping. Proc. of ICML-95. Workshop on Learning by Induction vs. Learning by Demonstration, Tahoe City, CA, USA, 1995
- (1995) Proc. of ICML-95
- Baroglio, C.¹

9
- 0038849321
- Gerhard Weiß and Sandip Sen, editors, Adaptation and Learning in Multiagent Systems, Springer Verlag, Berlin
- J. A. Clouse. Learning from an automated training agent. Gerhard Weiß and Sandip Sen, editors, Adaptation and Learning in Multiagent Systems, Springer Verlag, Berlin, 1996
- (1996) Learning from an automated training agent
- Clouse, J.A.¹

10
- 0029678884
- On partially controlled multi-agent systems
- R. I. Brafman, M. Tennenholtz. On partially controlled multi-agent systems. Journal of Artificial Intelligence Research, 4: 477-507, 1996
- (1996) Journal of Artificial Intelligence Research , vol.4 , pp. 477-507
- Brafman, R.I.¹ Tennenholtz, M.²

11
- 0010276944
- Implicit imitation in Multiagent Reinforcement Learning
- Bled, SI
- B. Price, C. Boutilier. Implicit imitation in Multiagent Reinforcement Learning. Proc. of the Sixteenth International Conference on Machine Learning, pp. 325-334. Bled, SI, 1999
- (1999) Proc. of the Sixteenth International Conference on Machine Learning , pp. 325-334
- Price, B.¹ Boutilier, C.²

12
- 0033685787
- Advantages of Cooperation Between Reinforcement Learning Agents in Difficult Stochastic Problems
- FUZZ-IEEE '00
- H. R. Berenji, D. Vengerov. Advantages of Cooperation Between Reinforcement Learning Agents in Difficult Stochastic Problems. Proc. Of the Ninth IEEE International Conference on Fuzzy Systems (FUZZ-IEEE '00), 2000
- (2000) Proc. Of the Ninth IEEE International Conference on Fuzzy Systems
- Berenji, H.R.¹ Vengerov, D.²

13
- 0031630561
- The Dynamics of Reinforcement Learning in Cooperative Multiagent Systems
- July
- C. Claus, C. Boutilier. The Dynamics of Reinforcement Learning in Cooperative Multiagent Systems. Proc. of the Fifteenth National Conference on Artificial Intelligence (AAAI-98), 746-752, July 1998
- (1998) Proc. of the Fifteenth National Conference on Artificial Intelligence (AAAI-98) , pp. 746-752
- Claus, C.¹ Boutilier, C.²

14
- 0036932299
- Reinforcement learning of coordination in cooperative multiagent systems
- American Association for Artificial Intelligence
- S. Kapetanakis, D. Kudenko. Reinforcement learning of coordination in cooperative multiagent systems. Proc. of the Eighteenth National Conference on Artificial Intelligence, (AAAI02), 326-331, American Association for Artificial Intelligence 2002
- (2002) Proc. of the Eighteenth National Conference on Artificial Intelligence, (AAAI02) , pp. 326-331
- Kapetanakis, S.¹ Kudenko, D.²

15
- 0029732210
- Creating advicetaking reinforcement learners
- R. Maclin, J. Shavlik. Creating advicetaking reinforcement learners. Machine Learning 22: 251-281, 1997
- (1997) Machine Learning , vol.22 , pp. 251-281
- Maclin, R.¹ Shavlik, J.²

16
- 0002797521
- Learning in behaviour-based multi-robot systems: Policies, models and other agents
- Elsvier
- M. J. Mataric. Learning in behaviour-based multi-robot systems: Policies, models and other agents. Journal of Cognitive Systems Research 2: 81-93, Elsvier, 2001
- (2001) Journal of Cognitive Systems Research , vol.2 , pp. 81-93
- Mataric, M.J.¹

17
- 0003226481
- Primitive-based movement classification for humanoid imitation
- Cambridge, MA, MIT
- O. C. Jenkins, M. J. Mataric, S. Weber. Primitive-based movement classification for humanoid imitation. Proc. of the First International Conference on Humanoid Robotics (IEEE-RAS), Cambridge, MA, MIT, 2000
- (2000) Proc. of the First International Conference on Humanoid Robotics (IEEE-RAS)
- Jenkins, O.C.¹ Mataric, M.J.² Weber, S.³

18
- 0035438933
- Learning and interacting in human-robot domains
- K. Dau-tenhahn (Ed.)
- M. Nicoluescu, M. J. Mataric. Learning and interacting in human-robot domains. K. Dau-tenhahn (Ed.), IEEE Transactions on systems, Man Cybernetics, special issue on Socially Intelligent Agents - The Human In The Loop, 2001
- (2001) IEEE Transactions on systems, Man Cybernetics, special issue on Socially Intelligent Agents - The Human In The Loop
- Nicoluescu, M.¹ Mataric, M.J.²

19
- 0003200022
- Sensory-motor primitives as a basis for imitation: Linking perception to action and biology to robotics
- C. Nehaniv & K. Dautenhahn (Eds.), MIT Press
- M. J. Mataric. Sensory-motor primitives as a basis for imitation: Linking perception to action and biology to robotics. C. Nehaniv & K. Dautenhahn (Eds.), Imitation in animals and artifacts, MIT Press, 2001
- (2001) Imitation in animals and artifacts
- Mataric, M.J.¹

20
- 0030368639
- Scaling Up: Distributed Machine Learning with Cooperation
- F. J. Provost, D. N. Hennessy. Scaling Up: Distributed Machine Learning with Cooperation. Proc. of the Thirteenth National Conference on Artificial Intelligence, 1996
- (1996) Proc. of the Thirteenth National Conference on Artificial Intelligence
- Provost, F.J.¹ Hennessy, D.N.²

21
- 0003463297
- University of Michigan Press
- J. H. Holland. Adaptation in Natural and Artificial Systems. University of Michigan Press, 1975
- (1975) Adaptation in Natural and Artificial Systems
- Holland, J.H.¹

22
- 0003882343
- MIT Press, Cambridge MA
- J. R. Koza. Genetic programming: On the Programming of Computers by Means of Natural Selection. MIT Press, Cambridge MA, 1992
- (1992) Genetic programming: On the Programming of Computers by Means of Natural Selection
- Koza, J.R.¹

23
- 0000646059
- Learning internal representations by error propagation
- Foundations, Cambridge MA: MIT Press
- D. E. Rumelhart, G. E. Hinton, R. J. Wlliams. Learning internal representations by error propagation. Parallel Distributed Processing: Exploration in the Microstructure of Cognition, vol. 1: Foundations, 318-362, Cambridge MA: MIT Press, 1986
- (1986) Parallel Distributed Processing: Exploration in the Microstructure of Cognition , vol.1 , pp. 318-362
- Rumelhart, D.E.¹ Hinton, G.E.² Wlliams, R.J.³

24
- 0004355619
- PhD Thesis, Tech. Univ. Berlin
- R. Salustowicz. A Genetic Algorithm for the Topological Optimization of Neural Networks. PhD Thesis, Tech. Univ. Berlin, 1995
- (1995) A Genetic Algorithm for the Topological Optimization of Neural Networks
- Salustowicz, R.¹

25
- 0033362601
- Evolving artificial neural networks
- X. Yao. Evolving artificial neural networks. Proceedings of the IEEE, 87(9), 1423-1447, 1999
- (1999) Proceedings of the IEEE , vol.87 , Issue.9 , pp. 1423-1447
- Yao, X.¹

26
- 0040921099
- Fast learning in multilayered neural networks by means of hybrid evolutionary and gradient algorithms
- Moscow
- A.P. Topchy, O.A. Lebedko, V.V. Miagkikh. Fast learning in multilayered neural networks by means of hybrid evolutionary and gradient algorithms. Proc. of the International Conference on Evolutionary Computation and Its Applications, Moscow, 1996
- (1996) Proc. of the International Conference on Evolutionary Computation and Its Applications
- Topchy, A.P.¹ Lebedko, O.A.² Miagkikh, V.V.³

27
- 0030710786
- Exploring the effects of Lamarckian and Baldwinian learning in evolving recurrent neural networks
- K. W. C. Ku, M. W. Mak. Exploring the effects of Lamarckian and Baldwinian learning in evolving recurrent neural networks. Proc. of the IEEE International Conference on Evolutionary Computation, 617-621, 1997.
- (1997) Proc. of the IEEE International Conference on Evolutionary Computation , pp. 617-621
- Ku, K.W.C.¹ Mak, M.W.²

28
- 23144445904
- The Improvement and Comparison of different Algorithms for Optimizing Neural Networks on the MasPar {MP}-2
- ICSC Academic Press, Ed.M. Heiss
- W. Erhard, T. Fink, M. M. Gutzmann, C. Rahn, A. Doering, M. Galicki, The Improvement and Comparison of different Algorithms for Optimizing Neural Networks on the MasPar {MP}-2. Neural Computation {NC}'98, ICSC Academic Press, Ed.M. Heiss, 617-623, 1998
- (1998) Neural Computation {NC}'98 , pp. 617-623
- Erhard, W.¹ Fink, T.² Gutzmann, M.M.³ Rahn, C.⁴ Doering, A.⁵ Galicki, M.⁶

29
- 23144451497
- SA-Prop: Optimization of Multilayer Perceptron Parameters using Simulated Annealing
- P.A. Castillo, J. González, J.J. Merelo, V. Rivas, G. Romero, A. Prieto. SA-Prop: Optimization of Multilayer Perceptron Parameters using Simulated Annealing. Proc. of IWANN99, 1999
- (1999) Proc. of IWANN99
- Castillo, P.A.¹ González, J.² Merelo, J.J.³ Rivas, V.⁴ Romero, G.⁵ Prieto, A.⁶

30
- 0027708111
- Solving the Really Hard problems with Cooperative Search
- T. Hogg, C. P. Williams. Solving the Really Hard problems with Cooperative Search. Proc. of the Eleventh National Conference on Artificial Intelligence (AAAI-93), 231-236, 1993
- (1993) Proc. of the Eleventh National Conference on Artificial Intelligence (AAAI-93) , pp. 231-236
- Hogg, T.¹ Williams, C.P.²

31
- 0002831868
- Mutually supervised learning in multi-agent systems
- Montreal, CA, August
- C. Goldman, J. Rosenschein. Mutually supervised learning in multi-agent systems. Proc. of the IJCAI-95 Workshop on Adaptation and Learning in Multi-Agent Systems, Montreal, CA., August 1995
- (1995) Proc. of the IJCAI-95 Workshop on Adaptation and Learning in Multi-Agent Systems
- Goldman, C.¹ Rosenschein, J.²

32
- 1842538384
- Masters Thesis, Department of Computer Science, Colorado State University
- T. Thorpe. Vehicle Traffic Light Control Using SARSA. Masters Thesis, Department of Computer Science, Colorado State University, 1997
- (1997) Vehicle Traffic Light Control Using SARSA
- Thorpe, T.¹

33
- 0035509912
- Optimizing Traffic Lights in a Cellular Automaton Model for City Traffic
- E. Brockfeld, R. Barlovic, A. Schadschneider, M. Schreckenberg. Optimizing Traffic Lights in a Cellular Automaton Model for City Traffic. Physical Review E 64, 2001
- (2001) Physical Review E , vol.64
- Brockfeld, E.¹ Barlovic, R.² Schadschneider, A.³ Schreckenberg, M.⁴

34
- 84898400353
- On Learning By Exchanging advice
- Imperial College, London, April
- L. Nunes, E. Oliveira. On Learning By Exchanging advice. Symposium on Adaptive Agents and Multi-Agent Systems (AISB/AAMAS-II), Imperial College, London, April 2002
- (2002) Symposium on Adaptive Agents and Multi-Agent Systems (AISB/AAMAS-II)
- Nunes, L.¹ Oliveira, E.²

35
- 26444479778
- Science, Vol, May
- S. Kirkpatrick, C. D. Gelatt, M. P. Vecchi. Optimization by simulated Annealing. Science, Vol. 220: 671-680, May 1983
- (1983) Optimization by simulated Annealing , vol.220 , pp. 671-680
- Kirkpatrick, S.¹ Gelatt, C.D.² Vecchi, M.P.³

36
- 4544284544
- Evolution of Goal-Directed Behavior Using Limited Information in a Complex Environment
- July
- M. Glickman, K. Sycara. Evolution of Goal-Directed Behavior Using Limited Information in a Complex Environment. Proc. of the Genetic and Evolutionary Computation Conference (GECCO-99), July 1999
- (1999) Proc. of the Genetic and Evolutionary Computation Conference (GECCO-99)
- Glickman, M.¹ Sycara, K.²

37
- 85132026293
- Integrated architectures for learning planning and reacting based on approximating dynamic programming
- Morgan-Kaufman
- R. S. Sutton. Integrated architectures for learning planning and reacting based on approximating dynamic programming. Proc. of the Seventh International Conference on Machine Learning, 216-224, Morgan-Kaufman.
- Proc. of the Seventh International Conference on Machine Learning , pp. 216-224
- Sutton, R.S.¹

38
- 0001232636
- J. Phisique I
- K. Nagel, M Shreckenberg. A Cellular Automaton Model for Freeway Traffic. J. Phisique I, 2(12): 2221-2229, 1992
- (1992) A Cellular Automaton Model for Freeway Traffic , vol.2 , Issue.12 , pp. 2221-2229
- Nagel, K.¹ Shreckenberg, M.²

39
- 84962079179
- Believing others: Pros and Cons
- S. Sen, A. Biswas, S. Debnath. Believing others: Pros and Cons. Proc. of the Fourth International Conference on Multiagent Systems, 279-286, 2000
- (2000) Proc. of the Fourth International Conference on Multiagent Systems , pp. 279-286
- Sen, S.¹ Biswas, A.² Debnath, S.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.