SCOPUS 정보 검색 플랫폼

Autonomous Agents and Multi-Agent Systems

Volumn 15, Issue 2, 2007, Pages 147-196

A framework for meta-level control in multi-agent systems

(2) Raja, Anita a Lesser, Victor b

a University of North Carolina at Charlotte (United States)

b Biologically Inspired Neural and Dynamical Systems Laboratory (United States)

Author keywords

Bounded rationality; Meta level control architecture; Multi agent systems

Indexed keywords

EID: 34548107638 PISSN: 13872532 EISSN: 15737454 Source Type: Journal
DOI: 10.1007/s10458-006-9008-z Document Type: Article

Times cited : (46)

References (61)

1
- 0020970738
- IEEE Transactions on Systems, Man, and Cybernetics, SMC-13
- Barto, A., Sutton, R., & Anderson, C. (1983). Neuronlike adaptive elements that can solve difficult learning control problems. IEEE Transactions on Systems, Man, and Cybernetics, SMC-13, 834-846.
- (1983) , pp. 834-846
- Barto, A.¹ Sutton, R.² Anderson, C.³

2
- 0003487482
- Athena Scientific, Belmont, MA
- Bertsekas, D., & Tsitsiklis, J. (1996). Neuro-dynamic programming. Athena Scientific, Belmont, MA.
- (1996) Neuro-dynamic programming
- Bertsekas, D.¹ Tsitsiklis, J.²

3
- 0028447220
- Decision-theoretic deliberation scheduling for problem solving in time-constrained environments
- Boddy, M., & Dean, T. (1994). Decision-theoretic deliberation scheduling for problem solving in time-constrained environments, Artificial Intelligence, 67(2), 245-286.
- (1994) Artificial Intelligence , vol.67 , Issue.2 , pp. 245-286
- Boddy, M.¹ Dean, T.²

4
- 84880690163
- Sequential, optimality and coordination in multiagent systems
- Boutilier, C. (1999). Sequential, optimality and coordination in multiagent systems. In Proceedings of the sixteenth international joint conferences on artificial intelligence. (IJCAI-99) pp.478-485.
- (1999) Proceedings of the sixteenth international joint conferences on artificial intelligence. (IJCAI-99) , pp. 478-485
- Boutilier, C.¹

5
- 0008458860
- Improving elevator performance using reinforcement learning, Multi-ag In :1017-1023
- Crites, R., & Barto, A. (1996). Improving elevator performance using reinforcement learning, Multi-ag In Advances in Neural Information Processing Systems, pages 8:1017-1023.
- (1996) Advances in Neural Information Processing Systems , pp. 8
- Crites, R.¹ Barto, A.²

6
- 84880655104
- An analysis of time-dependent planning
- Saint Paul, Minnesota, USA: AAAI Press/MIT Press
- Dean, T., & Boddy, M. (1988). An analysis of time-dependent planning. In Proceedings of the seventh national conference on artificial intelligence (AAAI-88) (pp. 49-54). Saint Paul, Minnesota, USA: AAAI Press/MIT Press.
- (1988) Proceedings of the seventh national conference on artificial intelligence (AAAI-88) , pp. 49-54
- Dean, T.¹ Boddy, M.²

7
- 0001700825
- Taems: A framework for environment centered analysis and design of coordination mechanisms
- G. O'Hare & N. Jennings, Eds, Wiley Inter-Science
- Decker, K. (1996). Taems: a framework for environment centered analysis and design of coordination mechanisms. In G. O'Hare & N. Jennings, (Eds.), Foundations of Distributed Artificial Intelligence, Chapter 16 (pp. 429-448). Wiley Inter-Science.
- (1996) Foundations of Distributed Artificial Intelligence , pp. 429-448
- Decker, K.¹

8
- 0002278788
- Hierarchical reinforcement learning with the MAXQ value function decomposition
- Dietterich, T. (2000). Hierarchical reinforcement learning with the MAXQ value function decomposition. Journal of Artificial Intelligence Research, 13, 227-303.
- (2000) Journal of Artificial Intelligence Research , vol.13 , pp. 227-303
- Dietterich, T.¹

9
- 9144222344
- What is rational psychology? toward a modern mental philosophy
- Doyle, J. (1983). What is rational psychology? toward a modern mental philosophy. AI Magazine, 4(3), 50-53.
- (1983) AI Magazine , vol.4 , Issue.3 , pp. 50-53
- Doyle, J.¹

10
- 0027701502
- Design-to-time real-time scheduling
- Garvey, A. & Lesser, V. (1993). Design-to-time real-time scheduling. IEEE Transactions on Systems, Man, and Cybernetics, 23 (6):1491-1502.
- (1993) IEEE Transactions on Systems, Man, and Cybernetics , vol.23 , Issue.6 , pp. 1491-1502
- Garvey, A.¹ Lesser, V.²

11
- 0000931858
- Reactive reasoning and planning
- Seattle, WA
- Georgeff, M. & Lansky, A. (1987). Reactive reasoning and planning. In Proceedings of the sixth national conference on artificial intelligence (AAAI-87) pp. 677-682 Seattle, WA.
- (1987) Proceedings of the sixth national conference on artificial intelligence (AAAI-87) , pp. 677-682
- Georgeff, M.¹ Lansky, A.²

12
- 33747182896
- Managing online self-adaptation in real-time environments
- Goldman, R., Musliner, D. & Krebsbach, K. (2003). Managing online self-adaptation in real-time environments. In LNCS, vol. 2614, SV, pp. 6-23.
- (2003) LNCS , vol.2614 , Issue.SV , pp. 6-23
- Goldman, R.¹ Musliner, D.² Krebsbach, K.³

13
- 0003106875
- Twenty-seven principles of rationality
- V. P. Godambe & D. A. Sprott, Eds, Toronto: Holt Rinehart Wilson
- Good, I. J. (1971). Twenty-seven principles of rationality. In V. P. Godambe & D. A. Sprott, (Eds.), Foundations of statistical inference (pp. 108-141). Toronto: Holt Rinehart Wilson.
- (1971) Foundations of statistical inference , pp. 108-141
- Good, I.J.¹

14
- 0004867057
- Monitoring anytime algorithms
- Hansen, E. & Zilberstein, S. (1996). Monitoring anytime algorithms. SIGART Bulletin, 7(2), 28-33.
- (1996) SIGART Bulletin , vol.7 , Issue.2 , pp. 28-33
- Hansen, E.¹ Zilberstein, S.²

15
- 0343224519
- Extended abstract: Learning search strategies
- Stanford, CA
- Harada, D. & Russell, S. (1999). Extended abstract: Learning search strategies. In Proceedings AAAI spring symposium on search techniques for problem solving under uncertainty and incom-plete information, Stanford, CA, 1999.
- (1999) Proceedings AAAI spring symposium on search techniques for problem solving under uncertainty and incom-plete information
- Harada, D.¹ Russell, S.²

16
- 0027694654
- Hayes-Roth, B. (1993). Opportunistic control of action in intelligent agents. In Proceedings of IEEE transactions on systems, man and cybernetics, pp. SMC-23(6), 1575-1587.
- Hayes-Roth, B. (1993). Opportunistic control of action in intelligent agents. In Proceedings of IEEE transactions on systems, man and cybernetics, pp. SMC-23(6), 1575-1587.

17
- 0028579426
- Guardian: A prototype intelligent agent for intensive-care monitoring
- Hayes-Roth, B., Uckun, S., Larsson, X E., Gaba, D., Barr, J. & Chien, J. (1994). Guardian: A prototype intelligent agent for intensive-care monitoring. In Proceedings of the national conference on artificial intelligence, pp. 1503-1511.
- (1994) Proceedings of the national conference on artificial intelligence , pp. 1503-1511
- Hayes-Roth, B.¹ Uckun, S.² Larsson, X.E.³ Gaba, D.⁴ Barr, J.⁵ Chien, J.⁶

18
- 14744296052
- Multi-agent system simulation framework
- Switzerland: EPFL, Lausanne
- Horling, B., Lesser, V. & Vincent, R. (2000). Multi-agent system simulation framework. In sixteenth IMACS World Congress 2000 on scientific computation, applied mathematics and simulation. Switzerland: EPFL, Lausanne.
- (2000) sixteenth IMACS World Congress 2000 on scientific computation, applied mathematics and simulation
- Horling, B.¹ Lesser, V.² Vincent, R.³

19
- 31344442254
- The soft real-time agent control architecture
- Horling, B., Lesser, V., Vincent, R. & Wagner, T. (2006). The soft real-time agent control architecture. Autonomous Agents and Multi-Agent Systems, 12(1), 35-92.
- (2006) Autonomous Agents and Multi-Agent Systems , vol.12 , Issue.1 , pp. 35-92
- Horling, B.¹ Lesser, V.² Vincent, R.³ Wagner, T.⁴

20
- 84959057080
- Reasoning under varying and uncertain resource constraints
- Horvitz, E. (1988). Reasoning under varying and uncertain resource constraints. In National conference on artificial intelligence of the american association for AI (AAAI-88), pp. 111-116.
- (1988) National conference on artificial intelligence of the american association for AI (AAAI-88) , pp. 111-116
- Horvitz, E.¹

21
- 0004280606
- PhD thesis, Stanford University
- Kaelbling, L. (1990). Learning in embedded systems. PhD thesis, Stanford University.
- (1990) Learning in embedded systems
- Kaelbling, L.¹

22
- 0032114497
- Learning communication strategies in multiagent systems
- Kinney, M. & Tsatsoulis, C. (1998). Learning communication strategies in multiagent systems. Applied intelligence, 9(1), 71-91.
- (1998) Applied intelligence , vol.9 , Issue.1 , pp. 71-91
- Kinney, M.¹ Tsatsoulis, C.²

23
- 70349465256
- Meta-level control of coordination protocols
- Kuwabara, K. (1996). Meta-level control of coordination protocols. In Proceedings of the third international conference on multi-agent systems (ICMAS96). pp. 104-111.
- (1996) Proceedings of the third international conference on multi-agent systems (ICMAS96) , pp. 104-111
- Kuwabara, K.¹

24
- 84949729204
- Reinforcement learning for algorithm selection
- Lagoudakis, M. & Littman, M. (2000). Reinforcement learning for algorithm selection. In Proceedings of the seventeenth national conference on artificial intelligence (AAAI-2000), pp. 1081.
- (2000) Proceedings of the seventeenth national conference on artificial intelligence (AAAI-2000) , pp. 1081
- Lagoudakis, M.¹ Littman, M.²

25
- 0343048727
- A distributed reinforcement learning scheme for network routing
- Technical Report CS-93-165
- Littman, M. & Boyan, J. (1993). A distributed reinforcement learning scheme for network routing. Technical Report CS-93-165.
- (1993)
- Littman, M.¹ Boyan, J.²

26
- 85149834820
- Markov games as a framework for multi-agent reinforcement learning
- Morgan Kaufmann: New Brunswick, NJ
- Littman, M. (1994). Markov games as a framework for multi-agent reinforcement learning. In Proceedings of the eleventh international conference on machine learning (ML-94) (pp. 157-163). Morgan Kaufmann: New Brunswick, NJ.
- (1994) Proceedings of the eleventh international conference on machine learning (ML-94) , pp. 157-163
- Littman, M.¹

27
- 0030647149
- Reinforcement learning in the multi-robot domain
- Mataric, M. (1997). Reinforcement learning in the multi-robot domain. Autonomous Robots, 4(1), 73-83.
- (1997) Autonomous Robots , vol.4 , Issue.1 , pp. 73-83
- Mataric, M.¹

28
- 0029220270
- The Challenges of real-time AI
- Musliner, D. J., Hendler, J. A., Agrawala, A. K., Durfee, E. H., Strosnider, J. K. & Paul, C. J. (1995). The Challenges of real-time AI. IEEE Computer, 28(1), 58-66.
- (1995) IEEE Computer , vol.28 , Issue.1 , pp. 58-66
- Musliner, D.J.¹ Hendler, J.A.² Agrawala, A.K.³ Durfee, E.H.⁴ Strosnider, J.K.⁵ Paul, C.J.⁶

29
- 34548068694
- Plan execution in mission-critical domains
- Musliner, D. (1996). Plan execution in mission-critical domains. In Working notes of the AAAI fall symposium on plan execution-problems and issues.
- (1996) Working notes of the AAAI fall symposium on plan execution-problems and issues
- Musliner, D.¹

30
- 0028557848
- Increasing the efficiency of simulated annealing search by learning to recognize (un)promising runs
- Nakakuki, Y & Sadeh, N. (1994). Increasing the efficiency of simulated annealing search by learning to recognize (un)promising runs. In Proceedings of the twelfth national conference on artificial intelligence (AAAI-94), pp. 1316-1322.
- (1994) Proceedings of the twelfth national conference on artificial intelligence (AAAI-94) , pp. 1316-1322
- Nakakuki, Y.¹ Sadeh, N.²

31
- 0001070375
- Reinforcement learning with hierarchies of machines
- M. I. Jordan, M. J. Kearns, & S. A. Solla Eds, The MIT Press
- Parr, R. & Russell, S. (1997). Reinforcement learning with hierarchies of machines. In M. I. Jordan, M. J. Kearns, & S. A. Solla (Eds.), Advances in neural information processing systems, vol. 10, The MIT Press.
- (1997) Advances in neural information processing systems , vol.10
- Parr, R.¹ Russell, S.²

32
- 0003998452
- New York: John. Wiley and Sons, Inc
- Puterman, M. L. (1994). Markov decision processes - discrete stochastic dynamic programming. Games as a Framework for Multi-Agent Reinforcement Learning. New York: John. Wiley and Sons, Inc.
- (1994) Markov decision processes - discrete stochastic dynamic programming. Games as a Framework for Multi-Agent Reinforcement Learning
- Puterman, M.L.¹

33
- 3543071954
- PhD thesis, University of Massachusetts at Amherst, Amherst, Massachusetts
- Raja, A. (2003). Meta-level control in multi-agent systems. PhD thesis, University of Massachusetts at Amherst, Amherst, Massachusetts.
- (2003) Meta-level control in multi-agent systems
- Raja, A.¹

34
- 33747185917
- Leveraging problem classification in online meta-cognition
- Stanford
- Raja, A., Alexander, G. & Mappillai, V. (2006). Leveraging problem classification in online meta-cognition. In Proceedings of AAAI 2006 spring symposium on distributed plan and schedule management (pp. 97-104) Stanford.
- (2006) Proceedings of AAAI 2006 spring symposium on distributed plan and schedule management , pp. 97-104
- Raja, A.¹ Alexander, G.² Mappillai, V.³

35
- 0033700750
- Toward Robust Agent Control in Open Environments
- Barcelona, Catalonia, Spain: ACM Press
- Raja, A., Lesser, V., & Wagner, T. (2000). Toward Robust Agent Control in Open Environments. In Proceedings of the fourth international conference on autonomous agents (pp. 84-91). Barcelona, Catalonia, Spain: ACM Press.
- (2000) Proceedings of the fourth international conference on autonomous agents , pp. 84-91
- Raja, A.¹ Lesser, V.² Wagner, T.³

36
- 0003584577
- Prentice Hall
- Russell, S. & Norvig, P. (1995). Artificial intelligence: A modern approach. Prentice Hall.
- (1995) Artificial intelligence: A modern approach
- Russell, S.¹ Norvig, P.²

37
- 0003711660
- MIT press
- Russell, S. & Wefald, E. (1992). Do the right thing: studies in limited rationality. MIT press.
- (1992) Do the right thing: Studies in limited rationality
- Russell, S.¹ Wefald, E.²

38
- 0003086707
- Provably bounded optimal agents
- Russell, S. J., Subramanian, D. & Parr, R. (1993). Provably bounded optimal agents. In Proceedings of the thirteenth international joint conference on artificial intelligence (IJCAI-93), pp. 338-344.
- (1993) Proceedings of the thirteenth international joint conference on artificial intelligence (IJCAI-93) , pp. 338-344
- Russell, S.J.¹ Subramanian, D.² Parr, R.³

39
- 0038884984
- Principles of metareasoning
- Russell, S. & Wefald, E. (1989). Principles of metareasoning. In Proceedings of the first international conference on principles of knowledge representation and reasoning, pp. 400-411.
- (1989) Proceedings of the first international conference on principles of knowledge representation and reasoning , pp. 400-411
- Russell, S.¹ Wefald, E.²

40
- 0030050933
- Multiagent reinforcement learning in the iterated prisoner's dilemma
- Sandholm, T. & Crites, R. (1995). Multiagent reinforcement learning in the iterated prisoner's dilemma. Biosystems Journal, 37, 147-166.
- (1995) Biosystems Journal , vol.37 , pp. 147-166
- Sandholm, T.¹ Crites, R.²

41
- 0010271689
- The control of reasoning in resource-bounded agents
- Schut, M. & Wooldridge, M. (2001). The control of reasoning in resource-bounded agents. Knowledge Engineering Review, 16(3), 215-240.
- (2001) Knowledge Engineering Review , vol.16 , Issue.3 , pp. 215-240
- Schut, M.¹ Wooldridge, M.²

42
- 0028555752
- Learning to coordinate without sharing information
- Seattle, WA
- Sen, S., Sekaran, M. & Hale, J. (1994). Learning to coordinate without sharing information. In Proceedings of the twelfth national conference on artificial intelligence, (pp. 426-431), Seattle, WA.
- (1994) Proceedings of the twelfth national conference on artificial intelligence , pp. 426-431
- Sen, S.¹ Sekaran, M.² Hale, J.³

43
- 0002298346
- From substantive to procedural rationality
- Simon, H, Latsis, S. J, Ed, Cambridge University Press, pp
- Simon, H., Latsis, S. J. (Ed.) (1976). From substantive to procedural rationality. In Method and Appraisal in Economic. Cambridge University Press, pp. 129-148.
- (1976) Method and Appraisal in Economic , pp. 129-148

44
- 0016556911
- Optimal problem solving search: All-or-none solutions
- Simon, H. & Kadane, J. (1974). Optimal problem solving search: All-or-none solutions. Artificial Intelligence, 6, 235-247.
- (1974) Artificial Intelligence , vol.6 , pp. 235-247
- Simon, H.¹ Kadane, J.²

45
- 0004077471
- Cambridge, MA: The MIT Press
- Simon, H. (1982). Models of bounded rationality, vol. 1. Cambridge, MA: The MIT Press.
- (1982) Models of bounded rationality , vol.1
- Simon, H.¹

46
- 85158142417
- Empirical evaluation of a reinforcement learning spoken dialogue system
- Singh, S., Kearns, M., Litman, D. & Walker, M. (2000). Empirical evaluation of a reinforcement learning spoken dialogue system. In Proceedings of the seventeenth national conference on artificial intelligence, pp. 645-651.
- (2000) Proceedings of the seventeenth national conference on artificial intelligence , pp. 645-651
- Singh, S.¹ Kearns, M.² Litman, D.³ Walker, M.⁴

47
- 0042049192
- On-line learning of coordination plans
- Sugawara, T. & Lesser, V. (1993). On-line learning of coordination plans. In Proceedings of the twelth international workshop on distributed artificial intelligence, pp. 335-345, 371-377.
- (1993) Proceedings of the twelth international workshop on distributed artificial intelligence
- Sugawara, T.¹ Lesser, V.²

48
- 0004007508
- MIT Press
- Sutton, R. & Barto, A. (1998). Reinforcement learning. MIT Press.
- (1998) Reinforcement learning
- Sutton, R.¹ Barto, A.²

49
- 0003617454
- PhD thesis, University of Massachusetts Amherst
- Sutton, R. (1984). Temporal credit assignment in reinforcement learning. PhD thesis, University of Massachusetts Amherst.
- (1984) Temporal credit assignment in reinforcement learning
- Sutton, R.¹

50
- 33847202724
- Learning to predict by the method of temporal differences
- Sutton, R. (1988). Learning to predict by the method of temporal differences. Machine Learning, 3(1), 9-44.
- (1988) Machine Learning , vol.3 , Issue.1 , pp. 9-44
- Sutton, R.¹

51
- 0033170372
- Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
- Sutton, R., Precup, D. & Singh, S. (1999). Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning. Artificial Intelligence, 112(1-2), 181-211.
- (1999) Artificial Intelligence , vol.112 , Issue.1-2 , pp. 181-211
- Sutton, R.¹ Precup, D.² Singh, S.³

52
- 85152198941
- Multi-agent reinforcement learning: Independent vs. cooperative agents
- Tan, M. (1993). Multi-agent reinforcement learning: Independent vs. cooperative agents. In Proceedings of the tenth international conference on machine learning, pp. 330-337.
- (1993) Proceedings of the tenth international conference on machine learning , pp. 330-337
- Tan, M.¹

53
- 84937406535
- An agent infrastructure to build and evaluate multi-agent systems: The java agent framework and multi-agent system simulator
- Wagner and Rana Eds, Springer
- Vincent, R., Horling, B. & Lesser, V. (2001). An agent infrastructure to build and evaluate multi-agent systems: The java agent framework and multi-agent system simulator. In Wagner and Rana (Eds.), Lecture notes in artificial intelligence: infrastructure for agents, multi-agent systems, and scalable multi-agent systems, vol. 1887. Springer.
- (2001) Lecture notes in artificial intelligence: Infrastructure for agents, multi-agent systems, and scalable multi-agent systems , vol.1887
- Vincent, R.¹ Horling, B.² Lesser, V.³

54
- 0032116206
- International Journal of Approximate Reasoning, Special Issue on Scheduling, 19
- 1-2, A version also available as UMASS CS TR-97-59
- Wagner, T., Garvey, A. & Lesser, V. (1998). Criteria-directed heuristic task scheduling. International Journal of Approximate Reasoning, Special Issue on Scheduling, 19(1-2), 91-118. A version also available as UMASS CS TR-97-59.
- (1998) , pp. 91-118
- Wagner, T.¹ Garvey, A.² Lesser, V.³

55
- 0004049893
- PhD thesis, Cambridge, England
- Watkins, C. (1989). Learning from delayed rewards. PhD thesis, Cambridge, England.
- (1989) Learning from delayed rewards
- Watkins, C.¹

56
- 0002557085
- Learning to perceive and act by trial and error
- Whitehead, S. D. & Ballard, D. H. (1991). Learning to perceive and act by trial and error. Machine Learning, 7(1), 45-83.
- (1991) Machine Learning , vol.7 , Issue.1 , pp. 45-83
- Whitehead, S.D.¹ Ballard, D.H.²

57
- 0036355482
- Multi-linked negotiation in multi-agent system
- Zhang, X. & Lesser, V. (2002). Multi-linked negotiation in multi-agent system. In Proceedings of the first international joint conference on autonomous agents and multiagent systems (AAMAS 2002), pp. 1207-1214.
- (2002) Proceedings of the first international joint conference on autonomous agents and multiagent systems (AAMAS 2002) , pp. 1207-1214
- Zhang, X.¹ Lesser, V.²

58
- 84880663704
- Reactive control of dynamic progressive processing
- Zilberstein, S. & Mouaddib, A. (1999). Reactive control of dynamic progressive processing. In Proceedings of the sixth international joint conference on artificial intelligence, IJCAI, pp. 1268-1273.
- (1999) Proceedings of the sixth international joint conference on artificial intelligence, IJCAI , pp. 1268-1273
- Zilberstein, S.¹ Mouaddib, A.²

59
- 0026986776
- Zilberstein, S. & Russell, S. J. (.1.992). Efficient resource-bounded reasoning in AT-RALPH. In James Hendler, (Edn.), Proceedings of the first international conference of artificial intelligence planning systems (AIPS 92) (pp. 260-268) Morgan Kaufmann: College Park, Maryland, USA.
- Zilberstein, S. & Russell, S. J. (.1.992). Efficient resource-bounded reasoning in AT-RALPH. In James Hendler, (Edn.), Proceedings of the first international conference of artificial intelligence planning systems (AIPS 92) (pp. 260-268) Morgan Kaufmann: College Park, Maryland, USA.

60
- 0030122886
- Optimal composition of real-time systems
- Zilberstein, S. & Russell, S. J. (1996). Optimal composition of real-time systems. Artificial Intelligence, 82(1-2), 181-213.
- (1996) Artificial Intelligence , vol.82 , Issue.1-2 , pp. 181-213
- Zilberstein, S.¹ Russell, S.J.²

61
- 1942484421
- Online convex programming and generalized infinitesimal gradient ascent
- Zinkevich, M. (2003). Online convex programming and generalized infinitesimal gradient ascent. International Conference in Machine Learning, pp. 929-936.
- (2003) International Conference in Machine Learning , pp. 929-936
- Zinkevich, M.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.