SCOPUS 정보 검색 플랫폼

Autonomous Agents and Multi-Agent Systems

Volumn 6, Issue 3, 2003, Pages 287-316

Maximizing Reward in a Non-Stationary Mobile Robot Environment

(2) Goldberg, Dani a Matarić, Maja J b

a CARNEGIE MELLON UNIVERSITY (United States)

b University of Southern California ^* (United States)

Author keywords

Collection tasks; Mobile robots; Non stationary environments; On line modeling; Reward maximization

Indexed keywords

COLLECTION TASKS; NON-STATIONARY ENVIRONMENTS; ON-LINE MODELING;

ALGORITHMS; AUTONOMOUS AGENTS; MARKOV PROCESSES; MOBILE ROBOTS; STATE ESTIMATION; STOCHASTIC CONTROL SYSTEMS;

MULTI AGENT SYSTEMS;

EID: 0347132453 PISSN: 13872532 EISSN: None Source Type: Journal
DOI: 10.1023/A:1022935725296 Document Type: Article

Times cited : (21)

References (47)

1
- 0003869829
- The MIT Press: Cambridge, Massachusetts
- R. C. Arkin, Behavior-Based Robotics, The MIT Press: Cambridge, Massachusetts, 1998.
- (1998) Behavior-based Robotics
- Arkin, R.C.¹

2
- 0002270731
- Reactive and Telerobotic Control in Multi-Agent Systems
- Brighton, England
- R. C. Arkin and K. S. Ali, "Reactive and Telerobotic Control in Multi-Agent Systems," in From Animals to Animats 3: Proceedings of the Third International Conference on Simulation of Adpalive Behavior, Brighton, England, pp. 473-478, 1994.
- (1994) From Animals to Animats 3: Proceedings of the Third International Conference on Simulation of Adpalive Behavior , pp. 473-478
- Arkin, R.C.¹ Ali, K.S.²

3
- 0027153766
- Communication of Behavioral State in Multi-Agent Retrieval Tasks
- Atlanta
- R. C. Arkin, T. Balch, and E. Nitz, "Communication of Behavioral State in Multi-Agent Retrieval Tasks," in IEEE International Conference on Robotics and Automation, Atlanta, pp. 588-594, 1993.
- (1993) IEEE International Conference on Robotics and Automation , pp. 588-594
- Arkin, R.C.¹ Balch, T.² Nitz, E.³

4
- 0034204860
- Hierarchical Social Entropy: An Information Theoretic Measure of Robot Group Diversity
- T. Balch, "Hierarchical Social Entropy: An Information Theoretic Measure of Robot Group Diversity," Autonomous Robots, vol. 8, pp. 209-237, 2000.
- (2000) Autonomous Robots , vol.8 , pp. 209-237
- Balch, T.¹

5
- 0029209779
- A Dynamical Systems Perspective on Agent-Environment Interaction
- R. D. Beer, "A Dynamical Systems Perspective on Agent-Environment Interaction," Artificial Intelligence, vol. 72, pp. 173-215, 1993.
- (1993) Artificial Intelligence , vol.72 , pp. 173-215
- Beer, R.D.¹

6
- 0003645589
- The Behavior Language: User's Guide
- MIT AI Laboratory
- R. A. Brooks, "The Behavior Language: User's Guide," Technical Report AIM-1227, MIT AI Laboratory 1990.
- (1990) Technical Report , vol.AIM-1227
- Brooks, R.A.¹

7
- 0002439923
- Intelligence without Reason
- Sydney, Australia
- R. A. Brooks, "Intelligence Without Reason," in Proceedings of the Twelfth International Joint Conference on Artificial Intelligence (IJCAI-91), Sydney, Australia, pp. 569-590, 1991.
- (1991) Proceedings of the Twelfth International Joint Conference on Artificial Intelligence (IJCAI-91) , pp. 569-590
- Brooks, R.A.¹

8
- 84915880659
- Approximate Binomial Confidence Limits
- C. R. Blyth, "Approximate Binomial Confidence Limits," Journal of the American Statistical Association, vol. 81, no. 395, pp. 843-855, 1986.
- (1986) Journal of the American Statistical Association , vol.81 , Issue.395 , pp. 843-855
- Blyth, C.R.¹

9
- 0030674885
- Cooperative Mobile Robotics: Antecedents and Directions
- Y. U. Cao, A. S. Fukunaga, and A. B. Kahng, "Cooperative Mobile Robotics: Antecedents and Directions," Autonomous Robots, vol. 4, pp. 1-23, 1997.
- (1997) Autonomous Robots , vol.4 , pp. 1-23
- Cao, Y.U.¹ Fukunaga, A.S.² Kahng, A.B.³

10
- 0030388815
- Acting under Uncertainty: Discrete Bayesian Models for Mobile-Robot Navigation
- A. R. Cassandra, L. P. Kaelbling, and J. A. Kurien, "Acting under Uncertainty: Discrete Bayesian Models for Mobile-Robot Navigation," in Proceedings of the 1996 IEEE/RSJ International Conference on Intelligent Robots and Systems, vol. 2, pp. 963-972, 1996.
- (1996) Proceedings of the 1996 IEEE/RSJ International Conference on Intelligent Robots and Systems , vol.2 , pp. 963-972
- Cassandra, A.R.¹ Kaelbling, L.P.² Kurien, J.A.³

11
- 0000804982
- Estimating the Current Mean of a Normal Distribution which is Subjected to Changes in Time
- H. Chernoff and S. Zacks, "Estimating the Current Mean of a Normal Distribution which is Subjected to Changes in Time," Annals of Mathematical Statistics, vol. 35, no. 3, pp. 999-1018, 1964.
- (1964) Annals of Mathematical Statistics , vol.35 , Issue.3 , pp. 999-1018
- Chernoff, H.¹ Zacks, S.²

12
- 0004116989
- McGraw-Hill Book Company
- T. H. Cormen, C. E. Leiserson, and R. L. Rivest, Introduction to Algorithms, McGraw-Hill Book Company, 1990.
- (1990) Introduction to Algorithms
- Cormen, T.H.¹ Leiserson, C.E.² Rivest, R.L.³

13
- 0026998041
- Reinforcement Learning with Perceptual Aliasing: The Perceptual Distinctions Approach
- W. Swartout (ed.), San Jose, CA
- L. Chrisman, "Reinforcement Learning with Perceptual Aliasing: The Perceptual Distinctions Approach," in W. Swartout (ed.), Proceedings of the 10th National Conference on Artificial Intelligence, San Jose, CA, pp. 183-188, 1992.
- (1992) Proceedings of the 10th National Conference on Artificial Intelligence , pp. 183-188
- Chrisman, L.¹

14
- 0032187823
- Territorial Multi-Robot Task Division
- M. S. Fontán and M. J. Matarić, "Territorial Multi-Robot Task Division," IEEE Transactions on Robotics and Automation, vol. 14, no. 5, pp. 815-822, 1998.
- (1998) IEEE Transactions on Robotics and Automation , vol.14 , Issue.5 , pp. 815-822
- Fontán, M.S.¹ Matarić, M.J.²

15
- 0003901362
- Prentice Hall
- J. E. Freund, Mathematical Statistics, Fifth Edition, Prentice Hall, 1992.
- (1992) Mathematical Statistics, Fifth Edition
- Freund, J.E.¹

16
- 0003109864
- On Three-Layer Architectures
- D. Kortenkamp, R. P. Bonnasso, and R. Murphy (eds.), AAAI Press
- E. Gat, "On Three-Layer Architectures," in D. Kortenkamp, R. P. Bonnasso, and R. Murphy (eds.), Artificial Intelligence and Mobile Robotics: Case Studies of Successful Robot Systems, AAAI Press, pp. 195-210, 1998.
- (1998) Artificial Intelligence and Mobile Robotics: Case Studies of Successful Robot Systems , pp. 195-210
- Gat, E.¹

17
- 0345819123
- Ph.D. thesis, University of Southern California
- D. Goldberg, "Evaluating the Dynamics of Agent Environment Interaction," Ph.D. thesis, University of Southern California, 2001.
- (2001) Evaluating the Dynamics of Agent Environment Interaction
- Goldberg, D.¹

18
- 0032662270
- Coordinating Mobile Robot Group Behavior Using a Model of Interaction Dynamics
- Seattle, Washington
- D. Goldberg and M. J. Matarić, "Coordinating Mobile Robot Group Behavior Using a Model of Interaction Dynamics," in Proceedings, The Third International Conference on Autonomous Agents (Agents '99), Seattle, Washington, pp. 100-107, 1999.
- (1999) Proceedings, the Third International Conference on Autonomous Agents (Agents '99) , pp. 100-107
- Goldberg, D.¹ Matarić, M.J.²

19
- 0035558020
- Detecting Regime Changes with a Mobile Robot using Multiple Models
- Maui, Hawaii
- D. Goldberg and M. J. Matarić, "Detecting Regime Changes with a Mobile Robot using Multiple Models," in Proceedings of the 2001 IEEE/RSJ International Conference on Intelligent Robots and Systems, Maui, Hawaii, pp. 619-624, 2001.
- (2001) Proceedings of the 2001 IEEE/RSJ International Conference on Intelligent Robots and Systems , pp. 619-624
- Goldberg, D.¹ Matarić, M.J.²

20
- 0346450316
- Automated Robot Behavior Recognition Applied to Robotic Soccer
- Snowbird, Utah
- K. Han and M. Veloso, "Automated Robot Behavior Recognition Applied to Robotic Soccer," in Robotics Research: the Ninth International Symposium, Snowbird, Utah, pp. 249-256, 2000.
- (2000) Robotics Research: The Ninth International Symposium , pp. 249-256
- Han, K.¹ Veloso, M.²

21
- 0000028318
- Meiosis Networks
- D. S. Touretzky (ed.), San Mateo, CA
- S. J. Hanson, "Meiosis Networks," in D. S. Touretzky (ed.), Advances in Neural Information Processing Systems 2, San Mateo, CA, pp. 533-541, 1990.
- (1990) Advances in Neural Information Processing Systems 2 , pp. 533-541
- Hanson, S.J.¹

22
- 0032073263
- Planning and Acting in Partially Observable Stochastic Domains
- L. P. Kaelbling, M. L. Littman, and A. R. Cassandra, "Planning and Acting in Partially Observable Stochastic Domains," Artificial Intelligence, vol. 101, no. 1-2, pp. 99-134, 1998.
- (1998) Artificial Intelligence , vol.101 , Issue.1-2 , pp. 99-134
- Kaelbling, L.P.¹ Littman, M.L.² Cassandra, A.R.³

23
- 0003415457
- D. Van Nostrand Company, Inc.
- J. G. Kemeny, J. L. Snell, and A. W. Knapp, Denumerable Markov Chains, D. Van Nostrand Company, Inc., 1966.
- (1966) Denumerable Markov Chains
- Kemeny, J.G.¹ Snell, J.L.² Knapp, A.W.³

24
- 0029697923
- Unsupervised Learning of Probabilistic Models for Robot Navigation
- S. Koenig and R. G. Simmons, "Unsupervised Learning of Probabilistic Models for Robot Navigation," in Proceedings of the IEEE International Conference on Robotics and Automation, vol. 3, pp. 2301-2308, 1996.
- (1996) Proceedings of the IEEE International Conference on Robotics and Automation , vol.3 , pp. 2301-2308
- Koenig, S.¹ Simmons, R.G.²

25
- 0010946014
- Discrete Event Systems for Autonomous Mobile Agents
- Zakopane, Poland
- J. Košecká and R. Bajcsy, "Discrete Event Systems for Autonomous Mobile Agents," in Proceedings of the First Workshop on Intelligent Robotic Systems, Zakopane, Poland, pp. 21-31, 1993.
- (1993) Proceedings of the First Workshop on Intelligent Robotic Systems , pp. 21-31
- Košecká, J.¹ Bajcsy, R.²

26
- 0001300222
- Sequential Changepoint Detection in Quality Control and Dynamical Systems
- T. L. Lai, "Sequential Changepoint Detection in Quality Control and Dynamical Systems," Journal of the Royal Statistical Society, Series B (Methodological), vol. 57, no. 4, pp. 613-658, 1995.
- (1995) Journal of the Royal Statistical Society, Series B (Methodological) , vol.57 , Issue.4 , pp. 613-658
- Lai, T.L.¹

27
- 0005849701
- 2, and F Tail Probabilities
- 2, and F Tail Probabilities," Journal of the American Statistical Association, vol. 73, no. 362, pp. 274-283, 1978.
- (1978) Journal of the American Statistical Association , vol.73 , Issue.362 , pp. 274-283
- Ling, R.F.¹

28
- 0002228390
- Optimizing Production Manufacturing using Reinforcement Learning
- Sanibel Island, Florida
- S. Mahadevan and G. Theocharous, "Optimizing Production Manufacturing using Reinforcement Learning," in Proceedings of the Eleventh International FLAIRS Conference, Sanibel Island, Florida, pp. 372-377, 1998.
- (1998) Proceedings of the Eleventh International FLAIRS Conference , pp. 372-377
- Mahadevan, S.¹ Theocharous, G.²

29
- 0001932539
- Behavior-Based Systems: Key Properties and Implications
- Nice, France
- M. J. Matarić, "Behavior-Based Systems: Key Properties and Implications," in IEEE International Conference on Robotics and Automation, Workshop on Architectures for Intelligent Control Systems, Nice, France, pp. 46-54, 1992.
- (1992) IEEE International Conference on Robotics and Automation, Workshop on Architectures for Intelligent Control Systems , pp. 46-54
- Matarić, M.J.¹

30
- 0029537980
- Issues and Approaches in the Design of Collective Autonomous Agents
- M. J. Matarić, "Issues and Approaches in the Design of Collective Autonomous Agents," Robotics and Autonomous Systems, vol. 16, no. 2-4, pp. 321-331, 1995.
- (1995) Robotics and Autonomous Systems , vol.16 , Issue.2-4 , pp. 321-331
- Matarić, M.J.¹

31
- 0031504223
- Behavior-Based Control: Examples from Navigation, Learning, and Group Behavior
- M. J. Matarić, "Behavior-Based Control: Examples from Navigation, Learning, and Group Behavior," Journal of Experimental and Theoretical Artificial Intelligence, vol. 9, no. 2-3, pp. 323-336, 1997.
- (1997) Journal of Experimental and Theoretical Artificial Intelligence , vol.9 , Issue.2-3 , pp. 323-336
- Matarić, M.J.¹

32
- 0003932121
- Ph.D. thesis, University of Rochester, Department of Computer Science
- A. K. McCallum, "Reinforcement Learning with Selective Perception and Hidden State," Ph.D. thesis, University of Rochester, Department of Computer Science, 1996.
- (1996) Reinforcement Learning with Selective Perception and Hidden State
- McCallum, A.K.¹

33
- 0032117054
- Learning from History for Behavior-Based Mobile Robots in Nonstationary Conditions
- F. Michaud and M. J. Matarić, "Learning from History for Behavior-Based Mobile Robots in Nonstationary Conditions," Autonomous Robots, vol. 5, no. 3-4, pp. 335-354, 1998.
- (1998) Autonomous Robots , vol.5 , Issue.3-4 , pp. 335-354
- Michaud, F.¹ Matarić, M.J.²

34
- 0004255908
- The McGraw-Hill Companies, Inc.
- T. M. Mitchell, Machine Learning, The McGraw-Hill Companies, Inc., 1997.
- (1997) Machine Learning
- Mitchell, T.M.¹

35
- 0003577275
- Ph.D. thesis, MIT
- L. E. Parker, "Heterogeneous Multi-Robot Cooperation," Ph.D. thesis, MIT, 1994.
- (1994) Heterogeneous Multi-robot Cooperation
- Parker, L.E.¹

36
- 84947375150
- A Normal Approximation for Binomial, F, Beta, and Other Common, Related Tail Probabilities, 1
- D. B. Peizer and J. W. Pratt, "A Normal Approximation for Binomial, F, Beta, and Other Common, Related Tail Probabilities, 1," Journal of the American Statistical Association, vol. 63, no. 324, pp. 1416-1456, 1968.
- (1968) Journal of the American Statistical Association , vol.63 , Issue.324 , pp. 1416-1456
- Peizer, D.B.¹ Pratt, J.W.²

37
- 0004008584
- Ph.D. thesis, Institute of Electronic Systems, Alborg University, Denmark
- P. Pirjanian, "Multiple Objective Action Selection & Behavior Fusion using Voting," Ph.D. thesis, Institute of Electronic Systems, Alborg University, Denmark, 1998.
- (1998) Multiple Objective Action Selection & Behavior Fusion Using Voting
- Pirjanian, P.¹

38
- 0004161838
- Cambridge University Press
- W. H. Press, S. A. Teukolsky, W. T. Vetterling, and B. P. Flannery, Numerical Recipes in C: The Art of Scientific Computing, Cambridge University Press, 1992.
- (1992) Numerical Recipes in C: The Art of Scientific Computing
- Press, W.H.¹ Teukolsky, S.A.² Vetterling, W.T.³ Flannery, B.P.⁴

39
- 0024610919
- A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition
- L. R. Rabiner, "A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition," Proceedings of the IEEE, vol. 77, no. 2, pp. 257-285, 1989.
- (1989) Proceedings of the IEEE , vol.77 , Issue.2 , pp. 257-285
- Rabiner, L.R.¹

40
- 0003823587
- Prentice-Hall, Inc.
- F. S. Roberts, Discrete Mathematical Models: With Applications to Social, Biological, and Environmental Problems, Prentice-Hall, Inc., 1976.
- (1976) Discrete Mathematical Models: With Applications to Social, Biological, and Environmental Problems
- Roberts, F.S.¹

41
- 0003644137
- New York: Dover Publications, Inc.
- S. M. Ross, Applied Probability Models with Optimization Applications, New York: Dover Publications, Inc., 1992.
- (1992) Applied Probability Models with Optimization Applications
- Ross, S.M.¹

42
- 0037826642
- Learning Hidden Markov Model Structure for Information Extraction
- Orlando, FL
- K. Seymore, A. McCallum, and R. Rosenfeld, "Learning Hidden Markov Model Structure for Information Extraction," in Proceedings of the Sixteenth National Conference on Artificial Intelligence: Workshop on Machine Learning for Information Extraction, Orlando, FL, pp. 37-42, 1999.
- (1999) Proceedings of the Sixteenth National Conference on Artificial Intelligence: Workshop on Machine Learning for Information Extraction , pp. 37-42
- Seymore, K.¹ McCallum, A.² Rosenfeld, R.³

43
- 0346450313
- What the Dynamics of Adaptive Behavior and Cognition Might Look Like in Agent-Environment Interaction Systems
- Mt. Verita, Switzerland
- T. Smithers, "What the Dynamics of Adaptive Behavior and Cognition Might Look Like in Agent-Environment Interaction Systems," in Practice and Future of Autonomous Agents, Mt. Verita, Switzerland, 1995.
- (1995) Practice and Future of Autonomous Agents
- Smithers, T.¹

44
- 0002297358
- Hidden Markov Model Induction by Bayesian Model Merging
- S. J. Hanson, J. D. Cowan, and C. L. Giles (eds.)
- A. Stolcke and S. Omohundro, "Hidden Markov Model Induction by Bayesian Model Merging," in S. J. Hanson, J. D. Cowan, and C. L. Giles (eds.), Advances in Neural Information Processing Systems, vol. 5. pp. 11-18, 1993.
- (1993) Advances in Neural Information Processing Systems , vol.5 , pp. 11-18
- Stolcke, A.¹ Omohundro, S.²

45
- 0033170372
- Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning
- R. S. Sutton, D. Precup, and S. Singh, "Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning," Artificial Intelligence, vol. 112, pp. 181-211, 1999.
- (1999) Artificial Intelligence , vol.112 , pp. 181-211
- Sutton, R.S.¹ Precup, D.² Singh, S.³

46
- 0003504917
- Hierarchical Optimization of Policy-Coupled Semi-Markov Decision Processes
- Bled, Slovenia
- G. Wang and S. Mahadevan, "Hierarchical Optimization of Policy-Coupled Semi-Markov Decision Processes," in Proceedings of the Sixteenth International Conference on Machine Learning, Bled, Slovenia, pp. 464-473, 1999.
- (1999) Proceedings of the Sixteenth International Conference on Machine Learning , pp. 464-473
- Wang, G.¹ Mahadevan, S.²

47
- 0029250080
- Reinforcement Learning of Non-Markov Decision Processes
- S. D. Whitehead and L.-J. Lin, "Reinforcement Learning of Non-Markov Decision Processes," Artificial Intelligence, vol. 73, no. 1-2, pp. 271-306, 1995.
- (1995) Artificial Intelligence , vol.73 , Issue.1-2 , pp. 271-306
- Whitehead, S.D.¹ Lin, L.-J.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.