메뉴 건너뛰기




Volumn 6, Issue 3, 2003, Pages 287-316

Maximizing Reward in a Non-Stationary Mobile Robot Environment

Author keywords

Collection tasks; Mobile robots; Non stationary environments; On line modeling; Reward maximization

Indexed keywords

COLLECTION TASKS; NON-STATIONARY ENVIRONMENTS; ON-LINE MODELING;

EID: 0347132453     PISSN: 13872532     EISSN: None     Source Type: Journal    
DOI: 10.1023/A:1022935725296     Document Type: Article
Times cited : (21)

References (47)
  • 1
    • 0003869829 scopus 로고    scopus 로고
    • The MIT Press: Cambridge, Massachusetts
    • R. C. Arkin, Behavior-Based Robotics, The MIT Press: Cambridge, Massachusetts, 1998.
    • (1998) Behavior-based Robotics
    • Arkin, R.C.1
  • 4
    • 0034204860 scopus 로고    scopus 로고
    • Hierarchical Social Entropy: An Information Theoretic Measure of Robot Group Diversity
    • T. Balch, "Hierarchical Social Entropy: An Information Theoretic Measure of Robot Group Diversity," Autonomous Robots, vol. 8, pp. 209-237, 2000.
    • (2000) Autonomous Robots , vol.8 , pp. 209-237
    • Balch, T.1
  • 5
    • 0029209779 scopus 로고
    • A Dynamical Systems Perspective on Agent-Environment Interaction
    • R. D. Beer, "A Dynamical Systems Perspective on Agent-Environment Interaction," Artificial Intelligence, vol. 72, pp. 173-215, 1993.
    • (1993) Artificial Intelligence , vol.72 , pp. 173-215
    • Beer, R.D.1
  • 6
    • 0003645589 scopus 로고
    • The Behavior Language: User's Guide
    • MIT AI Laboratory
    • R. A. Brooks, "The Behavior Language: User's Guide," Technical Report AIM-1227, MIT AI Laboratory 1990.
    • (1990) Technical Report , vol.AIM-1227
    • Brooks, R.A.1
  • 9
    • 0030674885 scopus 로고    scopus 로고
    • Cooperative Mobile Robotics: Antecedents and Directions
    • Y. U. Cao, A. S. Fukunaga, and A. B. Kahng, "Cooperative Mobile Robotics: Antecedents and Directions," Autonomous Robots, vol. 4, pp. 1-23, 1997.
    • (1997) Autonomous Robots , vol.4 , pp. 1-23
    • Cao, Y.U.1    Fukunaga, A.S.2    Kahng, A.B.3
  • 11
    • 0000804982 scopus 로고
    • Estimating the Current Mean of a Normal Distribution which is Subjected to Changes in Time
    • H. Chernoff and S. Zacks, "Estimating the Current Mean of a Normal Distribution which is Subjected to Changes in Time," Annals of Mathematical Statistics, vol. 35, no. 3, pp. 999-1018, 1964.
    • (1964) Annals of Mathematical Statistics , vol.35 , Issue.3 , pp. 999-1018
    • Chernoff, H.1    Zacks, S.2
  • 13
    • 0026998041 scopus 로고
    • Reinforcement Learning with Perceptual Aliasing: The Perceptual Distinctions Approach
    • W. Swartout (ed.), San Jose, CA
    • L. Chrisman, "Reinforcement Learning with Perceptual Aliasing: The Perceptual Distinctions Approach," in W. Swartout (ed.), Proceedings of the 10th National Conference on Artificial Intelligence, San Jose, CA, pp. 183-188, 1992.
    • (1992) Proceedings of the 10th National Conference on Artificial Intelligence , pp. 183-188
    • Chrisman, L.1
  • 20
    • 0346450316 scopus 로고    scopus 로고
    • Automated Robot Behavior Recognition Applied to Robotic Soccer
    • Snowbird, Utah
    • K. Han and M. Veloso, "Automated Robot Behavior Recognition Applied to Robotic Soccer," in Robotics Research: the Ninth International Symposium, Snowbird, Utah, pp. 249-256, 2000.
    • (2000) Robotics Research: The Ninth International Symposium , pp. 249-256
    • Han, K.1    Veloso, M.2
  • 22
    • 0032073263 scopus 로고    scopus 로고
    • Planning and Acting in Partially Observable Stochastic Domains
    • L. P. Kaelbling, M. L. Littman, and A. R. Cassandra, "Planning and Acting in Partially Observable Stochastic Domains," Artificial Intelligence, vol. 101, no. 1-2, pp. 99-134, 1998.
    • (1998) Artificial Intelligence , vol.101 , Issue.1-2 , pp. 99-134
    • Kaelbling, L.P.1    Littman, M.L.2    Cassandra, A.R.3
  • 26
    • 0001300222 scopus 로고
    • Sequential Changepoint Detection in Quality Control and Dynamical Systems
    • T. L. Lai, "Sequential Changepoint Detection in Quality Control and Dynamical Systems," Journal of the Royal Statistical Society, Series B (Methodological), vol. 57, no. 4, pp. 613-658, 1995.
    • (1995) Journal of the Royal Statistical Society, Series B (Methodological) , vol.57 , Issue.4 , pp. 613-658
    • Lai, T.L.1
  • 30
    • 0029537980 scopus 로고
    • Issues and Approaches in the Design of Collective Autonomous Agents
    • M. J. Matarić, "Issues and Approaches in the Design of Collective Autonomous Agents," Robotics and Autonomous Systems, vol. 16, no. 2-4, pp. 321-331, 1995.
    • (1995) Robotics and Autonomous Systems , vol.16 , Issue.2-4 , pp. 321-331
    • Matarić, M.J.1
  • 31
    • 0031504223 scopus 로고    scopus 로고
    • Behavior-Based Control: Examples from Navigation, Learning, and Group Behavior
    • M. J. Matarić, "Behavior-Based Control: Examples from Navigation, Learning, and Group Behavior," Journal of Experimental and Theoretical Artificial Intelligence, vol. 9, no. 2-3, pp. 323-336, 1997.
    • (1997) Journal of Experimental and Theoretical Artificial Intelligence , vol.9 , Issue.2-3 , pp. 323-336
    • Matarić, M.J.1
  • 33
    • 0032117054 scopus 로고    scopus 로고
    • Learning from History for Behavior-Based Mobile Robots in Nonstationary Conditions
    • F. Michaud and M. J. Matarić, "Learning from History for Behavior-Based Mobile Robots in Nonstationary Conditions," Autonomous Robots, vol. 5, no. 3-4, pp. 335-354, 1998.
    • (1998) Autonomous Robots , vol.5 , Issue.3-4 , pp. 335-354
    • Michaud, F.1    Matarić, M.J.2
  • 34
  • 36
    • 84947375150 scopus 로고
    • A Normal Approximation for Binomial, F, Beta, and Other Common, Related Tail Probabilities, 1
    • D. B. Peizer and J. W. Pratt, "A Normal Approximation for Binomial, F, Beta, and Other Common, Related Tail Probabilities, 1," Journal of the American Statistical Association, vol. 63, no. 324, pp. 1416-1456, 1968.
    • (1968) Journal of the American Statistical Association , vol.63 , Issue.324 , pp. 1416-1456
    • Peizer, D.B.1    Pratt, J.W.2
  • 39
    • 0024610919 scopus 로고
    • A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition
    • L. R. Rabiner, "A Tutorial on Hidden Markov Models and Selected Applications in Speech Recognition," Proceedings of the IEEE, vol. 77, no. 2, pp. 257-285, 1989.
    • (1989) Proceedings of the IEEE , vol.77 , Issue.2 , pp. 257-285
    • Rabiner, L.R.1
  • 43
    • 0346450313 scopus 로고
    • What the Dynamics of Adaptive Behavior and Cognition Might Look Like in Agent-Environment Interaction Systems
    • Mt. Verita, Switzerland
    • T. Smithers, "What the Dynamics of Adaptive Behavior and Cognition Might Look Like in Agent-Environment Interaction Systems," in Practice and Future of Autonomous Agents, Mt. Verita, Switzerland, 1995.
    • (1995) Practice and Future of Autonomous Agents
    • Smithers, T.1
  • 44
    • 0002297358 scopus 로고
    • Hidden Markov Model Induction by Bayesian Model Merging
    • S. J. Hanson, J. D. Cowan, and C. L. Giles (eds.)
    • A. Stolcke and S. Omohundro, "Hidden Markov Model Induction by Bayesian Model Merging," in S. J. Hanson, J. D. Cowan, and C. L. Giles (eds.), Advances in Neural Information Processing Systems, vol. 5. pp. 11-18, 1993.
    • (1993) Advances in Neural Information Processing Systems , vol.5 , pp. 11-18
    • Stolcke, A.1    Omohundro, S.2
  • 45
    • 0033170372 scopus 로고    scopus 로고
    • Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning
    • R. S. Sutton, D. Precup, and S. Singh, "Between MDPs and Semi-MDPs: A Framework for Temporal Abstraction in Reinforcement Learning," Artificial Intelligence, vol. 112, pp. 181-211, 1999.
    • (1999) Artificial Intelligence , vol.112 , pp. 181-211
    • Sutton, R.S.1    Precup, D.2    Singh, S.3
  • 47
    • 0029250080 scopus 로고
    • Reinforcement Learning of Non-Markov Decision Processes
    • S. D. Whitehead and L.-J. Lin, "Reinforcement Learning of Non-Markov Decision Processes," Artificial Intelligence, vol. 73, no. 1-2, pp. 271-306, 1995.
    • (1995) Artificial Intelligence , vol.73 , Issue.1-2 , pp. 271-306
    • Whitehead, S.D.1    Lin, L.-J.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.