메뉴 건너뛰기




Volumn 27, Issue 2, 2009, Pages 93-103

A Bayesian exploration-exploitation approach for optimal online sensing and planning with a visually guided mobile robot

Author keywords

Active learning; Active SLAM; Active vision; Attention and gaze planning; Bayesian optimization; Dynamic sensor networks; Model predictive control; Online path planning; Policy search; Reinforcement learning; Sequential experimental design

Indexed keywords

ACTIVE LEARNING; ACTIVE SLAM; ACTIVE VISION; ATTENTION AND GAZE PLANNING; BAYESIAN OPTIMIZATION; DYNAMIC SENSOR NETWORKS; ONLINE PATH PLANNING; POLICY SEARCH; SEQUENTIAL EXPERIMENTAL DESIGN;

EID: 70349325516     PISSN: 09295593     EISSN: None     Source Type: Journal    
DOI: 10.1007/s10514-009-9130-2     Document Type: Article
Times cited : (223)

References (55)
  • 2
    • 0035482203 scopus 로고    scopus 로고
    • Emergent synthesis
    • DOI 10.1016/S0954-1810(01)00028-0, PII S0954181001000280
    • J. Baxter P. L. Bartlett 2001 Infinite-horizon policy-gradient estimation Journal of Artificial Intelligence Research 15 4 319 350 0994.68119 10.1016/S0954-1810(01)00028-0 1884081 (Pubitemid 33076651)
    • (2001) Artificial Intelligence in Engineering , vol.15 , Issue.4 , pp. 319-320
    • Ueda, K.1
  • 3
    • 0003555229 scopus 로고    scopus 로고
    • Navigation and tracking applications PhD thesis, Linköping University.
    • Bergman, N. (1999). Recursive Bayesian estimation: navigation and tracking applications. PhD thesis, Linköping University.
    • (1999) Recursive Bayesian Estimation
    • Bergman, N.1
  • 6
    • 44649152265 scopus 로고    scopus 로고
    • Observability analysis and active control for airborne SLAM
    • DOI 10.1109/TAES.2008.4517003
    • M. Bryson S. Sukkarieh 2008 Observability analysis and active control for airborne SLAM IEEE Transaction on Aerospace Electronic Systems 44 1 261 280 10.1109/TAES.2008.4517003 (Pubitemid 351772586)
    • (2008) IEEE Transactions on Aerospace and Electronic Systems , vol.44 , Issue.1 , pp. 261-280
    • Bryson, M.1    Sukkarieh, S.2
  • 7
    • 84972528615 scopus 로고
    • Bayesian experimental design: A review
    • 0955.62617 10.1214/ss/1177009939 1390519
    • K. Chaloner I. Verdinelli 1995 Bayesian experimental design: a review Journal of Statistical Science 10 273 304 0955.62617 10.1214/ss/1177009939 1390519
    • (1995) Journal of Statistical Science , vol.10 , pp. 273-304
    • Chaloner, K.1    Verdinelli, I.2
  • 8
    • 33745178432 scopus 로고    scopus 로고
    • Simultaneous localization and mapping: Part I
    • DOI 10.1109/MRA.2006.1638022
    • H. Durrant-Whyte T. Bailey 2006 Simultaneous localisation and mapping (SLAM): part I the essential algorithms Robotics and Automation Magazine 13 99 110 10.1109/MRA.2006.1638022 (Pubitemid 43898863)
    • (2006) IEEE Robotics and Automation Magazine , vol.13 , Issue.2 , pp. 99-108
    • Durrant-Whyte, H.1    Bailey, T.2
  • 9
    • 70349304137 scopus 로고    scopus 로고
    • Finkel D.
    • Center for Research in Scientific Computation, North Carolina State University
    • Finkel, D. (2003). DIRECT optimization algorithm user guide. Center for Research in Scientific Computation, North Carolina State University.
    • (2003) DIRECT Optimization Algorithm User Guide
  • 10
    • 0005621612 scopus 로고    scopus 로고
    • PhD thesis, Department of Mathematics, North Carolina State University, Raleigh, North Carolina
    • Gablonsky, J. (2001). Modification of the DIRECT algorithm. PhD thesis, Department of Mathematics, North Carolina State University, Raleigh, North Carolina.
    • (2001) Modification of the DIRECT Algorithm
    • Gablonsky, J.1
  • 13
    • 70349330652 scopus 로고    scopus 로고
    • A novel method for learning policies from variable constraint data
    • (Special issue on Robot Learning, Part B) (this issue)
    • Howard, M., Klanke, S., Gienger, M., Goerick, C., & Vijayakumar, S. (2009). A novel method for learning policies from variable constraint data. Autonomous Robots, 27 (Special issue on Robot Learning, Part B) (this issue).
    • (2009) Autonomous Robots , vol.27
    • Howard, M.1    Klanke, S.2    Gienger, M.3    Goerick, C.4    Vijayakumar, S.5
  • 14
    • 0035577808 scopus 로고    scopus 로고
    • A Taxonomy of Global Optimization Methods Based on Response Surfaces
    • DOI 10.1023/A:1012771025575
    • D. Jones 2001 A taxonomy of global optimization methods based on response surfaces Journal of Global Optimization 21 345 383 01845262 10.1023/A:1012771025575 (Pubitemid 33672374)
    • (2001) Journal of Global Optimization , vol.21 , Issue.4 , pp. 345-383
    • Jones, D.R.1
  • 15
    • 0027678534 scopus 로고
    • Lipschitzian optimization without the Lipschitz constant
    • 0796.49032 10.1007/BF00941892 1246501
    • D. Jones C. Perttunen B. Stuckman 1993 Lipschitzian optimization without the Lipschitz constant Journal of Optimization Theory and Applications 79 1 157 181 0796.49032 10.1007/BF00941892 1246501
    • (1993) Journal of Optimization Theory and Applications , vol.79 , Issue.1 , pp. 157-181
    • Jones, D.1    Perttunen, C.2    Stuckman, B.3
  • 16
    • 0000561424 scopus 로고    scopus 로고
    • Efficient global optimization of expensive black-box functions
    • 0917.90270 10.1023/A:1008306431147 1673460
    • D. Jones M. Schonlau W. Welch 1998 Efficient global optimization of expensive black-box functions Journal of Global Optimization 13 4 455 492 0917.90270 10.1023/A:1008306431147 1673460
    • (1998) Journal of Global Optimization , vol.13 , Issue.4 , pp. 455-492
    • Jones, D.1    Schonlau, M.2    Welch, W.3
  • 17
    • 85054124781 scopus 로고    scopus 로고
    • Marker tracking and hmd calibration for a video-based augmentedreality conferencing system
    • on augmented reality 1999
    • Kato, H., & Billinghurst, M. (1999). Marker tracking and hmd calibration for a video-based augmentedreality conferencing system. In Proc. of the 2nd IEEE and ACM int. work. on augmented reality (pp. 85-94) 1999.
    • (1999) Proc. of the 2nd IEEE and ACM Int. Work , pp. 85-94
    • Kato, H.1    Billinghurst, M.2
  • 18
    • 38649114040 scopus 로고    scopus 로고
    • Trajectory optimization using reinforcement learning for map exploration
    • DOI 10.1177/0278364907087426
    • T. Kollar N. Roy 2008 Trajectory optimization using reinforcement learning for map exploration International Journal of Robotics Research 27 2 175 197 10.1177/0278364907087426 (Pubitemid 351169713)
    • (2008) International Journal of Robotics Research , vol.27 , Issue.2 , pp. 175-196
    • Kollar, T.1    Roy, N.2
  • 19
    • 4043069840 scopus 로고    scopus 로고
    • On actor-critic algorithms
    • 1049.93095 10.1137/S0363012901385691 2044789
    • V. Konda J. Tsitsiklis 2003 On actor-critic algorithms SIAM Journal on Control and Optimization 42 4 1143 1166 1049.93095 10.1137/S0363012901385691 2044789
    • (2003) SIAM Journal on Control and Optimization , vol.42 , Issue.4 , pp. 1143-1166
    • Konda, V.1    Tsitsiklis, J.2
  • 21
    • 84998710865 scopus 로고
    • A new method of locating the maximum of an arbitrary multipeak curve in the presence of noise
    • H. Kushner 1964 A new method of locating the maximum of an arbitrary multipeak curve in the presence of noise Journal of Basic Engineering 86 97 106
    • (1964) Journal of Basic Engineering , vol.86 , pp. 97-106
    • Kushner, H.1
  • 23
    • 70349318390 scopus 로고    scopus 로고
    • PhD thesis, Dept. of Computer Science, University of Alberta
    • Lizotte, D. (2008). Practical Bayesian optimization. PhD thesis, Dept. of Computer Science, University of Alberta.
    • (2008) Practical Bayesian Optimization
    • Lizotte, D.1
  • 25
    • 0001977314 scopus 로고    scopus 로고
    • Bayesian algorithms for one-dimensional global optimization
    • 0871.90087 10.1023/A:1008294716304 1432103
    • M. Locatelli 1997 Bayesian algorithms for one-dimensional global optimization Journal of Global Optimization 10 57 76 0871.90087 10.1023/A:1008294716304 1432103
    • (1997) Journal of Global Optimization , vol.10 , pp. 57-76
    • Locatelli, M.1
  • 28
    • 70349337643 scopus 로고    scopus 로고
    • Analysis of particle methods for simultaneous robot localization and mapping and a new algorithm: Marginal-SLAM
    • on robotics automation, 2006
    • Martinez-Cantin, R., de Freitas, N., & Castellanos, J. (2006). Analysis of particle methods for simultaneous robot localization and mapping and a new algorithm: Marginal-SLAM. In Proc. of the IEEE int. conf. on robotics & automation, 2006.
    • (2006) Proc. of the IEEE Int. Conf
    • Martinez-Cantin, R.1    De Freitas, N.2    Castellanos, J.3
  • 33
    • 0342813049 scopus 로고
    • The application of Bayesian methods for seeking the extremum
    • L. Dixon G. Szego (eds). Elsevier Amsterdam
    • Mockus, J., Tiesis, V., & Zilinskas, A. (1978). The application of Bayesian methods for seeking the extremum. In L. Dixon & G. Szego (Eds.), Towards global optimisation (Vol. 2, pp. 117-129). Amsterdam: Elsevier.
    • (1978) Towards Global Optimisation , pp. 117-129
    • Mockus, J.1    Tiesis, V.2    Zilinskas, A.3
  • 37
    • 40649106649 scopus 로고    scopus 로고
    • Natural actor critic
    • 10.1016/j.neucom.2007.11.026
    • J. Peters S. Schaal 2008 Natural actor critic Neurocomputing 71 7-9 1180 1190 10.1016/j.neucom.2007.11.026
    • (2008) Neurocomputing , vol.71 , Issue.79 , pp. 1180-1190
    • Peters, J.1    Schaal, S.2
  • 38
    • 44949241322 scopus 로고    scopus 로고
    • Reinforcement learning of motor skills with policy gradients
    • 10.1016/j.neunet.2008.02.003
    • J. Peters S. Schaal 2008 Reinforcement learning of motor skills with policy gradients Neural Networks 21 4 682 697 10.1016/j.neunet.2008.02.003
    • (2008) Neural Networks , vol.21 , Issue.4 , pp. 682-697
    • Peters, J.1    Schaal, S.2
  • 40
    • 67650996818 scopus 로고    scopus 로고
    • Reinforcement learning for robot soccer
    • 10.1007/s10514-009-9120-4 (Special issue on Robot Learning, Part A)
    • M. Riedmiller T. Gabel R. Hafner S. Lange 2009 Reinforcement learning for robot soccer Autonomous Robots 27 1 55 73 10.1007/s10514-009-9120-4 (Special issue on Robot Learning, Part A)
    • (2009) Autonomous Robots , vol.27 , Issue.1 , pp. 55-73
    • Riedmiller, M.1    Gabel, T.2    Hafner, R.3    Lange, S.4
  • 42
    • 0347131360 scopus 로고    scopus 로고
    • N. Flournoy, W. Rosenberger, W. Wong (Eds.) New developments and applications in experimental design Institute of Mathematical Statistics
    • Schonlau, M., Welch, W., & Jones, D. (1998). Global versus local search in constrained optimization of computer models. In N. Flournoy, W. Rosenberger, W. Wong (Eds.) New developments and applications in experimental design (Vol. 34, pp. 11-25). Institute of Mathematical Statistics.
    • (1998) Global Versus Local Search in Constrained Optimization of Computer Models , vol.34 , pp. 11-25
    • Schonlau, M.1    Welch, W.2    Jones, D.3
  • 47
    • 0015658957 scopus 로고
    • The optimal control of partially observable Markov processes over a finite horizon
    • 0275.93059 10.1287/opre.21.5.1071
    • R. Smallwood E. Sondik 1973 The optimal control of partially observable Markov processes over a finite horizon Operations Research 21 1071 1088 0275.93059 10.1287/opre.21.5.1071
    • (1973) Operations Research , vol.21 , pp. 1071-1088
    • Smallwood, R.1    Sondik, E.2
  • 48
    • 84959547465 scopus 로고    scopus 로고
    • Information gain-based exploration using Rao-Blackwellized particle filters
    • Cambridge, USA, 2005
    • Stachniss, C., Grisetti, G., & Burgard, W. (2005). Information gain-based exploration using Rao-Blackwellized particle filters. In Proc. of robotics: science and systems, Cambridge, USA, 2005.
    • (2005) Proc. of Robotics: Science and Systems
    • Stachniss, C.1    Grisetti, G.2    Burgard, W.3
  • 49
    • 70349310295 scopus 로고    scopus 로고
    • Finding and transferring policies using stored behaviors
    • (Special issue on Robot Learning, Part B) (this issue)
    • Stolle, M., & Atkeson, C. (2009). Finding and transferring policies using stored behaviors. Autonomous Robots, 27 (Special issue on Robot Learning, Part B) (this issue).
    • (2009) Autonomous Robots , vol.27
    • Stolle, M.1    Atkeson, C.2
  • 50
    • 0032653584 scopus 로고    scopus 로고
    • Optimal observer trajectory in bearings-only tracking for manoeuvering sources
    • 10.1049/ip-rsn:19990262
    • O. Tremois J. Le Cadre 1999 Optimal observer trajectory in bearings-only tracking for manoeuvering sources IEE Proceeding Radar, Sonar Navigation 146 1 31 39 10.1049/ip-rsn:19990262
    • (1999) IEE Proceeding Radar, Sonar Navigation , vol.146 , Issue.1 , pp. 31-39
    • Tremois, O.1    Le Cadre, J.2
  • 51
    • 70349310279 scopus 로고    scopus 로고
    • On the convergence of the expected improvement algorithm
    • Vazquez, E., & Bect, J. (2008). On the convergence of the expected improvement algorithm. arXivorg arXiv:0712.3744v2 [stat.CO], http://arxiv.org/abs/0712.3744v2.
    • (2008) ArXivorg
    • Vazquez, E.1    Bect, J.2
  • 53
    • 70349327392 scopus 로고    scopus 로고
    • Learning model-free robot control using a Monte Carlo em algorithm
    • (Special issue on Robot Learning, Part B) (this issue)
    • Vlassis, N., Toussaint, G. K. M., & Piperidis, S. (2009). Learning model-free robot control using a Monte Carlo em algorithm. Autonomous Robots, 27 (Special issue on Robot Learning, Part B) (this issue).
    • (2009) Autonomous Robots , vol.27
    • Vlassis, N.1    Toussaint, G.K.M.2    Piperidis, S.3
  • 54
    • 0000337576 scopus 로고
    • Simple statistical gradient-following algorithms for connectionist reinforcement learning
    • 0772.68076
    • R. Williams 1992 Simple statistical gradient-following algorithms for connectionist reinforcement learning Machine Learning 8 3 229 256 0772.68076
    • (1992) Machine Learning , vol.8 , Issue.3 , pp. 229-256
    • Williams, R.1
  • 55
    • 0036783418 scopus 로고    scopus 로고
    • Global optimization based on a statistical model and simplicial partitioning
    • DOI 10.1016/S0898-1221(02)00206-7, PII S0898122102002067, Global Optimization, Control, and Games, IV
    • A. Zilinskas J. Zilinskas 2002 Global optimization based on a statistical model and simplicial partitioning Computers and Mathematics with Applications 44 957 967 1047.90036 10.1016/S0898-1221(02)00206-7 1927740 (Pubitemid 34979858)
    • (2002) Computers and Mathematics with Applications , vol.44 , Issue.7 , pp. 957-967
    • Ilinskas, A.1    Ilinskas, J.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.