메뉴 건너뛰기




Volumn 17, Issue 1, 2012, Pages 86-97

Robust quantum-inspired reinforcement learning for robot navigation

Author keywords

Probabilistic action selection; quantum amplitude amplification; quantum inspired reinforcement learning (QiRL); robot navigation

Indexed keywords

ACTION SELECTION; AUTONOMOUS MOBILE ROBOT; INITIAL STATE; LEARNING RATES; MARKOVIAN; NAVIGATION CONTROLS; QUANTUM AMPLITUDE AMPLIFICATION; QUANTUM MEASUREMENT; QUANTUM-INSPIRED REINFORCEMENT LEARNING (QIRL); REINFORCEMENT STRATEGIES; ROBOT NAVIGATION; SIMULATED EXPERIMENTS; STATE TRANSITIONS;

EID: 84855964266     PISSN: 10834435     EISSN: None     Source Type: Journal    
DOI: 10.1109/TMECH.2010.2090896     Document Type: Article
Times cited : (80)

References (46)
  • 3
    • 33745780982 scopus 로고    scopus 로고
    • Quantum robot: Structure, algorithms and applications
    • D. Y. Dong, C. L. Chen, C. B. Zhang, and Z. H. Chen, "Quantum robot: Structure, algorithms and applications," Robotica, vol. 24, pp. 513-521, 2006.
    • (2006) Robotica , vol.24 , pp. 513-521
    • Dong, D.Y.1    Chen, C.L.2    Zhang, C.B.3    Chen, Z.H.4
  • 4
    • 0029272121 scopus 로고
    • On quantum neural computing
    • S. Kak, "On quantum neural computing," Inf. Sci., vol. 83, pp. 143-160, 1995.
    • (1995) Inf. Sci. , vol.83 , pp. 143-160
    • Kak, S.1
  • 5
    • 0034300183 scopus 로고    scopus 로고
    • Quantum artificial neural network architectures and components
    • A. Narayanan and T. Menneer, "Quantum artificial neural network architectures and components," Inf. Sci., vol. 128, pp. 231-255, 2000.
    • (2000) Inf. Sci. , vol.128 , pp. 231-255
    • Narayanan, A.1    Menneer, T.2
  • 6
    • 0036685590 scopus 로고    scopus 로고
    • Parallelization of a fuzzy control algorithm using quantum computation
    • DOI 10.1109/TFUZZ.2002.800690, PII 1011092002800690
    • G. G. Rigatos and S. G. Tzafestas, "Parallelization of a fuzzy control algorithm using quantum computation," IEEE Trans. Fuzzy Syst., vol. 10, no. 4, pp. 451-460, Aug. 2002. (Pubitemid 34950050)
    • (2002) IEEE Transactions on Fuzzy Systems , vol.10 , Issue.4 , pp. 451-460
    • Rigatos, G.G.1    Tzafestas, S.G.2
  • 8
    • 0036945847 scopus 로고    scopus 로고
    • Quantum-inspired evolutionary algorithm for a class of combinatorial optimization
    • Dec.
    • K. H. Han and J. H. Kim, "Quantum-inspired evolutionary algorithm for a class of combinatorial optimization," IEEE Trans. Evol. Comput., vol. 6, no. 6, pp. 580-593, Dec. 2002.
    • (2002) IEEE Trans. Evol. Comput. , vol.6 , Issue.6 , pp. 580-593
    • Han, K.H.1    Kim, J.H.2
  • 10
    • 49049097102 scopus 로고    scopus 로고
    • Incoherent control of quantum systems with wavefunction controllable subspaces via quantum reinforcement learning
    • Aug.
    • D. Dong, C. Chen, T. J. Tarn, A. Pechen, and H. Rabitz, "Incoherent control of quantum systems with wavefunction controllable subspaces via quantum reinforcement learning," IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 38, no. 4, pp. 957-962, Aug. 2008.
    • (2008) IEEE Trans. Syst., Man, Cybern. B, Cybern. , vol.38 , Issue.4 , pp. 957-962
    • Dong, D.1    Chen, C.2    Tarn, T.J.3    Pechen, A.4    Rabitz, H.5
  • 11
    • 54849403345 scopus 로고    scopus 로고
    • Incoherent control of locally controllable quantum systems
    • D. Dong, C. Zhang, H. Rabitz, A. Pechen, and T. J. Tarn, "Incoherent control of locally controllable quantum systems," J.Chem. Phys., vol. 129, no. 15, pp. 154103-1-154103-10, 2008.
    • (2008) J.Chem. Phys. , vol.129 , Issue.15 , pp. 1541031-15410310
    • Dong, D.1    Zhang, C.2    Rabitz, H.3    Pechen, A.4    Tarn, T.J.5
  • 13
    • 37749016469 scopus 로고    scopus 로고
    • Reinforcement strategy using quantum amplitude amplification for robot learning
    • Zhangjiajie, China
    • D. Dong, C. Chen, and H. Li, "Reinforcement strategy using quantum amplitude amplification for robot learning," in Proc. 26th Chinese Control Conf., Zhangjiajie, China, 2007, vol. 6, pp. 571-575.
    • (2007) Proc. 26th Chinese Control Conf. , vol.6 , pp. 571-575
    • Dong, D.1    Chen, C.2    Li, H.3
  • 14
    • 79551551978 scopus 로고    scopus 로고
    • Quantum-inspired reinforcement learning for decision-making of Markovian state transition
    • Hangzhou, China, Nov. 15-16
    • D. Dong and C. Chen, Quantum-inspired reinforcement learning for decision-making of Markovian state transition," presented at the 2010 Int. Conf. Intell. Syst. Knowl. Eng., Hangzhou, China, Nov. 15-16, 2010.
    • (2010) Presented at the 2010 Int. Conf. Intell. Syst. Knowl. Eng.
    • Dong, D.1    Chen, C.2
  • 15
    • 46349107239 scopus 로고    scopus 로고
    • Hybrid control for robot navigation: A hierarchical Q-learning algorithm
    • Jun.
    • C. Chen, H. Li, and D. Dong, "Hybrid control for robot navigation: A hierarchical Q-learning algorithm," IEEE Robot. Autom. Mag., vol. 15, no. 2, pp. 37-47, Jun. 2008.
    • (2008) IEEE Robot. Autom. Mag. , vol.15 , Issue.2 , pp. 37-47
    • Chen, C.1    Li, H.2    Dong, D.3
  • 16
    • 67651177970 scopus 로고    scopus 로고
    • Autonomous mobile robot navigation using passive RFID in indoor environment
    • Jul.
    • S. Park and S. Hashimoto, "Autonomous mobile robot navigation using passive RFID in indoor environment," IEEE Trans. Ind. Electron., vol. 56, no. 7, pp. 2366-2373, Jul. 2009.
    • (2009) IEEE Trans. Ind. Electron. , vol.56 , Issue.7 , pp. 2366-2373
    • Park, S.1    Hashimoto, S.2
  • 17
    • 61649087246 scopus 로고    scopus 로고
    • Behavioral control through evolutionary neurocontrollers for autonomous mobile robot navigation
    • J. A. Fernandez-Leon, G. G. Acosta, and M. A. Mayosky, "Behavioral control through evolutionary neurocontrollers for autonomous mobile robot navigation," Robot. Auton. Syst., vol. 57, no. 4, pp. 411-419, 2009.
    • (2009) Robot. Auton. Syst. , vol.57 , Issue.4 , pp. 411-419
    • Fernandez-Leon, J.A.1    Acosta, G.G.2    Mayosky, M.A.3
  • 18
    • 58149122860 scopus 로고    scopus 로고
    • The use of aerial images and GPS for mobile robot waypoint navigation
    • Dec.
    • S. Shair, J. H. Chandler, V. J. Gonzalez-Villela et al., "The use of aerial images and GPS for mobile robot waypoint navigation," IEEE/ASME Trans. Mechatronics, vol. 13, no. 6, pp. 692-699, Dec. 2008.
    • (2008) IEEE/ASME Trans. Mechatronics , vol.13 , Issue.6 , pp. 692-699
    • Shair, S.1    Chandler, J.H.2    Gonzalez-Villela, V.J.3
  • 19
    • 67349184465 scopus 로고    scopus 로고
    • The sensor-based random graph method for cooperative robot exploration
    • Apr.
    • A. Franchi, L. Freda, G. Oriolo et al., "The sensor-based random graph method for cooperative robot exploration," IEEE/ASME Trans. Mechatronics, vol. 14, no. 2, pp. 163-175, Apr. 2009.
    • (2009) IEEE/ASME Trans. Mechatronics , vol.14 , Issue.2 , pp. 163-175
    • Franchi, A.1    Freda, L.2    Oriolo, G.3
  • 20
    • 77649323815 scopus 로고    scopus 로고
    • Grey system based reactive navigation of mobile robots using reinforcement learning
    • C. Chen and D. Dong, "Grey system based reactive navigation of mobile robots using reinforcement learning," Int. J. Innov. Comput., Inf. Control, vol. 6, no. 2, pp. 789-800, 2010.
    • (2010) Int. J. Innov. Comput., Inf. Control , vol.6 , Issue.2 , pp. 789-800
    • Chen, C.1    Dong, D.2
  • 21
    • 54749139321 scopus 로고    scopus 로고
    • Optimal path planning for mobile robot navigation
    • Aug.
    • G. E. Jan, K. Y. Chang, and I. Parberry, "Optimal path planning for mobile robot navigation," IEEE/ASME Trans. Mechatronics, vol. 13, no. 4, pp. 451-460, Aug. 2008.
    • (2008) IEEE/ASME Trans. Mechatronics , vol.13 , Issue.4 , pp. 451-460
    • Jan, G.E.1    Chang, K.Y.2    Parberry, I.3
  • 23
    • 39449084814 scopus 로고    scopus 로고
    • Robot navigation in very cluttered environments by preference-based fuzzy behaviors
    • DOI 10.1016/j.robot.2007.07.006, PII S0921889007000978
    • M. F. Selekwa, D. D. Dunlap, D. Shi, and E. G. Collins, "Robot navigation in very cluttered environments by preference-based fuzzy behaviors," Robot. Auton. Syst., vol. 56, no. 3, pp. 231-246, 2008. (Pubitemid 351273574)
    • (2008) Robotics and Autonomous Systems , vol.56 , Issue.3 , pp. 231-246
    • Selekwa, M.F.1    Dunlap, D.D.2    Shi, D.3    Collins Jr., E.G.4
  • 24
    • 0033750229 scopus 로고    scopus 로고
    • Navigation of mobile robot: Open questions
    • M. A. Salichs and L. Moreno, "Navigation of mobile robot: Open questions," Robotica, vol. 18, pp. 227-234, 2000.
    • (2000) Robotica , vol.18 , pp. 227-234
    • Salichs, M.A.1    Moreno, L.2
  • 25
    • 34548040193 scopus 로고    scopus 로고
    • Autonomous and fast robot learning through motivation
    • DOI 10.1016/j.robot.2007.05.005, PII S0921889007000668
    • M. Rodriguez, R. Iglesias, C. V. Regueiro, J. Correa, and S. Barro, "Autonomous and fast robot learning through motivation," Robot. Auton. Syst., vol. 55, pp. 735-740, 2007. (Pubitemid 47285710)
    • (2007) Robotics and Autonomous Systems , vol.55 , Issue.9 , pp. 735-740
    • Rodriguez, M.1    Iglesias, R.2    Regueiro, C.V.3    Correa, J.4    Barro, S.5
  • 26
    • 67650957592 scopus 로고    scopus 로고
    • Learning to search: Functional gradient techniques for imitation learning
    • N. D. Ratliff, D. Silver, and J. A. Bagnell, "Learning to search: Functional gradient techniques for imitation learning," Auton. Robots, vol. 27, pp. 25-53, 2009.
    • (2009) Auton. Robots , vol.27 , pp. 25-53
    • Ratliff, N.D.1    Silver, D.2    Bagnell, J.A.3
  • 27
    • 0742289960 scopus 로고    scopus 로고
    • A reinforcement learning with evolutionary state recruitment strategy for autonomous mobile robots control
    • T. Kondo and K. Ito, "A reinforcement learning with evolutionary state recruitment strategy for autonomous mobile robots control," Robot. Auton. Syst., vol. 46, pp. 111-124, 2004.
    • (2004) Robot. Auton. Syst. , vol.46 , pp. 111-124
    • Kondo, T.1    Ito, K.2
  • 29
    • 33847202724 scopus 로고
    • Learning to predict by the methods of temporal difference
    • R. Sutton, "Learning to predict by the methods of temporal difference," Mach. Learn., vol. 3, pp. 9-44, 1988.
    • (1988) Mach. Learn. , vol.3 , pp. 9-44
    • Sutton, R.1
  • 30
  • 32
    • 77950596388 scopus 로고    scopus 로고
    • Sequential Q-learning with Kalman filtering for multirobot cooperative transportation
    • Apr.
    • Y.Wang and C.W. de Silva, "Sequential Q-learning with Kalman filtering for multirobot cooperative transportation," IEEE/ASME Trans. Mechatronics, vol. 15, no. 2, pp. 261-268, Apr. 2010.
    • (2010) IEEE/ASME Trans. Mechatronics , vol.15 , Issue.2 , pp. 261-268
    • Wang, Y.1    De Silva, C.W.2
  • 33
    • 0002278788 scopus 로고    scopus 로고
    • Hierarchical reinforcement learning with the MAXQ value function decomposition
    • T. G. Dietterich, "Hierarchical reinforcement learning with theMaxq value function decomposition," J. Artif. Intell. Res., vol. 13, pp. 227-303, 2000. (Pubitemid 33682087)
    • (2000) Journal of Artificial Intelligence Research , vol.13 , pp. 227-303
    • Dietterich, T.G.1
  • 34
    • 2942574444 scopus 로고    scopus 로고
    • Online tuning of fuzzy inference systems using dynamic fuzzy Q-learning
    • Jun.
    • M. J. Er and C. Deng, "Online tuning of fuzzy inference systems using dynamic fuzzy Q-learning," IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 34, no. 3, pp. 1478-1489, Jun. 2004.
    • (2004) IEEE Trans. Syst., Man, Cybern. B, Cybern. , vol.34 , Issue.3 , pp. 1478-1489
    • Er, M.J.1    Deng, C.2
  • 35
    • 33646714634 scopus 로고    scopus 로고
    • Evolutionary function approximation for reinforcement learning
    • S. Whiteson and P. Stone, "Evolutionary function approximation for reinforcement learning," J. Mach. Learn. Res., vol. 7, pp. 877-917, 2006. (Pubitemid 43736560)
    • (2006) Journal of Machine Learning Research , vol.7 , pp. 877-917
    • Whiteson, S.1    Stone, P.2
  • 36
    • 27844582247 scopus 로고    scopus 로고
    • A novel approach to multiagent reinforcement learning: Utilizing OLAP mining in the learning process
    • DOI 10.1109/TSMCC.2004.843188
    • M. Kaya and R. Alhajj, "A novel approach to multiagent reinforcement learning: Utilizing OLAP mining in the learning process," IEEE Trans. Syst., Man, Cybern. C, Appl. Rev., vol. 35, no. 4, pp. 582-590, Nov. 2005. (Pubitemid 41638177)
    • (2005) IEEE Transactions on Systems, Man and Cybernetics Part C: Applications and Reviews , vol.35 , Issue.4 , pp. 582-590
    • Kaya, M.1    Alhajj, R.2
  • 37
    • 0036465263 scopus 로고    scopus 로고
    • Fuzzy reinforcement learning control for compliance tasks of robotic manipulators
    • DOI 10.1109/3477.979965, PII S1083441902004521
    • S. G. Tzafestas and G. G. Rigatos, "Fuzzy reinforcement learning control for compliance tasks of robotic manipulators," IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 32, no. 1, pp. 107-113, Feb. 2002. (Pubitemid 34228677)
    • (2002) IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics , vol.32 , Issue.1 , pp. 107-113
    • Tzafestas, S.G.1    Rigatos, G.G.2
  • 38
    • 4243573416 scopus 로고    scopus 로고
    • Arbitrary phases in quantum amplitude amplification
    • P. Høyer, "Arbitrary phases in quantum amplitude amplification," Phys. Rev. A, vol. 62, pp. 052304-1-052304-5, 2000.
    • (2000) Phys. Rev. A , vol.62 , pp. 0523041-0523045
    • Høyer, P.1
  • 39
    • 0030661550 scopus 로고    scopus 로고
    • An exact quantum polynomial-time algorithm for Simon's problem
    • Los Alamitos, CA
    • G. Brassard and P. Høyer, "An exact quantum polynomial-time algorithm for Simon's problem," in Proc. 5th Israeli Symp. Theory Comput. Syst., Los Alamitos, CA, 1997, pp. 12-23.
    • (1997) Proc. 5th Israeli Symp. Theory Comput. Syst. , pp. 12-23
    • Brassard, G.1    Høyer, P.2
  • 40
    • 4243807288 scopus 로고    scopus 로고
    • Quantum mechanics helps in searching for a needle in a haystack
    • L. K. Grover, "Quantum mechanics helps in searching for a needle in a haystack," Phys. Rev. Lett., vol. 79, pp. 325-327, 1997.
    • (1997) Phys. Rev. Lett. , vol.79 , pp. 325-327
    • Grover, L.K.1
  • 41
    • 0035435437 scopus 로고    scopus 로고
    • Grover algorithm with zero theoretical failure rate
    • G. L. Long, "Grover algorithm with zero theoretical failure rate," Phys. Rev. A, vol. 64, pp. 022307-1-022307-4, 2001.
    • (2001) Phys. Rev. A , vol.64 , pp. 0223071-0223074
    • Long, G.L.1
  • 42
    • 72149089621 scopus 로고    scopus 로고
    • Sliding mode control of quantum systems
    • D. Dong and I. R. Petersen, "Sliding mode control of quantum systems," New J. Phys., vol. 11, pp. 105033-1-105033-18, 2009.
    • (2009) New J. Phys. , vol.11 , pp. 1050331-10503318
    • Dong, D.1    Petersen, I.R.2
  • 43
    • 4243643113 scopus 로고    scopus 로고
    • Quantum computers can search rapidly by using almost any transformation
    • L. K. Grover, "Quantum computers can search rapidly by using almost any transformation," Phys. Rev. Lett., vol. 80, pp. 4329-4332, 1998. (Pubitemid 128621914)
    • (1998) Physical Review Letters , vol.80 , Issue.19 , pp. 4329-4332
    • Grover, L.K.1
  • 45
    • 3943106166 scopus 로고    scopus 로고
    • Evolutionary path planning for autonomous underwater vehicles in a variable ocean
    • Apr.
    • A. Alvarez, A. Caiti, and R. Onken, "Evolutionary path planning for autonomous underwater vehicles in a variable ocean," IEEE J. Ocean. Eng., vol. 29, no. 2, pp. 418-429, Apr. 2004.
    • (2004) IEEE J. Ocean. Eng. , vol.29 , Issue.2 , pp. 418-429
    • Alvarez, A.1    Caiti, A.2    Onken, R.3
  • 46
    • 0031998630 scopus 로고    scopus 로고
    • Learning metric-topological maps for indoor mobile robot navigation
    • PII S0004370297000787
    • S. Thrun, "Learning metric-topological maps for indoor mobile robot navigation," Artif. Intell., vol. 99, pp. 21-71, 1998. (Pubitemid 128378368)
    • (1998) Artificial Intelligence , vol.99 , Issue.1 , pp. 21-71
    • Thrun, S.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.