메뉴 건너뛰기




Volumn 38, Issue 5, 2008, Pages 1207-1220

Quantum reinforcement learning

Author keywords

Collapse; Grover iteration; Probability amplitude; Quantum reinforcement learning (QRL); State superposition

Indexed keywords

ARTIFICIAL INTELLIGENCE; CHLORINE COMPOUNDS; COMPUTATIONAL LINGUISTICS; LEARNING SYSTEMS; PROBABILITY; QUANTUM COMPUTERS; QUANTUM THEORY; RANDOM PROCESSES; REINFORCEMENT; REINFORCEMENT LEARNING; RISK ASSESSMENT;

EID: 49049104480     PISSN: 10834419     EISSN: None     Source Type: Journal    
DOI: 10.1109/TSMCB.2008.925743     Document Type: Article
Times cited : (340)

References (53)
  • 4
    • 33847202724 scopus 로고
    • Learning to predict by the methods of temporal difference
    • Aug
    • R. Sutton, "Learning to predict by the methods of temporal difference," Mach. Learn., vol. 3, no. 1, pp. 9-44, Aug. 1988.
    • (1988) Mach. Learn , vol.3 , Issue.1 , pp. 9-44
    • Sutton, R.1
  • 5
    • 34249833101 scopus 로고
    • Q-learning
    • C. Watkins and P. Dayan, "Q-learning," Mach. Learn., vol. 8, no. 3/4, pp. 279-292, 1992.
    • (1992) Mach. Learn , vol.8 , Issue.3-4 , pp. 279-292
    • Watkins, C.1    Dayan, P.2
  • 6
    • 24644466803 scopus 로고    scopus 로고
    • A fuzzy reinforcement learning approach to power control in wireless transmitters
    • Aug
    • D. Vengerov, N. Bambos, and H. Berenji, "A fuzzy reinforcement learning approach to power control in wireless transmitters," IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 35, no. 4, pp. 768-778, Aug. 2005.
    • (2005) IEEE Trans. Syst., Man, Cybern. B, Cybern , vol.35 , Issue.4 , pp. 768-778
    • Vengerov, D.1    Bambos, N.2    Berenji, H.3
  • 7
    • 0029277469 scopus 로고
    • A sensor-based navigation for a mobile robot using fuzzy logic and reinforcement learning
    • Mar
    • H. R. Beom and H. S. Cho, "A sensor-based navigation for a mobile robot using fuzzy logic and reinforcement learning," IEEE Trans. Syst., Man, Cybern., vol. 25, no. 3, pp. 464-477, Mar. 1995.
    • (1995) IEEE Trans. Syst., Man, Cybern , vol.25 , Issue.3 , pp. 464-477
    • Beom, H.R.1    Cho, H.S.2
  • 8
    • 0031207586 scopus 로고    scopus 로고
    • Harmonic functions and collision probabilities
    • Aug
    • C. I. Connolly, "Harmonic functions and collision probabilities," Int. J. Rob. Res., vol. 16, no. 4, pp. 497-507, Aug. 1997.
    • (1997) Int. J. Rob. Res , vol.16 , Issue.4 , pp. 497-507
    • Connolly, C.I.1
  • 10
    • 0742289960 scopus 로고    scopus 로고
    • A reinforcement learning with evolutionary state recruitment strategy for autonomous mobile robots control
    • Feb
    • T. Kondo and K. Ito, "A reinforcement learning with evolutionary state recruitment strategy for autonomous mobile robots control," Robot. Auton. Syst., vol. 46, no. 2, pp. 111-124, Feb. 2004.
    • (2004) Robot. Auton. Syst , vol.46 , Issue.2 , pp. 111-124
    • Kondo, T.1    Ito, K.2
  • 11
    • 0031215211 scopus 로고    scopus 로고
    • HQ-learning
    • M. Wiering and J. Schmidhuber, "HQ-learning," Adapt. Behav., vol. 6, no. 2, pp. 219-246, 1997.
    • (1997) Adapt. Behav , vol.6 , Issue.2 , pp. 219-246
    • Wiering, M.1    Schmidhuber, J.2
  • 12
    • 0141988716 scopus 로고    scopus 로고
    • Recent advances in hierarchical reinforcement learning
    • Oct
    • A. G. Barto and S. Mahanevan, "Recent advances in hierarchical reinforcement learning," Discret. Event Dyn. Syst.: Theory Appl., vol. 13, no. 4, pp. 41-77, Oct. 2003.
    • (2003) Discret. Event Dyn. Syst.: Theory Appl , vol.13 , Issue.4 , pp. 41-77
    • Barto, A.G.1    Mahanevan, S.2
  • 13
    • 0033170372 scopus 로고    scopus 로고
    • Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
    • Aug
    • R. Sutton, D. Precup, and S. Singh, "Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning," Artif. Intell., vol. 112, no. 1, pp. 181-211, Aug. 1999.
    • (1999) Artif. Intell , vol.112 , Issue.1 , pp. 181-211
    • Sutton, R.1    Precup, D.2    Singh, S.3
  • 14
    • 0002278788 scopus 로고    scopus 로고
    • Hierarchical reinforcement learning with the MAXQ value function decomposition
    • T. G. Dietterich, "Hierarchical reinforcement learning with the MAXQ value function decomposition," J. Artif. Intell. Res., vol. 13, pp. 227-303, 2000.
    • (2000) J. Artif. Intell. Res , vol.13 , pp. 227-303
    • Dietterich, T.G.1
  • 15
    • 0013498457 scopus 로고    scopus 로고
    • Hierarchical learning and planning in partially observable Markov decision processes,
    • Ph.D. dissertation, Michigan State Univ, East Lansing, MI
    • G. Theocharous, "Hierarchical learning and planning in partially observable Markov decision processes," Ph.D. dissertation, Michigan State Univ., East Lansing, MI, 2002.
    • (2002)
    • Theocharous, G.1
  • 16
    • 0036790898 scopus 로고    scopus 로고
    • Applications of the self-organising map to reinforcement learning
    • Oct
    • A. J. Smith, "Applications of the self-organising map to reinforcement learning,"Neural Netw., vol. 15, no. 8/9, pp. 1107-1124, Oct. 2002.
    • (2002) Neural Netw , vol.15 , Issue.8-9 , pp. 1107-1124
    • Smith, A.J.1
  • 18
    • 0036465263 scopus 로고    scopus 로고
    • Fuzzy reinforcement learning control for compliance tasks of robotic manipulators
    • Feb
    • S. G. Tzafestas and G. G. Rigatos, "Fuzzy reinforcement learning control for compliance tasks of robotic manipulators," IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 32, no. 1, pp. 107-113, Feb. 2002.
    • (2002) IEEE Trans. Syst., Man, Cybern. B, Cybern , vol.32 , Issue.1 , pp. 107-113
    • Tzafestas, S.G.1    Rigatos, G.G.2
  • 19
    • 2942574444 scopus 로고    scopus 로고
    • Online tuning of fuzzy inference systems using dynamic fuzzy Q-learning
    • Jun
    • M. J. Er and C. Deng, "Online tuning of fuzzy inference systems using dynamic fuzzy Q-learning," IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 34, no. 3, pp. 1478-1489, Jun. 2004.
    • (2004) IEEE Trans. Syst., Man, Cybern. B, Cybern , vol.34 , Issue.3 , pp. 1478-1489
    • Er, M.J.1    Deng, C.2
  • 20
    • 46349107239 scopus 로고    scopus 로고
    • Hybrid control for Robot Navigation: A hierarchical Q-learning algorithm
    • Jun
    • C. L. Chen, H. X. Li, and D. Y. Dong, "Hybrid control for Robot Navigation: A hierarchical Q-learning algorithm," IEEE Robot. Autom. Mag., vol. 15, no. 2, pp. 37-47, Jun. 2008.
    • (2008) IEEE Robot. Autom. Mag , vol.15 , Issue.2 , pp. 37-47
    • Chen, C.L.1    Li, H.X.2    Dong, D.Y.3
  • 21
    • 33646714634 scopus 로고    scopus 로고
    • Evolutionary function approximation for reinforcement learning
    • Dec
    • S. Whiteson and P. Stone, "Evolutionary function approximation for reinforcement learning," J. Mach. Learn. Res., vol. 7, pp. 877-917, Dec. 2006.
    • (2006) J. Mach. Learn. Res , vol.7 , pp. 877-917
    • Whiteson, S.1    Stone, P.2
  • 22
    • 27844582247 scopus 로고    scopus 로고
    • A novel approach to multiagent reinforcement learning: Utilizing OLAP mining in the learning process
    • Nov
    • M. Kaya and R. Alhajj, "A novel approach to multiagent reinforcement learning: Utilizing OLAP mining in the learning process," IEEE Trans. Syst., Man, Cybern. C, Appl. Rev., vol. 35, no. 4, pp. 582-590, Nov. 2005.
    • (2005) IEEE Trans. Syst., Man, Cybern. C, Appl. Rev , vol.35 , Issue.4 , pp. 582-590
    • Kaya, M.1    Alhajj, R.2
  • 23
    • 85115374351 scopus 로고
    • Algorithms for quantum computation: Discrete logarithms and factoring
    • P. W. Shor, "Algorithms for quantum computation: Discrete logarithms and factoring," in Proc. 35th Annu. Symp. Found. Comput. Sci., 1994, pp. 124-134.
    • (1994) Proc. 35th Annu. Symp. Found. Comput. Sci , pp. 124-134
    • Shor, P.W.1
  • 24
    • 0030520263 scopus 로고    scopus 로고
    • Quantum computation and Shor's factoring algorithm
    • A. Ekert and R. Jozsa, "Quantum computation and Shor's factoring algorithm," Rev. Mod. Phys., vol. 68, no. 3, pp. 733-753, 1996.
    • (1996) Rev. Mod. Phys , vol.68 , Issue.3 , pp. 733-753
    • Ekert, A.1    Jozsa, R.2
  • 25
    • 0029701737 scopus 로고    scopus 로고
    • A fast quantum mechanical algorithm for database search
    • L. K. Grover, "A fast quantum mechanical algorithm for database search," in Proc. 28th Annu. ACM Symp. Theory Comput., 1996, pp. 212-219.
    • (1996) Proc. 28th Annu. ACM Symp. Theory Comput , pp. 212-219
    • Grover, L.K.1
  • 26
    • 4243807288 scopus 로고    scopus 로고
    • Quantum mechanics helps in searching for a needle in a haystack
    • Jul
    • L. K. Grover, "Quantum mechanics helps in searching for a needle in a haystack," Phys. Rev. Lett., vol. 79, no. 2, pp. 325-327, Jul. 1997.
    • (1997) Phys. Rev. Lett , vol.79 , Issue.2 , pp. 325-327
    • Grover, L.K.1
  • 27
    • 0035924370 scopus 로고    scopus 로고
    • Experimental realization of Shor's quantum factoring algorithm using nuclear magnetic resonance
    • Dec
    • L. M. K. Vandersypen, M. Steffen, G. Breyta, C. S. Yannoni, M. H. Sherwood, and I. L. Chuang, "Experimental realization of Shor's quantum factoring algorithm using nuclear magnetic resonance," Nature, vol. 414, no. 6866, pp. 883-887, Dec. 2001.
    • (2001) Nature , vol.414 , Issue.6866 , pp. 883-887
    • Vandersypen, L.M.K.1    Steffen, M.2    Breyta, G.3    Yannoni, C.S.4    Sherwood, M.H.5    Chuang, I.L.6
  • 28
    • 11744326454 scopus 로고    scopus 로고
    • Experimental implementation of fast quantum searching
    • Apr
    • I. L. Chuang, N. Gershenfeld, and M. Kubinec, "Experimental implementation of fast quantum searching," Phys. Rev. Lett., vol. 80, no. 15, pp. 3408-3411, Apr. 1998.
    • (1998) Phys. Rev. Lett , vol.80 , Issue.15 , pp. 3408-3411
    • Chuang, I.L.1    Gershenfeld, N.2    Kubinec, M.3
  • 29
    • 0032502816 scopus 로고    scopus 로고
    • Fast searches with nuclear magnetic resonance computers
    • Apr
    • J. A. Jones, "Fast searches with nuclear magnetic resonance computers," Science, vol. 280, no. 5361, p. 229, Apr. 1998.
    • (1998) Science , vol.280 , Issue.5361 , pp. 229
    • Jones, J.A.1
  • 30
    • 0032575114 scopus 로고    scopus 로고
    • Implementation of a quantum search algorithm on a quantum computer
    • May
    • J. A. Jones, M. Mosca, and R. H. Hansen, "Implementation of a quantum search algorithm on a quantum computer," Nature, vol. 393, no. 6683, pp. 344-346, May 1998.
    • (1998) Nature , vol.393 , Issue.6683 , pp. 344-346
    • Jones, J.A.1    Mosca, M.2    Hansen, R.H.3
  • 31
    • 0034652062 scopus 로고    scopus 로고
    • Grover's search algorithm: An optical approach
    • Feb
    • P. G. Kwiat, J. R. Mitchell, P. D. D. Schwindt, and A. G. White, "Grover's search algorithm: An optical approach," J. Mod. Opt., vol. 47, no. 2/3, pp. 257-266, Feb. 2000.
    • (2000) J. Mod. Opt , vol.47 , Issue.2-3 , pp. 257-266
    • Kwiat, P.G.1    Mitchell, J.R.2    Schwindt, P.D.D.3    White, A.G.4
  • 32
    • 0035859923 scopus 로고    scopus 로고
    • Quantum optical implementation of Grover's algorithm
    • Aug
    • M. O. Scully and M. S. Zubairy, "Quantum optical implementation of Grover's algorithm," Proc. Nat. Acad. Sci. USA, vol. 98, no. 17, pp. 9490-9493, Aug. 2001.
    • (2001) Proc. Nat. Acad. Sci. USA , vol.98 , Issue.17 , pp. 9490-9493
    • Scully, M.O.1    Zubairy, M.S.2
  • 33
    • 0033889971 scopus 로고    scopus 로고
    • Quantum associative memory
    • May
    • D. Ventura and T. Martinez, "Quantum associative memory," Inf. Sci. vol. 124, no. 1, pp. 273-296, May 2000.
    • (2000) Inf. Sci , vol.124 , Issue.1 , pp. 273-296
    • Ventura, D.1    Martinez, T.2
  • 34
    • 0034300183 scopus 로고    scopus 로고
    • Quantum artificial neural network architectures and components
    • Oct
    • A. Narayanan and T. Menneer, "Quantum artificial neural network architectures and components," Inf. Sci., vol. 128, no. 3/4, pp. 231-255, Oct. 2000.
    • (2000) Inf. Sci , vol.128 , Issue.3-4 , pp. 231-255
    • Narayanan, A.1    Menneer, T.2
  • 35
    • 0029272121 scopus 로고
    • On quantum neural computing
    • Mar
    • S. Kak, "On quantum neural computing," Inf. Sci., vol. 83, no. 3, pp. 143-160, Mar. 1995.
    • (1995) Inf. Sci , vol.83 , Issue.3 , pp. 143-160
    • Kak, S.1
  • 36
    • 22244451025 scopus 로고    scopus 로고
    • Qubit neural network and its learning efficiency
    • Jul
    • N. Kouda, N. Matsui, H. Nishimura, and F. Peper, "Qubit neural network and its learning efficiency," Neural Comput. Appl., vol. 14, no. 2, pp. 114-121, Jul. 2005.
    • (2005) Neural Comput. Appl , vol.14 , Issue.2 , pp. 114-121
    • Kouda, N.1    Matsui, N.2    Nishimura, H.3    Peper, F.4
  • 38
    • 0036685590 scopus 로고    scopus 로고
    • Parallelization of a fuzzy control algorithm using quantum computation
    • Aug
    • G. G. Rigatos and S. G. Tzafestas, "Parallelization of a fuzzy control algorithm using quantum computation," IEEE Trans. Fuzzy Syst., vol. 10, no. 4, pp. 451-460, Aug. 2002.
    • (2002) IEEE Trans. Fuzzy Syst , vol.10 , Issue.4 , pp. 451-460
    • Rigatos, G.G.1    Tzafestas, S.G.2
  • 39
    • 25444455386 scopus 로고    scopus 로고
    • Quantum genetic algorithm method in self-consistent electronic structure calculations of a quantum dot with many electrons
    • M. Sahin, U. Atav, and M. Tomak, "Quantum genetic algorithm method in self-consistent electronic structure calculations of a quantum dot with many electrons," Int. J. Mod. Phys. C, vol. 16, no. 9, pp. 1379-1393, 2005.
    • (2005) Int. J. Mod. Phys. C , vol.16 , Issue.9 , pp. 1379-1393
    • Sahin, M.1    Atav, U.2    Tomak, M.3
  • 40
    • 0034301165 scopus 로고    scopus 로고
    • Quantum optimization
    • Oct
    • T. Hogg and D. Portnov, "Quantum optimization," Inf. Sci., vol. 128, no. 3, pp. 181-197, Oct. 2000.
    • (2000) Inf. Sci , vol.128 , Issue.3 , pp. 181-197
    • Hogg, T.1    Portnov, D.2
  • 41
    • 28444450523 scopus 로고    scopus 로고
    • Quantum search in stochastic planning
    • S. Naguleswaran and L. B. White, "Quantum search in stochastic planning," Proc. SPIE, vol. 5846, pp. 34-45, 2005.
    • (2005) Proc. SPIE , vol.5846 , pp. 34-45
    • Naguleswaran, S.1    White, L.B.2
  • 43
    • 52349091986 scopus 로고    scopus 로고
    • J. Preskill, Course information for Physics 229:[I#Advanced Mathematical Methods of Physics - Quantum Information and Computation Pasadena, CA: California Inst. Technol., 1998. [Online]. Available: http://www.theory, caltech.edu/people/preskill/ph229/
    • J. Preskill, "Course information for Physics 229:[I#Advanced Mathematical Methods of Physics - Quantum Information and Computation Pasadena, CA: California Inst. Technol., 1998. [Online]. Available: http://www.theory, caltech.edu/people/preskill/ph229/
  • 45
    • 33846175677 scopus 로고    scopus 로고
    • Quantum computation for action selection using reinforcement learning
    • C.L. Chen, D. Y. Dong, and Z. H. Chen, "Quantum computation for action selection using reinforcement learning," Int. J. Quantum Inf., vol. 4, no. 6, pp. 1071-1083, 2006.
    • (2006) Int. J. Quantum Inf , vol.4 , Issue.6 , pp. 1071-1083
    • Chen, C.L.1    Dong, D.Y.2    Chen, Z.H.3
  • 47
    • 14344266002 scopus 로고    scopus 로고
    • Learning rates for Q-learning
    • Dec
    • E. Even-Dar and Y. Mansour, "Learning rates for Q-learning," J. Mach. Learn. Res., vol. 5, pp. 1-25, Dec. 2003.
    • (2003) J. Mach. Learn. Res , vol.5 , pp. 1-25
    • Even-Dar, E.1    Mansour, Y.2
  • 49
    • 33646406807 scopus 로고    scopus 로고
    • Multi-armed bandit algorithms and empirical evaluation
    • J. Vermorel and M. Mohri, "Multi-armed bandit algorithms and empirical evaluation," in Proc. ECML, 2005, vol. 3720, pp. 437-448.
    • (2005) Proc. ECML , vol.3720 , pp. 437-448
    • Vermorel, J.1    Mohri, M.2
  • 50
    • 4844223639 scopus 로고    scopus 로고
    • A new Q-learning algorithm based on the metropolis criterion
    • Oct
    • M. Guo, Y. Liu, and J. Malec, "A new Q-learning algorithm based on the metropolis criterion," IEEE Trans. Syst., Man, Cybern. B, Cybern. vol. 34, no. 5, pp. 2140-2143, Oct. 2004.
    • (2004) IEEE Trans. Syst., Man, Cybern. B, Cybern , vol.34 , Issue.5 , pp. 2140-2143
    • Guo, M.1    Liu, Y.2    Malec, J.3
  • 51
    • 33745610133 scopus 로고    scopus 로고
    • Quantum mechanics helps in learning for more intelligent robots
    • Jul
    • D. Y. Dong, C. L. Chen, Z. H. Chen, and C. B. Zhang, "Quantum mechanics helps in learning for more intelligent robots," Chin. Phys. Lett. vol. 23, no. 7, pp. 1691-1694, Jul. 2006.
    • (2006) Chin. Phys. Lett , vol.23 , Issue.7 , pp. 1691-1694
    • Dong, D.Y.1    Chen, C.L.2    Chen, Z.H.3    Zhang, C.B.4
  • 52
    • 33745780982 scopus 로고    scopus 로고
    • Quantum robot: Structure, algorithms and applications
    • Jul
    • D. Y. Dong, C. L. Chen, C. B. Zhang, and Z. H. Chen, "Quantum robot: Structure, algorithms and applications," Robotica, vol. 24, no. 4, pp. 513-521, Jul. 2006.
    • (2006) Robotica , vol.24 , Issue.4 , pp. 513-521
    • Dong, D.Y.1    Chen, C.L.2    Zhang, C.B.3    Chen, Z.H.4
  • 53
    • 0000532930 scopus 로고    scopus 로고
    • Quantum robots and environments
    • Aug
    • P. Benioff, "Quantum robots and environments," Phys. Rev. A, Gen. Phys., vol. 58, no. 2, pp. 893-904, Aug. 1998.
    • (1998) Phys. Rev. A, Gen. Phys , vol.58 , Issue.2 , pp. 893-904
    • Benioff, P.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.