-
2
-
-
0029679044
-
Reinforcement learning: A survey
-
L. P. Kaelbling, M. L. Littman, and A. W. Moore, "Reinforcement learning: A survey," J. Artif. Intell. Res., vol. 4, pp. 237-287, 1996.
-
(1996)
J. Artif. Intell. Res
, vol.4
, pp. 237-287
-
-
Kaelbling, L.P.1
Littman, M.L.2
Moore, A.W.3
-
4
-
-
33847202724
-
Learning to predict by the methods of temporal difference
-
Aug
-
R. Sutton, "Learning to predict by the methods of temporal difference," Mach. Learn., vol. 3, no. 1, pp. 9-44, Aug. 1988.
-
(1988)
Mach. Learn
, vol.3
, Issue.1
, pp. 9-44
-
-
Sutton, R.1
-
5
-
-
34249833101
-
Q-learning
-
C. Watkins and P. Dayan, "Q-learning," Mach. Learn., vol. 8, no. 3/4, pp. 279-292, 1992.
-
(1992)
Mach. Learn
, vol.8
, Issue.3-4
, pp. 279-292
-
-
Watkins, C.1
Dayan, P.2
-
6
-
-
24644466803
-
A fuzzy reinforcement learning approach to power control in wireless transmitters
-
Aug
-
D. Vengerov, N. Bambos, and H. Berenji, "A fuzzy reinforcement learning approach to power control in wireless transmitters," IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 35, no. 4, pp. 768-778, Aug. 2005.
-
(2005)
IEEE Trans. Syst., Man, Cybern. B, Cybern
, vol.35
, Issue.4
, pp. 768-778
-
-
Vengerov, D.1
Bambos, N.2
Berenji, H.3
-
7
-
-
0029277469
-
A sensor-based navigation for a mobile robot using fuzzy logic and reinforcement learning
-
Mar
-
H. R. Beom and H. S. Cho, "A sensor-based navigation for a mobile robot using fuzzy logic and reinforcement learning," IEEE Trans. Syst., Man, Cybern., vol. 25, no. 3, pp. 464-477, Mar. 1995.
-
(1995)
IEEE Trans. Syst., Man, Cybern
, vol.25
, Issue.3
, pp. 464-477
-
-
Beom, H.R.1
Cho, H.S.2
-
8
-
-
0031207586
-
Harmonic functions and collision probabilities
-
Aug
-
C. I. Connolly, "Harmonic functions and collision probabilities," Int. J. Rob. Res., vol. 16, no. 4, pp. 497-507, Aug. 1997.
-
(1997)
Int. J. Rob. Res
, vol.16
, Issue.4
, pp. 497-507
-
-
Connolly, C.I.1
-
10
-
-
0742289960
-
A reinforcement learning with evolutionary state recruitment strategy for autonomous mobile robots control
-
Feb
-
T. Kondo and K. Ito, "A reinforcement learning with evolutionary state recruitment strategy for autonomous mobile robots control," Robot. Auton. Syst., vol. 46, no. 2, pp. 111-124, Feb. 2004.
-
(2004)
Robot. Auton. Syst
, vol.46
, Issue.2
, pp. 111-124
-
-
Kondo, T.1
Ito, K.2
-
11
-
-
0031215211
-
HQ-learning
-
M. Wiering and J. Schmidhuber, "HQ-learning," Adapt. Behav., vol. 6, no. 2, pp. 219-246, 1997.
-
(1997)
Adapt. Behav
, vol.6
, Issue.2
, pp. 219-246
-
-
Wiering, M.1
Schmidhuber, J.2
-
12
-
-
0141988716
-
Recent advances in hierarchical reinforcement learning
-
Oct
-
A. G. Barto and S. Mahanevan, "Recent advances in hierarchical reinforcement learning," Discret. Event Dyn. Syst.: Theory Appl., vol. 13, no. 4, pp. 41-77, Oct. 2003.
-
(2003)
Discret. Event Dyn. Syst.: Theory Appl
, vol.13
, Issue.4
, pp. 41-77
-
-
Barto, A.G.1
Mahanevan, S.2
-
13
-
-
0033170372
-
Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
-
Aug
-
R. Sutton, D. Precup, and S. Singh, "Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning," Artif. Intell., vol. 112, no. 1, pp. 181-211, Aug. 1999.
-
(1999)
Artif. Intell
, vol.112
, Issue.1
, pp. 181-211
-
-
Sutton, R.1
Precup, D.2
Singh, S.3
-
14
-
-
0002278788
-
Hierarchical reinforcement learning with the MAXQ value function decomposition
-
T. G. Dietterich, "Hierarchical reinforcement learning with the MAXQ value function decomposition," J. Artif. Intell. Res., vol. 13, pp. 227-303, 2000.
-
(2000)
J. Artif. Intell. Res
, vol.13
, pp. 227-303
-
-
Dietterich, T.G.1
-
15
-
-
0013498457
-
Hierarchical learning and planning in partially observable Markov decision processes,
-
Ph.D. dissertation, Michigan State Univ, East Lansing, MI
-
G. Theocharous, "Hierarchical learning and planning in partially observable Markov decision processes," Ph.D. dissertation, Michigan State Univ., East Lansing, MI, 2002.
-
(2002)
-
-
Theocharous, G.1
-
16
-
-
0036790898
-
Applications of the self-organising map to reinforcement learning
-
Oct
-
A. J. Smith, "Applications of the self-organising map to reinforcement learning,"Neural Netw., vol. 15, no. 8/9, pp. 1107-1124, Oct. 2002.
-
(2002)
Neural Netw
, vol.15
, Issue.8-9
, pp. 1107-1124
-
-
Smith, A.J.1
-
18
-
-
0036465263
-
Fuzzy reinforcement learning control for compliance tasks of robotic manipulators
-
Feb
-
S. G. Tzafestas and G. G. Rigatos, "Fuzzy reinforcement learning control for compliance tasks of robotic manipulators," IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 32, no. 1, pp. 107-113, Feb. 2002.
-
(2002)
IEEE Trans. Syst., Man, Cybern. B, Cybern
, vol.32
, Issue.1
, pp. 107-113
-
-
Tzafestas, S.G.1
Rigatos, G.G.2
-
19
-
-
2942574444
-
Online tuning of fuzzy inference systems using dynamic fuzzy Q-learning
-
Jun
-
M. J. Er and C. Deng, "Online tuning of fuzzy inference systems using dynamic fuzzy Q-learning," IEEE Trans. Syst., Man, Cybern. B, Cybern., vol. 34, no. 3, pp. 1478-1489, Jun. 2004.
-
(2004)
IEEE Trans. Syst., Man, Cybern. B, Cybern
, vol.34
, Issue.3
, pp. 1478-1489
-
-
Er, M.J.1
Deng, C.2
-
20
-
-
46349107239
-
Hybrid control for Robot Navigation: A hierarchical Q-learning algorithm
-
Jun
-
C. L. Chen, H. X. Li, and D. Y. Dong, "Hybrid control for Robot Navigation: A hierarchical Q-learning algorithm," IEEE Robot. Autom. Mag., vol. 15, no. 2, pp. 37-47, Jun. 2008.
-
(2008)
IEEE Robot. Autom. Mag
, vol.15
, Issue.2
, pp. 37-47
-
-
Chen, C.L.1
Li, H.X.2
Dong, D.Y.3
-
21
-
-
33646714634
-
Evolutionary function approximation for reinforcement learning
-
Dec
-
S. Whiteson and P. Stone, "Evolutionary function approximation for reinforcement learning," J. Mach. Learn. Res., vol. 7, pp. 877-917, Dec. 2006.
-
(2006)
J. Mach. Learn. Res
, vol.7
, pp. 877-917
-
-
Whiteson, S.1
Stone, P.2
-
22
-
-
27844582247
-
A novel approach to multiagent reinforcement learning: Utilizing OLAP mining in the learning process
-
Nov
-
M. Kaya and R. Alhajj, "A novel approach to multiagent reinforcement learning: Utilizing OLAP mining in the learning process," IEEE Trans. Syst., Man, Cybern. C, Appl. Rev., vol. 35, no. 4, pp. 582-590, Nov. 2005.
-
(2005)
IEEE Trans. Syst., Man, Cybern. C, Appl. Rev
, vol.35
, Issue.4
, pp. 582-590
-
-
Kaya, M.1
Alhajj, R.2
-
23
-
-
85115374351
-
Algorithms for quantum computation: Discrete logarithms and factoring
-
P. W. Shor, "Algorithms for quantum computation: Discrete logarithms and factoring," in Proc. 35th Annu. Symp. Found. Comput. Sci., 1994, pp. 124-134.
-
(1994)
Proc. 35th Annu. Symp. Found. Comput. Sci
, pp. 124-134
-
-
Shor, P.W.1
-
24
-
-
0030520263
-
Quantum computation and Shor's factoring algorithm
-
A. Ekert and R. Jozsa, "Quantum computation and Shor's factoring algorithm," Rev. Mod. Phys., vol. 68, no. 3, pp. 733-753, 1996.
-
(1996)
Rev. Mod. Phys
, vol.68
, Issue.3
, pp. 733-753
-
-
Ekert, A.1
Jozsa, R.2
-
25
-
-
0029701737
-
A fast quantum mechanical algorithm for database search
-
L. K. Grover, "A fast quantum mechanical algorithm for database search," in Proc. 28th Annu. ACM Symp. Theory Comput., 1996, pp. 212-219.
-
(1996)
Proc. 28th Annu. ACM Symp. Theory Comput
, pp. 212-219
-
-
Grover, L.K.1
-
26
-
-
4243807288
-
Quantum mechanics helps in searching for a needle in a haystack
-
Jul
-
L. K. Grover, "Quantum mechanics helps in searching for a needle in a haystack," Phys. Rev. Lett., vol. 79, no. 2, pp. 325-327, Jul. 1997.
-
(1997)
Phys. Rev. Lett
, vol.79
, Issue.2
, pp. 325-327
-
-
Grover, L.K.1
-
27
-
-
0035924370
-
Experimental realization of Shor's quantum factoring algorithm using nuclear magnetic resonance
-
Dec
-
L. M. K. Vandersypen, M. Steffen, G. Breyta, C. S. Yannoni, M. H. Sherwood, and I. L. Chuang, "Experimental realization of Shor's quantum factoring algorithm using nuclear magnetic resonance," Nature, vol. 414, no. 6866, pp. 883-887, Dec. 2001.
-
(2001)
Nature
, vol.414
, Issue.6866
, pp. 883-887
-
-
Vandersypen, L.M.K.1
Steffen, M.2
Breyta, G.3
Yannoni, C.S.4
Sherwood, M.H.5
Chuang, I.L.6
-
28
-
-
11744326454
-
Experimental implementation of fast quantum searching
-
Apr
-
I. L. Chuang, N. Gershenfeld, and M. Kubinec, "Experimental implementation of fast quantum searching," Phys. Rev. Lett., vol. 80, no. 15, pp. 3408-3411, Apr. 1998.
-
(1998)
Phys. Rev. Lett
, vol.80
, Issue.15
, pp. 3408-3411
-
-
Chuang, I.L.1
Gershenfeld, N.2
Kubinec, M.3
-
29
-
-
0032502816
-
Fast searches with nuclear magnetic resonance computers
-
Apr
-
J. A. Jones, "Fast searches with nuclear magnetic resonance computers," Science, vol. 280, no. 5361, p. 229, Apr. 1998.
-
(1998)
Science
, vol.280
, Issue.5361
, pp. 229
-
-
Jones, J.A.1
-
30
-
-
0032575114
-
Implementation of a quantum search algorithm on a quantum computer
-
May
-
J. A. Jones, M. Mosca, and R. H. Hansen, "Implementation of a quantum search algorithm on a quantum computer," Nature, vol. 393, no. 6683, pp. 344-346, May 1998.
-
(1998)
Nature
, vol.393
, Issue.6683
, pp. 344-346
-
-
Jones, J.A.1
Mosca, M.2
Hansen, R.H.3
-
31
-
-
0034652062
-
Grover's search algorithm: An optical approach
-
Feb
-
P. G. Kwiat, J. R. Mitchell, P. D. D. Schwindt, and A. G. White, "Grover's search algorithm: An optical approach," J. Mod. Opt., vol. 47, no. 2/3, pp. 257-266, Feb. 2000.
-
(2000)
J. Mod. Opt
, vol.47
, Issue.2-3
, pp. 257-266
-
-
Kwiat, P.G.1
Mitchell, J.R.2
Schwindt, P.D.D.3
White, A.G.4
-
32
-
-
0035859923
-
Quantum optical implementation of Grover's algorithm
-
Aug
-
M. O. Scully and M. S. Zubairy, "Quantum optical implementation of Grover's algorithm," Proc. Nat. Acad. Sci. USA, vol. 98, no. 17, pp. 9490-9493, Aug. 2001.
-
(2001)
Proc. Nat. Acad. Sci. USA
, vol.98
, Issue.17
, pp. 9490-9493
-
-
Scully, M.O.1
Zubairy, M.S.2
-
33
-
-
0033889971
-
Quantum associative memory
-
May
-
D. Ventura and T. Martinez, "Quantum associative memory," Inf. Sci. vol. 124, no. 1, pp. 273-296, May 2000.
-
(2000)
Inf. Sci
, vol.124
, Issue.1
, pp. 273-296
-
-
Ventura, D.1
Martinez, T.2
-
34
-
-
0034300183
-
Quantum artificial neural network architectures and components
-
Oct
-
A. Narayanan and T. Menneer, "Quantum artificial neural network architectures and components," Inf. Sci., vol. 128, no. 3/4, pp. 231-255, Oct. 2000.
-
(2000)
Inf. Sci
, vol.128
, Issue.3-4
, pp. 231-255
-
-
Narayanan, A.1
Menneer, T.2
-
35
-
-
0029272121
-
On quantum neural computing
-
Mar
-
S. Kak, "On quantum neural computing," Inf. Sci., vol. 83, no. 3, pp. 143-160, Mar. 1995.
-
(1995)
Inf. Sci
, vol.83
, Issue.3
, pp. 143-160
-
-
Kak, S.1
-
36
-
-
22244451025
-
Qubit neural network and its learning efficiency
-
Jul
-
N. Kouda, N. Matsui, H. Nishimura, and F. Peper, "Qubit neural network and its learning efficiency," Neural Comput. Appl., vol. 14, no. 2, pp. 114-121, Jul. 2005.
-
(2005)
Neural Comput. Appl
, vol.14
, Issue.2
, pp. 114-121
-
-
Kouda, N.1
Matsui, N.2
Nishimura, H.3
Peper, F.4
-
37
-
-
0034301322
-
Simulations of quantum neural networks
-
Oct
-
E. C. Behrman, L. R. Nash, J. E. Steck, V. G. Chandrashekar, and S. R. Skinner, "Simulations of quantum neural networks," Inf. Sci., vol. 128, no. 3, pp. 257-269, Oct. 2000.
-
(2000)
Inf. Sci
, vol.128
, Issue.3
, pp. 257-269
-
-
Behrman, E.C.1
Nash, L.R.2
Steck, J.E.3
Chandrashekar, V.G.4
Skinner, S.R.5
-
38
-
-
0036685590
-
Parallelization of a fuzzy control algorithm using quantum computation
-
Aug
-
G. G. Rigatos and S. G. Tzafestas, "Parallelization of a fuzzy control algorithm using quantum computation," IEEE Trans. Fuzzy Syst., vol. 10, no. 4, pp. 451-460, Aug. 2002.
-
(2002)
IEEE Trans. Fuzzy Syst
, vol.10
, Issue.4
, pp. 451-460
-
-
Rigatos, G.G.1
Tzafestas, S.G.2
-
39
-
-
25444455386
-
Quantum genetic algorithm method in self-consistent electronic structure calculations of a quantum dot with many electrons
-
M. Sahin, U. Atav, and M. Tomak, "Quantum genetic algorithm method in self-consistent electronic structure calculations of a quantum dot with many electrons," Int. J. Mod. Phys. C, vol. 16, no. 9, pp. 1379-1393, 2005.
-
(2005)
Int. J. Mod. Phys. C
, vol.16
, Issue.9
, pp. 1379-1393
-
-
Sahin, M.1
Atav, U.2
Tomak, M.3
-
40
-
-
0034301165
-
Quantum optimization
-
Oct
-
T. Hogg and D. Portnov, "Quantum optimization," Inf. Sci., vol. 128, no. 3, pp. 181-197, Oct. 2000.
-
(2000)
Inf. Sci
, vol.128
, Issue.3
, pp. 181-197
-
-
Hogg, T.1
Portnov, D.2
-
41
-
-
28444450523
-
Quantum search in stochastic planning
-
S. Naguleswaran and L. B. White, "Quantum search in stochastic planning," Proc. SPIE, vol. 5846, pp. 34-45, 2005.
-
(2005)
Proc. SPIE
, vol.5846
, pp. 34-45
-
-
Naguleswaran, S.1
White, L.B.2
-
42
-
-
26844446858
-
Quantum reinforcement learning
-
D. Y. Dong, C. L. Chen, and Z. H. Chen, "Quantum reinforcement learning," in Proc. 1st Int. Conf. Natural Comput., 2005, vol. 3611, pp. 686-689.
-
(2005)
Proc. 1st Int. Conf. Natural Comput
, vol.3611
, pp. 686-689
-
-
Dong, D.Y.1
Chen, C.L.2
Chen, Z.H.3
-
43
-
-
52349091986
-
-
J. Preskill, Course information for Physics 229:[I#Advanced Mathematical Methods of Physics - Quantum Information and Computation Pasadena, CA: California Inst. Technol., 1998. [Online]. Available: http://www.theory, caltech.edu/people/preskill/ph229/
-
J. Preskill, "Course information for Physics 229:[I#Advanced Mathematical Methods of Physics - Quantum Information and Computation Pasadena, CA: California Inst. Technol., 1998. [Online]. Available: http://www.theory, caltech.edu/people/preskill/ph229/
-
-
-
-
45
-
-
33846175677
-
Quantum computation for action selection using reinforcement learning
-
C.L. Chen, D. Y. Dong, and Z. H. Chen, "Quantum computation for action selection using reinforcement learning," Int. J. Quantum Inf., vol. 4, no. 6, pp. 1071-1083, 2006.
-
(2006)
Int. J. Quantum Inf
, vol.4
, Issue.6
, pp. 1071-1083
-
-
Chen, C.L.1
Dong, D.Y.2
Chen, Z.H.3
-
46
-
-
0032338939
-
Tight bounds on quantum searching
-
Jun
-
M. Boyer, G. Brassard, and P. Høyer, "Tight bounds on quantum searching," Fortschritte Der Physik - Progress of Physics, vol. 46, no. 45, pp. 493-506, Jun. 1998.
-
(1998)
Fortschritte Der Physik - Progress of Physics
, vol.46
, Issue.45
, pp. 493-506
-
-
Boyer, M.1
Brassard, G.2
Høyer, P.3
-
47
-
-
14344266002
-
Learning rates for Q-learning
-
Dec
-
E. Even-Dar and Y. Mansour, "Learning rates for Q-learning," J. Mach. Learn. Res., vol. 5, pp. 1-25, Dec. 2003.
-
(2003)
J. Mach. Learn. Res
, vol.5
, pp. 1-25
-
-
Even-Dar, E.1
Mansour, Y.2
-
48
-
-
33846183002
-
Emergent robot differentiation for distributed multi-robot task allocation
-
T. S. Dahl, M. J. Mataric, and G. S. Sukhatme, "Emergent robot differentiation for distributed multi-robot task allocation," in Proc. 7th Int. Symp. Distrib. Auton. Robotic Syst., 2004, pp. 191-200.
-
(2004)
Proc. 7th Int. Symp. Distrib. Auton. Robotic Syst
, pp. 191-200
-
-
Dahl, T.S.1
Mataric, M.J.2
Sukhatme, G.S.3
-
49
-
-
33646406807
-
Multi-armed bandit algorithms and empirical evaluation
-
J. Vermorel and M. Mohri, "Multi-armed bandit algorithms and empirical evaluation," in Proc. ECML, 2005, vol. 3720, pp. 437-448.
-
(2005)
Proc. ECML
, vol.3720
, pp. 437-448
-
-
Vermorel, J.1
Mohri, M.2
-
50
-
-
4844223639
-
A new Q-learning algorithm based on the metropolis criterion
-
Oct
-
M. Guo, Y. Liu, and J. Malec, "A new Q-learning algorithm based on the metropolis criterion," IEEE Trans. Syst., Man, Cybern. B, Cybern. vol. 34, no. 5, pp. 2140-2143, Oct. 2004.
-
(2004)
IEEE Trans. Syst., Man, Cybern. B, Cybern
, vol.34
, Issue.5
, pp. 2140-2143
-
-
Guo, M.1
Liu, Y.2
Malec, J.3
-
51
-
-
33745610133
-
Quantum mechanics helps in learning for more intelligent robots
-
Jul
-
D. Y. Dong, C. L. Chen, Z. H. Chen, and C. B. Zhang, "Quantum mechanics helps in learning for more intelligent robots," Chin. Phys. Lett. vol. 23, no. 7, pp. 1691-1694, Jul. 2006.
-
(2006)
Chin. Phys. Lett
, vol.23
, Issue.7
, pp. 1691-1694
-
-
Dong, D.Y.1
Chen, C.L.2
Chen, Z.H.3
Zhang, C.B.4
-
52
-
-
33745780982
-
Quantum robot: Structure, algorithms and applications
-
Jul
-
D. Y. Dong, C. L. Chen, C. B. Zhang, and Z. H. Chen, "Quantum robot: Structure, algorithms and applications," Robotica, vol. 24, no. 4, pp. 513-521, Jul. 2006.
-
(2006)
Robotica
, vol.24
, Issue.4
, pp. 513-521
-
-
Dong, D.Y.1
Chen, C.L.2
Zhang, C.B.3
Chen, Z.H.4
-
53
-
-
0000532930
-
Quantum robots and environments
-
Aug
-
P. Benioff, "Quantum robots and environments," Phys. Rev. A, Gen. Phys., vol. 58, no. 2, pp. 893-904, Aug. 1998.
-
(1998)
Phys. Rev. A, Gen. Phys
, vol.58
, Issue.2
, pp. 893-904
-
-
Benioff, P.1
|