-
1
-
-
0037225359
-
Stochastic approximation for nonexpansive maps: Application to Q-learning algorithms
-
J. ABOUNADI, D. P. BERTSEKAS, AND V. BORKAR, Stochastic approximation for nonexpansive maps: Application to Q-learning algorithms, SIAM J. Control Optim., 41 (2002), pp. 1-22.
-
(2002)
SIAM J. Control Optim.
, vol.41
, pp. 1-22
-
-
Abounadi, J.1
Bertsekas, D.P.2
Borkar, V.3
-
2
-
-
33244493215
-
On the ergodicity properties of some Markov chain Monte Carlo algorithms
-
to appear
-
C. ANDRIEU AND E. MOULINES, On the ergodicity properties of some Markov chain Monte Carlo algorithms, Ann. Appl. Probab., to appear.
-
Ann. Appl. Probab.
-
-
Andrieu, C.1
Moulines, E.2
-
3
-
-
2442578536
-
Controlled MCMC for optimal sampling
-
Université de Paris Dauphine, Paris
-
C. ANDRIEU AND C. P. ROBERT, Controlled MCMC for Optimal Sampling, Cahiers du Cérémade 0125, Université de Paris Dauphine, Paris, 2001.
-
(2001)
Cahiers du Cérémade
, vol.125
-
-
Andrieu, C.1
Robert, C.P.2
-
4
-
-
33244485998
-
-
Ph.D. thesis, The Institute for Systems Research, University of Maryland, College Park, MD
-
J. BARTUSEK, Stochastic Approximation and Optimization of Markov Chains, Ph.D. thesis, The Institute for Systems Research, University of Maryland, College Park, MD, 2000.
-
(2000)
Stochastic Approximation and Optimization of Markov Chains
-
-
Bartusek, J.1
-
5
-
-
0003778897
-
-
Springer-Verlag, New York
-
A. BENVENISTE, M. MÉTIVIER, AND P. PRIOURET, Adaptive Algorithms and Stochastic Approximations, Springer-Verlag, New York, 1990.
-
(1990)
Adaptive Algorithms and Stochastic Approximations
-
-
Benveniste, A.1
Métivier, M.2
Priouret, P.3
-
6
-
-
0346736747
-
Stability of annealing schemes and related processes
-
V. BORKAR, Stability of annealing schemes and related processes, Systems Control Lett., 41 (2000), pp. 325-331.
-
(2000)
Systems Control Lett.
, vol.41
, pp. 325-331
-
-
Borkar, V.1
-
7
-
-
0033876515
-
The o.d.e. method for convergence of stochastic approximation and reinforcement learning
-
V. S. BORKAR AND S. P. MEYN, The o.d.e. method for convergence of stochastic approximation and reinforcement learning, SIAM J. Control Optim., 38 (2000), pp. 447-469.
-
(2000)
SIAM J. Control Optim.
, vol.38
, pp. 447-469
-
-
Borkar, V.S.1
Meyn, S.P.2
-
8
-
-
0036334497
-
Rate of convergence for constrained stochastic approximation algorithms
-
R. BUCHE AND H. J. KUSHNER, Rate of convergence for constrained stochastic approximation algorithms, SIAM J. Control Optim., 40 (2001), pp. 1011-1041.
-
(2001)
SIAM J. Control Optim.
, vol.40
, pp. 1011-1041
-
-
Buche, R.1
Kushner, H.J.2
-
9
-
-
0003936132
-
Stochastic approximation with state-dependent noise
-
H. CHEN, Stochastic approximation with state-dependent noise, Sci. China Ser. E, 43 (2000), pp. 531-541.
-
(2000)
Sci. China Ser. E
, vol.43
, pp. 531-541
-
-
Chen, H.1
-
10
-
-
38249037320
-
Convergence and robustness of the Robbins-Monro algorithm truncated at randomly varying bounds
-
H. CHEN, L. GUO, AND A. GAO, Convergence and robustness of the Robbins-Monro algorithm truncated at randomly varying bounds, Stochastic Process. Appl., 27 (1988), pp. 217-231.
-
(1988)
Stochastic Process. Appl.
, vol.27
, pp. 217-231
-
-
Chen, H.1
Guo, L.2
Gao, A.3
-
11
-
-
0011595015
-
Stochastic approximation procedures with randomly varying truncations
-
H. CHEN AND Y.-M. ZHU, Stochastic approximation procedures with randomly varying truncations, Sci. Sinica, 29 (1986), pp. 914-926.
-
(1986)
Sci. Sinica
, vol.29
, pp. 914-926
-
-
Chen, H.1
Zhu, Y.-M.2
-
12
-
-
33244493888
-
Stochastic approximation and its applications
-
Kluwer Academic Publishers, Dordrecht
-
H.-F. CHEN, Stochastic Approximation and Its Applications, Nonconvex Optimization and Its Applications 64, Kluwer Academic Publishers, Dordrecht, 2002.
-
(2002)
Nonconvex Optimization and Its Applications
, vol.64
-
-
Chen, H.-F.1
-
13
-
-
33244496564
-
Stochastic approximation with decreasing gain: Convergence and asymptotic theory
-
Université de Rennes, Rennes, France
-
B. DELYON, Stochastic Approximation with Decreasing Gain: Convergence and Asymptotic Theory, Tech. report, Université de Rennes, Rennes, France, 2000.
-
(2000)
Tech. Report
-
-
Delyon, B.1
-
14
-
-
0033243858
-
Convergence of a stochastic approximation version of the em algorithm
-
B. DELYON, M. LAVIELLE, AND E. MOULINES, Convergence of a stochastic approximation version of the EM algorithm, Ann. Stat., 27 (1999), pp. 94-128.
-
(1999)
Ann. Stat.
, vol.27
, pp. 94-128
-
-
Delyon, B.1
Lavielle, M.2
Moulines, E.3
-
15
-
-
14544282666
-
Quantitative bounds on convergence of time-inhomogeneous Markov chains
-
R. DOUC, E. MOULINES, AND J. ROSENTHAL, Quantitative bounds on convergence of time-inhomogeneous Markov chains, Ann. Appl. Probab., 14 (2004), pp. 1643-1665.
-
(2004)
Ann. Appl. Probab.
, vol.14
, pp. 1643-1665
-
-
Douc, R.1
Moulines, E.2
Rosenthal, J.3
-
16
-
-
14544306466
-
Random Iterative Systems
-
Springer-Verlag, Berlin
-
M. DUFLO, Random Iterative Systems, Appl. Math. 34, Springer-Verlag, Berlin, 1997.
-
(1997)
Appl. Math.
, vol.34
-
-
Duflo, M.1
-
17
-
-
0026923443
-
Rate of convergence of recursive estimators
-
L. GERENCSÉR AND S. S. WILSON, Rate of convergence of recursive estimators, SIAM J. Control Optim., 30 (1992), pp. 1200-1227.
-
(1992)
SIAM J. Control Optim.
, vol.30
, pp. 1200-1227
-
-
Gerencsér, L.1
Wilson, S.S.2
-
18
-
-
0030522182
-
A Liapounov bound for solutions of the Poisson equation
-
P. W. GLYNN AND S. P. MEYN, A Liapounov bound for solutions of the Poisson equation, Ann. Probab., 24 (1996), pp. 916-931.
-
(1996)
Ann. Probab.
, vol.24
, pp. 916-931
-
-
Glynn, P.W.1
Meyn, S.P.2
-
19
-
-
0038563932
-
An adaptive metropolis algorithm
-
H. HAARIO, E. SAKSMAN. AND J. TAMMINEN, An adaptive Metropolis algorithm, Bernoulli, 7 (2001), pp. 223-242.
-
(2001)
Bernoulli
, vol.7
, pp. 223-242
-
-
Haario, H.1
Saksman, E.2
Tamminen, J.3
-
21
-
-
0001562199
-
Geometric ergodicity of metropolis algorithms
-
S. JARNER AND E. HANSEN, Geometric ergodicity of Metropolis algorithms, Stochastic Process. Appl., 85 (2000), pp. 341-361.
-
(2000)
Stochastic Process. Appl.
, vol.85
, pp. 341-361
-
-
Jarner, S.1
Hansen, E.2
-
23
-
-
0345532155
-
Stochastic approximation algorithms and applications
-
Springer-Verlag, New York
-
H. KUSHNER AND G. YIN, Stochastic Approximation Algorithms and Applications, Appl. Math. 35, Springer-Verlag, New York, 1997.
-
(1997)
Appl. Math.
, vol.35
-
-
Kushner, H.1
Yin, G.2
-
24
-
-
5744249209
-
Equations of state calculations by fast computing machines
-
N. METROPOLIS, A. ROSENBLUTH, M. ROSENBLUTH, A. TELLER, AND M. TELLER, Equations of state calculations by fast computing machines, J. Chem. Phys., 21 (1953), pp. 1087-1091.
-
(1953)
J. Chem. Phys.
, vol.21
, pp. 1087-1091
-
-
Metropolis, N.1
Rosenbluth, A.2
Rosenbluth, M.3
Teller, A.4
Teller, M.5
-
26
-
-
0039367797
-
On the Poisson equation in the potential theory of a single kernel
-
E. NUMMELIN, On the Poisson equation in the potential theory of a single kernel, Math. Scand., 68 (1991), pp. 59-82.
-
(1991)
Math. Scand.
, vol.68
, pp. 59-82
-
-
Nummelin, E.1
-
27
-
-
33746388444
-
Geometric convergence and central limit theorem for multidimensional Hastings and Metropolis algorithms
-
G. ROBERTS AND R. TWEEDIE, Geometric convergence and central limit theorem for multidimensional Hastings and Metropolis algorithms, Biometrika, 83 (1996), pp. 95-110.
-
(1996)
Biometrika
, vol.83
, pp. 95-110
-
-
Roberts, G.1
Tweedie, R.2
-
28
-
-
0001514831
-
Bounds on regeneration times and convergence rates for Markov chains
-
G. ROBERTS AND R. TWEEDIE, Bounds on regeneration times and convergence rates for Markov chains, Stochastic Process. Appl., 80 (1999), pp. 211-229.
-
(1999)
Stochastic Process. Appl.
, vol.80
, pp. 211-229
-
-
Roberts, G.1
Tweedie, R.2
-
29
-
-
84923618271
-
Minorization conditions and convergence rates for Markov chain Monte Carlo
-
J. ROSENTHAL, Minorization conditions and convergence rates for Markov chain Monte Carlo, J. Amer. Statis. Assoc., 90 (1995), pp. 558-566.
-
(1995)
J. Amer. Statis. Assoc.
, vol.90
, pp. 558-566
-
-
Rosenthal, J.1
-
30
-
-
0031221985
-
Stochastic gradient with random truncations
-
V. TADIC, Stochastic gradient with random truncations, European J. Oper. Res., 101 (1997), pp. 261-284.
-
(1997)
European J. Oper. Res.
, vol.101
, pp. 261-284
-
-
Tadic, V.1
-
31
-
-
0142162920
-
Stochastic approximations with random truncations, state dependent noise and discontinuous dynamics
-
V. TADIC, Stochastic approximations with random truncations, state dependent noise and discontinuous dynamics, Stochastics Stochastics Rep., 64 (1998), pp. 283-326.
-
(1998)
Stochastics Stochastics Rep.
, vol.64
, pp. 283-326
-
-
Tadic, V.1
|