메뉴 건너뛰기




Volumn 44, Issue 1, 2006, Pages 283-312

Stability of stochastic approximation under verifiable conditions

Author keywords

Adaptive Markov chain Monte Carlo; Randomly varying truncation; State dependent noise; Stochastic approximation

Indexed keywords

ALGORITHMS; APPROXIMATION THEORY; SET THEORY; SYSTEM STABILITY;

EID: 33244461073     PISSN: 03630129     EISSN: None     Source Type: Journal    
DOI: 10.1137/S0363012902417267     Document Type: Article
Times cited : (188)

References (31)
  • 1
    • 0037225359 scopus 로고    scopus 로고
    • Stochastic approximation for nonexpansive maps: Application to Q-learning algorithms
    • J. ABOUNADI, D. P. BERTSEKAS, AND V. BORKAR, Stochastic approximation for nonexpansive maps: Application to Q-learning algorithms, SIAM J. Control Optim., 41 (2002), pp. 1-22.
    • (2002) SIAM J. Control Optim. , vol.41 , pp. 1-22
    • Abounadi, J.1    Bertsekas, D.P.2    Borkar, V.3
  • 2
    • 33244493215 scopus 로고    scopus 로고
    • On the ergodicity properties of some Markov chain Monte Carlo algorithms
    • to appear
    • C. ANDRIEU AND E. MOULINES, On the ergodicity properties of some Markov chain Monte Carlo algorithms, Ann. Appl. Probab., to appear.
    • Ann. Appl. Probab.
    • Andrieu, C.1    Moulines, E.2
  • 3
    • 2442578536 scopus 로고    scopus 로고
    • Controlled MCMC for optimal sampling
    • Université de Paris Dauphine, Paris
    • C. ANDRIEU AND C. P. ROBERT, Controlled MCMC for Optimal Sampling, Cahiers du Cérémade 0125, Université de Paris Dauphine, Paris, 2001.
    • (2001) Cahiers du Cérémade , vol.125
    • Andrieu, C.1    Robert, C.P.2
  • 6
    • 0346736747 scopus 로고    scopus 로고
    • Stability of annealing schemes and related processes
    • V. BORKAR, Stability of annealing schemes and related processes, Systems Control Lett., 41 (2000), pp. 325-331.
    • (2000) Systems Control Lett. , vol.41 , pp. 325-331
    • Borkar, V.1
  • 7
    • 0033876515 scopus 로고    scopus 로고
    • The o.d.e. method for convergence of stochastic approximation and reinforcement learning
    • V. S. BORKAR AND S. P. MEYN, The o.d.e. method for convergence of stochastic approximation and reinforcement learning, SIAM J. Control Optim., 38 (2000), pp. 447-469.
    • (2000) SIAM J. Control Optim. , vol.38 , pp. 447-469
    • Borkar, V.S.1    Meyn, S.P.2
  • 8
    • 0036334497 scopus 로고    scopus 로고
    • Rate of convergence for constrained stochastic approximation algorithms
    • R. BUCHE AND H. J. KUSHNER, Rate of convergence for constrained stochastic approximation algorithms, SIAM J. Control Optim., 40 (2001), pp. 1011-1041.
    • (2001) SIAM J. Control Optim. , vol.40 , pp. 1011-1041
    • Buche, R.1    Kushner, H.J.2
  • 9
    • 0003936132 scopus 로고    scopus 로고
    • Stochastic approximation with state-dependent noise
    • H. CHEN, Stochastic approximation with state-dependent noise, Sci. China Ser. E, 43 (2000), pp. 531-541.
    • (2000) Sci. China Ser. E , vol.43 , pp. 531-541
    • Chen, H.1
  • 10
    • 38249037320 scopus 로고
    • Convergence and robustness of the Robbins-Monro algorithm truncated at randomly varying bounds
    • H. CHEN, L. GUO, AND A. GAO, Convergence and robustness of the Robbins-Monro algorithm truncated at randomly varying bounds, Stochastic Process. Appl., 27 (1988), pp. 217-231.
    • (1988) Stochastic Process. Appl. , vol.27 , pp. 217-231
    • Chen, H.1    Guo, L.2    Gao, A.3
  • 11
    • 0011595015 scopus 로고
    • Stochastic approximation procedures with randomly varying truncations
    • H. CHEN AND Y.-M. ZHU, Stochastic approximation procedures with randomly varying truncations, Sci. Sinica, 29 (1986), pp. 914-926.
    • (1986) Sci. Sinica , vol.29 , pp. 914-926
    • Chen, H.1    Zhu, Y.-M.2
  • 12
    • 33244493888 scopus 로고    scopus 로고
    • Stochastic approximation and its applications
    • Kluwer Academic Publishers, Dordrecht
    • H.-F. CHEN, Stochastic Approximation and Its Applications, Nonconvex Optimization and Its Applications 64, Kluwer Academic Publishers, Dordrecht, 2002.
    • (2002) Nonconvex Optimization and Its Applications , vol.64
    • Chen, H.-F.1
  • 13
    • 33244496564 scopus 로고    scopus 로고
    • Stochastic approximation with decreasing gain: Convergence and asymptotic theory
    • Université de Rennes, Rennes, France
    • B. DELYON, Stochastic Approximation with Decreasing Gain: Convergence and Asymptotic Theory, Tech. report, Université de Rennes, Rennes, France, 2000.
    • (2000) Tech. Report
    • Delyon, B.1
  • 14
    • 0033243858 scopus 로고    scopus 로고
    • Convergence of a stochastic approximation version of the em algorithm
    • B. DELYON, M. LAVIELLE, AND E. MOULINES, Convergence of a stochastic approximation version of the EM algorithm, Ann. Stat., 27 (1999), pp. 94-128.
    • (1999) Ann. Stat. , vol.27 , pp. 94-128
    • Delyon, B.1    Lavielle, M.2    Moulines, E.3
  • 15
    • 14544282666 scopus 로고    scopus 로고
    • Quantitative bounds on convergence of time-inhomogeneous Markov chains
    • R. DOUC, E. MOULINES, AND J. ROSENTHAL, Quantitative bounds on convergence of time-inhomogeneous Markov chains, Ann. Appl. Probab., 14 (2004), pp. 1643-1665.
    • (2004) Ann. Appl. Probab. , vol.14 , pp. 1643-1665
    • Douc, R.1    Moulines, E.2    Rosenthal, J.3
  • 16
    • 14544306466 scopus 로고    scopus 로고
    • Random Iterative Systems
    • Springer-Verlag, Berlin
    • M. DUFLO, Random Iterative Systems, Appl. Math. 34, Springer-Verlag, Berlin, 1997.
    • (1997) Appl. Math. , vol.34
    • Duflo, M.1
  • 17
    • 0026923443 scopus 로고
    • Rate of convergence of recursive estimators
    • L. GERENCSÉR AND S. S. WILSON, Rate of convergence of recursive estimators, SIAM J. Control Optim., 30 (1992), pp. 1200-1227.
    • (1992) SIAM J. Control Optim. , vol.30 , pp. 1200-1227
    • Gerencsér, L.1    Wilson, S.S.2
  • 18
    • 0030522182 scopus 로고    scopus 로고
    • A Liapounov bound for solutions of the Poisson equation
    • P. W. GLYNN AND S. P. MEYN, A Liapounov bound for solutions of the Poisson equation, Ann. Probab., 24 (1996), pp. 916-931.
    • (1996) Ann. Probab. , vol.24 , pp. 916-931
    • Glynn, P.W.1    Meyn, S.P.2
  • 19
    • 0038563932 scopus 로고    scopus 로고
    • An adaptive metropolis algorithm
    • H. HAARIO, E. SAKSMAN. AND J. TAMMINEN, An adaptive Metropolis algorithm, Bernoulli, 7 (2001), pp. 223-242.
    • (2001) Bernoulli , vol.7 , pp. 223-242
    • Haario, H.1    Saksman, E.2    Tamminen, J.3
  • 21
    • 0001562199 scopus 로고    scopus 로고
    • Geometric ergodicity of metropolis algorithms
    • S. JARNER AND E. HANSEN, Geometric ergodicity of Metropolis algorithms, Stochastic Process. Appl., 85 (2000), pp. 341-361.
    • (2000) Stochastic Process. Appl. , vol.85 , pp. 341-361
    • Jarner, S.1    Hansen, E.2
  • 23
    • 0345532155 scopus 로고    scopus 로고
    • Stochastic approximation algorithms and applications
    • Springer-Verlag, New York
    • H. KUSHNER AND G. YIN, Stochastic Approximation Algorithms and Applications, Appl. Math. 35, Springer-Verlag, New York, 1997.
    • (1997) Appl. Math. , vol.35
    • Kushner, H.1    Yin, G.2
  • 26
    • 0039367797 scopus 로고
    • On the Poisson equation in the potential theory of a single kernel
    • E. NUMMELIN, On the Poisson equation in the potential theory of a single kernel, Math. Scand., 68 (1991), pp. 59-82.
    • (1991) Math. Scand. , vol.68 , pp. 59-82
    • Nummelin, E.1
  • 27
    • 33746388444 scopus 로고    scopus 로고
    • Geometric convergence and central limit theorem for multidimensional Hastings and Metropolis algorithms
    • G. ROBERTS AND R. TWEEDIE, Geometric convergence and central limit theorem for multidimensional Hastings and Metropolis algorithms, Biometrika, 83 (1996), pp. 95-110.
    • (1996) Biometrika , vol.83 , pp. 95-110
    • Roberts, G.1    Tweedie, R.2
  • 28
    • 0001514831 scopus 로고    scopus 로고
    • Bounds on regeneration times and convergence rates for Markov chains
    • G. ROBERTS AND R. TWEEDIE, Bounds on regeneration times and convergence rates for Markov chains, Stochastic Process. Appl., 80 (1999), pp. 211-229.
    • (1999) Stochastic Process. Appl. , vol.80 , pp. 211-229
    • Roberts, G.1    Tweedie, R.2
  • 29
    • 84923618271 scopus 로고
    • Minorization conditions and convergence rates for Markov chain Monte Carlo
    • J. ROSENTHAL, Minorization conditions and convergence rates for Markov chain Monte Carlo, J. Amer. Statis. Assoc., 90 (1995), pp. 558-566.
    • (1995) J. Amer. Statis. Assoc. , vol.90 , pp. 558-566
    • Rosenthal, J.1
  • 30
    • 0031221985 scopus 로고    scopus 로고
    • Stochastic gradient with random truncations
    • V. TADIC, Stochastic gradient with random truncations, European J. Oper. Res., 101 (1997), pp. 261-284.
    • (1997) European J. Oper. Res. , vol.101 , pp. 261-284
    • Tadic, V.1
  • 31
    • 0142162920 scopus 로고    scopus 로고
    • Stochastic approximations with random truncations, state dependent noise and discontinuous dynamics
    • V. TADIC, Stochastic approximations with random truncations, state dependent noise and discontinuous dynamics, Stochastics Stochastics Rep., 64 (1998), pp. 283-326.
    • (1998) Stochastics Stochastics Rep. , vol.64 , pp. 283-326
    • Tadic, V.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.