메뉴 건너뛰기




Volumn 2, Issue 1, 2010, Pages 87-96

Stochastic approximation: A survey

Author keywords

[No Author keywords available]

Indexed keywords

A-STABILITY; ASYMPTOTIC PROPERTIES; BASIC IDEA; CONTINUOUS TIME; CONVERGENCE WITH PROBABILITY ONE; HIGH-DIMENSIONAL PROBLEMS; MULTIPLE TIME SCALE; NOISE PROCESS; RATE OF CONVERGENCE; RECURSIVE ALGORITHMS; STATE-DEPENDENT NOISE; STOCHASTIC APPROXIMATIONS; TYPE ARGUMENTS; WEAK CONVERGENCE;

EID: 78651592863     PISSN: 19395108     EISSN: 19390068     Source Type: Journal    
DOI: 10.1002/wics.57     Document Type: Review
Times cited : (53)

References (74)
  • 1
    • 0000016172 scopus 로고
    • A stochastic approximation method
    • Robbins H, Monro S. A stochastic approximation method. Ann Math Stat 1951, 22:400-407.
    • (1951) Ann Math Stat , vol.22 , pp. 400-407
    • Robbins, H.1    Monro, S.2
  • 3
    • 0000792515 scopus 로고
    • Multidimensional stochastic approximation
    • Blum JR. Multidimensional stochastic approximation. Ann Math Stat 1954, 9:737-744.
    • (1954) Ann Math Stat , vol.9 , pp. 737-744
    • Blum, J.R.1
  • 6
    • 33846701878 scopus 로고
    • On the Kiefer-Wolfowitz approximation method
    • Dupač V. On the Kiefer-Wolfowitz approximation method. Casopis Pest Mat 1957, 82:47-75.
    • (1957) Casopis Pest Mat , vol.82 , pp. 47-75
    • Dupač, V.1
  • 8
    • 0007298844 scopus 로고
    • On stochastic approximation
    • Gladyshev EG. On stochastic approximation. Theory Probab Appl 1965, 10:275-278.
    • (1965) Theory Probab Appl , vol.10 , pp. 275-278
    • Gladyshev, E.G.1
  • 9
    • 0002686402 scopus 로고
    • A convergence theorem for some nonnegative almost supermartingales and some applications
    • In: Rustagi JS, ed., New York: Academic Press
    • Robbins H, Siegmund D. A convergence theorem for some nonnegative almost supermartingales and some applications. In: Rustagi JS, ed. Optimizing Methods in Statistics. New York: Academic Press; 1971, 233-257.
    • (1971) Optimizing Methods in Statistics , pp. 233-257
    • Robbins, H.1    Siegmund, D.2
  • 10
    • 78651536107 scopus 로고
    • A continuous Kiefer-Wolfowitz procedure for random processes
    • Sakrison DJ. A continuous Kiefer-Wolfowitz procedure for random processes. Ann Math Stat 1964, 35:590-599.
    • (1964) Ann Math Stat , vol.35 , pp. 590-599
    • Sakrison, D.J.1
  • 11
    • 0000431134 scopus 로고
    • Asymptotic distribution of stochastic approximation processes
    • Sacks J. Asymptotic distribution of stochastic approximation processes. Ann Math Stat 1958, 29:373-405.
    • (1958) Ann Math Stat , vol.29 , pp. 373-405
    • Sacks, J.1
  • 13
    • 0001079593 scopus 로고
    • Stochastic estimation of the maximum of a regression function
    • Kiefer J, Wolfowitz J. Stochastic estimation of the maximum of a regression function. Ann Math Stat 1952, 23:462-466.
    • (1952) Ann Math Stat , vol.23 , pp. 462-466
    • Kiefer, J.1    Wolfowitz, J.2
  • 16
    • 0038026196 scopus 로고    scopus 로고
    • Stochastic approximation
    • Lai TL. Stochastic approximation. Ann Stat 2003, 31:391-406.
    • (2003) Ann Stat , vol.31 , pp. 391-406
    • Lai, T.L.1
  • 17
    • 0009167146 scopus 로고
    • Multidimensional stochastic approximation
    • In: Krishnaiah PR, ed., New York: Academic Press
    • Schmetterer L. Multidimensional stochastic approximation. In: Krishnaiah PR, ed. Multivariate Analysis II. New York: Academic Press; 1969, 443-460.
    • (1969) Multivariate Analysis II , pp. 443-460
    • Schmetterer, L.1
  • 18
    • 0001846920 scopus 로고
    • Stochastic approximation
    • In: Ghosh BK, Sen PK, ed., New York: Marcel Dekker
    • Ruppert D. Stochastic approximation. In: Ghosh BK, Sen PK, ed. Handbook in Sequential Analysis. New York: Marcel Dekker; 1991, 503-529.
    • (1991) Handbook in Sequential Analysis , pp. 503-529
    • Ruppert, D.1
  • 19
    • 0003361256 scopus 로고
    • Stochastic approximation
    • In: Rustagi JS, ed., New York: Academic Press;
    • Fabian V. Stochastic approximation. In: Rustagi JS, ed. Optimizing Methods in Statistics. New York: Academic Press; 1971.
    • (1971) Optimizing Methods in Statistics
    • Fabian, V.1
  • 23
    • 0033876515 scopus 로고    scopus 로고
    • The O.D.E. method for convergence of stochastic approximation and reinforcement learning
    • Borkar VS, Meyn SP. The o.d.e. method for convergence of stochastic approximation and reinforcement learning. SIAM J Control Optim 2000, 38:447-469.
    • (2000) SIAM J Control Optim , vol.38 , pp. 447-469
    • Borkar, V.S.1    Meyn, S.P.2
  • 24
    • 0017526570 scopus 로고
    • Analysis of recursive stochastic algorithms
    • Ljung L. Analysis of recursive stochastic algorithms. IEEE Trans Automat Control 1977, 22:551-575.
    • (1977) IEEE Trans Automat Control , vol.22 , pp. 551-575
    • Ljung, L.1
  • 26
    • 0021190437 scopus 로고
    • An invariant measure approach to the convergence of stochastic approximations with state dependent noise
    • Kushner HJ, Shwartz A. An invariant measure approach to the convergence of stochastic approximations with state dependent noise. SIAM J Control Optim 1984, 22:13-27.
    • (1984) SIAM J Control Optim , vol.22 , pp. 13-27
    • Kushner, H.J.1    Shwartz, A.2
  • 28
    • 0011595015 scopus 로고
    • Stochastic approximation procedure with randomly varying truncations
    • Chen H-F, Zhu YM. Stochastic approximation procedure with randomly varying truncations. Sci Sin Ser., A 1986, 29:914-926.
    • (1986) Sci Sin Ser., A , vol.29 , pp. 914-926
    • Chen, H.-F.1    Zhu, Y.M.2
  • 29
    • 0001055484 scopus 로고    scopus 로고
    • Equivalent necessary and sufficient conditions on noise sequences for stochastic approximation algorithms
    • Wang IJ, Chong EKP, Kulkarni SR. Equivalent necessary and sufficient conditions on noise sequences for stochastic approximation algorithms. Adv Appl Probab 1996, 28:784-801.
    • (1996) Adv Appl Probab , vol.28 , pp. 784-801
    • Wang, I.J.1    Chong, E.K.P.2    Kulkarni, S.R.3
  • 30
    • 0030104967 scopus 로고    scopus 로고
    • A dynamical systems approach to stochastic approximations
    • Benaïm M. A dynamical systems approach to stochastic approximations. SIAM J Control Optim 1996, 34: 437-472.
    • (1996) SIAM J Control Optim , vol.34 , pp. 437-472
    • Benaïm, M.1
  • 32
    • 0001793657 scopus 로고    scopus 로고
    • Séminaire de Probabilités
    • Berlin and New York: Springer
    • Benaïm M. Dynamics of stochastic approximation algorithms. Séminaire de Probabilités, vol.XXXIII, Lecture Notes in Mathematics, 1709. Berlin and New York: Springer; 1999, 1-68.
    • (1709) Lecture Notes in Mathematics , vol.33 , pp. 1-68
    • Benaïm, M.1
  • 35
    • 0024731334 scopus 로고
    • Stochastic approximation and large deviations: upper bounds and w.p.1 convergence
    • Dupuis P, Kushner HJ. Stochastic approximation and large deviations: upper bounds and w.p.1 convergence. SIAM J Control Optim 1989, 27:1108-1135.
    • (1989) SIAM J Control Optim , vol.27 , pp. 1108-1135
    • Dupuis, P.1    Kushner, H.J.2
  • 39
    • 0019079883 scopus 로고
    • Diffusion approximations to output processes of nonlinear systems with wide-band inputs, with applications
    • Kushner HJ. Diffusion approximations to output processes of nonlinear systems with wide-band inputs, with applications. IEEE Trans Inf Theory 1980, 26: 715-725.
    • (1980) IEEE Trans Inf Theory , vol.26 , pp. 715-725
    • Kushner, H.J.1
  • 40
    • 0032186926 scopus 로고    scopus 로고
    • Optimal random perturbations for stochastic approximation using a simultaneous perturbation gradient approximation
    • Sadegh P, Spall JC. Optimal random perturbations for stochastic approximation using a simultaneous perturbation gradient approximation. IEEE Trans Autom Control 1998, 43:1480-1484.
    • (1998) IEEE Trans Autom Control , vol.43 , pp. 1480-1484
    • Sadegh, P.1    Spall, J.C.2
  • 41
    • 0026839090 scopus 로고
    • Multivariate stochastic approximation using a simultaneous perturbation gradient approximation
    • Spall JC. Multivariate stochastic approximation using a simultaneous perturbation gradient approximation. IEEE Trans Autom Control 1992, 37:331-341.
    • (1992) IEEE Trans Autom Control , vol.37 , pp. 331-341
    • Spall, J.C.1
  • 42
    • 0034290982 scopus 로고    scopus 로고
    • Adaptive stochastic approximation by the simultaneous perturbation method
    • Spall JC. Adaptive stochastic approximation by the simultaneous perturbation method. IEEE Trans Autom Control 2000, 45:1839-1853.
    • (2000) IEEE Trans Autom Control , vol.45 , pp. 1839-1853
    • Spall, J.C.1
  • 43
    • 0010013212 scopus 로고
    • Asymptotical study of parameter tracking algorithms
    • Delyon B, Juditsky A. Asymptotical study of parameter tracking algorithms. SIAM J Control Optim 1995, 33:323-345.
    • (1995) SIAM J Control Optim , vol.33 , pp. 323-345
    • Delyon, B.1    Juditsky, A.2
  • 44
    • 0024701601 scopus 로고
    • Frequency domain tracking characteristics of adaptive algorithms
    • Gunnarsson S, Ljung L. Frequency domain tracking characteristics of adaptive algorithms. IEEE Trans Acoust Speech Signal Process 1989, 37:1072-1089.
    • (1989) IEEE Trans Acoust Speech Signal Process , vol.37 , pp. 1072-1089
    • Gunnarsson, S.1    Ljung, L.2
  • 45
    • 0028500612 scopus 로고
    • Stability of recursive tracking algorithms
    • Guo L. Stability of recursive tracking algorithms. SIAM J Control Optim 1994, 32:1195-1225.
    • (1994) SIAM J Control Optim , vol.32 , pp. 1195-1225
    • Guo, L.1
  • 46
    • 0029359928 scopus 로고
    • Performance analysis of general tracking algorithms
    • Guo L, Ljung L. Performance analysis of general tracking algorithms. IEEE Trans Autom Control 1995, AC- 40:1388-1402.
    • (1995) IEEE Trans Autom Control , vol.AC40 , pp. 1388-1402
    • Guo, L.1    Ljung, L.2
  • 47
    • 0039026936 scopus 로고
    • Tracking performance analyses of the forgetting factor RLS algorithm
    • Tucson, Arizona, pages, New York. IEEE
    • Guo L, Ljung L, Priouret P. Tracking performance analyses of the forgetting factor RLS algorithm. In Proceedings of the 31st Conference on Decision and Control, Tucson, Arizona, pages 688-693, New York, 1992. IEEE.
    • (1992) Proceedings of the 31st Conference on Decision and Control , pp. 688-693
    • Guo, L.1    Ljung, L.2    Priouret, P.3
  • 53
    • 0029359844 scopus 로고
    • Analysis of adaptive step size SA algorithms for parameter tracking
    • Kushner HJ, Yang J. Analysis of adaptive step size SA algorithms for parameter tracking. IEEE Trans Autom Control 1995, 40:1403-1410.
    • (1995) IEEE Trans Autom Control , vol.40 , pp. 1403-1410
    • Kushner, H.J.1    Yang, J.2
  • 54
    • 28644436822 scopus 로고    scopus 로고
    • Adaptive optimization of least squares tracking algorithms: With applications to adaptive antennas arrays for randomly time-varying mobile communications systems
    • Buche R, Kushner HJ. Adaptive optimization of least squares tracking algorithms: with applications to adaptive antennas arrays for randomly time-varying mobile communications systems. IEEE Trans Autom Control 2005, 50:1749-1760.
    • (2005) IEEE Trans Autom Control , vol.50 , pp. 1749-1760
    • Buche, R.1    Kushner, H.J.2
  • 55
    • 0019031661 scopus 로고
    • Robust identification of a nonminimum phase system: Blind adjustment of a linear equalizer in data communications
    • Benveniste A, GoursatM, Ruget G. Robust identification of a nonminimum phase system: blind adjustment of a linear equalizer in data communications. IEEE Trans Autom Control 1980, AC-25:385-399.
    • (1980) IEEE Trans Autom Control , vol.AC-25 , pp. 385-399
    • Benveniste, A.1    Goursat, M.2    Ruget, G.3
  • 56
    • 0027590682 scopus 로고
    • A stochastic estimation algorithm with observation averaging
    • Juditsky A. A stochastic estimation algorithm with observation averaging. IEEE Trans Autom Control 1993, 38:794-798.
    • (1993) IEEE Trans Autom Control , vol.38 , pp. 794-798
    • Juditsky, A.1
  • 57
    • 0026899240 scopus 로고
    • Acceleration of stochastic approximation by averaging
    • Polyak BT, Juditsky AB. Acceleration of stochastic approximation by averaging. SIAM J Control Optim 1992, 30:838-855.
    • (1992) SIAM J Control Optim , vol.30 , pp. 838-855
    • Polyak, B.T.1    Juditsky, A.B.2
  • 58
    • 0002700406 scopus 로고
    • Asymptotically efficient stochastic approximation
    • Chen H-F. Asymptotically efficient stochastic approximation. Stochastics Stochastic Rep 1993, 45:1-16.
    • (1993) Stochastics Stochastic Rep , vol.45 , pp. 1-16
    • Chen, H.-F.1
  • 59
    • 0001197466 scopus 로고
    • Stochastic optimization with averaging of trajectories
    • Delyon B, Juditsky A. Stochastic optimization with averaging of trajectories. Stochastics Stochastic Rep 1992, 39:107-118.
    • (1992) Stochastics Stochastic Rep , vol.39 , pp. 107-118
    • Delyon, B.1    Juditsky, A.2
  • 60
    • 0027627789 scopus 로고
    • Stochastic approximation with averaging of the iterates: Optimal asymptotic rates of convergence for general processes
    • Kushner HJ, Yang J. Stochastic approximation with averaging of the iterates: optimal asymptotic rates of convergence for general processes. SIAM J Control Optim 1993, 31:1045-1062.
    • (1993) SIAM J Control Optim , vol.31 , pp. 1045-1062
    • Kushner, H.J.1    Yang, J.2
  • 61
    • 33747128208 scopus 로고
    • Stochastic approximation with averaging and feedback: Faster convergence
    • In: Goodwin GC, Aström K, Kumar PR, eds., The IMA Series. Berlin and New York: Springer-Verlag
    • Kushner HJ, Yang J. Stochastic approximation with averaging and feedback: faster convergence. In: Goodwin GC, Aström K, Kumar PR, eds. IMA Volumes in Mathematics and Applications: Adaptive Control, Filtering and Signal Processing, Volume 74, The IMA Series. Berlin and New York: Springer-Verlag; 1995, 205-228.
    • (1995) IMA Volumes in Mathematics and Applications: Adaptive Control, Filtering and Signal Processing , vol.74 , pp. 205-228
    • Kushner, H.J.1    Yang, J.2
  • 62
    • 0029221295 scopus 로고
    • Stochastic approximation with averaging and feedback: Rapidly onvergent "on line" algorithms
    • KushnerHJ, Yang J. Stochastic approximation with averaging and feedback: rapidly onvergent "on line" algorithms.IEEE Trans Autom Control 1995, 40:24-34.
    • (1995) IEEE Trans Autom Control , vol.40 , pp. 24-34
    • Kushner, H.J.1    Yang, J.2
  • 63
    • 78651565692 scopus 로고    scopus 로고
    • Averaging with feedback in Gaussian schemes in stochastic approximation
    • Le Breton A. Averaging with feedback in Gaussian schemes in stochastic approximation. Math Methods Stat 1997, 6:313-331.
    • (1997) Math Methods Stat , vol.6 , pp. 313-331
    • Le Breton, A.1
  • 64
    • 0003254109 scopus 로고
    • On extensions of Polyak's averaging approach to stochastic approximation
    • Yin G. On extensions of Polyak's averaging approach to stochastic approximation. Stochastics Stochastic Rep 1991, 36:245-264.
    • (1991) Stochastics Stochastic Rep , vol.36 , pp. 245-264
    • Yin, G.1
  • 65
    • 4243094684 scopus 로고
    • Stochastic approximation via averaging: Polyak's approach revisited
    • In: Pflug , Dieter U, eds., Berlin and New York: Springer-Verlag
    • Yin G. Stochastic approximation via averaging: Polyak's approach revisited. In: Pflug , Dieter U, eds. Lecture Notes in Economics and Mathematical Systems 374. Berlin and New York: Springer-Verlag; 1992, 119-134.
    • (1992) Lecture Notes in Economics and Mathematical Systems , vol.374 , pp. 119-134
    • Yin, G.1
  • 66
    • 20244382281 scopus 로고
    • Rates of convergence for sequential montecarlo optimization methods
    • Kushner HJ. Rates of convergence for sequential montecarlo optimization methods. SIAM J Control Optim 1978, 16:150-168.
    • (1978) SIAM J Control Optim , vol.16 , pp. 150-168
    • Kushner, H.J.1
  • 67
    • 0018518852 scopus 로고
    • Rates of convergence for stochasticapproximation type algorithms
    • Kushner HJ, Huang H. Rates of convergence for stochasticapproximation type algorithms. SIAM J Control Optim 1979, 17:607-617.
    • (1979) SIAM J Control Optim , vol.17 , pp. 607-617
    • Kushner, H.J.1    Huang, H.2
  • 68
    • 0036334497 scopus 로고    scopus 로고
    • Rate of convergence for constrained stochastic approximation algorithms
    • Buche R, Kushner HJ. Rate of convergence for constrained stochastic approximation algorithms. SIAM J Control Optim 2001, 40:1011-1041.
    • (2001) SIAM J Control Optim , vol.40 , pp. 1011-1041
    • Buche, R.1    Kushner, H.J.2
  • 69
    • 0032657551 scopus 로고    scopus 로고
    • Convergence rate of moments in stochastic approximation with simultaneous perturbation gradient approximation and resetting
    • Gerencsér L. Convergence rate of moments in stochastic approximation with simultaneous perturbation gradient approximation and resetting. IEEE Trans Autom Control 1999, 44:894-905.
    • (1999) IEEE Trans Autom Control , vol.44 , pp. 894-905
    • Gerencsér, L.1
  • 70
    • 0034453883 scopus 로고    scopus 로고
    • Law of the iterated logarithm for constant-gain linear stochastic gradient algorithm
    • Joslin JA, Heunis AJ. Law of the iterated logarithm for constant-gain linear stochastic gradient algorithm. SIAM J Control Optim 2000, 39:533-570.
    • (2000) SIAM J Control Optim , vol.39 , pp. 533-570
    • Joslin, J.A.1    Heunis, A.J.2
  • 71
    • 0031103181 scopus 로고    scopus 로고
    • Strong diffusion approximations for recursive stochastic algorithms
    • Pezeshki-Esfanahani H, Heunis AJ. Strong diffusion approximations for recursive stochastic algorithms. IEEE Trans Inform Theory 1997, 43:312-323.
    • (1997) IEEE Trans Inform Theory , vol.43 , pp. 312-323
    • Pezeshki-Esfanahani, H.1    Heunis, A.J.2
  • 72
    • 0030105021 scopus 로고    scopus 로고
    • Stochastic approximation algorithms for systems over an infinite horizon
    • Kushner HJ, Vázquez-Abad FJ. Stochastic approximation algorithms for systems over an infinite horizon. SIAM J Control Optim 1996, 34:712-756.
    • (1996) SIAM J Control Optim , vol.34 , pp. 712-756
    • Kushner, H.J.1    Vázquez-Abad, F.J.2
  • 73
    • 0022131183 scopus 로고
    • Stochastic approximation via large deviations: Asymptotic properties
    • Dupuis P, Kushner HJ. Stochastic approximation via large deviations: asymptotic properties. SIAM J Control Optim 1985, 23:675-696.
    • (1985) SIAM J Control Optim , vol.23 , pp. 675-696
    • Dupuis, P.1    Kushner, H.J.2
  • 74
    • 0001472112 scopus 로고
    • Asymptotic behavior of constrained stochastic approximations via the theory of large deviations
    • Dupuis P, Kushner HJ. Asymptotic behavior of constrained stochastic approximations via the theory of large deviations. Probab Theory Related Fields 1987, 75:223-244.
    • (1987) Probab Theory Related Fields , vol.75 , pp. 223-244
    • Dupuis, P.1    Kushner, H.J.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.