메뉴 건너뛰기




Volumn 5, Issue , 2003, Pages 4426-4431

Asymptotic Properties of Two Time-Scale Stochastic Approximation Algorithms with Constant Step Sizes

Author keywords

[No Author keywords available]

Indexed keywords

ALGORITHMS; APPROXIMATION THEORY; ASYMPTOTIC STABILITY; MARKOV PROCESSES; PROBLEM SOLVING; SPURIOUS SIGNAL NOISE;

EID: 0142231039     PISSN: 07431619     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (14)

References (15)
  • 1
  • 4
    • 0031076413 scopus 로고    scopus 로고
    • Stochastic approximation with two time scales
    • V. S. Borkar, "Stochastic approximation with two time scales," Systems and Control Letters, vol. 29, pp. 291-294, 1997.
    • (1997) Systems and Control Letters , vol.29 , pp. 291-294
    • Borkar, V.S.1
  • 5
    • 0003976087 scopus 로고    scopus 로고
    • Optimal multilevel feedback policies for ABR flow control using two timescale SPSA
    • Institute of systems Research, University of Maryland
    • S. Bhatnagar, M. C. Fu, and S. I. Marcus, "Optimal multilevel feedback policies for ABR flow control using two timescale SPSA," Technical Report TR 99-18, Institute of systems Research, University of Maryland, 1999.
    • (1999) Technical Report TR 99-18
    • Bhatnagar, S.1    Fu, M.C.2    Marcus, S.I.3
  • 6
    • 0005358214 scopus 로고    scopus 로고
    • Randomized difference two-timescale simultaneous perturbation stochastic approximation algorithms for simulation optimization of hidden Markov models
    • Institute of systems Research, University of Maryland
    • S. Bhatnagar, M. C. Fu, S. I. Marcus, and S. Bhatnagar, "Randomized difference two-timescale simultaneous perturbation stochastic approximation algorithms for simulation optimization of hidden Markov models," Technical Report TR 2000-13, Institute of systems Research, University of Maryland, 2000.
    • (2000) Technical Report TR 2000-13
    • Bhatnagar, S.1    Fu, M.C.2    Marcus, S.I.3    Bhatnagar, S.4
  • 8
    • 0343893613 scopus 로고    scopus 로고
    • Actor-critic like learning algorithms for Markov decision processes
    • V. R. Konda and V. S. Borkar, "Actor-critic like learning algorithms for Markov decision processes," SIAM Journal on control and Optimization, vol. 38, pp. 94-123, 1999.
    • (1999) SIAM Journal on Control and Optimization , vol.38 , pp. 94-123
    • Konda, V.R.1    Borkar, V.S.2
  • 9
    • 0042758707 scopus 로고    scopus 로고
    • PhD Thesis, Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology
    • V. R. Konda, Actor-Critic Algorithms, PhD Thesis, Department of Electrical Engineering and Computer Science, Massachusetts Institute of Technology, 2002.
    • (2002) Actor-Critic Algorithms
    • Konda, V.R.1
  • 14
    • 0142162920 scopus 로고    scopus 로고
    • Stochastic Approximation with Random Truncations, State Dependent Noise and Discontinuous Dynamics
    • V. B. Tadić, "Stochastic Approximation with Random Truncations, State Dependent Noise and Discontinuous Dynamics," Stochastics and Stochastics Reports, vol. 64, 283-325, 1998.
    • (1998) Stochastics and Stochastics Reports , vol.64 , pp. 283-325
    • Tadić, V.B.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.