메뉴 건너뛰기




Volumn 52, Issue 1, 2014, Pages 1-31

Convergence of the relative value iteration for the ergodic control problem of nondegenerate diffusions under near-monotone costs

Author keywords

Controlled diffusions; Ergodic control; Hamilton jacobi bellman equation; Parabolic cauchy problem; Relative value iteration

Indexed keywords

CAUCHY PROBLEMS; CONTROLLED DIFFUSION; ERGODIC CONTROL; HAMILTON JACOBI BELLMAN EQUATION; RELATIVE VALUE ITERATIONS;

EID: 84897899776     PISSN: 03630129     EISSN: None     Source Type: Journal    
DOI: 10.1137/130912918     Document Type: Article
Times cited : (13)

References (23)
  • 1
    • 0036287773 scopus 로고    scopus 로고
    • Learning algorithms for Markov decision processes with average cost
    • J. ABOUNADI, D. P. BERTSEKAS, AND V. S. BORKAR, Learning algorithms for Markov decision processes with average cost, SIAM J. Control Optim., 40(2001), pp. 681-698.
    • (2001) SIAM J. Control Optim. , vol.40 , pp. 681-698
    • Abounadi, J.1    Bertsekas, D.P.2    Borkar, V.S.3
  • 2
    • 84897861590 scopus 로고    scopus 로고
    • On the policy iteration algorithm for nondegenerate controlled diffusions under the ergodic criterion
    • D. Hernández-Hernández and J. A. Minjárez-Sosa, eds., Systems Control Found. Appl., Birkhäuser Boston, Boston, MA
    • A. ARAPOSTATHIS, On the policy iteration algorithm for nondegenerate controlled diffusions under the ergodic criterion, in Optimization, Control, and Applications of Stochastic Systems, D. Hernández-Hernández and J. A. Minjárez-Sosa, eds., Systems Control Found. Appl., Birkhäuser Boston, Boston, MA, 2012, pp. 1-12.
    • (2012) Optimization, Control, and Applications of Stochastic Systems , pp. 1-12
    • Arapostathis, A.1
  • 3
    • 84866419009 scopus 로고    scopus 로고
    • A relative value iteration algorithm for nondegenerate controlled diffusions
    • A. ARAPOSTATHIS AND V. S. BORKAR, A relative value iteration algorithm for nondegenerate controlled diffusions, SIAM J. Control Optim., 50(2012), pp. 1886-1902.
    • (2012) SIAM J. Control Optim. , vol.50 , pp. 1886-1902
    • Arapostathis, A.1    Borkar, V.S.2
  • 4
    • 84902333764 scopus 로고    scopus 로고
    • Ergodic control of diffusion processes
    • Cambridge University Press, Cambridge, UK
    • A. ARAPOSTATHIS, V. S. BORKAR, AND M. K. GHOSH, Ergodic Control of Diffusion Processes, Encyclopedia Math. Appl. 143, Cambridge University Press, Cambridge, UK, 2011.
    • (2011) Encyclopedia Math. Appl. , pp. 143
    • Arapostathis, A.1    Borkar, V.S.2    Ghosh, M.K.3
  • 5
    • 85044324976 scopus 로고    scopus 로고
    • Relative value iteration for stochastic differential games
    • V. Krivan and G. Zaccour, eds., Ann. Internat. Soc. Dynam. Games 13, Birkhäuser Boston, Boston, MA
    • A. ARAPOSTATHIS, V. S. BORKAR, AND K. S. KUMAR, Relative value iteration for stochastic differential games, in Advances in Dynamical Games: Theory, Applications and Numerical Methods, V. Krivan and G. Zaccour, eds., Ann. Internat. Soc. Dynam. Games 13, Birkhäuser Boston, Boston, MA, 2013.
    • (2013) Advances in Dynamical Games: Theory, Applications and Numerical Methods
    • Arapostathis, A.1    Borkar, V.S.2    Kumar, K.S.3
  • 6
    • 3242720761 scopus 로고    scopus 로고
    • On regularity of transition probabilities and invariant measures of singular diffusions under minimal conditions
    • V. I. BOGACHEV, N. V. KRYLOV, AND M. RÖCKNER, On regularity of transition probabilities and invariant measures of singular diffusions under minimal conditions, Comm. Partial Differential Equations, 26(2001), pp. 2037-2080.
    • (2001) Comm. Partial Differential Equations , vol.26 , pp. 2037-2080
    • Bogachev, V.I.1    Krylov, N.V.2    Röckner, M.3
  • 7
    • 0033245832 scopus 로고    scopus 로고
    • Value iteration and optimization of multiclass queueing networks
    • R.-R. CHEN AND S. MEYN, Value iteration and optimization of multiclass queueing networks, Queueing Systems Theory Appl., 32(1999), pp. 65-97.
    • (1999) Queueing Systems Theory Appl. , vol.32 , pp. 65-97
    • Chen, R.-R.1    Meyn, S.2
  • 8
    • 0042971774 scopus 로고    scopus 로고
    • Uniqueness of solutions of the Cauchy problem for parabolic equations degenerating at infinity
    • S. EIDELMAN, S. KAMIN, AND F. PORPER, Uniqueness of solutions of the Cauchy problem for parabolic equations degenerating at infinity, Asymptot. Anal., 22(2000), pp. 349-358.
    • (2000) Asymptot. Anal. , vol.22 , pp. 349-358
    • Eidelman, S.1    Kamin, S.2    Porper, F.3
  • 9
    • 0039140304 scopus 로고
    • Harnack inequalities for solutions of general second order parabolic equations and estimates of their Hölder constants
    • M. GRUBER, Harnack inequalities for solutions of general second order parabolic equations and estimates of their Hölder constants, Math. Z., 185(1984), pp. 23-43.
    • (1984) Math. Z. , vol.185 , pp. 23-43
    • Gruber, M.1
  • 10
    • 0040361064 scopus 로고    scopus 로고
    • Existence of strong solutions for Ito's stochastic equations via approximations
    • I. GYÖNGY AND N. KRYLOV, Existence of strong solutions for Ito's stochastic equations via approximations, Probab. Theory Related Fields, 105(1996), pp. 143-158.
    • (1996) Probab. Theory Related Fields , vol.105 , pp. 143-158
    • Gyöngy, I.1    Krylov, N.2
  • 11
    • 0000073325 scopus 로고
    • Ergodic properties of recurrent diffusion processes and stabilization of the solution of the Cauchy problem for parabolic equations
    • R. Z. HASMINSKIǏ, Ergodic properties of recurrent diffusion processes and stabilization of the solution of the Cauchy problem for parabolic equations, Theory Probab. Appl., 5(1960), pp. 179-196.
    • (1960) Theory Probab. Appl. , vol.5 , pp. 179-196
    • Hasminskiǐ, R.Z.1
  • 12
    • 84857980321 scopus 로고    scopus 로고
    • Large time asymptotic problems for optimal stochastic control with superlinear cost
    • N. ICHIHARA, Large time asymptotic problems for optimal stochastic control with superlinear cost, Stochastic Process. Appl., 122(2012), pp. 1248-1275.
    • (2012) Stochastic Process. Appl. , vol.122 , pp. 1248-1275
    • Ichihara, N.1
  • 13
    • 84876099623 scopus 로고    scopus 로고
    • Large time behavior of solutions of Hamilton-Jacobi-Bellman equations with quadratic nonlinearity in gradients
    • N. ICHIHARA AND S.-J. SHEU, Large time behavior of solutions of Hamilton-Jacobi-Bellman equations with quadratic nonlinearity in gradients, SIAM J. Math. Anal., 45(2013), pp. 279-306.
    • (2013) SIAM J. Math. Anal. , vol.45 , pp. 279-306
    • Ichihara, N.1    Sheu, S.-J.2
  • 14
    • 0007282196 scopus 로고
    • Controlled diffusion processes
    • Springer-Verlag, New York
    • N. V. KRYLOV, Controlled Diffusion Processes, Appl. Math. 14, Springer-Verlag, New York, 1980.
    • (1980) Appl. Math. , pp. 14
    • Krylov, N.V.1
  • 15
    • 77952293458 scopus 로고    scopus 로고
    • Lectures on elliptic and parabolic equations in sobolev spaces
    • American Mathematical Society, Providence, RI
    • N. V. KRYLOV, Lectures on Elliptic and Parabolic Equations in Sobolev Spaces, Grad. Stud. Math. 96, American Mathematical Society, Providence, RI, 2008.
    • (2008) Grad. Stud. Math. , pp. 96
    • Krylov, N.V.1
  • 16
    • 0001156676 scopus 로고
    • Linear and quasi-linear equations of parabolic type
    • American Mathematical Society, Providence, RI
    • O. A. LADYŽENSKAJA, V. A. SOLONNIKOV, AND N. N. URAL'CEVA, Linear and quasi-linear equations of parabolic type, Transl. Math. Monogr. 23, American Mathematical Society, Providence, RI, 1967.
    • (1967) Transl. Math. Monogr. , pp. 23
    • Ladyženskaja, O.A.1    Solonnikov, V.A.2    Ural'Ceva, N.N.3
  • 17
    • 79953289767 scopus 로고    scopus 로고
    • Polynomial bounds in the ergodic theorem for one-dimensional diffusions and integrability of hitting times
    • E. LÖCHERBACH, D. LOUKIANOVA, AND O. LOUKIANOV, Polynomial bounds in the ergodic theorem for one-dimensional diffusions and integrability of hitting times, Ann. Inst. H. Poincare Probab. Statist., 47(2011), pp. 425-449.
    • (2011) Ann. Inst. H. Poincare Probab. Statist. , vol.47 , pp. 425-449
    • Löcherbach, E.1    Loukianova, D.2    Loukianov, O.3
  • 19
    • 0031344030 scopus 로고    scopus 로고
    • The policy iteration algorithm for average reward Markov decision processes with general state space
    • S. P. MEYN, The policy iteration algorithm for average reward Markov decision processes with general state space, IEEE Trans. Automat. Control, 42(1997), pp. 1663-1680.
    • (1997) IEEE Trans. Automat. Control , vol.42 , pp. 1663-1680
    • Meyn, S.P.1
  • 20
    • 0001340188 scopus 로고
    • Stability of Markovian processes. III. Foster-Lyapunov criteria for continuous-time processes
    • S. P. MEYN AND R. L. TWEEDIE, Stability of Markovian processes. III. Foster-Lyapunov criteria for continuous-time processes, Adv. in Appl. Probab., 25(1993), pp. 518-548.
    • (1993) Adv. in Appl. Probab. , vol.25 , pp. 518-548
    • Meyn, S.P.1    Tweedie, R.L.2
  • 21
    • 84863939018 scopus 로고    scopus 로고
    • Downside risk minimization via a large deviations approach
    • H. NAGAI, Downside risk minimization via a large deviations approach, Ann. Appl. Probab., 22(2012), pp. 608-669.
    • (2012) Ann. Appl. Probab. , vol.22 , pp. 608-669
    • Nagai, H.1
  • 22
    • 85129541797 scopus 로고    scopus 로고
    • 1: Existence, uniqueness and associated Markov processes
    • 1: Existence, uniqueness and associated Markov processes, Ann. Scuola Norm. Sup. Pisa Cl. Sci. (4), 28(1999), pp. 99-140.
    • (1999) Ann. Scuola Norm. Sup. Pisa Cl. Sci. , vol.28 , Issue.4 , pp. 99-140
    • Stannat, W.1
  • 23
    • 0003122592 scopus 로고
    • Dynamic programming, Markov chains, and the method of successive approximations
    • D. J. WHITE, Dynamic programming, Markov chains, and the method of successive approximations, J. Math. Anal. Appl., 6(1963), pp. 373-376.
    • (1963) J. Math. Anal. Appl. , vol.6 , pp. 373-376
    • White, D.J.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.