메뉴 건너뛰기




Volumn 13, Issue , 2008, Pages 341-373

A penalized bandit algorithm

Author keywords

Convergence rate; Learning automata; Penalization; Stochastic approximation; Two armed bandit algorithm

Indexed keywords


EID: 41849113048     PISSN: None     EISSN: 10836489     Source Type: Journal    
DOI: 10.1214/EJP.v13-489     Document Type: Article
Times cited : (14)

References (23)
  • 1
    • 14544291553 scopus 로고    scopus 로고
    • ASYMPTOTICS IN RANDOMIZED URN MODELS
    • Z.-D. BAI, F. HU (2005), ASYMPTOTICS IN RANDOMIZED URN MODELS, Annals of Applied Probability, 15, 914-940.
    • (2005) Annals of Applied Probability , vol.15 , pp. 914-940
    • Bai, Z.-D.1    Hu, F.2
  • 2
    • 0001793657 scopus 로고    scopus 로고
    • DYNAMICS OF STOCHASTIC APPROXIMATION ALGORITHMS, Séminaire de Proba-bilités XXXIII
    • J. Azema, M. Emery, M. Ledoux, M. Yor eds, n01709
    • M. BENAIM (1999), DYNAMICS OF STOCHASTIC APPROXIMATION ALGORITHMS, Séminaire de Proba-bilités XXXIII, J. Azema, M. Emery, M. Ledoux, M. Yor eds., Lecture Notes in Mathematics n01709, pp.1-68.
    • (1999) Lecture Notes in Mathematics , pp. 1-68
    • Benaim, M.1
  • 3
    • 0040967666 scopus 로고
    • APPROXIMATION GAUSSIENNE D'ALGORITHMES STOCHASTIQUES A DYNAMIQUE MARKOVIENNE
    • C. BOUTON (1988), APPROXIMATION GAUSSIENNE D'ALGORITHMES STOCHASTIQUES A DYNAMIQUE MARKOVIENNE, Ann. Inst. Henri Poincaré, Probab. Stat., 24(1), 131-155.
    • (1988) Ann. Inst. Henri Poincaré, Probab. Stat. , vol.24 , Issue.1 , pp. 131-155
    • Bouton, C.1
  • 4
    • 0001451318 scopus 로고
    • A SHARPER FORM OF THE BOREL-CANTELLI LEMMA AND THE STRONG LAW
    • L. DUBINS AND D. FREEDMAN (1965), A SHARPER FORM OF THE BOREL-CANTELLI LEMMA AND THE STRONG LAW, Ann. Of Math. Stat., 36, 800-807.
    • (1965) Ann. Of Math. Stat. , vol.36 , pp. 800-807
    • Dubins, L.1    Freedman, D.2
  • 5
    • 0001668150 scopus 로고    scopus 로고
    • TRANSFORM ANALYSIS AND ASSET PRICING FOR AFFINE JUMP-DIFFUSIONS
    • D. DUFFIE, J. PAN, K. SINGLETON (2000), TRANSFORM ANALYSIS AND ASSET PRICING FOR AFFINE JUMP-DIFFUSIONS, Econometrica, 68, 1343-1376.
    • (2000) Econometrica , vol.68 , pp. 1343-1376
    • Duffie, D.1    Pan, J.2    Singleton, K.3
  • 7
    • 85037903205 scopus 로고    scopus 로고
    • ALGORITHMES STOCHASTIQUES, COLL
    • Springer-Verlag, Berlin
    • M. DUFLO (1996), ALGORITHMES STOCHASTIQUES, COLL. Mathématiques & Applications, 23, Springer-Verlag, Berlin, 319p.
    • (1996) Mathématiques & Applications , vol.23 , pp. 319
    • Duflo, M.1
  • 8
    • 29344435643 scopus 로고    scopus 로고
    • LIMIT THEOREMS FOR STOCHASTIC PROCESSES, 2nd edition
    • Springer-Verlag, Berlin
    • J. JACOD, A.N. SHIRYAEV (2003), LIMIT THEOREMS FOR STOCHASTIC PROCESSES, 2nd edition, Fundamental Principles of Mathematical Sciences, 28, Springer-Verlag, Berlin, 661p.
    • (2003) Fundamental Principles of Mathematical Sciences , vol.28 , pp. 661
    • Jacod, J.1    Shiryaev, A.N.2
  • 10
    • 0003452601 scopus 로고
    • STOCHASTIC APPROXIMATION FOR CONSTRAINED AND UNCONSTRAINED SYSTEMS
    • Springer-Verlag, New York
    • H.J. KUSHNER, D.S. CLARK (1978), STOCHASTIC APPROXIMATION FOR CONSTRAINED AND UNCONSTRAINED SYSTEMS, Applied Math. Science Series, 26, Springer-Verlag, New York.
    • (1978) Applied Math. Science Series , pp. 26
    • Kushner, H.J.1    Clark, D.S.2
  • 12
    • 84939377645 scopus 로고
    • UNIV. OKLAHOMA, SCHOOL OF ELECTRICAL ENGINEERING AND COMPUTING SCIENCE, TECHN. REPORT EECS
    • S. LAKSHMIVARAHAN (1979), E-OPTIMAL LEARNING ALGORITHMS-NON-ABSORBING BARRIER TYPE, UNIV. OKLAHOMA, SCHOOL OF ELECTRICAL ENGINEERING AND COMPUTING SCIENCE, TECHN. REPORT EECS 7901.
    • (1979) E-OPTIMAL LEARNING ALGORITHMS-NON-ABSORBING BARRIER TYPE , pp. 7901
    • Lakshmivarahan, S.1
  • 15
    • 26844528216 scopus 로고    scopus 로고
    • WHEN CAN THE TWO-ARMED BANDIT ALGORITHM BE TRUSTED?
    • D. LAMBERTON, G. PAGES, P. TARRES (2004), WHEN CAN THE TWO-ARMED BANDIT ALGORITHM BE TRUSTED?, Annals of Applied Probability, 14(3), 1424-1454.
    • (2004) Annals of Applied Probability , vol.14 , Issue.3 , pp. 1424-1454
    • Lamberton, D.1    Pages, G.2    Tarres, P.3
  • 19
    • 24144477156 scopus 로고
    • On linear Models with Two Absorbing Barriers
    • M.F. NORMAN (1968), On linear Models with Two Absorbing Barriers, J. Of Mathematical Psychlogy, 5, 225-241.
    • (1968) J. Of Mathematical Psychlogy , vol.5 , pp. 225-241
    • Norman, M.F.1
  • 20
    • 0001000786 scopus 로고
    • Non-convergence to unstable points in urn models and stochastic approximations
    • n0 2
    • R. PEMANTLE (1990), Non-convergence to unstable points in urn models and stochastic approximations, Annals of Probability, 18, n0 2, 698-712.
    • (1990) Annals of Probability , vol.18 , pp. 698-712
    • Pemantle, R.1
  • 21
    • 0014580386 scopus 로고
    • Use of Stochastic Automata for Parameter SelfOptimization with Multi-Modal Perfomance Criteria
    • I.J. SHAPIRO, K.S. NARENDRA (1969), Use of Stochastic Automata for Parameter SelfOptimization with Multi-Modal Perfomance Criteria, IEEE Trans. Syst. Sci. And Cybern., SSC-5, 352-360.
    • (1969) IEEE Trans. Syst. Sci. And Cybern. , vol.SSC-5 , pp. 352-360
    • Shapiro, I.J.1    Narendra, K.S.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.