메뉴 건너뛰기




Volumn 26, Issue 3, 2008, Pages 603-623

How fast is the bandit?

Author keywords

Asset allocation; Learning automata; Stochastic approximation; Two armed bandit algorithm

Indexed keywords


EID: 45749127104     PISSN: 07362994     EISSN: 15329356     Source Type: Journal    
DOI: 10.1080/07362990802007202     Document Type: Article
Times cited : (6)

References (20)
  • 1
    • 14544291553 scopus 로고    scopus 로고
    • Asymptotics in randomized urn models
    • Bai, Z.-D., and Hu, F. 2005. Asymptotics in randomized urn models. Annals of Applied Probability, 15:914-940.
    • (2005) Annals of Applied Probability , vol.15 , pp. 914-940
    • Bai, Z.-D.1    Hu, F.2
  • 2
    • 0040967666 scopus 로고
    • Approximation gaussienne d'algorithmes stochastiques à dynamique markovienne.
    • Bouton, C. 1988. Approximation gaussienne d'algorithmes stochastiques à dynamique markovienne. Ann. Inst. Henri Poincaré, Probab. Stat. 24(1):131-155.
    • (1988) Ann. Inst. Henri Poincaré, Probab. Stat , vol.24 , Issue.1 , pp. 131-155
    • Bouton, C.1
  • 4
    • 45749146890 scopus 로고    scopus 로고
    • Duflo, M. 1997. Random Iterative Models, trans. by Stephen S. Wilson; rev. by M. Duflo. Applications of Mathematics. Springer-Verlag, New York, Berlin, 34.
    • Duflo, M. 1997. Random Iterative Models, trans. by Stephen S. Wilson; rev. by M. Duflo. Applications of Mathematics. Springer-Verlag, New York, Berlin, vol. 34.
  • 5
    • 84861255173 scopus 로고    scopus 로고
    • Decreasing step stochastic algorithms: A.s. behavior of weighted empirical measures
    • Fort, J.C., and Pagès, G. 2002. Decreasing step stochastic algorithms: a.s. behavior of weighted empirical measures. Monte Carlo Methods and Applications 8(3):237-270.
    • (2002) Monte Carlo Methods and Applications , vol.8 , Issue.3 , pp. 237-270
    • Fort, J.C.1    Pagès, G.2
  • 8
    • 26844528216 scopus 로고    scopus 로고
    • When can the two-armed bandit algorithm be trusted?
    • Lamberton, D., Pagès, G., and Tarrès, P. 2004. When can the two-armed bandit algorithm be trusted?. Annals of Applied Probability, 14(3):1424-1454.
    • (2004) Annals of Applied Probability , vol.14 , Issue.3 , pp. 1424-1454
    • Lamberton, D.1    Pagès, G.2    Tarrès, P.3
  • 10
    • 0026723360 scopus 로고
    • Convergence of stochastic approximation procedures in the case of a regression equation with several roots, (transl. from)
    • Lazarev, V.A. 1992. Convergence of stochastic approximation procedures in the case of a regression equation with several roots, (transl. from) Problemy Pederachi Informatsii, vol. 28(1).
    • (1992) Problemy Pederachi Informatsii , vol.28 , Issue.1
    • Lazarev, V.A.1
  • 11
    • 0016082525 scopus 로고    scopus 로고
    • Narendra, K.S., and Thathachar, M.A.L. 1974. Learning Automata-A Survey. IEEE Trans. Systems, Man., Cybernetics S.M.C-4, pp. 323-334.
    • Narendra, K.S., and Thathachar, M.A.L. 1974. Learning Automata-A Survey. IEEE Trans. Systems, Man., Cybernetics S.M.C-4, pp. 323-334.
  • 14
    • 24144477156 scopus 로고
    • On linear models with two absorbing barriers
    • Norman, M.F. 1968. On linear models with two absorbing barriers. Journal of Mathematical Psychology, 5:225-241.
    • (1968) Journal of Mathematical Psychology , vol.5 , pp. 225-241
    • Norman, M.F.1
  • 15
    • 0032363716 scopus 로고    scopus 로고
    • Weak convergence rates for stochastic approximation with application to multiple targets and simulated annealing
    • Pelletier, M. 1998. Weak convergence rates for stochastic approximation with application to multiple targets and simulated annealing. Annals of Applied Probability 8(1):10-44.
    • (1998) Annals of Applied Probability , vol.8 , Issue.1 , pp. 10-44
    • Pelletier, M.1
  • 16
    • 0001000786 scopus 로고
    • Nonconvergence to unstable points in urn models and stochastic approximations
    • Pemantle, R. 1990. Nonconvergence to unstable points in urn models and stochastic approximations. Annals of Probability 18(2):698-712.
    • (1990) Annals of Probability , vol.18 , Issue.2 , pp. 698-712
    • Pemantle, R.1
  • 17
    • 0014580386 scopus 로고
    • Use of stochastic automata for parameter self-optimization with multi-modal perfomance criteria
    • Shapiro, I.J., and Narendra, K.S. 1969. Use of stochastic automata for parameter self-optimization with multi-modal perfomance criteria. IEEE Trans. Syst. Sci. Cybern. SSC-5:352-360.
    • (1969) IEEE Trans. Syst. Sci. Cybern , vol.SSC-5 , pp. 352-360
    • Shapiro, I.J.1    Narendra, K.S.2
  • 18
    • 0034650017 scopus 로고    scopus 로고
    • Pièges répulsifs.
    • Série I
    • Tarrès, P. 2000. Pièges répulsifs. C.R.A.S. Acad. Sc. de Paris, Série I 330:125-130.
    • (2000) C.R.A.S. Acad. Sc. de Paris , vol.330 , pp. 125-130
    • Tarrès, P.1
  • 19
    • 45749085567 scopus 로고    scopus 로고
    • Tarrès, P. 2001. Algorithmes Stochastiques et Marches Aléatoires Renforcées. Thèse de l'ENS Cachan (France).
    • Tarrès, P. 2001. Algorithmes Stochastiques et Marches Aléatoires Renforcées. Thèse de l'ENS Cachan (France).
  • 20
    • 45749090392 scopus 로고    scopus 로고
    • Tarrès, P., and Vandekerkhove, P. 2006. On the ergodic two-armed bandit algorithm. University Paris-Est Marne-la-Vallée.
    • Tarrès, P., and Vandekerkhove, P. 2006. On the ergodic two-armed bandit algorithm. University Paris-Est Marne-la-Vallée.


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.