메뉴 건너뛰기




Volumn 6079 LNCS, Issue , 2010, Pages 69-80

Tug-of-war model for multi-armed bandit problem

Author keywords

Amoeba based computing; Bio inspired computation; Multi armed bandit problem; Reinforcement learning

Indexed keywords

ACCURACY RATE; AMOEBA-BASED COMPUTING; BIO-INSPIRED COMPUTATION; CONSERVATION LAW; ENVIRONMENTAL INFORMATION; GREEDY ALGORITHMS; MULTI-ARMED BANDIT PROBLEM; NONLOCAL CORRELATIONS; OPTIMAL STRATEGIES; PARALLEL SEARCH; PHYSARUM; SEARCH AGENTS; SLIME MOLD; VOLUME INCREMENT;

EID: 79956331440     PISSN: 03029743     EISSN: 16113349     Source Type: Book Series    
DOI: 10.1007/978-3-642-13523-1_10     Document Type: Conference Paper
Times cited : (18)

References (21)
  • 1
    • 0343278984 scopus 로고    scopus 로고
    • Maze-solving by an amoeboid organism
    • Nakagaki, T., Yamada, H., Toth, A.: Maze-solving by an amoeboid organism. Nature 407, 470 (2000).
    • (2000) Nature , vol.407 , pp. 470
    • Nakagaki, T.1    Yamada, H.2    Toth, A.3
  • 2
    • 33645168074 scopus 로고    scopus 로고
    • Physarum solver: A biologically inspired method of road-network navigation
    • Tero, A., Kobayashi, R., Nakagaki, T.: Physarum solver: A biologically inspired method of road-network navigation. Physica A 363, 115-119 (2006).
    • (2006) Physica A , vol.363 , pp. 115-119
    • Tero, A.1    Kobayashi, R.2    Nakagaki, T.3
  • 5
    • 34548387441 scopus 로고    scopus 로고
    • Amoeba-based neurocomputing with chaotic dynamics
    • DOI 10.1145/1284621.1284651
    • Aono, M., Hara, M., Aihara, K.: Amoeba-based neurocomputing with chaotic dynamics. Communications of the ACM 50(9), 69-72 (2007). (Pubitemid 47366815)
    • (2007) Communications of the ACM , vol.50 , Issue.9 , pp. 69-72
    • Aono, M.1    Hara, M.2    Aihara, K.3
  • 6
    • 37349011483 scopus 로고    scopus 로고
    • Spontaneous deadlock breaking on amoeba-based neurocomputer
    • DOI 10.1016/j.biosystems.2007.08.004, PII S0303264707001244
    • Aono, M., Hara, M.: Spontaneous deadlock breaking on amoeba-based neurocomputer. BioSystems 91, 83-93 (2008). (Pubitemid 350297567)
    • (2008) BioSystems , vol.91 , Issue.1 , pp. 83-93
    • Aono, M.1    Hara, M.2
  • 7
    • 70349568825 scopus 로고    scopus 로고
    • Amoeba-based chaotic neurocomputing: Combinatorial optimization by coupled biological oscillators
    • Aono, M., Hirata, Y., Hara, M., Aihara, K.: Amoeba-based chaotic neurocomputing: Combinatorial optimization by coupled biological oscillators. New Generation Computing 27, 129-157 (2009).
    • (2009) New Generation Computing , vol.27 , pp. 129-157
    • Aono, M.1    Hirata, Y.2    Hara, M.3    Aihara, K.4
  • 8
    • 70350516626 scopus 로고    scopus 로고
    • Resource-competing oscillator network as a model of amoeba-based neurocomputer
    • Calude, C.S. Costa, J.F. Dershowitz, N. Freire, E. Rozenberg, G. (eds.), LNCS, Springer, Heidelberg
    • Aono, M., Hirata, Y., Hara, M., Aihara, K.: Resource-competing oscillator network as a model of amoeba-based neurocomputer. In: Calude, C.S., Costa, J.F., Dershowitz, N., Freire, E., Rozenberg, G. (eds.) UC 2009. LNCS, vol. 5715, pp. 56-69. Springer, Heidelberg (2009).
    • (2009) UC 2009 , vol.5715 , pp. 56-69
    • Aono, M.1    Hirata, Y.2    Hara, M.3    Aihara, K.4
  • 9
    • 77953609691 scopus 로고    scopus 로고
    • Tug-of-war model for two-bandit problem
    • Calude, C.S. Costa, J.F. Dershowitz, N. Freire, E. Rozenberg, G. (eds.), LNCS, Springer, Heidelberg
    • Kim, S.-J., Aono, M., Hara, M.: Tug-of-war model for two-bandit problem. In: Calude, C.S., Costa, J.F., Dershowitz, N., Freire, E., Rozenberg, G. (eds.) UC 2009. LNCS, vol. 5715, p. 289. Springer, Heidelberg (2009).
    • (2009) UC 2009 , vol.5715 , pp. 289
    • Kim, S.-J.1    Aono, M.2    Hara, M.3
  • 10
    • 77953609815 scopus 로고    scopus 로고
    • Tug-of-war model for the two-bandit problem: Nonlocally-correlated parallel exploration via resource conservation
    • to appear
    • Kim, S.-J., Aono, M., Hara, M.: Tug-of-war model for the two-bandit problem: Nonlocally-correlated parallel exploration via resource conservation. BioSystems (to appear).
    • BioSystems
    • Kim, S.-J.1    Aono, M.2    Hara, M.3
  • 11
    • 84966203785 scopus 로고
    • Some aspects of the sequential design of experiments
    • Robbins, H.: Some aspects of the sequential design of experiments. Bull. Amer. Math. Soc. 58, 527-536 (1952).
    • (1952) Bull. Amer. Math. Soc. , vol.58 , pp. 527-536
    • Robbins, H.1
  • 12
    • 0001395850 scopus 로고
    • On the likelihood that one unknown probability exceeds another in view of the evidence of two samples
    • Thompson, W.: On the likelihood that one unknown probability exceeds another in view of the evidence of two samples. Biometrika 25, 285-294 (1933).
    • (1933) Biometrika , vol.25 , pp. 285-294
    • Thompson, W.1
  • 13
    • 0002955623 scopus 로고
    • A dynamic allocation index for the sequential design of experiments
    • Gans, J. (ed.), North Holland, Amsterdam
    • Gittins, J., Jones, D.: A dynamic allocation index for the sequential design of experiments. In: Gans, J. (ed.) Progress in Statistics, pp. 241-266. North Holland, Amsterdam (1974).
    • (1974) Progress in Statistics , pp. 241-266
    • Gittins, J.1    Jones, D.2
  • 14
    • 0000169010 scopus 로고
    • Bandit processes and dynamic allocation indices
    • Gittins, J.: Bandit processes and dynamic allocation indices. J. R. Stat. Soc. B 41, 148-177 (1979).
    • (1979) J. R. Stat. Soc. B , vol.41 , pp. 148-177
    • Gittins, J.1
  • 15
    • 0002899547 scopus 로고
    • Asymptotically efficient adaptive allocation rules
    • Lai, T., Robbins, H.: Asymptotically efficient adaptive allocation rules. Advances in Applied Mathematics 6, 4-22 (1985).
    • (1985) Advances in Applied Mathematics , vol.6 , pp. 4-22
    • Lai, T.1    Robbins, H.2
  • 16
    • 0000616723 scopus 로고
    • Sample mean based index policies with O(log n) regret for the multiarmed bandit problem
    • Agrawal, R.: Sample mean based index policies with O(log n) regret for the multiarmed bandit problem. Adv. Appl. Prob. 27, 1054-1078 (1995).
    • (1995) Adv. Appl. Prob. , vol.27 , pp. 1054-1078
    • Agrawal, R.1
  • 17
    • 0036568025 scopus 로고    scopus 로고
    • Finite-time analysis of the multiarmed bandit problem
    • DOI 10.1023/A:1013689704352, Computational Learning Theory
    • Auer, P., Cesa-Bianchi, N., Fischer, P.: Finite-time analysis of the multiarmed bandit problem. Machine Learning 47, 235-256 (2002). (Pubitemid 34126111)
    • (2002) Machine Learning , vol.47 , Issue.2-3 , pp. 235-256
    • Auer, P.1    Cesa-Bianchi, N.2    Fischer, P.3
  • 18
    • 33646406807 scopus 로고    scopus 로고
    • Multi-armed bandit algorithms and empirical evaluation
    • Gama, J. Camacho, R. Brazdil, P.B. Jorge, A.M. Torgo, L. et al. (eds.), LNCS (LNAI), Springer, Heidelberg
    • Vermorel, J., Mohri, M.: Multi-armed bandit algorithms and empirical evaluation. In: Gama, J., Camacho, R., Brazdil, P.B., Jorge, A.M., Torgo, L., et al. (eds.) ECML 2005. LNCS (LNAI), vol. 3720, pp. 437-448. Springer, Heidelberg (2005).
    • (2005) ECML 2005 , vol.3720 , pp. 437-448
    • Vermorel, J.1    Mohri, M.2
  • 20
    • 33745223257 scopus 로고    scopus 로고
    • Cortical substrates for exploratory decisions in humans
    • Daw, N., O'Doherty, J., Dayan, P., Seymour, B., Dolan, R.: Cortical substrates for exploratory decisions in humans. Nature 441, 876-879 (2006).
    • (2006) Nature , vol.441 , pp. 876-879
    • Daw, N.1    O'Doherty, J.2    Dayan, P.3    Seymour, B.4    Dolan, R.5
  • 21


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.