메뉴 건너뛰기




Volumn 101, Issue 1, 2010, Pages 29-36

Tug-of-war model for the two-bandit problem: Nonlocally-correlated parallel exploration via resource conservation

Author keywords

Amoeba based computing; Bio inspired computing; Multi armed bandit problem; Reinforcement learning

Indexed keywords

ACCURACY ASSESSMENT; ALGORITHM; DECISION MAKING; EFFICIENCY MEASUREMENT; MICROBIAL ACTIVITY; MODEL; NUMERICAL MODEL; RESOURCE ALLOCATION; THEORETICAL STUDY;

EID: 77953609815     PISSN: 03032647     EISSN: None     Source Type: Journal    
DOI: 10.1016/j.biosystems.2010.04.002     Document Type: Article
Times cited : (69)

References (21)
  • 1
    • 0000616723 scopus 로고
    • Sample mean based index policies with O(logn) regret for the multi-armed bandit problem
    • Agrawal R. Sample mean based index policies with O(logn) regret for the multi-armed bandit problem. Advances in Applied Probability 1995, 27:1054-1078.
    • (1995) Advances in Applied Probability , vol.27 , pp. 1054-1078
    • Agrawal, R.1
  • 2
    • 34548387441 scopus 로고    scopus 로고
    • Amoeba-based neurocomputing with chaotic dynamics
    • Aono M., Hara M., Aihara K. Amoeba-based neurocomputing with chaotic dynamics. Communications of the ACM 2007, 50(9):69-72.
    • (2007) Communications of the ACM , vol.50 , Issue.9 , pp. 69-72
    • Aono, M.1    Hara, M.2    Aihara, K.3
  • 3
    • 37349011483 scopus 로고    scopus 로고
    • Spontaneous deadlock breaking on amoeba-based neurocomputer
    • Aono M., Hara M. Spontaneous deadlock breaking on amoeba-based neurocomputer. BioSystems 2008, 91:83-93.
    • (2008) BioSystems , vol.91 , pp. 83-93
    • Aono, M.1    Hara, M.2
  • 4
    • 70349568825 scopus 로고    scopus 로고
    • Amoeba-based chaotic neurocomputing: combinatorial optimization by coupled biological oscillators
    • Aono M., Hirata Y., Hara M., Aihara K. Amoeba-based chaotic neurocomputing: combinatorial optimization by coupled biological oscillators. New Generation Computing 2009, 27:129-157.
    • (2009) New Generation Computing , vol.27 , pp. 129-157
    • Aono, M.1    Hirata, Y.2    Hara, M.3    Aihara, K.4
  • 6
    • 0036568025 scopus 로고    scopus 로고
    • Finite-time analysis of the multiarmed bandit problem
    • Auer P., Cesa-Bianchi N., Fischer P. Finite-time analysis of the multiarmed bandit problem. Machine Learning 2002, 47:235-256.
    • (2002) Machine Learning , vol.47 , pp. 235-256
    • Auer, P.1    Cesa-Bianchi, N.2    Fischer, P.3
  • 7
    • 34250348767 scopus 로고    scopus 로고
    • Should I stay or should I go? How the human brain manages the trade-off between exploitation and exploration
    • Cohen J., McClure S., Yu A. Should I stay or should I go? How the human brain manages the trade-off between exploitation and exploration. Philosophical Transactions of the Royal Society B 2007, 362(1481):933-942.
    • (2007) Philosophical Transactions of the Royal Society B , vol.362 , Issue.1481 , pp. 933-942
    • Cohen, J.1    McClure, S.2    Yu, A.3
  • 8
    • 33745223257 scopus 로고    scopus 로고
    • Cortical substrates for exploratory decisions in humans
    • Daw N., O'Doherty J., Dayan P., Seymour B., Dolan R. Cortical substrates for exploratory decisions in humans. Nature 2006, 441:876-879.
    • (2006) Nature , vol.441 , pp. 876-879
    • Daw, N.1    O'Doherty, J.2    Dayan, P.3    Seymour, B.4    Dolan, R.5
  • 9
    • 0002955623 scopus 로고
    • A dynamic allocation index for the sequential design of experiments
    • North Holland, J. Gans (Ed.)
    • Gittins J., Jones D. A dynamic allocation index for the sequential design of experiments. Progress in Statistics 1974, 241-266. North Holland. J. Gans (Ed.).
    • (1974) Progress in Statistics , pp. 241-266
    • Gittins, J.1    Jones, D.2
  • 13
    • 0002899547 scopus 로고
    • Asymptotically efficient adaptive allocation rules
    • Lai T., Robbins H. Asymptotically efficient adaptive allocation rules. Advances in Applied Mathematics 1985, 6:4-22.
    • (1985) Advances in Applied Mathematics , vol.6 , pp. 4-22
    • Lai, T.1    Robbins, H.2
  • 14
    • 0343278984 scopus 로고    scopus 로고
    • Maze-solving by an amoeboid organism
    • Nakagaki T., Yamada H., Toth A. Maze-solving by an amoeboid organism. Nature 2000, 407:470.
    • (2000) Nature , vol.407 , pp. 470
    • Nakagaki, T.1    Yamada, H.2    Toth, A.3
  • 19
    • 33645168074 scopus 로고    scopus 로고
    • Physarum solver: a biologically inspired method of road-network navigation
    • Tero A., Kobayashi R., Nakagaki T. Physarum solver: a biologically inspired method of road-network navigation. Physica A 2006, 363:115-119.
    • (2006) Physica A , vol.363 , pp. 115-119
    • Tero, A.1    Kobayashi, R.2    Nakagaki, T.3
  • 20
    • 0001395850 scopus 로고
    • On the likelihood that one unknown probability exceeds another in view of the evidence of two samples
    • Thompson W. On the likelihood that one unknown probability exceeds another in view of the evidence of two samples. Biometrika 1933, 25:285-294.
    • (1933) Biometrika , vol.25 , pp. 285-294
    • Thompson, W.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.