SCOPUS 정보 검색 플랫폼

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Volumn 6079 LNCS, Issue , 2010, Pages 69-80

Tug-of-war model for multi-armed bandit problem

(3) Kim, Song Ju a,b Aono, Masashi a,b Hara, Masahiko a,b

a Flucto Order Functions Research Team (South Korea)

b Hanyang University (South Korea)

Author keywords

Amoeba based computing; Bio inspired computation; Multi armed bandit problem; Reinforcement learning

Indexed keywords

ACCURACY RATE; AMOEBA-BASED COMPUTING; BIO-INSPIRED COMPUTATION; CONSERVATION LAW; ENVIRONMENTAL INFORMATION; GREEDY ALGORITHMS; MULTI-ARMED BANDIT PROBLEM; NONLOCAL CORRELATIONS; OPTIMAL STRATEGIES; PARALLEL SEARCH; PHYSARUM; SEARCH AGENTS; SLIME MOLD; VOLUME INCREMENT;

ALGORITHMS; DECISION MAKING;

STATISTICS;

EID: 79956331440 PISSN: 03029743 EISSN: 16113349 Source Type: Book Series
DOI: 10.1007/978-3-642-13523-1_10 Document Type: Conference Paper

Times cited : (18)

References (21)

1
- 0343278984
- Maze-solving by an amoeboid organism
- Nakagaki, T., Yamada, H., Toth, A.: Maze-solving by an amoeboid organism. Nature 407, 470 (2000).
- (2000) Nature , vol.407 , pp. 470
- Nakagaki, T.¹ Yamada, H.² Toth, A.³

2
- 33645168074
- Physarum solver: A biologically inspired method of road-network navigation
- Tero, A., Kobayashi, R., Nakagaki, T.: Physarum solver: A biologically inspired method of road-network navigation. Physica A 363, 115-119 (2006).
- (2006) Physica A , vol.363 , pp. 115-119
- Tero, A.¹ Kobayashi, R.² Nakagaki, T.³

3
- 34547862006
- Minimum-risk path finding by an adaptive amoebal network
- Nakagaki, T., Iima, M., Ueda, T., Nishiura, Y., Saigusa, T., Tero, A., Kobayashi, R., Showalter, K.: Minimum-risk path finding by an adaptive amoebal network. Phys. Rev. Lett. 99, 068104 (2007).
- (2007) Phys. Rev. Lett. , vol.99 , pp. 068104
- Nakagaki, T.¹ Iima, M.² Ueda, T.³ Nishiura, Y.⁴ Saigusa, T.⁵ Tero, A.⁶ Kobayashi, R.⁷ Showalter, K.⁸

4
- 40749112315
- Amoebae anticipate periodic events
- Saigusa, T., Tero, A., Nakagaki, T., Kuramoto, Y.: Amoebae anticipate periodic events. Phys. Rev. Lett. 100, 018101 (2008).
- (2008) Phys. Rev. Lett. , vol.100 , pp. 018101
- Saigusa, T.¹ Tero, A.² Nakagaki, T.³ Kuramoto, Y.⁴

5
- 34548387441
- Amoeba-based neurocomputing with chaotic dynamics
- DOI 10.1145/1284621.1284651
- Aono, M., Hara, M., Aihara, K.: Amoeba-based neurocomputing with chaotic dynamics. Communications of the ACM 50(9), 69-72 (2007). (Pubitemid 47366815)
- (2007) Communications of the ACM , vol.50 , Issue.9 , pp. 69-72
- Aono, M.¹ Hara, M.² Aihara, K.³

6
- 37349011483
- Spontaneous deadlock breaking on amoeba-based neurocomputer
- DOI 10.1016/j.biosystems.2007.08.004, PII S0303264707001244
- Aono, M., Hara, M.: Spontaneous deadlock breaking on amoeba-based neurocomputer. BioSystems 91, 83-93 (2008). (Pubitemid 350297567)
- (2008) BioSystems , vol.91 , Issue.1 , pp. 83-93
- Aono, M.¹ Hara, M.²

7
- 70349568825
- Amoeba-based chaotic neurocomputing: Combinatorial optimization by coupled biological oscillators
- Aono, M., Hirata, Y., Hara, M., Aihara, K.: Amoeba-based chaotic neurocomputing: Combinatorial optimization by coupled biological oscillators. New Generation Computing 27, 129-157 (2009).
- (2009) New Generation Computing , vol.27 , pp. 129-157
- Aono, M.¹ Hirata, Y.² Hara, M.³ Aihara, K.⁴

8
- 70350516626
- Resource-competing oscillator network as a model of amoeba-based neurocomputer
- Calude, C.S. Costa, J.F. Dershowitz, N. Freire, E. Rozenberg, G. (eds.), LNCS, Springer, Heidelberg
- Aono, M., Hirata, Y., Hara, M., Aihara, K.: Resource-competing oscillator network as a model of amoeba-based neurocomputer. In: Calude, C.S., Costa, J.F., Dershowitz, N., Freire, E., Rozenberg, G. (eds.) UC 2009. LNCS, vol. 5715, pp. 56-69. Springer, Heidelberg (2009).
- (2009) UC 2009 , vol.5715 , pp. 56-69
- Aono, M.¹ Hirata, Y.² Hara, M.³ Aihara, K.⁴

9
- 77953609691
- Tug-of-war model for two-bandit problem
- Calude, C.S. Costa, J.F. Dershowitz, N. Freire, E. Rozenberg, G. (eds.), LNCS, Springer, Heidelberg
- Kim, S.-J., Aono, M., Hara, M.: Tug-of-war model for two-bandit problem. In: Calude, C.S., Costa, J.F., Dershowitz, N., Freire, E., Rozenberg, G. (eds.) UC 2009. LNCS, vol. 5715, p. 289. Springer, Heidelberg (2009).
- (2009) UC 2009 , vol.5715 , pp. 289
- Kim, S.-J.¹ Aono, M.² Hara, M.³

10
- 77953609815
- Tug-of-war model for the two-bandit problem: Nonlocally-correlated parallel exploration via resource conservation
- to appear
- Kim, S.-J., Aono, M., Hara, M.: Tug-of-war model for the two-bandit problem: Nonlocally-correlated parallel exploration via resource conservation. BioSystems (to appear).
- BioSystems
- Kim, S.-J.¹ Aono, M.² Hara, M.³

11
- 84966203785
- Some aspects of the sequential design of experiments
- Robbins, H.: Some aspects of the sequential design of experiments. Bull. Amer. Math. Soc. 58, 527-536 (1952).
- (1952) Bull. Amer. Math. Soc. , vol.58 , pp. 527-536
- Robbins, H.¹

12
- 0001395850
- On the likelihood that one unknown probability exceeds another in view of the evidence of two samples
- Thompson, W.: On the likelihood that one unknown probability exceeds another in view of the evidence of two samples. Biometrika 25, 285-294 (1933).
- (1933) Biometrika , vol.25 , pp. 285-294
- Thompson, W.¹

13
- 0002955623
- A dynamic allocation index for the sequential design of experiments
- Gans, J. (ed.), North Holland, Amsterdam
- Gittins, J., Jones, D.: A dynamic allocation index for the sequential design of experiments. In: Gans, J. (ed.) Progress in Statistics, pp. 241-266. North Holland, Amsterdam (1974).
- (1974) Progress in Statistics , pp. 241-266
- Gittins, J.¹ Jones, D.²

14
- 0000169010
- Bandit processes and dynamic allocation indices
- Gittins, J.: Bandit processes and dynamic allocation indices. J. R. Stat. Soc. B 41, 148-177 (1979).
- (1979) J. R. Stat. Soc. B , vol.41 , pp. 148-177
- Gittins, J.¹

15
- 0002899547
- Asymptotically efficient adaptive allocation rules
- Lai, T., Robbins, H.: Asymptotically efficient adaptive allocation rules. Advances in Applied Mathematics 6, 4-22 (1985).
- (1985) Advances in Applied Mathematics , vol.6 , pp. 4-22
- Lai, T.¹ Robbins, H.²

16
- 0000616723
- Sample mean based index policies with O(log n) regret for the multiarmed bandit problem
- Agrawal, R.: Sample mean based index policies with O(log n) regret for the multiarmed bandit problem. Adv. Appl. Prob. 27, 1054-1078 (1995).
- (1995) Adv. Appl. Prob. , vol.27 , pp. 1054-1078
- Agrawal, R.¹

17
- 0036568025
- Finite-time analysis of the multiarmed bandit problem
- DOI 10.1023/A:1013689704352, Computational Learning Theory
- Auer, P., Cesa-Bianchi, N., Fischer, P.: Finite-time analysis of the multiarmed bandit problem. Machine Learning 47, 235-256 (2002). (Pubitemid 34126111)
- (2002) Machine Learning , vol.47 , Issue.2-3 , pp. 235-256
- Auer, P.¹ Cesa-Bianchi, N.² Fischer, P.³

18
- 33646406807
- Multi-armed bandit algorithms and empirical evaluation
- Gama, J. Camacho, R. Brazdil, P.B. Jorge, A.M. Torgo, L. et al. (eds.), LNCS (LNAI), Springer, Heidelberg
- Vermorel, J., Mohri, M.: Multi-armed bandit algorithms and empirical evaluation. In: Gama, J., Camacho, R., Brazdil, P.B., Jorge, A.M., Torgo, L., et al. (eds.) ECML 2005. LNCS (LNAI), vol. 3720, pp. 437-448. Springer, Heidelberg (2005).
- (2005) ECML 2005 , vol.3720 , pp. 437-448
- Vermorel, J.¹ Mohri, M.²

19
- 0004102479
- MIT Press, Cambridge
- Sutton, R., Barto, A.: Reinforcement learning: An introduction. MIT Press, Cambridge (1998).
- (1998) Reinforcement Learning: an Introduction
- Sutton, R.¹ Barto, A.²

20
- 33745223257
- Cortical substrates for exploratory decisions in humans
- Daw, N., O'Doherty, J., Dayan, P., Seymour, B., Dolan, R.: Cortical substrates for exploratory decisions in humans. Nature 441, 876-879 (2006).
- (2006) Nature , vol.441 , pp. 876-879
- Daw, N.¹ O'Doherty, J.² Dayan, P.³ Seymour, B.⁴ Dolan, R.⁵

21
- 34250348767
- Should I stay or should I go? How the human brain manages the trade-off between exploitation and exploration
- DOI 10.1098/rstb.2007.2098
- Cohen, J., McClure, S., Yu, A.: Should I stay or should I go? How the human brain manages the trade-off between exploitation and exploration. Phil. Trans. R. Soc. B 362 (1481), 933-942 (2007). (Pubitemid 47056820)
- (2007) Philosophical Transactions of the Royal Society B: Biological Sciences , vol.362 , Issue.1481 , pp. 933-942
- Cohen, J.D.¹ McClure, S.M.² Yu, A.J.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.