SCOPUS 정보 검색 플랫폼

International Journal of Intelligent Computing and Cybernetics

Volumn 3, Issue 2, 2010, Pages 207-234

Solving two-armed Bernoulli bandit problems using a Bayesian learning automaton

(1) Granmo, Ole Christoffer a

a UNIVERSITY OF AGDER (Norway)

Author keywords

Automata theory; Learning processes; Programming and algorithm theory; Stochastic processes

Indexed keywords

BANDIT PROBLEMS; BAYESIAN; BAYESIAN APPROACHES; BAYESIAN COMPUTATION; BAYESIAN LEARNING; BAYESIAN METHODOLOGY; BAYESIAN METHODS; BAYESIAN PERSPECTIVE; BERNOULLI; BETA DISTRIBUTIONS; CLASSICAL OPTIMIZATION; CONJUGATE PRIOR; DESIGN/METHODOLOGY/APPROACH; DISTRIBUTED APPLICATIONS; LEARNING PROCESS; OPTIMAL DECISION MAKING; PROGRAMMING AND ALGORITHM THEORY; RANDOM SAMPLING; STOCHASTIC PROCESS; STOCHASTIC PROCESSES;

AUTOMATA THEORY; COMPUTATIONAL COMPLEXITY; DECISION MAKING; LEARNING ALGORITHMS; OPTIMIZATION; RANDOM PROCESSES; ROBOTS; STOCHASTIC SYSTEMS; TRANSLATION (LANGUAGES);

BAYESIAN NETWORKS;

EID: 78549244167 PISSN: 1756378X EISSN: 17563798 Source Type: Journal
DOI: 10.1108/17563781011049179 Document Type: Article

Times cited : (103)

References (35)

1
- 0036896733
- Generalized pursuit learning schemes: New families of continuous and discretized learning automata
- Agache, M. and Oommen, B.J. (2002), "Generalized pursuit learning schemes: new families of continuous and discretized learning automata" in IEEE Transactions on Systems, Man, and Cybernetics-Part B: Cybernetics, Vol. 32, No. 6, pp. 738-49.
- (2002) IEEE Transactions on Systems, Man, and Cybernetics-Part B: Cybernetics , vol.32 , Issue.6 , pp. 738-749
- Agache, M.¹ Oommen, B.J.²

2
- 38149013086
- Tuning bandit algorithms in stochastic environments
- Springer, Berlin
- Audibert, J.-Y., Munos, R. and Szepesvarri, C. (2007), "Tuning bandit algorithms in stochastic environments" in Proceedings of the 18th International Conference of Algorithmic Learning Theory, Sendai, Japan, Springer, Berlin, pp. 150-65.
- (2007) Proceedings of the 18th International Conference of Algorithmic Learning Theory, Sendai, Japan , pp. 150-165
- Audibert, J.-Y.¹ Munos, R.² Szepesvarri, C.³

3
- 0036568025
- Finite-time analysis of the multiarmed bandit problem
- Auer, P., Cesa-Bianchi, N. and Fischer, P. (2002), "Finite-time analysis of the multiarmed bandit problem" in Machine Learning, Vol. 47, pp. 235-56.
- (2002) Machine Learning , vol.47 , pp. 235-256
- Auer, P.¹ Cesa-Bianchi, N.² Fischer, P.³

4
- 0034316108
- On the value of learning for Bernoulli bandits with unknown parameters
- Bhulai, S. and Koole, G. (2000), "On the value of learning for Bernoulli bandits with unknown parameters" in IEEE Transactions on Automatic Control, Vol. 45, No. 11, pp. 2135-40.
- (2000) IEEE Transactions on Automatic Control , vol.45 , Issue.11 , pp. 2135-2140
- Bhulai, S.¹ Koole, G.²

5
- 33748692398
- Routing without regret: On convergence to Nash equilibria of regret-minimizing algorithms in routing games
- ACM, New York, NY
- Blum, A., Even-Dar, E. and Ligett, K. (2006), "Routing without regret: on convergence to Nash equilibria of regret-minimizing algorithms in routing games" in Proceedings of the Twenty-Fifth Annual ACM SIGACT-SIGOPS Symposium on Principles of Distributed Computing (PODC 2006), ACM, New York, NY, pp. 45-52.
- (2006) Proceedings of the Twenty-Fifth Annual ACM SIGACT-SIGOPS Symposium on Principles of Distributed Computing (PODC 2006) , pp. 45-52
- Blum, A.¹ Even-Dar, E.² Ligett, K.³

6
- 0030674885
- Cooperative mobile robotics: Antecedents and directions
- Cao, Y.U., Fukunaga, A.S. and Kahng, A. (1997), "Cooperative mobile robotics: antecedents and directions" in Autonomous Robots, Vol. 4, No. 1, pp. 7-27.
- (1997) Autonomous Robots , vol.4 , Issue.1 , pp. 7-27
- Cao, Y.U.¹ Fukunaga, A.S.² Kahng, A.³

7
- 12844287705
- QoS support in wireless sensor networks: A survey
- paper presented at The 2004 International Conference on Wireless Networks (ICWN 20 04), Las Vegas, NV
- Chen, D. and Varshney, P.K. (2004), "QoS support in wireless sensor networks: a survey", paper presented at The 2004 International Conference on Wireless Networks (ICWN 20 04), Las Vegas, NV.
- (2004)
- Chen, D.¹ Varshney, P.K.²

8
- 33749818313
- Nearly optimal exploration-exploitation decision thresholds
- Lecture Notes in Computer Science, Springer, Berlin
- Dimitrakakis, C. (2006), "Nearly optimal exploration-exploitation decision thresholds" in Proceedings of the 16th International Conference on Artificial Neural Networks (ICANN 2006), Athens, Greece, Springer, Berlin, p. 850, Lecture Notes in Computer Science.
- (2006) Proceedings of the 16th International Conference on Artificial Neural Networks (ICANN 2006), Athens, Greece , pp. 850
- Dimitrakakis, C.¹

9
- 0003922190
- Wiley, New York, NY, 2nd ed
- Duda, R., Hart, P. and Stork, D. (2000), Pattern Classification, 2nd ed., Wiley, New York, NY.
- (2000) Pattern Classification
- Duda, R.¹ Hart, P.² Stork, D.³

10
- 77949664565
- Exploration exploitation in go: UCT for Monte-Carlo go
- NIPS
- Gelly, S. and Wang, Y. (2006), "Exploration exploitation in go: UCT for Monte-Carlo go", Proceedings of NIPS-2006.
- (2006) Proceedings of NIPS-2006
- Gelly, S.¹ Wang, Y.²

11
- 60649101399
- Solving the satisfiability problem using finite learning automata
- Granmo, O.-C. and Bouhmala, N. (2007), "Solving the satisfiability problem using finite learning automata" in International Journal of Computer Science and Applications, Vol. 4, No. 3, pp. 15-29.
- (2007) International Journal of Computer Science and Applications , vol.4 , Issue.3 , pp. 15-29
- Granmo, O.-C.¹ Bouhmala, N.²

12
- 33847613520
- Learning automata-based solutions to the nonlinear fractional knapsack problem with applications to optimal resource allocation
- Granmo, O.-C., Oommen, B.J., Myrer, S.A. and Olsen, M.G. (2007), "Learning automata-based solutions to the nonlinear fractional knapsack problem with applications to optimal resource allocation" in IEEE Transactions on Systems, Man, and Cybernetics, Part B, Vol. 37, No. 1, pp. 166-75.
- (2007) IEEE Transactions on Systems, Man, and Cybernetics, Part B , vol.37 , Issue.1 , pp. 166-175
- Granmo, O.-C.¹ Oommen, B.J.² Myrer, S.A.³ Olsen, M.G.⁴

13
- 0037715076
- QoS control for sensor networks
- Iyer, R. and Kleinrock, L. (2003), "QoS control for sensor networks" in IEEE International Conference on Communications, Vol. 1, pp. 517-21.
- (2003) IEEE International Conference on Communications , vol.1 , pp. 517-521
- Iyer, R.¹ Kleinrock, L.²

14
- 0004280606
- PhD thesis, Stanford University, Stanford, CA
- Kaelbling, L.P. (1993), Learning in Embedded Systems, Stanford University, Stanford, CA, PhD thesis.
- (1993) Learning in Embedded Systems
- Kaelbling, L.P.¹

15
- 33750293964
- Bandit based Monte-Carlo planning
- Kocsis, L. and Szepesvari, C. (2006), "Bandit based Monte-Carlo planning" in Proceedings of the 17th European Conference on Machine Learning (ECML 2006), Springer, Berlin, pp. 282-93.
- (2006) Proceedings of the 17th European Conference on Machine Learning (ECML 2006), Springer, Berlin , pp. 282-293
- Kocsis, L.¹ Szepesvari, C.²

16
- 0003650765
- Springer, Berlin
- Lakshmivarahan, S. (1981), Learning Algorithms Theory and Applications, Springer, Berlin.
- (1981) Learning Algorithms Theory and Applications
- Lakshmivarahan, S.¹

17
- 0003492473
- Discrete estimator algorithms: A mathematical model of computer learning
- Master's thesis, Department of Mathematics and Statistics, Carleton University, Ottawa
- Lanctôt, J.K. (1989), "Discrete estimator algorithms: a mathematical model of computer learning", Department of Mathematics and Statistics, Carleton University, Ottawa, Master's thesis.
- (1989)
- Lanctôt, J.K.¹

18
- 0026943998
- Discretized estimator learning automata
- Lanctôt, J.K. and Oommen, B.J. (1992), "Discretized estimator learning automata" in IEEE Transactions on Systems, Man, and Cybernetics, Vol. SMC-22, No. 6, pp. 1473-83.
- (1992) IEEE Transactions on Systems, Man, and Cybernetics , vol.SMC-22 , Issue.6 , pp. 1473-1483
- Lanctôt, J.K.¹ Oommen, B.J.²

19
- 34547875703
- Routing bandwidth guaranteed paths in MPLS traffic engineering: A multiple race track learning approach
- Misra, S., Oommen, B.J. and Granmo, O.-C. (2007), "Routing bandwidth guaranteed paths in MPLS traffic engineering: a multiple race track learning approach" in IEEE Transactions on Computers, Vol. 56, No. 7, pp. 959-76.
- (2007) IEEE Transactions on Computers , vol.56 , Issue.7 , pp. 959-976
- Misra, S.¹ Oommen, B.J.² Granmo, O.-C.³

20
- 0004255908
- McGraw-Hill, New York, NY
- Mitchell, T.M. (1997), Machine Learning, McGraw-Hill, New York, NY.
- (1997) Machine Learning
- Mitchell, T.M.¹

21
- 0003988124
- Pergamon Press, Oxford
- Najim, K. and Poznyak, A.S. (1994), Learning Automata: Theory and Applications, Pergamon Press, Oxford.
- (1994) Learning Automata: Theory and Applications
- Najim, K.¹ Poznyak, A.S.²

22
- 0003891507
- Prentice-Hall, Englewood Cliffs, NJ
- Narendra, K.S. and Thathachar, M.A.L. (1989), Learning Automata: An Introduction, Prentice-Hall, Englewood Cliffs, NJ.
- (1989) Learning Automata: An Introduction
- Narendra, K.S.¹ Thathachar, M.A.L.²

23
- 0036894188
- Learning automata: Theory, paradigms and applications
- Obaidat, M.S., Papadimitriou, G.I. and Pomportsis, A.S. (2002), "Learning automata: theory, paradigms and applications" in IEEE Transactions on Systems Man and Cybernetics, Vol. SMC-32, pp. 706-9.
- (2002) IEEE Transactions on Systems Man and Cybernetics , vol.SMC-32 , pp. 706-709
- Obaidat, M.S.¹ Papadimitriou, G.I.² Pomportsis, A.S.³

24
- 0035359274
- Continuous and discretized pursuit learning schemes: Various algorithms and their comparison
- Oommen, B.J. and Agache, M. (2001), "Continuous and discretized pursuit learning schemes: various algorithms and their comparison" in IEEE Transactions on Systems, Man, and Cybernetics-Part B: Cybernetics, Vol. 31, pp. 277-87.
- (2001) IEEE Transactions on Systems, Man, and Cybernetics-Part B: Cybernetics , vol.31 , pp. 277-287
- Oommen, B.J.¹ Agache, M.²

25
- 0025452159
- Discretized pursuit learning automata
- Oommen, B.J. and Lanctôt, J.K. (1990), "Discretized pursuit learning automata" in IEEE Transactions on Systems, Man, and Cybernetics, Vol. SMC-20, No. 4, pp. 931-8.
- (1990) IEEE Transactions on Systems, Man, and Cybernetics , vol.SMC-20 , Issue.4 , pp. 931-938
- Oommen, B.J.¹ Lanctôt, J.K.²

26
- 0003730403
- Springer, Berlin
- Poznyak, A.S. and Najim, K. (1997), Learning Automata and Stochastic Optimization, Springer, Berlin.
- (1997) Learning Automata and Stochastic Optimization
- Poznyak, A.S.¹ Najim, K.²

27
- 33748017961
- PhD thesis, Dept of Electrical Engineering, Indian Institute of Science, Bangalore
- Sastry, P.S. (1985), Systems of Learning Automata: Estimator Algorithms Applications, Dept of Electrical Engineering, Indian Institute of Science, Bangalore, PhD thesis.
- (1985) Systems of Learning Automata: Estimator Algorithms Applications
- Sastry, P.S.¹

28
- 0004102479
- MIT Press, Cambridge, MA
- Sutton, R.S. and Barto, A.G. (1998), Reinforcement Learning: An Introduction, MIT Press, Cambridge, MA.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

29
- 2942609194
- Kluwer Academic Publishers, Dordrecht
- Thathachar, M.A.L. and Sastry, P.S. (2004), Networks of Learning Automata: Techniques for Online Stochastic Optimization, Kluwer Academic Publishers, Dordrecht.
- (2004) Networks of Learning Automata: Techniques for Online Stochastic Optimization
- Thathachar, M.A.L.¹ Sastry, P.S.²

30
- 0001395850
- On the likelihood that one unknown probability exceeds another in view of the evidence of two samples
- Thompson, W.R. (1933), "On the likelihood that one unknown probability exceeds another in view of the evidence of two samples" in Biometrika, Vol. 25, pp. 285-94.
- (1933) Biometrika , vol.25 , pp. 285-294
- Thompson, W.R.¹

31
- 0004162272
- Academic Press, New York, NY
- Tsetlin, M.L. (1973), Automaton Theory and Modeling of Biological Systems, Academic Press, New York, NY.
- (1973) Automaton Theory and Modeling of Biological Systems
- Tsetlin, M.L.¹

32
- 0030122488
- Using finite state automata to produce self-optimization and self-control
- Tung, B. and Kleinrock, L. (1996), "Using finite state automata to produce self-optimization and self-control" in IEEE Transactions on Parallel and Distributed Systems, Vol. 7, No. 4, pp. 47-61.
- (1996) IEEE Transactions on Parallel and Distributed Systems , vol.7 , Issue.4 , pp. 47-61
- Tung, B.¹ Kleinrock, L.²

33
- 33646406807
- Multi-armed bandit algorithms and empirical evaluation
- Springer, Heidelberg
- Vermorel, J. and Mohri, M. (2005), "Multi-armed bandit algorithms and empirical evaluation" in Proceedings of the 16th European Conference on Machine Learning (ECML 2005), Porto, Portugal, Springer, Heidelberg, pp. 437-48.
- (2005) Proceedings of the 16th European Conference on Machine Learning (ECML 2005), Porto, Portugal , pp. 437-448
- Vermorel, J.¹ Mohri, M.²

34
- 31844436266
- Bayesian sparse sampling for on-line reward optimization
- Wang, T., Lizotte, D., Bowling, M. and Scuurmans, D. (2005), "Bayesian sparse sampling for on-line reward optimization" in Proceedings of the 22nd International Conference on Machine Learning, Bonn, Germany, pp. 956-63.
- (2005) Proceedings of the 22nd International Conference on Machine Learning, Bonn, Germany , pp. 956-963
- Wang, T.¹ Lizotte, D.² Bowling, M.³ Scuurmans, D.⁴

35
- 0008954974
- Exploration and inference in learning from reinforcement
- PhD thesis, University of Edinburgh
- Wyatt, J. (1997), "Exploration and inference in learning from reinforcement", University of Edinburgh, Edinburgh, PhD thesis.
- (1997)
- Wyatt, J.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.