메뉴 건너뛰기




Volumn 3, Issue 2, 2010, Pages 207-234

Solving two-armed Bernoulli bandit problems using a Bayesian learning automaton

Author keywords

Automata theory; Learning processes; Programming and algorithm theory; Stochastic processes

Indexed keywords

BANDIT PROBLEMS; BAYESIAN; BAYESIAN APPROACHES; BAYESIAN COMPUTATION; BAYESIAN LEARNING; BAYESIAN METHODOLOGY; BAYESIAN METHODS; BAYESIAN PERSPECTIVE; BERNOULLI; BETA DISTRIBUTIONS; CLASSICAL OPTIMIZATION; CONJUGATE PRIOR; DESIGN/METHODOLOGY/APPROACH; DISTRIBUTED APPLICATIONS; LEARNING PROCESS; OPTIMAL DECISION MAKING; PROGRAMMING AND ALGORITHM THEORY; RANDOM SAMPLING; STOCHASTIC PROCESS; STOCHASTIC PROCESSES;

EID: 78549244167     PISSN: 1756378X     EISSN: 17563798     Source Type: Journal    
DOI: 10.1108/17563781011049179     Document Type: Article
Times cited : (103)

References (35)
  • 1
    • 0036896733 scopus 로고    scopus 로고
    • Generalized pursuit learning schemes: New families of continuous and discretized learning automata
    • Agache, M. and Oommen, B.J. (2002), "Generalized pursuit learning schemes: new families of continuous and discretized learning automata" in IEEE Transactions on Systems, Man, and Cybernetics-Part B: Cybernetics, Vol. 32, No. 6, pp. 738-49.
    • (2002) IEEE Transactions on Systems, Man, and Cybernetics-Part B: Cybernetics , vol.32 , Issue.6 , pp. 738-749
    • Agache, M.1    Oommen, B.J.2
  • 3
    • 0036568025 scopus 로고    scopus 로고
    • Finite-time analysis of the multiarmed bandit problem
    • Auer, P., Cesa-Bianchi, N. and Fischer, P. (2002), "Finite-time analysis of the multiarmed bandit problem" in Machine Learning, Vol. 47, pp. 235-56.
    • (2002) Machine Learning , vol.47 , pp. 235-256
    • Auer, P.1    Cesa-Bianchi, N.2    Fischer, P.3
  • 4
    • 0034316108 scopus 로고    scopus 로고
    • On the value of learning for Bernoulli bandits with unknown parameters
    • Bhulai, S. and Koole, G. (2000), "On the value of learning for Bernoulli bandits with unknown parameters" in IEEE Transactions on Automatic Control, Vol. 45, No. 11, pp. 2135-40.
    • (2000) IEEE Transactions on Automatic Control , vol.45 , Issue.11 , pp. 2135-2140
    • Bhulai, S.1    Koole, G.2
  • 6
    • 0030674885 scopus 로고    scopus 로고
    • Cooperative mobile robotics: Antecedents and directions
    • Cao, Y.U., Fukunaga, A.S. and Kahng, A. (1997), "Cooperative mobile robotics: antecedents and directions" in Autonomous Robots, Vol. 4, No. 1, pp. 7-27.
    • (1997) Autonomous Robots , vol.4 , Issue.1 , pp. 7-27
    • Cao, Y.U.1    Fukunaga, A.S.2    Kahng, A.3
  • 7
    • 12844287705 scopus 로고    scopus 로고
    • QoS support in wireless sensor networks: A survey
    • paper presented at The 2004 International Conference on Wireless Networks (ICWN 20 04), Las Vegas, NV
    • Chen, D. and Varshney, P.K. (2004), "QoS support in wireless sensor networks: a survey", paper presented at The 2004 International Conference on Wireless Networks (ICWN 20 04), Las Vegas, NV.
    • (2004)
    • Chen, D.1    Varshney, P.K.2
  • 10
    • 77949664565 scopus 로고    scopus 로고
    • Exploration exploitation in go: UCT for Monte-Carlo go
    • NIPS
    • Gelly, S. and Wang, Y. (2006), "Exploration exploitation in go: UCT for Monte-Carlo go", Proceedings of NIPS-2006.
    • (2006) Proceedings of NIPS-2006
    • Gelly, S.1    Wang, Y.2
  • 12
    • 33847613520 scopus 로고    scopus 로고
    • Learning automata-based solutions to the nonlinear fractional knapsack problem with applications to optimal resource allocation
    • Granmo, O.-C., Oommen, B.J., Myrer, S.A. and Olsen, M.G. (2007), "Learning automata-based solutions to the nonlinear fractional knapsack problem with applications to optimal resource allocation" in IEEE Transactions on Systems, Man, and Cybernetics, Part B, Vol. 37, No. 1, pp. 166-75.
    • (2007) IEEE Transactions on Systems, Man, and Cybernetics, Part B , vol.37 , Issue.1 , pp. 166-175
    • Granmo, O.-C.1    Oommen, B.J.2    Myrer, S.A.3    Olsen, M.G.4
  • 17
    • 0003492473 scopus 로고
    • Discrete estimator algorithms: A mathematical model of computer learning
    • Master's thesis, Department of Mathematics and Statistics, Carleton University, Ottawa
    • Lanctôt, J.K. (1989), "Discrete estimator algorithms: a mathematical model of computer learning", Department of Mathematics and Statistics, Carleton University, Ottawa, Master's thesis.
    • (1989)
    • Lanctôt, J.K.1
  • 19
    • 34547875703 scopus 로고    scopus 로고
    • Routing bandwidth guaranteed paths in MPLS traffic engineering: A multiple race track learning approach
    • Misra, S., Oommen, B.J. and Granmo, O.-C. (2007), "Routing bandwidth guaranteed paths in MPLS traffic engineering: a multiple race track learning approach" in IEEE Transactions on Computers, Vol. 56, No. 7, pp. 959-76.
    • (2007) IEEE Transactions on Computers , vol.56 , Issue.7 , pp. 959-976
    • Misra, S.1    Oommen, B.J.2    Granmo, O.-C.3
  • 30
    • 0001395850 scopus 로고
    • On the likelihood that one unknown probability exceeds another in view of the evidence of two samples
    • Thompson, W.R. (1933), "On the likelihood that one unknown probability exceeds another in view of the evidence of two samples" in Biometrika, Vol. 25, pp. 285-94.
    • (1933) Biometrika , vol.25 , pp. 285-294
    • Thompson, W.R.1
  • 32
    • 0030122488 scopus 로고    scopus 로고
    • Using finite state automata to produce self-optimization and self-control
    • Tung, B. and Kleinrock, L. (1996), "Using finite state automata to produce self-optimization and self-control" in IEEE Transactions on Parallel and Distributed Systems, Vol. 7, No. 4, pp. 47-61.
    • (1996) IEEE Transactions on Parallel and Distributed Systems , vol.7 , Issue.4 , pp. 47-61
    • Tung, B.1    Kleinrock, L.2
  • 35
    • 0008954974 scopus 로고    scopus 로고
    • Exploration and inference in learning from reinforcement
    • PhD thesis, University of Edinburgh
    • Wyatt, J. (1997), "Exploration and inference in learning from reinforcement", University of Edinburgh, Edinburgh, PhD thesis.
    • (1997)
    • Wyatt, J.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.