-
1
-
-
0036896733
-
Generalized pursuit learning schemes: New families of continuous and discretized learning automata
-
Agache, M. and Oommen, B.J. (2002), "Generalized pursuit learning schemes: new families of continuous and discretized learning automata" in IEEE Transactions on Systems, Man, and Cybernetics-Part B: Cybernetics, Vol. 32, No. 6, pp. 738-49.
-
(2002)
IEEE Transactions on Systems, Man, and Cybernetics-Part B: Cybernetics
, vol.32
, Issue.6
, pp. 738-749
-
-
Agache, M.1
Oommen, B.J.2
-
2
-
-
38149013086
-
Tuning bandit algorithms in stochastic environments
-
Springer, Berlin
-
Audibert, J.-Y., Munos, R. and Szepesvarri, C. (2007), "Tuning bandit algorithms in stochastic environments" in Proceedings of the 18th International Conference of Algorithmic Learning Theory, Sendai, Japan, Springer, Berlin, pp. 150-65.
-
(2007)
Proceedings of the 18th International Conference of Algorithmic Learning Theory, Sendai, Japan
, pp. 150-165
-
-
Audibert, J.-Y.1
Munos, R.2
Szepesvarri, C.3
-
3
-
-
0036568025
-
Finite-time analysis of the multiarmed bandit problem
-
Auer, P., Cesa-Bianchi, N. and Fischer, P. (2002), "Finite-time analysis of the multiarmed bandit problem" in Machine Learning, Vol. 47, pp. 235-56.
-
(2002)
Machine Learning
, vol.47
, pp. 235-256
-
-
Auer, P.1
Cesa-Bianchi, N.2
Fischer, P.3
-
4
-
-
0034316108
-
On the value of learning for Bernoulli bandits with unknown parameters
-
Bhulai, S. and Koole, G. (2000), "On the value of learning for Bernoulli bandits with unknown parameters" in IEEE Transactions on Automatic Control, Vol. 45, No. 11, pp. 2135-40.
-
(2000)
IEEE Transactions on Automatic Control
, vol.45
, Issue.11
, pp. 2135-2140
-
-
Bhulai, S.1
Koole, G.2
-
5
-
-
33748692398
-
Routing without regret: On convergence to Nash equilibria of regret-minimizing algorithms in routing games
-
ACM, New York, NY
-
Blum, A., Even-Dar, E. and Ligett, K. (2006), "Routing without regret: on convergence to Nash equilibria of regret-minimizing algorithms in routing games" in Proceedings of the Twenty-Fifth Annual ACM SIGACT-SIGOPS Symposium on Principles of Distributed Computing (PODC 2006), ACM, New York, NY, pp. 45-52.
-
(2006)
Proceedings of the Twenty-Fifth Annual ACM SIGACT-SIGOPS Symposium on Principles of Distributed Computing (PODC 2006)
, pp. 45-52
-
-
Blum, A.1
Even-Dar, E.2
Ligett, K.3
-
6
-
-
0030674885
-
Cooperative mobile robotics: Antecedents and directions
-
Cao, Y.U., Fukunaga, A.S. and Kahng, A. (1997), "Cooperative mobile robotics: antecedents and directions" in Autonomous Robots, Vol. 4, No. 1, pp. 7-27.
-
(1997)
Autonomous Robots
, vol.4
, Issue.1
, pp. 7-27
-
-
Cao, Y.U.1
Fukunaga, A.S.2
Kahng, A.3
-
7
-
-
12844287705
-
QoS support in wireless sensor networks: A survey
-
paper presented at The 2004 International Conference on Wireless Networks (ICWN 20 04), Las Vegas, NV
-
Chen, D. and Varshney, P.K. (2004), "QoS support in wireless sensor networks: a survey", paper presented at The 2004 International Conference on Wireless Networks (ICWN 20 04), Las Vegas, NV.
-
(2004)
-
-
Chen, D.1
Varshney, P.K.2
-
8
-
-
33749818313
-
Nearly optimal exploration-exploitation decision thresholds
-
Lecture Notes in Computer Science, Springer, Berlin
-
Dimitrakakis, C. (2006), "Nearly optimal exploration-exploitation decision thresholds" in Proceedings of the 16th International Conference on Artificial Neural Networks (ICANN 2006), Athens, Greece, Springer, Berlin, p. 850, Lecture Notes in Computer Science.
-
(2006)
Proceedings of the 16th International Conference on Artificial Neural Networks (ICANN 2006), Athens, Greece
, pp. 850
-
-
Dimitrakakis, C.1
-
9
-
-
0003922190
-
-
Wiley, New York, NY, 2nd ed
-
Duda, R., Hart, P. and Stork, D. (2000), Pattern Classification, 2nd ed., Wiley, New York, NY.
-
(2000)
Pattern Classification
-
-
Duda, R.1
Hart, P.2
Stork, D.3
-
10
-
-
77949664565
-
Exploration exploitation in go: UCT for Monte-Carlo go
-
NIPS
-
Gelly, S. and Wang, Y. (2006), "Exploration exploitation in go: UCT for Monte-Carlo go", Proceedings of NIPS-2006.
-
(2006)
Proceedings of NIPS-2006
-
-
Gelly, S.1
Wang, Y.2
-
11
-
-
60649101399
-
Solving the satisfiability problem using finite learning automata
-
Granmo, O.-C. and Bouhmala, N. (2007), "Solving the satisfiability problem using finite learning automata" in International Journal of Computer Science and Applications, Vol. 4, No. 3, pp. 15-29.
-
(2007)
International Journal of Computer Science and Applications
, vol.4
, Issue.3
, pp. 15-29
-
-
Granmo, O.-C.1
Bouhmala, N.2
-
12
-
-
33847613520
-
Learning automata-based solutions to the nonlinear fractional knapsack problem with applications to optimal resource allocation
-
Granmo, O.-C., Oommen, B.J., Myrer, S.A. and Olsen, M.G. (2007), "Learning automata-based solutions to the nonlinear fractional knapsack problem with applications to optimal resource allocation" in IEEE Transactions on Systems, Man, and Cybernetics, Part B, Vol. 37, No. 1, pp. 166-75.
-
(2007)
IEEE Transactions on Systems, Man, and Cybernetics, Part B
, vol.37
, Issue.1
, pp. 166-175
-
-
Granmo, O.-C.1
Oommen, B.J.2
Myrer, S.A.3
Olsen, M.G.4
-
14
-
-
0004280606
-
-
PhD thesis, Stanford University, Stanford, CA
-
Kaelbling, L.P. (1993), Learning in Embedded Systems, Stanford University, Stanford, CA, PhD thesis.
-
(1993)
Learning in Embedded Systems
-
-
Kaelbling, L.P.1
-
15
-
-
33750293964
-
Bandit based Monte-Carlo planning
-
Kocsis, L. and Szepesvari, C. (2006), "Bandit based Monte-Carlo planning" in Proceedings of the 17th European Conference on Machine Learning (ECML 2006), Springer, Berlin, pp. 282-93.
-
(2006)
Proceedings of the 17th European Conference on Machine Learning (ECML 2006), Springer, Berlin
, pp. 282-293
-
-
Kocsis, L.1
Szepesvari, C.2
-
17
-
-
0003492473
-
Discrete estimator algorithms: A mathematical model of computer learning
-
Master's thesis, Department of Mathematics and Statistics, Carleton University, Ottawa
-
Lanctôt, J.K. (1989), "Discrete estimator algorithms: a mathematical model of computer learning", Department of Mathematics and Statistics, Carleton University, Ottawa, Master's thesis.
-
(1989)
-
-
Lanctôt, J.K.1
-
18
-
-
0026943998
-
Discretized estimator learning automata
-
Lanctôt, J.K. and Oommen, B.J. (1992), "Discretized estimator learning automata" in IEEE Transactions on Systems, Man, and Cybernetics, Vol. SMC-22, No. 6, pp. 1473-83.
-
(1992)
IEEE Transactions on Systems, Man, and Cybernetics
, vol.SMC-22
, Issue.6
, pp. 1473-1483
-
-
Lanctôt, J.K.1
Oommen, B.J.2
-
19
-
-
34547875703
-
Routing bandwidth guaranteed paths in MPLS traffic engineering: A multiple race track learning approach
-
Misra, S., Oommen, B.J. and Granmo, O.-C. (2007), "Routing bandwidth guaranteed paths in MPLS traffic engineering: a multiple race track learning approach" in IEEE Transactions on Computers, Vol. 56, No. 7, pp. 959-76.
-
(2007)
IEEE Transactions on Computers
, vol.56
, Issue.7
, pp. 959-976
-
-
Misra, S.1
Oommen, B.J.2
Granmo, O.-C.3
-
22
-
-
0003891507
-
-
Prentice-Hall, Englewood Cliffs, NJ
-
Narendra, K.S. and Thathachar, M.A.L. (1989), Learning Automata: An Introduction, Prentice-Hall, Englewood Cliffs, NJ.
-
(1989)
Learning Automata: An Introduction
-
-
Narendra, K.S.1
Thathachar, M.A.L.2
-
23
-
-
0036894188
-
Learning automata: Theory, paradigms and applications
-
Obaidat, M.S., Papadimitriou, G.I. and Pomportsis, A.S. (2002), "Learning automata: theory, paradigms and applications" in IEEE Transactions on Systems Man and Cybernetics, Vol. SMC-32, pp. 706-9.
-
(2002)
IEEE Transactions on Systems Man and Cybernetics
, vol.SMC-32
, pp. 706-709
-
-
Obaidat, M.S.1
Papadimitriou, G.I.2
Pomportsis, A.S.3
-
24
-
-
0035359274
-
Continuous and discretized pursuit learning schemes: Various algorithms and their comparison
-
Oommen, B.J. and Agache, M. (2001), "Continuous and discretized pursuit learning schemes: various algorithms and their comparison" in IEEE Transactions on Systems, Man, and Cybernetics-Part B: Cybernetics, Vol. 31, pp. 277-87.
-
(2001)
IEEE Transactions on Systems, Man, and Cybernetics-Part B: Cybernetics
, vol.31
, pp. 277-287
-
-
Oommen, B.J.1
Agache, M.2
-
25
-
-
0025452159
-
Discretized pursuit learning automata
-
Oommen, B.J. and Lanctôt, J.K. (1990), "Discretized pursuit learning automata" in IEEE Transactions on Systems, Man, and Cybernetics, Vol. SMC-20, No. 4, pp. 931-8.
-
(1990)
IEEE Transactions on Systems, Man, and Cybernetics
, vol.SMC-20
, Issue.4
, pp. 931-938
-
-
Oommen, B.J.1
Lanctôt, J.K.2
-
27
-
-
33748017961
-
-
PhD thesis, Dept of Electrical Engineering, Indian Institute of Science, Bangalore
-
Sastry, P.S. (1985), Systems of Learning Automata: Estimator Algorithms Applications, Dept of Electrical Engineering, Indian Institute of Science, Bangalore, PhD thesis.
-
(1985)
Systems of Learning Automata: Estimator Algorithms Applications
-
-
Sastry, P.S.1
-
28
-
-
0004102479
-
-
MIT Press, Cambridge, MA
-
Sutton, R.S. and Barto, A.G. (1998), Reinforcement Learning: An Introduction, MIT Press, Cambridge, MA.
-
(1998)
Reinforcement Learning: An Introduction
-
-
Sutton, R.S.1
Barto, A.G.2
-
30
-
-
0001395850
-
On the likelihood that one unknown probability exceeds another in view of the evidence of two samples
-
Thompson, W.R. (1933), "On the likelihood that one unknown probability exceeds another in view of the evidence of two samples" in Biometrika, Vol. 25, pp. 285-94.
-
(1933)
Biometrika
, vol.25
, pp. 285-294
-
-
Thompson, W.R.1
-
32
-
-
0030122488
-
Using finite state automata to produce self-optimization and self-control
-
Tung, B. and Kleinrock, L. (1996), "Using finite state automata to produce self-optimization and self-control" in IEEE Transactions on Parallel and Distributed Systems, Vol. 7, No. 4, pp. 47-61.
-
(1996)
IEEE Transactions on Parallel and Distributed Systems
, vol.7
, Issue.4
, pp. 47-61
-
-
Tung, B.1
Kleinrock, L.2
-
33
-
-
33646406807
-
Multi-armed bandit algorithms and empirical evaluation
-
Springer, Heidelberg
-
Vermorel, J. and Mohri, M. (2005), "Multi-armed bandit algorithms and empirical evaluation" in Proceedings of the 16th European Conference on Machine Learning (ECML 2005), Porto, Portugal, Springer, Heidelberg, pp. 437-48.
-
(2005)
Proceedings of the 16th European Conference on Machine Learning (ECML 2005), Porto, Portugal
, pp. 437-448
-
-
Vermorel, J.1
Mohri, M.2
-
34
-
-
31844436266
-
Bayesian sparse sampling for on-line reward optimization
-
Wang, T., Lizotte, D., Bowling, M. and Scuurmans, D. (2005), "Bayesian sparse sampling for on-line reward optimization" in Proceedings of the 22nd International Conference on Machine Learning, Bonn, Germany, pp. 956-63.
-
(2005)
Proceedings of the 22nd International Conference on Machine Learning, Bonn, Germany
, pp. 956-963
-
-
Wang, T.1
Lizotte, D.2
Bowling, M.3
Scuurmans, D.4
-
35
-
-
0008954974
-
Exploration and inference in learning from reinforcement
-
PhD thesis, University of Edinburgh
-
Wyatt, J. (1997), "Exploration and inference in learning from reinforcement", University of Edinburgh, Edinburgh, PhD thesis.
-
(1997)
-
-
Wyatt, J.1
|