메뉴 건너뛰기




Volumn , Issue , 2012, Pages 1548-1556

Approximately optimal adaptive learning in opportunistic spectrum access

Author keywords

Approximate optimality; online learning; opportunistic spectrum access; restless bandits

Indexed keywords

ADAPTIVE LEARNING; ADAPTIVE LEARNING ALGORITHM; AVERAGE REWARD CRITERIA; CHANNEL CONDITIONS; COMPUTATIONALLY EFFICIENT; DISCRETE TIME MARKOV CHAINS; GILBERT-ELLIOT CHANNEL MODEL; INFINITE HORIZONS; MARKOV DECISION PROBLEM; ONLINE LEARNING; OPPORTUNISTIC SPECTRUM ACCESS; OPPORTUNISTIC SPECTRUM ACCESSES (OSA); OPTIMAL POLICIES; OPTIMALITY; POLYNOMIAL COMPLEXITY; RESTLESS BANDITS; TRANSITION PROBABILITIES; TRANSITION STRUCTURES; TWO-STATE; USER ACTIVITY;

EID: 84861588214     PISSN: 0743166X     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/INFCOM.2012.6195522     Document Type: Conference Paper
Times cited : (37)

References (21)
  • 1
    • 0032628612 scopus 로고    scopus 로고
    • The complexity of optimal queuing network control
    • C. H. Papadimitriou and J. N. Tsitsiklis, "The complexity of optimal queuing network control," Math. Oper. Res., vol. 24, no. 2, pp. 293-305, 1999.
    • (1999) Math. Oper. Res. , vol.24 , Issue.2 , pp. 293-305
    • Papadimitriou, C.H.1    Tsitsiklis, J.N.2
  • 2
    • 0001043843 scopus 로고
    • Restless bandits
    • P. Whitlle, "Restless bandits," J. Appl. Prob., pp. 301-313, 1988.
    • (1988) J. Appl. Prob. , pp. 301-313
    • Whitlle, P.1
  • 3
    • 78650720102 scopus 로고    scopus 로고
    • Approximation algorithms for restless bandit problems
    • December
    • S. Guha, K. Mungala, and P. Shi, "Approximation algorithms for restless bandit problems," Journal of the ACM, vol. 58, December 2010.
    • (2010) Journal of the ACM , vol.58
    • Guha, S.1    Mungala, K.2    Shi, P.3
  • 5
    • 84966203785 scopus 로고
    • Some aspects of the sequential design of experiments
    • H. Robbins, "Some aspects of the sequential design of experiments,"Bull. Amer. Math. Soc., vol. 55, pp. 527-535, 1952.
    • (1952) Bull. Amer. Math. Soc. , vol.55 , pp. 527-535
    • Robbins, H.1
  • 6
    • 0002899547 scopus 로고
    • Asymptotically efficient adaptive allocation rules
    • T. Lai and H. Robbins, "Asymptotically efficient adaptive allocation rules," Advances in Applied Mathematics, vol. 6, pp. 4-22, 1985.
    • (1985) Advances in Applied Mathematics , vol.6 , pp. 4-22
    • Lai, T.1    Robbins, H.2
  • 7
    • 0000616723 scopus 로고
    • Sample mean based index policies with o(log n) regret for the multi-armed bandit problem
    • December
    • R. Agrawal, "Sample mean based index policies with o(log n) regret for the multi-armed bandit problem," Advances in Applied Probability, vol. 27, no. 4, pp. 1054-1078, December 1995.
    • (1995) Advances in Applied Probability , vol.27 , Issue.4 , pp. 1054-1078
    • Agrawal, R.1
  • 8
    • 0023453059 scopus 로고
    • Asymptotically efficient allocation rules for the multiarmed bandit problem with multiple plays-part i: Iid rewards
    • November
    • V. Anantharam, P. Varaiya, and J. . Walrand, "Asymptotically efficient allocation rules for the multiarmed bandit problem with multiple plays-part i: Iid rewards," IEEE Trans. Automat. Contr., pp. 968-975, November 1987.
    • (1987) IEEE Trans. Automat. Contr. , pp. 968-975
    • Anantharam, V.1    Varaiya, P.2    Walrand, J.3
  • 9
    • 0023450663 scopus 로고
    • Asymptotically efficient allocation rules for the multiarmed bandit problem with multiple plays-part ii: Markovian rewards
    • November
    • -, "Asymptotically efficient allocation rules for the multiarmed bandit problem with multiple plays-part ii: Markovian rewards," IEEE Trans. Automat. Contr., pp. 977-982, November 1987.
    • (1987) IEEE Trans. Automat. Contr. , pp. 977-982
    • Anantharam, V.1    Varaiya, P.2    Walrand, J.3
  • 10
    • 0036568025 scopus 로고    scopus 로고
    • Finite-time analysis of the multiarmed bandit problem
    • P. Auer, N. Cesa-Bianchi, and P. Fischer, "Finite-time analysis of the multiarmed bandit problem," Machine Learning, vol. 47, p. 235256, 2002.
    • (2002) Machine Learning , vol.47 , pp. 235256
    • Auer, P.1    Cesa-Bianchi, N.2    Fischer, P.3
  • 15
    • 79953827701 scopus 로고    scopus 로고
    • Distributed learning in multi-armed bandit with multiple players
    • November
    • K. Liu and Q. Zhao, "Distributed learning in multi-armed bandit with multiple players," IEEE Transactions on Signal Processing, vol. 58, pp. 5667 - 5681, November 2010.
    • (2010) IEEE Transactions on Signal Processing , vol.58 , pp. 5667-5681
    • Liu, K.1    Zhao, Q.2
  • 17
    • 0031070051 scopus 로고    scopus 로고
    • Optimal adaptive policies for markov decision processes
    • A. N. Burnetas and M. N. Katehakis, "Optimal adaptive policies for markov decision processes," Mathematics of Operations Research, vol. 22, no. 1, pp. 222-255, 1997.
    • (1997) Mathematics of Operations Research , vol.22 , Issue.1 , pp. 222-255
    • Burnetas, A.N.1    Katehakis, M.N.2
  • 18
    • 85162041468 scopus 로고    scopus 로고
    • Optimistic linear programming gives logarithmic regret for irreducible mdps
    • A. Tewari and P. Bartlett, "Optimistic linear programming gives logarithmic regret for irreducible mdps," Advances in Neural Information Processing Systems, vol. 20, pp. 1505-1512, 2008.
    • (2008) Advances in Neural Information Processing Systems , vol.20 , pp. 1505-1512
    • Tewari, A.1    Bartlett, P.2
  • 20
    • 27944497396 scopus 로고    scopus 로고
    • Senstivity and convergence of uniformly ergodic markov chains
    • A. Y. Mitrophanov, "Senstivity and convergence of uniformly ergodic markov chains," J. Appl. Prob., vol. 42, pp. 1003-1014, 2005.
    • (2005) J. Appl. Prob. , vol.42 , pp. 1003-1014
    • Mitrophanov, A.Y.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.