메뉴 건너뛰기




Volumn , Issue , 2009, Pages 137-144

The knowledge gradient algorithm for online subset selection

Author keywords

[No Author keywords available]

Indexed keywords

DECISION RULES; EXPERIMENTAL EVIDENCE; GRADIENT ALGORITHM; LOOK-AHEAD; MULTI-ARMED BANDIT PROBLEM; ONLINE LEARNING; SUBSET SELECTION;

EID: 67650505320     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/ADPRL.2009.4927537     Document Type: Conference Paper
Times cited : (24)

References (18)
  • 1
    • 0343441515 scopus 로고    scopus 로고
    • Restless bandits, linear programming relaxations, and a primal-dual index heuristic
    • D. Bertsimas and J. Nino-Mora, "Restless bandits, linear programing relaxations, and a primal-dual index heuristic," Operations Research, Vol. 48, no. 1, 80-90, 2000. (Pubitemid 30930252)
    • (2000) Operations Research , vol.48 , Issue.1 , pp. 80-90
    • Bertsimas, D.1    Nino-Mora, J.2
  • 4
    • 8844229506 scopus 로고
    • 'Q-learning for bandit problems
    • Dept. of Comp. Sci., University of Massachusetts, Amherst, MA
    • M.O. Duff, 'Q-learning for bandit problems, " Technical report, Dept. of Comp. Sci., University of Massachusetts, Amherst, MA, 1995.
    • (1995) Technical report
    • Duff, M.O.1
  • 8
    • 0000511415 scopus 로고
    • Bayesian look ahead one stage sampling allocations for selecting the largest normal mean
    • S. Gupta and K. Miescke, "Bayesian look ahead one stage sampling allocations for selecting the largest normal mean," Statistical Papers, Vol. 35, 169-177, 1994.
    • (1994) Statistical Papers , vol.35 , pp. 169-177
    • Gupta, S.1    Miescke, K.2
  • 9
    • 0030590294 scopus 로고    scopus 로고
    • Bayesian look ahead one-stage sampling allocations for selection of the best population
    • DOI 10.1016/0378-3758(95)00169-7
    • S. Gupta and K. Miescke, "Bayesian look ahead one stage sampling allocations for selection of the best population," Journal of Statistical Planning and Inference, Vol. 54, no. 2, 229-244, 1996. (Pubitemid 126161097)
    • (1996) Journal of Statistical Planning and Inference , vol.54 , Issue.2 , pp. 229-244
    • Gupta, S.S.1    Miescke, K.J.2
  • 10
    • 39549108095 scopus 로고    scopus 로고
    • Ranking inequality: Applications of multivariate subset selection
    • W.C. Horrace, J.T. Marchand and T.M. Smeeding, "Ranking inequality: Applications of multivariate subset selection," Journal of Economic Inequality, Vol. 6, no. 1, 5-32, 2008.
    • (2008) Journal of Economic Inequality , vol.6 , Issue.1 , pp. 5-32
    • Horrace, W.C.1    Marchand, J.T.2    Smeeding, T.M.3
  • 14
    • 34547966991 scopus 로고    scopus 로고
    • Multi-armed bandit problems with dependent arms
    • DOI 10.1145/1273496.1273587, Proceedings, Twenty-Fourth International Conference on Machine Learning, ICML 2007
    • S. Pandey, D. Chakrabarti and D. Agarwal, "Multi-armed bandit problems with dependent arms, " Proceedings of the 24th International Conference on Machine Learning, 721-728. (Pubitemid 47275130)
    • (2007) ACM International Conference Proceeding Series , vol.227 , pp. 721-728
    • Pandey, S.1    Chakrabarti, D.2    Agarwal, D.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.