SCOPUS 정보 검색 플랫폼

Volumn , Issue , 2009, Pages 137-144

The knowledge gradient algorithm for online subset selection

Author keywords

[No Author keywords available]

Indexed keywords

DECISION RULES; EXPERIMENTAL EVIDENCE; GRADIENT ALGORITHM; LOOK-AHEAD; MULTI-ARMED BANDIT PROBLEM; ONLINE LEARNING; SUBSET SELECTION;

DYNAMIC PROGRAMMING; EDUCATION; INTERNET; REINFORCEMENT; REINFORCEMENT LEARNING; SYSTEMS ENGINEERING;

SET THEORY;

EID: 67650505320 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/ADPRL.2009.4927537 Document Type: Conference Paper

Times cited : (24)

References (18)

1
- 0343441515
- Restless bandits, linear programming relaxations, and a primal-dual index heuristic
- D. Bertsimas and J. Nino-Mora, "Restless bandits, linear programing relaxations, and a primal-dual index heuristic," Operations Research, Vol. 48, no. 1, 80-90, 2000. (Pubitemid 30930252)
- (2000) Operations Research , vol.48 , Issue.1 , pp. 80-90
- Bertsimas, D.¹ Nino-Mora, J.²

2
- 67650370716
- New myopic sequential sampling procedures
- S. Chick, J. Branke and C. Schmidt, "New myopic sequential sampling procedures," 2007, submitted for publication.
- (2007) Submitted for Publication
- Chick, S.¹ Branke, J.² Schmidt, C.³

3
- 0003759419
- John Wiley and Sons
- M.H. De Groot, Optimal Statistical Decisions. John Wiley and Sons, 1970.
- (1970) Optimal Statistical Decisions
- De Groot, M.H.¹

6
- 67650368215
- The knowledge gradient policy for correlated normal rewards
- P.I. Frazier, W.B. Powell and S. Dayanik, "The knowledge gradient policy for correlated normal rewards," 2008, submitted for publication.
- (2008) Submitted for Publication
- Frazier, P.I.¹ Powell, W.B.² Dayanik, S.³

7
- 84891584370
- John Wiley and Sons, New York
- J. Gittins, Multi-Armed Bandit Allocation Indices. John Wiley and Sons, New York, 1989.
- (1989) Multi-Armed Bandit Allocation Indices
- Gittins, J.¹

8
- 0000511415
- Bayesian look ahead one stage sampling allocations for selecting the largest normal mean
- S. Gupta and K. Miescke, "Bayesian look ahead one stage sampling allocations for selecting the largest normal mean," Statistical Papers, Vol. 35, 169-177, 1994.
- (1994) Statistical Papers , vol.35 , pp. 169-177
- Gupta, S.¹ Miescke, K.²

10
- 39549108095
- Ranking inequality: Applications of multivariate subset selection
- W.C. Horrace, J.T. Marchand and T.M. Smeeding, "Ranking inequality: Applications of multivariate subset selection," Journal of Economic Inequality, Vol. 6, no. 1, 5-32, 2008.
- (2008) Journal of Economic Inequality , vol.6 , Issue.1 , pp. 5-32
- Horrace, W.C.¹ Marchand, J.T.² Smeeding, T.M.³

11
- 0004280606
- MIT Press, Cambridge, MA
- L.P. Kaelbling, Learning in Embedded Systems. MIT Press, Cambridge, MA, 1993.
- (1993) Learning in Embedded Systems
- Kaelbling, L.P.¹

12
- 0023345261
- MULTI-ARMED BANDIT PROBLEM: DECOMPOSITION AND COMPUTATION.
- M. Katehakis and A.F. Veinott Jr., "The Multi-Armed Bandit Problem: Decomposition And Computation," Math. of OR, Vol. 12, no. 2, pp. 262-268, 1987. (Pubitemid 17603261)
- (1987) Mathematics of Operations Research , vol.12 , Issue.2 , pp. 262-268
- Katehakis Michael, N.¹ Veinott Jr., F.²

13
- 0003971926
- 2nd ed. CRC Press
- A.J. Miller, Subset Selection in Regression, 2nd ed. CRC Press, 2002.
- (2002) Subset Selection in Regression
- Miller, A.J.¹

15
- 47349092417
- John Wiley and Sons, New York
- W.B. Powell, Approximate Dynamic Programming: Solving the curses of dimensionality. John Wiley and Sons, New York, 2007.
- (2007) Approximate Dynamic Programming: Solving the curses of dimensionality
- Powell, W.B.¹

16
- 67650386778
- submitted for publication
- I.O. Ryzhov, W.B. Powell and P.I. Frazier, "The knowledge gradient algorithm for a general class of online learning problems," 2008, submitted for publication.
- (2008) The Knowledge Gradient Algorithm for a General Class of Online Learning Problems
- Ryzhov, I.O.¹ Powell, W.B.² Frazier, P.I.³

17
- 33646406807
- Multi-armed bandit algorithms and empirical evaluation
- J. Vermorel and M. Mohri, "Multi-armed bandit algorithms and empirical evaluation," Proceedings of the 16th European Conference on Machine Learning, 437-448, 2005.
- (2005) Proceedings of the 16th European Conference on Machine Learning , pp. 437-448
- Vermorel, J.¹ Mohri, M.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.