SCOPUS 정보 검색 플랫폼

Statistics in Biosciences

Volumn 6, Issue 2, 2014, Pages 223-243

Q-Learning: Flexible Learning About Useful Utilities

(3) Moodie, Erica E M a Dean, Nema b Sun, Yue Ru c

a MCGILL UNIVERSITY (Canada)

b UNIVERSITY OF GLASGOW (United Kingdom)

c MCGILL UNIVERSITY (Canada)

Author keywords

Adaptive treatment strategies; Discrete data; Dynamic treatment regimes; Generalized additive models; Personalized medicine; Q learning

Indexed keywords

ARTICLE; COMMUNITY PROGRAM; CONFOUNDING VARIABLE; DYNAMICS; LEARNING; OUTCOME ASSESSMENT; PROBABILITY; PROPENSITY SCORE; Q LEARNING; SAMPLE SIZE; SIMULATION;

EID: 84912121111 PISSN: 18671764 EISSN: 18671772 Source Type: Journal
DOI: 10.1007/s12561-013-9103-z Document Type: Article

Times cited : (78)

References (33)

1
- 78650878764
- Dynamic treatment regimes for managing chronic health conditions: A statistical perspective
- Chakraborty B (2011) Dynamic treatment regimes for managing chronic health conditions: A statistical perspective. Am J Publ Health 101(1):40–45
- (2011) Am J Publ Health , vol.101 , Issue.1 , pp. 40-45
- Chakraborty, B.¹

2
- 84912130870
- (submitted)
- Chakraborty B, Laber EB, Zhao Y (2013) Inference for optimal dynamic treatment regimes using an adaptive m-out-of-n bootstrap scheme (submitted)
- (2013) Inference for optimal dynamic treatment regimes using an adaptive m-out-of-n bootstrap scheme
- Chakraborty, B.¹ Laber, E.B.² Zhao, Y.³

3
- 84921442020
- (submitted)
- Chakraborty B, Moodie EEM (2013) Estimating optimal dynamic treatment regimes with shared decision rules across stages: An extension of Q-learning (submitted)
- (2013) Estimating optimal dynamic treatment regimes with shared decision rules across stages: An extension of Q-learning
- Chakraborty, B.¹ Moodie, E.E.M.²

4
- 77954420461
- Inference for non-regular parameters in optimal dynamic treatment regimes
- Chakraborty B, Murphy SA, Strecher V (2010) Inference for non-regular parameters in optimal dynamic treatment regimes. Stat Methods Med Res 19(3):317–343
- (2010) Stat Methods Med Res , vol.19 , Issue.3 , pp. 317-343
- Chakraborty, B.¹ Murphy, S.A.² Strecher, V.³

5
- 0037986382
- Background and rationale for the sequenced treatment alternatives to relieve depression (STAR*D) study
- Fava M, Rush AJ, Trivedi MH, Nierenberg AA, Thase ME, Sackeim HA, Quitkin FM, Wisniewski S, Lavori PW, Rosenbaum JF, Kupfer DJ (2003) Background and rationale for the sequenced treatment alternatives to relieve depression (STAR*D) study. Psychiatr Clin North Am 26(2):457–494
- (2003) Psychiatr Clin North Am , vol.26 , Issue.2 , pp. 457-494
- Fava, M.¹ Rush, A.J.² Trivedi, M.H.³ Nierenberg, A.A.⁴ Thase, M.E.⁵ Sackeim, H.A.⁶ Quitkin, F.M.⁷ Wisniewski, S.⁸ Lavori, P.W.⁹ Rosenbaum, J.F.¹⁰ Kupfer, D.J.¹¹

6
- 32044449925
- Generalized cross-validation as a method for choosing a good ridge parameter
- Golub G, Heath M, Wahba G (1979) Generalized cross-validation as a method for choosing a good ridge parameter. Technometrics 21:215–224
- (1979) Technometrics , vol.21 , pp. 215-224
- Golub, G.¹ Heath, M.² Wahba, G.³

7
- 84972488102
- Generalized additive models
- Hastie T, Tibshirani R (1986) Generalized additive models. Stat Sci 1(3):297–318
- (1986) Stat Sci , vol.1 , Issue.3 , pp. 297-318
- Hastie, T.¹ Tibshirani, R.²

8
- 0003598526
- Chapman & Hall, London
- Hastie T, Tibshirani R (1990) Generalized additive models. Chapman & Hall, London
- (1990) Generalized additive models
- Hastie, T.¹ Tibshirani, R.²

9
- 84867329090
- Analysis of multi-stage treatments for recurrent diseases
- Huang X, Ning J (2012) Analysis of multi-stage treatments for recurrent diseases. Stat Med 31:2805–2821
- (2012) Stat Med , vol.31 , pp. 2805-2821
- Huang, X.¹ Ning, J.²

10
- 0001462696
- L, cross-validation and generalized cross-validation: Discrete index set
- L, cross-validation and generalized cross-validation: Discrete index set. Ann Stat 15:958–975
- (1987) Ann Stat , vol.15 , pp. 958-975
- Li, K.C.¹

11
- 84869185228
- Q-learning for estimating optimal dynamic treatment rules from observational data
- Moodie EEM, Chakraborty B, Kramer MS (2012) Q-learning for estimating optimal dynamic treatment rules from observational data. Can J Stat 40:629–645
- (2012) Can J Stat , vol.40 , pp. 629-645
- Moodie, E.E.M.¹ Chakraborty, B.² Kramer, M.S.³

12
- 77949537979
- Estimating optimal dynamic regimes: Correcting bias under the null
- Moodie EEM, Richardson TS (2010) Estimating optimal dynamic regimes: Correcting bias under the null. Scand J Stat 37:126–146
- (2010) Scand J Stat , vol.37 , pp. 126-146
- Moodie, E.E.M.¹ Richardson, T.S.²

13
- 33846260190
- Methodological challenges in constructing effective treatment sequences for chronic psychiatric disorders
- Murphy SA, Oslin DW, Rush AJ, Zhu J (2007) Methodological challenges in constructing effective treatment sequences for chronic psychiatric disorders. Neuropsychopharmacology 32:257–262
- (2007) Neuropsychopharmacology , vol.32 , pp. 257-262
- Murphy, S.A.¹ Oslin, D.W.² Rush, A.J.³ Zhu, J.⁴

14
- 23244437791
- A generalization error for Q-learning
- Murphy SA (2005) A generalization error for Q-learning. J Mach Learn Res 6:1073–1097
- (2005) J Mach Learn Res , vol.6 , pp. 1073-1097
- Murphy, S.A.¹

15
- 84874401815
- Q-Learning: A data analysis method for constructing adaptive interventions
- Nahum-Shani I, Qian M, Almirall D, Pelham WE, Gnagy B, Fabiano GA, Waxmonsky JG, Yu J, Murphy SA (2012) Q-Learning: A data analysis method for constructing adaptive interventions. Psychol Methods 17:478–494
- (2012) Psychol Methods , vol.17 , pp. 478-494
- Nahum-Shani, I.¹ Qian, M.² Almirall, D.³ Pelham, W.E.⁴ Gnagy, B.⁵ Fabiano, G.A.⁶ Waxmonsky, J.G.⁷ Yu, J.⁸ Murphy, S.A.⁹

16
- 84863304598
- R Foundation for Statistical Computing, Vienna
- R Core Team (2012) R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna. ISBN 3-900051-07-0
- (2012) R: A language and environment for statistical computing
- R Core Team¹

17
- 0033847784
- Marginal structural models and causal inference in epidemiology
- Robins JM, Hernán MA, Brumback B (2000) Marginal structural models and causal inference in epidemiology. Epidemiology 11:550–560
- (2000) Epidemiology , vol.11 , pp. 550-560
- Robins, J.M.¹ Hernán, M.A.² Brumback, B.³

18
- 33845913126
- Optimal structural nested models for optimal sequential decisions
- Lin DY, Heagerty P, (eds), Springer, New York
- Robins JM (2004) Optimal structural nested models for optimal sequential decisions. In: Lin DY, Heagerty P (eds) Proceedings of the second Seattle symposium on biostatistics. Springer, New York, pp 189–326
- (2004) Proceedings of the second Seattle symposium on biostatistics , pp. 189-326
- Robins, J.M.¹

19
- 77951622706
- The central role of the propensity score in observational studies for causal effects
- Rosenbaum PR, Rubin DB (1983) The central role of the propensity score in observational studies for causal effects. Biometrika 70:41–55
- (1983) Biometrika , vol.70 , pp. 41-55
- Rosenbaum, P.R.¹ Rubin, D.B.²

20
- 33845907187
- Estimation of optimal dynamic anticoagulation regimes from observational data: A regret-based approach
- Rosthoj S, Fullwood C, Henderson R, Stewart S (2006) Estimation of optimal dynamic anticoagulation regimes from observational data: A regret-based approach. Stat Med 25:4197–4215
- (2006) Stat Med , vol.25 , pp. 4197-4215
- Rosthoj, S.¹ Fullwood, C.² Henderson, R.³ Stewart, S.⁴

21
- 0034752199
- National institute of mental health clinical antipsychotic trials of intervention effectiveness (CATIE): Alzheimer disease trial methodology
- Schneider LS, Tariot PN, Lyketsos CG, Dagerman KS, Davis KL, Davis S (2001) National institute of mental health clinical antipsychotic trials of intervention effectiveness (CATIE): Alzheimer disease trial methodology. Am J Geriatr Psychiatry 9:346–360
- (2001) Am J Geriatr Psychiatry , vol.9 , pp. 346-360
- Schneider, L.S.¹ Tariot, P.N.² Lyketsos, C.G.³ Dagerman, K.S.⁴ Davis, K.L.⁵ Davis, S.⁶

22
- 84864102291
- Estimating the optimal dynamic antipsychotic treatment regime: Evidence from the sequential-multiple assignment randomized CATIE schizophrenia study
- Shortreed SM, Moodie EEM (2012) Estimating the optimal dynamic antipsychotic treatment regime: Evidence from the sequential-multiple assignment randomized CATIE schizophrenia study. J R Stat Soc, Ser B, Stat Methodol 61:577–599
- (2012) J R Stat Soc, Ser B, Stat Methodol , vol.61 , pp. 577-599
- Shortreed, S.M.¹ Moodie, E.E.M.²

23
- 80054684086
- (submitted)
- Song R, Wang W, Zeng D, Kosorok MR (2013) Penalized Q-learning for dynamic treatment regimes (submitted)
- (2013) Penalized Q-learning for dynamic treatment regimes
- Song, R.¹ Wang, W.² Zeng, D.³ Kosorok, M.R.⁴

24
- 0004102479
- MIT Press, Cambridge
- Sutton RS, Barto AG (1998) Reinforcement learning: An introduction. MIT Press, Cambridge
- (1998) Reinforcement learning: An introduction
- Sutton, R.S.¹ Barto, A.G.²

25
- 0034732660
- Evaluating multiple treatment courses in clinical trials
- Thall PF, Millikan RE, Sung HG (2000) Evaluating multiple treatment courses in clinical trials. Stat Med 30:1011–1128
- (2000) Stat Med , vol.30 , pp. 1011-1128
- Thall, P.F.¹ Millikan, R.E.² Sung, H.G.³

26
- 0036489042
- Selecting therapeutic strategies based on efficacy and death in multicourse clinical trials
- Thall PF, Sung HG, Estey EH (2002) Selecting therapeutic strategies based on efficacy and death in multicourse clinical trials. J Am Stat Assoc 97(457):29–39
- (2002) J Am Stat Assoc , vol.97 , Issue.457 , pp. 29-39
- Thall, P.F.¹ Sung, H.G.² Estey, E.H.³

27
- 84861473763
- Basic Books, New York
- Topol E (2012) Creative destruction of medicine: How the digital revolution and personalized medicine will create better health care. Basic Books, New York
- (2012) Creative destruction of medicine: How the digital revolution and personalized medicine will create better health care
- Topol, E.¹

28
- 4944226585
- Stable and efficient multiple smoothing parameter estimation for generalized additive models
- Wood SN (2004) Stable and efficient multiple smoothing parameter estimation for generalized additive models. J Am Stat Assoc 99(467):673–686
- (2004) J Am Stat Assoc , vol.99 , Issue.467 , pp. 673-686
- Wood, S.N.¹

29
- 85153659511
- Chapman & Hall, London
- Wood SN (2006) Generalized additive models: An introduction with R. Chapman & Hall, London
- (2006) Generalized additive models: An introduction with R
- Wood, S.N.¹

30
- 78650862532
- Fast stable restricted maximum likelihood and marginal likelihood estimation of semiparametric generalized linear models
- Wood SN (2011) Fast stable restricted maximum likelihood and marginal likelihood estimation of semiparametric generalized linear models. J R Stat Soc B 73(1):3–36
- (2011) J R Stat Soc B , vol.73 , Issue.1 , pp. 3-36
- Wood, S.N.¹

31
- 84912109301
- R package version 1.0
- Xin J, Chakraborty B, Laber EB (2012) qLearn: Estimation and inference for Q-learning. R package version 1.0
- (2012) QLearn: Estimation and inference for Q-learning
- Xin, J.¹ Chakraborty, B.² Laber, E.B.³

32
- 70449449564
- Reinforcement learning design for cancer clinical trials
- Zhao Y, Kosorok MR, Zeng D (2009) Reinforcement learning design for cancer clinical trials. Stat Med 28:3294–3315
- (2009) Stat Med , vol.28 , pp. 3294-3315
- Zhao, Y.¹ Kosorok, M.R.² Zeng, D.³

33
- 83655181241
- Reinforcement learning strategies for clinical trials in non-small cell lung cancer
- Zhao Y, Zeng D, Socinski MA, Kosorok MR (2011) Reinforcement learning strategies for clinical trials in non-small cell lung cancer. Biometrics 67(4):1422–1433
- (2011) Biometrics , vol.67 , Issue.4 , pp. 1422-1433
- Zhao, Y.¹ Zeng, D.² Socinski, M.A.³ Kosorok, M.R.⁴

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.