메뉴 건너뛰기




Volumn 25, Issue 3, 2015, Pages 901-920

Penalized Q-learning for dynamic treatment regimens

Author keywords

Dynamic treatment regimen; Individual selection; Multistage; Penalized Q learning; Q learning; Shrinkage; Two stage procedure

Indexed keywords


EID: 84990041926     PISSN: 10170405     EISSN: None     Source Type: Journal    
DOI: 10.5705/ss.2012.364     Document Type: Article
Times cited : (76)

References (25)
  • 1
    • 34548275795 scopus 로고    scopus 로고
    • The dantzig selector: Statistical estimation when p is much larger than n (with discussion)
    • Candes, E. and Tao, T. (2007). The dantzig selector: Statistical estimation when p is much larger than n (with discussion). Ann. Statist. 35, 2313-2404.
    • (2007) Ann. Statist. , vol.35 , pp. 2313-2404
    • Candes, E.1    Tao, T.2
  • 2
    • 77954420461 scopus 로고    scopus 로고
    • Inference for non-regular parameters in optimal dynamic treatment regimes
    • Chakraborty, B., Murphy, S. and Strecher, V. (2010). Inference for non-regular parameters in optimal dynamic treatment regimes. Statist. Meth. Medical Res. 19, 317-343.
    • (2010) Statist. Meth. Medical Res. , vol.19 , pp. 317-343
    • Chakraborty, B.1    Murphy, S.2    Strecher, V.3
  • 3
    • 84901199092 scopus 로고    scopus 로고
    • Inference for optimal dynamic treatment regimes using an adaptive m-out-of-n bootstrap scheme
    • Chakraborty, B., Laber, E. and Zhao, Y. (2013). Inference for optimal dynamic treatment regimes using an adaptive m-out-of-n bootstrap scheme. Biometrics 69, 714-723.
    • (2013) Biometrics , vol.69 , pp. 714-723
    • Chakraborty, B.1    Laber, E.2    Zhao, Y.3
  • 4
    • 1542784498 scopus 로고    scopus 로고
    • Variable selection via nonconcave penalized likelihood and its oracle properties
    • Fan, J. and Li, R. (2001). Variable selection via nonconcave penalized likelihood and its oracle properties. J. Amer. Statist. Assoc. 96, 1348-1360.
    • (2001) J. Amer. Statist. Assoc. , vol.96 , pp. 1348-1360
    • Fan, J.1    Li, R.2
  • 6
    • 84952149204 scopus 로고
    • A statistical view of some chemometrics regression tools (with discussion)
    • Frank, I. E. and Friedman, J. H. (1993). A statistical view of some chemometrics regression tools (with discussion). Technometrics 35, 109-148.
    • (1993) Technometrics , vol.35 , pp. 109-148
    • Frank, I.E.1    Friedman, J.H.2
  • 7
    • 84990031243 scopus 로고    scopus 로고
    • Impossibility results for nondifferentiable functionals
    • To appear
    • Hirano, K. and Porter, J. R. (To appear). Impossibility results for nondifferentiable functionals. Econometrica.
    • Econometrica
    • Hirano, K.1    Porter, J.R.2
  • 9
    • 0034364917 scopus 로고    scopus 로고
    • A design for testing clinical strategies: Biased adaptive withinsubject randomization
    • Lavori, P.W. and Dawson, A. (2000). A design for testing clinical strategies: Biased adaptive withinsubject randomization. J. Roy. Statist. Soc. A 163, 29-38.
    • (2000) J. Roy. Statist. Soc. A , vol.163 , pp. 29-38
    • Lavori, P.W.1    Dawson, A.2
  • 10
    • 0036192067 scopus 로고    scopus 로고
    • Estimation of survival distributions of treatment policies in two-stage randomization designs in clinical trials
    • Lunceford, J., Davidian, M. and Tsiatis, A. (2002). Estimation of survival distributions of treatment policies in two-stage randomization designs in clinical trials. Biometrics 58, 48-57.
    • (2002) Biometrics , vol.58 , pp. 48-57
    • Lunceford, J.1    Davidian, M.2    Tsiatis, A.3
  • 11
    • 69949172005 scopus 로고    scopus 로고
    • Estimating response-maximized decision rules with applications to breastfeeding
    • Moodie, E. E. M., Platt, R. W. and Kramer, M. S. (2009). Estimating response-maximized decision rules with applications to breastfeeding. J. Amer. Statist. Assoc. 104, 155-165.
    • (2009) J. Amer. Statist. Assoc. , vol.104 , pp. 155-165
    • Moodie, E.E.M.1    Platt, R.W.2    Kramer, M.S.3
  • 12
    • 77949537979 scopus 로고    scopus 로고
    • Estimating optimal dynamic regimes: Correcting bias under the null
    • Moodie, E. E. M. and Richardson, T. S. (2010). Estimating optimal dynamic regimes: Correcting bias under the null. Scand. J. Statist. 37, 126-146.
    • (2010) Scand. J. Statist. , vol.37 , pp. 126-146
    • Moodie, E.E.M.1    Richardson, T.S.2
  • 13
    • 0038107066 scopus 로고    scopus 로고
    • Optimal dynamic treatment regimes
    • Murphy, S. (2003). Optimal dynamic treatment regimes. J. Roy. Statist. Soc. Ser. B 65, 331-355.
    • (2003) J. Roy. Statist. Soc. Ser. B , vol.65 , pp. 331-355
    • Murphy, S.1
  • 14
    • 19144362679 scopus 로고    scopus 로고
    • An experimental design for the development of adaptive treatment strategies
    • Murphy, S. (2005). An experimental design for the development of adaptive treatment strategies. Statist. Medicine 24, 1455-1481.
    • (2005) Statist. Medicine , vol.24 , pp. 1455-1481
    • Murphy, S.1
  • 16
    • 0034732660 scopus 로고    scopus 로고
    • Evaluating multiple treatment courses in clinical trials
    • Thall, P., Millikan, R. and Sung, H. (2000). Evaluating multiple treatment courses in clinical trials. Statist. Medicine 19, 1011-1028.
    • (2000) Statist. Medicine , vol.19 , pp. 1011-1028
    • Thall, P.1    Millikan, R.2    Sung, H.3
  • 17
    • 0036489042 scopus 로고    scopus 로고
    • Selecting therapeutic strategies based on efficacy and death in multicourse clinical trials
    • Thall, P., Sung, H. and Estey, E. (2002). Selecting therapeutic strategies based on efficacy and death in multicourse clinical trials. J. Amer. Statist. Assoc. 97, 29-39.
    • (2002) J. Amer. Statist. Assoc. , vol.97 , pp. 29-39
    • Thall, P.1    Sung, H.2    Estey, E.3
  • 18
    • 35648935906 scopus 로고    scopus 로고
    • Bayesian and frequentist two-stage treatment strategies based on sequential failure times subject to interval censoring
    • Thall, P. F.,Wooten, L. H., Logothetis, C. J., Millikan, R. E. and Tannir, N. M. (2007). Bayesian and frequentist two-stage treatment strategies based on sequential failure times subject to interval censoring. Statist. Medicine 26, 4687-4702.
    • (2007) Statist. Medicine , vol.26 , pp. 4687-4702
    • Thall, P.F.1    Wooten, L.H.2    Logothetis, C.J.3    Millikan, R.E.4    Tannir, N.M.5
  • 19
    • 85194972808 scopus 로고    scopus 로고
    • Regression shrinkage and selection via the lasso
    • Tibshirani, R. (1996). Regression shrinkage and selection via the lasso. J. Roy. Statist. Soc. Ser. B 58, 267-288.
    • (1996) J. Roy. Statist. Soc. Ser. B , vol.58 , pp. 267-288
    • Tibshirani, R.1
  • 20
    • 33644973742 scopus 로고    scopus 로고
    • Semiparametric efficient estimation of survival distributions in two-stage randomisation designs in clinical trials with censored data
    • Wahed, A. and Tsiatis, A. (2006). Semiparametric efficient estimation of survival distributions in two-stage randomisation designs in clinical trials with censored data. Biometrika 93, 163-177.
    • (2006) Biometrika , vol.93 , pp. 163-177
    • Wahed, A.1    Tsiatis, A.2
  • 21
    • 1642358199 scopus 로고    scopus 로고
    • Optimal estimator for the survival distribution and related quantities for treatment policies in two-stage randomised designs in clinical trials
    • Wahed, A. S. and Tsiatis, A. A. (2004). Optimal estimator for the survival distribution and related quantities for treatment policies in two-stage randomised designs in clinical trials. Biometrics 60, 124-133.
    • (2004) Biometrics , vol.60 , pp. 124-133
    • Wahed, A.S.1    Tsiatis, A.A.2
  • 22
    • 70449449564 scopus 로고    scopus 로고
    • Reinforcement learning design for cancer clinical trials
    • Zhao, Y., Kosorok, M. R. and Zeng, D. (2009). Reinforcement learning design for cancer clinical trials. Statist. Medicine 28, 3294-3315.
    • (2009) Statist. Medicine , vol.28 , pp. 3294-3315
    • Zhao, Y.1    Kosorok, M.R.2    Zeng, D.3
  • 23
    • 83655181241 scopus 로고    scopus 로고
    • Reinforcement learning strategies for clinical trials in non-small cell lung cancer
    • Zhao, Y., Zeng, D., Socinski, M. and Kosorok, M. (2011). Reinforcement learning strategies for clinical trials in non-small cell lung cancer. Biometrics 67, 1422-1433.
    • (2011) Biometrics , vol.67 , pp. 1422-1433
    • Zhao, Y.1    Zeng, D.2    Socinski, M.3    Kosorok, M.4
  • 24
    • 33846114377 scopus 로고    scopus 로고
    • The adaptive lasso and its oracle properties
    • Zou, H. (2006). The adaptive lasso and its oracle properties. J. Amer. Statist. Assoc. 101, 1418-1429.
    • (2006) J. Amer. Statist. Assoc. , vol.101 , pp. 1418-1429
    • Zou, H.1
  • 25
    • 51049104549 scopus 로고    scopus 로고
    • One-step sparse estimates in nonconcave penalized likelihood models
    • Zou, H. and Li, R. (2008). One-step sparse estimates in nonconcave penalized likelihood models. Ann. Statist. 36, 1509-1533.
    • (2008) Ann. Statist. , vol.36 , pp. 1509-1533
    • Zou, H.1    Li, R.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.