메뉴 건너뛰기




Volumn 40, Issue 1, 2012, Pages 529-560

Q-learning with censored data

Author keywords

Generalization error; Q learning; Reinforcement learning; Survival analysis

Indexed keywords


EID: 84861349230     PISSN: 00905364     EISSN: None     Source Type: Journal    
DOI: 10.1214/12-AOS968     Document Type: Article
Times cited : (129)

References (43)
  • 2
    • 85012688561 scopus 로고
    • Princeton Univ. Press, Princeton, NJ. MR00904777
    • BELLMAN, R. (1957). Dynamic Programming. Princeton Univ. Press, Princeton, NJ. MR00904777
    • (1957) Dynamic Programming
    • Bellman, R.1
  • 3
    • 0031921607 scopus 로고    scopus 로고
    • Feed forward neural networks for the analysis of censored survival data: A partial logistic regression approach
    • BIGANZOLI, E., BORACCHI, P., MARIANI, L. and MARUBINI, E. (1998). Feed forward neural networks for the analysis of censored survival data: A partial logistic regression approach. Stat. Med. 17 1169-1186.
    • (1998) Stat. Med. , vol.17 , pp. 1169-1186
    • Biganzoli, E.1    Boracchi, P.2    Mariani, L.3    Marubini, E.4
  • 4
    • 0033227358 scopus 로고    scopus 로고
    • A Dvoretzky-Kiefer-Wolfowitz type inequality for the Kaplan-Meier estimator
    • MR17257099
    • BITOUZÉ, D., LAURENT, B. and MASSART, P. (1999). A Dvoretzky-Kiefer-Wolfowitz type inequality for the Kaplan-Meier estimator. Ann. Inst. Henri Poincaré Probab. Stat. 35 735-763. MR17257099
    • (1999) Ann. Inst. Henri Poincaré Probab. Stat. , vol.35 , pp. 735-763
    • Bitouzé, D.1    Laurent, B.2    Massart, P.3
  • 5
    • 0035179219 scopus 로고    scopus 로고
    • Causal inference on the difference of the restricted mean lifetime between two groups
    • MR19504188
    • CHEN, P.-Y. and TSIATIS, A. A. (2001). Causal inference on the difference of the restricted mean lifetime between two groups. Biometrics 57 1030-1038. MR19504188
    • (2001) Biometrics , vol.57 , pp. 1030-1038
    • Chen, P.-Y.1    Tsiatis, A.A.2
  • 8
    • 0030925038 scopus 로고    scopus 로고
    • Use of Irwin's restricted mean as an index for comparing survival in different treatment groups - Interpretation and power considerations
    • KARRISON, T. G. (1997). Use of Irwin's restricted mean as an index for comparing survival in different treatment groups - interpretation and power considerations. Control Clin. Trials 18 151- 167.
    • (1997) Control Clin. Trials , vol.18 , pp. 151-167
    • Karrison, T.G.1
  • 12
    • 85047692405 scopus 로고    scopus 로고
    • Dynamic treatment regimes: Practical design considerations
    • LAVORI, P. W. and DAWSON, R. (2004). Dynamic treatment regimes: Practical design considerations. Clin. Trials 1 9-20.
    • (2004) Clin. Trials , vol.1 , pp. 9-20
    • Lavori, P.W.1    Dawson, R.2
  • 13
    • 0036192067 scopus 로고    scopus 로고
    • Estimation of survival distributions of treatment policies in two-stage randomization designs in clinical trials
    • MR18910422
    • LUNCEFORD, J. K., DAVIDIAN, M. and TSIATIS, A. A. (2002). Estimation of survival distributions of treatment policies in two-stage randomization designs in clinical trials. Biometrics 58 48-57. MR18910422
    • (2002) Biometrics , vol.58 , pp. 48-57
    • Lunceford, J.K.1    Davidian, M.2    Tsiatis, A.A.3
  • 14
    • 77958531196 scopus 로고    scopus 로고
    • Weighted Kaplan-Meier estimators for two-stage treatment regimes
    • MR27569455
    • MIYAHARA, S. and WAHED, A. S. (2010). Weighted Kaplan-Meier estimators for two-stage treatment regimes. Stat. Med. 29 2581-2591. MR27569455
    • (2010) Stat. Med. , vol.29 , pp. 2581-2591
    • Miyahara, S.1    Wahed, A.S.2
  • 15
    • 33947624563 scopus 로고    scopus 로고
    • Demystifying optimal dynamic treatment regimes
    • MR23708033
    • MOODIE, E. E. M., RICHARDSON, T. S. and STEPHENS, D. A. (2007). Demystifying optimal dynamic treatment regimes. Biometrics 63 447-455. MR23708033
    • (2007) Biometrics , vol.63 , pp. 447-455
    • Moodie, E.E.M.1    Richardson, T.S.2    Stephens, D.A.3
  • 16
  • 17
    • 19144362679 scopus 로고    scopus 로고
    • An experimental design for the development of adaptive treatment strategies
    • MR21376511
    • MURPHY, S. A. (2005a). An experimental design for the development of adaptive treatment strategies. Stat. Med. 24 1455-1481. MR21376511
    • (2005) Stat. Med. , vol.24 , pp. 1455-1481
    • Murphy, S.A.1
  • 18
    • 23244437791 scopus 로고    scopus 로고
    • A generalization error for Q-learning
    • (electronic). MR22498499
    • MURPHY, S. A. (2005b). A generalization error for Q-learning. J. Mach. Learn. Res. 6 1073-1097 (electronic). MR22498499
    • (2005) J. Mach. Learn. Res. , vol.6 , pp. 1073-1097
    • Murphy, S.A.1
  • 19
    • 33846260190 scopus 로고    scopus 로고
    • Methodological challenges in constructing effective treatment sequences for chronic psychiatric disorders
    • MURPHY, S. A., OSLIN, D. W., RUSH, A. J., ZHU, J. and MCATS (2007). Methodological challenges in constructing effective treatment sequences for chronic psychiatric disorders. Neuropsychopharmacology 32 257-262.
    • (2007) Neuropsychopharmacology , vol.32 , pp. 257-262
    • Murphy, S.A.1    Oslin, D.W.2    Rush, A.J.3    Zhu, J.4    Mcats5
  • 20
    • 77950539248 scopus 로고    scopus 로고
    • Dynamic regime marginal structural mean models for estimation of optimal dynamic treatment regimes, Part I: Main content
    • Art. 8. MR26025511
    • ORELLANA, L., ROTNITZKY, A. and ROBINS, J. M. (2010). Dynamic regime marginal structural mean models for estimation of optimal dynamic treatment regimes, Part I: Main content. Int. J. Biostat. 6 Art. 8, 49. MR26025511
    • (2010) Int. J. Biostat. , vol.6 , pp. 49
    • Orellana, L.1    Rotnitzky, A.2    Robins, J.M.3
  • 21
    • 0001415308 scopus 로고    scopus 로고
    • Association, causation, and marginal structural models
    • MR17667766
    • ROBINS, J. M. (1999). Association, causation, and marginal structural models. Synthese 121 151- 179. MR17667766
    • (1999) Synthese , vol.121 , pp. 151-179
    • Robins, J.M.1
  • 22
    • 33845913126 scopus 로고    scopus 로고
    • Optimal structural nested models for optimal sequential decisions
    • (D. Lin and P. J. Heagerty, eds.), Springer, New York. MR21294022
    • ROBINS, J. M. (2004). Optimal structural nested models for optimal sequential decisions. In Proceedings of the Second Seattle Symposium in Biostatistics (D. Lin and P. J. Heagerty, eds.) 189- 326. Springer, New York. MR21294022
    • (2004) Proceedings of the Second Seattle Symposium in Biostatistics , pp. 189-326
    • Robins, J.M.1
  • 23
    • 53849122359 scopus 로고    scopus 로고
    • Estimation and extrapolation of optimal treatment and testing strategies
    • MR25285766
    • ROBINS, J., ORELLANA, L. and ROTNITZKY, A. (2008). Estimation and extrapolation of optimal treatment and testing strategies. Stat. Med. 27 4678-4721. MR25285766
    • (2008) Stat. Med. , vol.27 , pp. 4678-4721
    • Robins, J.1    Orellana, L.2    Rotnitzky, A.3
  • 24
    • 84888862680 scopus 로고
    • Estimation of regression coefficients when some regressors are not always observed
    • MR12947300
    • ROBINS, J. M., ROTNITZKY, A. and ZHAO, L. P. (1994). Estimation of regression coefficients when some regressors are not always observed. J. Amer. Statist. Assoc. 89 846-866. MR12947300
    • (1994) J. Amer. Statist. Assoc. , vol.89 , pp. 846-866
    • Robins, J.M.1    Rotnitzky, A.2    Zhao, L.P.3
  • 25
    • 0035596127 scopus 로고    scopus 로고
    • The Kaplan-Meier estimator as an inverse-probability-ofcensoring weighted average
    • MR19472666
    • SATTEN, G. A. and DATTA, S. (2001). The Kaplan-Meier estimator as an inverse-probability-ofcensoring weighted average. Amer. Statist. 55 207-210. MR19472666
    • (2001) Amer. Statist. , vol.55 , pp. 207-210
    • Satten, G.A.1    Datta, S.2
  • 26
    • 58549086326 scopus 로고    scopus 로고
    • Support vector censored quantile regression under random censoring
    • MR26570577
    • SHIM, J. and HWANG, C. (2009). Support vector censored quantile regression under random censoring. Comput. Statist. Data Anal. 53 912-919. MR26570577
    • (2009) Comput. Statist. Data Anal. , vol.53 , pp. 912-919
    • Shim, J.1    Hwang, C.2
  • 29
    • 42049107120 scopus 로고    scopus 로고
    • Considerations for second-line therapy of nonsmall cell lung cancer
    • STINCHCOMBE, T. E. and SOCINSKI, M. A. (2008). Considerations for second-line therapy of nonsmall cell lung cancer. Oncologist 13 28-36.
    • (2008) Oncologist , vol.13 , pp. 28-36
    • Stinchcombe, T.E.1    Socinski, M.A.2
  • 31
    • 35648935906 scopus 로고    scopus 로고
    • Bayesian and frequentist two-stage treatment strategies based on sequential failure times subject to interval censoring
    • MR24133922
    • THALL, P. F., WOOTEN, L. H., LOGOTHETIS, C. J., MILLIKAN, R. E. and TANNIR, N. M. (2007). Bayesian and frequentist two-stage treatment strategies based on sequential failure times subject to interval censoring. Stat. Med. 26 4687-4702. MR24133922
    • (2007) Stat. Med. , vol.26 , pp. 4687-4702
    • Thall, P.F.1    Wooten, L.H.2    Logothetis, C.J.3    Millikan, R.E.4    Tannir, N.M.5
  • 32
    • 0029752470 scopus 로고    scopus 로고
    • Feature-based methods for large scale dynamic programming
    • TSITSIKLIS, J. N. and VAN ROY, B. (1996). Feature-based methods for large scale dynamic programming. Machine Learning 22 59-94.
    • (1996) Machine Learning , vol.22 , pp. 59-94
    • Tsitsiklis, J.N.1    Van Roy, B.2
  • 33
    • 33847633000 scopus 로고    scopus 로고
    • Causal effect models for realistic individualized treatment and intention to treat rules
    • Art. 3. MR23068411
    • VAN DER LAAN, M. J. and PETERSEN, M. L. (2007). Causal effect models for realistic individualized treatment and intention to treat rules. Int. J. Biostat. 3 Art. 3, 54. MR23068411
    • (2007) Int. J. Biostat. , vol.3 , pp. 54
    • Van Der Laan, M.J.1    Petersen, M.L.2
  • 36
    • 60949104694 scopus 로고    scopus 로고
    • Estimation of survival quantiles in two-stage randomization designs
    • MR24975600
    • WAHED, A. S. (2009). Estimation of survival quantiles in two-stage randomization designs. J. Statist. Plann. Inference 139 2064-2075. MR24975600
    • (2009) J. Statist. Plann. Inference , vol.139 , pp. 2064-2075
    • Wahed, A.S.1
  • 37
    • 33644973742 scopus 로고    scopus 로고
    • Semiparametric efficient estimation of survival distributions in two-stage randomisation designs in clinical trials with censored data
    • MR22777488
    • WAHED, A. S. and TSIATIS, A. A. (2006). Semiparametric efficient estimation of survival distributions in two-stage randomisation designs in clinical trials with censored data. Biometrika 93 163-177. MR22777488
    • (2006) Biometrika , vol.93 , pp. 163-177
    • Wahed, A.S.1    Tsiatis, A.A.2
  • 40
    • 37649015731 scopus 로고    scopus 로고
    • On an exponential bound for the Kaplan-Meier estimator
    • MR23942844
    • WELLNER, J. A. (2007). On an exponential bound for the Kaplan-Meier estimator. Lifetime Data Anal. 13 481-496. MR23942844
    • (2007) Lifetime Data Anal. , vol.13 , pp. 481-496
    • Wellner, J.A.1
  • 41
    • 70449449564 scopus 로고    scopus 로고
    • Reinforcement learning design for cancer clinical trials
    • MR27502777
    • ZHAO, Y., KOSOROK, M. R. and ZENG, D. (2009). Reinforcement learning design for cancer clinical trials. Stat. Med. 28 3294-3315. MR27502777
    • (2009) Stat. Med. , vol.28 , pp. 3294-3315
    • Zhao, Y.1    Kosorok, M.R.2    Zeng, D.3
  • 42
    • 83655181241 scopus 로고    scopus 로고
    • Reinforcement learning strategies for clinical trials in nonsmall cell lung cancer
    • ZHAO, Y., ZENG, D., SOCINSKI, M. A. and KOSOROK, M. R. (2011). Reinforcement learning strategies for clinical trials in nonsmall cell lung cancer. Biometrics 67 1422-1433.
    • (2011) Biometrics , vol.67 , pp. 1422-1433
    • Zhao, Y.1    Zeng, D.2    Socinski, M.A.3    Kosorok, M.R.4
  • 43
    • 0032381820 scopus 로고    scopus 로고
    • Restricted mean life with covariates:Modification and extension of a useful survival analysis method
    • MR16313655
    • ZUCKER, D. M. (1998). Restricted mean life with covariates:Modification and extension of a useful survival analysis method. J. Amer. Statist. Assoc. 93 702-709. MR16313655
    • (1998) J. Amer. Statist. Assoc. , vol.93 , pp. 702-709
    • Zucker, D.M.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.