메뉴 건너뛰기




Volumn 84, Issue 1-2, 2011, Pages 109-136

Informing sequential clinical decision-making through reinforcement learning: An empirical study

Author keywords

Fitted Q iteration; Optimal treatment policies; Policy uncertainty

Indexed keywords

CHRONIC ILLNESS; CLINICAL TRIAL; EMPIRICAL STUDIES; FITTED Q-ITERATION; FUNCTION APPROXIMATION; MISSING DATA; MULTIPLE IMPUTATION; OPTIMAL TREATMENT POLICIES; Q-FUNCTIONS; REINFORCEMENT LEARNING METHOD; TREATMENT POLICIES;

EID: 79958787689     PISSN: 08856125     EISSN: 15730565     Source Type: Journal    
DOI: 10.1007/s10994-010-5229-0     Document Type: Article
Times cited : (188)

References (68)
  • 1
    • 0036661235 scopus 로고    scopus 로고
    • Schizophrenia trials: Past, present and future
    • C. E. Adams 2002 Schizophrenia trials: past, present and future Epidemiologia E Psichiatria Sociale 11 13 144 151 (Pubitemid 35277840)
    • (2002) Epidemiologia e Psichiatria Sociale , vol.11 , Issue.3 , pp. 144-151
    • Adams, C.E.1
  • 2
    • 0001198502 scopus 로고    scopus 로고
    • Inconsistency of the bootstrap when a parameter is on the boundary of the parameter space
    • 1748009 1015.62044 10.1111/1468-0262.00114
    • D. W. K. Andrews 2000 Inconsistency of the bootstrap when a parameter is on the boundary of the parameter space Econometrica 68 2 399 405 1748009 1015.62044 10.1111/1468-0262.00114
    • (2000) Econometrica , vol.68 , Issue.2 , pp. 399-405
    • Andrews, D.W.K.1
  • 4
    • 33644861229 scopus 로고    scopus 로고
    • A guide to drug discovery: Bayesian clinical trials
    • 10.1038/nrd1927
    • D. A. Berry 2006 A guide to drug discovery: Bayesian clinical trials Nature Reviews. Drug Discovery 5 27 36 10.1038/nrd1927
    • (2006) Nature Reviews. Drug Discovery , vol.5 , pp. 27-36
    • Berry, D.A.1
  • 5
    • 67649306739 scopus 로고    scopus 로고
    • Bayesian clinical trials at the University of Texas M. D. Anderson cancer center
    • 10.1177/1740774509104992
    • S. Biswas D. D. Liu J. J. Lee D. A. Berry 2009 Bayesian clinical trials at the University of Texas M. D. Anderson cancer center Clinical Trials 6 205 216 10.1177/1740774509104992
    • (2009) Clinical Trials , vol.6 , pp. 205-216
    • Biswas, S.1    Liu, D.D.2    Lee, J.J.3    Berry, D.A.4
  • 6
    • 0030211964 scopus 로고    scopus 로고
    • Bagging predictors
    • L. Breiman 1996 Bagging predictors Machine Learning 24 2 123 140 1425957 0858.68080 (Pubitemid 126724382)
    • (1996) Machine Learning , vol.24 , Issue.2 , pp. 123-140
    • Breiman, L.1
  • 8
    • 34347398256 scopus 로고    scopus 로고
    • Sensitivity analysis after multiple imputation under missing at random: A weighting approach
    • DOI 10.1177/0962280206075303
    • J. R. Carpenter M. G. Kenward I. R. White 2007 Sensitivity analysis after multiple imputation under missing at random: a weighting approach Statistical Methods in Medical Research 16 3 259 275 2371009 1122.62300 10.1177/ 0962280206075303 (Pubitemid 47018662)
    • (2007) Statistical Methods in Medical Research , vol.16 , Issue.3 , pp. 259-275
    • Carpenter, J.R.1    Kenward, M.G.2    White, I.R.3
  • 9
    • 7744234104 scopus 로고    scopus 로고
    • Placebo-free designs for evaluating new mental health treatments: The use of adaptive treatment strategies
    • DOI 10.1002/sim.1920
    • R. Dawson P. W. Lavori 2004 Placebo-free designs for evaluating new mental health treatments: the use of adaptive strategies Statistics in Medicine 23 3249 3262 10.1002/sim.1920 (Pubitemid 39462085)
    • (2004) Statistics in Medicine , vol.23 , Issue.21 , pp. 3249-3262
    • Dawson, R.1    Lavori, P.W.2
  • 12
    • 56449086386 scopus 로고    scopus 로고
    • Reinforcement learning with limited reinforcement: Using Bayes risk for active learning in POMDPs
    • A. McCallum S. Roweis (eds). Omnipress New York. 10.1145/1390156.1390189
    • Doshi, F., Pineau, J., & Roy, N. (2008). Reinforcement learning with limited reinforcement: using Bayes risk for active learning in POMDPs. In A. McCallum & S. Roweis (Eds.), Proceedings of the 25th annual international conference on machine learning (ICML 2008) (pp. 256-263). New York: Omnipress.
    • (2008) Proceedings of the 25th Annual International Conference on Machine Learning (ICML 2008) , pp. 256-263
    • Doshi, F.1    Pineau, J.2    Roy, N.3
  • 13
    • 0002344794 scopus 로고
    • Bootstrap methods: Another look at the jackknife
    • 515681 0406.62024 10.1214/aos/1176344552
    • B. Efron 1979 Bootstrap methods: another look at the jackknife The Annals of Statistics 7 1 1 26 515681 0406.62024 10.1214/aos/1176344552
    • (1979) The Annals of Statistics , vol.7 , Issue.1 , pp. 1-26
    • Efron, B.1
  • 15
    • 31844451013 scopus 로고    scopus 로고
    • Reinforcement learning with Gaussian processes
    • DOI 10.1145/1102351.1102377, ICML 2005 - Proceedings of the 22nd International Conference on Machine Learning
    • Y. Engel S. Mannor R. Meir 2005 Reinforcement learning with Gaussian processes L. D. Raedt S. Wrobel (eds) Proceedings of the 22nd international conference on machine learning (ICML 2005) ACM New York 201 208 10.1145/1102351.1102377 10.1145/1102351.1102377 (Pubitemid 43183334)
    • (2005) ICML 2005 - Proceedings of the 22nd International Conference on Machine Learning , pp. 201-208
    • Engel, Y.1    Mannor, S.2    Meir, R.3
  • 16
    • 31844451013 scopus 로고    scopus 로고
    • Reinforcement learning with Gaussian processes
    • DOI 10.1145/1102351.1102377, ICML 2005 - Proceedings of the 22nd International Conference on Machine Learning
    • Engel, Y., Mannor, S., & Meir, R. (2005). Reinforcement learning with Gaussian processes. In L. D. Raedt & S. Wrobel (Eds.), Proceedings of the 22nd international conference on machine learning (ICML 2005) (pp. 201-208). New York: ACM. 10.1145/1102351.1102377. (Pubitemid 43183334)
    • (2005) ICML 2005 - Proceedings of the 22nd International Conference on Machine Learning , pp. 201-208
    • Engel, Y.1    Mannor, S.2    Meir, R.3
  • 19
    • 79958798058 scopus 로고    scopus 로고
    • MDPs with non-deterministic policies
    • D. Koller D. Schuurmans Y. Bengio L. Bottou (eds). MIT Press Cambridge
    • Fard, M. M., Pineau, J. (2009). MDPs with non-deterministic policies. In D. Koller, D. Schuurmans, Y. Bengio, & L. Bottou (Eds.), Advances in neural information processing systems (pp. 1065-1072). Cambridge: MIT Press.
    • (2009) Advances in Neural Information Processing Systems , pp. 1065-1072
    • Fard, M.M.1    Pineau, J.2
  • 21
    • 15044358532 scopus 로고    scopus 로고
    • Multiple imputation for model checking: Completed-data plots with missing and latent data
    • DOI 10.1111/j.0006-341X.2005.031010.x
    • A. Gelman I. V. Mechelen G. Verbeke D. F. Heitjan M. Meulders 2005 Multiple imputation for model checking: completed-data plots with missing and latent data Biometrics 61 74 85 2135847 1077.62091 10.1111/j.0006-341X.2005. 031010.x (Pubitemid 40380966)
    • (2005) Biometrics , vol.61 , Issue.1 , pp. 74-85
    • Gelman, A.1    Van Mechelen, I.2    Verbeke, G.3    Heitjan, D.F.4    Meulders, M.5
  • 26
    • 0032073263 scopus 로고    scopus 로고
    • Planning and acting in partially observable stochastic domains
    • PII S000437029800023X
    • L. P. Kaelbling M. L. Littman A. R. Cassandra 1998 Planning and acting in partially observable stochastic domains Artificial Intelligence 101 99 134 1641530 0908.68165 10.1016/S0004-3702(98)00023-X (Pubitemid 128387390)
    • (1998) Artificial Intelligence , vol.101 , Issue.1-2 , pp. 99-134
    • Kaelbling, L.P.1    Littman, M.L.2    Cassandra, A.R.3
  • 28
    • 0023606101 scopus 로고
    • The positive and negative syndrome scale (PANSS) for schizophrenia
    • S. R. Kay A. Flazbein L. A. Opler 1987 The positive and negative syndrome scale (PANSS) for schizophrenia Schizophrenia Bulletin 13 2 261 276 (Pubitemid 18039085)
    • (1987) Schizophrenia Bulletin , vol.13 , Issue.2 , pp. 261-276
    • Kay, S.R.1    Fiszbein, A.2    Opler, L.A.3
  • 30
    • 4644323293 scopus 로고    scopus 로고
    • Least-squares policy iteration
    • 2125347 10.1162/jmlr.2003.4.6.1107
    • M. G. Lagoudakis R. Parr 2003 Least-squares policy iteration Journal of Machine Learning Research 4 1107 1149 2125347 10.1162/jmlr.2003.4.6.1107
    • (2003) Journal of Machine Learning Research , vol.4 , pp. 1107-1149
    • Lagoudakis, M.G.1    Parr, R.2
  • 35
    • 0019909899 scopus 로고
    • A survey of partially observable Markov decision processes
    • 646904 0486.90084 10.1287/mnsc.28.1.1
    • G. Monahan 1982 A survey of partially observable Markov decision processes Management Science 28 1 16 646904 0486.90084 10.1287/mnsc.28.1.1
    • (1982) Management Science , vol.28 , pp. 1-16
    • Monahan, G.1
  • 37
    • 19144362679 scopus 로고    scopus 로고
    • An experimental design for the development of adaptive treatment strategies
    • DOI 10.1002/sim.2022
    • S. A. Murphy 2005 An experimental design for the development of adaptive treatment strategies Statistics in Medicine 24 1455 1481 2137651 10.1002/sim.2022 (Pubitemid 40716347)
    • (2005) Statistics in Medicine , vol.24 , Issue.10 , pp. 1455-1481
    • Murphy, S.A.1
  • 38
    • 33846260190 scopus 로고    scopus 로고
    • Methodological challenges in constructing effective treatment sequences for chronic psychiatric disorders
    • DOI 10.1038/sj.npp.1301241, PII 1301241
    • S. A. Murphy D. Oslin A. J. Rush 2007 Methodological challenges in constructing effective treatment sequences for chronic disorders Neuropsychopharmacology 32 2 257 262 10.1038/sj.npp.1301241 (Pubitemid 46106932)
    • (2007) Neuropsychopharmacology , vol.32 , Issue.2 , pp. 257-262
    • Murphy, S.A.1    Oslin, D.W.2    Rush, A.J.3    Zhu, J.4
  • 39
    • 79952576381 scopus 로고    scopus 로고
    • NAP The National Academies Press, Panel on Handling Missing Data in Clinical Trials. Committee on National Statistics, Division of Behavioral, Social Sciences and Education
    • NAP (2010). The prevention and treatment of missing data in clinical trials. The National Academies Press, Panel on Handling Missing Data in Clinical Trials. Committee on National Statistics, Division of Behavioral, Social Sciences and Education.
    • (2010) The Prevention and Treatment of Missing Data in Clinical Trials
  • 41
    • 56449092660 scopus 로고    scopus 로고
    • An analysis of linear models, linear value-function approximation, and feature selection for reinforcement learning
    • A. McCallum S. Roweis (eds). Omnipress New York. 10.1145/1390156.1390251
    • Parr, R., Li, L., Taylor, G., Painter-Wakefield, C., & Littman, M. (2008). An analysis of linear models, linear value-function approximation, and feature selection for reinforcement learning. In A. McCallum, & S. Roweis (Eds.), Proceedings of the 25th annual international conference on machine learning (pp. 752-759). New York: Omnipress.
    • (2008) Proceedings of the 25th Annual International Conference on Machine Learning , pp. 752-759
    • Parr, R.1    Li, L.2    Taylor, G.3    Painter-Wakefield, C.4    Littman, M.5
  • 42
    • 34047273906 scopus 로고    scopus 로고
    • Constructing evidence-based treatment strategies using methods from computer science
    • DOI 10.1016/j.drugalcdep.2007.01.005, PII S0376871607000270
    • Pineau, J., Bellemare, M. G., Rush, A. J., Ghizaru, A., & Murphy, S. A. (2007). Constructing evidence-based treatment strategies using methods from computer science. Drug and Alcohol Dependence S52-S60. (Pubitemid 46546455)
    • (2007) Drug and Alcohol Dependence , vol.88 , Issue.SUPPL. 2
    • Pineau, J.1    Bellemare, M.G.2    Rush, A.J.3    Ghizaru, A.4    Murphy, S.A.5
  • 43
    • 0002121637 scopus 로고    scopus 로고
    • Sensitivity analysis for selection bias and unmeasured confounding in missing data and causal inference models
    • M. E. Halloran D. Berry (eds). Springer Berlin
    • Robins, J. M., Rotnitzky, A., & Scharfstein, D. (1999). Sensitivity analysis for selection bias and unmeasured confounding in missing data and causal inference models. In M. E. Halloran & D. Berry (Eds.), Statistical models in epidemiology: the environment and clinical trials (pp. 1-92). Berlin: Springer.
    • (1999) Statistical Models in Epidemiology: The Environment and Clinical Trials , pp. 1-92
    • Robins, J.M.1    Rotnitzky, A.2    Scharfstein, D.3
  • 44
    • 0030539070 scopus 로고    scopus 로고
    • Multiple imputation after 18+ years (with discussion)
    • 0869.62014 10.2307/2291635
    • D. B. Rubin 1996 Multiple imputation after 18+ years (with discussion) Journal of the American Statistical Association 91 473 489 0869.62014 10.2307/2291635
    • (1996) Journal of the American Statistical Association , vol.91 , pp. 473-489
    • Rubin, D.B.1
  • 47
    • 0032960273 scopus 로고    scopus 로고
    • Multiple imputation: A primer
    • DOI 10.1191/096228099671525676
    • J. L. Schafer 1999 Multiple imputation: a primer Statistical Methods in Medical Research 8 1 3 15 10.1191/096228099671525676 (Pubitemid 29222784)
    • (1999) Statistical Methods in Medical Research , vol.8 , Issue.1 , pp. 3-15
    • Schafer, J.L.1
  • 48
    • 0036017469 scopus 로고    scopus 로고
    • Computational strategies for multivariate linear mixed models with missing values
    • 1938143 10.1198/106186002760180608
    • J. L. Schafer R. M. Yucel 2002 Computational strategies for multivariate linear mixed models with missing values Journal of Computational and Graphical Statistics 11 421 442 1938143 10.1198/106186002760180608
    • (2002) Journal of Computational and Graphical Statistics , vol.11 , pp. 421-442
    • Schafer, J.L.1    Yucel, R.M.2
  • 49
    • 0442278084 scopus 로고    scopus 로고
    • Adjusting for nonignorable drop-out using semiparametric nonresponse models
    • 1731478 1072.62644 10.2307/2669923
    • D. O. Scharfstein A. Rotnitzky J. M. Robins 1999 Adjusting for nonignorable drop-out using semiparametric nonresponse models Journal of the American Statistical Association 94 448 1096 1120 1731478 1072.62644 10.2307/2669923
    • (1999) Journal of the American Statistical Association , vol.94 , Issue.448 , pp. 1096-1120
    • Scharfstein, D.O.1    Rotnitzky, A.2    Robins, J.M.3
  • 50
    • 0042304469 scopus 로고
    • Bootstrap sample size in nonregular cases
    • 1227529 0820.62037 10.1090/S0002-9939-1994-1227529-8
    • J. Shao 1994 Bootstrap sample size in nonregular cases Proceedings of the American Mathematical Society 122 4 1251 1262 1227529 0820.62037 10.1090/S0002-9939-1994-1227529-8
    • (1994) Proceedings of the American Mathematical Society , vol.122 , Issue.4 , pp. 1251-1262
    • Shao, J.1
  • 52
    • 0015658957 scopus 로고
    • The optimal control of partially observable Markov processes over a finite horizon
    • 10.1287/opre.21.5.1071
    • R. D. Smallwood E. J. Sondik 1973 The optimal control of partially observable Markov processes over a finite horizon Operations Research 21 1070 1088 10.1287/opre.21.5.1071
    • (1973) Operations Research , vol.21 , pp. 1070-1088
    • Smallwood, R.D.1    Sondik, E.J.2
  • 54
    • 31844432138 scopus 로고    scopus 로고
    • A theoretical analysis of model-based interval Estimation
    • L. D. Raedt S. Wrobel (eds). ACM New York. 10.1145/1102351.1102459
    • A. L. Strehl M. L. Littman 2005 A theoretical analysis of model-based interval Estimation L. D. Raedt S. Wrobel (eds) Proceedings of the 22nd international conference on Machine learning (ICML 2005 ) ACM New York 856 863 10.1145/1102351.1102459 10.1145/1102351.1102459
    • (2005) Proceedings of the 22nd International Conference on Machine Learning (ICML 2005 ) , pp. 856-863
    • Strehl, A.L.1    Littman, M.L.2
  • 55
    • 31844432138 scopus 로고    scopus 로고
    • A theoretical analysis of model-based interval Estimation
    • L. D. Raedt & S. Wrobel (Eds.) New York: ACM. 10.1145/1102351.1102459
    • Strehl, A. L., & Littman, M. L. (2005). A theoretical analysis of model-based interval Estimation. In L. D. Raedt & S. Wrobel (Eds.), Proceedings of the 22nd international conference on Machine learning (ICML 2005 ) (pp. 856-863). New York: ACM. 10.1145/1102351.1102459.
    • (2005) Proceedings of the 22nd International Conference on Machine Learning (ICML 2005 ) , pp. 856-863
    • Strehl, A.L.1    Littman, M.L.2
  • 56
    • 34250700033 scopus 로고    scopus 로고
    • PAC model-free reinforcement learning
    • DOI 10.1145/1143844.1143955, ACM International Conference Proceeding Series - Proceedings of the 23rd International Conference on Machine Learning, ICML 2006
    • Strehl, A., Li, L., Wiewiora, E., Langford, J., & Littman, M. (2006). PAC model-free reinforcement learning. In W. W. Cohen & A. Moore (Eds.), Proceedings of the 23rd annual international conference on machine learning (ICML 2006) (pp. 881-888). (Pubitemid 46966930)
    • (2006) ACM International Conference Proceeding Series , vol.148 , pp. 881-888
    • Strehl, A.L.1    Lihong, L.2    Wiewiora, E.3    Langford, J.4    Littman, M.L.5
  • 58
    • 0038448311 scopus 로고    scopus 로고
    • The National Institute of Mental Health Clinical Antipsychotic Trials of Intervention Effectiveness (CATIE) project: Schizophrenia trial design and protocol development
    • T. S. Stroup J. P. McEvoy M. S. Swartz M. J. Byerly I. D. Glick J. M. Canive M. McGee G. M. Simpson M. D. Stevens J. A. Lieberman 2003 The National Institute of Mental Health clinical antipschotic trials of intervention effectiveness (CATIE) project: schizophrenia trial design and protocol development Schizophrenia Bulletin 29 1 15 31 (Pubitemid 36871054)
    • (2003) Schizophrenia Bulletin , vol.29 , Issue.1 , pp. 15-31
    • Stroup, T.S.1    McEvoy, J.P.2    Swartz, M.S.3    Byerly, M.J.4    Glick, I.D.5    Canive, J.M.6    McGee, M.F.7    Simpson, G.M.8    Stevens, M.C.9    Lieberman, J.A.10
  • 60
    • 0037771503 scopus 로고    scopus 로고
    • Assessing clinical and functional outcomes in the clinical antipsychotic trials of intervention effectiveness (CATIE) schizophrenia trial
    • M. S. Swartz D. O. Perkins T. S. Stroup J. P. McEvoy J. M. Nieri D. D. Haal 2003 Assessing clinical and functional outcomes in the clinical antipsychotic of intervention effectiveness (CATIE) schizophrenia trial Schizophrenia Bulletin 29 1 33 43 (Pubitemid 36871055)
    • (2003) Schizophrenia Bulletin , vol.29 , Issue.1 , pp. 33-43
    • Swartz, M.S.1    Perkins, D.O.2    Stroup, T.S.3    McEvoy, J.P.4    Nieri, J.M.5    Haak, D.C.6
  • 62
    • 33847666453 scopus 로고    scopus 로고
    • Practical Bayesian adaptive randomisation in clinical trials
    • DOI 10.1016/j.ejca.2007.01.006, PII S095980490700010X
    • P. Thall J. Wathen 2007 Practical Bayesian adaptive randomisation in clinical trials European Journal of Cancer 43 5 859 866 10.1016/j.ejca.2007.01. 006 (Pubitemid 46366693)
    • (2007) European Journal of Cancer , vol.43 , Issue.5 , pp. 859-866
    • Thall, P.F.1    Wathen, J.K.2
  • 63
    • 0034732660 scopus 로고    scopus 로고
    • Evaluating multiple treatment courses in clinical trials
    • DOI 10.1002/(SICI)1097-0258(20000430)19:8<1011::AID-SIM414>3.0. CO;2-M
    • P. F. Thall J. K. Wathan 2000 Covariate-adjusted adaptive randomization in a sarcoma trial with multistate treatments Statistics in Medicine 19 1011 1028 10.1002/(SICI)1097-0258(20000430)19:8<1011::AID-SIM414>3.0.CO;2-M (Pubitemid 30238912)
    • (2000) Statistics in Medicine , vol.19 , Issue.8 , pp. 1011-1028
    • Thall, P.F.1    Millikan, R.E.2    Sung, H.-G.3
  • 64
    • 34347407592 scopus 로고    scopus 로고
    • Multiple imputation of discrete and continuous data by fully conditional specification
    • DOI 10.1177/0962280206074463
    • S. van Buuren 2007 Multiple imputation of discrete and continuous data by fully conditional specification Statistical Methods in Medical Research 16 3 219 242 2371007 1122.62382 10.1177/0962280206074463 (Pubitemid 47018660)
    • (2007) Statistical Methods in Medical Research , vol.16 , Issue.3 , pp. 219-242
    • Van Buuren, S.1
  • 68
    • 70449449564 scopus 로고    scopus 로고
    • Reinforcement learning design for cancer clinical trials
    • 10.1002/sim.3720
    • Y. Zhao M. R. Kosorok D. Zeng 2009 Reinforcement learning design for cancer clinical trials Statistics in Medicine 28 3294 3315 10.1002/sim.3720
    • (2009) Statistics in Medicine , vol.28 , pp. 3294-3315
    • Zhao, Y.1    Kosorok, M.R.2    Zeng, D.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.