SCOPUS 정보 검색 플랫폼

Journal of Artificial Intelligence Research

Volumn 40, Issue , 2011, Pages 1-24

Non-deterministic policies in markovian decision processes

(2) Fard, Mahdi Milani a Pineau, Joelle a

a MCGILL UNIVERSITY (Canada)

Author keywords

[No Author keywords available]

Indexed keywords

ACTION SELECTION; CONVENTIONAL METHODS; DECISION MAKING PROCESS; DECISION-MAKING PROBLEM; DISCRETE DOMAINS; HUMAN SUBJECTS; MARKOVIAN DECISION PROCESS; MARKOVIAN ENVIRONMENT; MARKOVIAN PROCESS; MEDICAL DOMAINS; NEAR-OPTIMAL SOLUTIONS; REAL-WORLD PROBLEM; RUNNING TIME; STOCHASTIC ENVIRONMENT; WEB NAVIGATION;

ARTIFICIAL INTELLIGENCE; DECISION SUPPORT SYSTEMS; MEDICAL PROBLEMS; REINFORCEMENT LEARNING; STOCHASTIC MODELS; USER INTERFACES;

DECISION MAKING;

EID: 79956364385 PISSN: None EISSN: 10769757 Source Type: Journal
DOI: 10.1613/jair.3175 Document Type: Article

Times cited : (25)

References (22)

1
- 85012688561
- Princeton University Press
- Bellman, R. (1957). Dynamic Programming. Princeton University Press.
- (1957) Dynamic Programming
- Bellman, R.¹

2
- 0003565783
- Athena Scientific
- Bertsekas, D. (1995). Dynamic Programming and Optimal Control, Vol 2. Athena Scientific.
- (1995) Dynamic Programming and Optimal Control , vol.2
- Bertsekas, D.¹

3
- 64049089271
- Clinical data based optimal STI strategies for HIV: A reinforcement learning approach
- Ernst, D., Stan, G. B., Concalves, J., & Wehenkel, L. (2006). Clinical data based optimal STI strategies for HIV: A reinforcement learning approach. In Proceedings of the Fifteenth Machine Learning conference of Belgium and The Netherlands (Benelearn), pp. 65-72.
- (2006) Proceedings of the Fifteenth Machine Learning conference of Belgium and The Netherlands (Benelearn , pp. 65-72
- Ernst, D.¹ Stan, G.B.² Concalves, J.³ Wehenkel, L.⁴

4
- 0037986382
- Background and rationale for the sequenced treatment alternatives to relieve depression (STAR*D) study
- DOI 10.1016/S0193-953X(02)00107-7
- Fava, M., Rush, A., Trivedi, M., Nierenberg, A., Thase, M., Sackeim, H., Quitkin, F., Wis- niewski, S., Lavori, P., Rosenbaum, J., & Kupfer, D. (2003). Background and rationale for the sequenced treatment alternatives to relieve depression (STAR*D) study. Psy- chiatric Clinics of North America, 26 (2), 457-494. (Pubitemid 36583573)
- (2003) Psychiatric Clinics of North America , vol.26 , Issue.2 , pp. 457-494
- Fava, M.¹ Rush, A.J.² Trivedi, M.H.³ Nierenberg, A.A.⁴ Thase, M.E.⁵ Sackeim, H.A.⁶ Quitkin, F.M.⁷ Wisniewski, S.⁸ Lavori, P.W.⁹ Rosenbaum, J.F.¹⁰ Kupfer, D.J.¹¹

5
- 1842788824
- Finding scientific topics
- DOI 10.1073/pnas.0307752101
- Grffiths, T. L., & Steyvers, M. (2004). Finding scientific topics. Proceedings of the National Academy of Sciences, 101 (Suppl. 1), 5228-5235. (Pubitemid 38469131)
- (2004) Proceedings of the National Academy of Sciences of the United States of America , vol.101 , Issue.SUPPL. 1 , pp. 5228-5235
- Griffiths, T.L.¹ Steyvers, M.²

6
- 0034160101
- Planning treatment of ischemic heart disease with partially observable Markov decision processes
- DOI 10.1016/S0933-3657(99)00042-1, PII S0933365799000421
- Hauskrecht, M., & Fraser, H. (2000). Planning treatment of ischemic heart disease with partially observable Markov decision processes. Artificial Intelligence in Medicine, 18 (3), 221-244. (Pubitemid 30089530)
- (2000) Artificial Intelligence in Medicine , vol.18 , Issue.3 , pp. 221-244
- Hauskrecht, M.¹ Fraser, H.²

7
- 85120861483
- Consideration of risk in reinforcement learning
- Heger, M. (1994). Consideration of risk in reinforcement learning. In Proceedings of the Eleventh International Conference on Machine Learning (ICML), pp. 105-111.
- (1994) Proceedings of the Eleventh International Conference on Machine Learning (ICML , pp. 105-111
- Heger, M.¹

8
- 51249181779
- A new polynomial-time algorithm for linear programming
- Karmarkar, N. (1984). A new polynomial-time algorithm for linear programming. Combi- natorica, 4 (4), 373-395.
- (1984) Combi- natorica , vol.4 , Issue.4 , pp. 373-395
- Karmarkar, N.¹

9
- 0036832954
- Near-optimal reinforcement learning in polynomial time
- Kearns, M., & Singh, S. (2002). Near-optimal reinforcement learning in polynomial time. Machine Learning, 49.
- (2002) Machine Learning , vol.49
- Kearns, M.¹ Singh, S.²

10
- 0034530018
- Deciding when to intervene: A Markov decision process approach
- DOI 10.1016/S1386-5056(00)00099-X, PII S138650560000099X
- Magni, P., Quaglini, S., Marchetti, M., & Barosi, G. (2000). Deciding when to intervene: A Markov decision process approach. International Journal of Medical Informatics, 60 (3), 237-253. (Pubitemid 32007952)
- (2000) International Journal of Medical Informatics , vol.60 , Issue.3 , pp. 237-253
- Magni, P.¹ Quaglini, S.² Marchetti, M.³ Barosi, G.⁴

11
- 14344261137
- Bias and variance in value function estimation
- Mannor, S., Simester, D., Sun, P., & Tsitsiklis, J. N. (2004). Bias and variance in value function estimation. In Proceedings of the Twenty-First International Conference on Machine Learning (ICML), pp. 308-322.
- (2004) Proceedings of the Twenty-First International Conference on Machine Learning (ICML , pp. 308-322
- Mannor, S.¹ Simester, D.² Sun, P.³ Tsitsiklis, J.N.⁴

12
- 33847336943
- Bias and variance approximation in value function estimates
- DOI 10.1287/mnsc.1060.0614
- Mannor, S., Simester, D., Sun, P., & Tsitsiklis, J. N. (2007). Bias and variance approxima- tion in value function estimates. Management Science, 53 (2), 308-322. (Pubitemid 46326182)
- (2007) Management Science , vol.53 , Issue.2 , pp. 308-322
- Mannor, S.¹ Simester, D.² Sun, P.³ Tsitsiklis, J.N.⁴

13
- 79956340213
- Amazon mechanical turk. In
- MTurk (2010). Amazon mechanical turk. In http://www.mturk.com/.
- (2010)
- Turk, M.¹

14
- 19144362679
- An experimental design for the development of adaptive treatment strategies
- DOI 10.1002/sim.2022
- Murphy, S. A. (2005). An experimental design for the development of adaptive treatment strategies. Statistics in Medicine, 24 (10), 1455-1481. (Pubitemid 40716347)
- (2005) Statistics in Medicine , vol.24 , Issue.10 , pp. 1455-1481
- Murphy, S.A.¹

15
- 34047273906
- Constructing evidence-based treatment strategies using methods from computer science
- DOI 10.1016/j.drugalcdep.2007.01.005, PII S0376871607000270
- Pineau, J., Bellemare, M. G., Rush, A. J., Ghizaru, A., & Murphy, S. A. (2007). Construct- ing evidence-based treatment strategies using methods from computer science. Drug and Alcohol Dependence, 88 (Supplement 2), S52 - S60. (Pubitemid 46546455)
- (2007) Drug and Alcohol Dependence , vol.88 , Issue.SUPPL. 2
- Pineau, J.¹ Bellemare, M.G.² Rush, A.J.³ Ghizaru, A.⁴ Murphy, S.A.⁵

16
- 0003584577
- (Second Edition). Prentice Hall
- Russell, S. J., & Norvig, P. (2003). Artificial Intelligence: A Modern Approach (Second Edition). Prentice Hall.
- (2003) Artificial Intelligence: A Modern Approach
- Russell, S.J.¹ Norvig, P.²

17
- 84944041010
- Variance-Penalized Reinforcement Learning for Risk-Averse Asset Allocation
- Sato, M., & Kobayashi, S. (2000). Variance-penalized reinforcement learning for risk-averse asset allocation. In Proceedings of the Second International Conference on Intelligent Data Engineering and Automated Learning, Data Mining, Financial Engineering, and Intelligent Agents, pp. 244-249. Springer-Verlag. (Pubitemid 33211324)
- (2000) LECTURE NOTES IN COMPUTER SCIENCE , Issue.1983 , pp. 244-249
- Sato, M.¹ Kobayashi, S.²

18
- 36448962503
- chap. Medical decisions using Markov decision processes. Kluwer Academic Publishers
- Schaefer, A., Bailey, M., Shechter, S., & Roberts, M. (2004). Handbook of Operations Research / Management Science Applications in Health Care, chap. Medical decisions using Markov decision processes. Kluwer Academic Publishers.
- (2004) Handbook of Operations Research / Management Science Applications in Health Care
- Schaefer, A.¹ Bailey, M.² Shechter, S.³ Roberts, M.⁴

19
- 79956364284
- Schools-Wikipedia 2008/9 wikipedia selection for schools. In
- Schools-Wikipedia (2009). 2008/9 wikipedia selection for schools. In http://schools- wikipedia.org/.
- (2009)

20
- 0004102479
- The MIT Press
- Sutton, R. S., & Barto, A. G. (1998). Reinforcement Learning: An Introduction (Adaptive Computation and Machine Learning). The MIT Press.
- (1998) Reinforcement Learning: An Introduction (Adaptive Computation and Machine Learning)
- Sutton, R.S.¹ Barto, A.G.²

21
- 26844472827
- Agent based decision support system using re- inforcement learning under emergency circumstances
- Thapa, D., Jung, I., & Wang, G. (2005). Agent based decision support system using re- inforcement learning under emergency circumstances. Lecture Notes in Computer Science, 3610, 888.
- (2005) Lecture Notes in Computer Science , vol.3610 , pp. 888
- Thapa, D.¹ Jung, I.² Wang, G.³

22
- 78751692961
- Wikispeedia: An online game for inferring semantic distances between concepts
- San Francisco, CA, USA. Morgan Kaufmann Publishers Inc
- West, R., Pineau, J., & Precup, D. (2009). Wikispeedia: An online game for inferring semantic distances between concepts. In Proceedings of the Twenty-First International Jont Conference on Artifical Intelligence (IJCAI), pp. 1598-1603, San Francisco, CA, USA. Morgan Kaufmann Publishers Inc.
- (2009) Proceedings of the Twenty-First International Jont Conference on Artifical Intelligence (IJCAI , pp. 1598-1603
- West, R.¹ Pineau, J.² Precup, D.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.