SCOPUS 정보 검색 플랫폼

Volumn , Issue , 2008, Pages 296-303

Active reinforcement learning

Author keywords

[No Author keywords available]

Indexed keywords

LEARNING ALGORITHMS; MACHINE LEARNING; MARKOV PROCESSES; LEARNING SYSTEMS; REINFORCEMENT; ROBOT LEARNING;

ACTIVE REINFORCEMENT; EXPLORATION STRATEGIES; MARKOV DECISION PROCESSES; OPTIMAL POLICIES; PLANNING PROBLEM; TRANSITION PROBABILITIES;

REINFORCEMENT LEARNING; OPTIMIZATION;

ACTIVE REINFORCEMENTS; EXPLORATION STRATEGIES; MARKOV DECISION PROCESS; OPTIMAL POLICIES; PLANNING PROBLEMS; TRANSITION PROBABILITIES;

EID: 56449114181 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1145/1390156.1390194 Document Type: Conference Paper

Times cited : (33)

References (14)

1
- 31844444663
- Exploration and apprenticeship learning in reinforcement learning
- Abbeel, P., & Ng, A. Y. (2005). Exploration and apprenticeship learning in reinforcement learning. ICML.
- (2005) ICML
- Abbeel, P.¹ Ng, A.Y.²

2
- 33749242451
- Using inaccurate models in reinforcement learning
- Abbeel, P., Quigley, M., & Ng, A. (2006). Using inaccurate models in reinforcement learning. ICML.
- (2006) ICML
- Abbeel, P.¹ Quigley, M.² Ng, A.³

3
- 0041965975
- R-MAX- A general polynomial time algorithm for near-optimal reinforcement learning
- Brafman, R. I., & Tennenholtz, M. (2002). R-MAX- A general polynomial time algorithm for near-optimal reinforcement learning. Journal of Machine Learning Research, 3, 213-231.
- (2002) Journal of Machine Learning Research , vol.3 , pp. 213-231
- Brafman, R.I.¹ Tennenholtz, M.²

4
- 0037289322
- From perturbation analysis to markov decision processes and reinforcement learning
- Cao, X. (2003). From perturbation analysis to markov decision processes and reinforcement learning. Discrete Event Dynamic Systems: Theory and Applications, 13, 9-39.
- (2003) Discrete Event Dynamic Systems: Theory and Applications , vol.13 , pp. 9-39
- Cao, X.¹

5
- 22944449970
- Model based bayesian exploration
- Dearden, R., Friedman, N., & Andre, D. (1999). Model based bayesian exploration. Uncertainty in Artificial Intelligence.
- (1999) Uncertainty in Artificial Intelligence
- Dearden, R.¹ Friedman, N.² Andre, D.³

6
- 0003391450
- New York, NY: Springer-Verlag
- Devroye, L. (1986). Non-uniform random variate generation. New York, NY: Springer-Verlag.
- (1986) Non-uniform random variate generation
- Devroye, L.¹

7
- 0034272032
- Bounded-parameter markov decision processes
- Givan, R., Leach, S., & Dean, T. (2000). Bounded-parameter markov decision processes. Artificial Intelligence, 122, 71-109.
- (2000) Artificial Intelligence , vol.122 , pp. 71-109
- Givan, R.¹ Leach, S.² Dean, T.³

8
- 0036832954
- Near optimal reinforcement learning in polynomial time
- Kearns, M., & Singh, S. (2002). Near optimal reinforcement learning in polynomial time. Machine Learning, 49, 209-232.
- (2002) Machine Learning , vol.49 , pp. 209-232
- Kearns, M.¹ Singh, S.²

9
- 0041940559
- Applications of second-order cone programming
- Lobo, M. S., Vandenberghe, L., Boyd, S., & Lebret, H. (1998). Applications of second-order cone programming. Linear Algebra and its Applications, 284, 193-228.
- (1998) Linear Algebra and its Applications , vol.284 , pp. 193-228
- Lobo, M.S.¹ Vandenberghe, L.² Boyd, S.³ Lebret, H.⁴

10
- 0012304345
- Doctoral dissertation, University of Washington
- Madani, O. (2000). Complexity results for infinite-horizon markov decision processes. Doctoral dissertation, University of Washington.
- (2000) Complexity results for infinite-horizon markov decision processes
- Madani, O.¹

11
- 56449109724
- Robustness in markov decision problems with uncertain transition matrices
- Nilim, A., & Ghaoui, L. E. (2003). Robustness in markov decision problems with uncertain transition matrices. NIPS.
- (2003) NIPS
- Nilim, A.¹ Ghaoui, L.E.²

12
- 85102627959
- New York, NY: John Wiley & Sons, Inc
- Puterman, M. (1994). Markov decision processes -discrete stochastic dynamic programming. New York, NY: John Wiley & Sons, Inc.
- (1994) Markov decision processes -discrete stochastic dynamic programming
- Puterman, M.¹

13
- 31844432138
- A theoretical analysis of model-based interval estimation
- Strehl, A. L., & Littman, M. L. (2005). A theoretical analysis of model-based interval estimation. ICML.
- (2005) ICML
- Strehl, A.L.¹ Littman, M.L.²

14
- 0004102479
- Cambridge, MA: MIT Press
- Sutton, R. S., & Barto, A. G. (1998). Reinforcement learning: An introduction. Cambridge, MA: MIT Press.
- (1998) Reinforcement learning: An introduction
- Sutton, R.S.¹ Barto, A.G.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.