SCOPUS 정보 검색 플랫폼

Volumn 148, Issue , 2006, Pages 305-312

Qualitative reinforcement learning

Author keywords

[No Author keywords available]

Indexed keywords

ALGORITHMS; COMPUTER SIMULATION; MARKOV PROCESSES; PROBABILITY; PROBLEM SOLVING; STOCHASTIC MODELS;

STOCHASTIC DOMINANCE CONSTRAINTS; TRANSITION PROBABILITIES;

REINFORCEMENT LEARNING;

EID: 34250717446 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1145/1143844.1143883 Document Type: Conference Paper

Times cited : (7)

References (10)

1
- 14344251217
- Apprenticeship learning via inverse reinforcement learning
- Abbeel, P., & Ng, A. Y. (2004). Apprenticeship learning via inverse reinforcement learning. ICML.
- (2004) ICML
- Abbeel, P.¹ Ng, A.Y.²

2
- 31844444663
- Exploration and apprenticeship learning in reinforcement learning
- Abbeel, P., & Ng, A. Y. (2005). Exploration and apprenticeship learning in reinforcement learning. ICML.
- (2005) ICML
- Abbeel, P.¹ Ng, A.Y.²

3
- 16244402846
- Qualitative mdps and pomdps: An order-of-magnitude approximation
- Bonet, B., & Pearl, J. (2002). Qualitative mdps and pomdps: An order-of-magnitude approximation. UAI.
- (2002) UAI
- Bonet, B.¹ Pearl, J.²

4
- 34250717373
- Epshteyn, A., & DeJong, G. (2006). Qualitative reinforcement learning (full paper). http://www.ews.uiuc.edu/~aepshtey/pubs/qual_rl.ps.
- Epshteyn, A., & DeJong, G. (2006). Qualitative reinforcement learning (full paper). http://www.ews.uiuc.edu/~aepshtey/pubs/qual_rl.ps.

5
- 0034272032
- Bounded-parameter markov decision processes
- Givan, R., Leach, S., & Dean, T. (2000). Bounded-parameter markov decision processes. Artificial Intelligence, 122, 71-109.
- (2000) Artificial Intelligence , vol.122 , pp. 71-109
- Givan, R.¹ Leach, S.² Dean, T.³

6
- 1942484890
- The influence of reward on the speed of reinforcement learning: An analysis of shaping
- Laud, A., & DeJong, G. (2003). The influence of reward on the speed of reinforcement learning: An analysis of shaping. ICML.
- (2003) ICML
- Laud, A.¹ DeJong, G.²

7
- 0141596576
- Policy invariance under reward transformations: Theory and application to reward shaping
- Ng, A. Y., Harada, D., & Russell, S. J. (1999). Policy invariance under reward transformations: Theory and application to reward shaping. ICML.
- (1999) ICML
- Ng, A.Y.¹ Harada, D.² Russell, S.J.³

8
- 34250706765
- Sabbadin, R. (1999). A possibilistic model for qualitative sequential decision problems under uncertainty in partially observable environments. UAI
- Sabbadin, R. (1999). A possibilistic model for qualitative sequential decision problems under uncertainty in partially observable environments. UAI

9
- 0003445519
- San Diego, CA: Academic Press
- Shaked, M., & Shanthikumar, J. G. (1994). Stochastic orders and their applications. San Diego, CA: Academic Press.
- (1994) Stochastic orders and their applications
- Shaked, M.¹ Shanthikumar, J.G.²

10
- 0004102479
- Cambridge, MA: MIT Press
- Sutton, R. S., & Barto, A. G. (1998). Reinforcement learning: An introduction. Cambridge, MA: MIT Press.
- (1998) Reinforcement learning: An introduction
- Sutton, R.S.¹ Barto, A.G.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.