SCOPUS 정보 검색 플랫폼

Volumn 2006, Issue , 2006, Pages 217-224

Dealing with non-stationary environments using context detection

Author keywords

[No Author keywords available]

Indexed keywords

LEARNING ALGORITHMS; MATHEMATICAL MODELS; PROBLEM SOLVING; SIGNAL FILTERING AND PREDICTION;

CONTEXT DETECTION; NON STATIONARY ENVIRONMENTS; PREDICTIONS; REINFORCEMENT LEARNING ALGORITHMS;

LEARNING SYSTEMS;

EID: 33749262176 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (55)

References (9)

1
- 33749245240
- An environment model for nonstationary reinforcement learning
- Choi, S. P. M., Yeung, D.-Y., & Zhang, N. L. (2000). An environment model for nonstationary reinforcement learning. Advances in Neural Information Processing Systems 12 (pp. 994-1000).
- (2000) Advances in Neural Information Processing Systems , vol.12 , pp. 994-1000
- Choi, S.P.M.¹ Yeung, D.-Y.² Zhang, N.L.³

3
- 0036618011
- Multiple model-based reinforcement learning
- Doya, K., Samejima, K., Katagiri, K., & Kawato, M. (2002). Multiple model-based reinforcement learning. Neural Computation, 14, 1347-1369.
- (2002) Neural Computation , vol.14 , pp. 1347-1369
- Doya, K.¹ Samejima, K.² Katagiri, K.³ Kawato, M.⁴

4
- 0029679044
- Reinforcement learning: A survey
- Kaelbling, L. P., Littman, M., & Moore, A. (1996). Reinforcement learning: A survey. Journal of Artificial Intelligence Research, 4, 237-285.
- (1996) Journal of Artificial Intelligence Research , vol.4 , pp. 237-285
- Kaelbling, L.P.¹ Littman, M.² Moore, A.³

5
- 0027684215
- Prioritized sweeping: Reinforcement learning with less data and less time
- Moore, A. W., & Atkeson, C. G. (1993). Prioritized sweeping: Reinforcement learning with less data and less time. Machine Learning, 13, 103-130.
- (1993) Machine Learning , vol.13 , pp. 103-130
- Moore, A.W.¹ Atkeson, C.G.²

8
- 0026962175
- Reinforcement learning with a hierarchy of abstract models
- Singh, S. P. (1992). Reinforcement learning with a hierarchy of abstract models. Proceedings of the Tenth National Conference on Artificial Intelligence (AAAI) (pp. 202-207).
- (1992) Proceedings of the Tenth National Conference on Artificial Intelligence (AAAI) , pp. 202-207
- Singh, S.P.¹

9
- 0033170372
- Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
- Sutton, R. S., Precup, D., & Singh, S. P. (1999). Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning. Artificial Intelligence, 112, 181-211.
- (1999) Artificial Intelligence , vol.112 , pp. 181-211
- Sutton, R.S.¹ Precup, D.² Singh, S.P.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.