SCOPUS 정보 검색 플랫폼

Volumn 148, Issue , 2006, Pages 833-840

An intrinsic reward mechanism for efficient exploration

Author keywords

[No Author keywords available]

Indexed keywords

EFFICIENT EXPLORATION; INTRINSIC REWARD MECHANISM; MARKOV DECISION PROCESS; OPTIMAL POLICY;

ALGORITHMS; DECISION MAKING; MARKOV PROCESSES; MATHEMATICAL MODELS;

REINFORCEMENT LEARNING;

EID: 34250703734 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1145/1143844.1143949 Document Type: Conference Paper

Times cited : (71)

References (19)

2
- 33749651693
- Intrinsically motivated learning of hierarchical collections of skills
- Barto, A. G., Singh, S., & Chentanez, N. (2004). Intrinsically motivated learning of hierarchical collections of skills. Proceedings of the Third International Conference on Developmental Learning.
- (2004) Proceedings of the Third International Conference on Developmental Learning
- Barto, A.G.¹ Singh, S.² Chentanez, N.³

3
- 1942450858
- Doctoral dissertation, University of Massassachusetts Amherst
- Duff, M. (2002). Optimal learning: Computational procedures for Bayes-adaptive Markov decision processes. Doctoral dissertation, University of Massassachusetts Amherst.
- (2002) Optimal learning: Computational procedures for Bayes-adaptive Markov decision processes
- Duff, M.¹

4
- 1942421168
- Design for an optimal probe
- Duff, M. (2003). Design for an optimal probe. Proceedings of the Twentieth International Conference on Machine Learning.
- (2003) Proceedings of the Twentieth International Conference on Machine Learning
- Duff, M.¹

5
- 0013465036
- Discovering hierarchy in reinforcement learning with HEXQ
- Hengst, B. (2002). Discovering hierarchy in reinforcement learning with HEXQ. Proceedings of the Nineteenth International Conference on Machine Learning.
- (2002) Proceedings of the Nineteenth International Conference on Machine Learning
- Hengst, B.¹

6
- 1842793184
- Motivational principles for visual know-how development
- Kaplan, F., & Oudeyer, P.-Y. (2003). Motivational principles for visual know-how development. Proceedings of the Third International Workshop on Epigenetic Robotics: Modeling Cognitive Development in Robotic Systems.
- (2003) Proceedings of the Third International Workshop on Epigenetic Robotics: Modeling Cognitive Development in Robotic Systems
- Kaplan, F.¹ Oudeyer, P.-Y.²

7
- 0012257655
- Near-Optimal reinforcement learning in polynomial time
- Kearns, M., & Singh, S. (1998). Near-Optimal reinforcement learning in polynomial time. Proceedings of the Fifteenth International Conference on Machine Learning.
- (1998) Proceedings of the Fifteenth International Conference on Machine Learning
- Kearns, M.¹ Singh, S.²

8
- 14344250635
- Dynamic abstraction in reinforcement learning via clustering
- Mannor, S., Menache, I., Hoze, A., & Klein, U. (2004). Dynamic abstraction in reinforcement learning via clustering. Proceedings of the Twenty-First International Conference on Machine Learning.
- (2004) Proceedings of the Twenty-First International Conference on Machine Learning
- Mannor, S.¹ Menache, I.² Hoze, A.³ Klein, U.⁴

9
- 0013465187
- Automatic discovery of subgoals in reinforcement learning using diverse density
- McGovern, A., & Barto, A. G. (2001). Automatic discovery of subgoals in reinforcement learning using diverse density. Proceedings of the Eighteenth International Conference on Machine Learning.
- (2001) Proceedings of the Eighteenth International Conference on Machine Learning
- McGovern, A.¹ Barto, A.G.²

10
- 0027684215
- Prioritized sweeping: Reinforcement learning with less data and less real time
- Moore, A., & Atkeson, C. G. (1993). Prioritized sweeping: Reinforcement learning with less data and less real time. Machine Learning, 13, 103-130.
- (1993) Machine Learning , vol.13 , pp. 103-130
- Moore, A.¹ Atkeson, C.G.²

11
- 84977063352
- Efficient learning and planning within the dyna framework
- Peng, J., & Williams, R. J. (1993). Efficient learning and planning within the dyna framework. Adaptive Behavior, 2, 437-454.
- (1993) Adaptive Behavior , vol.2 , pp. 437-454
- Peng, J.¹ Williams, R.J.²

12
- 2442467081
- A possibility for implementing curiosity and boredom in model-building neural controllers
- Schmidhuber, J. (1991). A possibility for implementing curiosity and boredom in model-building neural controllers. From Animals to Animate: Proceedings of the First International Conference on Simulation of Adaptive Behavior.
- (1991) From Animals to Animate: Proceedings of the First International Conference on Simulation of Adaptive Behavior
- Schmidhuber, J.¹

13
- 34250764144
- Schmidhuber, J., & Storck, J. (1993). Reinforcement driven information acquisition in nondeterministic environments. Technical report, Fakultat fur Informatik, Technische Universit at Munchen.
- Schmidhuber, J., & Storck, J. (1993). Reinforcement driven information acquisition in nondeterministic environments. Technical report, Fakultat fur Informatik, Technische Universit at Munchen.

14
- 14344261491
- Using relative novelty to identify useful temporal abstractions in reinforcement learning
- Şimşek, Ö., & Barto, A. G. (2004). Using relative novelty to identify useful temporal abstractions in reinforcement learning. Proceedings of the Twenty-First International Conference on Machine Learning.
- (2004) Proceedings of the Twenty-First International Conference on Machine Learning
- Şimşek, O.¹ Barto, A.G.²

15
- 31844447221
- Identifying useful subgoals in reinforcement learning by local graph partitioning
- Şimşek, Ö., Wolfe, A. P., & Barto, A. G. (2005). Identifying useful subgoals in reinforcement learning by local graph partitioning. Proceedings of the Twenty-Second International Conference on Machine Learning.
- (2005) Proceedings of the Twenty-Second International Conference on Machine Learning
- Şimşek, O.¹ Wolfe, A.P.² Barto, A.G.³

16
- 84899031920
- Intrinsically motivated reinforcement learning
- Singh, S., Barto, A. G., & Chentanez, N. (2005). Intrinsically motivated reinforcement learning. Advances in Neurul Information Processing Systems.
- (2005) Advances in Neurul Information Processing Systems
- Singh, S.¹ Barto, A.G.² Chentanez, N.³

17
- 85132026293
- Integrated architectures for learning, planning, and reacting based on approximating dynamic programming
- Sutton, R. S. (1990). Integrated architectures for learning, planning, and reacting based on approximating dynamic programming. Proceedings of the Seventh International Conference on Machine. Learning.
- (1990) Proceedings of the Seventh International Conference on Machine. Learning
- Sutton, R.S.¹

18
- 0033170372
- Between MDPs and Semi-MDPs: A framework for temporal abstraction in reinforcement learning
- Sutton, R. S., Precup, D., & Singh, S. P. (1999). Between MDPs and Semi-MDPs: A framework for temporal abstraction in reinforcement learning. Artificial Intelligence, 112, 181-211.
- (1999) Artificial Intelligence , vol.112 , pp. 181-211
- Sutton, R.S.¹ Precup, D.² Singh, S.P.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.