SCOPUS 정보 검색 플랫폼

AAAI Workshop - Technical Report

Volumn WS-14-12, Issue , 2014, Pages 19-23

Surprise and curiosity for big data robotics

(3) White, Adam a Modayil, Joseph a Sutton, Richard S a

a UNIVERSITY OF ALBERTA (Canada)

Author keywords

[No Author keywords available]

Indexed keywords

ARTIFICIAL INTELLIGENCE; DECISION MAKING; ROBOT LEARNING; ROBOTS;

DATA DECISION; FUNCTION APPROXIMATION; INTRINSIC MOTIVATION; NON-STATIONARY LEARNING; POLICY LEARNING; REACTIVE BEHAVIOR CONTROL; TEMPORAL DIFFERENCE LEARNING; VALUE FUNCTIONS;

BIG DATA;

EID: 84975055760 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (21)

References (12)

1
- 84967334530
- Berlin: Springer
- Baldassarre, G., Mirolli, M. (Eds.). (2013). Intrinsically motivated learning in natural and artificial systems. Berlin: Springer.
- (2013) Intrinsically Motivated Learning in Natural and Artificial Systems
- Baldassarre, G.¹ Mirolli, M.²

2
- 84864655352
- PhD thesis, University of Alberta
- Maei, H. R. (2011). Gradient Temporal-Difference Learning Algorithms. PhD thesis, University of Alberta.
- (2011) Gradient Temporal-Difference Learning Algorithms
- Maei, H.R.¹

3
- 84866006400
- Multi-timescale nexting in a reinforcement learning robot
- Modayil, J., White, A., Sutton, R. S. (2012). Multi-timescale nexting in a reinforcement learning robot. In From Animals to Animals 12, 299-309.
- (2012) From Animals to Animals 12 , pp. 299-309
- Modayil, J.¹ White, A.² Sutton, R.S.³

4
- 34047267520
- Intrinsic motivation systems for autonomous mental development
- Oudeyer, P. Y., Kaplan, F., Hafner, V. (2007). Intrinsic Motivation Systems for Autonomous Mental Development. In IEEE Transactions on Evolutionary Computation 11, 265-286
- (2007) IEEE Transactions on Evolutionary Computation 11 , pp. 265-286
- Oudeyer, P.Y.¹ Kaplan, F.² Hafner, V.³

5
- 50849094213
- Evolving internal reinforcers for an intrinsically motivated reinforcement-learning robot
- Schembri, M., Mirolli, M., Baldassarre, G. (2007). Evolving internal reinforcers for an intrinsically motivated reinforcement-learning robot. In Development and Learning, 282-287.
- (2007) Development and Learning , pp. 282-287
- Schembri, M.¹ Mirolli, M.² Baldassarre, G.³

6
- 2442467081
- A possibility for implementing curiosity and boredom in model-building neural controllers
- Schmidhuber J. (1991). A possibility for implementing curiosity and boredom in model-building neural controllers. In Proceedings of the 1st International Conference on Simulation of Adaptive Behavior, 222-227.
- (1991) Proceedings of the 1st International Conference on Simulation of Adaptive Behavior , pp. 222-227
- Schmidhuber, J.¹

7
- 34250703734
- An intrinsic reward mechanism for efficient exploration
- Simsek, O., Barto, A. G. (2006). An intrinsic reward mechanism for efficient exploration. In Proceedings of the 23rd international conference on Machine learning, 833-840.
- (2006) Proceedings of the 23rd International Conference on Machine Learning , pp. 833-840
- Simsek, O.¹ Barto, A.G.²

8
- 84899031920
- Intrinsically motivated reinforcement learning
- Singh S., Barto, A. G., Chentanez, N. (2005). Intrinsically motivated reinforcement learning. In Advances in Neural Information Processing Systems 17, 1281-1288.
- (2005) Advances in Neural Information Processing Systems , vol.17 , pp. 1281-1288
- Singh, S.¹ Barto, A.G.² Chentanez, N.³

9
- 0004102479
- MIT Press
- Sutton, R. S., Barto, A. G. (1998). Reinforcement Learning: An Introduction. MIT Press.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

10
- 71149099079
- Fast gradient-descent methods for temporal-difference learning with linear function approximation
- Sutton, R. S., and Maei, H. R., Precup, D., Bhatnagar, S., Silver, D., Szepesvári, Cs., Wiewiora, E. (2009). Fast gradient-descent methods for temporal-difference learning with linear function approximation. In Proceedings of the 26th International Conference on Machine Learning.
- (2009) Proceedings of the 26th International Conference on Machine Learning
- Sutton, R.S.¹ Maei, H.R.² Precup, D.³ Bhatnagar, S.⁴ Silver, D.⁵ Szepesvári, Cs.⁶ Wiewiora, E.⁷

11
- 84899464022
- Horde: A scalable real-time architecture for learning knowledge from unsupervised sensorimotor interaction
- Sutton, R. S., Modayil, J., Delp, M., Degris, T., and Pilarski, P. M., White, A., Precup, D. (2011). Horde: A scalable real-time architecture for learning knowledge from unsupervised sensorimotor interaction. In Proceedings of thelOth International Conference on Autonomous Agents and Multiagent Systems.
- (2011) Proceedings of ThelOth International Conference on Autonomous Agents and Multiagent Systems
- Sutton, R.S.¹ Modayil, J.² Delp, M.³ Degris, T.⁴ Pilarski, P.M.⁵ White, A.⁶ Precup, D.⁷

12
- 84872849054
- Scaling life-long off-policy learning
- White, A., Modayil, J., Sutton, R. S. (2012). Scaling life-long off-policy learning. In Development and Learning and Epigenetic Robotics, 1-6.
- (2012) Development and Learning and Epigenetic Robotics , pp. 1-6
- White, A.¹ Modayil, J.² Sutton, R.S.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.