SCOPUS 정보 검색 플랫폼

Advances in Neural Information Processing Systems

Volumn 2, Issue January, 2014, Pages 990-998

Universal option models

(5) Yao, Hengshuai a Szepesvári, Csaba a Sutton, Rich a Modayil, Joseph a Bhatnagar, Shalabh b

a UNIVERSITY OF ALBERTA (Canada)

b INDIAN INSTITUTE OF SCIENCE (India)

Author keywords

[No Author keywords available]

Indexed keywords

ABSTRACTING; INFORMATION SCIENCE; STOCHASTIC SYSTEMS;

EFFICIENT COMPUTATION; EXPECTED RETURN; LEARNING MODELS; LINEAR FUNCTIONS; REAL-TIME STRATEGY GAMES; REWARD FUNCTION; STOCHASTIC APPROXIMATION ALGORITHMS; VALUE FUNCTIONS;

APPROXIMATION ALGORITHMS;

EID: 84937951926 PISSN: 10495258 EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (26)

References (13)

1
- 77955809093
- Autonomous helicopter aerobatics through apprenticeship learning
- Abbeel, P., Coates, A., and Ng, A. Y. (2010). Autonomous helicopter aerobatics through apprenticeship learning. Int. J. Rob. Res., 29(13): 1608-1639.
- (2010) Int. J. Rob. Res. , vol.29 , Issue.13 , pp. 1608-1639
- Abbeel, P.¹ Coates, A.² Ng, A.Y.³

2
- 2442603180
- Monte Carlo matrix inversion and reinforcement learning
- Barto, A. and Duff, M. (1994). Monte carlo matrix inversion and reinforcement learning. NIPS, pages 687-694.
- (1994) NIPS , pp. 687-694
- Barto, A.¹ Duff, M.²

3
- 0003487482
- Athena
- Bertsekas, D. P. and Tsitsiklis, J. N. (1996). Neuro-dynamic Programming. Athena.
- (1996) Neuro-dynamic Programming
- Bertsekas, D.P.¹ Tsitsiklis, J.N.²

4
- 0000439891
- On the convergence of stochastic iterative dynamic programming algorithms
- Jaakkola, T., Jordan, M., and Singh, S. (1994). On the convergence of stochastic iterative dynamic programming algorithms. Neural Computation, 6(6): 1185-1201.
- (1994) Neural Computation , vol.6 , Issue.6 , pp. 1185-1201
- Jaakkola, T.¹ Jordan, M.² Singh, S.³

5
- 0042547347
- Algorithms for inverse reinforcement learning
- Ng, A. Y. and Russell, S. J. (2000). Algorithms for inverse reinforcement learning. ICML, pages 663-670.
- (2000) ICML , pp. 663-670
- Ng, A.Y.¹ Russell, S.J.²

6
- 0003780986
- The PageRank citation ranking: Bringing order to the web
- Page, L., Brin, S., Motwani, R., and Winograd, T. (1998). The PageRank citation ranking: Bringing order to the web. Technical report, Stanford University.
- (1998) Technical Report, Stanford University
- Page, L.¹ Brin, S.² Motwani, R.³ Winograd, T.⁴

7
- 0003392384
- PhD thesis, University of Massachusetts, Amherst
- Precup, D. (2000). Temporal Abstraction in Reinforcement Learning. PhD thesis, University of Massachusetts, Amherst.
- (2000) Temporal Abstraction in Reinforcement Learning
- Precup, D.¹

8
- 16244405068
- The intelligent surfer: Probabilistic combination of link and content information in PageRank
- Richardson, M. and Domingos, P. (2002). The intelligent surfer: Probabilistic combination of link and content information in PageRank. NIPS.
- (2002) NIPS
- Richardson, M.¹ Domingos, P.²

9
- 84868298774
- Linear options
- Sorg, J. and Singh, S. (2010). Linear options. AAMAS, pages 31-38.
- (2010) AAMAS , pp. 31-38
- Sorg, J.¹ Singh, S.²

10
- 0004102479
- MIT Press
- Sutton, R. S. and Barto, A. G. (1998). Reinforcement Learning: An Introduction. MIT Press.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

11
- 0033170372
- Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
- Sutton, R. S., Precup, D., and Singh, S. (1999). Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning. Artificial Intelligence, 112: 181-211.
- (1999) Artificial Intelligence , vol.112 , pp. 181-211
- Sutton, R.S.¹ Precup, D.² Singh, S.³

12
- 84937949237
- PhD thesis, Princeton University
- Syed, U. A. (2010). Reinforcement Learning Without Rewards. PhD thesis, Princeton University.
- (2010) Reinforcement Learning Without Rewards
- Syed, U.A.¹

13
- 65449166085
- Arnetminer: Extraction and mining of academic social networks
- Tang, J., Zhang, J., Yao, L., Li, J., Zhang, L., and Su, Z. (2008). Arnetminer: extraction and mining of academic social networks. SIGKDD, pages 990-998.
- (2008) SIGKDD , pp. 990-998
- Tang, J.¹ Zhang, J.² Yao, L.³ Li, J.⁴ Zhang, L.⁵ Su, Z.⁶

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.