메뉴 건너뛰기




Volumn 2, Issue January, 2014, Pages 990-998

Universal option models

Author keywords

[No Author keywords available]

Indexed keywords

ABSTRACTING; INFORMATION SCIENCE; STOCHASTIC SYSTEMS;

EID: 84937951926     PISSN: 10495258     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (26)

References (13)
  • 1
    • 77955809093 scopus 로고    scopus 로고
    • Autonomous helicopter aerobatics through apprenticeship learning
    • Abbeel, P., Coates, A., and Ng, A. Y. (2010). Autonomous helicopter aerobatics through apprenticeship learning. Int. J. Rob. Res., 29(13): 1608-1639.
    • (2010) Int. J. Rob. Res. , vol.29 , Issue.13 , pp. 1608-1639
    • Abbeel, P.1    Coates, A.2    Ng, A.Y.3
  • 2
    • 2442603180 scopus 로고
    • Monte Carlo matrix inversion and reinforcement learning
    • Barto, A. and Duff, M. (1994). Monte carlo matrix inversion and reinforcement learning. NIPS, pages 687-694.
    • (1994) NIPS , pp. 687-694
    • Barto, A.1    Duff, M.2
  • 4
    • 0000439891 scopus 로고
    • On the convergence of stochastic iterative dynamic programming algorithms
    • Jaakkola, T., Jordan, M., and Singh, S. (1994). On the convergence of stochastic iterative dynamic programming algorithms. Neural Computation, 6(6): 1185-1201.
    • (1994) Neural Computation , vol.6 , Issue.6 , pp. 1185-1201
    • Jaakkola, T.1    Jordan, M.2    Singh, S.3
  • 5
    • 0042547347 scopus 로고    scopus 로고
    • Algorithms for inverse reinforcement learning
    • Ng, A. Y. and Russell, S. J. (2000). Algorithms for inverse reinforcement learning. ICML, pages 663-670.
    • (2000) ICML , pp. 663-670
    • Ng, A.Y.1    Russell, S.J.2
  • 8
    • 16244405068 scopus 로고    scopus 로고
    • The intelligent surfer: Probabilistic combination of link and content information in PageRank
    • Richardson, M. and Domingos, P. (2002). The intelligent surfer: Probabilistic combination of link and content information in PageRank. NIPS.
    • (2002) NIPS
    • Richardson, M.1    Domingos, P.2
  • 9
    • 84868298774 scopus 로고    scopus 로고
    • Linear options
    • Sorg, J. and Singh, S. (2010). Linear options. AAMAS, pages 31-38.
    • (2010) AAMAS , pp. 31-38
    • Sorg, J.1    Singh, S.2
  • 11
    • 0033170372 scopus 로고    scopus 로고
    • Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
    • Sutton, R. S., Precup, D., and Singh, S. (1999). Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning. Artificial Intelligence, 112: 181-211.
    • (1999) Artificial Intelligence , vol.112 , pp. 181-211
    • Sutton, R.S.1    Precup, D.2    Singh, S.3
  • 13
    • 65449166085 scopus 로고    scopus 로고
    • Arnetminer: Extraction and mining of academic social networks
    • Tang, J., Zhang, J., Yao, L., Li, J., Zhang, L., and Su, Z. (2008). Arnetminer: extraction and mining of academic social networks. SIGKDD, pages 990-998.
    • (2008) SIGKDD , pp. 990-998
    • Tang, J.1    Zhang, J.2    Yao, L.3    Li, J.4    Zhang, L.5    Su, Z.6


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.