SCOPUS 정보 검색 플랫폼

Volumn WS-07-02, Issue , 2007, Pages 52-56

Hierarchical strategy learning with hybrid representations

(2) Yoon, Sungwook a Kambhampati, Subbarao a

a Arizona State University (United States)

Author keywords

[No Author keywords available]

Indexed keywords

HIERARCHICAL REPRESENTATION; NOVICE LEARNERS; PLANNING ALGORITHMS; STRATEGIC LEVELS; STRATEGY LEARNING; SUPERIOR PERFORMANCE; TECHNICAL REPORTS; VALUE FUNCTIONS;

DECISION MAKING; LEARNING ALGORITHMS; PROBLEM SOLVING;

KNOWLEDGE REPRESENTATION;

EID: 51849088509 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (6)

References (19)

1
- 14344251217
- Apprenticeship learning via inverse reinforcement learning
- Abbeel, P., and Ng, A. Y. 2004. Apprenticeship learning via inverse reinforcement learning. In Proc. ICML.
- (2004) Proc. ICML
- Abbeel, P.¹ Ng, A.Y.²

2
- 0013465186
- Andre, D., and Russell, S. 2000. Programmable reinforcement learning agents.
- (2000) Programmable reinforcement learning agents
- Andre, D.¹ Russell, S.²

3
- 0024103809
- Prism: An algorithm for inducing modular rules
- Cendrowska, J. 1987. Prism: An algorithm for inducing modular rules. International Journal of Man-Machine Studies 27(4):349-370.
- (1987) International Journal of Man-Machine Studies , vol.27 , Issue.4 , pp. 349-370
- Cendrowska, J.¹

4
- 84870778933
- Chawla, N. V.; Bowyer, K. W.; Hall, L. O.; and Kegelmeyer, W. P. 2002. Smote: Synthetic minority oversampling technique.
- (2002) Smote: Synthetic minority oversampling technique
- Chawla, N.V.¹ Bowyer, K.W.² Hall, L.O.³ Kegelmeyer, W.P.⁴

5
- 0002278788
- Hierarchical reinforcement learning with the MAXQ value function decomposition
- Dietterich, T. G. 2000. Hierarchical reinforcement learning with the MAXQ value function decomposition. Journal of Artificial Intelligence Research 13:227-303.
- (2000) Journal of Artificial Intelligence Research , vol.13 , pp. 227-303
- Dietterich, T.G.¹

6
- 26444572401
- Complexity results for HTN planning
- Erol, K.; Hendler, J. A.; and Nau, D. S. 1996. Complexity results for HTN planning. Annals of Mathematics and Artificial Intelligence 18(1):69-93.
- (1996) Annals of Mathematics and Artificial Intelligence , vol.18 , Issue.1 , pp. 69-93
- Erol, K.¹ Hendler, J.A.² Nau, D.S.³

7
- 22944468731
- Approximate policy iteration with a policy language bias
- Fern, A.; Yoon, S.; and Givan, R. 2003. Approximate policy iteration with a policy language bias. In Proceedings of the I6th Conference on Advances in Neural Information Processing.
- (2003) Proceedings of the I6th Conference on Advances in Neural Information Processing
- Fern, A.¹ Yoon, S.² Givan, R.³

8
- 17444432872
- Camel: Learning method preconditions for htn planning
- Ilghami, O.; Nau, D. S.; and oz Avila, H. M. 2002. Camel: Learning method preconditions for htn planning. In AIPS02.
- (2002) AIPS , vol.2
- Ilghami, O.¹ Nau, D.S.² oz Avila, H.M.³

9
- 33750586671
- Solving factored MDPs with hybrid state and action variables
- Kveton, B.; Hauskrecht, M.; and Guestrin, C. 2006. Solving factored MDPs with hybrid state and action variables. Journal of Artificial Intelligence Research 27:153-201.
- (2006) Journal of Artificial Intelligence Research , vol.27 , pp. 153-201
- Kveton, B.¹ Hauskrecht, M.² Guestrin, C.³

10
- 33845772427
- Maloof, M. 2003. Learning when data sets are imbalanced and when costs are unequal and unknown.
- (2003) Learning when data sets are imbalanced and when costs are unequal and unknown
- Maloof, M.¹

11
- 0027574520
- Taxonomie syntax for first-order inference
- McAllester, D., and Givan, R. 1993. Taxonomie syntax for first-order inference. Journal of the ACM 40:246-283.
- (1993) Journal of the ACM , vol.40 , pp. 246-283
- McAllester, D.¹ Givan, R.²

12
- 14344250066
- Learning to fly by combining reinforcement learning with behavioural cloning
- Morales, E., and Sammut, C. 2004. Learning to fly by combining reinforcement learning with behavioural cloning. In ICML.
- (2004) ICML
- Morales, E.¹ Sammut, C.²

13
- 84880665976
- Shop: Simple hierarchical ordered planner
- Nau, D.; Cao, Y.; Lotem, A.; and Munoz-Avila, H. 1999. Shop: Simple hierarchical ordered planner. In Proceedings of the International Joint Conference on Artificial Intelligence, 968-973.
- (1999) Proceedings of the International Joint Conference on Artificial Intelligence , pp. 968-973
- Nau, D.¹ Cao, Y.² Lotem, A.³ Munoz-Avila, H.⁴

14
- 0001070375
- Reinforcement learning with hierarchies of machines
- Jordan, M. I, Kearns, M. J, and Solla, S. A, eds, The MIT Press
- Parr, R., and Russell, S. 1997. Reinforcement learning with hierarchies of machines. In Jordan, M. I.; Kearns, M. J.; and Solla, S. A., eds., Advances in Neural Information Processing Systems, volume 10. The MIT Press.
- (1997) Advances in Neural Information Processing Systems , vol.10
- Parr, R.¹ Russell, S.²

15
- 29144443664
- Minority report in fraud detection: Classification of skewed data
- 50-59
- Phua, C.; Alahakoon, D.; and Lee, V. 2004. Minority report in fraud detection: classification of skewed data. SIGKDD Explor. Newsl. 6(1):50-59.
- (2004) SIGKDD Explor. Newsl , vol.6 , Issue.1
- Phua, C.¹ Alahakoon, D.² Lee, V.³

16
- 84899003140
- Multi-time models for temporally abstract planning
- Jordan, M. I, Kearns, M. J, and Solla, S. A, eds, The MIT Press
- Precup, D., and Sutton, R. S. 1998. Multi-time models for temporally abstract planning. In Jordan, M. I.; Kearns, M. J.; and Solla, S. A., eds., Advances in Neural Information Processing Systems, volume 10. The MIT Press.
- (1998) Advances in Neural Information Processing Systems , vol.10
- Precup, D.¹ Sutton, R.S.²

17
- 13444310066
- Inductive policy selection for first-order MDPs
- Yoon, S.; Fern, A.; and Givan, R. 2002. Inductive policy selection for first-order MDPs. In Proceedings of the Eighteenth Conference on Uncertainty in Artificial Intelligence.
- (2002) Proceedings of the Eighteenth Conference on Uncertainty in Artificial Intelligence
- Yoon, S.¹ Fern, A.² Givan, R.³

18
- 29344443330
- Learning measures of progress for planning domains
- Yoon, S.; Fern, A.; and Givan, R. 2005. Learning measures of progress for planning domains. In Proceedings of the Twentieth Conference on Uncertainty in Artificial Intelligence.
- (2005) Proceedings of the Twentieth Conference on Uncertainty in Artificial Intelligence
- Yoon, S.¹ Fern, A.² Givan, R.³

19
- 9444296873
- Younes, H.; Musliner, D.; and Simmons, R. 2003. A framework for planning in continuous-time stochastic domains.
- (2003) A framework for planning in continuous-time stochastic domains
- Younes, H.¹ Musliner, D.² Simmons, R.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.