SCOPUS 정보 검색 플랫폼

IJCAI International Joint Conference on Artificial Intelligence

Volumn , Issue , 2011, Pages 1243-1248

Automatic state abstraction from demonstration

(4) Cobo, Luis C a Zang, Peng b Isbell Jr , Charles L b Thomaz, Andrea L b

a Georgia Institute of Technology (United States)

b Georgia Institute of Technology (United States)

Author keywords

[No Author keywords available]

Indexed keywords

COMPLEX TASK; LEARNING FROM DEMONSTRATION; REINFORCEMENT LEARNING METHOD; STATE ABSTRACTION; TRAINING EXAMPLE;

ABSTRACTING; ARTIFICIAL INTELLIGENCE; REINFORCEMENT LEARNING;

DEMONSTRATIONS;

EID: 84881076324 PISSN: 10450823 EISSN: None Source Type: Conference Proceeding
DOI: 10.5591/978-1-57735-516-8/IJCAI11-211 Document Type: Conference Paper

Times cited : (31)

References (17)

1
- 14344251217
- Apprenticeship learning via inverse reinforcement learning
- ACM
- Pieter Abbeel and Andrew Y. Ng. Apprenticeship learning via inverse reinforcement learning. In International Conference on Machine Learning, page 1. ACM, 2004.
- (2004) International Conference on Machine Learning , pp. 1
- Abbeel, P.¹ Ng, A.Y.²

2
- 27144521144
- Correcting and improving imitation models of humans for robosoccer agents
- R. Aler, O. Garcia, and JM Valls. Correcting and improving imitation models of humans for robosoccer agents. In IEEE Congress on Evolutionary Computation, volume 3, 2005.
- (2005) IEEE Congress on Evolutionary Computation , vol.3
- Aler, R.¹ Garcia, O.² Valls, J.M.³

3
- 63149159130
- A survey of robot learning from demonstration
- May
- Brenna D. Argall, Sonia Chernova, Manuela Veloso, and Brett Browning. A survey of robot learning from demonstration. Robotics and Autonomous Systems, 57(5):469-483, May 2009.
- (2009) Robotics and Autonomous Systems , vol.57 , Issue.5 , pp. 469-483
- Argall, B.D.¹ Chernova, S.² Veloso, M.³ Browning, B.⁴

4
- 58149180961
- Learning classifiers from only positive and unlabeled data
- ACM
- C. Elkan and K. Noto. Learning classifiers from only positive and unlabeled data. In International Conference on Knowledge Discovery and Data Mining, pages 213-220. ACM, 2008.
- (2008) International Conference on Knowledge Discovery and Data Mining , pp. 213-220
- Elkan, C.¹ Noto, K.²

5
- 70049096468
- Regularized policy iteration
- A.M. Farahmand, M. Ghavamzadeh, C. Szepesvari, and S. Mannor. Regularized policy iteration. Advances in Neural Information Processing Systems, 21:441-448, 2009.
- (2009) Advances in Neural Information Processing Systems , vol.21 , pp. 441-448
- Farahmand, A.M.¹ Ghavamzadeh, M.² Szepesvari, C.³ Mannor, S.⁴

6
- 0004060921
- PhD thesis
- M.A. Hall. Correlation-based feature selection for machine learning. PhD thesis, 1999.
- (1999) Correlation-based Feature Selection for Machine Learning
- Hall, M.A.¹

7
- 84861670983
- State Abstraction Discovery from Irrelevant State Variables
- Nicholas K Jong and Peter Stone. State Abstraction Discovery from Irrelevant State Variables. International Joint Conference on Artificial Intelligence, pages 752-757, 2005.
- (2005) International Joint Conference on Artificial Intelligence , pp. 752-757
- Jong, N.K.¹ Stone, P.²

8
- 33749263205
- Automatic basis function construction for approximate dynamic programming and reinforcement learning
- ACM
- P.W. Keller, S. Mannor, and D. Precup. Automatic basis function construction for approximate dynamic programming and reinforcement learning. In International Conference on Machine learning, pages 449-456. ACM, 2006.
- (2006) International Conference on Machine Learning , pp. 449-456
- Keller, P.W.¹ Mannor, S.² Precup, D.³

9
- 71149121683
- Regularization and feature selection in least-squares temporal difference learning
- ACM
- J.Z. Kolter and A.Y. Ng. Regularization and feature selection in least-squares temporal difference learning. In International Conference on Machine Learning, pages 521-528. ACM, 2009.
- (2009) International Conference on Machine Learning , pp. 521-528
- Kolter, J.Z.¹ Ng, A.Y.²

10
- 21844450933
- Learning from positive and unlabeled examples
- Springer
- F. Letouzey, F. Denis, and R. Gilleron. Learning from positive and unlabeled examples. In Algorithmic Learning Theory, pages 71-85. Springer, 2009.
- (2009) Algorithmic Learning Theory , pp. 71-85
- Letouzey, F.¹ Denis, F.² Gilleron, R.³

11
- 84881074926
- Automatic induction of maxq hierarchies
- S. Ray N. Mehta, M. Wynkoop, P. Tadepalli, and T. Dietterich. Automatic induction of maxq hierarchies. In Proceedings of the Hierarchical Organization of Behavior Workshop. 21st Conference on Neural Information Processing Systems, 2007.
- Proceedings of the Hierarchical Organization of Behavior Workshop. 21st Conference on Neural Information Processing Systems, 2007
- Ray, S.¹ Mehta, N.² Wynkoop, M.³ Tadepalli, P.⁴ Dietterich, T.⁵

12
- 84898980684
- Autonomous helicopter flight via reinforcement learning
- A.Y. Ng, H.J. Kim, M.I. Jordan, S. Sastry, and S. Ballianda. Autonomous helicopter flight via reinforcement learning. Advances in Neural Information Processing Systems, 16, 2004.
- (2004) Advances in Neural Information Processing Systems , pp. 16
- Ng, A.Y.¹ Kim, H.J.² Jordan, M.I.³ Sastry, S.⁴ Ballianda, S.⁵

13
- 34547982545
- Analyzing feature generation for value-function approximation
- ACM
- R. Parr, C. Painter-Wakefield, L. Li, and M. Littman. Analyzing feature generation for value-function approximation. In International Conference on Machine learning, pages 737-744. ACM, 2007.
- (2007) International Conference on Machine Learning , pp. 737-744
- Parr, R.¹ Painter-Wakefield, C.² Li, L.³ Littman, M.⁴

14
- 2142812536
- Learning without state-estimation in partially observable Markovian decision processes
- S.P. Singh, Tommi Jaakkola, and M.I. Jordan. Learning without state-estimation in partially observable Markovian decision processes. In International Conference on Machine Learning, pages 284-292, 1994.
- (1994) International Conference on Machine Learning , pp. 284-292
- Singh, S.P.¹ Jaakkola, T.² Jordan, M.I.³

15
- 0036058423
- Effective reinforcement learning for mobile robots
- W.D. Smart and L.P. Kaelbling. Effective reinforcement learning for mobile robots. In IEEE International Conference on Robotics and Automation, volume 4, pages 3404-3410, 2002.
- (2002) IEEE International Conference on Robotics and Automation , vol.4 , pp. 3404-3410
- Smart, W.D.¹ Kaelbling, L.P.²

16
- 71149102986
- Discovering options from example trajectories
- ACM New York, NY, USA
- P. Zang, P. Zhou, D. Minnen, and C.L. Isbell. Discovering options from example trajectories. In Proceedings of the 26th Annual International Conference on Machine Learning. ACM New York, NY, USA, 2009.
- (2009) Proceedings of the 26th Annual International Conference on Machine Learning
- Zang, P.¹ Zhou, P.² Minnen, D.³ Isbell, C.L.⁴

17
- 33947681316
- MLKNN: A lazy learning approach to multi-label learning
- M.L. Zhang and Z.H. Zhou. MLKNN: A lazy learning approach to multi-label learning. Pattern Recognition, 40(7):2038-2048, 2007.
- (2007) Pattern Recognition , vol.40 , Issue.7 , pp. 2038-2048
- Zhang, M.L.¹ Zhou, Z.H.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.