SCOPUS 정보 검색 플랫폼

Journal of China Universities of Posts and Telecommunications

Volumn 21, Issue 5, 2014, Pages 94-104

Autonomic discovery of subgoals in hierarchical reinforcement learning

(3) Xiao, Ding a Li, Yi Tong a Shi, Chuan a

a BEIJING UNIVERSITY OF POSTS AND TELECOMMUNICATIONS (China)

Author keywords

Hierarchical reinforcement learning; Option; Q learning; Subgoal; UDV

Indexed keywords

SOCIAL NETWORKING (ONLINE);

HIERARCHICAL REINFORCEMENT LEARNING; OPTION; Q-LEARNING; SUBGOALS; UDV;

REINFORCEMENT LEARNING;

EID: 84926486227 PISSN: 10058885 EISSN: 22105123 Source Type: Journal
DOI: 10.1016/S1005-8885(14)60337-X Document Type: Article

Times cited : (9)

References (21)

1
- 85153965130
- Reinforcement learning with soft state aggregation
- Nov 28-Dec 1, 1994, Denver, CO, USA. Cambridge, MA USA: MIT Press
- Singh S P, Jaakkola T, Jordan M I. Reinforcement learning with soft state aggregation. Advance in Neural Information Processing Systems 7: Proceedings of the Neural Information Processing Systems Conference (NIPS'94), Nov 28-Dec 1, 1994, Denver, CO, USA. Cambridge, MA USA: MIT Press, 1995: 361-368
- (1995) Advance in Neural Information Processing Systems 7: Proceedings of the Neural Information Processing Systems Conference (NIPS'94) , pp. 361-368
- Singh, S.P.¹ Jaakkola, T.² Jordan, M.I.³

2
- 0031143730
- An analysis of temporal-difference learning with function approximation
- Tsitsiklis J N, Van Roy B. An analysis of temporal-difference learning with function approximation. IEEE Transactions on Automatic Control, 1997, 42(5): 674-690
- (1997) IEEE Transactions on Automatic Control , vol.42 , Issue.5 , pp. 674-690
- Tsitsiklis, J.N.¹ Van Roy, B.²

3
- 0002278788
- Hierarchical reinforcement learning with the max Q value function decomposition
- Dietterich T G. Hierarchical reinforcement learning with the max Q value function decomposition. Journal of Artificial Intelligence Research, 2000, 13: 227-303
- (2000) Journal of Artificial Intelligence Research , vol.13 , pp. 227-303
- Dietterich, T.G.¹

4
- 0003989214
- PhD Thesis. Berkeley, CA USA: University of California, Berkeley
- Parr R. Hierarchical control and learning for Markov decision processes. PhD Thesis. Berkeley, CA USA: University of California, Berkeley, 1998
- (1998) Hierarchical Control and Learning for Markov Decision Processes
- Parr, R.¹

5
- 31844447221
- Identifying useful subgoals in reinforcement learning by local graph partitioning
- Aug 7-10 Bonn, Germany. New York, NY, USA: ACM 2005
- Simsek Ö, Wolfe P A, Barto A G. Identifying useful subgoals in reinforcement learning by local graph partitioning. Proceedings of the 22nd International Conference on Machine Learning (ICML'05), Aug 7-10, 2005. Bonn, Germany. New York, NY, USA: ACM 2005: 816-823
- (2005) Proceedings of the 22nd International Conference on Machine Learning (ICML'05) , pp. 816-823
- Simsek, O.¹ Wolfe, P.A.² Barto, A.G.³

6
- 0033170372
- Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
- Sutton R S, Precup D, Singh S. Between MDPs and semi-MDPs: a framework for temporal abstraction in reinforcement learning. Artificial Intelligence, 1999, 112(1/2): 181-211
- (1999) Artificial Intelligence , vol.112 , Issue.1-2 , pp. 181-211
- Sutton, R.S.¹ Precup, D.² Singh, S.³

7
- 0004782095
- Learning hierarchical control structure for multiple tasks and changing environments
- Aug 17-21, 1998, Zurich, Switzerland. Cambridge, MA, USA: MIT Press
- Digney B L. Learning hierarchical control structure for multiple tasks and changing environments. From Animals to Animats 5: Proceedings of the 5th International Conference on Simulation of Adaptive Behavior (SAB'98). Aug 17-21, 1998, Zurich, Switzerland. Cambridge, MA, USA: MIT Press, 1998: 321-330
- (1998) From Animals to Animats 5: Proceedings of the 5th International Conference on Simulation of Adaptive Behavior (SAB'98) , pp. 321-330
- Digney, B.L.¹

8
- 0013465187
- Automatic discovery of subgoals in reinforcement learning using diverse density
- Jun 28-Jul 1, Williamstown, MA, USA. San Francisco, CA, USA: Morgan Kaufmann
- Mcgovern A, Barto A G. Automatic discovery of subgoals in reinforcement learning using diverse density. Proceedings of the 18th International Conference on Machine Learning (ICML'01), Jun 28-Jul 1, Williamstown, MA, USA. San Francisco, CA, USA: Morgan Kaufmann, 2001: 361-368
- (2001) Proceedings of the 18th International Conference on Machine Learning (ICML'01) , pp. 361-368
- Mcgovern, A.¹ Barto, A.G.²

9
- 84912073624
- Learning options in reinforcement learning
- Aug 2-4, Kananaskis, Canada. Berlin, Germany: Springer
- Stolle M, Precup D. Learning options in reinforcement learning. Proceedings of the 5th International Symposium on Abstraction, Reformulation and Approximation (SARA'02), Aug 2-4, Kananaskis, Canada. Berlin, Germany: Springer, 2002: 212-223
- (2002) Proceedings of the 5th International Symposium on Abstraction, Reformulation and Approximation (SARA'02) , pp. 212-223
- Stolle, M.¹ Precup, D.²

10
- 29344448283
- Autonomous subgoal discovery and hierarchical abstraction for reinforcement learning using Monte Carlo method
- Jul 9-13, 2005, Pittsburgh, PA, USA. Cambridge, MA, USA: MIT Press
- Asadi M, Huber M. Autonomous subgoal discovery and hierarchical abstraction for reinforcement learning using Monte Carlo method. Proceedings of the 20th National Conference on Artificial Intelligence and the 17th Innovative Applications of Artificial Intelligence Conference (AAAI'05), Jul 9-13, 2005, Pittsburgh, PA, USA. Cambridge, MA, USA: MIT Press, 2005: 1588-1589
- (2005) Proceedings of the 20th National Conference on Artificial Intelligence and the 17th Innovative Applications of Artificial Intelligence Conference (AAAI'05) , pp. 1588-1589
- Asadi, M.¹ Huber, M.²

11
- 29344435556
- Subgoal discovery for hierarchical reinforcement learning using learnt policies
- May 12-14, 2003, St Augustine, FL, USA
- Goel S, Huber M. Subgoal discovery for hierarchical reinforcement learning using learnt policies. Proceedings of the 16th International Florida Artificial Intelligence Research Society Conference (FLAIRS'03), May 12-14, 2003, St Augustine, FL, USA. 2003: 346-350
- (2003) Proceedings of the 16th International Florida Artificial Intelligence Research Society Conference (FLAIRS'03) , pp. 346-350
- Goel, S.¹ Huber, M.²

12
- 14344250635
- Dynamic abstraction in reinforcement learning via clustering
- Jul 4-8, 2004, Banff, Canada. San Francisco, CA, USA: Morgan Kaufmann
- Mannor S, Menache I, Hoze I, et al. Dynamic abstraction in reinforcement learning via clustering. Proceedings of the 21st International Conference on Machine Learning (ICML'04), Jul 4-8, 2004, Banff, Canada. San Francisco, CA, USA: Morgan Kaufmann, 2004: 560-567
- (2004) Proceedings of the 21st International Conference on Machine Learning (ICML'04) , pp. 560-567
- Mannor, S.¹ Menache, I.² Hoze, I.³

13
- 84945250000
- Q-cut-dynamic discovery of subgoals in reinforcement learning
- Aug 19-23, 2002, Helsinki, Finland. Berlin, Germany: Springer
- Menache I, Mannor S, Shimkin N. Q-cut-dynamic discovery of subgoals in reinforcement learning. Proceedings of the 13th European Conference on Machine Learning (ECML'02), Aug 19-23, 2002, Helsinki, Finland. Berlin, Germany: Springer, 2002: 295-306
- (2002) Proceedings of the 13th European Conference on Machine Learning (ECML'02) , pp. 295-306
- Menache, I.¹ Mannor, S.² Shimkin, N.³

14
- 33750954561
- Automatic option generation in hierarchical reinforcement learning via immune clustering
- Jan 19-21, 2006, Harbin, China. Piscataway, NJ, USA: IEEE
- Jing S, Gu G C, Liu H B. Automatic option generation in hierarchical reinforcement learning via immune clustering. Proceedings of the 1st International Symposium on Systems and Control in Aerospace and Astronautics(SSCAA'06), Jan 19-21, 2006, Harbin, China. Piscataway, NJ, USA: IEEE, 2006: 4p
- (2006) Proceedings of the 1st International Symposium on Systems and Control in Aerospace and Astronautics(SSCAA'06)
- Jing, S.¹ Gu, G.C.² Liu, H.B.³

15
- 78651097494
- Skill characterization based on betweenness
- Dec 8-11, 2008, Vancouver, Canada. Cambridge, MA, USA: MIT Press
- Simsek Ö, Barto A G. Skill characterization based on betweenness. Advances in Neural Information Processing Systems 21: Proceedings of the 22 Annual Conference on Neural Information Processing Systems (NIPS'09), Dec 8-11, 2008, Vancouver, Canada. Cambridge, MA, USA: MIT Press, 2009: 1497-1504
- (2009) Advances in Neural Information Processing Systems 21: Proceedings of the 22 Annual Conference on Neural Information Processing Systems (NIPS'09) , pp. 1497-1504
- Simsek, O.¹ Barto, A.G.²

16
- 84886423138
- Subgoal discovery in reinforcement learning using local graph clustering
- Entezari N, Shiri M E, Moradi P. Subgoal discovery in reinforcement learning using local graph clustering. International Journal of Future Generation Communication and Networking, 2011,4(3): 13-23
- (2011) International Journal of Future Generation Communication and Networking , vol.4 , Issue.3 , pp. 13-23
- Entezari, N.¹ Shiri, M.E.² Moradi, P.³

17
- 77958563254
- PUMA: Planning under uncertainty with macro-actions
- Jul 11-15, 2010, Atlanta, GA, USA. Cambridge, MA, USA: MIT Press
- He R J, Brunskill E, Roy N. PUMA: Planning under uncertainty with macro-actions. Proceedings of the 24th AAAI Conference on Artificial Intelligence (AAAI'10), Jul 11-15, 2010, Atlanta, GA, USA. Cambridge, MA, USA: MIT Press, 2010: 1089-1096
- (2010) Proceedings of the 24th AAAI Conference on Artificial Intelligence (AAAI'10) , pp. 1089-1096
- He, R.J.¹ Brunskill, E.² Roy, N.³

18
- 78751681641
- Efficient skill learning using abstraction selection
- Jul 11-17, 2009, Pasadena, CA, USA
- Konidaris G, Barto A. Efficient skill learning using abstraction selection. Proceedings of the 21st International Joint Conference on Artificial Intelligence (IJCAI'09), Jul 11-17, 2009, Pasadena, CA, USA. 2009: 1107-1113
- (2009) Proceedings of the 21st International Joint Conference on Artificial Intelligence (IJCAI'09) , pp. 1107-1113
- Konidaris, G.¹ Barto, A.²

19
- 33745937922
- K-cluster subgoal discovery algorithm for option
- (in Chinese)
- Wang B N, Gao Y, Chen Z Q, et al. K-cluster subgoal discovery algorithm for option. Journal of Computer Research and Development, 2006, 42(5): 851-855 (in Chinese)
- (2006) Journal of Computer Research and Development , vol.42 , Issue.5 , pp. 851-855
- Wang, B.N.¹ Gao, Y.² Chen, Z.Q.³

20
- 0004102479
- Cambridge, MA, USA: MIT Press
- Sutton R S, Barto A G. Reinforcement learning: An introduction. Cambridge, MA, USA: MIT Press, 1998
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.S.¹ Barto, A.G.²

21
- 0003392384
- Ph. D Thesis. Amherst, MA, USA: University of Massachusetts
- Precup D. Temporal abstraction in reinforcement learning. Ph. D Thesis. Amherst, MA, USA: University of Massachusetts, 2000
- (2000) Temporal Abstraction in Reinforcement Learning
- Precup, D.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.