SCOPUS 정보 검색 플랫폼

Machine Learning

Volumn 73, Issue 3, 2008, Pages 289-312

Transfer in variable-reward hierarchical reinforcement learning

(4) Mehta, Neville a Natarajan, Sriraam a Tadepalli, Prasad a Fern, Alan a

a Oregon State University (United States)

Author keywords

Average reward learning; Hierarchical reinforcement learning; Multi criteria learning; Transfer learning

Indexed keywords

EDUCATION; LEARNING SYSTEMS; PROBABILITY DENSITY FUNCTION; PROBLEM SOLVING; REINFORCEMENT; REINFORCEMENT LEARNING; SILVER;

AVERAGE-REWARD LEARNING; DO-MAINS; EFFICIENT ALGORITHMS; FASTER LEARNINGS; FINITE NUMBERS; HIERARCHICAL REINFORCEMENT LEARNING; HIERARCHICAL REINFORCEMENT LEARNINGS; MARKOV DECISION PROCESSES; MULTI-CRITERIA LEARNING; ONLINE ALGORITHMS; OPTIMAL VALUE FUNCTIONS; REWARD FUNCTIONS; TASK HIERARCHIES; TIME STRATEGIES; TRANSFER LEARNING; TRANSFER LEARNINGS; TRANSITION DYNAMICS; VALUE FUNCTIONS;

LEARNING ALGORITHMS;

EID: 55149090494 PISSN: 08856125 EISSN: 15730565 Source Type: Journal
DOI: 10.1007/s10994-008-5061-y Document Type: Article

Times cited : (78)

References (24)

1
- 14344251217
- Apprenticeship learning via inverse reinforcement learning
- Abbeel, P., & Ng, A. (2004). Apprenticeship learning via inverse reinforcement learning. In Proceedings of the ICML.
- (2004) Proceedings of the ICML
- Abbeel, P.¹ Ng, A.²

2
- 0036927201
- State abstraction for programmable reinforcement learning agents
- Andre, D., & Russell, S. (2002). State abstraction for programmable reinforcement learning agents. In Eighteenth national conference on artificial intelligence (pp. 119-125).
- (2002) Eighteenth National Conference on Artificial Intelligence , pp. 119-125
- Andre, D.¹ Russell, S.²

3
- 0002278788
- Hierarchical reinforcement learning with the MAXQ value function decomposition
- T. Dietterich 2000 Hierarchical reinforcement learning with the MAXQ value function decomposition Journal of Artificial Intelligence Research 9 227 303
- (2000) Journal of Artificial Intelligence Research , vol.9 , pp. 227-303
- Dietterich, T.¹

4
- 0000184142
- Constrained Markov decision models with weighted discounted rewards
- 2
- E. Feinberg A. Schwartz 1995 Constrained Markov decision models with weighted discounted rewards Mathematics of Operations Research 20 2 302 320
- (1995) Mathematics of Operations Research , vol.20 , pp. 302-320
- Feinberg, E.¹ Schwartz, A.²

5
- 0343860991
- Multi-criteria reinforcement learning
- Gabor, Z., Kalmar, Z., & Szepesvari, C. (1998). Multi-criteria reinforcement learning. In Proceedings of the ICML.
- (1998) Proceedings of the ICML
- Gabor, Z.¹ Kalmar, Z.² Szepesvari, C.³

6
- 0012296128
- Multiagent planning with factored MDPs
- Guestrin, C., Koller, D., & Parr, R. (2001). Multiagent planning with factored MDPs. In Proceedings NIPS-01.
- (2001) Proceedings NIPS-01
- Guestrin, C.¹ Koller, D.² Parr, R.³

7
- 84880803349
- Generalizing plans to new environments in relational MDPs
- Guestrin, C., Koller, D., Gearhart, C., & Kanodia, N. (2003). Generalizing plans to new environments in relational MDPs. In International joint conference on artificial intelligence.
- (2003) International Joint Conference on Artificial Intelligence
- Guestrin, C.¹ Koller, D.² Gearhart, C.³ Kanodia, N.⁴

8
- 0032073263
- Planning and acting in partially observable stochastic domains
- Kaelbling, L., Littman, M., & Cassandra, A. (1998). Planning and acting in partially observable stochastic domains. AI Journal.
- (1998) AI Journal
- Kaelbling, L.¹ Littman, M.² Cassandra, A.³

9
- 33750742257
- Value-function-based transfer for reinforcement learning using structure mapping
- Liu, Y., & Stone, P. (2006). Value-function-based transfer for reinforcement learning using structure mapping. In Proceedings of the twenty-first national conference on artificial intelligence.
- (2006) Proceedings of the Twenty-first National Conference on Artificial Intelligence
- Liu, Y.¹ Stone, P.²

10
- 33750681156
- Solving relational MDPs with first-order machine learning
- Mausam, D. (2003). Solving relational MDPs with first-order machine learning. In Proceedings of the ICAPS workshop on planning under uncertainty and incomplete information.
- (2003) Proceedings of the ICAPS Workshop on Planning under Uncertainty and Incomplete Information
- Mausam, D.¹

11
- 33845597750
- Multi-agent shared hierarchy reinforcement learning
- Mehta, N., & Tadepalli, P. (2005). Multi-agent shared hierarchy reinforcement learning. In ICML workshop on rich representations in reinforcement learning.
- (2005) ICML Workshop on Rich Representations in Reinforcement Learning
- Mehta, N.¹ Tadepalli, P.²

12
- 31844444500
- Dynamic preferences in multi-criteria reinforcement learning
- Natarajan, S., & Tadepalli, P. (2005). Dynamic preferences in multi-criteria reinforcement learning. In Proceedings of the ICML.
- (2005) Proceedings of the ICML
- Natarajan, S.¹ Tadepalli, P.²

13
- 0346738900
- Flexible decomposition algorithms for weakly coupled Markov decision problems
- Parr, R. (1998). Flexible decomposition algorithms for weakly coupled Markov decision problems. In UAI.
- (1998) UAI
- Parr, R.¹

14
- 27344432348
- Accelerating reinforcement learning through implicit imitation
- Price, B., & Boutilier, C. (2003). Accelerating reinforcement learning through implicit imitation. Journal of Artificial Intelligence Research, 569-629.
- (2003) Journal of Artificial Intelligence Research , pp. 569-629
- Price, B.¹ Boutilier, C.²

15
- 0003998452
- Wiley New York
- Puterman, M. L. (1994). Markov decision processes. New York: Wiley.
- (1994) Markov Decision Processes
- Puterman, M.L.¹

16
- 1942484759
- Q-decomposition for reinforcement learning agents
- Russell, S., & Zimdars, A. (2003). Q-decomposition for reinforcement learning agents. In Proceedings of ICML-03.
- (2003) Proceedings of ICML-03
- Russell, S.¹ Zimdars, A.²

17
- 85152626183
- A reinforcement learning method for maximizing undiscounted rewards
- Morgan Kaufmann San Mateo
- Schwartz, A. (1993). A reinforcement learning method for maximizing undiscounted rewards. In Proceedings of the 10th international conference on machine learning. San Mateo: Morgan Kaufmann.
- (1993) Proceedings of the 10th International Conference on Machine Learning
- Schwartz, A.¹

18
- 36949003610
- Model-based hierarchical average reward reinforcement learning
- Seri, S., & Tadepalli, P. (2002). Model-based hierarchical average reward reinforcement learning. In Proceedings of the ICML (pp. 562-569).
- (2002) Proceedings of the ICML , pp. 562-569
- Seri, S.¹ Tadepalli, P.²

19
- 0033170372
- Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
- 1-2
- R. Sutton D. Precup S. Singh 1999 Between MDPs and semi-MDPs: a framework for temporal abstraction in reinforcement learning Artificial Intelligence 112 1-2 181 211
- (1999) Artificial Intelligence , vol.112 , pp. 181-211
- Sutton, R.¹ Precup, D.² Singh, S.³

20
- 0032050241
- Model-based average reward reinforcement learning
- P. Tadepalli D. Ok 1998 Model-based average reward reinforcement learning Artificial Intelligence 100 177 224
- (1998) Artificial Intelligence , vol.100 , pp. 177-224
- Tadepalli, P.¹ Ok, D.²

21
- 29444435242
- Value functions for RL-based behavior transfer: A comparative study
- Taylor, M., Stone, P., & Liu, Y. (2005). Value functions for RL-based behavior transfer: a comparative study. In Proceedings of the twentieth national conference on artificial intelligence.
- (2005) Proceedings of the Twentieth National Conference on Artificial Intelligence
- Taylor, M.¹ Stone, P.² Liu, Y.³

22
- 57749085102
- Relational macros for transfer in reinforcement learning
- Torrey, L., Shavlik, J., Walker, T., & Maclin, R. (2007). Relational macros for transfer in reinforcement learning. In Proceedings of the 17th conference on inductive logic programming.
- (2007) Proceedings of the 17th Conference on Inductive Logic Programming
- Torrey, L.¹ Shavlik, J.² Walker, T.³ MacLin, R.⁴

23
- 0008311994
- Weeks, J. (1985). The shape of space: how to visualize surfaces and three-dimensional manifolds.
- (1985) The Shape of Space: How to Visualize Surfaces and Three-dimensional Manifolds
- Weeks, J.¹

24
- 0040030981
- Multi-objective infinite-horizon discounted Markov decision processes
- D. White 1982 Multi-objective infinite-horizon discounted Markov decision processes Journal of Mathematical Analysis and Applications 89 639 647
- (1982) Journal of Mathematical Analysis and Applications , vol.89 , pp. 639-647
- White, D.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.