|
Volumn 2, Issue , 2010, Pages 709-714
|
Optimal policy switching algorithms for reinforcement learning
|
Author keywords
Markov Decision Processes; Policy gradient; Reinforcement learning; Temporal abstraction
|
Indexed keywords
GRADIENT METHODS;
LEARNING ALGORITHMS;
MARKOV PROCESSES;
MULTI AGENT SYSTEMS;
OPTIMIZATION;
REINFORCEMENT LEARNING;
GRADIENT BASED ALGORITHM;
LONG-TERM RETURNS;
MARKOV DECISION PROCESSES;
POLICY GRADIENT;
SEQUENTIAL DECISION MAKING;
SWITCHING ALGORITHMS;
TEMPORAL ABSTRACTION;
TERMINATION CONDITION;
AUTONOMOUS AGENTS;
|
EID: 80053022338
PISSN: 15488403
EISSN: 15582914
Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper |
Times cited : (34)
|
References (14)
|