메뉴 건너뛰기




Volumn , Issue , 2004, Pages 631-636

Advice generation from observed execution: Abstract Markov decision process learning

Author keywords

[No Author keywords available]

Indexed keywords

ADVICE GENERATION; DOMAINS; EXECUTABLE ADVICE; MARKOV DECISION PROCESS (MDP);

EID: 9444229271     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (2)

References (17)
  • 1
    • 0037288370 scopus 로고    scopus 로고
    • Recent advances in hierarchical reinforcement learning
    • Barto, A., and Mahadevan, S. 2003. Recent advances in hierarchical reinforcement learning. Discrete-Event Systems Journal 13:41-77.
    • (2003) Discrete-Event Systems Journal , vol.13 , pp. 41-77
    • Barto, A.1    Mahadevan, S.2
  • 2
    • 0027694889 scopus 로고
    • A bayesian model of plan recognition
    • Charniak, E., and Goldman, R. 1993. A Bayesian model of plan recognition. Artificial Intelligence 64(1):53-79.
    • (1993) Artificial Intelligence , vol.64 , Issue.1 , pp. 53-79
    • Charniak, E.1    Goldman, R.2
  • 3
    • 0030697013 scopus 로고    scopus 로고
    • Abstraction and approximate decision theoretic planning
    • Dearden, R., and Boutilier, C. 1997. Abstraction and approximate decision theoretic planning. Artificial Intelligence 89(1):219-283.
    • (1997) Artificial Intelligence , vol.89 , Issue.1 , pp. 219-283
    • Dearden, R.1    Boutilier, C.2
  • 4
    • 13444302959 scopus 로고    scopus 로고
    • Fault tolerant planning: Toward probabilistic uncertainty models in symbolic non-deterministic planning
    • Jensen, R. M.; Veloso, M. M.; and Bryant, R. E. 2004. Fault Tolerant Planning: Toward Probabilistic Uncertainty Models in Symbolic Non-Deterministic Planning. In ICAPS04.
    • (2004) ICAPS04
    • Jensen, R.M.1    Veloso, M.M.2    Bryant, R.E.3
  • 5
    • 0003318787 scopus 로고
    • A formal theory of plan recognition and its implementation
    • Allen, J. F.; Kautz, H. A.; Pelavin, R. N.; and Tenenberg, J. D., eds. Los Altos, CA: Morgan Kaufmann. chapter 2
    • Kautz, H. A. 1991. A Formal theory of plan recognition and its implementation. In Allen, J. F.; Kautz, H. A.; Pelavin, R. N.; and Tenenberg, J. D., eds., Reasoning About Plans. Los Altos, CA: Morgan Kaufmann. chapter 2.
    • (1991) Reasoning about Plans
    • Kautz, H.A.1
  • 7
    • 9444221186 scopus 로고    scopus 로고
    • The champion UT Austin Villa 2003 simulator online coach team
    • Polani, D.; Browning, B.; Bonarini, A.; and Yoshida, K., eds. Berlin: Springer Verlag. (to appear)
    • Kuhlmann, G.; Stone, P.; and Lallinger, J. 2004. The champion UT Austin Villa 2003 simulator online coach team. In Polani, D.; Browning, B.; Bonarini, A.; and Yoshida, K., eds., RoboCup-2003: Robot Soccer World Cup VII. Berlin: Springer Verlag. (to appear).
    • (2004) RoboCup-2003: Robot Soccer World Cup VII
    • Kuhlmann, G.1    Stone, P.2    Lallinger, J.3
  • 8
    • 0029732210 scopus 로고    scopus 로고
    • Creating advice-taking reinforcement learners
    • Maclin, R., and Shavlik, J. W. 1996. Creating advice-taking reinforcement learners. Machine Learning 22:251-282.
    • (1996) Machine Learning , vol.22 , pp. 251-282
    • Maclin, R.1    Shavlik, J.W.2
  • 10
    • 84949961009 scopus 로고    scopus 로고
    • Automated advice-giving strategies for scientific inquiry
    • Paolucci, M.; Suthers, D. D.; and Weiner, A. 1996. Automated advice-giving strategies for scientific inquiry. In ITS-96, 372-381.
    • (1996) ITS-96 , pp. 372-381
    • Paolucci, M.1    Suthers, D.D.2    Weiner, A.3
  • 12
    • 9444243045 scopus 로고    scopus 로고
    • Automated assistant to aid humans in understanding team behaviors
    • Raines, T.; Tambe, M.; and Marsella, S. 2000. Automated assistant to aid humans in understanding team behaviors. In Agents-2000.
    • (2000) Agents-2000
    • Raines, T.1    Tambe, M.2    Marsella, S.3
  • 13
    • 0010221077 scopus 로고    scopus 로고
    • An empirical study of coaching
    • Asama, H.; Arai, T.; Fukuda, T.; and Hasegawa, T., eds. Springer-Verlag. 215-224
    • Riley, P.; Veloso, M.; and Kaminka, G. 2002. An empirical study of coaching. In Asama, H.; Arai, T.; Fukuda, T.; and Hasegawa, T., eds., Distributed Autonomous Robotic Systems 5. Springer-Verlag. 215-224.
    • (2002) Distributed Autonomous Robotic Systems , vol.5
    • Riley, P.1    Veloso, M.2    Kaminka, G.3
  • 14
    • 0022059617 scopus 로고
    • Iterative aggregation-deaggregation procedures for discounted semi-Markov reward processes
    • Schweitzer, P. L.; Puterman, M. L.; and Kindle, K. W. 1985. Iterative aggregation-deaggregation procedures for discounted semi-Markov reward processes. Operations Research 33:589-605.
    • (1985) Operations Research , vol.33 , pp. 589-605
    • Schweitzer, P.L.1    Puterman, M.L.2    Kindle, K.W.3
  • 15
  • 16
    • 9444294799 scopus 로고    scopus 로고
    • TTree: Tree-based state generalization with temporally abstract actions
    • Uther, W., and Veloso, M. 2002. TTree: Tree-based state generalization with temporally abstract actions. In Proceedings of SARA-2002.
    • (2002) Proceedings of SARA-2002
    • Uther, W.1    Veloso, M.2
  • 17
    • 9444220031 scopus 로고    scopus 로고
    • Using online learning to analyze the opponent behavior
    • Polani, D.; Bonarini, A.; Browning, B.; and Yoshida, K., eds. Berlin: Springer Verlag. (to appear)
    • Visser, U., and Weland, H.-G. 2004. Using online learning to analyze the opponent behavior. In Polani, D.; Bonarini, A.; Browning, B.; and Yoshida, K., eds., RoboCup-2003: The Sixth RoboCup Competitions and Conferences. Berlin: Springer Verlag. (to appear).
    • (2004) RoboCup-2003: the Sixth RoboCup Competitions and Conferences
    • Visser, U.1    Weland, H.-G.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.