메뉴 건너뛰기




Volumn , Issue , 2009, Pages 1746-1753

ReTrASE: Integrating paradigms for approximate probabilistic planning

Author keywords

[No Author keywords available]

Indexed keywords

COMPUTATION THEORY; FUNCTIONS;

EID: 78650599203     PISSN: 10450823     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (21)

References (27)
  • 1
    • 13444288352 scopus 로고    scopus 로고
    • Decision-theoretic military operations planning
    • D. Aberdeen, S. Thiebaux, and L. Zhang. Decision-theoretic military operations planning. In ICAPS'04, 2004.
    • (2004) ICAPS'04
    • Aberdeen, D.1    Thiebaux, S.2    Zhang, L.3
  • 2
    • 0029210635 scopus 로고
    • Learning to act using real-time dynamic programming
    • A. Barto, S. Bradtke, and S. Singh. Learning to act using real-time dynamic programming. Artificial Intelligence, 72:81-138, 1995.
    • (1995) Artificial Intelligence , vol.72 , pp. 81-138
    • Barto, A.1    Bradtke, S.2    Singh, S.3
  • 5
    • 9444233135 scopus 로고    scopus 로고
    • Labeled RTDP: Improving the convergence of real-time dynamic programming
    • B. Bonet and H. Geffner. Labeled RTDP: Improving the convergence of real-time dynamic programming. In ICAPS'03, pages 12-21, 2003.
    • (2003) ICAPS'03 , pp. 12-21
    • Bonet, B.1    Geffner, H.2
  • 8
    • 33750713477 scopus 로고    scopus 로고
    • Symbolic heuristic search for factored Markov decision processes
    • Z. Feng and E. Hansen. Symbolic heuristic search for factored Markov decision processes. In AAAI'02, 2002.
    • (2002) AAAI'02
    • Feng, Z.1    Hansen, E.2
  • 10
    • 84880694195 scopus 로고
    • Stable function approximation in dynamic programming
    • Geoff Gordon. Stable function approximation in dynamic programming. In ICML, pages 261-268, 1995.
    • (1995) ICML , pp. 261-268
    • Gordon, G.1
  • 11
    • 44449170889 scopus 로고    scopus 로고
    • Exploiting first-order regression in inductive policy selection
    • C. Gretton and S. Thiebaux. Exploiting first-order regression in inductive policy selection. In UAI'04, 2004.
    • (2004) UAI'04
    • Gretton, C.1    Thiebaux, S.2
  • 12
    • 84880803349 scopus 로고    scopus 로고
    • Generalizing plans to new environments in relational MDPs
    • Acapulco, Mexico
    • C. Guestrin, D. Koller, C. Gearhart, and N. Kanodia. Generalizing plans to new environments in relational MDPs. In IJCAI'03, Acapulco, Mexico, 2003.
    • (2003) IJCAI'03
    • Guestrin, C.1    Koller, D.2    Gearhart, C.3    Kanodia, N.4
  • 14
    • 0002956570 scopus 로고    scopus 로고
    • SPUDD: Stochastic planning using decision diagrams
    • J. Hoey, R. St-Aubin, A. Hu, and C. Boutilier. SPUDD: Stochastic planning using decision diagrams. In UAI'99, pages 279-288, 1999.
    • (1999) UAI'99 , pp. 279-288
    • Hoey, J.1    St-Aubin, R.2    Hu, A.3    Boutilier, C.4
  • 15
    • 0036377352 scopus 로고    scopus 로고
    • The FF planning system: Fast plan generation through heuristic search
    • J. Hoffman and B. Nebel. The FF planning system: Fast plan generation through heuristic search. Journal of Artificial Intelligence Research, 14:253-302, 2001.
    • (2001) Journal of Artificial Intelligence Research , vol.14 , pp. 253-302
    • Hoffman, J.1    Nebel, B.2
  • 16
    • 33749263205 scopus 로고    scopus 로고
    • Automatic basis function construction for approximate dynamic programming and reinforcement learning
    • Philipp Keller, Shie Mannor, and Doine Precup. Automatic basis function construction for approximate dynamic programming and reinforcement learning. In ICML'06, pages 449-456, 2006.
    • (2006) ICML'06 , pp. 449-456
    • Keller, P.1    Mannor, S.2    Precup, D.3
  • 20
    • 84880762931 scopus 로고    scopus 로고
    • Planning with continuous resources in stochastic domains
    • Mausam, E. Benazara, R. Brafman, N. Meuleau, and E. Hansen. Planning with continuous resources in stochastic domains. In IJCAI'05, page 1244, 2005.
    • (2005) IJCAI'05 , pp. 1244
    • Mausam1    Benazara, E.2    Brafman, R.3    Meuleau, N.4    Hansen, E.5
  • 21
    • 84899031196 scopus 로고    scopus 로고
    • Exponential family PCA for belief compression in POMDPs
    • MIT Press
    • Nicholas Roy and Geoffrey Gordon. Exponential family PCA for belief compression in POMDPs. In NIPS'02, pages 1043-1049. MIT Press, 2003.
    • (2003) NIPS'02 , pp. 1043-1049
    • Roy, N.1    Gordon, G.2
  • 22
    • 77957878103 scopus 로고    scopus 로고
    • Practical linear value-approximation techniques for first-order MDPs
    • S. Sanner and C. Boutilier. Practical linear value-approximation techniques for first-order MDPs. In UAI'06, 2006.
    • (2006) UAI'06
    • Sanner, S.1    Boutilier, C.2
  • 23
    • 26944499565 scopus 로고    scopus 로고
    • APRICODD: Approximate policy construction using decision diagrams
    • R. St-Aubin, J. Hoey, and C. Boutilier. APRICODD: Approximate policy construction using decision diagrams. In NIPS'00, 2000.
    • (2000) NIPS'00
    • St-Aubin, R.1    Hoey, J.2    Boutilier, C.3
  • 25
    • 58349118462 scopus 로고    scopus 로고
    • FF-Replan: A baseline for probabilistic planning
    • Sungwook Yoon, Alan Fern, and Robert Givan. FF-Replan: A baseline for probabilistic planning. In ICAPS'07, pages 352-359, 2007.
    • (2007) ICAPS'07 , pp. 352-359
    • Yoon, S.1    Fern, A.2    Givan, R.3
  • 26
    • 77957947989 scopus 로고    scopus 로고
    • Probabilistic planning via determinization in hindsight
    • Sungwook Yoon, Alan Fern, Subbarao Kambhampati, and Robert Givan. Probabilistic planning via determinization in hindsight. In AAAI'08, 2008.
    • (2008) AAAI'08
    • Yoon, S.1    Fern, A.2    Kambhampati, S.3    Givan, R.4
  • 27
    • 13444256700 scopus 로고    scopus 로고
    • Policy generation for continuous-time stochastic domains with concurrency
    • H. L. S. Younes and R. G. Simmons. Policy generation for continuous-time stochastic domains with concurrency. In ICAPS'04, page 325, 2004.
    • (2004) ICAPS'04 , pp. 325
    • Younes, H.L.S.1    Simmons, R.G.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.