SCOPUS 정보 검색 플랫폼

Advances in Neural Information Processing Systems

Volumn , Issue , 2000, Pages 1022-1028

Policy search via density estimation

(3) Ng, Andrew Y a Parr, Ronald b Roller, Daphne b

a UNIVERSITY OF CALIFORNIA (United States)

b Stanford University (United States)

Author keywords

[No Author keywords available]

Indexed keywords

MARKOV PROCESSES;

COMPLEX PROBLEMS; DENSITY PROPAGATION; GRADIENT DESCENT; MARKOV DECISION PROCESSES; PARTIALLY OBSERVABLE MARKOV DECISION PROCESS; PROBABILITY DENSITIES; SOLUTION QUALITY; STOCHASTIC PROPAGATION;

STOCHASTIC MODELS;

EID: 84898967780 PISSN: 10495258 EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (23)

References (9)

1
- 84898958374
- Gradient descent for general reinforcement learning
- L. Baird and A.W. Moore. Gradient descent for general Reinforcement Learning. In NIPS 11, 1999.
- (1999) NIPS 11
- Baird, L.¹ Moore, A.W.²

2
- 0346942368
- Decision theoretic planning: Structural assumptions and computational leverage
- C. Boutilier. T. Dean, and S. Hanks. Decision theoretic planning: Structural assumptions and computational leverage. J. Artificial Intelligence Research, 1999.
- (1999) J. Artificial Intelligence Research
- Boutilier, C.¹ Dean, T.² Hanks, S.³

3
- 0002436850
- Tractable inference for complex stochastic processes
- X. Boyen and D. Koller. Tractable inference for complex stochastic processes. In Proc. UAI, pages 33-42, 1998.
- (1998) Proc. UAI , pp. 33-42
- Boyen, X.¹ Koller, D.²

4
- 85012097649
- The BATmobile: Towards a Bayesian automated taxi
- J. Forbes, T. Huang, K. Kanazawa, and S.J. Russell. The BATmobile: Towards a Bayesian automated taxi. In Proc. IJCAI, 1995.
- (1995) Proc. IJCAI
- Forbes, J.¹ Huang, T.² Kanazawa, K.³ Russell, S.J.⁴

5
- 0003987291
- Using learning for approximation in stochastic processes
- D. Koller and R. Fratkina. Using learning for approximation in stochastic processes. In Proc. ICML, pages 287-295, 1998.
- (1998) Proc ICML , pp. 287-295
- Koller, D.¹ Fratkina, R.²

6
- 33646430192
- Learning finite-state controllers for partially observable environments
- N. Meuleau, L. Peshkin, K-E. Kim, and L.P. Kaelbling. Learning finite-state controllers for partially observable environments. In Proc. UAI 15, 1999.
- (1999) Proc. UAI , vol.15
- Meuleau, N.¹ Peshkin, L.² Kim, K.-E.³ Kaelbling, L.P.⁴

7
- 1642401055
- Learning to drive a bicycle using reinforcement learning and shaping
- J. Randløv and P. Alstrøm. Learning to drive a bicycle using reinforcement learning and shaping. In Proc. ICML, 1998.
- (1998) Proc ICML
- Randløv, J.¹ Alstrøm, P.²

8
- 84898958707
- Experiments with an algorithm which learns stochastic memoryless policies for POMDPs
- J.K. Williams and S. Singh. Experiments with an algorithm which learns stochastic memoryless policies for POMDPs. In NIPS 11, 1999.
- (1999) NIPS 11
- Williams, J.K.¹ Singh, S.²

9
- 0000337576
- Simple statistical gradient-following algorithms for connectionist reinforcement learning
- R.J. Williams. Simple statistical gradient-following algorithms for connectionist reinforcement learning. Machine Learning, 8:229-256, 1992.
- (1992) Machine Learning , vol.8 , pp. 229-256
- Williams, R.J.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.