SCOPUS 정보 검색 플랫폼

Volumn 2, Issue , 2012, Pages 1133-1141

A Bayesian approach for policy learning from trajectory preference queries

Author keywords

[No Author keywords available]

Indexed keywords

BAYESIAN APPROACHES; BENCH-MARK PROBLEMS; LEARNING CONTROL; POLICY LEARNING; PREFERENCE QUERIES; QUERY SELECTION; QUERYING PROCESS; RANDOM SELECTION;

BAYESIAN NETWORKS;

TRAJECTORIES;

EID: 84877732154 PISSN: 10495258 EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (186)

References (17)

2
- 0037262814
- An introduction to mcmc for machine learning
- Christophe Andrieu, Nando de Freitas, Arnaud Doucet, and Michael I. Jordan. An introduction to mcmc for machine learning. Machine Learning, 50(1-2):5-43, 2003.
- (2003) Machine Learning , vol.50 , Issue.1-2 , pp. 5-43
- Andrieu, C.¹ De Freitas, N.² Doucet, A.³ Jordan, M.I.⁴

4
- 0000626524
- Expected information as expected utility
- J M Bernardo. Expected information as expected utility. Annals of Statistics, 7(3):686-690, 1979.
- (1979) Annals of Statistics , vol.7 , Issue.3 , pp. 686-690
- Bernardo, J.M.¹

7
- 84889281816
- Wiley-Interscience, New York, NY, USA
- Thomas M. Cover and Joy A. Thomas. Elements of information theory. Wiley-Interscience, New York, NY, USA, 1991.
- (1991) Elements of Information Theory
- Cover, T.M.¹ Thomas, J.A.²

8
- 4243137056
- Hybrid monte carlo
- Simon Duane, A. D. Kennedy, Brian J. Pendleton, and Duncan Roweth. Hybrid monte carlo. Physics Letters B, 195(2):216-222, 1987.
- (1987) Physics Letters B , vol.195 , Issue.2 , pp. 216-222
- Duane, S.¹ Kennedy, A.D.² Pendleton, B.J.³ Roweth, D.⁴

9
- 0031209604
- Selective sampling using the query by committee algorithm
- Yoav Freund, H. Sebastian Seung, Eli Shamir, and Naftali Tishby. Selective sampling using the query by committee algorithm. Machine Learning, 28(2-3):133-168, 1997.
- (1997) Machine Learning , vol.28 , Issue.2-3 , pp. 133-168
- Freund, Y.¹ Seung, H.S.² Shamir, E.³ Tishby, N.⁴

10
- 4644323293
- Least-squares policy iteration
- Michail G. Lagoudakis, Ronald Parr, and L. Bartlett. Least-squares policy iteration. Journal of Machine Learning Research, 4, 2003.
- (2003) Journal of Machine Learning Research , vol.4
- Lagoudakis, M.G.¹ Parr, R.² Bartlett, L.³

11
- 0001249987
- On a measure of the information provided by an experiment
- D. V. Lindley. On a Measure of the Information Provided by an Experiment. The Annals of Mathematical Statistics, 27(4):986-1005, 1956.
- (1956) The Annals of Mathematical Statistics , vol.27 , Issue.4 , pp. 986-1005
- Lindley, D.V.¹

12
- 0042547347
- Algorithms for inverse reinforcement learning
- Andrew Y. Ng and Stuart J. Russell. Algorithms for inverse reinforcement learning. In ICML, pages 663-670, 2000.
- (2000) ICML , pp. 663-670
- Ng, A.Y.¹ Russell, S.J.²

13
- 27344432348
- Accelerating reinforcement learning through implicit imitation
- Bob Price and Craig Boutilier. Accelerating reinforcement learning through implicit imitation. J. Artif. Intell. Res. (JAIR), 19:569-629, 2003.
- (2003) J. Artif. Intell. Res. (JAIR) , vol.19 , pp. 569-629
- Price, B.¹ Boutilier, C.²

14
- 1642401055
- Learning to drive a bicycle using reinforcement learning and shaping
- Jette Randløv and Preben Alstrøm. Learning to drive a bicycle using reinforcement learning and shaping. In ICML, pages 463-471, 1998.
- (1998) ICML , pp. 463-471
- Randløv, J.¹ Alstrøm, P.²

15
- 84898995067
- Learning from demonstration
- Stefan Schaal. Learning from demonstration. In NIPS, pages 1040-1046, 1996.
- (1996) NIPS , pp. 1040-1046
- Schaal, S.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.