SCOPUS 정보 검색 플랫폼

Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

Volumn 6911 LNAI, Issue PART 1, 2011, Pages 12-27

Preference-based policy learning

(3) Akrour, Riad a Schoenauer, Marc a Sebag, Michele a

a CNRS (France)

Author keywords

[No Author keywords available]

Indexed keywords

EXPERIMENTAL VALIDATIONS; INVERSE-OPTIMAL CONTROL; MACHINE-LEARNING; POLICY SEARCH; PREFERENCE-BASED; ROBOT SIMULATORS; SINGLE ROBOTS; MACHINE LEARNING APPROACHES; POLICY LEARNING;

ESTIMATION; LEARNING SYSTEMS; ROBOTICS; DATA MINING; INVERSE PROBLEMS; ROBOTS;

ROBOTS; REINFORCEMENT LEARNING;

EID: 80052395875 PISSN: 03029743 EISSN: 16113349 Source Type: Book Series
DOI: 10.1007/978-3-642-23780-5_11 Document Type: Conference Paper

Times cited : (109)

References (29)

1
- 14344251217
- Apprenticeship Learning via Inverse Reinforcement Learning
- Brodley, C.E. (ed.) ACM, New York
- Abbeel, P., Ng, A.Y.: Apprenticeship Learning via Inverse Reinforcement Learning. In: Brodley, C.E. (ed.) Proc. 21st Intl. Conf. on Machine Learning (ICML 2004). ACM Intl. Conf. Proc. Series, vol. 69, p. 1. ACM, New York (2004)
- (2004) Proc. 21st Intl. Conf. on Machine Learning (ICML 2004). ACM Intl. Conf. Proc. Series , vol.69 , pp. 1
- Abbeel, P.¹ Ng, A.Y.²

2
- 16344383127
- Convergence Results for the (1,λ)-SA-ES using the Theory of φ- Irreducible Markov Chains
- Auger, A.: Convergence Results for the (1,λ)-SA-ES using the Theory of φ- irreducible Markov Chains. Theoretical Computer Science 334(1-3), 35-69 (2005)
- (2005) Theoretical Computer Science , vol.334 , Issue.1-3 , pp. 35-69
- Auger, A.¹

3
- 1942450674
- A Framework for Behavioural Cloning
- Furukawa, K., Michie, D., Muggleton, S. (eds.) Oxford University Press, Oxford
- Bain, M., Sammut, C.: A Framework for Behavioural Cloning. In: Furukawa, K., Michie, D., Muggleton, S. (eds.) Machine Intelligence, vol. 15, pp. 103-129. Oxford University Press, Oxford (1995)
- (1995) Machine Intelligence , vol.15 , pp. 103-129
- Bain, M.¹ Sammut, C.²

4
- 80052420582
- MIT Press, Cambridge
- Bakir, G., Hofmann, T., Scholkopf, B., Smola, A.J., Taskar, B., Vishwanathan, S.V.N.: Machine Learning with Structured Outputs. MIT Press, Cambridge (2006)
- (2006) Machine Learning with Structured Outputs
- Bakir, G.¹ Hofmann, T.² Scholkopf, B.³ Smola, A.J.⁴ Taskar, B.⁵ Vishwanathan, S.V.N.⁶

5
- 80052401394
- Bredeche, N.: http://www.lri.fr/~bredeche/roborobo/
- Bredeche, N.¹

6
- 85161994270
- Active Preference Learning with Discrete Choice Data
- Brochu, E., de Freitas, N., Ghosh, A.: Active Preference Learning with Discrete Choice Data. In: Proc. NIPS 20, pp. 409-416 (2008)
- (2008) Proc. NIPS 20 , pp. 409-416
- Brochu, E.¹ De Freitas, N.² Ghosh, A.³

7
- 34047173490
- On Learning, Representing and Generalizing a Task in a Humanoid Robot
- Calinon, S., Guenter, F., Billard, A.: On Learning, Representing and Generalizing a Task in a Humanoid Robot. IEEE Trans. on Systems, Man and Cybernetics, Special Issue on Robot Learning by Observation, Demonstration and Imitation 37(2), 286-298 (2007)
- (2007) IEEE Trans. on Systems, Man and Cybernetics, Special Issue on Robot Learning by Observation, Demonstration and Imitation , vol.37 , Issue.2 , pp. 286-298
- Calinon, S.¹ Guenter, F.² Billard, A.³

8
- 17444409624
- A Tutorial on the Cross-Entropy Method
- de Boer, P.-T., Kroese, D.P., Mannor, S., Rubinstein, R.Y.: A Tutorial on the Cross-Entropy Method. Annals OR 134(1), 19-67 (2005)
- (2005) Annals or , vol.134 , Issue.1 , pp. 19-67
- De Boer, P.-T.¹ Kroese, D.P.² Mannor, S.³ Rubinstein, R.Y.⁴

9
- 55249109544
- The Forgetron: A Kernel-Based Perceptron on a Budget
- Dekel, O., Shalev-Shwartz, S., Singer, Y.: The Forgetron: A Kernel-Based Perceptron on a Budget. SIAM J. Comput. 37, 1342-1372 (2008)
- (2008) SIAM J. Comput. , vol.37 , pp. 1342-1372
- Dekel, O.¹ Shalev-Shwartz, S.² Singer, Y.³

10
- 0030649484
- Solving the Multiple-Instance Problem with Axis-Parallel Rectangles
- Dietterich, T.G., Lathrop, R., Lozano-Perez, T.: Solving the Multiple-Instance Problem with Axis-Parallel Rectangles. Artif. Intelligence 89(1-2), 31-71 (1997)
- (1997) Artif. Intelligence , vol.89 , Issue.1-2 , pp. 31-71
- Dietterich, T.G.¹ Lathrop, R.² Lozano-Perez, T.³

11
- 0003472470
- John Wiley and sons, Menlo Park, CA
- Duda, R.O., Hart, P.E.: Pattern Classification and scene analysis. John Wiley and sons, Menlo Park, CA (1973)
- (1973) Pattern Classification and Scene Analysis
- Duda, R.O.¹ Hart, P.E.²

12
- 31844446804
- A Support Vector Method for Multivariate Performance Measures
- De Raedt, L., Wrobel, S. (eds.) ACM, New York
- Joachims, T.: A Support Vector Method for Multivariate Performance Measures. In: De Raedt, L., Wrobel, S. (eds.) Proc. 22nd ICML. ACM Intl. Conf. Proc. Series, vol. 119, pp. 377-384. ACM, New York (2005)
- (2005) Proc. 22nd ICML. ACM Intl. Conf. Proc. Series , vol.119 , pp. 377-384
- Joachims, T.¹

13
- 33749563073
- Training Linear SVMs in Linear Time
- Eliassi-Rad, T., et al. (eds.) ACM, New York
- Joachims, T.: Training Linear SVMs in Linear Time. In: Eliassi-Rad, T., et al. (eds.) Proc. 12th Intl. Conf. KDDM, pp. 217-226. ACM, New York (2006)
- (2006) Proc. 12th Intl. Conf. KDDM , pp. 217-226
- Joachims, T.¹

14
- 85042697846
- Hierarchical Apprenticeship Learning with Application to Quadruped Locomotion
- MIT Press, Cambridge
- Zico Kolter, J., Abbeel, P., Ng, A.Y.: Hierarchical Apprenticeship Learning with Application to Quadruped Locomotion. In: Proc. NIPS 20. MIT Press, Cambridge (2007)
- (2007) Proc. NIPS 20
- Zico Kolter, J.¹ Abbeel, P.² Ng, A.Y.³

15
- 84874610503
- Exploiting Open-Endedness to Solve Problems Through the Search for Novelty
- Lehman, J., Stanley, K.O.: Exploiting Open-Endedness to Solve Problems Through the Search for Novelty. In: Proc. Artificial Life XI, pp. 329-336 (2008)
- (2008) Proc. Artificial Life XI , pp. 329-336
- Lehman, J.¹ Stanley, K.O.²

16
- 84874610503
- Exploiting Open-Endedness to Solve Problems through the Search for Novelty
- MIT Press, Cambridge
- Lehman, J., Stanley, K.O.: Exploiting Open-Endedness to Solve Problems through the Search for Novelty. In: Proc. ALife 2008, MIT Press, Cambridge (2008)
- (2008) Proc. ALife 2008
- Lehman, J.¹ Stanley, K.O.²

17
- 85162484380
- Feature Construction for Inverse Reinforcement Learning
- Levine, S., Popovic, Z., Koltun, V.: Feature Construction for Inverse Reinforcement Learning. In: Proc. NIPS 23, pp. 1342-1350 (2010)
- (2010) Proc. NIPS , vol.23 , pp. 1342-1350
- Levine, S.¹ Popovic, Z.² Koltun, V.³

18
- 78650262449
- Modeling and Optimization of Adaptive Foraging in Swarm Robotic Systems
- Liu, W., Winfield, A.F.T.: Modeling and Optimization of Adaptive Foraging in Swarm Robotic Systems. Intl. J. Robotic Research 29(14), 1743-1760 (2010)
- (2010) Intl. J. Robotic Research , vol.29 , Issue.14 , pp. 1743-1760
- Liu, W.¹ Winfield, A.F.T.²

19
- 0042547347
- Algorithms for Inverse Reinforcement Learning
- Langley, P. (ed.) Morgan Kaufmann, San Francisco
- Ng, A.Y., Russell, S.: Algorithms for Inverse Reinforcement Learning. In: Langley, P. (ed.) Proc. 17th ICML, pp. 663-670. Morgan Kaufmann, San Francisco (2000)
- (2000) Proc. 17th ICML , pp. 663-670
- Ng, A.Y.¹ Russell, S.²

20
- 44949241322
- Reinforcement Learning of Motor Skills with Policy Gradients
- Peters, J., Schaal, S.: Reinforcement Learning of Motor Skills with Policy Gradients. Neural Networks 21(4), 682-697 (2008)
- (2008) Neural Networks , vol.21 , Issue.4 , pp. 682-697
- Peters, J.¹ Schaal, S.²

21
- 84864069017
- Efficient Learning of Sparse Representations with an Energy-Based Model
- Schölkopf, B., Platt, J.C., Hoffman, T. (eds.) MIT Press, Cambridge
- Ranzato, M.-A., Poultney, C.S., Chopra, S., LeCun, Y.: Efficient Learning of Sparse Representations with an Energy-Based Model. In: Schölkopf, B., Platt, J.C., Hoffman, T. (eds.) Proc. NIPS 19, pp. 1137-1144. MIT Press, Cambridge (2006)
- (2006) Proc. NIPS , vol.19 , pp. 1137-1144
- Ranzato, M.-A.¹ Poultney, C.S.² Chopra, S.³ LeCun, Y.⁴

22
- 38649089443
- Robotic Grasping of Novel Objects using Vision
- Saxena, A., Driemeyer, J., Ng, A.Y.: Robotic Grasping of Novel Objects using Vision. Intl. J. Robotics Research (2008)
- (2008) Intl. J. Robotics Research
- Saxena, A.¹ Driemeyer, J.² Ng, A.Y.³

23
- 0003524416
- JohnWiley & Sons, New York 2nd edn.
- Schwefel, H.-P.: Numerical Optimization of Computer Models. JohnWiley & Sons, New York (1981) 2nd edn. (1995)
- (1981) Numerical Optimization of Computer Models
- Schwefel, H.-P.¹

24
- 77954086905
- Energy-efficient Indoor Search by Swarms of Simulated Flying Robots without Global Information
- Stirling, T.S., Wischmann, S., Floreano, D.: Energy-efficient Indoor Search by Swarms of Simulated Flying Robots without Global Information. Swarm Intelligence 4(2), 117-143 (2010)
- (2010) Swarm Intelligence , vol.4 , Issue.2 , pp. 117-143
- Stirling, T.S.¹ Wischmann, S.² Floreano, D.³

25
- 33749255382
- PAC Model-free Reinforcement Learning
- Airoldi, E.M., Blei, D.M., Fienberg, S.E., Goldenberg, A., Xing, E.P., Zheng, A.X. (eds.) ICML 2006. Springer, Heidelberg
- Strehl, A.L., Li, L., Wiewiora, E., Langford, J., Littman, M.L.: PAC Model-free Reinforcement Learning. In: Airoldi, E.M., Blei, D.M., Fienberg, S.E., Goldenberg, A., Xing, E.P., Zheng, A.X. (eds.) ICML 2006. LNCS, vol. 4503, pp. 881-888. Springer, Heidelberg (2007)
- (2007) LNCS , vol.4503 , pp. 881-888
- Strehl, A.L.¹ Li, L.² Wiewiora, E.³ Langford, J.⁴ Littman, M.L.⁵

26
- 0004102479
- MIT Press, Cambridge
- Sutton, R., Barto, A.: Reinforcement Learning: An Introduction. MIT Press, Cambridge (1998)
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.¹ Barto, A.²

27
- 85162012324
- A Game-Theoretic Approach to Apprenticeship Learning
- MIT Press, Cambridge
- Syed, U., Schapire, R.: A Game-Theoretic Approach to Apprenticeship Learning. In: Proc. NIPS 21, pp. 1449-1456. MIT Press, Cambridge (2008)
- (2008) Proc. NIPS , vol.21 , pp. 1449-1456
- Syed, U.¹ Schapire, R.²

28
- 70350150649
- Improvements on Learning Tetris with Cross Entropy
- Thiery, C., Scherrer, B.: Improvements on Learning Tetris with Cross Entropy. ICGA Journal 32(1), 23-33 (2009)
- (2009) ICGA Journal , vol.32 , Issue.1 , pp. 23-33
- Thiery, C.¹ Scherrer, B.²

29
- 30944441110
- Cooperative Hole Avoidance in a Swarm-bot
- Trianni, V., Nolfi, S., Dorigo, M.: Cooperative Hole Avoidance in a Swarm-bot. Robotics and Autonomous Systems 54(2), 97-103 (2006)
- (2006) Robotics and Autonomous Systems , vol.54 , Issue.2 , pp. 97-103
- Trianni, V.¹ Nolfi, S.² Dorigo, M.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.