메뉴 건너뛰기




Volumn 6911 LNAI, Issue PART 1, 2011, Pages 12-27

Preference-based policy learning

Author keywords

[No Author keywords available]

Indexed keywords

EXPERIMENTAL VALIDATIONS; INVERSE-OPTIMAL CONTROL; MACHINE-LEARNING; POLICY SEARCH; PREFERENCE-BASED; ROBOT SIMULATORS; SINGLE ROBOTS; MACHINE LEARNING APPROACHES; POLICY LEARNING;

EID: 80052395875     PISSN: 03029743     EISSN: 16113349     Source Type: Book Series    
DOI: 10.1007/978-3-642-23780-5_11     Document Type: Conference Paper
Times cited : (109)

References (29)
  • 2
    • 16344383127 scopus 로고    scopus 로고
    • Convergence Results for the (1,λ)-SA-ES using the Theory of φ- Irreducible Markov Chains
    • Auger, A.: Convergence Results for the (1,λ)-SA-ES using the Theory of φ- irreducible Markov Chains. Theoretical Computer Science 334(1-3), 35-69 (2005)
    • (2005) Theoretical Computer Science , vol.334 , Issue.1-3 , pp. 35-69
    • Auger, A.1
  • 3
    • 1942450674 scopus 로고
    • A Framework for Behavioural Cloning
    • Furukawa, K., Michie, D., Muggleton, S. (eds.) Oxford University Press, Oxford
    • Bain, M., Sammut, C.: A Framework for Behavioural Cloning. In: Furukawa, K., Michie, D., Muggleton, S. (eds.) Machine Intelligence, vol. 15, pp. 103-129. Oxford University Press, Oxford (1995)
    • (1995) Machine Intelligence , vol.15 , pp. 103-129
    • Bain, M.1    Sammut, C.2
  • 5
    • 80052401394 scopus 로고    scopus 로고
    • Bredeche, N.: http://www.lri.fr/~bredeche/roborobo/
    • Bredeche, N.1
  • 6
    • 85161994270 scopus 로고    scopus 로고
    • Active Preference Learning with Discrete Choice Data
    • Brochu, E., de Freitas, N., Ghosh, A.: Active Preference Learning with Discrete Choice Data. In: Proc. NIPS 20, pp. 409-416 (2008)
    • (2008) Proc. NIPS 20 , pp. 409-416
    • Brochu, E.1    De Freitas, N.2    Ghosh, A.3
  • 9
    • 55249109544 scopus 로고    scopus 로고
    • The Forgetron: A Kernel-Based Perceptron on a Budget
    • Dekel, O., Shalev-Shwartz, S., Singer, Y.: The Forgetron: A Kernel-Based Perceptron on a Budget. SIAM J. Comput. 37, 1342-1372 (2008)
    • (2008) SIAM J. Comput. , vol.37 , pp. 1342-1372
    • Dekel, O.1    Shalev-Shwartz, S.2    Singer, Y.3
  • 10
    • 0030649484 scopus 로고    scopus 로고
    • Solving the Multiple-Instance Problem with Axis-Parallel Rectangles
    • Dietterich, T.G., Lathrop, R., Lozano-Perez, T.: Solving the Multiple-Instance Problem with Axis-Parallel Rectangles. Artif. Intelligence 89(1-2), 31-71 (1997)
    • (1997) Artif. Intelligence , vol.89 , Issue.1-2 , pp. 31-71
    • Dietterich, T.G.1    Lathrop, R.2    Lozano-Perez, T.3
  • 12
    • 31844446804 scopus 로고    scopus 로고
    • A Support Vector Method for Multivariate Performance Measures
    • De Raedt, L., Wrobel, S. (eds.) ACM, New York
    • Joachims, T.: A Support Vector Method for Multivariate Performance Measures. In: De Raedt, L., Wrobel, S. (eds.) Proc. 22nd ICML. ACM Intl. Conf. Proc. Series, vol. 119, pp. 377-384. ACM, New York (2005)
    • (2005) Proc. 22nd ICML. ACM Intl. Conf. Proc. Series , vol.119 , pp. 377-384
    • Joachims, T.1
  • 13
    • 33749563073 scopus 로고    scopus 로고
    • Training Linear SVMs in Linear Time
    • Eliassi-Rad, T., et al. (eds.) ACM, New York
    • Joachims, T.: Training Linear SVMs in Linear Time. In: Eliassi-Rad, T., et al. (eds.) Proc. 12th Intl. Conf. KDDM, pp. 217-226. ACM, New York (2006)
    • (2006) Proc. 12th Intl. Conf. KDDM , pp. 217-226
    • Joachims, T.1
  • 14
    • 85042697846 scopus 로고    scopus 로고
    • Hierarchical Apprenticeship Learning with Application to Quadruped Locomotion
    • MIT Press, Cambridge
    • Zico Kolter, J., Abbeel, P., Ng, A.Y.: Hierarchical Apprenticeship Learning with Application to Quadruped Locomotion. In: Proc. NIPS 20. MIT Press, Cambridge (2007)
    • (2007) Proc. NIPS 20
    • Zico Kolter, J.1    Abbeel, P.2    Ng, A.Y.3
  • 15
    • 84874610503 scopus 로고    scopus 로고
    • Exploiting Open-Endedness to Solve Problems Through the Search for Novelty
    • Lehman, J., Stanley, K.O.: Exploiting Open-Endedness to Solve Problems Through the Search for Novelty. In: Proc. Artificial Life XI, pp. 329-336 (2008)
    • (2008) Proc. Artificial Life XI , pp. 329-336
    • Lehman, J.1    Stanley, K.O.2
  • 16
    • 84874610503 scopus 로고    scopus 로고
    • Exploiting Open-Endedness to Solve Problems through the Search for Novelty
    • MIT Press, Cambridge
    • Lehman, J., Stanley, K.O.: Exploiting Open-Endedness to Solve Problems through the Search for Novelty. In: Proc. ALife 2008, MIT Press, Cambridge (2008)
    • (2008) Proc. ALife 2008
    • Lehman, J.1    Stanley, K.O.2
  • 17
    • 85162484380 scopus 로고    scopus 로고
    • Feature Construction for Inverse Reinforcement Learning
    • Levine, S., Popovic, Z., Koltun, V.: Feature Construction for Inverse Reinforcement Learning. In: Proc. NIPS 23, pp. 1342-1350 (2010)
    • (2010) Proc. NIPS , vol.23 , pp. 1342-1350
    • Levine, S.1    Popovic, Z.2    Koltun, V.3
  • 18
    • 78650262449 scopus 로고    scopus 로고
    • Modeling and Optimization of Adaptive Foraging in Swarm Robotic Systems
    • Liu, W., Winfield, A.F.T.: Modeling and Optimization of Adaptive Foraging in Swarm Robotic Systems. Intl. J. Robotic Research 29(14), 1743-1760 (2010)
    • (2010) Intl. J. Robotic Research , vol.29 , Issue.14 , pp. 1743-1760
    • Liu, W.1    Winfield, A.F.T.2
  • 19
    • 0042547347 scopus 로고    scopus 로고
    • Algorithms for Inverse Reinforcement Learning
    • Langley, P. (ed.) Morgan Kaufmann, San Francisco
    • Ng, A.Y., Russell, S.: Algorithms for Inverse Reinforcement Learning. In: Langley, P. (ed.) Proc. 17th ICML, pp. 663-670. Morgan Kaufmann, San Francisco (2000)
    • (2000) Proc. 17th ICML , pp. 663-670
    • Ng, A.Y.1    Russell, S.2
  • 20
    • 44949241322 scopus 로고    scopus 로고
    • Reinforcement Learning of Motor Skills with Policy Gradients
    • Peters, J., Schaal, S.: Reinforcement Learning of Motor Skills with Policy Gradients. Neural Networks 21(4), 682-697 (2008)
    • (2008) Neural Networks , vol.21 , Issue.4 , pp. 682-697
    • Peters, J.1    Schaal, S.2
  • 21
    • 84864069017 scopus 로고    scopus 로고
    • Efficient Learning of Sparse Representations with an Energy-Based Model
    • Schölkopf, B., Platt, J.C., Hoffman, T. (eds.) MIT Press, Cambridge
    • Ranzato, M.-A., Poultney, C.S., Chopra, S., LeCun, Y.: Efficient Learning of Sparse Representations with an Energy-Based Model. In: Schölkopf, B., Platt, J.C., Hoffman, T. (eds.) Proc. NIPS 19, pp. 1137-1144. MIT Press, Cambridge (2006)
    • (2006) Proc. NIPS , vol.19 , pp. 1137-1144
    • Ranzato, M.-A.1    Poultney, C.S.2    Chopra, S.3    LeCun, Y.4
  • 24
    • 77954086905 scopus 로고    scopus 로고
    • Energy-efficient Indoor Search by Swarms of Simulated Flying Robots without Global Information
    • Stirling, T.S., Wischmann, S., Floreano, D.: Energy-efficient Indoor Search by Swarms of Simulated Flying Robots without Global Information. Swarm Intelligence 4(2), 117-143 (2010)
    • (2010) Swarm Intelligence , vol.4 , Issue.2 , pp. 117-143
    • Stirling, T.S.1    Wischmann, S.2    Floreano, D.3
  • 25
    • 33749255382 scopus 로고    scopus 로고
    • PAC Model-free Reinforcement Learning
    • Airoldi, E.M., Blei, D.M., Fienberg, S.E., Goldenberg, A., Xing, E.P., Zheng, A.X. (eds.) ICML 2006. Springer, Heidelberg
    • Strehl, A.L., Li, L., Wiewiora, E., Langford, J., Littman, M.L.: PAC Model-free Reinforcement Learning. In: Airoldi, E.M., Blei, D.M., Fienberg, S.E., Goldenberg, A., Xing, E.P., Zheng, A.X. (eds.) ICML 2006. LNCS, vol. 4503, pp. 881-888. Springer, Heidelberg (2007)
    • (2007) LNCS , vol.4503 , pp. 881-888
    • Strehl, A.L.1    Li, L.2    Wiewiora, E.3    Langford, J.4    Littman, M.L.5
  • 27
    • 85162012324 scopus 로고    scopus 로고
    • A Game-Theoretic Approach to Apprenticeship Learning
    • MIT Press, Cambridge
    • Syed, U., Schapire, R.: A Game-Theoretic Approach to Apprenticeship Learning. In: Proc. NIPS 21, pp. 1449-1456. MIT Press, Cambridge (2008)
    • (2008) Proc. NIPS , vol.21 , pp. 1449-1456
    • Syed, U.1    Schapire, R.2
  • 28
    • 70350150649 scopus 로고    scopus 로고
    • Improvements on Learning Tetris with Cross Entropy
    • Thiery, C., Scherrer, B.: Improvements on Learning Tetris with Cross Entropy. ICGA Journal 32(1), 23-33 (2009)
    • (2009) ICGA Journal , vol.32 , Issue.1 , pp. 23-33
    • Thiery, C.1    Scherrer, B.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.