메뉴 건너뛰기




Volumn , Issue , 2014, Pages 162-171

Universal convexification via risk-aversion

Author keywords

[No Author keywords available]

Indexed keywords

ALGORITHMS; ARTIFICIAL INTELLIGENCE; DYNAMICAL SYSTEMS; LEARNING ALGORITHMS; LEARNING SYSTEMS; MARKOV PROCESSES; STOCHASTIC SYSTEMS;

EID: 84923299173     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (7)

References (20)
  • 7
    • 84923295342 scopus 로고    scopus 로고
    • Sebastian Bubeck, December
    • Sebastian Bubeck. The complexities of optimization, December 2013. URL https: //blogs.princeton.edu/imabandit/2013/04/ 25/orf523-noisy-oracles/.
    • (2013) The Complexities of Optimization
  • 8
    • 79955702502 scopus 로고    scopus 로고
    • LIBSVM: A library for support vector machines
    • Software available at
    • Chih-Chung Chang and Chih-Jen Lin. LIBSVM: A library for support vector machines. ACM Transactions on Intelligent Systems and Technology, 2: 27:1-27:27, 2011. Software available at http://www. csie.ntu.edu.tw/~cjlin/libsvm.
    • (2011) ACM Transactions on Intelligent Systems and Technology , vol.2 , pp. 271-2727
    • Chang, C.1    Lin, C.2
  • 10
    • 0020203191 scopus 로고
    • Optimal control and nonlinear filtering for nondegenerate diffusion processes
    • W. Fleming and S. Mitter. Optimal control and nonlinear filtering for nondegenerate diffusion processes. Stochastics, 8:226-261, 1982.
    • (1982) Stochastics , vol.8 , pp. 226-261
    • Fleming, W.1    Mitter, S.2
  • 11
    • 28844435646 scopus 로고    scopus 로고
    • Linear theory for control of nonlinear stochastic systems
    • H.J. Kappen. Linear theory for control of nonlinear stochastic systems. Physical Review Letters, 95(20): 200201, 2005.
    • (2005) Physical Review Letters , vol.95 , Issue.20 , pp. 200201
    • Kappen, H.J.1
  • 12
    • 0032203257 scopus 로고    scopus 로고
    • Gradient-based learning applied to document recognition
    • Yann LeCun, Léon Bottou, Yoshua Bengio, and Patrick Haffner. Gradient-based learning applied to document recognition. Proceedings of the IEEE, 86 (11):2278-2324, 1998.
    • (1998) Proceedings of the IEEE , vol.86 , Issue.11 , pp. 2278-2324
    • Lecun, Y.1    Bottou, L.2    Bengio, Y.3    Haffner, P.4
  • 15
    • 0025536653 scopus 로고
    • Generalised graduated non-convexity algorithm for maximum a posterjori image estimation
    • A. Rangarajan. Generalised graduated non-convexity algorithm for maximum a posterjori image estimation. In Proc. ICPR, pages 127-133, 1990.
    • (1990) Proc. ICPR , pp. 127-133
    • Rangarajan, A.1
  • 16
    • 0016094244 scopus 로고
    • Optimization of stochastic linear systems with additive measurement and process noise using exponential performance criteria
    • J Speyer, John Deyst, and D Jacobson. Optimization of stochastic linear systems with additive measurement and process noise using exponential performance criteria. Automatic Control, IEEE Transactions on, 19(4):358-366, 1974.
    • (1974) Automatic Control, IEEE Transactions on , vol.19 , Issue.4 , pp. 358-366
    • Speyer, J.1    Deyst, J.2    Jacobson, D.3
  • 18
    • 79551503171 scopus 로고    scopus 로고
    • A generalized path integral control approach to reinforcement learning
    • Evangelos Theodorou, Jonas Buchli, and Stefan Schaal. A generalized path integral control approach to reinforcement learning. The Journal of Machine Learning Research, 9999:3137-3181, 2010b.
    • (2010) The Journal of Machine Learning Research , vol.9999 , pp. 3137-3181
    • Theodorou, E.1    Buchli, J.2    Schaal, S.3
  • 20
    • 71149083296 scopus 로고    scopus 로고
    • Robot trajectory optimization using approximate inference
    • M. Toussaint. Robot trajectory optimization using approximate inference. International Conference on Machine Learning, 26:1049-1056, 2009.
    • (2009) International Conference on Machine Learning , vol.26 , pp. 1049-1056
    • Toussaint, M.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.