메뉴 건너뛰기




Volumn , Issue , 2012, Pages

Autonomous reinforcement learning on raw visual input data in a real world application

Author keywords

[No Author keywords available]

Indexed keywords

CONTROL POLICY; HIGH-DIMENSIONAL; HUMAN PLAYERS; INPUT DATAS; LEARNING ARCHITECTURES; PROOF OF CONCEPT; REAL-WORLD APPLICATION; VISUAL CONTROL;

EID: 84865083902     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: 10.1109/IJCNN.2012.6252823     Document Type: Conference Paper
Times cited : (231)

References (38)
  • 1
    • 67650996818 scopus 로고    scopus 로고
    • Reinforcement learning for robot soccer
    • M. Riedmiller, T. Gabel, R. Hafner, and S. Lange, "Reinforcement Learning for Robot Soccer," Autonomous Robots, vol. 27, no. 1, pp. 55-74, 2009.
    • (2009) Autonomous Robots , vol.27 , Issue.1 , pp. 55-74
    • Riedmiller, M.1    Gabel, T.2    Hafner, R.3    Lange, S.4
  • 6
    • 84865067679 scopus 로고    scopus 로고
    • Reinforcement learning in feedback control
    • 10.1007/sI0994-011-5235-x. [Online]
    • R. Hafner and M. Riedmiller, "Reinforcement learning in feedback control," Machine Learning, vol. 27, no. 1, pp. 55-74, 2011, 10.1007/sI0994-011-5235-x. [Online]. Available: http://dx.doLorg/IO.1007/s10994- 011-5235-x
    • (2011) Machine Learning , vol.27 , Issue.1 , pp. 55-74
    • Hafner, R.1    Riedmiller, M.2
  • 7
    • 33746600649 scopus 로고    scopus 로고
    • Reducing the dimensionality of data with neural networks
    • G. Hinton and R. Salakhutdinov, "Reducing the Dimensionality of Data with Neural Networks," Science, vol. 313, no. 5786, pp. 504-507, 2006.
    • (2006) Science , vol.313 , Issue.5786 , pp. 504-507
    • Hinton, G.1    Salakhutdinov, R.2
  • 16
    • 34547997615 scopus 로고    scopus 로고
    • Learning a nonlinear embedding by preserving class neighbourhood structure
    • R. R. Salakhutdinov and G. E. Hinton, "Learning a nonlinear embedding by preserving class neighbourhood structure," in AI and Statistics, 2007.
    • (2007) AI and Statistics
    • Salakhutdinov, R.R.1    Hinton, G.E.2
  • 17
    • 34948870900 scopus 로고    scopus 로고
    • Unsupervised learning of invariant feature hierarchies with applications to object recognition
    • M. Ranzato, F. 1. Huang, Y.-L. Boureau, and Y. LeCun, "Unsupervised learning of invariant feature hierarchies with applications to object recognition," in Proc. of CVPR '07.,2007.
    • (2007) Proc. of CVPR '07
    • Ranzato, M.1    Huang, F.J.2    Boureau, Y.-L.3    Lecun, Y.4
  • 18
    • 78649669320 scopus 로고    scopus 로고
    • Deep big simple neural nets excel on handwritten digit recognition
    • D. C. Ciresan, U. Meier, L. M. Gambardella, and J. Schmidhuber, "Deep big simple neural nets excel on handwritten digit recognition," Neural Computation, vol. 22, no. 12, pp. 3207-3220, 2010.
    • (2010) Neural Computation , vol.22 , Issue.12 , pp. 3207-3220
    • Ciresan, D.C.1    Meier, U.2    Gambardella, L.M.3    Schmidhuber, J.4
  • 21
    • 84863380535 scopus 로고    scopus 로고
    • Unsupervised feature learning for audio classification using convolutional deep belief networks
    • Y. Bengio, D. Schuurmans, 1. Lafferty, C. K. I. Williams, and A. Culotta, Eds.
    • H. Lee,P. Pham,Y. Largman,a nd A. Ng," Unsupervised feature learning for audio classification using convolutional deep belief networks;' in Advances in Neural Information Processing Systems 22, Y. Bengio, D. Schuurmans, 1. Lafferty, C. K. I. Williams, and A. Culotta, Eds., 2009, pp. 1096-1104.
    • (2009) Advances in Neural Information Processing Systems , vol.22 , pp. 1096-1104
    • Lee, H.1    Pham, P.2    Largman, Y.3    Ng, A.4
  • 24
    • 33646430006 scopus 로고    scopus 로고
    • Extremely randomized trees
    • P. Geurts, D. Ernst, and L. Wehenkel, "Extremely randomized trees," Machine Learning, vol. 63, no. 1, pp. 3-42,2006.
    • (2006) Machine Learning , vol.63 , Issue.1 , pp. 3-42
    • Geurts, P.1    Ernst, D.2    Wehenkel, L.3
  • 31
    • 84873574800 scopus 로고    scopus 로고
    • Batch reinforcement learning
    • M. Wiering and M. van Otterlo, Eds. Springer, in press, in press
    • S. Lange,T. Gabel,a nd M. Riedmiller," Batch Reinforcement Learning," in Reinforcement Learning: State of the Art, M. Wiering and M. van Otterlo, Eds. Springer, in press, 2011, in press.
    • (2011) Reinforcement Learning: State of the Art
    • Lange, S.1    Gabel, T.2    Riedmiller, M.3
  • 32
    • 0019152630 scopus 로고
    • Neocognitron: A self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position
    • K. Fukushima, "Neocognitron: A self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position," Biological Cybernetics, vol. 36, no. 4, pp. 193-202, 1980.
    • (1980) Biological Cybernetics , vol.36 , Issue.4 , pp. 193-202
    • Fukushima, K.1
  • 33
    • 0032203257 scopus 로고    scopus 로고
    • Gradient-based learning applied to document recognition
    • Y. LeCun,L. Bottou,Y. Bengio,a nd P. Haffner," Gradient-based learning applied to document recognition," Proc. of the IEEE, vol. 86, no. 11, pp. 2278-2324, 1998.
    • (1998) Proc. of the IEEE , vol.86 , Issue.11 , pp. 2278-2324
    • Lecun, Y.1    Bottou, L.2    Bengio, Y.3    Haffner, P.4
  • 34
    • 84943274699 scopus 로고
    • A direct adaptive method for faster backpropagation learning: The RPROP algorithm
    • M. Riedmiller and H. Braun, "A Direct Adaptive Method for Faster Backpropagation Learning: The RPROP Algorithm," in Proc. of the Int. Con/. on Neural Networks, 1993, pp. 586-591.
    • (1993) Proc. of the Int. Conf. on Neural Networks , pp. 586-591
    • Riedmiller, M.1    Braun, H.2
  • 35
    • 0036832956 scopus 로고    scopus 로고
    • Kernel-based reinforcement learning
    • D. Ormoneit and S. Sen, "Kernel-based reinforcement learning," Machine Learning, vol. 49, no. 2, pp. 161-178,2002.
    • (2002) Machine Learning , vol.49 , Issue.2 , pp. 161-178
    • Ormoneit, D.1    Sen, S.2
  • 36
    • 0000549293 scopus 로고    scopus 로고
    • Self-organizing maps
    • zweite edition ed., ser. Springer, Heidelberg
    • T. Kohonen, Self-Organizing Maps, zweite edition ed., ser. Springer Series in Information Sciences. Springer, Heidelberg, 1997, vol. 30.
    • (1997) Springer Series in Information Sciences , vol.30
    • Kohonen, T.1
  • 37
    • 84865084454 scopus 로고    scopus 로고
    • Effizient klassifizieren und clustern: lernparadigmen von vektorquantisierern
    • B. Hammer and T. Villmann, "Effizient Klassifizieren und Clustern: Lernparadigmen von Vektorquantisierern," Kiinstliche Intelligenz, vol. 6, no. 3, pp. 5-11, 2006.
    • (2006) Kiinstliche Intelligenz , vol.6 , Issue.3 , pp. 5-11
    • Hammer, B.1    Villmann, T.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.