SCOPUS 정보 검색 플랫폼

Proceedings of the International Joint Conference on Neural Networks

Volumn , Issue , 2012, Pages

Autonomous reinforcement learning on raw visual input data in a real world application

(3) Lange, Sascha a Riedmiller, Martin a Voigtländer, Arne b

a UNIVERSITY OF FREIBURG (Germany)

b Shoogee GmbH and Co KG (Germany)

Author keywords

[No Author keywords available]

Indexed keywords

CONTROL POLICY; HIGH-DIMENSIONAL; HUMAN PLAYERS; INPUT DATAS; LEARNING ARCHITECTURES; PROOF OF CONCEPT; REAL-WORLD APPLICATION; VISUAL CONTROL;

NEURAL NETWORKS; REINFORCEMENT LEARNING; SEMANTICS; VISION;

INPUT OUTPUT PROGRAMS;

EID: 84865083902 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: 10.1109/IJCNN.2012.6252823 Document Type: Conference Paper

Times cited : (231)

References (38)

1
- 67650996818
- Reinforcement learning for robot soccer
- M. Riedmiller, T. Gabel, R. Hafner, and S. Lange, "Reinforcement Learning for Robot Soccer," Autonomous Robots, vol. 27, no. 1, pp. 55-74, 2009.
- (2009) Autonomous Robots , vol.27 , Issue.1 , pp. 55-74
- Riedmiller, M.¹ Gabel, T.² Hafner, R.³ Lange, S.⁴

2
- 79958814014
- Adaptive reactive job-shop scheduling with reinforcement learning agents
- T. Gabel and M. Riedmiller, "Adaptive Reactive Job-Shop Scheduling with Reinforcement Learning Agents," International Journal of Information Technology and Intelligent Computing, vol. 24, no. 4, 2008.
- (2008) International Journal of Information Technology and Intelligent Computing , vol.24 , Issue.4
- Gabel, T.¹ Riedmiller, M.²

3
- 77950832758
- The neuro slot car racer: Reinforcement learning in a real world setting
- T. Kietzmann and M. Riedmiller, "The Neuro Slot Car Racer: Reinforcement Learning in a Real World Setting," in Proc. of the 8th Int. Con! on Machine Learning and Applications, 2009.
- (2009) Proc. of the 8th Int. Con! on Machine Learning and Applications
- Kietzmann, T.¹ Riedmiller, M.²

4
- 21844465127
- Tree-based batch mode reinforcement learning
- D. Ernst, P. Geurts, and L. Wehenkel, "Tree-Based Batch Mode Reinforcement Learning," Journal of Machine Learning Research, vol. 6, no. 1, pp. 503-556, 2006.
- (2006) Journal of Machine Learning Research , vol.6 , Issue.1 , pp. 503-556
- Ernst, D.¹ Geurts, P.² Wehenkel, L.³

5
- 84865105064
- Learning to drive in 20 minutes
- Jeju, Korea
- M. Riedmiller, M. Montemerlo, and H. Dahlkamp, "Learning to Drive in 20 Minutes," in Proc. of the FBlT 2007, Jeju, Korea, 2007.
- (2007) Proc. of the FBlT 2007
- Riedmiller, M.¹ Montemerlo, M.² Dahlkamp, H.³

6
- 84865067679
- Reinforcement learning in feedback control
- 10.1007/sI0994-011-5235-x. [Online]
- R. Hafner and M. Riedmiller, "Reinforcement learning in feedback control," Machine Learning, vol. 27, no. 1, pp. 55-74, 2011, 10.1007/sI0994-011-5235-x. [Online]. Available: http://dx.doLorg/IO.1007/s10994- 011-5235-x
- (2011) Machine Learning , vol.27 , Issue.1 , pp. 55-74
- Hafner, R.¹ Riedmiller, M.²

7
- 33746600649
- Reducing the dimensionality of data with neural networks
- G. Hinton and R. Salakhutdinov, "Reducing the Dimensionality of Data with Neural Networks," Science, vol. 313, no. 5786, pp. 504-507, 2006.
- (2006) Science , vol.313 , Issue.5786 , pp. 504-507
- Hinton, G.¹ Salakhutdinov, R.²

8
- 84864073449
- Greedy Layer-wise training of deep networks
- Y. Bengio, P. Lamblin, D. P opovici, H. Larochelle, and Q. Montreal, "Greedy Layer-Wise Training of Deep Networks," in Advances in Neural Information Processing Systems 19, 2007, pp. 153-160.
- (2007) Advances in Neural Information Processing Systems , vol.19 , pp. 153-160
- Bengio, Y.¹ Lamblin, P.² Popovici, D.³ Larochelle, H.⁴ Montreal, Q.⁵

9
- 56449089103
- Extracting and composing robust features with denoising autoencoders
- ACM
- P. Vincent, H. Larochelle, Y. Bengio, and P. Manzagol, "Extracting and composing robust features with denoising autoencoders," in Proceedings of the 25th international conference on Machine learning. ACM,2008, pp. 1096-1103.
- (2008) Proceedings of the 25th International Conference on Machine Learning , pp. 1096-1103
- Vincent, P.¹ Larochelle, H.² Bengio, Y.³ Manzagol, P.⁴

10
- 24644436425
- Learning a similarity metric discriminatively, with application to face verification
- june, vol. 1
- S. Chopra, R. Hadsell, and Y. LeCun, "Learning a similarity metric discriminatively, with application to face verification," in Computer Vision and Pattern Recognition, 2005. CVPR 2005. IEEE Computer Society Conference on, vol. 1, june 2005, pp. 539 - 546 vol. 1.
- (2005) Computer Vision and Pattern Recognition, 2005. CVPR 2005. IEEE Computer Society Conference on , vol.1 , pp. 539-546
- Chopra, S.¹ Hadsell, R.² Lecun, Y.³

11
- 71149084945
- Deep learning from temporal coherence in video
- New York, NY, USA: ACM
- H. Mobahi, R. Collobert, and J. Weston, "Deep learning from temporal coherence in video," in Proceedings of the 26th Annual International Conference on Machine Learning, ser. ICML '09. New York, NY, USA: ACM, 2009, pp. 737-744.
- (2009) Proceedings of the 26th Annual International Conference on Machine Learning, Ser. ICML '09 , pp. 737-744
- Mobahi, H.¹ Collobert, R.² Weston, J.³

12
- 78049408551
- Evaluation of pooling operations in convolutional architectures for object recognition
- Berlin,H eidelberg: Springer-Verlag
- D. Scherer, A. Miiller, and S. Behnke, "Evaluation of pooling operations in convolutional architectures for object recognition," in Proceedings of the 20th international conference on Artificial neural networks: Part III, ser. ICANN'IO. Berlin,H eidelberg: Springer-Verlag,2 010,p p. 92-101.
- (2010) Proceedings of the 20th International Conference on Artificial Neural Networks: Part III, Ser. icann'Io , pp. 92-101
- Scherer, D.¹ Miiller, A.² Behnke, S.³

13
- 56449095373
- A unified architecture for natural language processing: Deep neural networks with multitask learning
- New York, NY, USA: ACM
- R. Collobert and J. Weston, "A unified architecture for natural language processing: deep neural networks with multitask learning," in Proceedings of the 25th international conference on Machine learning, ser. ICML '08. New York, NY, USA: ACM, 2008, pp. 160-167.
- (2008) Proceedings of the 25th International Conference on Machine Learning, Ser. ICML '08 , pp. 160-167
- Collobert, R.¹ Weston, J.²

14
- 34547967782
- An empirical evaluation of deep architectures on problems with many factors of variation
- H. Larochelle, D. Erhan, A. Courville, 1. Bergstra, and Y. Bengio, "An empirical evaluation of deep architectures on problems with many factors of variation," in Proc. of the 24th International Conference on Machine Learning, 2007, pp. 473-480.
- (2007) Proc. of the 24th International Conference on Machine Learning , pp. 473-480
- Larochelle, H.¹ Erhan, D.² Courville, A.³ Bergstra, J.⁴ Bengio, Y.⁵

15
- 79961226155
- The difficulty of training deep architectures and the effect of unsupervised pre-training
- D. Erhan, P. Manzagol, Y. Bengio, S. Bengio, and P. Vincent, "The difficulty of training deep architectures and the effect of unsupervised pre-training," in Proc. of The Twelfth International Conference on Artificial Intelligence and Statistics (AISTATS?09), 2009, pp. 153-160.
- (2009) Proc. of the Twelfth International Conference on Artificial Intelligence and Statistics (AISTATS?09) , pp. 153-160
- Erhan, D.¹ Manzagol, P.² Bengio, Y.³ Bengio, S.⁴ Vincent, P.⁵

16
- 34547997615
- Learning a nonlinear embedding by preserving class neighbourhood structure
- R. R. Salakhutdinov and G. E. Hinton, "Learning a nonlinear embedding by preserving class neighbourhood structure," in AI and Statistics, 2007.
- (2007) AI and Statistics
- Salakhutdinov, R.R.¹ Hinton, G.E.²

17
- 34948870900
- Unsupervised learning of invariant feature hierarchies with applications to object recognition
- M. Ranzato, F. 1. Huang, Y.-L. Boureau, and Y. LeCun, "Unsupervised learning of invariant feature hierarchies with applications to object recognition," in Proc. of CVPR '07.,2007.
- (2007) Proc. of CVPR '07
- Ranzato, M.¹ Huang, F.J.² Boureau, Y.-L.³ Lecun, Y.⁴

18
- 78649669320
- Deep big simple neural nets excel on handwritten digit recognition
- D. C. Ciresan, U. Meier, L. M. Gambardella, and J. Schmidhuber, "Deep big simple neural nets excel on handwritten digit recognition," Neural Computation, vol. 22, no. 12, pp. 3207-3220, 2010.
- (2010) Neural Computation , vol.22 , Issue.12 , pp. 3207-3220
- Ciresan, D.C.¹ Meier, U.² Gambardella, L.M.³ Schmidhuber, J.⁴

19
- 69549124128
- Deep belief net learning in a long-range vision system for autonomous off-road driving
- R. Hadsell,A. Erkan,P. Sermanet,M. Scoffier,U. Muller,a nd Y. LeCun, "Deep belief net learning in a long-range vision system for autonomous off-road driving," in IEEElRSJ International Conference on Intelligent Robots and Systems, 2008. IROS 2008, 2008, pp. 628-633.
- (2008) IEEElRSJ International Conference on Intelligent Robots and Systems, 2008. IROS 2008 , pp. 628-633
- Hadsell, R.¹ Erkan, A.² Sermanet, P.³ Scoffier, M.⁴ Muller, U.⁵ Lecun, Y.⁶

20
- 80054740693
- A committee of neural networks for traffic sign classification
- IEEE
- D. Ciresan, U. Meier, 1. Masci, and 1. Schmidhuber, "A committee of neural networks for traffic sign classification," in Neural Networks (IJCNN), The 2011 International Joint Conference on. IEEE, 2011, pp. 1918-1921.
- (2011) Neural Networks (IJCNN), the 2011 International Joint Conference On. , pp. 1918-1921
- Ciresan, D.¹ Meier, U.² Masci, J.³ Schmidhuber, J.⁴

21
- 84863380535
- Unsupervised feature learning for audio classification using convolutional deep belief networks
- Y. Bengio, D. Schuurmans, 1. Lafferty, C. K. I. Williams, and A. Culotta, Eds.
- H. Lee,P. Pham,Y. Largman,a nd A. Ng," Unsupervised feature learning for audio classification using convolutional deep belief networks;' in Advances in Neural Information Processing Systems 22, Y. Bengio, D. Schuurmans, 1. Lafferty, C. K. I. Williams, and A. Culotta, Eds., 2009, pp. 1096-1104.
- (2009) Advances in Neural Information Processing Systems , vol.22 , pp. 1096-1104
- Lee, H.¹ Pham, P.² Largman, Y.³ Ng, A.⁴

22
- 84880694195
- Stable function approximation in dynamic programming
- G. Gordon, "Stable Function Approximation in Dynamic P rogramming," in Proc. of the 12th Int. Con! on Machine Learning (ICML), 1995, pp. 261-268.
- (1995) Proc. of the 12th Int. Con! on Machine Learning (ICML) , pp. 261-268
- Gordon, G.¹

23
- 33750090060
- Reinforcement learning with raw pixels as input states
- D. Ernst, R. Maree, and L. Wehenkel, "Reinforcement learning with raw pixels as input states," in Int. Workshop on Intelligent Computing in Pattern Analysis/Synthesis (IW ICPAS), 2006, pp. 446-454.
- (2006) Int. Workshop on Intelligent Computing in Pattern Analysis/Synthesis (IW ICPAS) , pp. 446-454
- Ernst, D.¹ Maree, R.² Wehenkel, L.³

24
- 33646430006
- Extremely randomized trees
- P. Geurts, D. Ernst, and L. Wehenkel, "Extremely randomized trees," Machine Learning, vol. 63, no. 1, pp. 3-42,2006.
- (2006) Machine Learning , vol.63 , Issue.1 , pp. 3-42
- Geurts, P.¹ Ernst, D.² Wehenkel, L.³

25
- 34249075412
- Closed-loop learning of visual control policies
- S. Jodogne and 1. Piater, "Closed-loop learning of visual control policies," Journal of Artificial Intelligence Research, vol. 28, pp. 349-391, 2007.
- (2007) Journal of Artificial Intelligence Research , vol.28 , pp. 349-391
- Jodogne, S.¹ Piater, J.²

26
- 31844434669
- Interactive learning of mappings from visual percepts to actions
- -, "Interactive learning of mappings from visual percepts to actions," in Proc. of the 22nd international conference on Machine learning, 2005, pp. 393-400.
- (2005) Proc. of the 22nd International Conference on Machine Learning , pp. 393-400
- Jodogne, S.¹ Piater, J.²

27
- 34548094485
- Approximate policy iteration for closed-loop learning of visual tasks
- S. lodogne, C. Briquet, and 1. P iater, "Approximate P olicy Iteration for Closed-Loop Learning of Visual Tasks," in Proc. of the European Conference on Machine Learning, 2006.
- (2006) Proc. of the European Conference on Machine Learning
- Lodogne, S.¹ Briquet, C.² Iater, J.P.³

28
- 79959451979
- Deep auto-encoder neural networks in reinforcement learning
- Barcelona, Spain
- S. Lange and M. Riedmiller, "Deep auto-encoder neural networks in reinforcement learning," in Proc. of the International Joint Conference on Neural Networks (IJCNN 2010), Barcelona, Spain, 2010.
- (2010) Proc. of the International Joint Conference on Neural Networks (IJCNN 2010)
- Lange, S.¹ Riedmiller, M.²

29
- 79959478004
- Dissertation, Universitat Osnabriick
- S. Lange, "Tiefes Reinforcement Lemen auf Basis visueller Wahrnehmungen," Dissertation, Universitat Osnabriick, 2010.
- (2010) Tiefes Reinforcement Lemen Auf Basis Visueller Wahrnehmungen
- Lange, S.¹

30
- 84887003459
- Deep learning of visual control policies
- S. Lange and M. Riedmiller, "Deep learning of visual control policies," in European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning (ESANN), 2010.
- (2010) European Symposium on Artificial Neural Networks, Computational Intelligence and Machine Learning (ESANN)
- Lange, S.¹ Riedmiller, M.²

31
- 84873574800
- Batch reinforcement learning
- M. Wiering and M. van Otterlo, Eds. Springer, in press, in press
- S. Lange,T. Gabel,a nd M. Riedmiller," Batch Reinforcement Learning," in Reinforcement Learning: State of the Art, M. Wiering and M. van Otterlo, Eds. Springer, in press, 2011, in press.
- (2011) Reinforcement Learning: State of the Art
- Lange, S.¹ Gabel, T.² Riedmiller, M.³

32
- 0019152630
- Neocognitron: A self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position
- K. Fukushima, "Neocognitron: A self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position," Biological Cybernetics, vol. 36, no. 4, pp. 193-202, 1980.
- (1980) Biological Cybernetics , vol.36 , Issue.4 , pp. 193-202
- Fukushima, K.¹

33
- 0032203257
- Gradient-based learning applied to document recognition
- Y. LeCun,L. Bottou,Y. Bengio,a nd P. Haffner," Gradient-based learning applied to document recognition," Proc. of the IEEE, vol. 86, no. 11, pp. 2278-2324, 1998.
- (1998) Proc. of the IEEE , vol.86 , Issue.11 , pp. 2278-2324
- Lecun, Y.¹ Bottou, L.² Bengio, Y.³ Haffner, P.⁴

34
- 84943274699
- A direct adaptive method for faster backpropagation learning: The RPROP algorithm
- M. Riedmiller and H. Braun, "A Direct Adaptive Method for Faster Backpropagation Learning: The RPROP Algorithm," in Proc. of the Int. Con/. on Neural Networks, 1993, pp. 586-591.
- (1993) Proc. of the Int. Conf. on Neural Networks , pp. 586-591
- Riedmiller, M.¹ Braun, H.²

35
- 0036832956
- Kernel-based reinforcement learning
- D. Ormoneit and S. Sen, "Kernel-based reinforcement learning," Machine Learning, vol. 49, no. 2, pp. 161-178,2002.
- (2002) Machine Learning , vol.49 , Issue.2 , pp. 161-178
- Ormoneit, D.¹ Sen, S.²

36
- 0000549293
- Self-organizing maps
- zweite edition ed., ser. Springer, Heidelberg
- T. Kohonen, Self-Organizing Maps, zweite edition ed., ser. Springer Series in Information Sciences. Springer, Heidelberg, 1997, vol. 30.
- (1997) Springer Series in Information Sciences , vol.30
- Kohonen, T.¹

37
- 84865084454
- Effizient klassifizieren und clustern: lernparadigmen von vektorquantisierern
- B. Hammer and T. Villmann, "Effizient Klassifizieren und Clustern: Lernparadigmen von Vektorquantisierern," Kiinstliche Intelligenz, vol. 6, no. 3, pp. 5-11, 2006.
- (2006) Kiinstliche Intelligenz , vol.6 , Issue.3 , pp. 5-11
- Hammer, B.¹ Villmann, T.²

38
- 0000773486
- A growing neural gas network learns topologies
- B. Fritzke, "A growing neural gas network learns topologies," Advances in Neural Information Processing Systems 7, 1995.
- (1995) Advances in Neural Information Processing Systems , vol.7
- Fritzke, B.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.