메뉴 건너뛰기




Volumn , Issue , 2014, Pages

State Representation Learning in Robotics: Using Prior Knowledge about Physical Interaction

Author keywords

[No Author keywords available]

Indexed keywords

AIR NAVIGATION; REINFORCEMENT LEARNING; ROBOTICS;

EID: 84961612003     PISSN: None     EISSN: 2330765X     Source Type: Conference Proceeding    
DOI: 10.15607/RSS.2014.X.019     Document Type: Conference Paper
Times cited : (50)

References (30)
  • 8
    • 77954141070 scopus 로고    scopus 로고
    • Feature reinforcement learning: Part I: Unstructured MDPs
    • Marcus Hutter. Feature reinforcement learning: Part I: Unstructured MDPs. Journal of Artificial General Intelligence, 1:3–24, 2009.
    • (2009) Journal of Artificial General Intelligence , vol.1 , pp. 3-24
    • Hutter, Marcus1
  • 14
    • 0041654220 scopus 로고
    • Multidimensional scaling by optimizing goodness of fit to a nonmetric hypothesis
    • Joseph B. Kruskal. Multidimensional scaling by optimizing goodness of fit to a nonmetric hypothesis. Psychometrika, 29(1):1–27, 1964.
    • (1964) Psychometrika , vol.29 , Issue.1 , pp. 1-27
    • Kruskal, Joseph B.1
  • 16
    • 78049417739 scopus 로고    scopus 로고
    • Reinforcement learning on slow features of high-dimensional input streams
    • Robert Legenstein, Niko Wilbert, and Laurenz Wiskott. Reinforcement learning on slow features of high-dimensional input streams. PLoS Computational Biology, 6(8):e1000894, 2010.
    • (2010) PLoS Computational Biology , vol.6 , Issue.8 , pp. e1000894
    • Legenstein, Robert1    Wilbert, Niko2    Wiskott, Laurenz3
  • 19
    • 35748957806 scopus 로고    scopus 로고
    • Proto-value functions: A laplacian framework for learning representation and control in markov decision processes
    • Sridhar Mahadevan and Mauro Maggioni. Proto-value functions: A laplacian framework for learning representation and control in markov decision processes. Journal of Machine Learning Research, 8(10):2169–2231, 2007.
    • (2007) Journal of Machine Learning Research , vol.8 , Issue.10 , pp. 2169-2231
    • Mahadevan, Sridhar1    Maggioni, Mauro2
  • 20
    • 17444414191 scopus 로고    scopus 로고
    • Basis function adaptation in temporal difference reinforcement learning
    • Ishai Menache, Shie Mannor, and Nahum Shimkin. Basis function adaptation in temporal difference reinforcement learning. Annals of Operations Research, 134:215–238, 2005.
    • (2005) Annals of Operations Research , vol.134 , pp. 215-238
    • Menache, Ishai1    Mannor, Shie2    Shimkin, Nahum3
  • 22
    • 33646398129 scopus 로고    scopus 로고
    • Neural fitted Q iteration – first experiences with a data efficient neural reinforcement learning method
    • Martin Riedmiller. Neural fitted Q iteration – first experiences with a data efficient neural reinforcement learning method. In 16th European Conference on Machine Learning (ECML), pages 317–328, 2005.
    • (2005) 16th European Conference on Machine Learning (ECML) , pp. 317-328
    • Riedmiller, Martin1
  • 23
    • 0034704222 scopus 로고    scopus 로고
    • Nonlinear dimensionality reduction by locally linear embedding
    • Sam T. Roweis and Lawrence K. Saul. Nonlinear dimensionality reduction by locally linear embedding. Science, 290(5500):2323–2326, 2000.
    • (2000) Science , vol.290 , Issue.5500 , pp. 2323-2326
    • Roweis, Sam T.1    Saul, Lawrence K.2
  • 24
    • 84865801985 scopus 로고    scopus 로고
    • Conversational speech transcription using context-dependent deep neural networks
    • Frank Seide, Gang Li, and Dong Yu. Conversational speech transcription using context-dependent deep neural networks. In Interspeech, pages 437–440, 2011.
    • (2011) Interspeech , pp. 437-440
    • Seide, Frank1    Li, Gang2    Yu, Dong3
  • 25
    • 0023223978 scopus 로고
    • Toward a universal law of generalization for psychological science
    • Roger N. Shepard. Toward a universal law of generalization for psychological science. Science, 237(4820): 1317–1323, 1987.
    • (1987) Science , vol.237 , Issue.4820 , pp. 1317-1323
    • Shepard, Roger N.1
  • 29
    • 0034704229 scopus 로고    scopus 로고
    • A global geometric framework for nonlinear dimensionality reduction
    • Joshua B. Tenenbaum, Vin De Silva, and John C. Langford. A global geometric framework for nonlinear dimensionality reduction. Science, 290(5500):2319–2323, 2000.
    • (2000) Science , vol.290 , Issue.5500 , pp. 2319-2323
    • Tenenbaum, Joshua B.1    De Silva, Vin2    Langford, John C.3
  • 30
    • 0036546660 scopus 로고    scopus 로고
    • Slow feature analysis: unsupervised learning of invariances
    • Laurenz Wiskott and Terrence J. Sejnowski. Slow feature analysis: unsupervised learning of invariances. Neural Computation, 14(4):715–770, 2002.
    • (2002) Neural Computation , vol.14 , Issue.4 , pp. 715-770
    • Wiskott, Laurenz1    Sejnowski, Terrence J.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.