메뉴 건너뛰기




Volumn , Issue , 2009, Pages 189-197

Manifold embeddings for model-based reinforcement learning under partial observability

Author keywords

[No Author keywords available]

Indexed keywords

OBSERVABILITY;

EID: 79957616613     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (16)

References (25)
  • 2
    • 84898983672 scopus 로고    scopus 로고
    • Nonparametric representation of policies and value functions: A trajectory-based approach
    • Christopher G. Atkeson and Jun Morimoto. Nonparametric representation of policies and value functions: A trajectory-based approach. In Advances in Neural Information Processing, 2003.
    • (2003) Advances in Neural Information Processing
    • Atkeson, C.G.1    Morimoto, J.2
  • 5
    • 17444388562 scopus 로고    scopus 로고
    • Repetitive low-frequency stimulation reduces epileptiform synchronization in limbic neuronal networks
    • G. D'Arcangelo, G. Panuccio, V. Tancredi, and M. Avoli. Repetitive low-frequency stimulation reduces epileptiform synchronization in limbic neuronal networks. Neurobiology of Disease, 19:119-128, 2005.
    • (2005) Neurobiology of Disease , vol.19 , pp. 119-128
    • D'Arcangelo, G.1    Panuccio, G.2    Tancredi, V.3    Avoli, M.4
  • 8
  • 9
    • 0028958484 scopus 로고
    • Periodic pacing and in vitro epileptic focus
    • K. Jerger and S. Schiff. Periodic pacing and in vitro epileptic focus. Journal of Neurophysiology, 73(2):876-879, 1995.
    • (1995) Journal of Neurophysiology , vol.73 , Issue.2 , pp. 876-879
    • Jerger, K.1    Schiff, S.2
  • 10
    • 60349084848 scopus 로고    scopus 로고
    • Model-based function approximation in reinforcement learning
    • Nicholas K. Jong and Peter Stone. Model-based function approximation in reinforcement learning. In Proceedings of AAMAS, 2007.
    • (2007) Proceedings of AAMAS
    • Jong, N.K.1    Stone, P.2
  • 11
    • 33749263205 scopus 로고    scopus 로고
    • Automatic basis function construction for approximate dynamic programming and reinforcement learning
    • P.W. Keller, S. Mannor, and D. Precup. Automatic basis function construction for approximate dynamic programming and reinforcement learning. In Proceedings of ICML, 2006.
    • (2006) Proceedings of ICML
    • Keller, P.W.1    Mannor, S.2    Precup, D.3
  • 12
    • 41349096005 scopus 로고    scopus 로고
    • False neighbors and false strands: A reliable minimum embedding dimension algorithm
    • M. Kennel and H. Abarbanel. False neighbors and false strands: A reliable minimum embedding dimension algorithm. Physical Review E, 66:026209, 2002.
    • (2002) Physical Review E , vol.66 , pp. 026209
    • Kennel, M.1    Abarbanel, H.2
  • 13
    • 35748957806 scopus 로고    scopus 로고
    • Proto-value functions: A laplacian framework for learning representation and control in Markov decision processes
    • S. Mahadevan and M. Maggioni. Proto-value functions: A Laplacian framework for learning representation and control in Markov decision processes. Journal of Machine Learning Research, 8:2169-2231, 2007.
    • (2007) Journal of Machine Learning Research , vol.8 , pp. 2169-2231
    • Mahadevan, S.1    Maggioni, M.2
  • 15
    • 0036832953 scopus 로고    scopus 로고
    • Variable resolution discretization in optimal control
    • R. Munos and A. Moore. Variable resolution discretization in optimal control. Machine Learning, 49:291-323, 2002.
    • (2002) Machine Learning , vol.49 , pp. 291-323
    • Munos, R.1    Moore, A.2
  • 16
    • 0001766834 scopus 로고    scopus 로고
    • Prediction of spatiotemporal time series based on reconstructed local states
    • U. Parlitz and C. Merkwirth. Prediction of spatiotemporal time series based on reconstructed local states. Physical Review Letters, 84(9):1890-1893, 2000.
    • (2000) Physical Review Letters , vol.84 , Issue.9 , pp. 1890-1893
    • Parlitz, U.1    Merkwirth, C.2
  • 20
    • 34547971837 scopus 로고    scopus 로고
    • Explicit manifold representations for value-functions in reinforcement learning
    • W. Smart. Explicit manifold representations for value-functions in reinforcement learning. In Proceedings of ISAIM, 2004.
    • (2004) Proceedings of ISAIM
    • Smart, W.1
  • 21
    • 0000738499 scopus 로고    scopus 로고
    • Delay embeddings for forced systems. I. Deterministic forcing
    • J. Stark. Delay embeddings for forced systems. I. Deterministic forcing. Journal of Nonlinear Science, 9:255-332, 1999.
    • (1999) Journal of Nonlinear Science , vol.9 , pp. 255-332
    • Stark, J.1
  • 24
    • 0000779360 scopus 로고
    • Detecting strange attractors in turbulence
    • D. A. Rand & L. S. Young, editor Warwick
    • F. Takens. Detecting strange attractors in turbulence. In D. A. Rand & L. S. Young, editor, Dynamical Systems and Turbulence, volume 898, pages 366-381. Warwick, 1980.
    • (1980) Dynamical Systems and Turbulence , vol.898 , pp. 366-381
    • Takens, F.1
  • 25
    • 60349110114 scopus 로고    scopus 로고
    • On discovery and learning of models with predictive state representations of state for agents with continuous actions and observations
    • D. Wingate and S. Singh. On discovery and learning of models with predictive state representations of state for agents with continuous actions and observations. In Proceedings of AAMAS, 2007.
    • (2007) Proceedings of AAMAS
    • Wingate, D.1    Singh, S.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.