SCOPUS 정보 검색 플랫폼

Advances in Neural Information Processing Systems 22 - Proceedings of the 2009 Conference

Volumn , Issue , 2009, Pages 189-197

Manifold embeddings for model-based reinforcement learning under partial observability

(2) Bush, Keith a Pineau, Joelle a

a MCGILL UNIVERSITY (Canada)

Author keywords

[No Author keywords available]

Indexed keywords

OBSERVABILITY;

CONTROLLED SYSTEM; EMBEDDINGS; FIRST PRINCIPLES; LEARN+; MODEL-BASED REINFORCEMENT LEARNING; NEUROSTIMULATION; OFFLINE MODELS; PARTIAL OBSERVABILITY; REAL-WORLD DATASETS; STATE-SPACE;

REINFORCEMENT LEARNING;

EID: 79957616613 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (16)

References (25)

1
- 0031073475
- Locally weighted learning for control
- Christopher G. Atkeson, Andrew W. Moore, and Stefan Schaal. Locally weighted learning for control. Artificial Intelligence Review, 11:75-113, 1997.
- (1997) Artificial Intelligence Review , vol.11 , pp. 75-113
- Atkeson, C.G.¹ Moore, A.W.² Schaal, S.³

2
- 84898983672
- Nonparametric representation of policies and value functions: A trajectory-based approach
- Christopher G. Atkeson and Jun Morimoto. Nonparametric representation of policies and value functions: A trajectory-based approach. In Advances in Neural Information Processing, 2003.
- (2003) Advances in Neural Information Processing
- Atkeson, C.G.¹ Morimoto, J.²

3
- 31844456089
- Action respecting embedding
- M. Bowling, A. Ghodsi, and D. Wilkinson. Action respecting embedding. In Proceedings of ICML, 2005.
- (2005) Proceedings of ICML
- Bowling, M.¹ Ghodsi, A.² Wilkinson, D.³

4
- 0038238310
- Dynamical diseases of brain systems: Different routes to epileptic seizures
- F. Lopes da Silva, W. Blanes, S. Kalitzin, J. Parra, P. Suffczynski, and D. Velis. Dynamical diseases of brain systems: Different routes to epileptic seizures. IEEE Transactions on Biomedical Engineering, 50(5):540-548, 2003.
- (2003) IEEE Transactions on Biomedical Engineering , vol.50 , Issue.5 , pp. 540-548
- Lopes Da Silva, F.¹ Blanes, W.² Kalitzin, S.³ Parra, J.⁴ Suffczynski, P.⁵ Velis, D.⁶

5
- 17444388562
- Repetitive low-frequency stimulation reduces epileptiform synchronization in limbic neuronal networks
- G. D'Arcangelo, G. Panuccio, V. Tancredi, and M. Avoli. Repetitive low-frequency stimulation reduces epileptiform synchronization in limbic neuronal networks. Neurobiology of Disease, 19:119-128, 2005.
- (2005) Neurobiology of Disease , vol.19 , pp. 119-128
- D'Arcangelo, G.¹ Panuccio, G.² Tancredi, V.³ Avoli, M.⁴

6
- 21844465127
- Tree-based batch mode reinforcement learning
- Damien Ernst, Pierre Guerts, and Louis Wehenkel. Tree-based batch mode reinforcement learning. Journal of Machine Learning Research, 6:503-556, 2005.
- (2005) Journal of Machine Learning Research , vol.6 , pp. 503-556
- Ernst, D.¹ Guerts, P.² Wehenkel, L.³

7
- 0003479769
- World Scientific
- A. Galka. Topics in Nonlinear Time Series Analysis: with implications for EEG Analysis. World Scientific, 2000.
- (2000) Topics in Nonlinear Time Series Analysis: With Implications for EEG Analysis
- Galka, A.¹

8
- 33947369542
- Embedding nonlinear dynamical systems: A guide to takens' theorem
- University of Manchester, March
- J.P. Huke. Embedding nonlinear dynamical systems: A guide to Takens' Theorem. Technical report, Manchester Institute for Mathematical Sciences, University of Manchester, March, 2006.
- (2006) Technical Report, Manchester Institute for Mathematical Sciences
- Huke, J.P.¹

9
- 0028958484
- Periodic pacing and in vitro epileptic focus
- K. Jerger and S. Schiff. Periodic pacing and in vitro epileptic focus. Journal of Neurophysiology, 73(2):876-879, 1995.
- (1995) Journal of Neurophysiology , vol.73 , Issue.2 , pp. 876-879
- Jerger, K.¹ Schiff, S.²

10
- 60349084848
- Model-based function approximation in reinforcement learning
- Nicholas K. Jong and Peter Stone. Model-based function approximation in reinforcement learning. In Proceedings of AAMAS, 2007.
- (2007) Proceedings of AAMAS
- Jong, N.K.¹ Stone, P.²

11
- 33749263205
- Automatic basis function construction for approximate dynamic programming and reinforcement learning
- P.W. Keller, S. Mannor, and D. Precup. Automatic basis function construction for approximate dynamic programming and reinforcement learning. In Proceedings of ICML, 2006.
- (2006) Proceedings of ICML
- Keller, P.W.¹ Mannor, S.² Precup, D.³

12
- 41349096005
- False neighbors and false strands: A reliable minimum embedding dimension algorithm
- M. Kennel and H. Abarbanel. False neighbors and false strands: A reliable minimum embedding dimension algorithm. Physical Review E, 66:026209, 2002.
- (2002) Physical Review E , vol.66 , pp. 026209
- Kennel, M.¹ Abarbanel, H.²

13
- 35748957806
- Proto-value functions: A laplacian framework for learning representation and control in Markov decision processes
- S. Mahadevan and M. Maggioni. Proto-value functions: A Laplacian framework for learning representation and control in Markov decision processes. Journal of Machine Learning Research, 8:2169-2231, 2007.
- (2007) Journal of Machine Learning Research , vol.8 , pp. 2169-2231
- Mahadevan, S.¹ Maggioni, M.²

14
- 0003932121
- PhD thesis, University of Rochester
- A. K. McCallum. Reinforcement Learning with Selective Perception and Hidden State. PhD thesis, University of Rochester, 1996.
- (1996) Reinforcement Learning with Selective Perception and Hidden State
- McCallum, A.K.¹

15
- 0036832953
- Variable resolution discretization in optimal control
- R. Munos and A. Moore. Variable resolution discretization in optimal control. Machine Learning, 49:291-323, 2002.
- (2002) Machine Learning , vol.49 , pp. 291-323
- Munos, R.¹ Moore, A.²

16
- 0001766834
- Prediction of spatiotemporal time series based on reconstructed local states
- U. Parlitz and C. Merkwirth. Prediction of spatiotemporal time series based on reconstructed local states. Physical Review Letters, 84(9):1890-1893, 2000.
- (2000) Physical Review Letters , vol.84 , Issue.9 , pp. 1890-1893
- Parlitz, U.¹ Merkwirth, C.²

17
- 79951606113
- Natural actor-critic
- Jan Peters, Sethu Vijayakumar, and Stefan Schaal. Natural actor-critic. In Proceedings of ECML, 2005.
- (2005) Proceedings of ECML
- Peters, J.¹ Vijayakumar, S.² Schaal, S.³

18
- 33751278790
- Embedology
- Tim Sauer, James A. Yorke, and Martin Casdagli. Embedology. Journal of Statistical Physics, 65:3/4:579-616, 1991.
- (1991) Journal of Statistical Physics , vol.65 , Issue.3-4 , pp. 579-616
- Sauer, T.¹ Yorke, J.A.² Casdagli, M.³

19
- 1942452236
- Learning predictive state representations
- S. Singh, M. L. Littman, N. K. Jong, D. Pardoe, and P. Stone. Learning predictive state representations. In Proceedings of ICML, 2003.
- (2003) Proceedings of ICML
- Singh, S.¹ Littman, M.L.² Jong, N.K.³ Pardoe, D.⁴ Stone, P.⁵

20
- 34547971837
- Explicit manifold representations for value-functions in reinforcement learning
- W. Smart. Explicit manifold representations for value-functions in reinforcement learning. In Proceedings of ISAIM, 2004.
- (2004) Proceedings of ISAIM
- Smart, W.¹

21
- 0000738499
- Delay embeddings for forced systems. I. Deterministic forcing
- J. Stark. Delay embeddings for forced systems. I. Deterministic forcing. Journal of Nonlinear Science, 9:255-332, 1999.
- (1999) Journal of Nonlinear Science , vol.9 , pp. 255-332
- Stark, J.¹

22
- 84867960650
- Delay embeddings for forced systems. II. Stochastic forcing
- J. Stark, D.S. Broomhead, M.E. Davies, and J. Huke. Delay embeddings for forced systems. II. Stochastic forcing. Journal of Nonlinear Science, 13:519-577, 2003.
- (2003) Journal of Nonlinear Science , vol.13 , pp. 519-577
- Stark, J.¹ Broomhead, D.S.² Davies, M.E.³ Huke, J.⁴

23
- 0004102479
- The MIT Press, Cambridge, MA
- R. Sutton and A. Barto. Reinforcement learning: An introduction. The MIT Press, Cambridge, MA, 1998.
- (1998) Reinforcement Learning: An Introduction
- Sutton, R.¹ Barto, A.²

24
- 0000779360
- Detecting strange attractors in turbulence
- D. A. Rand & L. S. Young, editor Warwick
- F. Takens. Detecting strange attractors in turbulence. In D. A. Rand & L. S. Young, editor, Dynamical Systems and Turbulence, volume 898, pages 366-381. Warwick, 1980.
- (1980) Dynamical Systems and Turbulence , vol.898 , pp. 366-381
- Takens, F.¹

25
- 60349110114
- On discovery and learning of models with predictive state representations of state for agents with continuous actions and observations
- D. Wingate and S. Singh. On discovery and learning of models with predictive state representations of state for agents with continuous actions and observations. In Proceedings of AAMAS, 2007.
- (2007) Proceedings of AAMAS
- Wingate, D.¹ Singh, S.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.