SCOPUS 정보 검색 플랫폼

International Journal of Robotics Research

Volumn 30, Issue 7, 2011, Pages 954-966

Closing the learning-planning loop with predictive state representations

(3) Boots, Byron a MSiddiqi, Sajid b Gordon, Geoffrey J a

a CARNEGIE MELLON UNIVERSITY (United States)

Author keywords

Latent variable discovery; Planning under uncertainty; Point based value iteration; POMDPs; Predictive state representations; Singular value decomposition; Subspace identification

Indexed keywords

LATENT VARIABLE; PLANNING UNDER UNCERTAINTY; POINT-BASED VALUE ITERATIONS; POMDPS; PREDICTIVE STATE REPRESENTATION; SINGULAR VALUES; SUBSPACE IDENTIFICATION;

ARTIFICIAL INTELLIGENCE; LEARNING ALGORITHMS; SINGULAR VALUE DECOMPOSITION;

ROBOT PROGRAMMING;

EID: 80052249260 PISSN: 02783649 EISSN: 17413176 Source Type: Journal
DOI: 10.1177/0278364911404092 Document Type: Conference Paper

Times cited : (180)

References (37)

1
- 0003857778
- The International Computer Science Institute: Technical Report ICSI-TR 97-021
- Bilmes J (1997) A Gentle Tutorial on the EM Algorithm and Its Application to Parameter Estimation for Gaussian Mixture and Hidden Markov Models. The International Computer Science Institute: Technical Report ICSI-TR-97-021.
- (1997) A Gentle Tutorial on the em Algorithm and Its Application to Parameter Estimation for Gaussian Mixture and Hidden Markov Models
- Bilmes, J.¹

2
- 33749258231
- Learning predictive state representations using nonblind policies
- Bowling M, McCracken P, James M, Neufeld J and Wilkinson D (2006) Learning predictive state representations using nonblind policies. In Proceedings of ICML.
- (2006) Proceedings of ICML
- Bowling, M.¹ McCracken, P.² James, M.³ Neufeld, J.⁴ Wilkinson, D.⁵

3
- 0028564629
- Acting optimally in partially observable stochastic domains
- Cassandra AR, Kaelbling LP and Littman MR (1994) Acting optimally in partially observable stochastic domains. In Proceedings of AAAI.
- (1994) Proceedings of AAAI
- Cassandra, A.R.¹ Kaelbling, L.P.² Littman, M.R.³

4
- 33750367918
- Planning in POMDPs using multiplicity automata
- Even-Dar E, Kakade SM and Mansour Y (2005) Planning in POMDPs using multiplicity automata. In Proceedings of UAI
- (2005) Proceedings of UAI
- Even-Dar, E.¹ Kakade, S.M.² Mansour, Y.³

5
- 84898066687
- A spectral algorithm for learning hidden
- Z.
- Hsu D, Kakade S and Zhang T (2009) A spectral algorithm for learning hidden Markov models. In Proceedings of COLT.
- (2009) Markov Models. in Proceedings of COLT
- Hsu, D.¹ Kakade, S.² Zhang, T.³

6
- 80052191460
- Point-based planning for predictive state representations
- IzadiMT and Precup D (2008) Point-based planning for predictive state representations. In Proceedings of Canadian AI.
- (2008) Proceedings of Canadian AI
- Precup, D.¹

7
- 0034198996
- Observable operator models for discrete stochastic time series
- Jaeger H (2000) Observable operator models for discrete stochastic time series. Neural Computation 12: 1371-1398.
- (2000) Neural Computation , vol.12 , pp. 1371-1398
- Jaeger, H.¹

8
- 84877198914
- Efficient training of OOMs
- Jaeger H, Zhao M and Kolling A (2005) Efficient training of OOMs. In Proceedings of NIPS.
- (2005) Proceedings of NIPS
- Jaeger, H.¹ Zhao, M.² Kolling, A.³

9
- 33750704593
- Improving approximate value iteration using memories and predictive state representations
- James MR, Wessling T and Vlassis NA (2006) Improving approximate value iteration using memories and predictive state representations. In Proceedings of AAAI.
- (2006) Proceedings of AAAI
- James, M.R.¹ Wessling, T.² Vlassis, N.A.³

10
- 80052199204
- Technical Report UT-AI-TR 04-309, University of Texas at Austin
- Jong NK and Stone P (2004) Towards Employing PSRs in a Continuous Domain. Technical Report UT-AI-TR-04-309, University of Texas at Austin.
- (2004) Towards Employing PSRs in A Continuous Domain
- Jong, N.K.¹ Stone, P.²

11
- 0028324717
- Cryptographic limitations on learning boolean formulae and finite automata
- Kearns M and Valiant L (1994) Cryptographic limitations on learning boolean formulae and finite automata. Journal of the ACM 41: 67-95.
- (1994) Journal of the ACM , vol.41 , pp. 67-95
- Kearns, M.¹ Valiant, L.²

12
- 84898982129
- Predictive representations of state
- Littman M, Sutton R and Singh S (2002) Predictive representations of state. In Advances in Neural Information Processing Systems (NIPS).
- (2002) Advances in Neural Information Processing Systems (NIPS)
- Littman, M.¹ Sutton, R.² Singh, S.³

13
- 0003932121
- PhD thesis, University of Rochester
- McCallum A (1995) Reinforcement Learning with Selective Perception and Hidden State. PhD thesis, University of Rochester.
- (1995) Reinforcement Learning with Selective Perception and Hidden State.
- McCallum, A.¹

14
- 84864070408
- Online discovery and learning of predictive state representations
- McCracken P and Bowling M (2005) Online discovery and learning of predictive state representations. In Proceedings of NIPS.
- (2005) Proceedings of NIPS
- McCracken, P.¹ Bowling, M.²

15
- 31844443291
- Inverted autonomous helicopter flight via reinforcement learning
- Ng AY, Coates A, Diel M, Ganapathi V, Schulte J, Tse B, (2004) Inverted autonomous helicopter flight via reinforcement learning. In International Symposium on Experimental Robotics.
- (2004) International Symposium on Experimental Robotics
- Ng, A.Y.¹ Coates, A.² Diel, M.³ Ganapathi, V.⁴ Schulte, J.⁵ Tse, B.⁶

16
- 0003398906
- Cambridge University Press
- Pearl J (2000) Causality: Models, Reasoning, and Inference. Cambridge University Press.
- (2000) Causality: Models, Reasoning, and Inference
- Pearl, J.¹

17
- 84880772945
- Point-based value iteration: An anytime algorithm for POMDPs
- Pineau J, Gordon G and Thrun S (2003) Point-based value iteration: An anytime algorithm for POMDPs. In Proceedings of IJCAI.
- (2003) Proceedings of IJCAI
- Pineau, J.¹ Gordon, G.² Thrun, S.³

18
- 52249090123
- Anytime point-based approximations for large POMDPs
- Pineau J, Gordon G and Thrun S (2006) Anytime point-based approximations for large POMDPs. Journal of Artificial Intelligence Research 27: 335-380.
- (2006) Journal of Artificial Intelligence Research , vol.27 , pp. 335-380
- Pineau, J.¹ Gordon, G.² Thrun, S.³

19
- 14344256568
- Learning low dimensional predictive representations
- Rosencrantz M, Gordon GJ and Thrun S (2004) Learning low dimensional predictive representations. In Proceedings of ICML.
- (2004) Proceedings of ICML
- Rosencrantz, M.¹ Gordon, G.J.² Thrun, S.³

20
- 77955213275
- Model-based Bayesian reinforcement learning in large structured domains
- Ross S and Pineau J (2008) Model-based Bayesian reinforcement learning in large structured domains. In Proceedings of UAI.
- (2008) Proceedings of UAI
- Ross, S.¹ Pineau, J.²

21
- 79956088629
- Model-based online learning of POMDPs
- Shani G, Brafman RI and Shimony SE (2005) Model-based online learning of POMDPs. In Proceedings of ECML.
- (2005) Proceedings of ECML
- Shani, G.¹ Brafman, R.I.² Shimony, S.E.³

22
- 84860608661
- Reduced-rank hidden Markov models
- Siddiqi S, Boots B and Gordon GJ (2010) Reduced-rank hidden Markov models. In Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics (AISTATS-2010).
- (2010) Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics (AISTATS-2010)
- Siddiqi, S.¹ Boots, B.² Gordon, G.J.³

23
- 0003443397
- Chapman & Hall
- Silverman BW (1986) Density Estimation for Statistics and Data Analysis. London: Chapman & Hall.
- (1986) Density Estimation for Statistics and Data Analysis. London
- Silverman, B.W.¹

24
- 31844457132
- Predictive state representations: A new theory for modeling dynamical systems
- Singh S, James M and Rudary M (2004) Predictive state representations: A new theory for modeling dynamical systems. In Proceedings of UAI.
- (2004) Proceedings of UAI
- Singh, S.¹ James, M.² Rudary, M.³

25
- 1942452236
- Learning predictive state representations
- Singh S, Littman ML, Jong NK, Pardoe D and Stone P (2003) Learning predictive state representations. In Proceedings of ICML.
- (2003) Proceedings of ICML
- Singh, S.¹ Littman, M.L.² Jong, N.K.³ Pardoe, D.⁴ Stone, P.⁵

26
- 4243922811
- Technical Report UCLA
- Soatto S and Chiuso A (2001). Dynamic Data Factorization. Technical Report, UCLA.
- (2001) Dynamic Data Factorization.
- Soatto, S.¹ Chiuso, A.²

27
- 0003871607
- PhD thesis, Stanford University
- Sondik EJ (1971) The Optimal Control of Partially Observable Markov Processes. PhD thesis, Stanford University.
- (1971) The Optimal Control of Partially Observable Markov Processes.
- Sondik, E.J.¹

28
- 31144472319
- Perseus: Randomized point-based value iteration for POMDPs
- Spaan MTJ and Vlassis N (2005) Perseus: Randomized pointbased value iteration for POMDPs. Journal of Artificial Intelligence Research 24: 195-220. (Pubitemid 43130936)
- (2005) Journal of Artificial Intelligence Research , vol.24 , pp. 195-220
- Spaan, M.T.J.¹ Vlassis, N.²

29
- 80052189356
- Tedrake R, Jackowski Z, Cory R, Roberts JW and Hoburg W (2009) Learning to Fly Like a Bird. Under review.
- (2009) Learning to Fly Like A Bird. under Review
- Tedrake, R.¹ Jackowski, Z.² Cory, R.³ Roberts, J.W.⁴ Hoburg, W.⁵

30
- 84907301232
- On the learnability of hidden Markov models
- London: Springer Verlag
- Terwijn S (2002) On the learnability of hidden Markov models. In ICGI '02: Proceedings of the 6th International Colloquium on Grammatical Inference. London: Springer-Verlag, pp. 261-268.
- (2002) ICGI '02: Proceedings of the 6th International Colloquium on Grammatical Inference , pp. 261-268
- Terwijn, S.¹

31
- 0003426684
- Dordrecht Kluwer
- Van Overschee P and De Moor B (1996) Subspace Identification for Linear Systems: Theory, Implementation, Applications. Dordrecht: Kluwer.
- (1996) Subspace Identification for Linear Systems: Theory, Implementation, Applications
- Van Overschee, P.¹ De Moor, B.²

32
- 31844439543
- Learning predictive representations from a history
- Wiewiora E (2005) Learning predictive representations from a history. In Proceedings of ICML.
- (2005) Proceedings of ICML
- Wiewiora, E.¹

33
- 56449102993
- PhD thesis, University of Michigan
- Wingate D (2008) Exponential Family Predictive Representations of State. PhD thesis, University of Michigan.
- (2008) Exponential Family Predictive Representations of State.
- Wingate, D.¹

34
- 60349110114
- On discovery and learning of models with predictive representations of state for agents with continuous actions and observations
- Wingate D and Singh S (2007) On discovery and learning of models with predictive representations of state for agents with continuous actions and observations. In Proceedings of AAMAS.
- (2007) Proceedings of AAMAS
- Wingate, D.¹ Singh, S.²

35
- 56449115195
- Efficiently learning linear-linear exponential family predictive representations of state
- Wingate D and Singh S (2008) Efficiently learning linear-linear exponential family predictive representations of state. In Proceedings of ICML.
- (2008) Proceedings of ICML
- Wingate, D.¹ Singh, S.²

36
- 31844453029
- Learning predictive state representations in dynamical systems without reset
- Wolfe B, James M and Singh S (2005) Learning predictive state representations in dynamical systems without reset. In Proceedings of ICML.
- (2005) Proceedings of ICML
- Wolfe, B.¹ James, M.² Singh, S.³

37
- 70349239285
- A bound on modeling error in observable operator models and an associated learning algorithm
- in press
- Zhao M, Jaeger H and Thon M (2009) A bound on modeling error in observable operator models and an associated learning algorithm. Neural Computation, in press.
- (2009) Neural Computation
- Zhao, M.¹ Jaeger, H.² Thon, M.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.