SCOPUS 정보 검색 플랫폼

Uncertainty in Artificial Intelligence - Proceedings of the 28th Conference, UAI 2012

Volumn , Issue , 2012, Pages 644-653

Hilbert space embeddings of pomdps

(4) Nishiyama, Yu a Boularias, Abdeslam b Gretton, Arthur b,c Fukumizu, Kenji a

a INSTITUTE OF STATISTICAL MATHEMATICS (Japan)

b MAX PLANCK INSTITUTE FOR INTELLIGENT SYSTEMS (Germany)

c UNIVERSITY COLLEGE LONDON (United Kingdom)

Author keywords

[No Author keywords available]

Indexed keywords

BELLMAN EQUATIONS; FEATURE SPACE; NONPARAMETRIC APPROACHES; OPTIMAL VALUE FUNCTIONS; POLICY LEARNING; REPRODUCING KERNEL HILBERT SPACES; VALUE FUNCTIONS; VALUE ITERATION;

ARTIFICIAL INTELLIGENCE;

EID: 84879146831 PISSN: None EISSN: None Source Type: Conference Proceeding
DOI: None Document Type: Conference Paper

Times cited : (39)

References (26)

1
- 84880715629
- Reinforcement learning in POMDPs without resets
- Even-dar, 2005] Eyal Even-dar. Reinforcement learning in POMDPs without resets. IJCAI, 690-695, 2005.
- (2005) IJCAI , pp. 690-695
- Even-Dar, E.¹

2
- 0041494125
- Efficient SVM training using low-rank kernel representations
- Fine and Scheinberg, 2001]
- Fine and Scheinberg, 2001] S. Fine and K. Scheinberg. Efficient SVM training using low-rank kernel representations. JMLR, 2:243-264, 2001.
- (2001) JMLR , vol.2 , pp. 243-264
- Fine, S.¹ Scheinberg, K.²

3
- 85161986095
- Kernel measures of conditional dependence
- Fukumizu et al., 2008]
- Fukumizu et al., 2008] K. Fukumizu, A. Gretton, X. Sun, and B. Scholkopf. Kernel measures of conditional dependence. In NIPS2008.
- (2008) NIPS
- Fukumizu, K.¹ Gretton, A.² Sun, X.³ Scholkopf, B.⁴

4
- 85162445686
- Gretton
- Fukumizu et al., 2011]
- Fukumizu et al., 2011] K. Fukumizu, L. Song, and A. Gretton. Kernel Bayes' Rule. In NIPS2011.
- (2011) Kernel Bayes' Rule. in NIPS
- Fukumizu, K.¹ Song, A.L.²

5
- 84867112855
- Fukumizu et al.,] arXiv:1009.5736
- Fukumizu et al., 2011] K. Fukumizu, L. Song, and A. Gretton. Kernel bayes rule: Bayesian inference with positive definite kernels. arXiv:1009.5736.
- (2011) Kernel Bayes Rule: Bayesian Inference with Positive Definite Kernels
- Fukumizu, K.¹ Song, L.² Gretton, A.³

6
- 84864063983
- A kernel method for the two-sample-problem
- Gretton et al., 2007]
- Gretton et al., 2007] A. Gretton, K. Borgwardt, M. Rasch, B. Scholkopf, and A. Smola. A kernel method for the two-sample-problem. In NIPS2007.
- (2007) NIPS
- Gretton, A.¹ Borgwardt, K.² Rasch, M.³ Scholkopf, B.⁴ Smola, A.⁵

7
- 85162060108
- A kernel statistical test of independence
- Gretton et al., 2008]
- Gretton et al., 2008] A. Gretton, K. Fukumizu, C. Teo, L. Song, B. Scholkopf, and A. Smola. A kernel statistical test of independence. In NIPS2008.
- (2008) NIPS
- Gretton, A.¹ Fukumizu, K.² Teo, C.³ Song, L.⁴ Scholkopf, B.⁵ Smola, A.⁶

8
- 84859477054
- A kernel two-sample test
- Gretton et al., 2012]
- Gretton et al., 2012] A. Gretton, K. Borgwardt, M. Rasch, B. Scholkopf and A. Smola. A Kernel Two-Sample Test. JMLR, 13, 671-721, 2012.
- (2012) JMLR , vol.13 , pp. 671-721
- Gretton, A.¹ Borgwardt, K.² Rasch, M.³ Scholkopf, B.⁴ Smola, A.⁵

9
- 84867133646
- Modelling transition dynamics in mdps with rkhs embeddings
- Grunewalder et al., 2012
- Grunewalder et al., 2012] S. Grunewalder, G. Lever, L. Baldassarre, M. Pontil and A. Gretton. Modelling transition dynamics in MDPs with RKHS embeddings. In ICML2012.
- (2012) ICML
- Grunewalder, S.¹ Lever, G.² Baldassarre, L.³ Pontil, M.⁴ Gretton, A.⁵

10
- 0001770240
- Value-function approximations for partially observable markov decision processes
- Hauskrecht, 2000
- Hauskrecht, 2000] M. Hauskrecht. Value-Function Approximations for Partially Observable Markov Decision Processes. In JAIR, vol 13, pages 33-94, 2000.
- (2000) JAIR , vol.13 , pp. 33-94
- Hauskrecht, M.¹

11
- 85138579181
- Learning policies for partially observable environments: Scaling up
- Littman, 1995
- Littman, 1995] M. Littman, A. Cassandra, and L. Kaelbling. Learning policies for partially observable environments: Scaling up. In ICML1995.
- (1995) ICML
- Littman, M.¹ Cassandra, A.² Kaelbling, L.³

12
- 84880772945
- Point-based value iteration: An anytime algorithm for POMDPs
- Pineau et al., 2003] J. Pineau, G. Gordon, and S. Thrun. Point-based value iteration: an anytime algorithm for POMDPs. In ICJAI, pages 1025-1032, 2003.
- (2003) ICJAI , pp. 1025-1032
- Pineau, J.¹ Gordon, G.² Thrun, S.³

13
- 33750724397
- Point-based value iteration for continuous POMDPs
- Porta. et al., 2006]
- Porta. et al., 2006] J. M. Porta, N. Vlassis, and P. Poupart. Point-based value iteration for continuous POMDPs. JMLR, 7:2329-2367, 2006.
- (2006) JMLR , vol.7 , pp. 2329-2367
- Porta, J.M.¹ Vlassis, N.² Poupart, P.³

14
- 33749251297
- An analytic solution to discrete Bayesian reinforcement learning
- Poupart et al., 2006]
- Poupart et al., 2006] Pascal Poupart, Nikos A. Vlassis, Jesse Hoey, and Kevin Regan. An analytic solution to discrete Bayesian reinforcement learning. ICML2006.
- (2006) ICML
- Poupart, P.¹ Vlassis, N.A.² Hoey, J.³ Regan, K.⁴

15
- 52249086942
- Online planning algorithms for POMDPs
- Ross et al., 2008]
- Ross et al., 2008] S. Ross, J. Pineau, S. Paquet, and B. Chaib-draa. Online planning algorithms for POMDPs. JAIR, 32(1):663-704, 2008.
- (2008) JAIR , vol.32 , Issue.1 , pp. 663-704
- Ross, S.¹ Pineau, J.² Paquet, S.³ Chaib-Draa, B.⁴

16
- 85161963598
- Monte-carlo planning in large pomdps
- Silver and Veness, 2010]
- Silver and Veness, 2010] David Silver and Joel Veness. Monte-Carlo Planning in Large POMDPs. NIPS2010.
- (2010) NIPS
- Silver, D.¹ Veness, J.²

17
- 33750297371
- Heuristic search value iteration for POMDPs
- Smith and Simmons, 2004]
- Smith and Simmons, 2004] T. Smith and R. Simmons. Heuristic search value iteration for POMDPs. In UAI2004.
- (2004) UAI
- Smith, T.¹ Simmons, R.²

18
- 70049118151
- A Hilbert space embedding for distributions
- Smola et al., 2007]
- Smola et al., 2007] A. Smola, A. Gretton, L. Song, and B. Scholkopf. A Hilbert space embedding for distributions. In ALT2007.
- (2007) ALT
- Smola, A.¹ Gretton, A.² Song, L.³ Scholkopf, B.⁴

19
- 0003871607
- Sondik, 1971] PhD thesis, Stanford University
- Sondik, 1971] E. J. Sondik. The Optimal Control of Partially Observable Markov Processes. PhD thesis, Stanford University, 1971.
- (1971) The Optimal Control of Partially Observable Markov Processes
- Sondik, E.J.¹

20
- 71149099279
- Hilbert space embeddings of conditional distributions with applications to dynamical systems
- Song et al., 2009]
- Song et al., 2009] L. Song, J. Huang, A. Smola, and K. Fukumizu. Hilbert space embeddings of conditional distributions with applications to dynamical systems. In ICML2009.
- (2009) ICML
- Song, L.¹ Huang, J.² Smola, A.³ Fukumizu, K.⁴

21
- 77956540831
- Hilbert space embeddings of hidden markov models
- Song et al., 2010]
- Song et al., 2010] L. Song, B. Boots, S. Siddiqi, G. Gordon, and A. Smola. Hilbert space embeddings of hidden Markov models. In ICML2010.
- (2010) ICML
- Song, L.¹ Boots, B.² Siddiqi, S.³ Gordon, G.⁴ Smola, A.⁵

22
- 84860645997
- Nonparametric tree graphical models via kernel embeddings
- Song et al., 2010]
- Song et al., 2010] L. Song, A. Gretton, and C. Guestrin. Nonparametric tree graphical models via kernel embeddings. In AISTATS, pages 765-772, 2010.
- (2010) AISTATS , pp. 765-772
- Song, L.¹ Gretton, A.² Guestrin, C.³

23
- 84867126508
- Kernel belief propagation
- Song et al., 2011]
- Song et al., 2011] L. Song, A. Gretton, D. Bickson, Y. Low, and C. Guestrin. Kernel Belief Propagation. In AISTATS, 2011.
- (2011) AISTATS
- Song, L.¹ Gretton, A.² Bickson, D.³ Low, Y.⁴ Guestrin, C.⁵

24
- 31144472319
- Perseus: Randomized point-based value iteration for POMDPs
- Spaan and Vlassis, 2005]
- Spaan and Vlassis, 2005] M. T. J. Spaan and N. Vlassis. Perseus: Randomized point-based value iteration for POMDPs. JAIR, 24:195-220, 2005.
- (2005) JAIR , vol.24 , pp. 195-220
- Spaan, M.T.J.¹ Vlassis, N.²

25
- 77951953755
- Hilbert space embeddings and metrics on probability measures
- Sriperumbudur et al., 2010]
- Sriperumbudur et al., 2010] B. Sriperumbudur, A. Gretton, K. Fukumizu, G. Lanckriet, and B. Scholkopf. Hilbert space embeddings and metrics on probability measures. JMLR, 11:1517-1561, 2010.
- (2010) JMLR , vol.11 , pp. 1517-1561
- Sriperumbudur, B.¹ Gretton, A.² Fukumizu, K.³ Lanckriet, G.⁴ Scholkopf, B.⁵

26
- 84898978676
- Monte carlo pomdps
- Thrun, 2000
- Thrun, 2000] S. Thrun. Monte Carlo POMDPs. In NIPS2000.
- (2000) NIPS
- Thrun, S.¹

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.