메뉴 건너뛰기




Volumn , Issue , 2010, Pages 599-606

Bayesian multi-task reinforcement learning

Author keywords

[No Author keywords available]

Indexed keywords

BAYESIAN; BELONG TO; GAUSSIAN PROCESSES; HIERARCHICAL BAYESIAN; HIERARCHICAL BAYESIAN MODELS; INFERENCE ALGORITHM; NUMBER OF SAMPLES; VALUE FUNCTION MODEL; VALUE FUNCTIONS;

EID: 77956497402     PISSN: None     EISSN: None     Source Type: Conference Proceeding    
DOI: None     Document Type: Conference Paper
Times cited : (109)

References (15)
  • 2
    • 85161977902 scopus 로고    scopus 로고
    • Multi-task gaussian process prediction
    • Bonilla, E., Chai, K., and Williams, C. Multi-task Gaussian process prediction. In Proceedings of NIPS 20, pp. 153-160, 2008.
    • (2008) Proceedings of NIPS , vol.20 , pp. 153-160
    • Bonilla, E.1    Chai, K.2    Williams, C.3
  • 3
    • 0031189914 scopus 로고    scopus 로고
    • Multitask learning
    • Caruana, R. Multitask learning. Machine Learning, 28(1):41-75, 1997.
    • (1997) Machine Learning , vol.28 , Issue.1 , pp. 41-75
    • Caruana, R.1
  • 4
    • 31844451013 scopus 로고    scopus 로고
    • Reinforcement learning with gaussian processes
    • Engel, Y., Mannor, S., and Meir, R. Reinforcement learning with Gaussian processes. In Proceedings of ICML 22, pp. 201-208, 2005.
    • (2005) Proceedings of ICML , vol.22 , pp. 201-208
    • Engel, Y.1    Mannor, S.2    Meir, R.3
  • 5
    • 1842816362 scopus 로고    scopus 로고
    • Sampling methods for stick-breaking priors
    • Ishwaran, H. and James, L. Gibbs sampling methods for stick-breaking priors. J. Amer. Statistical Assoc., 96:161-173, 2001.
    • (2001) J. Amer. Statistical Assoc. , vol.96 , pp. 161-173
    • Ishwaran, H.1    James, L.G.2
  • 6
    • 4644323293 scopus 로고    scopus 로고
    • Least-squares policy iteration
    • Lagoudakis, M. and Parr, R. Least-squares policy iteration. JMLR, 4:1107-1149, 2003.
    • (2003) JMLR , vol.4 , pp. 1107-1149
    • Lagoudakis, M.1    Parr, R.2
  • 8
    • 56049125072 scopus 로고    scopus 로고
    • Transfer of samples in batch reinforcement learning
    • Lazaric, A., Restelli, M., and Bonarini, A. Transfer of samples in batch reinforcement learning. In Proceedings of ICML 25, pp. 544-551, 2008.
    • (2008) Proceedings of ICML , vol.25 , pp. 544-551
    • Lazaric, A.1    Restelli, M.2    Bonarini, A.3
  • 9
    • 55149090494 scopus 로고    scopus 로고
    • Transfer in variable-reward hierarchical reinforcement learning
    • Mehta, N., Natarajan, S., Tadepalli, R, and Fern, A. Transfer in variable-reward hierarchical reinforcement learning. Machine Learning, 73(3):289-312, 2008.
    • (2008) Machine Learning , vol.73 , Issue.3 , pp. 289-312
    • Mehta, N.1    Natarajan, S.2    Tadepalli, R.3    Fern, A.4
  • 10
    • 77950032550 scopus 로고    scopus 로고
    • Markov chain sampling methods for dirichlet process mixture models
    • Neal, R. Markov chain sampling methods for Dirichlet process mixture models. Journal of Computational and Graphical Statistics, 9(2):249-265, 2000.
    • (2000) Journal of Computational and Graphical Statistics , vol.9 , Issue.2 , pp. 249-265
    • Neal, R.1
  • 12
    • 34848816477 scopus 로고    scopus 로고
    • Transfer learning via inter-task mappings for temporal difference learning
    • Taylor, M., Stone, P., and Liu, Y. Transfer learning via inter-task mappings for temporal difference learning. JMLR, 8:2125-2167, 2007.
    • (2007) JMLR , vol.8 , pp. 2125-2167
    • Taylor, M.1    Stone, P.2    Liu, Y.3
  • 13
    • 78650267403 scopus 로고    scopus 로고
    • Multitask reinforcement learning: A hierarchical Bayesian approach
    • Wilson, A., Fern, A., Ray, S., and Tadepalli, P. Multitask reinforcement learning: A hierarchical Bayesian approach. In Proceedings of ICML 24, pp. 1015- 1022, 2007.
    • (2007) Proceedings of ICML , vol.24 , pp. 1015-1022
    • Wilson, A.1    Fern, A.2    Ray, S.3    Tadepalli, P.4
  • 15
    • 31844442664 scopus 로고    scopus 로고
    • Learning Gaussian processes from multiple tasks
    • Yu, K., Tresp, V., and Schwaighofer, A. Learning Gaussian processes from multiple tasks. In Proceedings of ICML 22, pp. 1012-1019, 2005.
    • (2005) Proceedings of ICML , vol.22 , pp. 1012-1019
    • Yu, K.1    Tresp, V.2    Schwaighofer, A.3


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.