SCOPUS 정보 검색 플랫폼

Neurocomputing

Volumn 72, Issue 7-9, 2009, Pages 1508-1524

Gaussian process dynamic programming

(3) Deisenroth, Marc Peter a,b Rasmussen, Carl Edward a,c Peters, Jan c,d

a UNIVERSITY OF CAMBRIDGE (United Kingdom)

b UNIVERSITY OF KARLSRUHE (Germany)

c MAX PLANCK INSTITUTE FOR BIOLOGICAL CYBERNETICS (Germany)

d UNIVERSITY OF SOUTHERN CALIFORNIA (United States)

Author keywords

Bayesian active learning; Dynamic programming; Gaussian processes; Optimal control; Policy learning; Reinforcement learning

Indexed keywords

A-PRIORI; APPROXIMATE VALUE FUNCTIONS; APPROXIMATION TECHNIQUES; BAYESIAN ACTIVE LEARNING; CONTINUOUS STATE; GAUSSIAN PROCESSES; INITIAL STATE; ON THE FLIES; OPTIMAL CONTROL; OPTIMAL CONTROL PROBLEMS; POLICY LEARNING; PRIOR KNOWLEDGE; PROBABILISTIC MODELS; STATE SPACES; SWING-UP; TRANSITION DYNAMICS; UNKNOWN VALUES; VALUE FUNCTIONS;

BAYESIAN NETWORKS; DYNAMIC PROGRAMMING; EDUCATION; GAUSSIAN DISTRIBUTION; GAUSSIAN NOISE (ELECTRONIC); OPTIMAL CONTROL SYSTEMS; OPTIMIZATION; REINFORCEMENT; REINFORCEMENT LEARNING; SYSTEMS ENGINEERING; TRELLIS CODES;

PROCESS CONTROL;

ARTICLE; ARTIFICIAL INTELLIGENCE; BAYESIAN LEARNING; CONTROL SYSTEM; CONTROLLED STUDY; GAUSSIAN PROCESS DYNAMIC PROGRAMMING ALGORITHM; INFORMATION PROCESSING; LEARNING ALGORITHM; MACHINE LEARNING; ONLINE SYSTEM; PRIORITY JOURNAL; PROBABILITY; PROCESS OPTIMIZATION;

EID: 61849173491 PISSN: 09252312 EISSN: None Source Type: Journal
DOI: 10.1016/j.neucom.2008.12.019 Document Type: Article

Times cited : (189)

References (56)

1
- 0039816976
- Using local trajectory optimizers to speed up global optimization in dynamic programming
- J.E. Hanson, S.J. Moody, R.P. Lippmann Eds, Morgan Kaufmann, Los Altos, CA
- C.G. Atkeson, Using local trajectory optimizers to speed up global optimization in dynamic programming, in: J.E. Hanson, S.J. Moody, R.P. Lippmann (Eds.), Advances in Neural Information Processing Systems, vol. 6, Morgan Kaufmann, Los Altos, CA, 1994, pp. 503-521.
- (1994) Advances in Neural Information Processing Systems , vol.6 , pp. 503-521
- Atkeson, C.G.¹

2
- 0030691430
- A comparison of direct and model-based reinforcement learning
- C.G. Atkeson, J.C. Santamaría, A comparison of direct and model-based reinforcement learning, in: Proceedings of the International Conference on Robotics and Automation, 1997.
- (1997) Proceedings of the International Conference on Robotics and Automation
- Atkeson, C.G.¹ Santamaría, J.C.²

3
- 0002130986
- Robot learning from demonstration
- Fisher Jr. D.H. (Ed), Morgan Kaufmann, Nashville, TN, USA
- Atkeson C.G., and Schaal S. Robot learning from demonstration. In: Fisher Jr. D.H. (Ed). Proceedings of the 14th International Conference on Machine Learning (July 1997), Morgan Kaufmann, Nashville, TN, USA 12-20
- (1997) Proceedings of the 14th International Conference on Machine Learning , pp. 12-20
- Atkeson, C.G.¹ Schaal, S.²

4
- 85012688561
- Princeton University Press, Princeton, NJ, USA
- Bellman R.E. Dynamic Programming (1957), Princeton University Press, Princeton, NJ, USA
- (1957) Dynamic Programming
- Bellman, R.E.¹

5
- 61849143544
- Dynamic Programming and Optimal Control
- third ed, Athena Scientific, Belmont, MA, USA
- D.P. Bertsekas, Dynamic Programming and Optimal Control, Optimization and Computation Series, vol. 1, third ed., Athena Scientific, Belmont, MA, USA, 2005.
- (2005) Optimization and Computation Series , vol.1
- Bertsekas, D.P.¹

6
- 61849185818
- Dynamic Programming and Optimal Control
- third ed, Athena Scientific, Belmont, MA, USA
- D.P. Bertsekas, Dynamic Programming and Optimal Control, Optimization and Computation Series, vol. 2, third ed., Athena Scientific, Belmont, MA, USA, 2007.
- (2007) Optimization and Computation Series , vol.2
- Bertsekas, D.P.¹

7
- 0003487482
- Neuro-Dynamic Programming
- Athena Scientific, Belmont, MA, USA
- Bertsekas D.P., and Tsitsiklis J.N. Neuro-Dynamic Programming. Optimization and Computation (1996), Athena Scientific, Belmont, MA, USA
- (1996) Optimization and Computation
- Bertsekas, D.P.¹ Tsitsiklis, J.N.²

8
- 0003507691
- Hemisphere, New York City, NY, USA
- Bryson A.E., and Ho Y.-C. Applied Optimal Control: Optimization, Estimation, and Control (1975), Hemisphere, New York City, NY, USA
- (1975) Applied Optimal Control: Optimization, Estimation, and Control
- Bryson, A.E.¹ Ho, Y.-C.²

9
- 84972528615
- Bayesian experimental design: a review
- Chaloner K., and Verdinelli I. Bayesian experimental design: a review. Statistical Science 10 (1995) 273-304
- (1995) Statistical Science , vol.10 , pp. 273-304
- Chaloner, K.¹ Verdinelli, I.²

10
- 52449085771
- Approximate dynamic programming with gaussian processes
- Seattle, WA, USA, June
- M.P. Deisenroth, J. Peters, C.E. Rasmussen, Approximate dynamic programming with gaussian processes, in: Proceedings of the 2008 American Control Conference, Seattle, WA, USA, June 2008, pp. 4480-4485.
- (2008) Proceedings of the 2008 American Control Conference , pp. 4480-4485
- Deisenroth, M.P.¹ Peters, J.² Rasmussen, C.E.³

11
- 61849106123
- Model-based reinforcement learning with continuous states and actions
- Bruges, Belgium, April
- M.P. Deisenroth, C.E. Rasmussen, J. Peters, Model-based reinforcement learning with continuous states and actions, in: Proceedings of the 16th European Symposium on Artificial Neural Networks, Bruges, Belgium, April 2008, pp. 19-24.
- (2008) Proceedings of the 16th European Symposium on Artificial Neural Networks , pp. 19-24
- Deisenroth, M.P.¹ Rasmussen, C.E.² Peters, J.³

12
- 0033629916
- Reinforcement learning in continuous time and space
- Doya K. Reinforcement learning in continuous time and space. Neural Computation 12 1 (2000) 219-245
- (2000) Neural Computation , vol.12 , Issue.1 , pp. 219-245
- Doya, K.¹

13
- 1942421151
- Bayes meets Bellman: The Gaussian process approach to temporal difference learning
- Washington, DC, USA, August
- Y. Engel, S. Mannor, R. Meir, Bayes meets Bellman: the Gaussian process approach to temporal difference learning, in: Proceedings of the 20th International Conference on Machine Learning, Washington, DC, USA, vol. 20, August 2003, pp. 154-161.
- (2003) Proceedings of the 20th International Conference on Machine Learning , vol.20 , pp. 154-161
- Engel, Y.¹ Mannor, S.² Meir, R.³

14
- 31844451013
- Reinforcement learning with Gaussian processes
- Bonn, Germany, August
- Y. Engel, S. Mannor, R. Meir, Reinforcement learning with Gaussian processes, in: Proceedings of the 22nd International Conference on Machine Learning, Bonn, Germany, vol. 22, August 2005, pp. 201-208.
- (2005) Proceedings of the 22nd International Conference on Machine Learning , vol.22 , pp. 201-208
- Engel, Y.¹ Mannor, S.² Meir, R.³

15
- 21844465127
- Tree-based batch mode reinforcement learning
- Ernst D., Geurts P., and Wehenkel L. Tree-based batch mode reinforcement learning. Journal of Machine Learning Research 6 (2005) 503-556
- (2005) Journal of Machine Learning Research , vol.6 , pp. 503-556
- Ernst, D.¹ Geurts, P.² Wehenkel, L.³

16
- 84864065133
- Bayesian policy gradient algorithms
- Schölkopf B., Platt J.C., and Hoffman T. (Eds), The MIT Press, Cambridge, MA, USA
- Ghavamzadeh M., and Engel Y. Bayesian policy gradient algorithms. In: Schölkopf B., Platt J.C., and Hoffman T. (Eds). Advances in Neural Information Processing Systems, vol. 19 (2007), The MIT Press, Cambridge, MA, USA 457-464
- (2007) Advances in Neural Information Processing Systems, vol. 19 , pp. 457-464
- Ghavamzadeh, M.¹ Engel, Y.²

17
- 84867040604
- Gaussian process priors with uncertain inputs-application to multiple-step ahead time series forecasting
- Becker S., Thrun S., and Obermayer K. (Eds), The MIT Press, Cambridge, MA, USA
- Girard A., Rasmussen C.E., Quiñonero Candela J., and Murray-Smith R. Gaussian process priors with uncertain inputs-application to multiple-step ahead time series forecasting. In: Becker S., Thrun S., and Obermayer K. (Eds). Advances in Neural Information Processing Systems, vol. 15 (2003), The MIT Press, Cambridge, MA, USA 529-536
- (2003) Advances in Neural Information Processing Systems, vol. 15 , pp. 529-536
- Girard, A.¹ Rasmussen, C.E.² Quiñonero Candela, J.³ Murray-Smith, R.⁴

18
- 84880694195
- Stable function approximation in dynamic programming
- Prieditis A., and Russell S. (Eds), Morgan Kaufmann, San Francisco, CA, USA
- Gordon G.J. Stable function approximation in dynamic programming. In: Prieditis A., and Russell S. (Eds). Proceedings of the 12th International Conference on Machine Learning (1995), Morgan Kaufmann, San Francisco, CA, USA 261-268
- (1995) Proceedings of the 12th International Conference on Machine Learning , pp. 261-268
- Gordon, G.J.¹

19
- 0003644124
- The MIT Press, Cambridge, MA, USA
- Howard R.A. Dynamic Programming and Markov Processes (1960), The MIT Press, Cambridge, MA, USA
- (1960) Dynamic Programming and Markov Processes
- Howard, R.A.¹

20
- 0001940458
- Adaptive mixtures of local experts
- Jacobs R.A., Jordan M.I., Nowlan S.J., and Hinton G.E. Adaptive mixtures of local experts. Neural Computation 3 (1991) 79-87
- (1991) Neural Computation , vol.3 , pp. 79-87
- Jacobs, R.A.¹ Jordan, M.I.² Nowlan, S.J.³ Hinton, G.E.⁴

21
- 21244437999
- Unscented filtering and nonlinear estimation
- Julier S.J., and Uhlmann J.K. Unscented filtering and nonlinear estimation. IEEE Review 92 3 (2004) 401-422
- (2004) IEEE Review , vol.92 , Issue.3 , pp. 401-422
- Julier, S.J.¹ Uhlmann, J.K.²

22
- 85024429815
- A new approach to linear filtering and prediction problems
- Kalman R.E. A new approach to linear filtering and prediction problems. Transactions of the ASME-Journal of Basic Engineering 82 Series D (1960) 35-45
- (1960) Transactions of the ASME-Journal of Basic Engineering , vol.82 , Issue.Series D , pp. 35-45
- Kalman, R.E.¹

23
- 69549111759
- Bayesian filtering using Gaussian process prediction and observation models
- Nice, France, September
- J. Ko, D. Fox, GP-BayesFilters: Bayesian filtering using Gaussian process prediction and observation models, in: Proceedings of the 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), Nice, France, September 2008, pp. 3471-3476.
- (2008) Proceedings of the 2008 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) , pp. 3471-3476
- Ko, J.¹ Fox, D.² BayesFilters, G.³

24
- 36348997154
- Gaussian processes and reinforcement learning for identification and control of an autonomous blimp
- Rome, Italy, April
- J. Ko, D.J. Klein, D. Fox, D. Haehnel, Gaussian processes and reinforcement learning for identification and control of an autonomous blimp, in: Proceedings of the International Conference on Robotics and Automation, Rome, Italy, April 2007, pp. 742-747.
- (2007) Proceedings of the International Conference on Robotics and Automation , pp. 742-747
- Ko, J.¹ Klein, D.J.² Fox, D.³ Haehnel, D.⁴

25
- 51349131913
- GP-UKF: Unscented Kalman filters with Gaussian process prediction and observation models
- San Diego, CA, USA, October
- J. Ko, D.J. Klein, D. Fox, D. Haehnel, GP-UKF: unscented Kalman filters with Gaussian process prediction and observation models, in: Proceedings of the 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems, San Diego, CA, USA, October 2007, pp. 1901-1907.
- (2007) Proceedings of the 2007 IEEE/RSJ International Conference on Intelligent Robots and Systems , pp. 1901-1907
- Ko, J.¹ Klein, D.J.² Fox, D.³ Haehnel, D.⁴

26
- 84945567712
- Predictive control with Gaussian process models
- B. Zajc, M. Tkalčič Eds, Piscataway, NJ, USA, September
- J. Kocijan, R. Murray-Smith, C.E. Rasmussen, B. Likar, Predictive control with Gaussian process models, in: B. Zajc, M. Tkalčič (Eds.), Proceedings of IEEE Region 8 Eurocon 2003: Computer as a Tool, Piscataway, NJ, USA, September 2003, pp. 352-356.
- (2003) Proceedings of IEEE Region 8 Eurocon 2003: Computer as a Tool , pp. 352-356
- Kocijan, J.¹ Murray-Smith, R.² Rasmussen, C.E.³ Likar, B.⁴

27
- 41549146576
- Near-optimal sensor placements in Gaussian processes: theory, efficient algorithms and empirical studies
- Krause A., Singh A., and Guestrin C. Near-optimal sensor placements in Gaussian processes: theory, efficient algorithms and empirical studies. Journal of Machine Learning Research 9 (2008) 235-284
- (2008) Journal of Machine Learning Research , vol.9 , pp. 235-284
- Krause, A.¹ Singh, A.² Guestrin, C.³

28
- 51949083432
- Ph.D. Thesis, Technische Universität Darmstadt, Germany, February
- M. Kuss, Gaussian process models for robust regression, classification, and reinforcement learning, Ph.D. Thesis, Technische Universität Darmstadt, Germany, February 2006.
- (2006) Gaussian process models for robust regression, classification, and reinforcement learning
- Kuss, M.¹

29
- 0000695404
- Information-based objective functions for active data selection
- MacKay D.J.C. Information-based objective functions for active data selection. Neural Computation 4 (1992) 590-604
- (1992) Neural Computation , vol.4 , pp. 590-604
- MacKay, D.J.C.¹

30
- 0000597408
- Comparison of approximate methods for handling hyperparameters
- MacKay D.J.C. Comparison of approximate methods for handling hyperparameters. Neural Computation 11 5 (1999) 1035-1068
- (1999) Neural Computation , vol.11 , Issue.5 , pp. 1035-1068
- MacKay, D.J.C.¹

31
- 0004272772
- Cambridge University Press, The Edinburgh Building, Cambridge, UK
- MacKay D.J.C. Information Theory, Inference, and Learning Algorithms (2003), Cambridge University Press, The Edinburgh Building, Cambridge, UK
- (2003) Information Theory, Inference, and Learning Algorithms
- MacKay, D.J.C.¹

32
- 84959309936
- Active policy learning for robot planning and exploration under uncertainty
- Atlanta, GA, USA, June
- R. Martinez-Cantin, N. de Freitas, A. Doucet, J. Castellanos, Active policy learning for robot planning and exploration under uncertainty, in: Proceedings of Robotics: Science and Systems III, Atlanta, GA, USA, June 2007.
- (2007) Proceedings of Robotics: Science and Systems , vol.3
- Martinez-Cantin, R.¹ de Freitas, N.² Doucet, A.³ Castellanos, J.⁴

33
- 0015764255
- The intrinsic random functions and their applications
- Matheron G. The intrinsic random functions and their applications. Advances in Applied Probability 5 (1973) 439-468
- (1973) Advances in Applied Probability , vol.5 , pp. 439-468
- Matheron, G.¹

34
- 0038387331
- Ph.D. Thesis, Massachusetts Institute of Technology, Cambridge, MA, USA, January
- T.P. Minka, A family of algorithms for approximate Bayesian inference, Ph.D. Thesis, Massachusetts Institute of Technology, Cambridge, MA, USA, January 2001.
- (2001) A family of algorithms for approximate Bayesian inference
- Minka, T.P.¹

35
- 84945582505
- Nonlinear adaptive control using non-parametric Gaussian process prior models
- Academic Press, Barcelona, Spain
- Murray-Smith R., and Sbarbaro D. Nonlinear adaptive control using non-parametric Gaussian process prior models. Proceedings of the 15th IFAC World Congress vol. 15 (July 2002), Academic Press, Barcelona, Spain
- (2002) Proceedings of the 15th IFAC World Congress , vol.15
- Murray-Smith, R.¹ Sbarbaro, D.²

36
- 79953133012
- Adaptive, cautious, predictive control with Gaussian process priors
- Rotterdam, Netherlands, August
- R. Murray-Smith, D. Sbarbaro, C.E. Rasmussen, A. Girard, Adaptive, cautious, predictive control with Gaussian process priors, in: 13th IFAC Symposium on System Identification, Rotterdam, Netherlands, August 2003.
- (2003) 13th IFAC Symposium on System Identification
- Murray-Smith, R.¹ Sbarbaro, D.² Rasmussen, C.E.³ Girard, A.⁴

37
- 0007302261
- Bayes-Hermite quadrature
- O'Hagan A. Bayes-Hermite quadrature. Journal of Statistical Planning and Inference 29 (1991) 245-260
- (1991) Journal of Statistical Planning and Inference , vol.29 , pp. 245-260
- O'Hagan, A.¹

38
- 0036832956
- Kernel-based reinforcement learning
- Ormoneit D., and Sen S. Kernel-based reinforcement learning. Machine Learning 49 2-3 (2002) 161-178
- (2002) Machine Learning , vol.49 , Issue.2-3 , pp. 161-178
- Ormoneit, D.¹ Sen, S.²

39
- 38649095925
- Learning to control in operational space
- Peters J., and Schaal S. Learning to control in operational space. The International Journal of Robotics Research 27 2 (2008) 197-212
- (2008) The International Journal of Robotics Research , vol.27 , Issue.2 , pp. 197-212
- Peters, J.¹ Schaal, S.²

40
- 40649106649
- Natural actor-critic
- Peters J., and Schaal S. Natural actor-critic. Neurocomputing 71 7-9 (2008) 1180-1190
- (2008) Neurocomputing , vol.71 , Issue.7-9 , pp. 1180-1190
- Peters, J.¹ Schaal, S.²

41
- 44949241322
- Reinforcement learning of motor skills with policy gradients
- Peters J., and Schaal S. Reinforcement learning of motor skills with policy gradients. Neural Networks 21 (2008) 682-697
- (2008) Neural Networks , vol.21 , pp. 682-697
- Peters, J.¹ Schaal, S.²

42
- 33750297394
- T. Pfingsten, Bayesian active learning for sensitivity analysis, in: Proceedings of the 17th European Conference on Machine Learning, September 2006, pp. 353-364.
- T. Pfingsten, Bayesian active learning for sensitivity analysis, in: Proceedings of the 17th European Conference on Machine Learning, September 2006, pp. 353-364.

43
- 29144453489
- A unifying view of sparse approximate Gaussian process regression
- Quiñonero-Candela J., and Rasmussen C.E. A unifying view of sparse approximate Gaussian process regression. Journal of Machine Learning Research 6 2 (2005) 1939-1960
- (2005) Journal of Machine Learning Research , vol.6 , Issue.2 , pp. 1939-1960
- Quiñonero-Candela, J.¹ Rasmussen, C.E.²

44
- 0004085699
- Ph.D. Thesis, Department of Computer Science, University of Toronto
- C.E. Rasmussen, Evaluation of Gaussian processes and other methods for non-linear regression, Ph.D. Thesis, Department of Computer Science, University of Toronto, 1996.
- (1996) Evaluation of Gaussian processes and other methods for non-linear regression
- Rasmussen, C.E.¹

45
- 58449109750
- Probabilistic inference for fast learning in control
- S. Girgin, M. Loth, R. Munos, P. Preux, D. Ryabko Eds, Recent Advances in Reinforcement Learning, Springer, Berlin, November
- C.E. Rasmussen, M.P. Deisenroth, Probabilistic inference for fast learning in control, in: S. Girgin, M. Loth, R. Munos, P. Preux, D. Ryabko (Eds.), Recent Advances in Reinforcement Learning, Lecture Notes on Computer Science, vol. 5323, Springer, Berlin, November 2008, pp. 229-242.
- (2008) Lecture Notes on Computer Science , vol.5323 , pp. 229-242
- Rasmussen, C.E.¹ Deisenroth, M.P.²

46
- 84899013778
- Bayesian Monte Carlo
- Becker S., Thrun S., and Obermayer K. (Eds), The MIT Press, Cambridge, MA, USA
- Rasmussen C.E., and Ghahramani Z. Bayesian Monte Carlo. In: Becker S., Thrun S., and Obermayer K. (Eds). Advances in Neural Information Processing Systems, vol. 15 (2003), The MIT Press, Cambridge, MA, USA 489-496
- (2003) Advances in Neural Information Processing Systems, vol. 15 , pp. 489-496
- Rasmussen, C.E.¹ Ghahramani, Z.²

47
- 84899026055
- Gaussian processes in reinforcement learning
- Thrun S., Saul L.K., and Schölkopf B. (Eds), The MIT Press, Cambridge, MA, USA
- Rasmussen C.E., and Kuss M. Gaussian processes in reinforcement learning. In: Thrun S., Saul L.K., and Schölkopf B. (Eds). Advances in Neural Information Processing Systems, vol. 16 (2004), The MIT Press, Cambridge, MA, USA 751-759
- (2004) Advances in Neural Information Processing Systems, vol. 16 , pp. 751-759
- Rasmussen, C.E.¹ Kuss, M.²

48
- 34247621089
- Gaussian processes for machine learning
- The MIT Press, Cambridge, MA, USA URL 〈http://www.gaussianprocess.org/gpml/〉
- Rasmussen C.E., and Williams C.K.I. Gaussian processes for machine learning. Adaptive Computation and Machine Learning (2006), The MIT Press, Cambridge, MA, USA. http://www.gaussianprocess.org/gpml/ URL 〈http://www.gaussianprocess.org/gpml/〉
- (2006) Adaptive Computation and Machine Learning
- Rasmussen, C.E.¹ Williams, C.K.I.²

49
- 0033233953
- Concepts and facilities of a neural reinforcement learning control architecture for technical process control
- Riedmiller M. Concepts and facilities of a neural reinforcement learning control architecture for technical process control. Neural Computation and Application 8 (2000) 323-338
- (2000) Neural Computation and Application , vol.8 , pp. 323-338
- Riedmiller, M.¹

50
- 33646687423
- Neural Fitted Q iteration-first experiences with a data efficient neural reinforcement learning method
- Porto, Portugal
- M. Riedmiller, Neural Fitted Q iteration-first experiences with a data efficient neural reinforcement learning method, in: Proceedings of the 16th European Conference on Machine Learning, Porto, Portugal, 2005.
- (2005) Proceedings of the 16th European Conference on Machine Learning
- Riedmiller, M.¹

51
- 84943274699
- A direct adaptive method for faster backpropagation learning the RPROP algorithm
- Riedmiller M., and Braun H. A direct adaptive method for faster backpropagation learning the RPROP algorithm. Proceedings of the IEEE International Conference on Neural Networks (1993) 586-591
- (1993) Proceedings of the IEEE International Conference on Neural Networks , pp. 586-591
- Riedmiller, M.¹ Braun, H.²

52
- 84864038646
- Sparse Gaussian processes using pseudo-inputs
- Weiss Y., Schölkopf B., and Platt J.C. (Eds), The MIT Press, Cambridge, MA, USA
- Snelson E., and Ghahramani Z. Sparse Gaussian processes using pseudo-inputs. In: Weiss Y., Schölkopf B., and Platt J.C. (Eds). Advances in Neural Information Processing Systems, vol. 18 (2006), The MIT Press, Cambridge, MA, USA 1257-1264
- (2006) Advances in Neural Information Processing Systems, vol. 18 , pp. 1257-1264
- Snelson, E.¹ Ghahramani, Z.²

53
- 4143121578
- Reinforcement Learning An Introduction
- The MIT Press, Cambridge, MA, USA
- Sutton R.S., and Barto A.G. Reinforcement Learning An Introduction. Adaptive Computation and Machine Learning (1998), The MIT Press, Cambridge, MA, USA
- (1998) Adaptive Computation and Machine Learning
- Sutton, R.S.¹ Barto, A.G.²

54
- 0001671445
- Bayesian designs for maximizing information and outcome
- Verdinelli I., and Kadane J.B. Bayesian designs for maximizing information and outcome. Journal of the American Statistical Association 87 418 (1992) 510-515
- (1992) Journal of the American Statistical Association , vol.87 , Issue.418 , pp. 510-515
- Verdinelli, I.¹ Kadane, J.B.²

55
- 33749242004
- Springer Science+Business Media, Inc., New York, NY, USA
- Wasserman L. All of Nonparametric Statistics. Springer Texts in Statistics (2006), Springer Science+Business Media, Inc., New York, NY, USA
- (2006) All of Nonparametric Statistics. Springer Texts in Statistics
- Wasserman, L.¹

56
- 0002295913
- Gaussian processes for regression
- Touretzky D.S., Mozer M.C., and Hasselmo M.E. (Eds), The MIT Press, Cambridge, MA, USA
- Williams C.K.I., and Rasmussen C.E. Gaussian processes for regression. In: Touretzky D.S., Mozer M.C., and Hasselmo M.E. (Eds). Advances in Neural Processing Systems, vol. 8 (1996), The MIT Press, Cambridge, MA, USA 598-604
- (1996) Advances in Neural Processing Systems, vol. 8 , pp. 598-604
- Williams, C.K.I.¹ Rasmussen, C.E.²

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.