SCOPUS 정보 검색 플랫폼

Neural Networks

Volumn 12, Issue 4-5, 1999, Pages 727-753

Multi-agent reinforcement learning: Weighting and partitioning

(2) Sun, R a Peterson, T a

a Department of Computer Science (United States)

Author keywords

Averaging; Gating; Neural networks; Partitioning; Reinforcement learning; Weighting

Indexed keywords

APPROXIMATION THEORY; ARTIFICIAL INTELLIGENCE; COMPUTATIONAL COMPLEXITY; FUNCTIONS; HEURISTIC METHODS; MATHEMATICAL MODELS; NEURAL NETWORKS; STATE SPACE METHODS;

MULTIPLE AGENTS; PARTITIONING; REINFORCEMENT LEARNING;

LEARNING SYSTEMS;

ALGORITHM; ARTICLE; ARTIFICIAL NEURAL NETWORK; AVERAGING; DISCRIMINANT ANALYSIS; MATHEMATICAL MODEL; PRIORITY JOURNAL;

EID: 0032772352 PISSN: 08936080 EISSN: None Source Type: Journal
DOI: 10.1016/S0893-6080(99)00024-6 Document Type: Article

Times cited : (53)

References (70)

1
- 0016556021
- A new approach to manipulator control: The cerebellar model articulation control
- Albus J. A new approach to manipulator control: the cerebellar model articulation control. Journal of Dynamic Systems Measure and Control. 97:1975;270-277.
- (1975) Journal of Dynamic Systems Measure and Control , vol.97 , pp. 270-277
- Albus, J.¹

2
- 0344633755
- Locally weighted regression
- Atkeson, C., Moore, A., and Schaal, S. (1997). Locally weighted regression. Artificial Intelligence Review.
- (1997) Artificial Intelligence Review
- Atkeson, C.¹ Moore, A.² Schaal, S.³

3
- 0003787146
- Princeton, NJ: Princeton University Press
- Bellman R. Dynamic programming. 1957;Princeton University Press, Princeton, NJ.
- (1957) Dynamic Programming
- Bellman, R.¹

4
- 0003487482
- Belmont, MA: Athena Scientific
- Bertsekas D., Tsitsiklis J. Neuro-dynamic programming. 1996;Athena Scientific, Belmont, MA.
- (1996) Neuro-dynamic Programming
- Bertsekas, D.¹ Tsitsiklis, J.²

5
- 0007609723
- Learning radial basis function networks on-line
- San Francisco, CA: Morgan Kaufmann. pp. 37-45
- Blanzieri E., Katenkamp P. Learning radial basis function networks on-line. Proceedings of International Conference on machine Learning. 1996;Morgan Kaufmann, San Francisco, CA. pp. 37-45.
- (1996) Proceedings of International Conference on Machine Learning
- Blanzieri, E.¹ Katenkamp, P.²

6
- 85153940465
- Generalization in reinforcement learning: Safely approximating the value function
- J. Tesauro, D. Touretzky, Leen T. Cambridge, MA: MIT Press
- Boyan J., Moore A. Generalization in reinforcement learning: safely approximating the value function. Tesauro J., Touretzky D., Leen T. Neural Information Processing Systems. 7:1995;369 MIT Press, Cambridge, MA.
- (1995) Neural Information Processing Systems , vol.7 , pp. 369
- Boyan, J.¹ Moore, A.²

7
- 0030211964
- Bagging predictors
- Breiman L. Bagging predictors. Machine Learning. 24:1996;123-140.
- (1996) Machine Learning , vol.24 , pp. 123-140
- Breiman, L.¹

8
- 0030196364
- Stacked regressions
- Breiman L. Stacked regressions. Machine Learning. 24:1996;49-64.
- (1996) Machine Learning , vol.24 , pp. 49-64
- Breiman, L.¹

9
- 0003619255
- Bias, variance and arcing classifiers
- Berkeley: University of California
- Breiman, L. (1996c). Bias, variance and arcing classifiers. Technical Report 460. Berkeley: University of California.
- (1996) Technical Report , vol.460
- Breiman, L.¹

10
- 0004291566
- Belmont, CA: Wadsworth
- Breiman L., Friedman L., Stone P. Classification and regression. 1984;Wadsworth, Belmont, CA.
- (1984) Classification and Regression
- Breiman, L.¹ Friedman, L.² Stone, P.³

11
- 0026998041
- Reinforcement learning with perceptual aliasing: The perceptual distinction approach
- San Francisco, CA: Morgan Kaufmann. pp. 183-188
- Chrisman L. Reinforcement learning with perceptual aliasing: the perceptual distinction approach. Proceedings of AAAI. 1993;Morgan Kaufmann, San Francisco, CA. pp. 183-188.
- (1993) Proceedings of AAAI
- Chrisman, L.¹

12
- 0001234682
- Feudal reinforcement learning
- Cambridge, MA: MIT Press
- Dayan P., Hinton G. Feudal reinforcement learning. Neural Information Processing Systems. 1993;MIT Press, Cambridge, MA.
- (1993) Neural Information Processing Systems
- Dayan, P.¹ Hinton, G.²

13
- 0344074989
- Technical report
- Dietterich, T. (1997). Hierarchical reinforcement learning with MAXQ value function decomposition. Technical report. ftp://www.cs.orst.edu/.
- (1997) Hierarchical Reinforcement Learning with MAXQ Value Function Decomposition
- Dietterich, T.¹

14
- 0345064332
- Ant-Q: A reinforcement learning approach to combinatorial optimization
- Belgium: Universite Libre de Bruxelles
- Dorigo, M., and Gambardella, L. (1995). Ant-Q: a reinforcement learning approach to combinatorial optimization. Technical Report 95-01. Belgium: Universite Libre de Bruxelles.
- (1995) Technical Report 95-01
- Dorigo, M.¹ Gambardella, L.²

15
- 0000201141
- Improving regressors using boosting techniques
- San Francicso, CA: Morgan Kaufmann. pp. 107-115
- Drucker H. Improving regressors using boosting techniques. Proceedings of ICML'97. 1997;Morgan Kaufmann, San Francicso, CA. pp. 107-115.
- (1997) Proceedings of ICML'97
- Drucker, H.¹

16
- 0345064331
- Manuscript
- Erickson, M., and Kruschke, J. (1996). Rules and examplars in category learning (Manuscript).
- (1996) Rules and Examplars in Category Learning
- Erickson, M.¹ Kruschke, J.²

17
- 0002978642
- Experiments with a new boosting algorithm
- San Francisco, CA: Morgan Kaufmann. pp. 148-156
- Freund Y., Schapire R. Experiments with a new boosting algorithm. Proceedings of ICML'97. 1996;Morgan Kaufmann, San Francisco, CA. pp. 148-156.
- (1996) Proceedings of ICML'97
- Freund, Y.¹ Schapire, R.²

18
- 0010272483
- PhD Thesis, Purdue, Indiana: Purdue University
- Hashem, S. (1993). Optimal linear combinations of neural networks. PhD Thesis, Purdue, Indiana: Purdue University.
- (1993) Optimal Linear Combinations of Neural Networks
- Hashem, S.¹

19
- 0003979924
- Reading, MA: Addison-Wesley
- Hertz J., Krogh A., Palmer R. Introduction to the theory of neural computation. 1991;Addison-Wesley, Reading, MA.
- (1991) Introduction to the Theory of Neural Computation
- Hertz, J.¹ Krogh, A.² Palmer, R.³

20
- 0007214322
- W-learning: A simple RL-based society of mind
- Cambridge, UK: University of Cambridge, Computer Laboratory
- Humphrys, M. (1996). W-learning: a simple RL-based society of mind. Technical report 362, Cambridge, UK: University of Cambridge, Computer Laboratory.
- (1996) Technical Report , vol.362
- Humphrys, M.¹

21
- 0031568357
- Bias/variance analysis of mixtures-of-experts architectures
- Jacobs R. Bias/variance analysis of mixtures-of-experts architectures. Neural Computation. 9:1997;369-383.
- (1997) Neural Computation , vol.9 , pp. 369-383
- Jacobs, R.¹

22
- 0001940458
- Adaptive mixtures of local experts
- Jacobs R., Jordan M., Nowlan S., Hinton G. Adaptive mixtures of local experts. Neural Computation. 3:1991;79-87.
- (1991) Neural Computation. , vol.3 , pp. 79-87
- Jacobs, R.¹ Jordan, M.² Nowlan, S.³ Hinton, G.⁴

23
- 0000262562
- Hierarchical mixtures of experts and the EM algorithm
- Jordan M., Jacobs R. Hierarchical mixtures of experts and the EM algorithm. Neural Computation. 6:1994;181-214.
- (1994) Neural Computation , vol.6 , pp. 181-214
- Jordan, M.¹ Jacobs, R.²

24
- 0029679044
- Reinforcement learning: A survey
- Kaelbling L., Littman M., Moore A. Reinforcement learning: A survey. Journal of Artificial Intelligence Research. 4:1996;237-285.
- (1996) Journal of Artificial Intelligence Research , vol.4 , pp. 237-285
- Kaelbling, L.¹ Littman, M.² Moore, A.³

25
- 85054435084
- Neural network ensembles, cross validation, and active learning
- Cambridge, MA: MIT Press. pp. 231-238
- Krogh A., Vedelsby J. Neural network ensembles, cross validation, and active learning. Neural Information Processing Systems. 1995;MIT Press, Cambridge, MA. pp. 231-238.
- (1995) Neural Information Processing Systems
- Krogh, A.¹ Vedelsby, J.²

26
- 85020933666
- Manuscript
- Kubat, M. (1997). Decision trees can initialize radial-basis-function networks (Manuscript).
- (1997) Decision Trees Can Initialize Radial-basis-function Networks
- Kubat, M.¹

27
- 0026852133
- Theory and development of higher-order CMAC neural networks
- Lane, S., Handelman, D., & Gelfand, J. (1992). Theory and development of higher-order CMAC neural networks. IEEE Control Systems, pp. 23-31.
- (1992) IEEE Control Systems , pp. 23-31
- Lane, S.¹ Handelman, D.² Gelfand, J.³

28
- 0000123778
- Self-improving reactive agents based on reinforcement learning, planning, and teaching
- Lin L. Self-improving reactive agents based on reinforcement learning, planning, and teaching. Machine Learning. 8:1992;293-321.
- (1992) Machine Learning , vol.8 , pp. 293-321
- Lin, L.¹

29
- 0002289220
- Pruning adaptive boosting
- San Francisco, CA: Morgan Kaufmann. pp. 211-218
- Margineantu D., Dietterich T. Pruning adaptive boosting. Proceedings of ICML. 1997;Morgan Kaufmann, San Francisco, CA. pp. 211-218.
- (1997) Proceedings of ICML
- Margineantu, D.¹ Dietterich, T.²

30
- 0002064848
- Reward functions for accelerated learning
- Cambridge, MA: MIT Press
- Mataric M. Reward functions for accelerated learning. Proceedings Conference on Simulation of Adaptive Behaviour. 1995;MIT Press, Cambridge, MA.
- (1995) Proceedings Conference on Simulation of Adaptive Behaviour
- Mataric, M.¹

31
- 0002242826
- Learning to use selective attention and short-term memory in sequential tasks
- Cambridge, MA: MIT Press. pp. 315-324
- McCallum A. Learning to use selective attention and short-term memory in sequential tasks. Proceedings of the conference on Simulation of Adaptive Behavior. 1996;MIT Press, Cambridge, MA. pp. 315-324.
- (1996) Proceedings of the Conference on Simulation of Adaptive Behavior
- McCallum, A.¹

32
- 85153941282
- Bias, variance and the combination of least squares estimators
- Cambridge, MA: MIT Press. pp. 295-302
- Meir R. Bias, variance and the combination of least squares estimators. Neural Information Processing Systems. 1995;MIT Press, Cambridge, MA. pp. 295-302.
- (1995) Neural Information Processing Systems
- Meir, R.¹

33
- 0030352275
- Reducing variance of committee prediction with resampling techniques
- Parmanto B., Munro P., Doyle H. Reducing variance of committee prediction with resampling techniques. Connection Science. 8:(3/4):1996;405-426.
- (1996) Connection Science , vol.8 , Issue.3-4 , pp. 405-426
- Parmanto, B.¹ Munro, P.² Doyle, H.³

34
- 0003512020
- PhD Thesis, Brown University. Providence, RI
- Perrone, M. (1993). Improving regression estimation: averaging methods for variance reduction with extensions to general convex measure optimization. PhD Thesis, Brown University. Providence, RI.
- (1993) Improving Regression Estimation: Averaging Methods for Variance Reduction with Extensions to General Convex Measure Optimization
- Perrone, M.¹

35
- 0031639872
- An RBF network alternative to a hybrid architecture. Anchorage, Alaska
- Piscataway, NJ: IEEE Press
- Peterson T., Sun R. An RBF network alternative to a hybrid architecture. Anchorage, Alaska. Proceedings of IEEE International Conference on Neural Networks. 1998;IEEE Press, Piscataway, NJ.
- (1998) Proceedings of IEEE International Conference on Neural Networks
- Peterson, T.¹ Sun, R.²

36
- 0025490985
- Networks for approximation and learning
- Poggio T., Girosi F. Networks for approximation and learning. Proceedings of IEEE. 78:(9):1990;1481-1497.
- (1990) Proceedings of IEEE , vol.78 , Issue.9 , pp. 1481-1497
- Poggio, T.¹ Girosi, F.²

37
- 33744584654
- Inductive learning of decision trees
- Quinlan R. Inductive learning of decision trees. Machine Learning. 1:1986;81-106.
- (1986) Machine Learning , vol.1 , pp. 81-106
- Quinlan, R.¹

38
- 0030370417
- Bagging, Boosting and C4.5
- San Francisco, CA: Morgan Kaufmann. pp. 725-730
- Quinlan R. Bagging, Boosting and C4.5. Proceedings of AAAI'96. 1996;Morgan Kaufmann, San Francisco, CA. pp. 725-730.
- (1996) Proceedings of AAAI'96
- Quinlan, R.¹

39
- 0030374103
- Bootstrapping with noise: An effective regularization technique
- Raviv Y., Intrator N. Bootstrapping with noise: an effective regularization technique. Connection Science. 8:(3/4):1996;355-372.
- (1996) Connection Science , vol.8 , Issue.3-4 , pp. 355-372
- Raviv, Y.¹ Intrator, N.²

40
- 13444280906
- Learning goal-decomposition rules using exercises
- San Francisco, CA: Morgan Kaufmann. pp. 278-286
- Reddy C., Tadepalli P. Learning goal-decomposition rules using exercises. Proceedings of ICML'97. 1997;Morgan Kaufmann, San Francisco, CA. pp. 278-286.
- (1997) Proceedings of ICML'97
- Reddy, C.¹ Tadepalli, P.²

41
- 0242430831
- PhD Thesis, Rochester, NY: Department of Computer Science, University of Rochester
- Rosca, J. (1997). Hierarchical learning with procedural abstraction mechanisms. PhD Thesis, Rochester, NY: Department of Computer Science, University of Rochester.
- (1997) Hierarchical Learning with Procedural Abstraction Mechanisms
- Rosca, J.¹

42
- 0030367578
- Ensemble learning using decorrelated neural networks
- Rosen B. Ensemble learning using decorrelated neural networks. Connection Science. 8:(3/4):1996;373-384.
- (1996) Connection Science , vol.8 , Issue.3-4 , pp. 373-384
- Rosen, B.¹

43
- 0026118624
- Tree-structured adaptive networks for function approximation in high-dimensional spaces
- Sanger T. Tree-structured adaptive networks for function approximation in high-dimensional spaces. IEEE Transaction on Neural Networks. 2:(2):1991;285-293.
- (1991) IEEE Transaction on Neural Networks , vol.2 , Issue.2 , pp. 285-293
- Sanger, T.¹

44
- 84964009081
- From isolation to cooperation: An alternative view of a system of experts
- Cambridge, MA: MIT Press. pp. 605-611
- Schaal S., Atkeson C. From isolation to cooperation: an alternative view of a system of experts. Advances in Neural Information Processing Systems. 1996;MIT Press, Cambridge, MA. pp. 605-611.
- (1996) Advances in Neural Information Processing Systems
- Schaal, S.¹ Atkeson, C.²

45
- 0002595663
- Boosting the margin: A new explanation for the effectiveness of voting methods
- San Francisco: Morgan Kaufmann. pp. 322-330
- Shcapire R., Freund Y., Bartlett P., Lee W. Boosting the margin: a new explanation for the effectiveness of voting methods. Proceedings of International Conference on Machine Learning. 1997;Morgan Kaufmann, San Francisco. pp. 322-330.
- (1997) Proceedings of International Conference on Machine Learning
- Shcapire, R.¹ Freund, Y.² Bartlett, P.³ Lee, W.⁴

46
- 0003824303
- PhD Thesis, Amherst, MA: University of Massachusetts
- Singh, S. (1994). Learning to solve Markovian decision processes. PhD Thesis, Amherst, MA: University of Massachusetts.
- (1994) Learning to Solve Markovian Decision Processes
- Singh, S.¹

47
- 85153965130
- Reinforcement learning with soft state aggregation
- S.L. Hanson, J.C. Cowan, & L. Giles. San Mateo, CA: Morgan Kaufmann
- Singh S., Jaakkola T., Jordan M. Reinforcement learning with soft state aggregation. Hanson S.L., Cowan J.C., Giles L. Advances in Neural Information Processing Systems. 1994;Morgan Kaufmann, San Mateo, CA.
- (1994) Advances in Neural Information Processing Systems
- Singh, S.¹ Jaakkola, T.² Jordan, M.³

48
- 0345064317
- Planning from reinforcement learning
- Tuscaloosa, AL: University of Alabama
- Sun, R. (1997). Planning from reinforcement learning. Technical report TR-CS-97-0027, Tuscaloosa, AL: University of Alabama.
- (1997) Technical Report TR-CS-97-0027
- Sun, R.¹

49
- 0003690625
- R. Sun, & L. Bookman. Norwell, MA: Kluwer Academic Publishers
- Sun R., Bookman L. Computational architectures integrating neural and symbolic procedures. 1994;Kluwer Academic Publishers, Norwell, MA.
- (1994) Computational Architectures Integrating Neural and Symbolic Procedures

50
- 0003773492
- A hybrid agent architecture for reactive sequential decision making
- R. Sun, & F. Alexandre. Hillsdale, NJ: Lawrence Erlbaum Associates
- Sun R., Peterson T. A hybrid agent architecture for reactive sequential decision making. Sun R., Alexandre F. Connectionist-symbolic integration. 1997;Lawrence Erlbaum Associates, Hillsdale, NJ.
- (1997) Connectionist-symbolic Integration
- Sun, R.¹ Peterson, T.²

51
- 0030675538
- A hybrid model for learning sequential navigation
- Monterey, CA. Piscateway, NJ: IEEE Press. pp. 234-239
- Sun R., Peterson T. A hybrid model for learning sequential navigation. Monterey, CA Proceedings of IEEE International Symposium on Computational Intelligence in Robotics and Automation (CIRA'97). 1997;IEEE Press, Piscateway, NJ. pp. 234-239.
- (1997) Proceedings of IEEE International Symposium on Computational Intelligence in Robotics and Automation (CIRA'97)
- Sun, R.¹ Peterson, T.²

52
- 0032203235
- Some experiments with a hybrid model for learning sequential decision making
- Sun R., Peterson T. Some experiments with a hybrid model for learning sequential decision making. Information Sciences. 111:1998;83-107.
- (1998) Information Sciences , vol.111 , pp. 83-107
- Sun, R.¹ Peterson, T.²

53
- 0001842850
- Bottom-up skill learning in reactive sequential decision tasks
- Hillsdale, NJ: Lawrence Erlbaum Associates. pp. 684-690
- Sun R., Peterson T., Merrill E. Bottom-up skill learning in reactive sequential decision tasks. Proceedings of 18th Cognitive Science Society Conference. 1996;Lawrence Erlbaum Associates, Hillsdale, NJ. pp. 684-690.
- (1996) Proceedings of 18th Cognitive Science Society Conference
- Sun, R.¹ Peterson, T.² Merrill, E.³

54
- 0344633742
- Integrated architectures for learning, planning
- San Meteo, CA: Morgan Kaufmann
- Sutton R. Integrated architectures for learning, planning. Proceedings of Seventh International Conference on Machine Learning and reacting based on approximating dynamic programming. 1990;Morgan Kaufmann, San Meteo, CA.
- (1990) Proceedings of Seventh International Conference on Machine Learning and Reacting Based on Approximating Dynamic Programming
- Sutton, R.¹

55
- 0000723997
- Generalization in reinforcement learning: Successful examples using sparse coarse coding
- Cambridge, MA: MIT Press
- Sutton R. Generalization in reinforcement learning: successful examples using sparse coarse coding. Neural Information Processing Systems. 8:1996;MIT Press, Cambridge, MA.
- (1996) Neural Information Processing Systems , vol.8
- Sutton, R.¹

56
- 0038145105
- Hierarchical explanation-based reinforcement learning
- San Francisco: Morgan Kaufmann. pp. 358-386
- Tadepalli P., Dietterich T. Hierarchical explanation-based reinforcement learning. Proceedings International Conference on machine Learning. 1997;Morgan Kaufmann, San Francisco. pp. 358-386.
- (1997) Proceedings International Conference on Machine Learning
- Tadepalli, P.¹ Dietterich, T.²

57
- 0021892282
- Fuzzy identification of systems and its applications to modeling and control
- Takagi T., Sugeno M. Fuzzy identification of systems and its applications to modeling and control. IEEE Transactions on Systems Man and Cybernetics. 15:(1):1985;116-132.
- (1985) IEEE Transactions on Systems Man and Cybernetics , vol.15 , Issue.1 , pp. 116-132
- Takagi, T.¹ Sugeno, M.²

58
- 0000078841
- Averaging regularized estimators
- Taniguchi M., Tresp V. Averaging regularized estimators. Neural Computation. 9:1997;1163-1178.
- (1997) Neural Computation , vol.9 , pp. 1163-1178
- Taniguchi, M.¹ Tresp, V.²

59
- 0029390263
- Reinforcement learning of multiple tasks using a hierarchical CMAC architecture
- Tham C. Reinforcement learning of multiple tasks using a hierarchical CMAC architecture. Robotics and Autonomous Systems. 15:1995;247-274.
- (1995) Robotics and Autonomous Systems , vol.15 , pp. 247-274
- Tham, C.¹

60
- 0000277836
- Finding structure in reinforcement learning
- Cambridge, MA: MIT Press
- Thrun S., Schwartz A. Finding structure in reinforcement learning. Neural Information Processing Systems. 7:1995;MIT Press, Cambridge, MA.
- (1995) Neural Information Processing Systems , vol.7
- Thrun, S.¹ Schwartz, A.²

61
- 0040639069
- Stacking bagged and dagged models
- San Francisco, CA: Morgan Kaufmann. pp. 367-375
- Ting W.K., Witten I. Stacking bagged and dagged models. Proceedings of ICML'97. 1997;Morgan Kaufmann, San Francisco, CA. pp. 367-375.
- (1997) Proceedings of ICML'97
- Ting, W.K.¹ Witten, I.²

62
- 85153970023
- Combining estimators using non-constant weighting functions
- Cambridge, MA: MIT Press. pp. 419-426
- Tresp V., Taniguchi M. Combining estimators using non-constant weighting functions. Neural Information Processing Systems. 7:1995;MIT Press, Cambridge, MA. pp. 419-426.
- (1995) Neural Information Processing Systems , vol.7
- Tresp, V.¹ Taniguchi, M.²

63
- 0030365938
- Error correlation and error reduction in ensemble classifiers
- Tumer K., Ghosh J. Error correlation and error reduction in ensemble classifiers. Connection Science. 8:(3/4):1996;385-404.
- (1996) Connection Science , vol.8 , Issue.3-4 , pp. 385-404
- Tumer, K.¹ Ghosh, J.²

64
- 0029727747
- Generalization error of ensemble estimators
- Piscateway, NJ: IEEE Press. pp. 90-95
- Ueda N., Nakano R. Generalization error of ensemble estimators. IEEE International Conference on Neural Networks. 1996;IEEE Press, Piscateway, NJ. pp. 90-95.
- (1996) IEEE International Conference on Neural Networks
- Ueda, N.¹ Nakano, R.²

65
- 0029517697
- Approximation with neural networks
- Piscateway, NJ: IEEE Press
- van der Smagt P., Groen F. Approximation with neural networks. Proceedings of 1995 International Conference on Neural Networks. 1995;IEEE Press, Piscateway, NJ.
- (1995) Proceedings of 1995 International Conference on Neural Networks
- Van Der Smagt, P.¹ Groen, F.²

66
- 0004049895
- PhD Thesis, Cambridge, UK: Cambridge University
- Watkins, C. (1989). Learning with delayed rewards. PhD Thesis, Cambridge, UK: Cambridge University.
- (1989) Learning with Delayed Rewards
- Watkins, C.¹

67
- 0345495820
- HQ-learning
- Wiering, M., and Schmidhuber, J. (1996). HQ-learning. TR IDSIA-95-96.
- (1996) TR IDSIA-95-96
- Wiering, M.¹ Schmidhuber, J.²

68
- 85158158334
- A complexity analysis of cooperative mechanisms in reinforcement learning
- San Francisco, CA: Morgan Kaufmann. pp. 607-613
- Whitehead A. A complexity analysis of cooperative mechanisms in reinforcement learning. Proceedings of the AAAI'93. 1993;Morgan Kaufmann, San Francisco, CA. pp. 607-613.
- (1993) Proceedings of the AAAI'93
- Whitehead, A.¹

69
- 0026692226
- Stacked generalization
- Wolpert D. Stacked generalization. Neural Networks. 5:1992;241-259.
- (1992) Neural Networks , vol.5 , pp. 241-259
- Wolpert, D.¹

70
- 85140116568
- An alternative model for mixtures of experts
- Cambridge, MA: MIT Press. pp. 633-640
- Xu L., Jordan M., Hinton G. An alternative model for mixtures of experts. Neural Information Processing Systems. 7:1995;MIT Press, Cambridge, MA. pp. 633-640.
- (1995) Neural Information Processing Systems , vol.7
- Xu, L.¹ Jordan, M.² Hinton, G.³

* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.