메뉴 건너뛰기




Volumn 11, Issue 1-5, 1997, Pages 75-113

Locally Weighted Learning for Control

Author keywords

Dynamic programming; Forward models; Inverse models; Lazy learning; Least commitment learning; Linear quadratic regulation (LQR); Locally weighted regression; LOESS; LWR; Memory based learning; Shifting setpoint algorithm

Indexed keywords

ADAPTIVE CONTROL SYSTEMS; COMPUTER SIMULATION; DATA STORAGE EQUIPMENT; DYNAMIC PROGRAMMING; INTELLIGENT CONTROL; LARGE SCALE SYSTEMS; LEARNING ALGORITHMS; LINEAR CONTROL SYSTEMS; REGRESSION ANALYSIS;

EID: 0031073475     PISSN: 02692821     EISSN: None     Source Type: Journal    
DOI: 10.1007/978-94-017-2053-3_3     Document Type: Article
Times cited : (495)

References (53)
  • 3
    • 2342560362 scopus 로고
    • Using local models to control movement
    • Touretzky, D. S. (ed.), Morgan Kaufmann, San Mateo, CA
    • Atkeson, C. G. (1990). Using local models to control movement. In Touretzky, D. S. (ed.), Advances in Neural Information Processing Systems 2, pp. 316-323. Morgan Kaufmann, San Mateo, CA.
    • (1990) Advances in Neural Information Processing Systems 2 , pp. 316-323
    • Atkeson, C.G.1
  • 4
    • 0039816976 scopus 로고
    • Using local trajectory optimizers to speed up global optimization in dynamic programming
    • Hanson, S. J., Cowan, J. D. & Giles, C. L. (eds.), Morgan Kaufmann, San Mateo, CA
    • Atkeson, C. G. (1994). Using local trajectory optimizers to speed up global optimization in dynamic programming. In Hanson, S. J., Cowan, J. D. & Giles, C. L. (eds.), Advances in Neural Information Processing Systems 6, pp. 663-670. Morgan Kaufmann, San Mateo, CA.
    • (1994) Advances in Neural Information Processing Systems 6 , pp. 663-670
    • Atkeson, C.G.1
  • 6
    • 0002201501 scopus 로고
    • Learning and Sequential Decision Making
    • Gabriel, M. & Moore, J. W. (eds.), MIT Press, Cambridge, MA
    • Barto, A. G., Sutton, R. S. & Watkins, C. J. C. H. (1990). Learning and Sequential Decision Making. In Gabriel, M. & Moore, J. W. (eds.), Learning and Computational Neuroscience, pp. 539-602. MIT Press, Cambridge, MA.
    • (1990) Learning and Computational Neuroscience , pp. 539-602
    • Barto, A.G.1    Sutton, R.S.2    Watkins, C.J.C.H.3
  • 7
    • 0029210635 scopus 로고
    • Learning to act using real-time dynamic programming
    • Barto, A. G., Bradtke, S. J. & Singh, S. P. (1995). Learning to act using real-time dynamic programming. Artificial Intelligence 72(1): 81-138.
    • (1995) Artificial Intelligence , vol.72 , Issue.1 , pp. 81-138
    • Barto, A.G.1    Bradtke, S.J.2    Singh, S.P.3
  • 8
    • 0003787146 scopus 로고
    • Princeton University Press, Princeton, NJ
    • Bellman, R. E. (1957). Dynamic Programming. Princeton University Press, Princeton, NJ.
    • (1957) Dynamic Programming
    • Bellman, R.E.1
  • 12
    • 0008864314 scopus 로고
    • Learning to control a dynamic physical system
    • Seattle, WA. Morgan Kaufmann, San Mateo, CA
    • Connell, M. E. & Utgoff, P. E. (1987). Learning to control a dynamic physical system. In Sixth National Conference on Artificial Intelligence, pp. 456-460, Seattle, WA. Morgan Kaufmann, San Mateo, CA.
    • (1987) Sixth National Conference on Artificial Intelligence , pp. 456-460
    • Connell, M.E.1    Utgoff, P.E.2
  • 16
    • 2342522500 scopus 로고
    • LOESS: Multivariate Smoothing by Moving Least Squares
    • C. K. Chul, L. L. S. & Ward, J. D. (eds.), Academic Press
    • Grosse, E. (1989). LOESS: Multivariate Smoothing by Moving Least Squares. In C. K. Chul, L. L. S. & Ward, J. D. (eds.), Approximation Theory VI. Academic Press.
    • (1989) Approximation Theory VI
    • Grosse, E.1
  • 17
    • 84972525897 scopus 로고
    • Local regression: Automatic kernel carpentry
    • Hastie, T. & Loader, C. (1993). Local regression: Automatic kernel carpentry. Statistical Science 8(2): 120-143.
    • (1993) Statistical Science , vol.8 , Issue.2 , pp. 120-143
    • Hastie, T.1    Loader, C.2
  • 18
    • 0000676676 scopus 로고
    • Learning to control an unstable system with forward modeling
    • Touretzky, D. (ed.), Morgan Kaufmann, San Mateo, CA
    • Jordan, M. I. & Jacobs, R. A. (1990). Learning to control an unstable system with forward modeling. In Touretzky, D. (ed.), Advances in Neural Information Processing Systems 2, pp. 324-331. Morgan Kaufmann, San Mateo, CA.
    • (1990) Advances in Neural Information Processing Systems 2 , pp. 324-331
    • Jordan, M.I.1    Jacobs, R.A.2
  • 19
    • 44049116478 scopus 로고
    • Forward Models: Supervised Learning with a Distal Teacher
    • Jordan, M. I. & Rumelhart, D. E. (1992). Forward Models: Supervised Learning with a Distal Teacher. Cognitive Science 16: 307-354.
    • (1992) Cognitive Science , vol.16 , pp. 307-354
    • Jordan, M.I.1    Rumelhart, D.E.2
  • 21
    • 0024284080 scopus 로고
    • Neural Model of Adaptive Hand-Eye Coordination for Single Postures
    • Kuperstein, M. (1988). Neural Model of Adaptive Hand-Eye Coordination for Single Postures. Science 239: 1308-3111.
    • (1988) Science , vol.239 , pp. 1308-3111
    • Kuperstein, M.1
  • 22
    • 0000372206 scopus 로고
    • Bayesian Model Comparison and Backprop Nets
    • Moody, J. E., Hanson, S. J. & Lippman, R. P. (eds.), Morgan Kaufmann, San Mateo, CA
    • MacKay, D. J. C. (1992). Bayesian Model Comparison and Backprop Nets. In Moody, J. E., Hanson, S. J. & Lippman, R. P. (eds.), Advances in Neural Information Processing Systems 4, pp. 839-846. Morgan Kaufmann, San Mateo, CA.
    • (1992) Advances in Neural Information Processing Systems 4 , pp. 839-846
    • MacKay, D.J.C.1
  • 23
    • 0348132949 scopus 로고
    • Enhancing Transfer in Reinforcement Learning by Building Stochastic Models of Robot Actions
    • Morgan Kaufmann
    • Mahadevan, S. (1992). Enhancing Transfer in Reinforcement Learning by Building Stochastic Models of Robot Actions. In Machine Learning: Proceedings of the Ninth International Conference, pp. 290-299. Morgan Kaufmann.
    • (1992) Machine Learning: Proceedings of the Ninth International Conference , pp. 290-299
    • Mahadevan, S.1
  • 24
    • 0001923944 scopus 로고
    • Hoeffding Races: Accelerating Model Selection Search for Classification and Function Approximation
    • Morgan Kaufmann, San Mateo, CA
    • Maron, O. & Moore, A. (1994). Hoeffding Races: Accelerating Model Selection Search for Classification and Function Approximation. In Advances in Neural Information Processing Systems 6, pp. 59-66. Morgan Kaufmann, San Mateo, CA.
    • (1994) Advances in Neural Information Processing Systems 6 , pp. 59-66
    • Maron, O.1    Moore, A.2
  • 25
    • 2342482919 scopus 로고
    • Instance-based utile distinctions for reinforcement learning with hidden state
    • McCallum, R. A. (1995). Instance-based utile distinctions for reinforcement learning with hidden state. In Prieditis and Russell (1995), pp. 387-395.
    • (1995) Prieditis and Russell , Issue.1995 , pp. 387-395
    • McCallum, R.A.1
  • 27
    • 0024705432 scopus 로고
    • Real-Time Application of Neural Networks for Sensor-Based Control of Robots with Vision
    • Miller, W. T. (1989). Real-Time Application of Neural Networks for Sensor-Based Control of Robots with Vision. IEEE Transactions on Systems, Man and Cybernetics 19(4): 825-831.
    • (1989) IEEE Transactions on Systems, Man and Cybernetics , vol.19 , Issue.4 , pp. 825-831
    • Miller, W.T.1
  • 30
    • 33747997674 scopus 로고
    • Variable Resolution Dynamic Programming: Efficiently Learning Action Maps in Multivariate Real-valued State-spaces
    • Birnbaum, L. & Collins, G. (eds.), Morgan Kaufmann
    • Moore, A. W. (1991b). Variable Resolution Dynamic Programming: Efficiently Learning Action Maps in Multivariate Real-valued State-spaces. In Birnbaum, L. & Collins, G. (eds.), Machine Learning: Proceedings of the Eighth International Workshop, pp. 333-337. Morgan Kaufmann.
    • (1991) Machine Learning: Proceedings of the Eighth International Workshop , pp. 333-337
    • Moore, A.W.1
  • 31
    • 0003971885 scopus 로고
    • Fast, Robust Adaptive Control by Learning only Forward Models
    • Moody, J. E., Hanson, S. J. & Lippman, R. P. (eds.), Morgan Kaufmann, San Mateo, CA
    • Moore, A. W. (1992). Fast, Robust Adaptive Control by Learning only Forward Models. In Moody, J. E., Hanson, S. J. & Lippman, R. P. (eds.), Advances in Neural Information Processing Systems 4, pp. 571-578. Morgan Kaufmann, San Mateo, CA.
    • (1992) Advances in Neural Information Processing Systems 4 , pp. 571-578
    • Moore, A.W.1
  • 32
    • 0027684215 scopus 로고
    • Prioritized Sweeping: Reinforcement Learning with Less Data and Less Real Time
    • Moore, A. W. & Atkeson, C. G. (1993). Prioritized Sweeping: Reinforcement Learning with Less Data and Less Real Time. Machine Learning 13: 103-130.
    • (1993) Machine Learning , vol.13 , pp. 103-130
    • Moore, A.W.1    Atkeson, C.G.2
  • 33
    • 2342512684 scopus 로고
    • An Empirical Investigation of Brute Force to Choose Features, Smoothers and Function Approximators
    • Hanson, S., Judd, S. & Petsche, T. (eds.), MIT Press
    • Moore, A. W., Hill, D. J. & Johnson, M. P. (1992). An Empirical Investigation of Brute Force to Choose Features, Smoothers and Function Approximators. In Hanson, S., Judd, S. & Petsche, T. (eds.), Computational Learning Theory and Natural Learning Systems, Volume 3. MIT Press.
    • (1992) Computational Learning Theory and Natural Learning Systems , vol.3
    • Moore, A.W.1    Hill, D.J.2    Johnson, M.P.3
  • 36
    • 0043023536 scopus 로고
    • Efficient Algorithms with Neural Network Behaviour
    • Omohundro, S. M. (1987). Efficient Algorithms with Neural Network Behaviour. Journal of Complex Systems 1(2): 273-347.
    • (1987) Journal of Complex Systems , vol.1 , Issue.2 , pp. 273-347
    • Omohundro, S.M.1
  • 37
    • 0003241739 scopus 로고
    • Bumptrees for Efficient Function, Constraint, and Classification Learning
    • Lippmann, R. P., Moody, J. E. & Touretzky, D. S. (eds.), Morgan Kaufmann, San Mateo, CA
    • Omohundro, S. M. (1991). Bumptrees for Efficient Function, Constraint, and Classification Learning. In Lippmann, R. P., Moody, J. E. & Touretzky, D. S. (eds.), Advances in Neural Information Processing Systems 3, pp. 693-699. Morgan Kaufmann, San Mateo, CA.
    • (1991) Advances in Neural Information Processing Systems 3 , pp. 693-699
    • Omohundro, S.M.1
  • 39
    • 2342514644 scopus 로고
    • Efficient memory-based dynamic programming
    • Peng, J. (1995). Efficient memory-based dynamic programming. In Prieditis and Russell (1995), pp. 438-446.
    • (1995) Prieditis and Russell , Issue.1995 , pp. 438-446
    • Peng, J.1
  • 41
    • 0028416621 scopus 로고
    • Reliability estimation for neural network based autonomous driving
    • Pomerleau, D. (1994). Reliability estimation for neural network based autonomous driving. Robotics and Autonomous Systems, 12.
    • (1994) Robotics and Autonomous Systems , pp. 12
    • Pomerleau, D.1
  • 46
    • 0028374275 scopus 로고
    • Robot Juggling: An Implementation of Memory-based Learning
    • Schaal, S. & Atkeson, C. (1994a). Robot Juggling: An Implementation of Memory-based Learning. Control Systems Magazine 14(1): 57-71.
    • (1994) Control Systems Magazine , vol.14 , Issue.1 , pp. 57-71
    • Schaal, S.1    Atkeson, C.2
  • 47
    • 0343486227 scopus 로고
    • Assessing the Quality of Local Linear Models
    • Cowan, J. D., Tesauro, G. & Alspector, J. (eds.), Morgan Kaufmann
    • Schaal, S. & Atkeson, C. G. (1994b). Assessing the Quality of Local Linear Models. In Cowan, J. D., Tesauro, G. & Alspector, J. (eds.), Advances in Neural Information Processing Systems 6, pp. 160-167. Morgan Kaufmann.
    • (1994) Advances in Neural Information Processing Systems 6 , pp. 160-167
    • Schaal, S.1    Atkeson, C.G.2
  • 48
    • 0022909661 scopus 로고
    • Towards Memory-Based Reasoning
    • Stanfill, C. & Waltz, D. (1986). Towards Memory-Based Reasoning. Communications of the ACM 29(12): 1213-1228.
    • (1986) Communications of the ACM , vol.29 , Issue.12 , pp. 1213-1228
    • Stanfill, C.1    Waltz, D.2
  • 50
    • 33847202724 scopus 로고
    • Learning to Predict by the Methods of Temporal Differences
    • Sutton, R. S. (1988). Learning to Predict by the Methods of Temporal Differences. Machine Learning 3: 9-44.
    • (1988) Machine Learning , vol.3 , pp. 9-44
    • Sutton, R.S.1
  • 51
    • 85132026293 scopus 로고
    • Integrated Architecture for Learning, Planning, and Reacting Based on Approximating Dynamic Programming
    • Morgan Kaufmann
    • Sutton, R. S. (1990). Integrated Architecture for Learning, Planning, and Reacting Based on Approximating Dynamic Programming. In Proceedings of the 7th International Conference on Machine Learning, pp. 216-224. Morgan Kaufmann.
    • (1990) Proceedings of the 7th International Conference on Machine Learning , pp. 216-224
    • Sutton, R.S.1
  • 53
    • 2342487200 scopus 로고
    • Geometric and neuromorphic learning for nonlinear modeling, control and forecasting
    • Glasgow, Scotland. IEEE catalog number 92CH3110-4
    • Zografski, Z. (1992). Geometric and neuromorphic learning for nonlinear modeling, control and forecasting. In Proceedings of the 1992 IEEE International Symposium on Intelligent Control, pp. 158-163. Glasgow, Scotland. IEEE catalog number 92CH3110-4.
    • (1992) Proceedings of the 1992 IEEE International Symposium on Intelligent Control , pp. 158-163
    • Zografski, Z.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.