메뉴 건너뛰기




Volumn 9, Issue 3, 1998, Pages 354-368

Recurrent neural-network training by a learning automaton approach for trajectory learning and control system design

Author keywords

Control system design; Learning systems; Neural network learning; Neuro controllers; Recurrent neural networks; Stochastic learning automata; Trajectory learning

Indexed keywords

BACKPROPAGATION; COMPUTATIONAL COMPLEXITY; CONTROL SYSTEM ANALYSIS; FEEDBACK CONTROL; LEARNING ALGORITHMS; LEARNING SYSTEMS; ROBUSTNESS (CONTROL SYSTEMS); STOCHASTIC CONTROL SYSTEMS;

EID: 0032072476     PISSN: 10459227     EISSN: None     Source Type: Journal    
DOI: 10.1109/72.668879     Document Type: Article
Times cited : (32)

References (45)
  • 1
    • 0004469897 scopus 로고
    • Neurons with graded response have collective computational properties like those of two-state neurons
    • J. J. Hopfield, "Neurons with graded response have collective computational properties like those of two-state neurons," in Proc. Nat. Academy Sci., 1984, vol. 81, pp. 3008-3090.
    • (1984) Proc. Nat. Academy Sci. , vol.81 , pp. 3008-3090
    • Hopfield, J.J.1
  • 2
    • 0026218699 scopus 로고
    • Equilibrium characterization of dynamical neural networks and a systematic synthesis procedure for associative memories
    • Sept.
    • S. I. Sudharsanan and M. K. Sundareshan, "Equilibrium characterization of dynamical neural networks and a systematic synthesis procedure for associative memories," IEEE Trans. Neural Networks, vol. 2, pp. 509-522, Sept. 1991.
    • (1991) IEEE Trans. Neural Networks , vol.2 , pp. 509-522
    • Sudharsanan, S.I.1    Sundareshan, M.K.2
  • 3
    • 0041401479 scopus 로고
    • Neural computation of decision optimization problems
    • J. J. Hopfield and D. W. Tank, "Neural computation of decision optimization problems," Biol. Cybern., vol. 52, pp. 1-12, 1985.
    • (1985) Biol. Cybern. , vol.52 , pp. 1-12
    • Hopfield, J.J.1    Tank, D.W.2
  • 4
    • 0025749653 scopus 로고
    • Exponential stability and a systematic synthesis of a neural network for quadratic minimization
    • S. I. Sudharsanan and M. K. Sundareshan, "Exponential stability and a systematic synthesis of a neural network for quadratic minimization," Neural Networks, vol. 4, pp. 599-613, 1991.
    • (1991) Neural Networks , vol.4 , pp. 599-613
    • Sudharsanan, S.I.1    Sundareshan, M.K.2
  • 5
    • 0000442791 scopus 로고
    • Generalization of backpropagation in recurrent neural networks
    • F. J. Pineda, "Generalization of backpropagation in recurrent neural networks," Phys. Rev. Lett., vol. 59, no. 19, pp. 2229-2232, 1987.
    • (1987) Phys. Rev. Lett. , vol.59 , Issue.19 , pp. 2229-2232
    • Pineda, F.J.1
  • 6
    • 0023563286 scopus 로고
    • A learning rule for asynchronous perceptrons with feedback in a combinatorial environment
    • San Diego, CA
    • L. B. Almeida, "A learning rule for asynchronous perceptrons with feedback in a combinatorial environment," in Proc. IEEE 1st Annu. Int. Conf. Neural Networks, San Diego, CA, 1987, pp. 609-618.
    • (1987) Proc. IEEE 1st Annu. Int. Conf. Neural Networks , pp. 609-618
    • Almeida, L.B.1
  • 9
    • 0000502181 scopus 로고
    • On the behavior of finite automata in random media
    • M. Tsetlin, "On the behavior of finite automata in random media," Automat. Remote Contr., vol. 22, pp. 1210-1219, 1962.
    • (1962) Automat. Remote Contr. , vol.22 , pp. 1210-1219
    • Tsetlin, M.1
  • 12
    • 0002109138 scopus 로고
    • A theory of Pavlovian conditioning: Variations in the effectiveness of reinforcement and nonreinforcement
    • A. H. Black and W. R. Prokasy, Eds. New York: Appleton-Century-Crofts
    • R. A. Rescorla and A. R. Wagner, "A theory of Pavlovian conditioning: Variations in the effectiveness of reinforcement and nonreinforcement," Classical Conditioning II: Current Research and Theory, A. H. Black and W. R. Prokasy, Eds. New York: Appleton-Century-Crofts, 1972.
    • (1972) Classical Conditioning II: Current Research and Theory
    • Rescorla, R.A.1    Wagner, A.R.2
  • 13
    • 0003799456 scopus 로고
    • Reinforcement learning control and pattern recognition systems
    • J. M. Mendal and K. S. Fu, Eds. New York: Academic
    • J. M. Mendel and R. N. McLearn, "Reinforcement learning control and pattern recognition systems," Adaptive, Learning, and Pattern Recognition Systems, J. M. Mendal and K. S. Fu, Eds. New York: Academic, 1970.
    • (1970) Adaptive, Learning, and Pattern Recognition Systems
    • Mendel, J.M.1    McLearn, R.N.2
  • 14
    • 0002355083 scopus 로고
    • Connectionist learning for control: An overview
    • W. T. Miller, R. S. Sutton, and P. J. Werbos, Eds. Cambridge, MA: MIT Press
    • A. G. Barto, "Connectionist learning for control: An overview," Neural Networks for Control, W. T. Miller, R. S. Sutton, and P. J. Werbos, Eds. Cambridge, MA: MIT Press, 1990.
    • (1990) Neural Networks for Control
    • Barto, A.G.1
  • 15
    • 0020970738 scopus 로고
    • Neuronlike elements that can solve difficult learning control problems
    • A. G. Barto, R. S. Sutton, and C. W. Anderson, "Neuronlike elements that can solve difficult learning control problems," IEEE Trans. Syst., Man, Cybern., vol. SMC-13, pp. 834-846, 1983.
    • (1983) IEEE Trans. Syst., Man, Cybern. , vol.SMC-13 , pp. 834-846
    • Barto, A.G.1    Sutton, R.S.2    Anderson, C.W.3
  • 16
    • 0024646143 scopus 로고
    • Learning to control an inverted pendulum using neural networks
    • C. W. Anderson, "Learning to control an inverted pendulum using neural networks," IEEE Contr. Syst. Mag., pp. 31-47, 1989.
    • (1989) IEEE Contr. Syst. Mag. , pp. 31-47
    • Anderson, C.W.1
  • 17
  • 18
    • 0001202594 scopus 로고
    • A learning algorithm for continually running fully recurrent neural networks
    • R. Williams and D. Zipser, "A learning algorithm for continually running fully recurrent neural networks," Neural Computa., vol. 1, pp. 270-270, 1989.
    • (1989) Neural Computa. , vol.1 , pp. 270-270
    • Williams, R.1    Zipser, D.2
  • 19
    • 0001202597 scopus 로고
    • Learning state-space trajectories in recurrent neural networks
    • B. Pearlmutter, "Learning state-space trajectories in recurrent neural networks," Neural Computa., vol. 1, pp. 263-269, 1989.
    • (1989) Neural Computa. , vol.1 , pp. 263-269
    • Pearlmutter, B.1
  • 20
    • 0026685372 scopus 로고
    • Learning a trajectory using adjoint functions and teacher forcing
    • N. Toomarian and J. Barhen, "Learning a trajectory using adjoint functions and teacher forcing," Neural Networks, vol. 5, pp. 473-484, 1992.
    • (1992) Neural Networks , vol.5 , pp. 473-484
    • Toomarian, N.1    Barhen, J.2
  • 22
    • 0025399567 scopus 로고
    • Identification and control of dynamical systems using neural networks
    • K. S. Narendra and K. Parthasarathy, "Identification and control of dynamical systems using neural networks," IEEE Trans. Neural Networks, vol. 1, pp. 4-27, 1990.
    • (1990) IEEE Trans. Neural Networks , vol.1 , pp. 4-27
    • Narendra, K.S.1    Parthasarathy, K.2
  • 24
    • 0027699924 scopus 로고
    • Identification and decentralized adaptive control using dynamical neural networks with application to robotic manipulators
    • Nov.
    • A. Karakasoglu, S. I. Sudharsanan, and M. K. Sundareshan, "Identification and decentralized adaptive control using dynamical neural networks with application to robotic manipulators," IEEE Trans. Neural Networks, vol. 4, pp. 919-931, Nov. 1993.
    • (1993) IEEE Trans. Neural Networks , vol.4 , pp. 919-931
    • Karakasoglu, A.1    Sudharsanan, S.I.2    Sundareshan, M.K.3
  • 25
    • 84943222933 scopus 로고
    • Self-tuning adaptive control of multiinput multioutput systems using multilayer recurrent neural networks with application to synchronous power generators
    • San Francisco, CA, Mar.
    • S. I. Sudharsanan, I. Muhsin, and M. K. Sundareshan, "Self-tuning adaptive control of multiinput multioutput systems using multilayer recurrent neural networks with application to synchronous power generators," in Proc. IEEE Conf. Neural Networks, San Francisco, CA, Mar. 1993.
    • (1993) Proc. IEEE Conf. Neural Networks
    • Sudharsanan, S.I.1    Muhsin, I.2    Sundareshan, M.K.3
  • 26
    • 0000238813 scopus 로고
    • On the behavior of stochastic automata with variable structure
    • V. I. Varshavskii and I. P. Vorontsova, "On the behavior of stochastic automata with variable structure," Automat. Remote Contr., vol. 24, pp. 327-333, 1963.
    • (1963) Automat. Remote Contr. , vol.24 , pp. 327-333
    • Varshavskii, V.I.1    Vorontsova, I.P.2
  • 29
    • 84976805978 scopus 로고
    • Generalized feedback shift register pseudorandom number algorithms
    • July
    • T. Lewis and W. H. Payne, "Generalized feedback shift register pseudorandom number algorithms," J. ACM, vol. 20, no. 3, pp. 456-468, July 1973.
    • (1973) J. ACM , vol.20 , Issue.3 , pp. 456-468
    • Lewis, T.1    Payne, W.H.2
  • 31
    • 0024765392 scopus 로고
    • Adaptive control of linearizable systems
    • S. S. Sastry and A. Isidori, "Adaptive control of linearizable systems," IEEE Trans. Automat. Contr., vol. 34, pp. 1123-1131, 1989.
    • (1989) IEEE Trans. Automat. Contr. , vol.34 , pp. 1123-1131
    • Sastry, S.S.1    Isidori, A.2
  • 32
    • 0026835531 scopus 로고
    • Neural-network application for direct feedback controllers
    • Mar.
    • Y. Ichikawa and T. Sawa, "Neural-network application for direct feedback controllers," IEEE Trans. Neural Networks, vol. 3, pp. 224-232, Mar. 1992.
    • (1992) IEEE Trans. Neural Networks , vol.3 , pp. 224-232
    • Ichikawa, Y.1    Sawa, T.2
  • 33
    • 84943277419 scopus 로고
    • Learning trajectories with a hierarchy of oscillatory modules
    • San Francisco, CA, June
    • P. Baldi and N. Toomarian, "Learning trajectories with a hierarchy of oscillatory modules," in Proc. IEEE Conf. Neural Networks, San Francisco, CA, June 1993, pp. 1171-1176.
    • (1993) Proc. IEEE Conf. Neural Networks , pp. 1171-1176
    • Baldi, P.1    Toomarian, N.2
  • 34
    • 0000562031 scopus 로고
    • A heuristic approach to learning control systems
    • M. D. Waltz and K. S. Fu, "A heuristic approach to learning control systems," IEEE Trans. Automat. Contr., vol. AC-10, pp. 390-398, 1965.
    • (1965) IEEE Trans. Automat. Contr. , vol.AC-10 , pp. 390-398
    • Waltz, M.D.1    Fu, K.S.2
  • 35
    • 0015667648 scopus 로고
    • Punish/reward: Learning with a critic in adaptive threshold systems
    • B. Widrow, N. K. Gupta, and S. Maitra, "Punish/reward: Learning with a critic in adaptive threshold systems," IEEE Trans. Syst., Man, Cybern., vol. SMC-3, pp. 455-463, 1973.
    • (1973) IEEE Trans. Syst., Man, Cybern. , vol.SMC-3 , pp. 455-463
    • Widrow, B.1    Gupta, N.K.2    Maitra, S.3
  • 37
    • 0028381374 scopus 로고
    • Acquiring robot skills via reinforcement learning
    • V. Gullapalli, J. A. Franklin, and H. Benbrahim, "Acquiring robot skills via reinforcement learning," IEEE Contr. Syst. Mag., vol. 14, no. 1, pp. 13-24, 1994.
    • (1994) IEEE Contr. Syst. Mag. , vol.14 , Issue.1 , pp. 13-24
    • Gullapalli, V.1    Franklin, J.A.2    Benbrahim, H.3
  • 38
  • 39
    • 0028506509 scopus 로고
    • Supervised training of dynamical neural networks for associative memory design and identification of nonlinear maps
    • Sept.
    • S. I. Sudharsanan and M. K. Sundareshan, "Supervised training of dynamical neural networks for associative memory design and identification of nonlinear maps," Int. J. Neural Syst., vol. 5, pp. 165-180, Sept. 1994.
    • (1994) Int. J. Neural Syst. , vol.5 , pp. 165-180
    • Sudharsanan, S.I.1    Sundareshan, M.K.2
  • 40
    • 0029375851 scopus 로고
    • Gradient calculation for dynamic neural networks: A survey
    • B. Pearlmutter, "Gradient calculation for dynamic neural networks: A survey," IEEE Trans. Neural Networks, vol. 6, pp. 1212-1228, 1995.
    • (1995) IEEE Trans. Neural Networks , vol.6 , pp. 1212-1228
    • Pearlmutter, B.1
  • 41
    • 0028498606 scopus 로고
    • Synthetic approach to optimal filtering
    • J. T. H. Lo, "Synthetic approach to optimal filtering," IEEE Trans. Neural Networks, vol. 5, pp. 803-811, 1994.
    • (1994) IEEE Trans. Neural Networks , vol.5 , pp. 803-811
    • Lo, J.T.H.1
  • 42
    • 0029777583 scopus 로고    scopus 로고
    • On-line training of recurrent neural networks with continuous topology adaptation
    • D. Obradivic, "On-line training of recurrent neural networks with continuous topology adaptation," IEEE Trans. Neural Networks, vol. 6, pp. 222-228, 1996.
    • (1996) IEEE Trans. Neural Networks , vol.6 , pp. 222-228
    • Obradivic, D.1
  • 44
    • 0025401954 scopus 로고
    • Unsupervised learning in noise
    • B. Kosko, "Unsupervised learning in noise," IEEE Trans. Neural Networks, vol. 1, pp. 44-57, 1990.
    • (1990) IEEE Trans. Neural Networks , vol.1 , pp. 44-57
    • Kosko, B.1
  • 45
    • 0025503558 scopus 로고
    • Backpropagation through time: What it does and how to do it
    • P. Werbos, "Backpropagation through time: What it does and how to do it," Proc. IEEE, vol. 78, pp. 1550-1560, 1990.
    • (1990) Proc. IEEE , vol.78 , pp. 1550-1560
    • Werbos, P.1


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.