메뉴 건너뛰기




Volumn 10, Issue 6, 2013, Pages

Towards autonomous neuroprosthetic control using Hebbian reinforcement learning

Author keywords

[No Author keywords available]

Indexed keywords

ADAPTIVE CONTROLLERS; CLOSED-LOOP CONTROL; CONNECTIONIST NETWORKS; CONVERGENCE PROPERTIES; OPEN-LOOP SIMULATIONS; ROBUST PERFORMANCE; SATISFACTORY CONTROL; SEQUENTIAL LEARNING;

EID: 84889056477     PISSN: 17412560     EISSN: 17412552     Source Type: Journal    
DOI: 10.1088/1741-2560/10/6/066005     Document Type: Article
Times cited : (30)

References (61)
  • 1
    • 84873710744 scopus 로고    scopus 로고
    • High-performance neuroprosthetic control by an individual with tetraplegia
    • 10.1016/S0140-6736(12)61816-9
    • Collinger J L et al 2012 High-performance neuroprosthetic control by an individual with tetraplegia Lancet 381 557-64
    • (2012) Lancet , vol.381 , pp. 557-564
    • Collinger, J.L.1
  • 2
    • 84861049949 scopus 로고    scopus 로고
    • Reach and grasp by people with tetraplegia using a neurally controlled robotic arm
    • 10.1038/nature11076
    • Hochberg L R et al 2012 Reach and grasp by people with tetraplegia using a neurally controlled robotic arm Nature 485 372-5
    • (2012) Nature , vol.485 , pp. 372-375
    • Hochberg, L.R.1
  • 3
    • 58149390799 scopus 로고    scopus 로고
    • Decoding trajectories from posterior parietal cortex ensembles
    • 10.1523/JNEUROSCI.1463-08.2008
    • Mulliken G H, Musallam S and Andersen R A 2008 Decoding trajectories from posterior parietal cortex ensembles J. Neurosci. 28 12913-26
    • (2008) J. Neurosci. , vol.28 , pp. 12913-12926
    • Mulliken, G.H.1    Musallam, S.2    Andersen, R.A.3
  • 4
  • 5
    • 84861045873 scopus 로고    scopus 로고
    • Restoration of grasp following paralysis through brain-controlled stimulation of muscles
    • 10.1038/nature10987
    • Ethier C et al 2012 Restoration of grasp following paralysis through brain-controlled stimulation of muscles Nature 485 368-71
    • (2012) Nature , vol.485 , pp. 368-371
    • Ethier, C.1
  • 6
    • 81555197988 scopus 로고    scopus 로고
    • Neuroplasticity of the sensorimotor cortex during learning
    • 10.1155/2011/310737 310737
    • Francis J T and Song W 2011 Neuroplasticity of the sensorimotor cortex during learning Neural Plast. 2011 310737
    • (2011) Neural Plast. , vol.2011
    • Francis, J.T.1    Song, W.2
  • 7
    • 77956709297 scopus 로고    scopus 로고
    • Learning in closed-loop brain-machine interfaces: Modeling and experimental validation
    • 10.1109/TSMCB.2009.2036931 1083-4419 B
    • Heliot R et al 2010 Learning in closed-loop brain-machine interfaces: modeling and experimental validation IEEE Trans. Syst. Man Cybern. B 40 1387-97
    • (2010) IEEE Trans. Syst. Man Cybern. , vol.40 , pp. 1387-1397
    • Heliot, R.1
  • 8
    • 84866930516 scopus 로고    scopus 로고
    • Comprehensive characterization and failure modes of tungsten microwire arrays in chronic neural implants
    • 10.1088/1741-2560/9/5/056015 1741-2552 056015
    • Prasad A et al 2012 Comprehensive characterization and failure modes of tungsten microwire arrays in chronic neural implants J. Neural Eng. 9 056015
    • (2012) J. Neural Eng. , vol.9 , Issue.5
    • Prasad, A.1
  • 9
    • 26444445003 scopus 로고    scopus 로고
    • Response of brain tissue to chronically implanted neural electrodes
    • DOI 10.1016/j.jneumeth.2005.08.015, PII S0165027005002931
    • Polikov V S, Tresco P A and Reichert W M 2005 Response of brain tissue to chronically implanted neural electrodes J. Neurosci. Methods 148 1-18 (Pubitemid 41423453)
    • (2005) Journal of Neuroscience Methods , vol.148 , Issue.1 , pp. 1-18
    • Polikov, V.S.1    Tresco, P.A.2    Reichert, W.M.3
  • 11
    • 67651008634 scopus 로고    scopus 로고
    • The science of neural interface systems
    • 10.1146/annurev.neuro.051508.135241
    • Hatsopoulos N G and Donoghue J P 2009 The science of neural interface systems Annu. Rev. Neurosci. 32 249-66
    • (2009) Annu. Rev. Neurosci. , vol.32 , pp. 249-266
    • Hatsopoulos, N.G.1    Donoghue, J.P.2
  • 12
    • 84863067405 scopus 로고    scopus 로고
    • Adaptive decoding for brain-machine interfaces through Bayesian parameter updates
    • 10.1162/NECO-a-00207
    • Li Z et al 2011 Adaptive decoding for brain-machine interfaces through Bayesian parameter updates Neural Comput. 23 3162-204
    • (2011) Neural Comput. , vol.23 , pp. 3162-3204
    • Li, Z.1
  • 13
    • 3943058267 scopus 로고    scopus 로고
    • Cortical neural prosthetics
    • DOI 10.1146/annurev.neuro.27.070203.144233
    • Schwartz A B 2004 Cortical neural prosthetics Annu. Rev. Neurosci. 27 487-507 (Pubitemid 39050411)
    • (2004) Annual Review of Neuroscience , vol.27 , pp. 487-507
    • Schwartz, A.B.1
  • 14
    • 33747625847 scopus 로고    scopus 로고
    • Towards On-line adaptation of neuro-prostheses with neuronal evaluation signals
    • DOI 10.1007/s00422-006-0083-7
    • Rotermund D, Ernst U A and Pawelzik K R 2006 Towards on-line adaptation of neuro-prostheses with neuronal evaluation signals Biol. Cybern. 95 243-57 (Pubitemid 44267893)
    • (2006) Biological Cybernetics , vol.95 , Issue.3 , pp. 243-257
    • Rotermund, D.1    Ernst, U.A.2    Pawelzik, K.R.3
  • 16
    • 84870499789 scopus 로고    scopus 로고
    • A high-performance neural prosthesis enabled by control algorithm design
    • 10.1038/nn.3265
    • Gilja V et al 2012 A high-performance neural prosthesis enabled by control algorithm design Nat. Neurosci. 15 1752-7
    • (2012) Nat. Neurosci. , vol.15 , pp. 1752-1757
    • Gilja, V.1
  • 17
    • 84863766742 scopus 로고    scopus 로고
    • Closed-loop decoder adaptation on intermediate time-scales facilitates rapid BMI performance improvements independent of decoder initialization conditions
    • 10.1109/TNSRE.2012.2185066 1534-4320
    • Orsborn A L et al 2012 Closed-loop decoder adaptation on intermediate time-scales facilitates rapid BMI performance improvements independent of decoder initialization conditions IEEE Trans. Neural Syst. Rehabil. Eng. 20 468-77
    • (2012) IEEE Trans. Neural Syst. Rehabil. Eng. , vol.20 , pp. 468-477
    • Orsborn, A.L.1
  • 18
    • 84870871612 scopus 로고    scopus 로고
    • Unsupervised adaptation of brain-machine interface decoders
    • Gurel T and Mehring C 2012 Unsupervised adaptation of brain-machine interface decoders Front. Neurosci. 6 16
    • (2012) Front. Neurosci. , vol.6 , pp. 16
    • Gurel, T.1    Mehring, C.2
  • 20
    • 60549097572 scopus 로고    scopus 로고
    • Coadaptive brain-machine interface via reinforcement learning
    • 10.1109/TBME.2008.926699 0018-9294
    • DiGiovanna J et al 2009 Coadaptive brain-machine interface via reinforcement learning IEEE Trans. Biomed. Eng. 56 54-64
    • (2009) IEEE Trans. Biomed. Eng. , vol.56 , pp. 54-64
    • Digiovanna, J.1
  • 21
    • 79952679719 scopus 로고    scopus 로고
    • A symbiotic brain-machine interface through value-based decision making
    • 10.1371/journal.pone.0014760
    • Mahmoudi B and Sanchez J C 2011 A symbiotic brain-machine interface through value-based decision making PLoS One 6 e14760
    • (2011) PLoS One , vol.6 , pp. 14760
    • Mahmoudi, B.1    Sanchez, J.C.2
  • 22
    • 0036592026 scopus 로고    scopus 로고
    • Actor-critic models of the basal ganglia: New anatomical and computational perspectives
    • DOI 10.1016/S0893-6080(02)00047-3, PII S0893608002000473
    • Joel D, Niv Y and Ruppin E 2002 Actor-critic models of the basal ganglia: new anatomical and computational perspectives Neural Netw. 15 535-47 (Pubitemid 34947463)
    • (2002) Neural Networks , vol.15 , Issue.4-6 , pp. 535-547
    • Joel, D.1    Niv, Y.2    Ruppin, E.3
  • 23
    • 21344442393 scopus 로고    scopus 로고
    • Actor-critic models of reinforcement learning in the basal ganglia: From natural to artificial rats
    • DOI 10.1177/105971230501300205
    • Khamassi M et al 2005 Actor-critic models of reinforcement learning in the basal ganglia: from natural to artificial rats Adapt. Behav. 13 131-48 (Pubitemid 40907085)
    • (2005) Adaptive Behavior , vol.13 , Issue.2 , pp. 131-148
    • Khamassi, M.1    Lacheze, L.2    Girard, B.3    Berthoz, A.4    Guillot, A.5
  • 25
    • 84857467273 scopus 로고    scopus 로고
    • Dynamically repairing and replacing neural networks: Using hybrid computational and biological tools
    • 10.1109/MPUL.2011.2175640
    • Sanchez J C et al 2012 Dynamically repairing and replacing neural networks: using hybrid computational and biological tools Pulse IEEE 3 57-9
    • (2012) Pulse IEEE , vol.3 , pp. 57-59
    • Sanchez, J.C.1
  • 26
    • 34447648725 scopus 로고    scopus 로고
    • Multiple representations of belief states and action values in corticobasal ganglia loops
    • DOI 10.1196/annals.1390.024, Reward and Decision Making in Corticobasal Ganglia Networks
    • Samejima K and Doya K 2007 Multiple representations of belief states and action values in corticobasal ganglia loops Reward and Decision Making in Corticobasal Ganglia Networks (Oxford: Blackwell) pp 213-28 (Pubitemid 47092747)
    • (2007) Annals of the New York Academy of Sciences , vol.1104 , pp. 213-228
    • Samejima, K.1    Doya, K.2
  • 27
    • 0031748491 scopus 로고    scopus 로고
    • Reward prediction in primate basal ganglia and frontal cortex
    • DOI 10.1016/S0028-3908(98)00071-9, PII S0028390898000719
    • Schultz W, Tremblay L and Hollerman J R 1998 Reward prediction in primate basal ganglia and frontal cortex Neuropharmacology 37 421-9 (Pubitemid 28306931)
    • (1998) Neuropharmacology , vol.37 , Issue.4-5 , pp. 421-429
    • Schultz, W.1    Tremblay, L.2    Hollerman, J.R.3
  • 28
    • 26644463216 scopus 로고    scopus 로고
    • Neural correlates of the proximity and quantity of anticipated food rewards in the ventral striatum of domestic chicks
    • DOI 10.1111/j.1460-9568.2005.04311.x
    • Izawa E I, Aoki N and Matsushima T 2005 Neural correlates of the proximity and quantity of anticipated food rewards in the ventral striatum of domestic chicks Eur. J. Neurosci. 22 1502-12 (Pubitemid 41442421)
    • (2005) European Journal of Neuroscience , vol.22 , Issue.6 , pp. 1502-1512
    • Izawa, E.-I.1    Aoki, N.2    Matsushima, T.3
  • 29
    • 0028442414 scopus 로고
    • Associative reinforcement learning: A generate and test algorithm
    • 10.1007/BF00993348 0885-6125
    • Kaelbling L P 1994 Associative reinforcement learning: a generate and test algorithm Mach. Learn. 15 299-319
    • (1994) Mach. Learn. , vol.15 , Issue.3 , pp. 299-319
    • Kaelbling, L.P.1
  • 31
    • 0031559957 scopus 로고    scopus 로고
    • Reinforcement learning by Hebbian synapses with adaptive thresholds
    • DOI 10.1016/S0306-4522(97)00118-8, PII S0306452297001188
    • Pennartz C M A 1997 Reinforcement learning by Hebbian synapses with adaptive thresholds Neuroscience 81 303-19 (Pubitemid 27381989)
    • (1997) Neuroscience , vol.81 , Issue.2 , pp. 303-319
    • Pennartz, C.M.A.1
  • 32
    • 74549209037 scopus 로고    scopus 로고
    • Spike-based reinforcement learning in continuous state and action space: When policy gradient methods fail
    • 10.1371/journal.pcbi.1000586
    • Vasilaki E et al 2009 Spike-based reinforcement learning in continuous state and action space: when policy gradient methods fail PLoS Comput. Biol. 5 e1000586
    • (2009) PLoS Comput. Biol. , vol.5 , pp. 1000586
    • Vasilaki, E.1
  • 34
    • 77950297907 scopus 로고    scopus 로고
    • Parameter-exploring policy gradients
    • 10.1016/j.neunet.2009.12.004 0893-6080
    • Sehnke F et al 2010 Parameter-exploring policy gradients Neural Netw. 23 551-9
    • (2010) Neural Netw. , vol.23 , pp. 551-559
    • Sehnke, F.1
  • 35
    • 84898939480 scopus 로고    scopus 로고
    • Policy gradient methods for reinforcement learning with function approximation
    • 1049-5258
    • Sutton R.S et al 2000 Policy gradient methods for reinforcement learning with function approximation Adv. Neural Inf. Process. Syst. 12 1057-63
    • (2000) Adv. Neural Inf. Process. Syst. , vol.12 , pp. 1057-1063
    • Sutton, R.S.1
  • 36
    • 40649106649 scopus 로고    scopus 로고
    • Natural actor-critic
    • 10.1016/j.neucom.2007.11.026 0925-2312
    • Peters J and Schaal S 2008 Natural actor-critic Neurocomputing 71 1180-90
    • (2008) Neurocomputing , vol.71 , pp. 1180-1190
    • Peters, J.1    Schaal, S.2
  • 37
    • 0000337576 scopus 로고
    • Simple statistical gradient-following algorithms for connectionist reinforcement learning
    • 10.1007/BF00992696 0885-6125
    • Williams R J 1992 Simple statistical gradient-following algorithms for connectionist reinforcement learning Mach. Learn. 8 229-56
    • (1992) Mach. Learn. , vol.8 , Issue.3-4 , pp. 229-256
    • Williams, R.J.1
  • 39
    • 84927461265 scopus 로고
    • Pattern-recognizing stochastic learning automata
    • 10.1109/TSMC.1985.6313371 0018-9472
    • Barto A G and Anandan P 1985 Pattern-recognizing stochastic learning automata IEEE Trans. Syst. Man Cybern. SMC-15 360-75
    • (1985) IEEE Trans. Syst. Man Cybern. , vol.15 , pp. 360-375
    • Barto, A.G.1    Anandan, P.2
  • 43
    • 84857501996 scopus 로고    scopus 로고
    • Experience replay for real-time reinforcement learning control
    • 10.1109/TSMCC.2011.2106494 C
    • Adam S, Busoniu L and Babuska R 2012 Experience replay for real-time reinforcement learning control IEEE Trans. Syst. Man Cybern. C 42 201-12
    • (2012) IEEE Trans. Syst. Man Cybern. , vol.42 , pp. 201-212
    • Adam, S.1    Busoniu, L.2    Babuska, R.3
  • 44
    • 71749106087 scopus 로고    scopus 로고
    • Real-time reinforcement learning by sequential actor-critics and experience replay
    • 10.1016/j.neunet.2009.05.011 0893-6080
    • Wawrzyński P 2009 Real-time reinforcement learning by sequential actor-critics and experience replay Neural Netw. 22 1484-97
    • (2009) Neural Netw. , vol.22 , pp. 1484-1497
    • Wawrzyński, P.1
  • 45
    • 0742268989 scopus 로고    scopus 로고
    • Simple model of spiking neurons
    • 10.1109/TNN.2003.820440 1045-9227
    • Izhikevich E M 2003 Simple model of spiking neurons IEEE Trans. Neural Netw. 14 1569-72
    • (2003) IEEE Trans. Neural Netw. , vol.14 , pp. 1569-1572
    • Izhikevich, E.M.1
  • 48
    • 76849106481 scopus 로고    scopus 로고
    • A wireless brain-machine interface for real-time speech synthesis
    • 10.1371/journal.pone.0008218
    • Guenther F H et al 2009 A wireless brain-machine interface for real-time speech synthesis PLoS One 4 e8218
    • (2009) PLoS One , vol.4 , pp. 8218
    • Guenther, F.H.1
  • 49
    • 67650599543 scopus 로고    scopus 로고
    • Using unconstrained tongue motion as an alternative control mechanism for wheeled mobility
    • 10.1109/TBME.2009.2018632 0018-9294
    • Xueliang H and Ghovanloo M 2009 Using unconstrained tongue motion as an alternative control mechanism for wheeled mobility IEEE Trans. Biomed. Eng. 56 1719-26
    • (2009) IEEE Trans. Biomed. Eng. , vol.56 , pp. 1719-1726
    • Xueliang, H.1    Ghovanloo, M.2
  • 50
    • 0034576323 scopus 로고    scopus 로고
    • Multiple reward signals in the brain
    • 10.1038/35044563
    • Schultz W 2000 Multiple reward signals in the brain Nature Rev. Neurosci. 1 199-207
    • (2000) Nature Rev. Neurosci. , vol.1 , pp. 199-207
    • Schultz, W.1
  • 51
    • 0034625673 scopus 로고    scopus 로고
    • Dissociating the role of the dorsolateral prefrontal and anterior cingulate cortex in cognitive control
    • DOI 10.1126/science.288.5472.1835
    • MacDonald A W et al 2000 Dissociating the role of the dorsolateral prefrontal and anterior cingulate cortex in cognitive control Science 288 1835-8 (Pubitemid 30399023)
    • (2000) Science , vol.288 , Issue.5472 , pp. 1835-1838
    • MacDonald III, A.W.1    Cohen, J.D.2    Andrew Stenger, V.3    Carter, C.S.4
  • 52
    • 39749132355 scopus 로고    scopus 로고
    • Error-related EEG potentials generated during simulated brain-computer interaction
    • DOI 10.1109/TBME.2007.908083
    • Ferrez P W and del R Millan J 2008 Error-related EEG potentials generated during simulated brain-computer interaction IEEE Trans. Biomed. Eng. 55 923-9 (Pubitemid 351301222)
    • (2008) IEEE Transactions on Biomedical Engineering , vol.55 , Issue.3 , pp. 923-929
    • Ferrez, P.W.1    Del R. Millan, J.2
  • 53
    • 0019188610 scopus 로고
    • From motivation to action: Functional interface between the limbic system and the motor system
    • DOI 10.1016/0301-0082(80)90018-0
    • Mogenson G J, Jones D L and Yim C Y 1980 From motivation to action: functional interface between the limbic system and the motor system Prog. Neurobiol. 14 69-97 (Pubitemid 10053405)
    • (1980) Progress in Neurobiology , vol.14 , Issue.2-3 , pp. 69-97
    • Mogenson, G.J.1    Jones, D.L.2    Yim, C.Y.3
  • 54
    • 84886486028 scopus 로고    scopus 로고
    • Feature extraction and unsupervised classification of neural population reward signals for reinforcement based BMI
    • Prins N et al 2013 Feature extraction and unsupervised classification of neural population reward signals for reinforcement based BMI Annu. Int. Conf. IEEE Eng. Med. Biol. Soc. (EMBC) (Osaka, Japan, 2013) at press
    • (2013) Annu. Int. Conf. IEEE Eng. Med. Biol. Soc. (EMBC)
    • Prins, N.1
  • 56
    • 84880955826 scopus 로고    scopus 로고
    • Brain-machine interface control of a robot arm using actor-critic reinforcement learning
    • Pohlmeyer E A et al 2012 Brain-machine interface control of a robot arm using actor-critic reinforcement learning Annu. Int. Conf. Proc. IEEE Eng. Med. Biol. Soc. pp 4108-11
    • (2012) Annu. Int. Conf. Proc. IEEE Eng. Med. Biol. Soc. , pp. 4108-4111
    • Pohlmeyer, E.A.1
  • 57
    • 0034641916 scopus 로고    scopus 로고
    • Learning of action through adaptive combination of motor primitives
    • 10.1038/35037588
    • Thoroughman K A and Shadmehr R 2000 Learning of action through adaptive combination of motor primitives Nature 407 742-7
    • (2000) Nature , vol.407 , pp. 742-747
    • Thoroughman, K.A.1    Shadmehr, R.2
  • 58
    • 75349098441 scopus 로고    scopus 로고
    • A neural basis for motor primitives in the spinal cord
    • 10.1523/JNEUROSCI.5894-08.2010
    • Hart C B and Giszter S F 2010 A neural basis for motor primitives in the spinal cord J. Neurosci. 30 1322-36
    • (2010) J. Neurosci. , vol.30 , pp. 1322-1336
    • Hart, C.B.1    Giszter, S.F.2
  • 59
    • 68049146044 scopus 로고    scopus 로고
    • Emergence of a stable cortical map for neuroprosthetic control
    • 10.1371/journal.pbio.1000153
    • Ganguly K and Carmena J M 2009 Emergence of a stable cortical map for neuroprosthetic control PLoS Biol. 7 e1000153
    • (2009) PLoS Biol. , vol.7 , pp. 1000153
    • Ganguly, K.1    Carmena, J.M.2
  • 60
    • 79960637995 scopus 로고    scopus 로고
    • A neural signature of hierarchical reinforcement learning
    • 10.1016/j.neuron.2011.05.042
    • Ribas-Fernandes J J F et al 2011 A neural signature of hierarchical reinforcement learning Neuron 71 370-9
    • (2011) Neuron , vol.71 , pp. 370-379
    • Ribas-Fernandes, J.J.F.1
  • 61
    • 0141988716 scopus 로고    scopus 로고
    • Recent advances in hierarchical reinforcement learning
    • 10.1023/A:1022140919877
    • Barto A G and Mahadevan S 2003 Recent advances in hierarchical reinforcement learning Discrete Event Dyn. Syst. 13 41-77
    • (2003) Discrete Event Dyn. Syst. , vol.13 , pp. 41-77
    • Barto, A.G.1    Mahadevan, S.2


* 이 정보는 Elsevier사의 SCOPUS DB에서 KISTI가 분석하여 추출한 것입니다.