TY - JOUR
T1 - Learning in spiking neural networks by reinforcement of stochastic synaptic transmission
AU - Seung, Hyunjune Sebastian
PY - 2003/12/18
Y1 - 2003/12/18
N2 - It is well-known that chemical synaptic transmission is an unreliable process, but the function of such unreliability remains unclear. Here I consider the hypothesis that the randomness of synaptic transmission is harnessed by the brain for learning, in analogy to the way that genetic mutation is utilized by Darwinian evolution. This is possible if synapses are "hedonistic," responding to a global reward signal by increasing their probabilities of vesicle release or failure, depending on which action immediately preceded reward. Hedonistic synapses learn by computing a stochastic approximation to the gradient of the average reward. They are compatible with synaptic dynamics such as short-term facilitation and depression and with the intricacies of dendritic integration and action potential generation. A network of hedonistic synapses can be trained to perform a desired computation by administering reward appropriately, as illustrated here through numerical simulations of integrate-and-fire model neurons.
AB - It is well-known that chemical synaptic transmission is an unreliable process, but the function of such unreliability remains unclear. Here I consider the hypothesis that the randomness of synaptic transmission is harnessed by the brain for learning, in analogy to the way that genetic mutation is utilized by Darwinian evolution. This is possible if synapses are "hedonistic," responding to a global reward signal by increasing their probabilities of vesicle release or failure, depending on which action immediately preceded reward. Hedonistic synapses learn by computing a stochastic approximation to the gradient of the average reward. They are compatible with synaptic dynamics such as short-term facilitation and depression and with the intricacies of dendritic integration and action potential generation. A network of hedonistic synapses can be trained to perform a desired computation by administering reward appropriately, as illustrated here through numerical simulations of integrate-and-fire model neurons.
UR - http://www.scopus.com/inward/record.url?scp=0347362917&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=0347362917&partnerID=8YFLogxK
U2 - 10.1016/S0896-6273(03)00761-X
DO - 10.1016/S0896-6273(03)00761-X
M3 - Article
C2 - 14687542
AN - SCOPUS:0347362917
SN - 0896-6273
VL - 40
SP - 1063
EP - 1073
JO - Neuron
JF - Neuron
IS - 6
ER -