TY - JOUR
T1 - Model of birdsong learning based on gradient estimation by dynamic perturbation of neural conductances
AU - Fiete, Ila R.
AU - Fee, Michale S.
AU - Seung, H. Sebastian
PY - 2007/10
Y1 - 2007/10
N2 - We propose a model of songbird learning that focuses on avian brain areas HVC and RA, involved in song production, and area LMAN, important for generating song variability. Plasticity at HVC → RA synapses is driven by hypothetical "rules" depending on three signals: activation of HVC → RA synapses, activation of LMAN → RA synapses, and reinforcement from an internal critic that compares the bird's own song with a memorized template of an adult tutor's song. Fluctuating glutamatergic input to RA from LMAN generates behavioral variability for trial-and-error learning. The plasticity rules perform gradient-based reinforcement learning in a spiking neural network model of song production. Although the reinforcement signal is delayed, temporally imprecise, and binarized, the model learns in a reasonable amount of time in numerical simulations. Varying the number of neurons in HVC and RA has little effect on learning time. The model makes specific predictions for the induction of bidirectional long-term plasticity at HVC → RA synapses.
AB - We propose a model of songbird learning that focuses on avian brain areas HVC and RA, involved in song production, and area LMAN, important for generating song variability. Plasticity at HVC → RA synapses is driven by hypothetical "rules" depending on three signals: activation of HVC → RA synapses, activation of LMAN → RA synapses, and reinforcement from an internal critic that compares the bird's own song with a memorized template of an adult tutor's song. Fluctuating glutamatergic input to RA from LMAN generates behavioral variability for trial-and-error learning. The plasticity rules perform gradient-based reinforcement learning in a spiking neural network model of song production. Although the reinforcement signal is delayed, temporally imprecise, and binarized, the model learns in a reasonable amount of time in numerical simulations. Varying the number of neurons in HVC and RA has little effect on learning time. The model makes specific predictions for the induction of bidirectional long-term plasticity at HVC → RA synapses.
UR - http://www.scopus.com/inward/record.url?scp=35348872545&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=35348872545&partnerID=8YFLogxK
U2 - 10.1152/jn.01311.2006
DO - 10.1152/jn.01311.2006
M3 - Article
C2 - 17652414
AN - SCOPUS:35348872545
SN - 0022-3077
VL - 98
SP - 2038
EP - 2057
JO - Journal of neurophysiology
JF - Journal of neurophysiology
IS - 4
ER -