Implications of Synaptic Digitisation and Error For Neocortical
Function
Recent work in hippocampus (1,2) has shown that correlation-based
synaptic strengthening occurs digitally and may spread to nearby, inactive,
connections. We are exploring a model in which co-activity strengthens connections
by adding synapses. The added synapses usually appear at the co-active connection,
but with some low probability E they appear at neighboring cells (3). Regular
Hebbian learning is thus a special case of our model, with E = 0. We assume
that the total number of synapses formed by a given presynaptic cell upon
a row of postsynaptic cells is constant. With E = 0, the equation that describes
our model is dy/dt = (w - <w>)y, where y is the number of synapses at
a connection, w is the "fitness" of a synapse (which depends on
the degree of correlation across it), and <w> the mean fitness of all
synapses. Allowing errors in placing new synapses, the equation becomes ¶y/¶t
= (w - <w>)y + 0.5 w E ¶2y/¶x2,
where x is position along the postsynaptic row, and w is a function of x.
This equation represents the random left/right placement of erroneous synapses
as a diffusion process. If the presynaptic neuron is well correlated (w = wm)
with a small mesa of neurons, and poorly correlated (w = wp) with a large
surrounding plateau of neurons, then in the steady state most of the synapses
will accumulate in the high fitness zone, but there will be an exponential
fringe of synapses trailing off into the low fitness surround. This fringe
of synapses will lower the accuracy of the connection compared to the zero
error case, to an extent that depends on E and wm/wp. We have verified this
using a computer simulation in which synapses appear according to a mesa-plateau
fitness function, with a probability E of misplacement to the right or left
(Fig 1a). As expected we found that increasing E or decreasing (wm/wp) increased
the spread of the connection beyond the mesa. We also simulated the transfer
of synapses from an initial high fitness region at one end of the row to a
new high fitness region at the other. As expected we observed (Fig 1b) a variable
delay in the transfer process that was most evident with small numbers of
synapses.
Fig 1. (A) Steady state synapse spread beyond the target (cell
7) increases with E (circles: 0.1, squares: 0.2, up triangles: 0.3, down triangles:
0.4; wm/wp =1.4). (B) 13,000 synapses were placed initially on cell 1 and
the number of synapses at the target (cell 13) is plotted for 5 repeated runs
with E = 0.1, (wm/wp) = 1.05.
The conclusion that the accuracy of correlation-dependent wiring depends on
both the fidelity with which new synapses are placed and upon the sharpness
of the correlation profile is not surprising but it has interesting implications
for the formation of large networks such as neocortex. The useful size of
networks may be limited by the accuracy with which they can be wired. Evolution
has presumably lowered error rates as far as possible, by placing synapses
on spines isolated from neighbors. However, errors ultimately reflect molecular
noise, and cannot be eliminated without making synapses bulky and sparse.
It is doubtful if neocortex has appreciably more reliable synapses than other
brain regions. The other obvious way to improve the accuracy of cortical wiring
is to enhance the sharpness of correlations. This strategy is also unavailable
to cortex ( in the short term) because correlations depends on patterns in
the real world and on analysis by previous layers. However, although a given
cortical area cannot improve the presented correlations, at least it can measure
them. If it determines that the profile of correlations is sharp (wm/wp exceeds
some threshold) , then Hebbian learning can procede. If however the profile
is blunt, learning should be disabled until changes in the world, or in preceding
layers, produce clearer patterns.
Fig 2 shows a circuit that ensures any desired degree of wiring accuracy.
A top layer of cells projects to a middle, output, layer, and the matrix of
digital synapses between these layers transforms input vectors to output vectors
in the standard connectionist manner. Consider an input cell which is currently
wired to an output cell (shown as solid circles), but which could become wired
(open circles) to the neighbors of the output cell as a result of errors in
the strengthening of the existing connection. The bottom cells act as correlation
detectors. Thus the bottom central cell is excited when it receives nearly
simultaneous spikes from the currently connected pair of cells, but is inhibited
by its neighbors if they receive nearly simultaneous spikes from the top neuron
and the neighbors of the output neuron. The depicted distal placement of inputs
from the top layer and the proximal placement of inputs from the middle layer
will assist in such coincidence detection, but could be supplemented by various
nonlinearities. If the difference in the correlation signals measured by the
central bottom neuron and its flanking neighbors is sufficiently great that
accidental formation of erroneous connections will not spread, then the central
bottom cell fires and signals the top layer cell that all its synaptic connections
within the middle layer can be plastic. This could be achieved if the bottom
cells send axons back to the top cells to control their firing mode, and if
the firing mode controls the plasticity of the synapses formed by top layer
cells. If the correlation profile measured by the bottom cell is too low,
then it remains silent, and the top-to-middle synapses remain implastic. This
arrangement hinges on the distinction between "connected" and "unconnected"
inherent in synaptic digitisation. Only if connections are sparse is it feasible
to use a layer of correlation detectors to control learning. In the simplest
case, there need only be nXm bottom cells, where n is the number of cells
in the top or middle rows, and m is the number of neighbors.
The arrangement shown in Fig 2 is similar to that in neocortex, with the top
row being the thalamic relay cells, the middle row, layer 4, and the bottom
row, layer 6. Firing of layer 6 cells switches relay cells from burst to tonic
mode. A similar solution should also apply to the subsequent layers of cortical
processing, from 4 to 2/3 and from 2/3 to 5. However now the feedforward connections
whose plasticity must be regulated are entirely intracortical, so that the
burst/tonic transition of thalamic relay cells cannot be directly used. Instead
the plasticity of these connections must be regulated postsynaptically. It
is likely this is done by layer 1 inputs to apical tufts of layer 2,3 and
5 pyramidal cells, which set up local calcium spikes, which facilitate backpropagation
along the apical dendrite. These arrangements also agree well with the observation
that there are, in cat visual cortex, "simple" layer 6 cells (which
would compute the T-4 correlations) and "complex" layer 6 cells
(which would compute the 4-2/3 and 2/3-5 correlations). Interestingly these
simple and complex layer 6 cells send collaterals to layers 4 and 2/3/5 respectively,
where they make shaft "drumstick" synapses like those formed on
the distal dendrites of relay cells. These synapses could use the same correlation
sharpness signals to modulate within-layer recurrent excitation.
Fig 2. Proposed thalamocorticothalamic circuitry to contain error. Vertical
arrow controls presynaptic plasticity. Horizontal arrows show interactions
computing correlation ratios.
This solution to the problem posed by synaptic error carries 2 interesting
drawbacks. First, neocortex will learn more slowly than if it were a fully-connected
error-free network. Second, the solution does not prevent error, but merely
contains it. If the connections are plastic, errant synapses can still occur
(though they will likely be eliminated by competition with "correct"
synapses). Errant synapses can serve as seeds of future learning, so that
a "mutating" partly-connected network can eventually attain the
weights achievable directly in a fully-connected perfect network. Consider
the extreme case where, as a result of daytime learning, an errant synapse
flourishes and a relay cell transfers its allegiance to a neighbor of the
original layer 4 cell (Fig 3). Before any further plasticity is allowed, the
connections of the layer 6 cells must be updated, so they can continue to
limit error spread. This updating, which involves breaking the dotted connections
and making the dashed connections in Fig 3, must be done offline, with the
feedforward T-to-4 connections rendered implastic. Layer 4 to layer 6 connections,
which define the columns of cortex, are permanent. The connections from layer
T to layer 6 can be updated if the T cell fires in bursts, since this will
fire the newly connected layer 4 cell, which will fire the correct layer 6
cell. The connection from the new layer 4 cell to its partner layer 6 cell
will undergo Hebbian strengthening, and errant synapses will form onto the
neighbors of that layer 6 cell, as required. If this bursting activity advances
as a calibrating wave across thalamic nuclei, it will automatically update
all the T-to-6 connections. Updating the return pathway to thalamus is trickier,
because the new layer 6 cell must find all the relay cells that comprise the
receptive field of its partner layer 4 cell. Because the layer 6 cells act
as correlation detectors, they can do reverse correlation analysis to determine
this receptive field. White noise must be played into thalamus to perform
offline updating of the corticothalamic connections. Now it is required that
a group of relay cells fire the new layer 4 cell, so the calibrating, white
noise, input should be in tonic mode. These requirements match the features
of slow wave and paradoxical sleep. In the former case traveling bursts are
imposed on thalamus by its reticular nucleus, in the latter, irregularly discharging
brainstem cholinergic neurons bombard relay cells with brief nicotinic epsps.
Fig 3 shows the start and end of the allegiance transfer, but not intermediate
points, when the thalamic cell makes synapses on both layer 4 cells, which
at the end of the day comprise an extended high fitness zone. Now it is the
average correlation in the high fitness zone that should be compared to the
flanking correlations, and used to control the plasticity of the thalamic
neuron. This can be achieved if there is also offline updating of the lateral
interactions between neighboring layer 6 cells, so that during the transfer
the layer 6 cells marked 0 and 1 act as a unit, comparing their average activity
to that of the flanking cells (marked -1 and 2), and both feeding back to
the relay cell. This (and similar updating in layers 2, 3 and 5) can again
be accomplished in slow wave sleep. Because the connections of the layer 6
complex-type cells that control postsynaptic plasticity must also be updated
offline in a similar manner during sleep, and because the required calibration
signals (burst waves and white noise) are generated thalamically, layer 6
control of postsynaptic plasticity must also loop back through thalamus, via
matrix relay cells that synapse in layer 1. In this case white noise input
must be played into the postsynaptic cells, presumably by random matrix cell
spikes fired into the apical tufts under conditions where apical epsps initiate
somatic spikes.
Fig. 3. Rewiring after an allegiance shift. Dotted (A) connections
change to dashed (B).
There are gaps in the above reasoning. First, new synapses may be placed with
perfect fidelity. Under these conditions there is no way to explore new connections,
and by shrinking synapses to the point where errors occur, great increases
in computational power could be obtained. Second, implicit in the notion that
large networks need accurate wiring is an assumption of nonlinearity. Our
simple model of synapse formation is linear, and wiring degrades gracefully
with increasing error rates or decreasing correlation sharpness. However,
real computation requires nonlinearities, as does slow learning in backpropagation
networks. We are looking for wiring catastrophes in nonlinear models. Third,
the exact mechanisms of presynaptic plasticity control are unclear. Fourth,
although the proposed detailed wiring is consistent with known anatomy, it
goes slightly beyond it. Nevertheless, the principle of error containment
by correlation ratio measurement offers an attractive new view of cortical
function.
References
1. C.C.H. Petersen, R.C. Malenka, R.A. Nicoll and J.J. Hopfield.
All-or-none potentiation at CA3-CA1 synapses. Proc. Natl. Acad. Sci. 95 (1998),
4732-4737
2. F. Engert and T. Bonhoeffer. Synapse specificity of long-term potentiation
breaks down at short distances. Nature. 388 (1997), 279-284.
3. P.R. Adams. Hebb and Darwin. J.Theor.Biol.195(1998), 419-438.
BIOGRAPHIES Kingsley Cox did his PhD at Sussex University cloning
peptide receptors in snail neurons. At Stony Brook he has done postdoctoral
work on calcium imaging in zebra fish, and most recently on modeling synaptic
darwinism.
Paul Adams is a Professor in the Neurobiology Department. His early work was
on the biophysics of synaptic transmission, spiking and calcium signalling.
More recently he has worked on cellular aspects of thalamus. He has a PhD
from London University.