CONNECTIONS

Together with units, the connections are the core of a neural network. The activities of the units are computed from the activities of other units and the connection weights of the connections between units (see the Units section). Basically, a connection is a 3-tuple (from_unit, to_unit, weight). The "from_unit" and the "to_unit" are determined by the topology of the network and the "weight" is a real number that is optimized in the network optimization (or learning) process.

The syntax of the command that creates connections is:

USAGE: Connect [option] From To Net
       Option                                                        Default
       -d delay     Delay connections (or lookahead if negative)     (0 delay)
       -D from to   All delays between 'From' and 'To'               (off)
       -r bound     Bound on random initialisation                   (0.10)
       -f           Full connection, i.e. both ways                  (off)
       -w w         Constant connection (only between single units)  (rand)
       -s frac      Sparse conection. 'frac' is % connection prob.   (off)
"From" and "To" can be the names of two units. Then one connection will be created between those units. But a more common and useful use of the Connect command is to connect between two groups. For example: will connect all units in the group "my_first_group" with all units in the group "my_second_group".

The connection weight is by default a square distributed random variable. The range of this random initialization variable can be changed with the -r option. Also for connections between two named units, the weight can be set to a specific value using the -w option.

Time-delay

One of the strengts of NICO toolkit is how it handles dynamic networks. Connections can be delayed a number of frames with the -d option, but also look-ahead connections can be created (by specifying a negative delay). It is common to let a unit "see" a window of time-delay/look-ahead of other units in the network. This is most easily done with the -D option. Here are some examples:

Connect from group "input to group "hidden" with delay 1:

 Connect from group "input to group "hidden" with look-ahead 3: Connect from group "input to group "hidden" with a window of -3 to +3 frames (creating seven connections between the pairs of units): Of cource, when the the network runs and during the back-propagation training there can be no "real" look-ahead connections because then it would not be a feed-forward network, but the NICO toolkit automatically converts all connections to time-delays when the network runs by delaying some of the units. This, however is completely transparent to the user. If the user attempts to make commections that would create a "zero-time loop" in the network, i.e. add connections so that the activation of some unit would depend on itself at the present or future times, the conversion to a feed-forward network is impossible and the NICO toolkit refuses to add the new connections.

Plasticity

Besides their weight, connections have an additional property: plasticity. This is a value controlling how fast this particular connection should be changed in the learning process. By default, all connections have plasticity 1.0, but it can be changed with the SetPlast command. Setting the plasticity to 0.0 prevents this connection to be altered at all -- higher values increases the learning rate. This is practical if you want to train only some part of a big network and keep the rest of the network constant. For example, the two commands: Forces the network to change only connections from the bias unit ("network" is the root group -- all ather groups are sub-groups of it, see the Groups section).

Besides Connect, the following commands can be used to create connections:

Sparse Connectivity

The toolkit has some powerful tools for sparse connectivity. When you connect the units in one group to another, you can choose to randomly create only a fraction of the connections.  The -s option of the Connect command does this basic sparse connection. You can also connect units in the groups with varying probability depending on their 'distance' from each other with the command Metricnct. The 'distance is simply based on the ordering of the units in their respective groups, e.g., unit #5 of group1 and unit #21 of group2 have a 'distance' of 16.

Pruning

The command Prune can be used to reduce the number of connections in a trained network by removing connections with small weights or alternatively, both small weight and small gradient over a trining set. This can be used to dramatically reduce the size and runtime of trained networks.