Neural Turing Machines.
Neural Turing Machines
The paper focused on the experiments and the results -- this notebook provides an overview of some of the implementation details.
The controller has a number of heads which each other read or write from the memory. The number and purpose of the heads is static (part of the network configuration). The generic API for the controller as a whole is only a sequence of vectors (describing a problem) and then the controller will output an answer for the given problem once a termination symbol is reached. There is no intermittent guidance on how the controller uses the NTM to organize information. The authors hypothesized, and at least to some degree demonstrated, that the controller and the NTM in concert learn to construct generalizable programs.
The read operation is a weighted sum of the stateful memory
The write is more complicated; it uses something akin to the gating mechanism in an LSTM to update memory positions. The update is applied across all of its memory, but again, the update is controlled using the same mask as above.
The NTM uses a novel mechanism for addressing content inside its memory.