Neural-Turing-Machine

A modular implementation of the Neural Turing Machine introduced by Alex Graves et al.

Currently, two tasks have been implemented, Copy Task and Associative Recall Task as tf.keras.Model wrapper, available in the NTM_Model.py

Use them as showed in the Training Notebooks

Architecture Implemented

Since the paper only provides the mathematical operations for the generation and use of the Heads' Weighings, not the full architecture, thus the complete architecture becomes an open ended problem, where I've used the following architecture:

Task Results

1. Copy Task

Training the above NTM on randomized sequence length between 1 and 20 yields the following results.

1.1. Till 10,000 epochs on Cross Entropy Loss.

Test 1:-

Input:

Sequence Length = 9, including the Start Of File and End Of File delimeters.

Output:

Test 2:-

Input:

Sequence Length = 33, including the Start Of File and End Of File delimeters.

Note that it is more than what the above NTM is trained upon.

Output:

Test 3:-

Input:

Sequence Length = 73, including the Start Of File and End Of File delimeters.

Output:

1.2. Till 20,000 epochs on Cross Entropy Loss.

Input:

Sequence Length = 90, including the Start Of File and End Of File delimeters.

Output:

Error incurred

Memory Matrix for this input after last timestep:

Results on more tasks to follow soon...

2. Associative Recall Task

Training the Associative Recall Model for 158,000 episodes on randomized item numbers between 2 and 6 yields the following results:

Input

Output from NTM

Write Weighing while Reading over time

Memory Matrix compared with Read Vectors while Writing

Memory Matrix Evolution over Time

Progress Timeline:

Wed, Jan 15:-

Completed the NTMCell Implementation along with various Vector Generation Tasks.
Also tested it's result with dynamic_RNN, observed some NaN values in the result, was fixed by initializing states by a considerably low (0.5 in this case) value.

Sun, Jan 19:-

Added sigmoid layer on Heads_w_t which produced much better results on one time step passes (not the training)
Random Initialization works well now too
In the process of finalizing the training schedule.

Fri, Jan 31:-

First Complete version, added Inputs Generator for Copy Task and some minor bug fixes.
One still needs to train this though, there maybe some problems during training which one needs to solve.

Sun, Feb 2:-

Training with Cross Entropy Loss Function proved to be difficult as loss seem to be stuck somewhere between 0.4 - 0.55
*Using Huber Loss Function seem to generate much better results, as loss seem to decrease linearly from 1.2 to 0.6 on max sequence length in about 10,000 epochs, after 1 injection of randomized initial states while preserving the weights.

Wed, Feb 6:-

More careful analysis brought some more subtle bugs, which were holding back the generalisation of the model, removing those increases generalization much better with Cross Entropy Loss now.

Sun, Feb 16:-

Added Associative Recall Task.

Name		Name	Last commit message	Last commit date
Latest commit History 83 Commits
Development Process		Development Process
RESULTS		RESULTS
Training Notebooks		Training Notebooks
Batch_Focusing.py		Batch_Focusing.py
Batch_RWV_Generation.py		Batch_RWV_Generation.py
Focusing.py		Focusing.py
NTM Familiarization.ipynb		NTM Familiarization.ipynb
NTMCell.py		NTMCell.py
NTM_Model.py		NTM_Model.py
README.md		README.md
ReadWriteVectorGeneration.py		ReadWriteVectorGeneration.py
utilities.py		utilities.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Neural-Turing-Machine

Architecture Implemented

Task Results

1. Copy Task

1.1. Till 10,000 epochs on Cross Entropy Loss.

Test 1:-

Input:

Output:

Test 2:-

Input:

Output:

Test 3:-

Input:

Output:

1.2. Till 20,000 epochs on Cross Entropy Loss.

Input:

Output:

Error incurred

Memory Matrix for this input after last timestep:

Results on more tasks to follow soon...

2. Associative Recall Task

Input

Output from NTM

Write Weighing while Reading over time

Memory Matrix compared with Read Vectors while Writing

Memory Matrix Evolution over Time

Progress Timeline:

About

Releases

Packages

Languages

WhenDustSettles/Neural-Turing-Machine

Folders and files

Latest commit

History

Repository files navigation

Neural-Turing-Machine

Architecture Implemented

Task Results

1. Copy Task

1.1. Till 10,000 epochs on Cross Entropy Loss.

Test 1:-

Input:

Output:

Test 2:-

Input:

Output:

Test 3:-

Input:

Output:

1.2. Till 20,000 epochs on Cross Entropy Loss.

Input:

Output:

Error incurred

Memory Matrix for this input after last timestep:

Results on more tasks to follow soon...

2. Associative Recall Task

Input

Output from NTM

Write Weighing while Reading over time

Memory Matrix compared with Read Vectors while Writing

Memory Matrix Evolution over Time

Progress Timeline:

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages