harpreif

Visual Representation Learning by solving Jigsaw puzzles using Deep Reinforcement Learning

Dataset

We take 240 objects and randomly choose 80 images from each of them. Then divide it into (50/10/20) for Training/Validation/Testing respectively. Then for testing for transfer learning, we take 30 images from the rest 16 object categories, and use that for transfer testing.

Input Construction

For input construction, a windowed HOG gradient (across 8 directions) is calculated for the image and then subsequently discretized, which gives us a state representation, as shown below:

Deep Q Network

The Deep Q network is used for evaluation function for Reinforcement Learning. The network is shown below:

Experimental Results

Test Images

The T-Sne plot for the image features (penultimate layer activation - FC3 layer) for the test images are plot across iterations. The results shows that RL agent learns to generate cluster to improve Learning.

20 neighbors

100 neighbors

Transfer Learning Test Images

The T-Sne plot for the image features (penultimate layer activation - FC3 layer) for the transfer test images are plot across iterations. The results shows that RL agent learns to generate cluster to improve Learning. The images were not used for training, and thus this shows transfer learning.

Name		Name	Last commit message	Last commit date
Latest commit History 83 Commits
.idea		.idea
harpreif		harpreif
images		images
model		model
rep_viz		rep_viz
.gitignore		.gitignore
LICENCE.md		LICENCE.md
README.md		README.md
test_feature_creator.py		test_feature_creator.py
test_nb_finder.py		test_nb_finder.py
train.py		train.py
train_val_test_split.py		train_val_test_split.py
vizualize_network.py		vizualize_network.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

harpreif

Contents

Dataset

Input Construction

Deep Q Network