Zero-shot Image Recognition using Convolutions and Knowledge Graphs

This project implements the method described in Zero-shot Recognition via Semantic Embeddings and Knowledge Graphs and applies it to identifying different classes of snakes!

Dependencies

In order to run this project you must have first installed the following dependencies:

Python 3
numpy
matplotlib
torch
torchvision

Problem Definition

Traditional Models (closed-world):

new class → new data needed (usually lots), redo fine-tuning
Models take a lot of time to fine-tune (assuming we even have the extra data)

Solution: Zero-shot learning - Infer knowledge from past training

Implicit knowledge: learn vector representation of categories using text data → learn mapping connecting vector representation to visual classifier
Explicit knowledge: relations from KGs and using them as zero-shot classifiers

Our task: combine both implicit and explicit knowledge using a KG and GCN (following the method of [Wang, Ye, and Gupta 2018]). Applying zero-shot approach to small/noisy/hard-to-learn dataset of snake images.

Datasets

ImageNet 2012 1k
Snake dataset
SnakeKG
- a custom subset of Wikidata

Methodology

ResNet

ResNet is used for visual feature extraction as well as providing a baseline method for comparison.

Knowledge Graphs (KG)

The KG we use for our project is constructed as a subset of the Wikidata knowledge graph. We use the mapping from the ImageNet dataset to Wikidata entities created by Filipiak, D., Fensel, A., & Filipowska, A. to select the nodes from the ImageNet dataset. Wikidata identifiers for the 10 classes not in the ImageNet dataset are identified to include those nodes in the subgraph.

Graph Convolutional Networks (GCN)

To integrate the information from our KG as part of the classifications, this method employs a 6-layer GCN.

References

While most of the referenced work is linked directly above, it is also more formally collected below:

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
data		data
imgs		imgs
.gitignore		.gitignore
README.md		README.md
construct_graph.py		construct_graph.py
create_id_list.py		create_id_list.py
gcn.py		gcn.py
graph_stats.py		graph_stats.py
gtc.py		gtc.py
main.py		main.py
preprocess_graph.py		preprocess_graph.py
preprocess_imagenet.py		preprocess_imagenet.py
subset_vects.py		subset_vects.py
test.py		test.py
word_embeddings.py		word_embeddings.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Zero-shot Image Recognition using Convolutions and Knowledge Graphs

Dependencies

Problem Definition

Datasets

Methodology

ResNet

Knowledge Graphs (KG)

Graph Convolutional Networks (GCN)

References

About

Releases

Packages

Contributors 2

Languages

bpark2/zero_shot_kg_cnn

Folders and files

Latest commit

History

Repository files navigation

Zero-shot Image Recognition using Convolutions and Knowledge Graphs

Dependencies

Problem Definition

Datasets

Methodology

ResNet

Knowledge Graphs (KG)

Graph Convolutional Networks (GCN)

References

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages