Skip to content

This repository contains all the information, code and data about the NFT classification with visual feature extraction by computer vision.

Notifications You must be signed in to change notification settings

DemocracyStudio/NFT_similarity_search_engine

Repository files navigation

NFT similarity search engine with visual feature extraction using VGG16 Convolutional Neural Network

This repository contains all the information, code and data about the NFT classification with visual feature extraction by computer vision, started as a group project at the Machine Learning degree of opencampus.sh. Team mates are: Lennert Jessen, Julien Carbonnell.

The video recording of the class presentation is available here

Project Description:

The recent increase of digital art and digitalization of historic artworks hosted in museums, combined with the improvement of machine learning techniques to train computers on achieving performative visual recognition tasks, allows us to envision large scale art classifiers for different purposes. The detection of authentic and fake paintings, the recommendation of similar artworks from a given one, As for our concern, we will try to use computer vision to identify what makes an NFT valuable or not.

As a nascent market of $50B capitalization, NFT sales remain highly volatile and hardly predictable. We are wondering if NFTs values forecasting could benefit from a visual feature classification performed by artificial intelligence. We will use Convolutional Neural Networks pre-trained both on real-world images classification and on art classification. As the enhancement of the earlier improve the laster, we are expecting that NFT classification will benefit from transfer learning of previously trained models. However, our purpose will not be to detect the belonging on an NFT to a collection or an artist, but how much it would be valued on the market.

A pitch deck is available here.

Datasets:

There are more NFTs on NFT marketplaces today than there were websites in 2010. Covering the whole available NFTs' data would have been impossible. Our model ambitions to experiment an undiscovered approach to NFT sales forecasting by using feature extraction on images, which induces some computational costs due to a high data size.

To collect full datasets including images, we combined existing datasets Kaggle, with OpenSea API requests, and data scraping with selenium. Here is a link to the kaggle datasets:

Protocol:

  • Collect datasets
  • Resize the images to 128x128 pixels
  • Pixel-to-pixel comparison with Euclidean Distance (based on colors)
  • Visual feature extraction with VGG16 CNN (based on shapes) and Euclidean Distance again
  • Unlabelled image clustering with K-Nearest Neighbor on VGG16 output
  • Compare the similarity searches methods

Literature:

2021

2020

2015

A literature review is available here.

About

This repository contains all the information, code and data about the NFT classification with visual feature extraction by computer vision.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published