Skip to content

jgcarrasco/dino-ink-detection

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Unsupervised Ink Detection with DINOv2

This repository will contain experiments related to unsupervised ink detection with DINO in an effort to implement an ink detector that does not require and has not been trained with any labels. For now, it includes a Colab notebook Open In Colab that lets you easily load some crop of a segment and visualize the output.

One of the problems with current supervised ink detection models is that they only work if we have ink labels, i.e. only on Scroll 1. The idea here is that, if we are able to obtain a somewhat decent detector that does not require labels, we could be able to find some letter/traces of ink in other scrolls and repeat the same ML/Human interaction loop that led to the Grand Prize results!

What is DINO?

Briefly, DINO ([1], [2]) is a vision transformer that has been trained on lots of images in a self-supervised way (without any sort of labels). The authors of DINO show that the model learns to pay attention to objects, we can think about it as a "rudimentary object detector".

alt text Source: DINOv2 paper

Given these results, I decided to test if it works on scroll data. Similar to the paper, what I did is to take a crop of a segment and use it as an input to DINO. Then, I applied PCA to the output for every layer of the segment. As it can be seen in the attached image, that the first and third components obtained by applying PCA to the 34th layer of the crop containing Casey's pi reveal traces of ink. I have also found some examples in other segments:

alt text

alt text

The cool thing about this is that it doesn't require labels at all and the model has not been trained without any sort of scroll data! The not-so-cool thing is that it only works on really obvious examples, where we can visually see crackle. However, the PCA output is easier to visualize, and given this preliminary results, I think that they can be improved.

About

Unsupervised Ink Detection with DINOv2

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors