diCaptcha - Diffusion based Captcha

A repo for our team participating in AssemblyAI hackathon 2022 Video that describes the demo: https://youtu.be/ENv-qlfjSe8

Inspiration 💡

Ever tried to select all boxes that have traffic lights showing up and confused if you're supposed to check the very edge of it? The inspiration comes from providing a better Captcha user experience while keeping security also in mind.

What it does 🤖

It shows a user an AI generated image and ask to select keywords/tags that are associated with the image. Tags will have 3 correct tags/keywords (that are actually associated with the image and part of the prompt) Tags will also have 3 negative tags/keywords (that are randomly generated and have nothing to do with the image)

So here we use the fact that the human is supposed to decipher what sort of tags are associated with the image directly or indirectly. The image and keywords are associated in a more complex way than just literally asking to classify the image.

How we built it 🛠️

We used AWS to host the API's, and we divided our product into 3 components described below:-

AI component: We use diffusion models generated images from prompts | Convert prompts to tags using POS model and some preprocessing steps
Backend component: We randomly pick Image and Prompt from an API | We convert prompt to tags from another API
Frontend component: We display a Captcha like experience but with a modern touch to it using

Challenges we ran into

Challenges were in creating a NLP pipeline so we can select truly relevant keywords from the prompt (which is used to generate the image).
The Negative keywords were being selected from a random word generator we created. Even with carefully picking correct keywords from prompts and picking random words as negative keywords, we can sometimes look at the image and be confused as to what correct tags are. This needs further improvement.

Accomplishments that we're proud of ✨

We proud to create something creative out of diffusion models in the security space that is scalable at internet scale. Provides value to the internet security.

What we learned 🧠

More about language models, diffusion models and its ability to generate images. It was also our first Hackathon, Amir and I learnt many things both from ML engineering and time-management.

What's next for diCaptcha Diffusion Captcha For Creative Tastes 🔜

Make the Tag generation model better and more untuitive for making it easier.
Protect our service from scraping and attacks.
To add user specific Captcha, called personalized Captcha security. A user's intrests are used to show related images and asked to clear the task of selecting correct tags.

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
Backend API		Backend API
MidJourney data		MidJourney data
backend		backend
frontend		frontend
instance code		instance code
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

diCaptcha - Diffusion based Captcha

Inspiration 💡

What it does 🤖

How we built it 🛠️

Challenges we ran into

Accomplishments that we're proud of ✨

What we learned 🧠

What's next for diCaptcha Diffusion Captcha For Creative Tastes 🔜

About

Releases

Packages

Contributors 2

Languages

salman-moh/diCaptcha

Folders and files

Latest commit

History

Repository files navigation

diCaptcha - Diffusion based Captcha

Inspiration 💡

What it does 🤖

How we built it 🛠️

Challenges we ran into

Accomplishments that we're proud of ✨

What we learned 🧠

What's next for diCaptcha Diffusion Captcha For Creative Tastes 🔜

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages