GitHub - Tobi-De/leerming: An implementation of the `Leitner box` that can generate flashcards using llms from documents, youtube videos and web page links.

Unlocking Understanding, One Card at a Time

Note

Alpha quality software!.

Description

Leerming is an open-source Django-based web app that follows the Leitner box method. Create flashcards effortlessly from PDFs, videos, and web links. Supercharge your learning experience.

Leitner Box Method

The Leitner box method is a simple yet effective technique for learning and retaining information. It works by organizing flashcards into different boxes or levels. As you study, correctly answered flashcards move to higher boxes, while incorrect ones move down. This spaced repetition system helps reinforce your memory over time.

For a more detailed explanation of the Leitner box method, check out Wikipedia.

Leitner Box Algorithm Implementation

Flashcards are organized into seven distinct levels. Each card starts at Level 1. The transition between levels is based on performance during reviews.
Each level corresponds to a specific number of days between reviews. For example, Level 1 cards are reviewed daily, while Level 2 cards are reviewed every two days. The exact mapping can be found in the codebase here.
During a review, when a card is answered correctly, it moves up to the next level. Once a card reaches Level 7, it is marked as mastered.
On the other hand, if a card is answered incorrectly during a review, it is downgraded to Level 1, regardless of its previous level. This ensures that challenging material is revisited frequently, while mastered content is reviewed less frequently.

Card Generation from Documents

Leerming can currently generate flashcards from web pages, YouTube videos, PDF files and Microsoft Word documents.

Text Extraction: Uploaded documents, regardless of their original format, undergo automated text extraction, transforming the content into a common text format.
Text Segmentation and Storage: The extracted text is divided into smaller, manageable chunks. For each chunk, we generate embeddings using OpenAI's models. These embeddings, along with the original text content, are then stored in a PostgreSQL database equipped with pgvector. This step is executed by a dedicated worker process.
Key Question as Focal Point: Users provide a key question that serve as a central topic for generating flashcards. Additionally, users select one of their uploaded documents.
Chunk Matching with L2Distance: Leerming identifies document chunks that are closest to the user's key question using L2Distance, ensuring the relevance of the generated flashcards.
Prompt Generation with Language Models (LLM): Using the key question and the identified document chunks, Leerming generates an LLM prompt. This prompt is then sent to Language Models (LLM) to generate flashcards.

Local Development Setup

Requirements

Ensure you have the following prerequisites in place:

PostgreSQL database with the pgvector extension. If you use Docker, you can find a suitable image available.
Rye for streamlined dependency management. While not mandatory, it simplifies the process. You can use the requirements-dev.lock in the project root with any tool that supports the Python requirements.txt format.
An openai API key, you can get one at https://platform.openai.com/account/api-keys.

Setup and Run

Follow these steps to set up and run Leerming locally:

Clone the repository: git clone https://github.com/tobi-de/leerming.git
Navigate to the project directory: cd leerming
Create and activate a virtual environment: rye shell
Install dependencies: rye sync
Create a .env file by copying from .env.template and fill it out: cp .env.template .env
Apply migrations: python manage.py migrate
Create the cache table: python manage.py createcachetable
Install Watson for full-text search: python manage.py installwatson
Create a superuser: python manage.py makesuperuser
Start the development server: python manage.py runserver

Name		Name	Last commit message	Last commit date
Latest commit History 170 Commits
.github		.github
.idea		.idea
config		config
docker		docker
leerming		leerming
.env.template		.env.template
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.watchmanconfig		.watchmanconfig
LICENSE		LICENSE
README.md		README.md
manage.py		manage.py
package-lock.json		package-lock.json
package.json		package.json
pyproject.toml		pyproject.toml
requirements-dev.lock		requirements-dev.lock
requirements.lock		requirements.lock
tailwind.config.js		tailwind.config.js

License

Tobi-De/leerming

Folders and files

Latest commit

History

Repository files navigation

Description

Leitner Box Method

Leitner Box Algorithm Implementation

Card Generation from Documents

Local Development Setup

Requirements

Setup and Run

About

Topics

Resources

License

Stars

Watchers

Forks

Languages