Lists (8)
Sort Name ascending (A-Z)
Starred repositories
Toolkit for linearizing PDFs for LLM datasets/training
A text extraction library supporting PDFs, images, office documents and more
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
Python tool for converting files and office documents to Markdown.
🚫 Stop saying "you forgot to …" in code review (in Ruby)
Basis set optimization library for quantum chemistry
⚡️ A curated list of awesome things related to marimo
A python package for accessing various properties of elements, ions and isotopes in the periodic table of elements.
A Git-compatible VCS that is both simple and powerful
Demonstrating the Dashboard++ method of organizing a vault in Obsidian
Blender Addon: Differential Growth
Everything about the SmolLM2 and SmolVLM family of models
Sphinx extension for bibtex style references.
This is a repo with links to everything you'd ever want to learn about data engineering
Opiniated RAG for integrating GenAI in your apps 🧠 Focus on your product rather than the RAG. Easy integration in existing products with customisation! Any LLM: GPT4, Groq, Llama. Any Vectorstore: …
Command line program which provides a suite of tools to create shopping lists and maintain recipes.
Edit and display CookLang recipes in Obsidian
ETL, Analytics, Versioning for Unstructured Data
Comprehensive country code information, including ISO 3166 codes, ITU dialing codes, ISO 4217 currency codes, and many others
Trio – a friendly Python library for async concurrency and I/O
🦀 Small exercises to get you used to reading and writing Rust code!
skchange provides sktime-compatible change detection and changepoint-based anomaly detection algorithms
A curated collection of tools to aid transcriptionists and subtitlers.