Skip to content

dylanhogg/awesome-python

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

Awesome Python

Awesome Last commit License: MIT

Hand-picked awesome Python libraries and frameworks, organised by category ๐Ÿ

Interactive version: www.awesomepython.org

Updated 14 Apr 2024

Categories

  • Newly Created Repositories - Awesome Python is regularly updated, and this category lists the most recently created GitHub repositories from all the other repositories here (10 repos)
  • Code Quality - Code quality tooling: linters, formatters, pre-commit hooks, unused code removal (16 repos)
  • Crypto and Blockchain - Cryptocurrency and blockchain libraries: trading bots, API integration, Ethereum virtual machine, solidity (13 repos)
  • Data - General data libraries: data processing, serialisation, formats, databases, SQL, connectors, web crawlers, data generation/augmentation/checks (97 repos)
  • Debugging - Debugging and tracing tools (9 repos)
  • Diffusion Text to Image - Text-to-image diffusion model libraries, tools and apps for generating images from natural language (36 repos)
  • Finance - Financial and quantitative libraries: investment research tools, market data, algorithmic trading, backtesting, financial derivatives (31 repos)
  • Game Development - Game development tools, engines and libraries (6 repos)
  • GIS - Geospatial libraries: raster and vector data formats, interactive mapping and visualisation, computing frameworks for processing images, projections (28 repos)
  • Graph - Graphs and network libraries: network analysis, graph machine learning, visualisation (6 repos)
  • GUI - Graphical user interface libraries and toolkits (8 repos)
  • Jupyter - Jupyter and JupyterLab and Notebook tools, libraries and plugins (24 repos)
  • LLMs and ChatGPT - Large language model and GPT libraries and frameworks: auto-gpt, agents, QnA, chain-of-thought workflows, API integations. Also see the Natural Language Processing category for crossover (220 repos)
  • Math and Science - Mathematical, numerical and scientific libraries (22 repos)
  • Machine Learning - General - General and classical machine learning libraries. See below for other sections covering specialised ML areas (153 repos)
  • Machine Learning - Deep Learning - Machine learning libraries that cross over with deep learning in some way (71 repos)
  • Machine Learning - Interpretability - Machine learning interpretability libraries. Covers explainability, prediction explainations, dashboards, understanding knowledge development in training (13 repos)
  • Machine Learning - Ops - MLOps tools, frameworks and libraries: intersection of machine learning, data engineering and DevOps; deployment, health, diagnostics and governance of ML models (40 repos)
  • Machine Learning - Reinforcement - Machine learning libraries and toolkits that cross over with reinforcement learning in some way: agent reinforcement learning, agent environemnts, RLHF (22 repos)
  • Machine Learning - Time Series - Machine learning and classical timeseries libraries: forecasting, seasonality, anomaly detection, econometrics (18 repos)
  • Natural Language Processing - Natural language processing libraries and toolkits: text processing, topic modelling, tokenisers, chatbots. Also see the LLMs and ChatGPT category for crossover (82 repos)
  • Packaging - Python packaging, dependency management and bundling (28 repos)
  • Pandas - Pandas and dataframe libraries: data analysis, statistical reporting, pandas GUIs, pandas performance optimisations (23 repos)
  • Performance - Performance, parallelisation and low level libraries (28 repos)
  • Profiling - Memory and CPU/GPU profiling tools and libraries (11 repos)
  • Security - Security related libraries: vulnerability discovery, SQL injection, environment auditing (14 repos)
  • Simulation - Simulation libraries: robotics, economic, agent-based, traffic, physics, astronomy, chemistry, quantum simulation. Also see the Maths and Science category for crossover (27 repos)
  • Study - Miscellaneous study resources: algorithms, general resources, system design, code repos for textbooks, best practices, tutorials (52 repos)
  • Template - Template tools and libraries: cookiecutter repos, generators, quick-starts (8 repos)
  • Terminal - Terminal and console tools and libraries: CLI tools, terminal based formatters, progress bars (14 repos)
  • Testing - Testing libraries: unit testing, load testing, acceptance testing, code coverage, browser automation, plugins (22 repos)
  • Typing - Typing libraries: static and run-time type checking, annotations (12 repos)
  • Utility - General utility libraries: miscellaneous tools, linters, code formatters, version management, package tools, documentation tools (185 repos)
  • Vizualisation - Vizualisation tools and libraries. Application frameworks, 2D/3D plotting, dashboards, WebGL (33 repos)
  • Web - Web related frameworks and libraries: webapp servers, WSGI, ASGI, asyncio, HTTP, REST, user management (56 repos)

Newly Created Repositories

Awesome Python is regularly updated, and this category lists the most recently created GitHub repositories from all the other repositories here.

  1. xai-org/grok-1 โญ 47,201
    This repository contains JAX example code for loading and running the Grok-1 open-weights model.

  2. karpathy/llm.c โญ 13,635
    LLM training in simple, pure C/CUDA. There is no need for 245MB of PyTorch or 107MB of cPython

  3. karpathy/minbpe โญ 7,869
    Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

  4. skyvern-ai/skyvern โญ 3,280
    Skyvern automates browser-based workflows using LLMs and computer vision. It provides a simple API endpoint to fully automate manual workflows, replacing brittle or unreliable automation solutions.
    ๐Ÿ”— www.skyvern.com

  5. databricks/dbrx โญ 2,301
    Code examples and resources for DBRX, a large language model developed by Databricks
    ๐Ÿ”— www.databricks.com

  6. amazon-science/chronos-forecasting โญ 1,503
    Chronos: Pretrained (Language) Models for Probabilistic Time Series Forecasting

  7. luijait/DarkGPT โญ 1,336
    DarkGPT is an OSINT assistant based on GPT-4-200K (recommended use) designed to perform queries on leaked databases, thus providing an artificial intelligence assistant that can be useful in your traditional OSINT processes.

  8. time-series-foundation-models/lag-llama โญ 913
    Lag-Llama: Towards Foundation Models for Probabilistic Time Series Forecasting

  9. dagworks-inc/burr โญ 371
    Build applications that make decisions (chatbots, agents, simulations, etc...). Monitor, persist, and execute on your own infrastructure.
    ๐Ÿ”— burr.dagworks.io

  10. lightning-ai/litdata โญ 175
    Streamline data pipelines for AI. Process datasets across 1000s of machines, and optimize data for blazing fast model training.

Code Quality

Code quality tooling: linters, formatters, pre-commit hooks, unused code removal.

  1. psf/black โญ 37,325
    The uncompromising Python code formatter
    ๐Ÿ”— black.readthedocs.io/en/stable

  2. astral-sh/ruff โญ 26,071
    An extremely fast Python linter and code formatter, written in Rust.
    ๐Ÿ”— docs.astral.sh/ruff

  3. google/yapf โญ 13,637
    A formatter for Python files

  4. pre-commit/pre-commit โญ 11,967
    A framework for managing and maintaining multi-language pre-commit hooks.
    ๐Ÿ”— pre-commit.com

  5. sqlfluff/sqlfluff โญ 7,186
    A modular SQL linter and auto-formatter with support for multiple dialects and templated code.
    ๐Ÿ”— www.sqlfluff.com

  6. pycqa/isort โญ 6,305
    A Python utility / library to sort imports.
    ๐Ÿ”— pycqa.github.io/isort

  7. davidhalter/jedi โญ 5,650
    Awesome autocompletion, static analysis and refactoring library for python
    ๐Ÿ”— jedi.readthedocs.io

  8. pycqa/pylint โญ 5,096
    It's not just a linter that annoys you!
    ๐Ÿ”— pylint.readthedocs.io/en/latest

  9. asottile/pyupgrade โญ 3,316
    A tool (and pre-commit hook) to automatically upgrade syntax for newer versions of the language.

  10. pycqa/flake8 โญ 3,248
    flake8 is a python tool that glues together pycodestyle, pyflakes, mccabe, and third-party plugins to check the style and quality of some python code.
    ๐Ÿ”— flake8.pycqa.org

  11. jendrikseipp/vulture โญ 2,996
    Find dead Python code

  12. wemake-services/wemake-python-styleguide โญ 2,413
    The strictest and most opinionated python linter ever!
    ๐Ÿ”— wemake-python-styleguide.rtfd.io

  13. python-lsp/python-lsp-server โญ 1,643
    Fork of the python-language-server project, maintained by the Spyder IDE team and the community

  14. sourcery-ai/sourcery โญ 1,475
    Instant AI code reviews to speed up your pull requests
    ๐Ÿ”— sourcery.ai

  15. akaihola/darker โญ 608
    Apply black reformatting to Python files only in regions changed since a given commit. For a practical usage example, see the blog post at https://dev.to/akaihola/improving-python-code-incrementally-3f7a
    ๐Ÿ”— pypi.org/project/darker

  16. tconbeer/sqlfmt โญ 340
    sqlfmt formats your dbt SQL files so you don't have to
    ๐Ÿ”— sqlfmt.com

Crypto and Blockchain

Cryptocurrency and blockchain libraries: trading bots, API integration, Ethereum virtual machine, solidity.

  1. ccxt/ccxt โญ 31,218
    A JavaScript / TypeScript / Python / C# / PHP cryptocurrency trading API with support for more than 100 bitcoin/altcoin exchanges
    ๐Ÿ”— docs.ccxt.com

  2. freqtrade/freqtrade โญ 25,353
    Free, open source crypto trading bot
    ๐Ÿ”— www.freqtrade.io

  3. crytic/slither โญ 4,972
    Static Analyzer for Solidity and Vyper
    ๐Ÿ”— blog.trailofbits.com/2018/10/19/slither-a-solidity-static-analysis-framework

  4. ethereum/web3.py โญ 4,768
    A python interface for interacting with the Ethereum blockchain and ecosystem.
    ๐Ÿ”— web3py.readthedocs.io

  5. ethereum/consensus-specs โญ 3,401
    Ethereum Proof-of-Stake Consensus Specifications

  6. cyberpunkmetalhead/Binance-volatility-trading-bot โญ 3,343
    This is a fully functioning Binance trading bot that measures the volatility of every coin on Binance and places trades with the highest gaining coins If you like this project consider donating though the Brave browser to allow me to continuously improve the script.

  7. ethereum/py-evm โญ 2,180
    A Python implementation of the Ethereum Virtual Machine
    ๐Ÿ”— py-evm.readthedocs.io/en/latest

  8. bmoscon/cryptofeed โญ 2,060
    Cryptocurrency Exchange Websocket Data Feed Handler

  9. binance/binance-public-data โญ 1,323
    Details on how to get Binance public data

  10. ofek/bit โญ 1,205
    Bitcoin made easy.
    ๐Ÿ”— ofek.dev/bit

  11. man-c/pycoingecko โญ 1,032
    Python wrapper for the CoinGecko API

  12. palkeo/panoramix โญ 748
    Ethereum decompiler

  13. dylanhogg/awesome-crypto โญ 64
    A list of awesome crypto and blockchain projects
    ๐Ÿ”— www.awesomecrypto.xyz

Data

General data libraries: data processing, serialisation, formats, databases, SQL, connectors, web crawlers, data generation/augmentation/checks.

  1. scrapy/scrapy โญ 50,773
    Scrapy, a fast high-level web crawling & scraping framework for Python.
    ๐Ÿ”— scrapy.org

  2. apache/spark โญ 38,244
    Apache Spark - A unified analytics engine for large-scale data processing
    ๐Ÿ”— spark.apache.org

  3. getredash/redash โญ 24,885
    Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.
    ๐Ÿ”— redash.io

  4. jaidedai/EasyOCR โญ 21,780
    Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
    ๐Ÿ”— www.jaided.ai

  5. mindsdb/mindsdb โญ 21,145
    The platform for customizing AI from enterprise data
    ๐Ÿ”— mindsdb.com

  6. qdrant/qdrant โญ 17,686
    Qdrant - High-performance, massive-scale Vector Database for the next generation of AI. Also available in the cloud https://cloud.qdrant.io/
    ๐Ÿ”— qdrant.tech

  7. joke2k/faker โญ 17,070
    Faker is a Python package that generates fake data for you.
    ๐Ÿ”— faker.readthedocs.io

  8. humansignal/label-studio โญ 16,340
    Label Studio is an open source data labeling tool. It lets you label data types like audio, text, images, videos, and time series with a simple and straightforward UI and export to various model formats.
    ๐Ÿ”— labelstud.io

  9. binux/pyspider โญ 16,297
    A Powerful Spider(Web Crawler) System in Python.
    ๐Ÿ”— docs.pyspider.org

  10. twintproject/twint โญ 15,522
    An advanced Twitter scraping & OSINT tool written in Python that doesn't use Twitter's API, allowing you to scrape a user's followers, following, Tweets and more while evading most API limitations.

  11. airbytehq/airbyte โญ 13,820
    The leading data integration platform for ETL / ELT data pipelines from APIs, databases & files to data warehouses, data lakes & data lakehouses. Both self-hosted and Cloud-hosted.
    ๐Ÿ”— airbyte.com

  12. apache/arrow โญ 13,441
    Apache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing
    ๐Ÿ”— arrow.apache.org

  13. tiangolo/sqlmodel โญ 12,848
    SQL databases in Python, designed for simplicity, compatibility, and robustness.
    ๐Ÿ”— sqlmodel.tiangolo.com

  14. redis/redis-py โญ 12,213
    Redis Python client

  15. chroma-core/chroma โญ 12,009
    the AI-native open-source embedding database
    ๐Ÿ”— www.trychroma.com

  16. coleifer/peewee โญ 10,771
    a small, expressive orm -- supports postgresql, mysql, sqlite and cockroachdb
    ๐Ÿ”— docs.peewee-orm.com

  17. s0md3v/Photon โญ 10,473
    Incredibly fast crawler designed for OSINT.

  18. simonw/datasette โญ 8,863
    An open source multi-tool for exploring and publishing data
    ๐Ÿ”— datasette.io

  19. sqlalchemy/sqlalchemy โญ 8,708
    The Database Toolkit for Python
    ๐Ÿ”— www.sqlalchemy.org

  20. bigscience-workshop/petals โญ 8,620
    ๐ŸŒธ Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading
    ๐Ÿ”— petals.dev

  21. avaiga/taipy โญ 8,226
    Turns Data and AI algorithms into production-ready web applications in no time.
    ๐Ÿ”— www.taipy.io

  22. yzhao062/pyod โญ 7,922
    A Comprehensive and Scalable Python Library for Outlier Detection (Anomaly Detection)
    ๐Ÿ”— pyod.readthedocs.io

  23. gristlabs/grist-core โญ 6,160
    Grist is the evolution of spreadsheets.
    ๐Ÿ”— www.getgrist.com

  24. alirezamika/autoscraper โญ 5,913
    A Smart, Automatic, Fast and Lightweight Web Scraper for Python

  25. kaggle/kaggle-api โญ 5,886
    Official Kaggle API

  26. vi3k6i5/flashtext โญ 5,528
    Extract Keywords from sentence or Replace keywords in sentences.

  27. madmaze/pytesseract โญ 5,482
    A Python wrapper for Google Tesseract

  28. airbnb/knowledge-repo โญ 5,429
    A next-generation curated knowledge sharing platform for data scientists and other technical professions.

  29. tobymao/sqlglot โญ 5,384
    Python SQL Parser and Transpiler
    ๐Ÿ”— sqlglot.com

  30. facebookresearch/AugLy โญ 4,898
    A data augmentations library for audio, image, text, and video.
    ๐Ÿ”— ai.facebook.com/blog/augly-a-new-data-augmentation-library-to-help-build-more-robust-ai-models

  31. jazzband/tablib โญ 4,517
    Python Module for Tabular Datasets in XLS, CSV, JSON, YAML, &c.
    ๐Ÿ”— tablib.readthedocs.io

  32. superduperdb/superduperdb โญ 4,304
    ๐Ÿ”ฎ SuperDuperDB: Bring AI to your database! Build, deploy and manage any AI application directly with your existing data infrastructure, without moving your data. Including streaming inference, scalable model training and vector search.
    ๐Ÿ”— superduperdb.com

  33. lk-geimfari/mimesis โญ 4,296
    Mimesis is a powerful Python library that empowers developers to generate massive amounts of synthetic data efficiently.
    ๐Ÿ”— mimesis.name

  34. amundsen-io/amundsen โญ 4,266
    Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.
    ๐Ÿ”— www.amundsen.io/amundsen

  35. mongodb/mongo-python-driver โญ 4,041
    PyMongo - the Official MongoDB Python driver
    ๐Ÿ”— pymongo.readthedocs.io

  36. ibis-project/ibis โญ 4,040
    Ibis is a Python library that provides a lightweight, universal interface for data wrangling. It helps Python users explore and transform data of any size, stored anywhere.
    ๐Ÿ”— ibis-project.org

  37. andialbrecht/sqlparse โญ 3,572
    A non-validating SQL parser module for Python

  38. jmcnamara/XlsxWriter โญ 3,481
    A Python module for creating Excel XLSX files.
    ๐Ÿ”— xlsxwriter.readthedocs.io

  39. run-llama/llama-hub โญ 3,386
    A library of data loaders for LLMs made by the community -- to be used with LlamaIndex and/or LangChain
    ๐Ÿ”— llamahub.ai

  40. deepchecks/deepchecks โญ 3,325
    Deepchecks: Tests for Continuous Validation of ML Models & Data. Deepchecks is a holistic open-source solution for all of your AI & ML validation needs, enabling to thoroughly test your data and models from research to production.
    ๐Ÿ”— docs.deepchecks.com/stable

  41. praw-dev/praw โญ 3,307
    PRAW, an acronym for "Python Reddit API Wrapper", is a python package that allows for simple access to Reddit's API.
    ๐Ÿ”— praw.readthedocs.io

  42. rom1504/img2dataset โญ 3,210
    Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

  43. pyeve/cerberus โญ 3,103
    Lightweight, extensible data validation library for Python
    ๐Ÿ”— python-cerberus.org

  44. giskard-ai/giskard โญ 3,001
    ๐Ÿข Open-Source Evaluation & Testing framework for LLMs and ML models
    ๐Ÿ”— docs.giskard.ai

  45. zoomeranalytics/xlwings โญ 2,831
    xlwings is a Python library that makes it easy to call Python from Excel and vice versa. It works with Excel on Windows and macOS as well as with Google Sheets and Excel on the web.
    ๐Ÿ”— www.xlwings.org

  46. datafold/data-diff โญ 2,826
    Compare tables within or across databases
    ๐Ÿ”— docs.datafold.com

  47. pallets/itsdangerous โญ 2,819
    Safely pass trusted data to untrusted environments and back.
    ๐Ÿ”— itsdangerous.palletsprojects.com

  48. goldsmith/Wikipedia โญ 2,808
    A Pythonic wrapper for the Wikipedia API
    ๐Ÿ”— wikipedia.readthedocs.org

  49. docarray/docarray โญ 2,723
    Represent, send, store and search multimodal data
    ๐Ÿ”— docs.docarray.org

  50. awslabs/amazon-redshift-utils โญ 2,706
    Amazon Redshift Utils contains utilities, scripts and view which are useful in a Redshift environment

  51. lancedb/lancedb โญ 2,691
    Developer-friendly, serverless vector database for AI applications. Easily add long-term memory to your LLM apps!
    ๐Ÿ”— lancedb.github.io/lancedb

  52. sqlalchemy/alembic โญ 2,441
    A database migrations tool for SQLAlchemy.

  53. pynamodb/PynamoDB โญ 2,371
    A pythonic interface to Amazon's DynamoDB
    ๐Ÿ”— pynamodb.readthedocs.io

  54. kayak/pypika โญ 2,362
    PyPika is a python SQL query builder that exposes the full richness of the SQL language using a syntax that reflects the resulting query. PyPika excels at all sorts of SQL queries but is especially useful for data analysis.
    ๐Ÿ”— pypika.readthedocs.io/en/latest

  55. emirozer/fake2db โญ 2,254
    Generate fake but valid data filled databases for test purposes using most popular patterns(AFAIK). Current support is sqlite, mysql, postgresql, mongodb, redis, couchdb.

  56. uqfoundation/dill โญ 2,128
    serialize all of Python
    ๐Ÿ”— dill.rtfd.io

  57. sdv-dev/SDV โญ 2,105
    Synthetic data generation for tabular data
    ๐Ÿ”— docs.sdv.dev/sdv

  58. accenture/AmpliGraph โญ 2,079
    Python library for Representation Learning on Knowledge Graphs https://docs.ampligraph.org

  59. graphistry/pygraphistry โญ 2,045
    PyGraphistry is a Python library to quickly load, shape, embed, and explore big graphs with the GPU-accelerated Graphistry visual graph analyzer

  60. rapidai/RapidOCR โญ 1,919
    Awesome OCR multiple programing languages toolkits based on ONNXRuntime, OpenVION and PaddlePaddle.
    ๐Ÿ”— rapidai.github.io/rapidocrdocs/docs

  61. samuelcolvin/arq โญ 1,901
    Fast job queuing and RPC in python with asyncio and redis.
    ๐Ÿ”— arq-docs.helpmanual.io

  62. sfu-db/connector-x โญ 1,757
    Fastest library to load data from DB to DataFrames in Rust and Python
    ๐Ÿ”— sfu-db.github.io/connector-x/intro.html

  63. uber/petastorm โญ 1,744
    Petastorm library enables single machine or distributed training and evaluation of deep learning models from datasets in Apache Parquet format. It supports ML frameworks such as Tensorflow, Pytorch, and PySpark and can be used from pure Python code.

  64. agronholm/sqlacodegen โญ 1,698
    Automatic model code generator for SQLAlchemy

  65. aio-libs/aiomysql โญ 1,696
    aiomysql is a library for accessing a MySQL database from the asyncio
    ๐Ÿ”— aiomysql.rtfd.io

  66. pathwaycom/pathway โญ 1,689
    Pathway is a high-throughput, low-latency data processing framework that handles live data & streaming for you. Made with โค๏ธ for Python & ML/AI developers.
    ๐Ÿ”— pathway.com

  67. dlt-hub/dlt โญ 1,675
    data load tool (dlt) is an open source Python library that makes data loading easy ๐Ÿ› ๏ธ
    ๐Ÿ”— dlthub.com/docs

  68. milvus-io/bootcamp โญ 1,602
    Dealing with all unstructured data, such as reverse image search, audio search, molecular search, video analysis, question and answer systems, NLP, etc.
    ๐Ÿ”— milvus.io

  69. simple-salesforce/simple-salesforce โญ 1,591
    A very simple Salesforce.com REST API client for Python

  70. collerek/ormar โญ 1,565
    python async orm with fastapi in mind and pydantic validation
    ๐Ÿ”— collerek.github.io/ormar

  71. aminalaee/sqladmin โญ 1,558
    SQLAlchemy Admin for FastAPI and Starlette
    ๐Ÿ”— aminalaee.dev/sqladmin

  72. simonw/sqlite-utils โญ 1,486
    Python CLI utility and library for manipulating SQLite databases
    ๐Ÿ”— sqlite-utils.datasette.io

  73. sdispater/orator โญ 1,422
    The Orator ORM provides a simple yet beautiful ActiveRecord implementation.
    ๐Ÿ”— orator-orm.com

  74. mchong6/JoJoGAN โญ 1,403
    Official PyTorch repo for JoJoGAN: One Shot Face Stylization

  75. eleutherai/the-pile โญ 1,396
    The Pile is a large, diverse, open source language modelling data set that consists of many smaller datasets combined together.

  76. aio-libs/aiopg โญ 1,372
    aiopg is a library for accessing a PostgreSQL database from the asyncio
    ๐Ÿ”— aiopg.readthedocs.io

  77. zarr-developers/zarr-python โญ 1,326
    An implementation of chunked, compressed, N-dimensional arrays for Python.
    ๐Ÿ”— zarr.readthedocs.io

  78. google/tensorstore โญ 1,279
    Library for reading and writing large multi-dimensional arrays.
    ๐Ÿ”— google.github.io/tensorstore

  79. ydataai/ydata-synthetic โญ 1,277
    Synthetic data generators for tabular and time-series data
    ๐Ÿ”— docs.synthetic.ydata.ai

  80. huggingface/datatrove โญ 1,118
    Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

  81. pytorch/data โญ 1,067
    A PyTorch repo for data loading and utilities to be shared by the PyTorch domain libraries.

  82. eliasdabbas/advertools โญ 1,044
    advertools - online marketing productivity and analysis tools
    ๐Ÿ”— advertools.readthedocs.io

  83. uber/fiber โญ 1,037
    Distributed Computing for AI Made Simple
    ๐Ÿ”— uber.github.io/fiber

  84. brettkromkamp/contextualise โญ 1,034
    Contextualise is an effective tool particularly suited for organising information-heavy projects and activities consisting of unstructured and widely diverse data and information resources
    ๐Ÿ”— contextualise.dev

  85. aio-libs/aiocache โญ 1,021
    Asyncio cache manager for redis, memcached and memory
    ๐Ÿ”— aiocache.readthedocs.io

  86. intake/intake โญ 977
    Intake is a lightweight package for finding, investigating, loading and disseminating data.
    ๐Ÿ”— intake.readthedocs.io

  87. scikit-hep/awkward โญ 792
    Manipulate JSON-like data with NumPy-like idioms.
    ๐Ÿ”— awkward-array.org

  88. koaning/human-learn โญ 772
    Natural Intelligence is still a pretty good idea.
    ๐Ÿ”— koaning.github.io/human-learn

  89. macbre/sql-metadata โญ 725
    Uses tokenized query returned by python-sqlparse and generates query metadata
    ๐Ÿ”— pypi.python.org/pypi/sql-metadata

  90. hyperqueryhq/whale โญ 724
    ๐Ÿณ The stupidly simple CLI workspace for your data warehouse.
    ๐Ÿ”— rsyi.gitbook.io/whale

  91. duckdb/dbt-duckdb โญ 713
    dbt (http://getdbt.com) adapter for DuckDB (http://duckdb.org)

  92. googleapis/python-bigquery โญ 706
    Python Client for Google BigQuery

  93. goccy/bigquery-emulator โญ 697
    BigQuery emulator provides a way to launch a BigQuery server on your local machine for testing and development.

  94. mcfunley/pugsql โญ 661
    A HugSQL-inspired database library for Python
    ๐Ÿ”— pugsql.org

  95. dgarnitz/vectorflow โญ 634
    VectorFlow is a high volume vector embedding pipeline that ingests raw data, transforms it into vectors and writes it to a vector DB of your choice.
    ๐Ÿ”— www.getvectorflow.com

  96. kagisearch/vectordb โญ 534
    A minimal Python package for storing and retrieving text using chunking, embeddings, and vector search.
    ๐Ÿ”— vectordb.com

  97. koaning/bulk โญ 440
    Bulk is a quick UI developer tool to apply some bulk labels.

Debugging

Debugging and tracing tools.

  1. cool-rr/PySnooper โญ 16,249
    Never use print for debugging again

  2. gruns/icecream โญ 8,405
    ๐Ÿฆ Never use print() to debug again.

  3. shobrook/rebound โญ 4,071
    Command-line tool that instantly fetches Stack Overflow results when an exception is thrown

  4. inducer/pudb โญ 2,860
    Full-screen console debugger for Python
    ๐Ÿ”— documen.tician.de/pudb

  5. gotcha/ipdb โญ 1,809
    Integration of IPython pdb

  6. alexmojaki/heartrate โญ 1,727
    Simple real time visualisation of the execution of a Python program.

  7. alexmojaki/birdseye โญ 1,629
    Graphical Python debugger which lets you easily view the values of all evaluated expressions
    ๐Ÿ”— birdseye.readthedocs.io

  8. alexmojaki/snoop โญ 1,194
    A powerful set of Python debugging tools, based on PySnooper

  9. samuelcolvin/python-devtools โญ 930
    Dev tools for python
    ๐Ÿ”— python-devtools.helpmanual.io

Diffusion Text to Image

Text-to-image diffusion model libraries, tools and apps for generating images from natural language.

  1. automatic1111/stable-diffusion-webui โญ 128,387
    Stable Diffusion web UI

  2. compvis/stable-diffusion โญ 65,113
    A latent text-to-image diffusion model
    ๐Ÿ”— ommer-lab.com/research/latent-diffusion-models

  3. stability-ai/stablediffusion โญ 35,979
    High-Resolution Image Synthesis with Latent Diffusion Models

  4. comfyanonymous/ComfyUI โญ 31,762
    The most powerful and modular stable diffusion GUI, api and backend with a graph/nodes interface.

  5. lllyasviel/ControlNet โญ 27,633
    Let us control diffusion models!

  6. huggingface/diffusers โญ 22,298
    ๐Ÿค— Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
    ๐Ÿ”— huggingface.co/docs/diffusers

  7. invoke-ai/InvokeAI โญ 21,171
    InvokeAI is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and create visual media using the latest AI-driven technologies. The solution offers an industry leading WebUI, supports terminal use through a CLI, and serves as the foundation for multi...
    ๐Ÿ”— invoke-ai.github.io/invokeai

  8. apple/ml-stable-diffusion โญ 16,055
    Stable Diffusion with Core ML on Apple Silicon

  9. borisdayma/dalle-mini โญ 14,623
    DALLยทE Mini - Generate images from a text prompt
    ๐Ÿ”— www.craiyon.com

  10. divamgupta/diffusionbee-stable-diffusion-ui โญ 11,879
    Diffusion Bee is the easiest way to run Stable Diffusion locally on your M1 Mac. Comes with a one-click installer. No dependencies or technical knowledge needed.
    ๐Ÿ”— diffusionbee.com

  11. lucidrains/DALLE2-pytorch โญ 10,795
    Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch

  12. compvis/latent-diffusion โญ 10,457
    High-Resolution Image Synthesis with Latent Diffusion Models

  13. instantid/InstantID โญ 9,597
    InstantID : Zero-shot Identity-Preserving Generation in Seconds ๐Ÿ”ฅ
    ๐Ÿ”— instantid.github.io

  14. ashawkey/stable-dreamfusion โญ 7,773
    Text-to-3D & Image-to-3D & Mesh Exportation with NeRF + Diffusion.

  15. facebookresearch/dinov2 โญ 7,735
    PyTorch code and models for the DINOv2 self-supervised learning method.

  16. carson-katri/dream-textures โญ 7,570
    Stable Diffusion built-in to Blender

  17. xavierxiao/Dreambooth-Stable-Diffusion โญ 7,433
    Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) with Stable Diffusion

  18. timothybrooks/instruct-pix2pix โญ 5,917
    PyTorch implementation of InstructPix2Pix, an instruction-based image editing model, based on the original CompVis/stable_diffusion repo.

  19. openai/consistency_models โญ 5,909
    Official repo for consistency models.

  20. idea-research/GroundingDINO โญ 4,842
    Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
    ๐Ÿ”— arxiv.org/abs/2303.05499

  21. salesforce/BLIP โญ 4,220
    PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

  22. nateraw/stable-diffusion-videos โญ 4,194
    Create ๐Ÿ”ฅ videos with Stable Diffusion by exploring the latent space and morphing between text prompts

  23. jina-ai/discoart โญ 3,839
    ๐Ÿชฉ Create Disco Diffusion artworks in one line

  24. lkwq007/stablediffusion-infinity โญ 3,797
    Outpainting with Stable Diffusion on an infinite canvas

  25. openai/glide-text2im โญ 3,455
    GLIDE: a diffusion-based text-conditional image synthesis model

  26. mlc-ai/web-stable-diffusion โญ 3,415
    Bringing stable diffusion models to web browsers. Everything runs inside the browser with no server support.
    ๐Ÿ”— mlc.ai/web-stable-diffusion

  27. openai/improved-diffusion โญ 2,767
    Release for Improved Denoising Diffusion Probabilistic Models

  28. saharmor/dalle-playground โญ 2,763
    A playground to generate images from any text prompt using Stable Diffusion (past: using DALL-E Mini)

  29. stability-ai/stability-sdk โญ 2,397
    SDK for interacting with stability.ai APIs (e.g. stable diffusion inference)
    ๐Ÿ”— platform.stability.ai

  30. divamgupta/stable-diffusion-tensorflow โญ 1,567
    Stable Diffusion in TensorFlow / Keras

  31. coyote-a/ultimate-upscale-for-automatic1111 โญ 1,471
    Ultimate SD Upscale extension for AUTOMATIC1111 Stable Diffusion web UI

  32. nvlabs/prismer โญ 1,284
    The implementation of "Prismer: A Vision-Language Model with Multi-Task Experts".
    ๐Ÿ”— shikun.io/projects/prismer

  33. chenyangqiqi/FateZero โญ 1,036
    [ICCV 2023 Oral] "FateZero: Fusing Attentions for Zero-shot Text-based Video Editing"
    ๐Ÿ”— fate-zero-edit.github.io

  34. thereforegames/unprompted โญ 735
    Templating language written for Stable Diffusion workflows. Available as an extension for the Automatic1111 WebUI.

  35. sharonzhou/long_stable_diffusion โญ 672
    Long-form text-to-images generation, using a pipeline of deep generative models (GPT-3 and Stable Diffusion)

  36. tanelp/tiny-diffusion โญ 526
    A minimal PyTorch implementation of probabilistic diffusion models for 2D datasets.

Finance

Financial and quantitative libraries: investment research tools, market data, algorithmic trading, backtesting, financial derivatives.

  1. openbb-finance/OpenBBTerminal โญ 25,968
    Investment Research for Everyone, Everywhere.
    ๐Ÿ”— openbb.co

  2. quantopian/zipline โญ 17,032
    Zipline, a Pythonic Algorithmic Trading Library
    ๐Ÿ”— www.zipline.io

  3. microsoft/qlib โญ 14,096
    Qlib is an AI-oriented quantitative investment platform that aims to realize the potential, empower research, and create value using AI technologies in quantitative investment, from exploring ideas to implementing productions. Qlib supports diverse machine learning modeling paradigms. including supervised learning, ...
    ๐Ÿ”— qlib.readthedocs.io/en/latest

  4. mementum/backtrader โญ 12,921
    Python Backtesting library for trading strategies
    ๐Ÿ”— www.backtrader.com

  5. ranaroussi/yfinance โญ 11,723
    Download market data from Yahoo! Finance's API
    ๐Ÿ”— aroussi.com/post/python-yahoo-finance

  6. ai4finance-foundation/FinGPT โญ 11,320
    FinGPT: Open-Source Financial Large Language Models! Revolutionize ๐Ÿ”ฅ We release the trained model on HuggingFace.
    ๐Ÿ”— ai4finance.org

  7. ai4finance-foundation/FinRL โญ 8,991
    FinRL: Financial Reinforcement Learning. ๐Ÿ”ฅ
    ๐Ÿ”— ai4finance.org

  8. ta-lib/ta-lib-python โญ 8,957
    Python wrapper for TA-Lib (http://ta-lib.org/).
    ๐Ÿ”— ta-lib.github.io/ta-lib-python

  9. quantconnect/Lean โญ 8,635
    Lean Algorithmic Trading Engine by QuantConnect (Python, C#)
    ๐Ÿ”— lean.io

  10. quantopian/pyfolio โญ 5,412
    Portfolio and risk analytics in Python
    ๐Ÿ”— quantopian.github.io/pyfolio

  11. kernc/backtesting.py โญ 4,787
    ๐Ÿ”Ž ๐Ÿ“ˆ ๐Ÿ ๐Ÿ’ฐ Backtest trading strategies in Python.
    ๐Ÿ”— kernc.github.io/backtesting.py

  12. twopirllc/pandas-ta โญ 4,699
    Technical Analysis Indicators - Pandas TA is an easy to use Python 3 Pandas Extension with 150+ Indicators
    ๐Ÿ”— twopirllc.github.io/pandas-ta

  13. gbeced/pyalgotrade โญ 4,286
    Python Algorithmic Trading Library
    ๐Ÿ”— gbeced.github.io/pyalgotrade

  14. google/tf-quant-finance โญ 4,258
    High-performance TensorFlow library for quantitative finance.

  15. ranaroussi/quantstats โญ 4,218
    Portfolio analytics for quants, written in Python

  16. borisbanushev/stockpredictionai โญ 3,926
    In this noteboook I will create a complete process for predicting stock price movements. Follow along and we will achieve some pretty good results. For that purpose we will use a Generative Adversarial Network (GAN) with LSTM, a type of Recurrent Neural Network, as generator, and a Convolutional Neural Networ...

  17. polakowo/vectorbt โญ 3,692
    Find your trading edge, using the fastest engine for backtesting, algorithmic trading, and research.
    ๐Ÿ”— vectorbt.dev

  18. cuemacro/finmarketpy โญ 3,344
    Python library for backtesting trading strategies & analyzing financial markets (formerly pythalesians)
    ๐Ÿ”— www.cuemacro.com

  19. matplotlib/mplfinance โญ 3,333
    Financial Markets Data Visualization using Matplotlib
    ๐Ÿ”— pypi.org/project/mplfinance

  20. quantopian/alphalens โญ 3,067
    Performance analysis of predictive (alpha) stock factors
    ๐Ÿ”— quantopian.github.io/alphalens

  21. zvtvz/zvt โญ 2,972
    modular quant framework.
    ๐Ÿ”— zvt.readthedocs.io/en/latest

  22. goldmansachs/gs-quant โญ 2,443
    Python toolkit for quantitative finance
    ๐Ÿ”— developer.gs.com/discover/products/gs-quant

  23. robcarver17/pysystemtrade โญ 2,369
    Systematic Trading in python

  24. quantopian/research_public โญ 2,302
    Quantitative research and educational materials
    ๐Ÿ”— www.quantopian.com/lectures

  25. pmorissette/bt โญ 2,006
    bt - flexible backtesting for Python
    ๐Ÿ”— pmorissette.github.io/bt

  26. blankly-finance/blankly โญ 1,963
    ๐Ÿš€ ๐Ÿ’ธ Easily build, backtest and deploy your algo in just a few lines of code. Trade stocks, cryptos, and forex across exchanges w/ one package.
    ๐Ÿ”— package.blankly.finance

  27. domokane/FinancePy โญ 1,888
    A Python Finance Library that focuses on the pricing and risk-management of Financial Derivatives, including fixed-income, equity, FX and credit derivatives.
    ๐Ÿ”— financepy.com

  28. pmorissette/ffn โญ 1,770
    ffn - a financial function library for Python
    ๐Ÿ”— pmorissette.github.io/ffn

  29. cuemacro/findatapy โญ 1,559
    Python library to download market data via Bloomberg, Eikon, Quandl, Yahoo etc.

  30. quantopian/empyrical โญ 1,214
    Common financial risk and performance metrics. Used by zipline and pyfolio.
    ๐Ÿ”— quantopian.github.io/empyrical

  31. idanya/algo-trader โญ 737
    Trading bot with support for realtime trading, backtesting, custom strategies and much more.

Game Development

Game development tools, engines and libraries.

  1. kitao/pyxel โญ 13,116
    A retro game engine for Python

  2. pygame/pygame โญ 6,923
    ๐Ÿ๐ŸŽฎ pygame (the library) is a Free and Open Source python programming language library for making multimedia applications like games built on top of the excellent SDL library. C, Python, Native, OpenGL.
    ๐Ÿ”— www.pygame.org

  3. panda3d/panda3d โญ 4,254
    Powerful, mature open-source cross-platform game engine for Python and C++, developed by Disney and CMU
    ๐Ÿ”— www.panda3d.org

  4. pokepetter/ursina โญ 2,081
    A game engine powered by python and panda3d.
    ๐Ÿ”— pokepetter.github.io/ursina

  5. pyglet/pyglet โญ 1,738
    pyglet is a cross-platform windowing and multimedia library for Python, for developing games and other visually rich applications.
    ๐Ÿ”— pyglet.org

  6. pythonarcade/arcade โญ 1,604
    Easy to use Python library for creating 2D arcade games.
    ๐Ÿ”— arcade.academy

GIS

Geospatial libraries: raster and vector data formats, interactive mapping and visualisation, computing frameworks for processing images, projections.

  1. domlysz/BlenderGIS โญ 7,196
    Blender addons to make the bridge between Blender and geographic data

  2. python-visualization/folium โญ 6,652
    Python Data. Leaflet.js Maps.
    ๐Ÿ”— python-visualization.github.io/folium

  3. gboeing/osmnx โญ 4,649
    OSMnx is a Python package to easily download, model, analyze, and visualize street networks and other geospatial features from OpenStreetMap.
    ๐Ÿ”— osmnx.readthedocs.io

  4. osgeo/gdal โญ 4,462
    GDAL is an open source MIT licensed translator library for raster and vector geospatial data formats.
    ๐Ÿ”— gdal.org

  5. geopandas/geopandas โญ 4,160
    Python tools for geographic data
    ๐Ÿ”— geopandas.org

  6. shapely/shapely โญ 3,652
    Manipulation and analysis of geometric objects
    ๐Ÿ”— shapely.readthedocs.io/en/stable

  7. holoviz/datashader โญ 3,199
    Quickly and accurately render even the largest data.
    ๐Ÿ”— datashader.org

  8. giswqs/geemap โญ 3,166
    A Python package for interactive geospatial analysis and visualization with Google Earth Engine.
    ๐Ÿ”— geemap.org

  9. opengeos/leafmap โญ 2,892
    A Python package for interactive mapping and geospatial analysis with minimal coding in a Jupyter environment
    ๐Ÿ”— leafmap.org

  10. opengeos/segment-geospatial โญ 2,644
    A Python package for segmenting geospatial data with the Segment Anything Model (SAM)
    ๐Ÿ”— samgeo.gishub.org

  11. google/earthengine-api โญ 2,520
    Python and JavaScript bindings for calling the Earth Engine API.

  12. microsoft/torchgeo โญ 2,208
    TorchGeo: datasets, samplers, transforms, and pre-trained models for geospatial data
    ๐Ÿ”— www.osgeo.org/projects/torchgeo

  13. rasterio/rasterio โญ 2,135
    Rasterio reads and writes geospatial raster datasets
    ๐Ÿ”— rasterio.readthedocs.io

  14. mcordts/cityscapesScripts โญ 2,087
    README and scripts for the Cityscapes Dataset

  15. azavea/raster-vision โญ 1,987
    An open source library and framework for deep learning on satellite and aerial imagery.
    ๐Ÿ”— docs.rastervision.io

  16. plant99/felicette โญ 1,816
    Satellite imagery for dummies.

  17. apache/sedona โญ 1,766
    A cluster computing framework for processing large-scale geospatial data
    ๐Ÿ”— sedona.apache.org

  18. jupyter-widgets/ipyleaflet โญ 1,450
    A Jupyter - Leaflet.js bridge
    ๐Ÿ”— ipyleaflet.readthedocs.io

  19. gboeing/osmnx-examples โญ 1,450
    Gallery of OSMnx tutorials, usage examples, and feature demonstations.
    ๐Ÿ”— osmnx.readthedocs.io

  20. pysal/pysal โญ 1,273
    PySAL: Python Spatial Analysis Library Meta-Package
    ๐Ÿ”— pysal.org/pysal

  21. microsoft/GlobalMLBuildingFootprints โญ 1,264
    Worldwide building footprints derived from satellite imagery

  22. anitagraser/movingpandas โญ 1,135
    Movement trajectory classes and functions built on top of GeoPandas
    ๐Ÿ”— movingpandas.org

  23. residentmario/geoplot โญ 1,115
    High-level geospatial data visualization library for Python.
    ๐Ÿ”— residentmario.github.io/geoplot/index.html

  24. sentinel-hub/eo-learn โญ 1,075
    Earth observation processing framework for machine learning in Python
    ๐Ÿ”— eo-learn.readthedocs.io/en/latest

  25. makepath/xarray-spatial โญ 770
    Raster-based Spatial Analytics for Python
    ๐Ÿ”— xarray-spatial.org

  26. osgeo/grass โญ 762
    GRASS GIS - free and open-source geospatial processing engine
    ๐Ÿ”— grass.osgeo.org

  27. scikit-mobility/scikit-mobility โญ 690
    scikit-mobility: mobility analysis in Python
    ๐Ÿ”— scikit-mobility.github.io/scikit-mobility

  28. developmentseed/titiler โญ 683
    Build your own Raster dynamic map tile services
    ๐Ÿ”— developmentseed.org/titiler

Graph

Graphs and network libraries: network analysis, graph machine learning, visualisation.

  1. networkx/networkx โญ 14,127
    Network Analysis in Python
    ๐Ÿ”— networkx.org

  2. stellargraph/stellargraph โญ 2,885
    StellarGraph - Machine Learning on Graphs
    ๐Ÿ”— stellargraph.readthedocs.io

  3. westhealth/pyvis โญ 908
    Python package for creating and visualizing interactive network graphs.
    ๐Ÿ”— pyvis.readthedocs.io/en/latest

  4. rampasek/GraphGPS โญ 584
    Recipe for a General, Powerful, Scalable Graph Transformer

  5. microsoft/graspologic โญ 494
    graspologic is a package for graph statistical algorithms
    ๐Ÿ”— microsoft.github.io/graspologic/latest

  6. dylanhogg/llmgraph โญ 80
    Create knowledge graphs with LLMs

GUI

Graphical user interface libraries and toolkits.

  1. pysimplegui/PySimpleGUI โญ 13,088
    Python GUIs for Humans! PySimpleGUI is the top-rated Python application development environment. Launched in 2018 and actively developed, maintained, and supported in 2024. Transforms tkinter, Qt, WxPython, and Remi into a simple, intuitive, and fun experience for both hobbyists and expert users.
    ๐Ÿ”— www.pysimplegui.com

  2. hoffstadt/DearPyGui โญ 12,187
    Dear PyGui: A fast and powerful Graphical User Interface Toolkit for Python with minimal dependencies
    ๐Ÿ”— dearpygui.readthedocs.io/en/latest

  3. parthjadhav/Tkinter-Designer โญ 8,225
    An easy and fast way to create a Python GUI ๐Ÿ

  4. samuelcolvin/FastUI โญ 7,163
    FastUI is a new way to build web application user interfaces defined by declarative Python code.
    ๐Ÿ”— fastui-demo.onrender.com

  5. r0x0r/pywebview โญ 4,286
    Build GUI for your Python program with JavaScript, HTML, and CSS
    ๐Ÿ”— pywebview.flowrl.com

  6. beeware/toga โญ 4,088
    A Python native, OS native GUI toolkit.
    ๐Ÿ”— toga.readthedocs.io/en/latest

  7. dddomodossola/remi โญ 3,448
    Python REMote Interface library. Platform independent. In about 100 Kbytes, perfect for your diet.

  8. wxwidgets/Phoenix โญ 2,176
    wxPython's Project Phoenix. A new implementation of wxPython, better, stronger, faster than he was before.
    ๐Ÿ”— wxpython.org

Jupyter

Jupyter and JupyterLab and Notebook tools, libraries and plugins.

  1. jupyterlab/jupyterlab โญ 13,746
    JupyterLab computational environment.
    ๐Ÿ”— jupyterlab.readthedocs.io

  2. jupyter/notebook โญ 11,126
    Jupyter Interactive Notebook
    ๐Ÿ”— jupyter-notebook.readthedocs.io

  3. mwouts/jupytext โญ 6,403
    Jupyter Notebooks as Markdown Documents, Julia, Python or R scripts
    ๐Ÿ”— jupytext.readthedocs.io

  4. nteract/papermill โญ 5,615
    ๐Ÿ“š Parameterize, execute, and analyze notebooks
    ๐Ÿ”— papermill.readthedocs.io/en/latest

  5. connorferster/handcalcs โญ 5,349
    Python library for converting Python calculations into rendered latex.

  6. voila-dashboards/voila โญ 5,192
    Voilร  turns Jupyter notebooks into standalone web applications
    ๐Ÿ”— voila.readthedocs.io

  7. executablebooks/jupyter-book โญ 3,678
    Create beautiful, publication-quality books and documents from computational content.
    ๐Ÿ”— jupyterbook.org

  8. jupyterlite/jupyterlite โญ 3,644
    Wasm powered Jupyter running in the browser ๐Ÿ’ก
    ๐Ÿ”— jupyterlite.rtfd.io/en/stable/try/lab

  9. jupyterlab/jupyterlab-desktop โญ 3,341
    JupyterLab desktop application, based on Electron.

  10. jupyter-widgets/ipywidgets โญ 3,042
    Interactive Widgets for the Jupyter Notebook
    ๐Ÿ”— ipywidgets.readthedocs.io

  11. quantopian/qgrid โญ 3,027
    An interactive grid for sorting, filtering, and editing DataFrames in Jupyter notebooks

  12. jupyterlab/jupyter-ai โญ 2,821
    A generative AI extension for JupyterLab
    ๐Ÿ”— jupyter-ai.readthedocs.io

  13. jupyter/nbdime โญ 2,593
    Tools for diffing and merging of Jupyter notebooks.
    ๐Ÿ”— nbdime.readthedocs.io

  14. mito-ds/mito โญ 2,205
    The mitosheet package, trymito.io, and other public Mito code.
    ๐Ÿ”— trymito.io

  15. jupyter/nbviewer โญ 2,162
    nbconvert as a web service: Render Jupyter Notebooks as static web pages
    ๐Ÿ”— nbviewer.jupyter.org

  16. maartenbreddels/ipyvolume โญ 1,907
    3d plotting for Python in the Jupyter notebook based on IPython widgets using WebGL

  17. jupyter-lsp/jupyterlab-lsp โญ 1,723
    Coding assistance for JupyterLab (code navigation + hover suggestions + linters + autocompletion + rename) using Language Server Protocol

  18. jupyter/nbconvert โญ 1,660
    Jupyter Notebook Conversion
    ๐Ÿ”— nbconvert.readthedocs.io

  19. jupyter/nbgrader โญ 1,256
    A system for assigning and grading notebooks
    ๐Ÿ”— nbgrader.readthedocs.io

  20. nbqa-dev/nbQA โญ 959
    Run ruff, isort, pyupgrade, mypy, pylint, flake8, and more on Jupyter Notebooks
    ๐Ÿ”— nbqa.readthedocs.io/en/latest/index.html

  21. vizzuhq/ipyvizzu โญ 922
    Build animated charts in Jupyter Notebook and similar environments with a simple Python syntax.
    ๐Ÿ”— ipyvizzu.vizzuhq.com

  22. koaning/drawdata โญ 703
    Draw datasets from within Jupyter.
    ๐Ÿ”— calmcode.io/labs/drawdata.html

  23. aws/graph-notebook โญ 679
    Library extending Jupyter notebooks to integrate with Apache TinkerPop, openCypher, and RDF SPARQL.
    ๐Ÿ”— github.com/aws/graph-notebook

  24. linealabs/lineapy โญ 652
    Move fast from data science prototype to pipeline. Capture, analyze, and transform messy notebooks into data pipelines with just two lines of code.
    ๐Ÿ”— lineapy.org

LLMs and ChatGPT

Large language model and GPT libraries and frameworks: auto-gpt, agents, QnA, chain-of-thought workflows, API integations. Also see the Natural Language Processing category for crossover.

  1. significant-gravitas/AutoGPT โญ 160,768
    AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.
    ๐Ÿ”— agpt.co

  2. hwchase17/langchain โญ 81,978
    ๐Ÿฆœ๐Ÿ”— Build context-aware reasoning applications
    ๐Ÿ”— python.langchain.com

  3. nomic-ai/gpt4all โญ 63,527
    gpt4all: run open-source LLMs anywhere
    ๐Ÿ”— gpt4all.io

  4. xtekky/gpt4free โญ 56,750
    The official gpt4free repository | various collection of powerful language models
    ๐Ÿ”— g4f.ai

  5. ggerganov/llama.cpp โญ 55,086
    LLM inference in C/C++

  6. facebookresearch/llama โญ 52,234
    Inference code for Llama models

  7. imartinez/private-gpt โญ 51,363
    Interact with your documents using the power of GPT, 100% privately, no data leaks
    ๐Ÿ”— docs.privategpt.dev

  8. gpt-engineer-org/gpt-engineer โญ 50,301
    Specify what you want it to build, the AI asks for clarification, and then builds it.

  9. xai-org/grok-1 โญ 47,201
    This repository contains JAX example code for loading and running the Grok-1 open-weights model.

  10. killianlucas/open-interpreter โญ 47,133
    A natural language interface for computers
    ๐Ÿ”— openinterpreter.com

  11. thudm/ChatGLM-6B โญ 39,042
    ChatGLM-6B: An Open Bilingual Dialogue Language Model | ๅผ€ๆบๅŒ่ฏญๅฏน่ฏ่ฏญ่จ€ๆจกๅž‹

  12. geekan/MetaGPT โญ 38,717
    ๐ŸŒŸ The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming
    ๐Ÿ”— deepwisdom.ai

  13. hpcaitech/ColossalAI โญ 37,746
    Making large AI models cheaper, faster and more accessible
    ๐Ÿ”— www.colossalai.org

  14. laion-ai/Open-Assistant โญ 36,591
    OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
    ๐Ÿ”— open-assistant.io

  15. oobabooga/text-generation-webui โญ 35,572
    A Gradio web UI for Large Language Models. Supports transformers, GPTQ, AWQ, EXL2, llama.cpp (GGUF), Llama models.

  16. moymix/TaskMatrix โญ 34,508
    Connects ChatGPT and a series of Visual Foundation Models to enable sending and receiving images during chatting.

  17. lm-sys/FastChat โญ 33,470
    An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.

  18. quivrhq/quivr โญ 31,742
    Your GenAI Second Brain ๐Ÿง  A personal productivity assistant (RAG) โšก๏ธ๐Ÿค– Chat with your docs (PDF, CSV, ...) & apps using Langchain, GPT 3.5 / 4 turbo, Private, Anthropic, VertexAI, Ollama, LLMs, Groq that you can share with users ! Local & Private alternative to OpenAI GPTs & ChatGPT powered by retrieval-augmented...
    ๐Ÿ”— quivr.app

  19. karpathy/nanoGPT โญ 31,295
    The simplest, fastest repository for training/finetuning medium-sized GPTs.

  20. jerryjliu/llama_index โญ 30,544
    LlamaIndex is a data framework for your LLM applications
    ๐Ÿ”— docs.llamaindex.ai

  21. tatsu-lab/stanford_alpaca โญ 28,705
    Code and documentation to train Stanford's Alpaca models, and generate the data.
    ๐Ÿ”— crfm.stanford.edu/2023/03/13/alpaca.html

  22. pythagora-io/gpt-pilot โญ 27,666
    The first real AI developer

  23. vision-cair/MiniGPT-4 โญ 24,819
    Open-sourced codes for MiniGPT-4 and MiniGPT-v2 (https://minigpt-4.github.io, https://minigpt-v2.github.io/)
    ๐Ÿ”— minigpt-4.github.io

  24. microsoft/autogen โญ 24,414
    A programming framework for agentic AI. Discord: https://aka.ms/autogen-dc. Roadmap: https://aka.ms/autogen-roadmap
    ๐Ÿ”— microsoft.github.io/autogen

  25. microsoft/JARVIS โญ 22,960
    JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf

  26. openai/gpt-2 โญ 21,044
    Code for the paper "Language Models are Unsupervised Multitask Learners"
    ๐Ÿ”— openai.com/blog/better-language-models

  27. openai/chatgpt-retrieval-plugin โญ 20,813
    The ChatGPT Retrieval Plugin lets you easily find personal or work documents by asking questions in natural language.

  28. yoheinakajima/babyagi โญ 19,078
    GPT-4 powered task-driven autonomous agent

  29. karpathy/minGPT โญ 18,673
    A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training

  30. tloen/alpaca-lora โญ 18,135
    Instruct-tune LLaMA on consumer hardware

  31. microsoft/semantic-kernel โญ 17,952
    Integrate cutting-edge LLM technology quickly and easily into your apps
    ๐Ÿ”— aka.ms/semantic-kernel

  32. rasahq/rasa โญ 17,892
    ๐Ÿ’ฌ Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants
    ๐Ÿ”— rasa.com/docs/rasa

  33. vllm-project/vllm โญ 17,597
    A high-throughput and memory-efficient inference and serving engine for LLMs
    ๐Ÿ”— docs.vllm.ai

  34. guidance-ai/guidance โญ 17,114
    A guidance language for controlling large language models.

  35. mlc-ai/mlc-llm โญ 16,612
    Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.
    ๐Ÿ”— llm.mlc.ai/docs

  36. logspace-ai/langflow โญ 16,540
    โ›“๏ธ Langflow is a dynamic graph where each node is an executable unit. Its modular and interactive design fosters rapid experimentation and prototyping, pushing hard on the limits of creativity.
    ๐Ÿ”— www.langflow.org

  37. hiyouga/LLaMA-Factory โญ 16,202
    Unify Efficient Fine-Tuning of 100+ LLMs

  38. hiyouga/LLaMA-Factory โญ 16,202
    Unify Efficient Fine-Tuning of 100+ LLMs

  39. haotian-liu/LLaVA โญ 15,768
    [NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
    ๐Ÿ”— llava.hliu.cc

  40. thudm/ChatGLM2-6B โญ 15,433
    ChatGLM2-6B: An Open Bilingual Chat LLM | ๅผ€ๆบๅŒ่ฏญๅฏน่ฏ่ฏญ่จ€ๆจกๅž‹

  41. karpathy/llama2.c โญ 15,391
    Inference Llama 2 in one file of pure C

  42. mayooear/gpt4-pdf-chatbot-langchain โญ 14,488
    GPT4 & LangChain Chatbot for large PDF docs
    ๐Ÿ”— www.youtube.com/watch?v=ih9pbgvvoo4

  43. transformeroptimus/SuperAGI โญ 14,347
    <โšก๏ธ> SuperAGI - A dev-first open source autonomous AI agent framework. Enabling developers to build, manage & run useful autonomous agents quickly and reliably.
    ๐Ÿ”— superagi.com

  44. facebookresearch/codellama โญ 14,248
    Inference code for CodeLlama models

  45. fauxpilot/fauxpilot โญ 14,209
    FauxPilot - an open-source alternative to GitHub Copilot server

  46. openai/evals โญ 13,765
    Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

  47. karpathy/llm.c โญ 13,635
    LLM training in simple, pure C/CUDA. There is no need for 245MB of PyTorch or 107MB of cPython

  48. huggingface/peft โญ 13,577
    ๐Ÿค— PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
    ๐Ÿ”— huggingface.co/docs/peft

  49. deepset-ai/haystack โญ 13,503
    ๐Ÿ” LLM orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data. With advanced retrieval methods, it's best suited for building RAG, question answering, semantic search or conv...
    ๐Ÿ”— haystack.deepset.ai

  50. idea-research/Grounded-Segment-Anything โญ 13,316
    Grounded-SAM: Marrying Grounding-DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
    ๐Ÿ”— arxiv.org/abs/2401.14159

  51. openlmlab/MOSS โญ 11,804
    An open-source tool-augmented conversational language model from Fudan University
    ๐Ÿ”— txsun1997.github.io/blogs/moss.html

  52. joaomdmoura/crewAI โญ 11,727
    Framework for orchestrating role-playing, autonomous AI agents. By fostering collaborative intelligence, CrewAI empowers agents to work together seamlessly, tackling complex tasks.
    ๐Ÿ”— crewai.com

  53. smol-ai/developer โญ 11,628
    the first library to let you embed a developer agent in your own app!
    ๐Ÿ”— twitter.com/smolmodels

  54. blinkdl/RWKV-LM โญ 11,539
    RWKV is an RNN with transformer-level LLM performance. It can be directly trained like a GPT (parallelizable). So it's combining the best of RNN and transformer - great performance, fast inference, saves VRAM, fast training, "infinite" ctx_len, and free sentence embedding.

  55. paddlepaddle/PaddleNLP โญ 11,337
    ๐Ÿ‘‘ Easy-to-use and powerful NLP and LLM library with ๐Ÿค— Awesome model zoo, supporting wide-range of NLP tasks from research to industrial applications, including ๐Ÿ—‚Text Classification, ๐Ÿ” Neural Search, โ“ Question Answering, โ„น๏ธ Information Extraction, ๐Ÿ“„ Document Intelligence, ๐Ÿ’Œ Sentiment Analysis etc.
    ๐Ÿ”— paddlenlp.readthedocs.io

  56. databrickslabs/dolly โญ 10,779
    Databricksโ€™ Dolly, a large language model trained on the Databricks Machine Learning Platform
    ๐Ÿ”— www.databricks.com/blog/2023/03/24/hello-dolly-democratizing-magic-chatgpt-open-models.html

  57. dao-ailab/flash-attention โญ 10,565
    Fast and memory-efficient exact attention

  58. h2oai/h2ogpt โญ 10,295
    Private chat with local GPT with document, images, video, etc. 100% private, Apache 2.0. Supports oLLaMa, Mixtral, llama.cpp, and more. Demo: https://gpt.h2o.ai/ https://codellama.h2o.ai/
    ๐Ÿ”— h2o.ai

  59. shishirpatil/gorilla โญ 9,849
    Enables LLMs to use tools by invoking APIs. Given a query, Gorilla comes up with the semantically and syntactically correct API.
    ๐Ÿ”— gorilla.cs.berkeley.edu

  60. stanfordnlp/dspy โญ 9,847
    DSPy: The framework for programmingโ€”not promptingโ€”foundation models
    ๐Ÿ”— dspy-docs.vercel.app

  61. danielmiessler/fabric โญ 9,530
    fabric is an open-source framework for augmenting humans using AI. It provides a modular framework for solving specific problems using a crowdsourced set of AI prompts that can be used anywhere.
    ๐Ÿ”— danielmiessler.com/p/fabric-origin-story

  62. artidoro/qlora โญ 9,318
    QLoRA: Efficient Finetuning of Quantized LLMs
    ๐Ÿ”— arxiv.org/abs/2305.14314

  63. blinkdl/ChatRWKV โญ 9,262
    ChatRWKV is like ChatGPT but powered by RWKV (100% RNN) language model, and open source.

  64. mlc-ai/web-llm โญ 8,969
    Bringing large-language models and chat to web browsers. Everything runs inside the browser with no server support.
    ๐Ÿ”— mlc.ai/web-llm

  65. microsoft/LoRA โญ 8,892
    Code for loralib, an implementation of "LoRA: Low-Rank Adaptation of Large Language Models"
    ๐Ÿ”— arxiv.org/abs/2106.09685

  66. mistralai/mistral-src โญ 8,547
    Reference implementation of Mistral AI 7B v0.1 model.
    ๐Ÿ”— mistral.ai

  67. nvidia/Megatron-LM โญ 8,416
    Ongoing research training transformer models at scale
    ๐Ÿ”— docs.nvidia.com/megatron-core/developer-guide/latest/user-guide/index.html#quick-start

  68. embedchain/embedchain โญ 8,387
    Personalizing LLM Responses
    ๐Ÿ”— docs.embedchain.ai

  69. assafelovic/gpt-researcher โญ 8,239
    GPT based autonomous agent that does online comprehensive research on any given topic
    ๐Ÿ”— gptr.dev

  70. facebookresearch/llama-recipes โญ 8,197
    Scripts for fine-tuning Llama2 with composable FSDP & PEFT methods to cover single/multi-node GPUs. Supports default & custom datasets for applications such as summarization & question answering. Supporting a number of candid inference solutions such as HF TGI, VLLM for local or cloud deployment.Demo apps to showcas...

  71. eleutherai/gpt-neo โญ 8,136
    An implementation of model parallel GPT-2 and GPT-3-style models using the mesh-tensorflow library.
    ๐Ÿ”— www.eleuther.ai

  72. mshumer/gpt-prompt-engineer โญ 7,975
    Simply input a description of your task and some test cases, and the system will generate, test, and rank a multitude of prompts to find the ones that perform the best.

  73. optimalscale/LMFlow โญ 7,948
    An Extensible Toolkit for Finetuning and Inference of Large Foundation Models. Large Models for All.
    ๐Ÿ”— optimalscale.github.io/lmflow

  74. microsoft/promptflow โญ 7,947
    Build high-quality LLM apps - from prototyping, testing to production deployment and monitoring.
    ๐Ÿ”— microsoft.github.io/promptflow

  75. lvwerra/trl โญ 7,914
    Train transformer language models with reinforcement learning.
    ๐Ÿ”— hf.co/docs/trl

  76. karpathy/minbpe โญ 7,869
    Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

  77. berriai/litellm โญ 7,800
    Call all LLM APIs using the OpenAI format. Use Bedrock, Azure, OpenAI, Cohere, Anthropic, Ollama, Sagemaker, HuggingFace, Replicate (100+ LLMs)
    ๐Ÿ”— docs.litellm.ai/docs

  78. thudm/CodeGeeX โญ 7,734
    CodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
    ๐Ÿ”— codegeex.cn

  79. apple/ml-ferret โญ 7,690
    Ferret: Refer and Ground Anything Anywhere at Any Granularity

  80. huggingface/text-generation-inference โญ 7,689
    Large Language Model Text Generation Inference
    ๐Ÿ”— hf.co/docs/text-generation-inference

  81. thudm/GLM-130B โญ 7,599
    GLM-130B: An Open Bilingual Pre-Trained Model (ICLR 2023)

  82. lianjiatech/BELLE โญ 7,486
    BELLE: Be Everyone's Large Language model Engine๏ผˆๅผ€ๆบไธญๆ–‡ๅฏน่ฏๅคงๆจกๅž‹๏ผ‰

  83. openlm-research/open_llama โญ 7,175
    OpenLLaMA: An Open Reproduction of LLaMA

  84. plachtaa/VALL-E-X โญ 7,107
    An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io

  85. bigcode-project/starcoder โญ 7,085
    Home of StarCoder: fine-tuning & inference!

  86. sweepai/sweep โญ 7,010
    Sweep: AI-powered Junior Developer for small features and bug fixes.
    ๐Ÿ”— sweep.dev

  87. sjtu-ipads/PowerInfer โญ 6,889
    High-speed Large Language Model Serving on PCs with Consumer-grade GPUs

  88. jzhang38/TinyLlama โญ 6,674
    The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

  89. bhaskatripathi/pdfGPT โญ 6,667
    PDF GPT allows you to chat with the contents of your PDF file by using GPT capabilities. The most effective open source solution to turn your pdf files in a chatbot!
    ๐Ÿ”— huggingface.co/spaces/bhaskartripathi/pdfgpt_turbo

  90. eleutherai/gpt-neox โญ 6,534
    An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.

  91. zilliztech/GPTCache โญ 6,367
    Semantic cache for LLMs. Fully integrated with LangChain and llama_index.
    ๐Ÿ”— gptcache.readthedocs.io

  92. vanna-ai/vanna โญ 6,267
    ๐Ÿค– Chat with your SQL database ๐Ÿ“Š. Accurate Text-to-SQL Generation via LLMs using RAG ๐Ÿ”„.
    ๐Ÿ”— vanna.ai/docs

  93. vaibhavs10/insanely-fast-whisper โญ 6,261
    An opinionated CLI to transcribe Audio files w/ Whisper on-device! Powered by ๐Ÿค— Transformers, Optimum & flash-attn

  94. abetlen/llama-cpp-python โญ 6,242
    Python bindings for llama.cpp
    ๐Ÿ”— llama-cpp-python.readthedocs.io

  95. mit-han-lab/streaming-llm โญ 6,146
    [ICLR 2024] Efficient Streaming Language Models with Attention Sinks
    ๐Ÿ”— arxiv.org/abs/2309.17453

  96. nat/openplayground โญ 6,059
    An LLM playground you can run on your laptop

  97. langchain-ai/opengpts โญ 6,048
    An open source effort to create a similar experience to OpenAI's GPTs and Assistants API.

  98. lightning-ai/litgpt โญ 5,899
    Pretrain, finetune, deploy 20+ LLMs on your own data. Uses state-of-the-art techniques: flash attention, FSDP, 4-bit, LoRA, and more.

  99. run-llama/rags โญ 5,825
    Build ChatGPT over your data, all with natural language

  100. lightning-ai/lit-llama โญ 5,761
    Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.

  101. unslothai/unsloth โญ 5,576
    2-5X faster 80% less memory QLoRA & LoRA finetuning
    ๐Ÿ”— unsloth.ai

  102. skypilot-org/skypilot โญ 5,558
    SkyPilot: Run LLMs, AI, and Batch jobs on any cloud. Get maximum savings, highest GPU availability, and managed executionโ€”all with a simple interface.
    ๐Ÿ”— skypilot.readthedocs.io

  103. dsdanielpark/Bard-API โญ 5,392
    The unofficial python package that returns response of Google Bard through cookie value.
    ๐Ÿ”— pypi.org/project/bardapi

  104. chainlit/chainlit โญ 5,252
    Build Conversational AI in minutes โšก๏ธ
    ๐Ÿ”— docs.chainlit.io

  105. minedojo/Voyager โญ 5,122
    An Open-Ended Embodied Agent with Large Language Models
    ๐Ÿ”— voyager.minedojo.org

  106. pytorch-labs/gpt-fast โญ 5,027
    Simple and efficient pytorch-native transformer text generation in <1000 LOC of python.

  107. microsoft/promptbase โญ 5,020
    promptbase is an evolving collection of resources, best practices, and example scripts for eliciting the best performance from foundation models.

  108. eleutherai/lm-evaluation-harness โญ 4,812
    A framework for few-shot evaluation of language models.
    ๐Ÿ”— www.eleuther.ai

  109. langchain-ai/chat-langchain โญ 4,707
    Locally hosted chatbot specifically focused on question answering over the LangChain documentation
    ๐Ÿ”— chat.langchain.com

  110. jxnl/instructor โญ 4,515
    Instructor is a Python library that makes it a breeze to work with structured outputs from large language models (LLMs). Built on top of Pydantic, it provides a simple, transparent, and user-friendly API to manage validation, retries, and streaming responses.
    ๐Ÿ”— python.useinstructor.com

  111. explodinggradients/ragas โญ 4,365
    Evaluation framework for your Retrieval Augmented Generation (RAG) pipelines
    ๐Ÿ”— docs.ragas.io

  112. openbmb/ToolBench โญ 4,353
    [ICLR'24 spotlight] An open platform for training, serving, and evaluating large language model for tool learning.
    ๐Ÿ”— openbmb.github.io/toolbench

  113. togethercomputer/RedPajama-Data โญ 4,296
    The RedPajama-Data repository contains code for preparing large datasets for training large language models.

  114. microsoft/BioGPT โญ 4,231
    Implementation of BioGPT: Generative Pre-trained Transformer for Biomedical Text Generation and Mining

  115. mnotgod96/AppAgent โญ 4,086
    AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.
    ๐Ÿ”— appagent-official.github.io

  116. kyegomez/tree-of-thoughts โญ 4,013
    Plug in and Play Implementation of Tree of Thoughts: Deliberate Problem Solving with Large Language Models that Elevates Model Reasoning by atleast 70%
    ๐Ÿ”— discord.gg/qutxnk2nmf

  117. instruction-tuning-with-gpt-4/GPT-4-LLM โญ 3,940
    Instruction Tuning with GPT-4
    ๐Ÿ”— instruction-tuning-with-gpt-4.github.io

  118. allenai/OLMo โญ 3,871
    OLMo is a repository for training and using AI2's state-of-the-art open language models. It is built by scientists, for scientists.
    ๐Ÿ”— allenai.org/olmo

  119. ravenscroftj/turbopilot โญ 3,832
    Turbopilot is an open source large-language-model based code completion engine that runs locally on CPU

  120. mshumer/gpt-llm-trainer โญ 3,750
    Input a description of your task, and the system will generate a dataset, parse it, and fine-tune a LLaMA 2 model for you

  121. 1rgs/jsonformer โญ 3,745
    A Bulletproof Way to Generate Structured JSON from Language Models

  122. yizhongw/self-instruct โญ 3,715
    Aligning pretrained language models with instruction data generated by themselves.

  123. whitead/paper-qa โญ 3,589
    LLM Chain for answering questions from documents with citations

  124. microsoft/LLMLingua โญ 3,585
    To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.
    ๐Ÿ”— llmlingua.com

  125. mmabrouk/llm-workflow-engine โญ 3,579
    Power CLI and Workflow manager for LLMs (core package)

  126. h2oai/h2o-llmstudio โญ 3,532
    H2O LLM Studio - a framework and no-code GUI for fine-tuning LLMs. Documentation: https://h2oai.github.io/h2o-llmstudio/
    ๐Ÿ”— gpt-gm.h2o.ai

  127. luodian/Otter โญ 3,435
    ๐Ÿฆฆ Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.
    ๐Ÿ”— otter-ntu.github.io

  128. minimaxir/gpt-2-simple โญ 3,377
    Python package to easily retrain OpenAI's GPT-2 text-generating model on new texts

  129. minimaxir/simpleaichat โญ 3,358
    Python package for easily interfacing with chat apps, with robust features and minimal code complexity.

  130. nvidia/NeMo-Guardrails โญ 3,303
    NeMo Guardrails is an open-source toolkit for easily adding programmable guardrails to LLM-based conversational systems.

  131. skyvern-ai/skyvern โญ 3,280
    Skyvern automates browser-based workflows using LLMs and computer vision. It provides a simple API endpoint to fully automate manual workflows, replacing brittle or unreliable automation solutions.
    ๐Ÿ”— www.skyvern.com

  132. eth-sri/lmql โญ 3,261
    A language for constraint-guided and efficient LLM programming.
    ๐Ÿ”— lmql.ai

  133. guardrails-ai/guardrails โญ 3,246
    Open-source Python package for specifying structure and type, validating and correcting the outputs of large language models (LLMs)
    ๐Ÿ”— www.guardrailsai.com/docs

  134. deep-diver/LLM-As-Chatbot โญ 3,224
    LLM as a Chatbot Service

  135. cg123/mergekit โญ 3,216
    Tools for merging pretrained large language models.

  136. microsoft/LMOps โญ 3,152
    General technology for enabling AI capabilities w/ LLMs and MLLMs
    ๐Ÿ”— aka.ms/generalai

  137. llmware-ai/llmware โญ 3,056
    Providing enterprise-grade LLM-based development framework, tools, and fine-tuned models.
    ๐Ÿ”— pypi.org/project/llmware

  138. baichuan-inc/Baichuan-13B โญ 2,954
    A 13B large language model developed by Baichuan Intelligent Technology
    ๐Ÿ”— huggingface.co/baichuan-inc/baichuan-13b-chat

  139. iryna-kondr/scikit-llm โญ 2,917
    Seamlessly integrate LLMs into scikit-learn.
    ๐Ÿ”— beastbyte.ai

  140. microsoft/torchscale โญ 2,906
    Foundation Architecture for (M)LLMs
    ๐Ÿ”— aka.ms/generalai

  141. phidatahq/phidata โญ 2,876
    Phidata is a toolkit for building AI Assistants using function calling.
    ๐Ÿ”— docs.phidata.com

  142. freedomintelligence/LLMZoo โญ 2,858
    โšกLLM Zoo is a project that provides data, models, and evaluation benchmark for large language models.โšก

  143. next-gpt/NExT-GPT โญ 2,826
    Code and models for NExT-GPT: Any-to-Any Multimodal Large Language Model
    ๐Ÿ”— next-gpt.github.io

  144. simonw/llm โญ 2,812
    Access large language models from the command-line
    ๐Ÿ”— llm.datasette.io

  145. juncongmoo/pyllama โญ 2,779
    LLaMA: Open and Efficient Foundation Language Models

  146. defog-ai/sqlcoder โญ 2,663
    SoTA LLM for converting natural language questions to SQL queries

  147. paperswithcode/galai โญ 2,637
    Model API for GALACTICA

  148. li-plus/chatglm.cpp โญ 2,546
    C++ implementation of ChatGLM-6B & ChatGLM2-6B & ChatGLM3 & more LLMs

  149. alpha-vllm/LLaMA2-Accessory โญ 2,452
    An Open-source Toolkit for LLM Development
    ๐Ÿ”— llama2-accessory.readthedocs.io

  150. pathwaycom/llm-app โญ 2,444
    LLM App templates for RAG, knowledge mining, and stream analytics. Ready to run with Docker,โšกin sync with your data sources.
    ๐Ÿ”— pathway.com/developers/showcases/llm-app-pathway

  151. hegelai/prompttools โญ 2,412
    Open-source tools for prompt testing and experimentation, with support for both LLMs (e.g. OpenAI, LLaMA) and vector databases (e.g. Chroma, Weaviate, LanceDB).
    ๐Ÿ”— prompttools.readthedocs.io

  152. promptfoo/promptfoo โญ 2,373
    Test your prompts, models, and RAGs. Catch regressions and improve prompt quality. LLM evals for OpenAI, Azure, Anthropic, Gemini, Mistral, Llama, Bedrock, Ollama, and other local & private models with CI/CD integration.
    ๐Ÿ”— www.promptfoo.dev

  153. open-compass/opencompass โญ 2,371
    OpenCompass is an LLM evaluation platform, supporting a wide range of models (InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
    ๐Ÿ”— opencompass.org.cn

  154. ofa-sys/OFA โญ 2,313
    Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework

  155. databricks/dbrx โญ 2,301
    Code examples and resources for DBRX, a large language model developed by Databricks
    ๐Ÿ”— www.databricks.com

  156. langchain-ai/langgraph โญ 2,258
    LangGraph is a library for building stateful, multi-actor applications with LLMs, built on top of (and intended to be used with) LangChain.

  157. civitai/sd_civitai_extension โญ 2,254
    All of the Civitai models inside Automatic 1111 Stable Diffusion Web UI

  158. young-geng/EasyLM โญ 2,218
    Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.

  159. sgl-project/sglang โญ 2,203
    SGLang is a structured generation language designed for large language models (LLMs). It makes your interaction with models faster and more controllable.

  160. weaviate/Verba โญ 2,125
    Retrieval Augmented Generation (RAG) chatbot powered by Weaviate

  161. openai/finetune-transformer-lm โญ 2,070
    Code and model for the paper "Improving Language Understanding by Generative Pre-Training"
    ๐Ÿ”— s3-us-west-2.amazonaws.com/openai-assets/research-covers/language-unsupervised/language_understanding_paper.pdf

  162. bclavie/RAGatouille โญ 2,037
    Bridging the gap between state-of-the-art research and alchemical RAG pipeline practices.

  163. openai/image-gpt โญ 1,999
    Archived. Code and models from the paper "Generative Pretraining from Pixels"

  164. huggingface/text-embeddings-inference โญ 1,942
    A blazing fast inference solution for text embeddings models
    ๐Ÿ”— huggingface.co/docs/text-embeddings-inference/quick_tour

  165. intel/neural-compressor โญ 1,933
    SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime
    ๐Ÿ”— intel.github.io/neural-compressor

  166. noahshinn/reflexion โญ 1,919
    [NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning

  167. cheshire-cat-ai/core โญ 1,916
    Production ready AI assistant framework
    ๐Ÿ”— cheshirecat.ai

  168. tairov/llama2.mojo โญ 1,903
    Inference Llama 2 in one file of pure ๐Ÿ”ฅ
    ๐Ÿ”— www.modular.com/blog/community-spotlight-how-i-built-llama2-by-aydyn-tairov

  169. openai/gpt-2-output-dataset โญ 1,879
    Dataset of GPT-2 outputs for research in detection, biases, and more

  170. lucidrains/toolformer-pytorch โญ 1,877
    Implementation of Toolformer, Language Models That Can Use Tools, by MetaAI

  171. spcl/graph-of-thoughts โญ 1,829
    Official Implementation of "Graph of Thoughts: Solving Elaborate Problems with Large Language Models"
    ๐Ÿ”— arxiv.org/pdf/2308.09687.pdf

  172. minimaxir/aitextgen โญ 1,826
    A robust Python tool for text-based AI training and generation using GPT-2.
    ๐Ÿ”— docs.aitextgen.io

  173. neulab/prompt2model โญ 1,818
    prompt2model - Generate Deployable Models from Natural Language Instructions

  174. openai/gpt-discord-bot โญ 1,701
    Example Discord bot written in Python that uses the completions API to have conversations with the text-davinci-003 model, and the moderations API to filter the messages.

  175. ist-daslab/gptq โญ 1,683
    Code for the ICLR 2023 paper "GPTQ: Accurate Post-training Quantization of Generative Pretrained Transformers".
    ๐Ÿ”— arxiv.org/abs/2210.17323

  176. microsoft/Megatron-DeepSpeed โญ 1,573
    Ongoing research training transformer language models at scale, including: BERT & GPT-2

  177. truera/trulens โญ 1,562
    Evaluation and Tracking for LLM Experiments
    ๐Ÿ”— www.trulens.org

  178. epfllm/meditron โญ 1,518
    Meditron is a suite of open-source medical Large Language Models (LLMs).
    ๐Ÿ”— huggingface.co/epfl-llm

  179. ray-project/llm-applications โญ 1,457
    A comprehensive guide to building RAG-based LLM applications for production.

  180. jina-ai/thinkgpt โญ 1,457
    Agent techniques to augment your LLM and push it beyong its limits

  181. predibase/lorax โญ 1,440
    Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
    ๐Ÿ”— loraexchange.ai

  182. cstankonrad/long_llama โญ 1,424
    LongLLaMA is a large language model capable of handling long contexts. It is based on OpenLLaMA and fine-tuned with the Focused Transformer (FoT) method.

  183. farizrahman4u/loopgpt โญ 1,385
    Re-implementation of Auto-GPT as a python package, written with modularity and extensibility in mind.

  184. akariasai/self-rag โญ 1,355
    This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.
    ๐Ÿ”— selfrag.github.io

  185. explosion/spacy-transformers โญ 1,312
    ๐Ÿ›ธ Use pretrained transformers like BERT, XLNet and GPT-2 in spaCy
    ๐Ÿ”— spacy.io/usage/embeddings-transformers

  186. run-llama/llama-lab โญ 1,288
    Llama Lab is a repo dedicated to building cutting-edge projects using LlamaIndex

  187. bigscience-workshop/Megatron-DeepSpeed โญ 1,228
    Ongoing research training transformer language models at scale, including: BERT & GPT-2

  188. chatarena/chatarena โญ 1,210
    ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments for LLMs. The goal is to develop communication and collaboration capabilities of AIs.
    ๐Ÿ”— www.chatarena.org

  189. srush/MiniChain โญ 1,169
    A tiny library for coding with large language models.
    ๐Ÿ”— srush-minichain.hf.space

  190. ray-project/ray-llm โญ 1,118
    RayLLM - LLMs on Ray
    ๐Ÿ”— aviary.anyscale.com

  191. ibm/Dromedary โญ 1,085
    Dromedary: towards helpful, ethical and reliable LLMs.

  192. nomic-ai/pygpt4all โญ 1,022
    Official supported Python bindings for llama.cpp + gpt4all
    ๐Ÿ”— nomic-ai.github.io/pygpt4all

  193. rlancemartin/auto-evaluator โญ 1,014
    Evaluation tool for LLM QA chains
    ๐Ÿ”— autoevaluator.langchain.com

  194. lupantech/chameleon-llm โญ 1,012
    Codes for "Chameleon: Plug-and-Play Compositional Reasoning with Large Language Models".
    ๐Ÿ”— chameleon-llm.github.io

  195. linksoul-ai/AutoAgents โญ 999
    Generate different roles for GPTs to form a collaborative entity for complex tasks.
    ๐Ÿ”— huggingface.co/spaces/linksoul/autoagents

  196. ctlllll/LLM-ToolMaker โญ 997
    Large Language Models as Tool Makers

  197. meetkai/functionary โญ 979
    Chat language model that can interpret and execute functions/plugins

  198. microsoft/Llama-2-Onnx โญ 978
    A Microsoft optimized version of the Llama 2 model, available from Meta

  199. keirp/automatic_prompt_engineer โญ 974
    Large Language Models Are Human-Level Prompt Engineers

  200. hao-ai-lab/LookaheadDecoding โญ 960
    Break the Sequential Dependency of LLM Inference Using Lookahead Decoding

  201. ajndkr/lanarky โญ 937
    The web framework for building LLM microservices
    ๐Ÿ”— lanarky.ajndkr.com

  202. explosion/spacy-llm โญ 917
    ๐Ÿฆ™ Integrating LLMs into structured NLP pipelines
    ๐Ÿ”— spacy.io/usage/large-language-models

  203. pinecone-io/canopy โญ 865
    Retrieval Augmented Generation (RAG) framework and context engine powered by Pinecone
    ๐Ÿ”— www.pinecone.io

  204. cerebras/modelzoo โญ 846
    Examples of common deep learning models that can be trained on Cerebras hardware

  205. muennighoff/sgpt โญ 806
    SGPT: GPT Sentence Embeddings for Semantic Search
    ๐Ÿ”— arxiv.org/abs/2202.08904

  206. agenta-ai/agenta โญ 799
    The all-in-one LLM developer platform: prompt management, evaluation, human feedback, and deployment all in one place.
    ๐Ÿ”— www.agenta.ai

  207. huggingface/nanotron โญ 748
    Minimalistic large language model 3D-parallelism training

  208. oliveirabruno01/babyagi-asi โญ 745
    BabyAGI: an Autonomous and Self-Improving agent, or BASI

  209. opengenerativeai/GenossGPT โญ 726
    One API for all LLMs either Private or Public (Anthropic, Llama V2, GPT 3.5/4, Vertex, GPT4ALL, HuggingFace ...) ๐ŸŒˆ๐Ÿ‚ Replace OpenAI GPT with any LLMs in your app with one line.
    ๐Ÿ”— genoss.ai

  210. salesforce/xgen โญ 711
    Salesforce open-source LLMs with 8k sequence length.

  211. datadreamer-dev/DataDreamer โญ 627
    DataDreamer is a powerful open-source Python library for prompting, synthetic data generation, and training workflows. It is designed to be simple, extremely efficient, and research-grade.
    ๐Ÿ”— datadreamer.dev

  212. langchain-ai/langsmith-cookbook โญ 582
    LangSmith is a platform for building production-grade LLM applications.
    ๐Ÿ”— langsmith-cookbook.vercel.app

  213. squeezeailab/SqueezeLLM โญ 556
    SqueezeLLM: Dense-and-Sparse Quantization
    ๐Ÿ”— arxiv.org/abs/2306.07629

  214. opengvlab/OmniQuant โญ 545
    [ICLR2024 spotlight] OmniQuant is a simple and powerful quantization technique for LLMs.

  215. lupantech/ScienceQA โญ 540
    Data and code for NeurIPS 2022 Paper "Learn to Explain: Multimodal Reasoning via Thought Chains for Science Question Answering".

  216. hazyresearch/ama_prompting โญ 529
    Ask Me Anything language model prompting

  217. zhudotexe/kani โญ 522
    kani (ใ‚ซใƒ‹) is a highly hackable microframework for chat-based language models with tool use/function calling. (NLP-OSS @ EMNLP 2023)
    ๐Ÿ”— kani.readthedocs.io

  218. continuum-llms/chatgpt-memory โญ 509
    Allows to scale the ChatGPT API to multiple simultaneous sessions with infinite contextual and adaptive memory powered by GPT and Redis datastore.

  219. vahe1994/SpQR โญ 507
    Quantization algorithm and the model evaluation code for SpQR method for LLM compression

  220. huggingface/lighteval โญ 302
    LightEval is a lightweight LLM evaluation suite that Hugging Face has been using internally with the recently released LLM data processing library datatrove and LLM training library nanotron.

Math and Science

Mathematical, numerical and scientific libraries.

  1. numpy/numpy โญ 26,239
    The fundamental package for scientific computing with Python.
    ๐Ÿ”— numpy.org

  2. taichi-dev/taichi โญ 24,709
    Productive, portable, and performant GPU programming in Python: Taichi Lang is an open-source, imperative, parallel programming language for high-performance numerical computation.
    ๐Ÿ”— taichi-lang.org

  3. scipy/scipy โญ 12,403
    SciPy library main repository
    ๐Ÿ”— scipy.org

  4. sympy/sympy โญ 12,349
    A computer algebra system written in pure Python
    ๐Ÿ”— sympy.org

  5. google/or-tools โญ 10,396
    Google Optimization Tools (a.k.a., OR-Tools) is an open-source, fast and portable software suite for solving combinatorial optimization problems.
    ๐Ÿ”— developers.google.com/optimization

  6. z3prover/z3 โญ 9,690
    Z3 is a theorem prover from Microsoft Research with a Python language binding.

  7. cupy/cupy โญ 7,740
    NumPy & SciPy for GPU
    ๐Ÿ”— cupy.dev

  8. google-deepmind/alphageometry โญ 3,603
    Solving Olympiad Geometry without Human Demonstrations

  9. mikedh/trimesh โญ 2,736
    Python library for loading and using triangular meshes.
    ๐Ÿ”— trimesh.org

  10. mckinsey/causalnex โญ 2,134
    A Python library that helps data scientists to infer causation rather than observing correlation.
    ๐Ÿ”— causalnex.readthedocs.io

  11. pyomo/pyomo โญ 1,830
    An object-oriented algebraic modeling language in Python for structured optimization problems.
    ๐Ÿ”— www.pyomo.org

  12. facebookresearch/theseus โญ 1,590
    A library for differentiable nonlinear optimization

  13. google-research/torchsde โญ 1,458
    Differentiable SDE solvers with GPU support and efficient sensitivity analysis.

  14. dynamicslab/pysindy โญ 1,277
    A package for the sparse identification of nonlinear dynamical systems from data
    ๐Ÿ”— pysindy.readthedocs.io/en/latest

  15. geomstats/geomstats โญ 1,145
    Computations and statistics on manifolds with geometric structures.
    ๐Ÿ”— geomstats.ai

  16. cma-es/pycma โญ 1,022
    pycma is a Python implementation of CMA-ES and a few related numerical optimization tools.

  17. sj001/AI-Feynman โญ 583
    Implementation of AI Feynman: a Physics-Inspired Method for Symbolic Regression

  18. willianfuks/tfcausalimpact โญ 573
    Python Causal Impact Implementation Based on Google's R Package. Built using TensorFlow Probability.

  19. brandondube/prysm โญ 233
    Prysm is an open-source library for physical and first-order modeling of optical systems and analysis of related data: numerical and physical optics, integrated modeling, phase retrieval, segmented systems, polynomials and fitting, sequential raytracing.
    ๐Ÿ”— prysm.readthedocs.io/en/stable

  20. lean-dojo/ReProver โญ 158
    Retrieval-Augmented Theorem Provers for Lean
    ๐Ÿ”— leandojo.org

  21. albahnsen/pycircular โญ 86
    pycircular is a Python module for circular data analysis

  22. gbillotey/Fractalshades โญ 26
    Arbitrary-precision fractal explorer - Python package

Machine Learning - General

General and classical machine learning libraries. See below for other sections covering specialised ML areas.

  1. scikit-learn/scikit-learn โญ 57,982
    scikit-learn: machine learning in Python
    ๐Ÿ”— scikit-learn.org

  2. openai/openai-cookbook โญ 55,606
    Examples and guides for using the OpenAI API
    ๐Ÿ”— cookbook.openai.com

  3. tencentarc/GFPGAN โญ 34,476
    GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.

  4. google-research/google-research โญ 32,697
    This repository contains code released by Google Research
    ๐Ÿ”— research.google

  5. suno-ai/bark โญ 32,180
    ๐Ÿ”Š Text-Prompted Generative Audio Model

  6. facebookresearch/faiss โญ 27,913
    A library for efficient similarity search and clustering of dense vectors.
    ๐Ÿ”— faiss.ai

  7. google/jax โญ 27,774
    Composable transformations of Python+NumPy programs: differentiate, vectorize, JIT to GPU/TPU, and more
    ๐Ÿ”— jax.readthedocs.io

  8. open-mmlab/mmdetection โญ 27,631
    OpenMMLab Detection Toolbox and Benchmark
    ๐Ÿ”— mmdetection.readthedocs.io

  9. ageron/handson-ml2 โญ 26,834
    A series of Jupyter notebooks that walk you through the fundamentals of Machine Learning and Deep Learning in Python using Scikit-Learn, Keras and TensorFlow 2.

  10. lutzroeder/netron โญ 25,962
    Visualizer for neural network, deep learning and machine learning models
    ๐Ÿ”— netron.app

  11. dmlc/xgboost โญ 25,529
    Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow
    ๐Ÿ”— xgboost.readthedocs.io/en/stable

  12. google/mediapipe โญ 25,321
    Cross-platform, customizable ML solutions for live and streaming media.
    ๐Ÿ”— mediapipe.dev

  13. harisiqbal88/PlotNeuralNet โญ 21,002
    Latex code for making neural networks diagrams

  14. jina-ai/jina โญ 19,876
    โ˜๏ธ Build multimodal AI applications with cloud-native stack
    ๐Ÿ”— docs.jina.ai

  15. onnx/onnx โญ 16,750
    Open standard for machine learning interoperability
    ๐Ÿ”— onnx.ai

  16. microsoft/LightGBM โญ 16,022
    A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning tasks.
    ๐Ÿ”— lightgbm.readthedocs.io/en/latest

  17. tensorflow/tensor2tensor โญ 14,841
    Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.

  18. ddbourgin/numpy-ml โญ 14,533
    Machine learning, in numpy
    ๐Ÿ”— numpy-ml.readthedocs.io

  19. aleju/imgaug โญ 14,113
    Image augmentation for machine learning experiments.
    ๐Ÿ”— imgaug.readthedocs.io

  20. ml-explore/mlx โญ 13,848
    MLX is an array framework for machine learning on Apple silicon, brought to you by Apple machine learning research.
    ๐Ÿ”— ml-explore.github.io/mlx

  21. roboflow/supervision โญ 13,796
    We write your reusable computer vision tools. ๐Ÿ’œ
    ๐Ÿ”— supervision.roboflow.com

  22. microsoft/nni โญ 13,709
    An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.
    ๐Ÿ”— nni.readthedocs.io

  23. microsoft/Swin-Transformer โญ 12,869
    This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
    ๐Ÿ”— arxiv.org/abs/2103.14030

  24. jindongwang/transferlearning โญ 12,804
    Transfer learning / domain adaptation / domain generalization / multi-task learning etc. Papers, codes, datasets, applications, tutorials.-่ฟ็งปๅญฆไน 
    ๐Ÿ”— transferlearning.xyz

  25. deepmind/deepmind-research โญ 12,760
    This repository contains implementations and illustrative code to accompany DeepMind publications

  26. spotify/annoy โญ 12,640
    Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk

  27. microsoft/onnxruntime โญ 12,556
    ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
    ๐Ÿ”— onnxruntime.ai

  28. neonbjb/tortoise-tts โญ 11,635
    A multi-voice TTS system trained with an emphasis on quality

  29. deepmind/alphafold โญ 11,622
    Implementation of the inference pipeline of AlphaFold v2

  30. facebookresearch/AnimatedDrawings โญ 10,133
    Code to accompany "A Method for Animating Children's Drawings of the Human Figure"

  31. twitter/the-algorithm-ml โญ 9,871
    Source code for Twitter's Recommendation Algorithm
    ๐Ÿ”— blog.twitter.com/engineering/en_us/topics/open-source/2023/twitter-recommendation-algorithm

  32. optuna/optuna โญ 9,595
    A hyperparameter optimization framework
    ๐Ÿ”— optuna.org

  33. ggerganov/ggml โญ 9,527
    Tensor library for machine learning

  34. statsmodels/statsmodels โญ 9,506
    Statsmodels: statistical modeling and econometrics in Python
    ๐Ÿ”— www.statsmodels.org/devel

  35. epistasislab/tpot โญ 9,485
    A Python Automated Machine Learning tool that optimizes machine learning pipelines using genetic programming.
    ๐Ÿ”— epistasislab.github.io/tpot

  36. megvii-basedetection/YOLOX โญ 8,971
    YOLOX is a high-performance anchor-free YOLO, exceeding yolov3~v5 with MegEngine, ONNX, TensorRT, ncnn, and OpenVINO supported. Documentation: https://yolox.readthedocs.io/

  37. cleanlab/cleanlab โญ 8,396
    The standard data-centric AI package for data quality and machine learning with messy, real-world data and labels.
    ๐Ÿ”— cleanlab.ai

  38. pycaret/pycaret โญ 8,372
    An open-source, low-code machine learning library in Python
    ๐Ÿ”— www.pycaret.org

  39. wandb/wandb โญ 8,152
    ๐Ÿ”ฅ A tool for visualizing and tracking your machine learning experiments. This repo contains the CLI and Python API.
    ๐Ÿ”— wandb.ai

  40. pymc-devs/pymc โญ 8,128
    Bayesian Modeling and Probabilistic Programming in Python
    ๐Ÿ”— docs.pymc.io

  41. uberi/speech_recognition โญ 8,014
    Speech recognition module for Python, supporting several engines and APIs, online and offline.
    ๐Ÿ”— pypi.python.org/pypi/speechrecognition

  42. catboost/catboost โญ 7,719
    A fast, scalable, high performance Gradient Boosting on Decision Trees library, used for ranking, classification, regression and other machine learning tasks for Python, R, Java, C++. Supports computation on CPU and GPU.
    ๐Ÿ”— catboost.ai

  43. facebookresearch/xformers โญ 7,471
    Hackable and optimized Transformers building blocks, supporting a composable construction.
    ๐Ÿ”— facebookresearch.github.io/xformers

  44. automl/auto-sklearn โญ 7,389
    Automated Machine Learning with scikit-learn
    ๐Ÿ”— automl.github.io/auto-sklearn

  45. open-mmlab/mmsegmentation โญ 7,339
    OpenMMLab Semantic Segmentation Toolbox and Benchmark.
    ๐Ÿ”— mmsegmentation.readthedocs.io/en/main

  46. hyperopt/hyperopt โญ 7,072
    Distributed Asynchronous Hyperparameter Optimization in Python
    ๐Ÿ”— hyperopt.github.io/hyperopt

  47. awslabs/autogluon โญ 7,045
    AutoGluon: Fast and Accurate ML in 3 Lines of Code
    ๐Ÿ”— auto.gluon.ai

  48. featurelabs/featuretools โญ 7,012
    An open source python library for automated feature engineering
    ๐Ÿ”— www.featuretools.com

  49. lmcinnes/umap โญ 6,920
    Uniform Manifold Approximation and Projection

  50. huggingface/accelerate โญ 6,881
    ๐Ÿš€ A simple way to launch, train, and use PyTorch models on almost any device and distributed configuration, automatic mixed precision (including fp8), and easy-to-configure FSDP and DeepSpeed support
    ๐Ÿ”— huggingface.co/docs/accelerate

  51. hips/autograd โญ 6,773
    Efficiently computes derivatives of numpy code.

  52. py-why/dowhy โญ 6,704
    DoWhy is a Python library for causal inference that supports explicit modeling and testing of causal assumptions. DoWhy is based on a unified language for causal inference, combining causal graphical models and potential outcomes frameworks.
    ๐Ÿ”— www.pywhy.org/dowhy

  53. scikit-learn-contrib/imbalanced-learn โญ 6,682
    A Python Package to Tackle the Curse of Imbalanced Datasets in Machine Learning
    ๐Ÿ”— imbalanced-learn.org

  54. open-mmlab/mmagic โญ 6,543
    OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic ๐Ÿช„: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image generation, image/video restoration/enhancement, etc.
    ๐Ÿ”— mmagic.readthedocs.io/en/latest

  55. probml/pyprobml โญ 6,216
    Python code for "Probabilistic Machine learning" book by Kevin Murphy

  56. nicolashug/Surprise โญ 6,169
    A Python scikit for building and analyzing recommender systems
    ๐Ÿ”— surpriselib.com

  57. google/automl โญ 6,143
    Google Brain AutoML

  58. cleverhans-lab/cleverhans โญ 6,070
    An adversarial example library for constructing attacks, building defenses, and benchmarking both

  59. kevinmusgrave/pytorch-metric-learning โญ 5,743
    The easiest way to use deep metric learning in your application. Modular, flexible, and extensible. Written in PyTorch.
    ๐Ÿ”— kevinmusgrave.github.io/pytorch-metric-learning

  60. open-mmlab/mmcv โญ 5,571
    OpenMMLab Computer Vision Foundation
    ๐Ÿ”— mmcv.readthedocs.io/en/latest

  61. project-monai/MONAI โญ 5,258
    AI Toolkit for Healthcare Imaging
    ๐Ÿ”— monai.io

  62. mdbloice/Augmentor โญ 5,018
    Image augmentation library in Python for machine learning.
    ๐Ÿ”— augmentor.readthedocs.io/en/stable

  63. ml-explore/mlx-examples โญ 4,807
    Examples in the MLX framework

  64. rasbt/mlxtend โญ 4,757
    A library of extension and helper modules for Python's data analysis and machine learning libraries.
    ๐Ÿ”— rasbt.github.io/mlxtend

  65. online-ml/river โญ 4,751
    ๐ŸŒŠ Online machine learning in Python
    ๐Ÿ”— riverml.xyz

  66. uber/causalml โญ 4,724
    Uplift modeling and causal inference with machine learning algorithms

  67. lucidrains/deep-daze โญ 4,387
    Simple command line tool for text to image generation using OpenAI's CLIP and Siren (Implicit neural representation network). Technique was originally created by https://twitter.com/advadnoun

  68. google-deepmind/graphcast โญ 4,194
    GraphCast: Learning skillful medium-range global weather forecasting

  69. districtdatalabs/yellowbrick โญ 4,190
    Visual analysis and diagnostic tools to facilitate machine learning model selection.
    ๐Ÿ”— www.scikit-yb.org

  70. skvark/opencv-python โญ 4,145
    Automated CI toolchain to produce precompiled opencv-python, opencv-python-headless, opencv-contrib-python and opencv-contrib-python-headless packages.
    ๐Ÿ”— pypi.org/project/opencv-python

  71. nv-tlabs/GET3D โญ 4,110
    Generative Model of High Quality 3D Textured Shapes Learned from Images

  72. marqo-ai/marqo โญ 4,085
    Unified embedding generation and search engine. Also available on cloud - cloud.marqo.ai
    ๐Ÿ”— www.marqo.ai

  73. sanchit-gandhi/whisper-jax โญ 4,046
    JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.

  74. apple/coremltools โญ 4,034
    Core ML tools contain supporting tools for Core ML model conversion, editing, and validation.
    ๐Ÿ”— coremltools.readme.io

  75. nmslib/hnswlib โญ 3,983
    Header-only C++/python library for fast approximate nearest neighbors
    ๐Ÿ”— github.com/nmslib/hnswlib

  76. cmusphinx/pocketsphinx โญ 3,725
    A small speech recognizer

  77. microsoft/FLAML โญ 3,661
    A fast library for AutoML and tuning. Join our Discord: https://discord.gg/Cppx2vSPVP.
    ๐Ÿ”— microsoft.github.io/flaml

  78. ourownstory/neural_prophet โญ 3,625
    NeuralProphet: A simple forecasting package
    ๐Ÿ”— neuralprophet.com

  79. py-why/EconML โญ 3,528
    ALICE (Automated Learning and Intelligence for Causation and Economics) is a Microsoft Research project aimed at applying Artificial Intelligence concepts to economic decision making. One of its goals is to build a toolkit that combines state-of-the-art machine learning techniques with econometrics in order to brin...
    ๐Ÿ”— www.microsoft.com/en-us/research/project/alice

  80. thudm/CogVideo โญ 3,477
    Text-to-video generation. The repo for ICLR2023 paper "CogVideo: Large-scale Pretraining for Text-to-Video Generation via Transformers"

  81. huggingface/notebooks โญ 3,244
    Notebooks using the Hugging Face libraries ๐Ÿค—

  82. facebookresearch/vissl โญ 3,224
    VISSL is FAIR's library of extensible, modular and scalable components for SOTA Self-Supervised Learning with images.
    ๐Ÿ”— vissl.ai

  83. rucaibox/RecBole โญ 3,154
    A unified, comprehensive and efficient recommendation library
    ๐Ÿ”— recbole.io

  84. yoheinakajima/instagraph โญ 3,152
    Converts text input or URL into knowledge graph and displays

  85. pytorch/glow โญ 3,132
    Compiler for Neural Network hardware accelerators

  86. hrnet/HRNet-Semantic-Segmentation โญ 3,047
    The OCR approach is rephrased as Segmentation Transformer: https://arxiv.org/abs/1909.11065. This is an official implementation of semantic segmentation for HRNet. https://arxiv.org/abs/1908.07919

  87. huggingface/autotrain-advanced โญ 3,007
    AutoTrain Advanced: faster and easier training and deployments of state-of-the-art machine learning models
    ๐Ÿ”— huggingface.co/autotrain

  88. lucidrains/musiclm-pytorch โญ 2,999
    Implementation of MusicLM, Google's new SOTA model for music generation using attention networks, in Pytorch

  89. mljar/mljar-supervised โญ 2,923
    Python package for AutoML on Tabular Data with Feature Engineering, Hyper-Parameters Tuning, Explanations and Automatic Documentation
    ๐Ÿ”— mljar.com

  90. zjunlp/DeepKE โญ 2,875
    [EMNLP 2022] An Open Toolkit for Knowledge Graph Extraction and Construction
    ๐Ÿ”— deepke.zjukg.cn

  91. lightly-ai/lightly โญ 2,731
    A python library for self-supervised learning on images.
    ๐Ÿ”— docs.lightly.ai/self-supervised-learning

  92. teamhg-memex/eli5 โญ 2,725
    A library for debugging/inspecting machine learning classifiers and explaining their predictions
    ๐Ÿ”— eli5.readthedocs.io

  93. scikit-optimize/scikit-optimize โญ 2,720
    Sequential model-based optimization with a scipy.optimize interface
    ๐Ÿ”— scikit-optimize.github.io

  94. scikit-learn-contrib/hdbscan โญ 2,667
    A high performance implementation of HDBSCAN clustering.
    ๐Ÿ”— hdbscan.readthedocs.io/en/latest

  95. shankarpandala/lazypredict โญ 2,656
    Lazy Predict help build a lot of basic models without much code and helps understand which models works better without any parameter tuning

  96. maif/shapash โญ 2,639
    ๐Ÿ”… Shapash: User-friendly Explainability and Interpretability to Develop Reliable and Transparent Machine Learning Models
    ๐Ÿ”— maif.github.io/shapash

  97. google-research/t5x โญ 2,466
    T5X is a modular, composable, research-friendly framework for high-performance, configurable, self-service training, evaluation, and inference of sequence models (starting with language) at many scales.

  98. apple/ml-ane-transformers โญ 2,458
    Reference implementation of the Transformer architecture optimized for Apple Neural Engine (ANE)

  99. huggingface/safetensors โญ 2,400
    Implements a new simple format for storing tensors safely (as opposed to pickle) and that is still fast (zero-copy).
    ๐Ÿ”— huggingface.co/docs/safetensors

  100. scikit-learn-contrib/category_encoders โญ 2,361
    A library of sklearn compatible categorical variable encoders
    ๐Ÿ”— contrib.scikit-learn.org/category_encoders

  101. freedmand/semantra โญ 2,264
    Semantra is a multipurpose tool for semantically searching documents. Query by meaning rather than just by matching text.

  102. huggingface/optimum โญ 2,115
    ๐Ÿš€ Accelerate training and inference of ๐Ÿค— Transformers and ๐Ÿค— Diffusers with easy to use hardware optimization tools
    ๐Ÿ”— huggingface.co/docs/optimum/main

  103. rom1504/clip-retrieval โญ 2,102
    Easily compute clip embeddings and build a clip retrieval system with them
    ๐Ÿ”— rom1504.github.io/clip-retrieval

  104. aws/sagemaker-python-sdk โญ 2,036
    A library for training and deploying machine learning models on Amazon SageMaker
    ๐Ÿ”— sagemaker.readthedocs.io

  105. rentruewang/koila โญ 1,817
    Prevent PyTorch's CUDA error: out of memory in just 1 line of code.
    ๐Ÿ”— rentruewang.github.io/koila

  106. contextlab/hypertools โญ 1,797
    A Python toolbox for gaining geometric insights into high-dimensional data
    ๐Ÿ”— hypertools.readthedocs.io/en/latest

  107. huggingface/evaluate โญ 1,795
    ๐Ÿค— Evaluate: A library for easily evaluating machine learning models and datasets.
    ๐Ÿ”— huggingface.co/docs/evaluate

  108. linkedin/greykite โญ 1,789
    A flexible, intuitive and fast forecasting library

  109. bmabey/pyLDAvis โญ 1,779
    Python library for interactive topic model visualization. Port of the R LDAvis package.

  110. scikit-learn-contrib/lightning โญ 1,706
    Large-scale linear classification, regression and ranking in Python
    ๐Ÿ”— contrib.scikit-learn.org/lightning

  111. tensorflow/addons โญ 1,680
    Useful extra functionality for TensorFlow 2.x maintained by SIG-addons

  112. huggingface/huggingface_hub โญ 1,643
    The official Python client for the Huggingface Hub.
    ๐Ÿ”— huggingface.co/docs/huggingface_hub

  113. microsoft/i-Code โญ 1,627
    The ambition of the i-Code project is to build integrative and composable multimodal AI. The "i" stands for integrative multimodal learning.

  114. eric-mitchell/direct-preference-optimization โญ 1,574
    Reference implementation for DPO (Direct Preference Optimization)

  115. castorini/pyserini โญ 1,428
    Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.
    ๐Ÿ”— pyserini.io

  116. kubeflow/katib โญ 1,419
    Automated Machine Learning on Kubernetes
    ๐Ÿ”— www.kubeflow.org/docs/components/katib

  117. jina-ai/finetuner โญ 1,414
    ๐ŸŽฏ Task-oriented embedding tuning for BERT, CLIP, etc.
    ๐Ÿ”— finetuner.jina.ai

  118. visual-layer/fastdup โญ 1,395
    fastdup is a powerful free tool designed to rapidly extract valuable insights from your image & video datasets. Assisting you to increase your dataset images & labels quality and reduce your data operations costs at an unparalleled scale.

  119. scikit-learn-contrib/metric-learn โญ 1,375
    Metric learning algorithms in Python
    ๐Ÿ”— contrib.scikit-learn.org/metric-learn

  120. laekov/fastmoe โญ 1,363
    A fast MoE impl for PyTorch
    ๐Ÿ”— fastmoe.ai

  121. googlecloudplatform/vertex-ai-samples โญ 1,328
    Sample code and notebooks for Vertex AI, the end-to-end machine learning platform on Google Cloud
    ๐Ÿ”— cloud.google.com/vertex-ai

  122. csinva/imodels โญ 1,284
    Interpretable ML package ๐Ÿ” for concise, transparent, and accurate predictive modeling (sklearn-compatible).
    ๐Ÿ”— csinva.io/imodels

  123. borealisai/advertorch โญ 1,270
    A Toolbox for Adversarial Robustness Research

  124. awslabs/dgl-ke โญ 1,231
    High performance, easy-to-use, and scalable package for learning large-scale knowledge graph embeddings.
    ๐Ÿ”— dglke.dgl.ai/doc

  125. microsoft/Olive โญ 1,202
    Olive is an easy-to-use hardware-aware model optimization tool that composes industry-leading techniques across model compression, optimization, and compilation.
    ๐Ÿ”— microsoft.github.io/olive

  126. google/vizier โญ 1,170
    Python-based research interface for blackbox and hyperparameter optimization, based on the internal Google Vizier Service.
    ๐Ÿ”— oss-vizier.readthedocs.io

  127. microsoft/Semi-supervised-learning โญ 1,169
    A Unified Semi-Supervised Learning Codebase (NeurIPS'22)
    ๐Ÿ”— usb.readthedocs.io

  128. patchy631/machine-learning โญ 1,154
    Machine Learning Tutorials Repository

  129. koaning/scikit-lego โญ 1,143
    Extra blocks for scikit-learn pipelines.
    ๐Ÿ”— koaning.github.io/scikit-lego

  130. spotify/voyager โญ 1,142
    ๐Ÿ›ฐ๏ธ An approximate nearest-neighbor search library for Python and Java with a focus on ease of use, simplicity, and deployability.
    ๐Ÿ”— spotify.github.io/voyager

  131. automl/TabPFN โญ 1,079
    Official implementation of the TabPFN paper (https://arxiv.org/abs/2207.01848) and the tabpfn package.
    ๐Ÿ”— priorlabs.ai

  132. google-research/deeplab2 โญ 982
    DeepLab2 is a TensorFlow library for deep labeling, aiming to provide a unified and state-of-the-art TensorFlow codebase for dense pixel labeling tasks.

  133. lmcinnes/pynndescent โญ 837
    A Python nearest neighbor descent for approximate nearest neighbors

  134. hazyresearch/safari โญ 834
    Convolutions for Sequence Modeling

  135. davidmrau/mixture-of-experts โญ 811
    PyTorch Re-Implementation of "The Sparsely-Gated Mixture-of-Experts Layer" by Noam Shazeer et al. https://arxiv.org/abs/1701.06538

  136. opentensor/bittensor โญ 766
    Internet-scale Neural Networks
    ๐Ÿ”— www.bittensor.com

  137. nvidia/cuda-python โญ 766
    CUDA Python Low-level Bindings
    ๐Ÿ”— nvidia.github.io/cuda-python

  138. oml-team/open-metric-learning โญ 755
    OML is a PyTorch-based framework to train and validate the models producing high-quality embeddings.
    ๐Ÿ”— open-metric-learning.readthedocs.io/en/latest/index.html

  139. qdrant/fastembed โญ 741
    Fast, Accurate, Lightweight Python library to make State of the Art Embedding
    ๐Ÿ”— qdrant.github.io/fastembed

  140. criteo/autofaiss โญ 739
    Automatically create Faiss knn indices with the most optimal similarity search parameters.
    ๐Ÿ”— criteo.github.io/autofaiss

  141. facebookresearch/balance โญ 671
    The balance python package offers a simple workflow and methods for dealing with biased data samples when looking to infer from them to some target population of interest.
    ๐Ÿ”— import-balance.org

  142. awslabs/python-deequ โญ 638
    Python API for Deequ, a library built on Spark for defining "unit tests for data", which measure data quality in large datasets

  143. nicolas-hbt/pygraft โญ 636
    Configurable Generation of Synthetic Schemas and Knowledge Graphs at Your Fingertips
    ๐Ÿ”— pygraft.readthedocs.io/en/latest

  144. hpcaitech/EnergonAI โญ 630
    Large-scale model inference.

  145. qdrant/quaterion โญ 618
    Blazing fast framework for fine-tuning similarity learning models
    ๐Ÿ”— quaterion.qdrant.tech

  146. replicate/replicate-python โญ 617
    Python client for Replicate
    ๐Ÿ”— replicate.com

  147. microsoft/Focal-Transformer โญ 541
    [NeurIPS 2021 Spotlight] Official code for "Focal Self-attention for Local-Global Interactions in Vision Transformers"

  148. nevronai/MetisFL โญ 530
    The first open Federated Learning framework implemented in C++ and Python.
    ๐Ÿ”— metisfl.org

  149. googleapis/python-aiplatform โญ 524
    A Python SDK for Vertex AI, a fully managed, end-to-end platform for data science and machine learning.

  150. huggingface/exporters โญ 524
    Export Hugging Face models to Core ML and TensorFlow Lite

  151. cvxgrp/pymde โญ 516
    Minimum-distortion embedding with PyTorch
    ๐Ÿ”— pymde.org

  152. lightning-ai/litdata โญ 175
    Streamline data pipelines for AI. Process datasets across 1000s of machines, and optimize data for blazing fast model training.

  153. dylanhogg/gptauthor โญ 42
    GPTAuthor is an AI tool for writing long form, multi-chapter stories given a story prompt.

Machine Learning - Deep Learning

Machine learning libraries that cross over with deep learning in some way.

  1. tensorflow/tensorflow โญ 182,186
    An Open Source Machine Learning Framework for Everyone
    ๐Ÿ”— tensorflow.org

  2. pytorch/pytorch โญ 77,461
    Tensors and Dynamic neural networks in Python with strong GPU acceleration
    ๐Ÿ”— pytorch.org

  3. keras-team/keras โญ 60,866
    Deep Learning for humans
    ๐Ÿ”— keras.io

  4. openai/whisper โญ 59,616
    Robust Speech Recognition via Large-Scale Weak Supervision

  5. deepfakes/faceswap โญ 49,099
    Deepfakes Software For All
    ๐Ÿ”— www.faceswap.dev

  6. iperov/DeepFaceLab โญ 45,194
    DeepFaceLab is the leading software for creating deepfakes.

  7. facebookresearch/segment-anything โญ 43,735
    The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

  8. microsoft/DeepSpeed โญ 32,429
    DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
    ๐Ÿ”— www.deepspeed.ai

  9. rwightman/pytorch-image-models โญ 29,579
    PyTorch image models, scripts, pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNet-V3/V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
    ๐Ÿ”— huggingface.co/docs/timm

  10. facebookresearch/detectron2 โญ 28,567
    Detectron2 is a platform for object detection, segmentation and other visual recognition tasks.
    ๐Ÿ”— detectron2.readthedocs.io/en/latest

  11. lightning-ai/pytorch-lightning โญ 26,740
    The deep learning framework to pretrain, finetune and deploy AI models. PyTorch Lightning is just organized PyTorch - Lightning disentangles PyTorch code to decouple the science from the engineering.
    ๐Ÿ”— lightning.ai

  12. facebookresearch/Detectron โญ 26,132
    FAIR's research platform for object detection research, implementing popular algorithms like Mask R-CNN and RetinaNet.

  13. xinntao/Real-ESRGAN โญ 25,853
    Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.

  14. matterport/Mask_RCNN โญ 24,097
    Mask R-CNN for object detection and instance segmentation on Keras and TensorFlow

  15. openai/CLIP โญ 21,902
    CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image

  16. paddlepaddle/Paddle โญ 21,576
    PArallel Distributed Deep LEarning: Machine Learning Framework from Industrial Practice ๏ผˆใ€Ž้ฃžๆกจใ€ๆ ธๅฟƒๆก†ๆžถ๏ผŒๆทฑๅบฆๅญฆไน &ๆœบๅ™จๅญฆไน ้ซ˜ๆ€ง่ƒฝๅ•ๆœบใ€ๅˆ†ๅธƒๅผ่ฎญ็ปƒๅ’Œ่ทจๅนณๅฐ้ƒจ็ฝฒ๏ผ‰
    ๐Ÿ”— www.paddlepaddle.org

  17. apache/mxnet โญ 20,705
    Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more
    ๐Ÿ”— mxnet.apache.org

  18. pyg-team/pytorch_geometric โญ 20,014
    Graph Neural Network Library for PyTorch
    ๐Ÿ”— pyg.org

  19. lucidrains/vit-pytorch โญ 17,781
    Implementation of Vision Transformer, a simple way to achieve SOTA in vision classification with only a single transformer encoder, in Pytorch

  20. sanster/IOPaint โญ 16,803
    Image inpainting tool powered by SOTA AI Model. Remove any unwanted object, defect, people from your pictures or erase and replace(powered by stable diffusion) any thing on your pictures.
    ๐Ÿ”— www.iopaint.com

  21. rasbt/deeplearning-models โญ 16,280
    A collection of various deep learning architectures, models, and tips

  22. danielgatis/rembg โญ 14,298
    Rembg is a tool to remove images background

  23. albumentations-team/albumentations โญ 13,360
    Fast image augmentation library and an easy-to-use wrapper around other libraries. Documentation: https://albumentations.ai/docs/ Paper about the library: https://www.mdpi.com/2078-2489/11/2/125
    ๐Ÿ”— albumentations.ai

  24. dmlc/dgl โญ 12,966
    Python package built to ease deep learning on graph, on top of existing DL frameworks.
    ๐Ÿ”— dgl.ai

  25. facebookresearch/detr โญ 12,786
    End-to-End Object Detection with Transformers

  26. nvidia/DeepLearningExamples โญ 12,564
    State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.

  27. kornia/kornia โญ 9,316
    Geometric Computer Vision Library for Spatial AI
    ๐Ÿ”— kornia.readthedocs.io

  28. keras-team/autokeras โญ 9,061
    AutoML library for deep learning
    ๐Ÿ”— autokeras.com

  29. pyro-ppl/pyro โญ 8,339
    Deep universal probabilistic programming with Python and PyTorch
    ๐Ÿ”— pyro.ai

  30. mlfoundations/open_clip โญ 8,297
    An open source implementation of OpenAI's CLIP (Contrastive Language-Image Pre-training)

  31. facebookresearch/pytorch3d โญ 8,251
    PyTorch3D is FAIR's library of reusable components for deep learning with 3D data
    ๐Ÿ”— pytorch3d.org

  32. nvidia/apex โญ 8,000
    A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch

  33. google/trax โญ 7,947
    Trax โ€” Deep Learning with Clear Code and Speed

  34. arogozhnikov/einops โญ 7,877
    Flexible and powerful tensor operations for readable and reliable code (for pytorch, jax, TF and others)
    ๐Ÿ”— einops.rocks

  35. facebookresearch/ImageBind โญ 7,831
    ImageBind One Embedding Space to Bind Them All

  36. lucidrains/imagen-pytorch โญ 7,752
    Implementation of Imagen, Google's Text-to-Image Neural Network, in Pytorch

  37. xpixelgroup/BasicSR โญ 6,134
    Open Source Image and Video Restoration Toolbox for Super-resolution, Denoise, Deblurring, etc. Currently, it includes EDSR, RCAN, SRResNet, SRGAN, ESRGAN, EDVR, BasicVSR, SwinIR, ECBSR, etc. Also support StyleGAN2, DFDNet.
    ๐Ÿ”— basicsr.readthedocs.io/en/latest

  38. skorch-dev/skorch โญ 5,609
    A scikit-learn compatible neural network library that wraps PyTorch

  39. google/flax โญ 5,469
    Flax is a neural network library for JAX that is designed for flexibility.
    ๐Ÿ”— flax.readthedocs.io

  40. facebookresearch/mmf โญ 5,414
    A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
    ๐Ÿ”— mmf.sh

  41. mosaicml/composer โญ 4,968
    Supercharge Your Model Training
    ๐Ÿ”— docs.mosaicml.com

  42. pytorch/ignite โญ 4,448
    High-level library to help with training and evaluating neural networks in PyTorch flexibly and transparently.
    ๐Ÿ”— pytorch-ignite.ai

  43. facebookincubator/AITemplate โญ 4,440
    AITemplate is a Python framework which renders neural network into high performance CUDA/HIP C++ code. Specialized for FP16 TensorCore (NVIDIA GPU) and MatrixCore (AMD GPU) inference.

  44. deci-ai/super-gradients โญ 4,312
    Easily train or fine-tune SOTA computer vision models with one open source training library. The home of Yolo-NAS.
    ๐Ÿ”— www.supergradients.com

  45. nvidiagameworks/kaolin โญ 4,196
    A PyTorch Library for Accelerating 3D Deep Learning Research

  46. williamyang1991/VToonify โญ 3,459
    [SIGGRAPH Asia 2022] VToonify: Controllable High-Resolution Portrait Video Style Transfer

  47. facebookresearch/PyTorch-BigGraph โญ 3,347
    Generate embeddings from large-scale graph-structured data.
    ๐Ÿ”— torchbiggraph.readthedocs.io

  48. alpa-projects/alpa โญ 2,970
    Training and serving large-scale neural networks with auto parallelization.
    ๐Ÿ”— alpa.ai

  49. pytorch/botorch โญ 2,939
    Bayesian optimization in PyTorch
    ๐Ÿ”— botorch.org

  50. cvg/LightGlue โญ 2,927
    LightGlue: Local Feature Matching at Light Speed (ICCV 2023)

  51. deepmind/dm-haiku โญ 2,799
    JAX-based neural network library
    ๐Ÿ”— dm-haiku.readthedocs.io

  52. explosion/thinc โญ 2,790
    ๐Ÿ”ฎ A refreshing functional take on deep learning, compatible with your favorite libraries
    ๐Ÿ”— thinc.ai

  53. nerdyrodent/VQGAN-CLIP โญ 2,568
    Just playing with getting VQGAN+CLIP running locally, rather than having to use colab.

  54. danielegrattarola/spektral โญ 2,343
    Graph Neural Networks with Keras and Tensorflow 2.
    ๐Ÿ”— graphneural.network

  55. google-research/electra โญ 2,286
    ELECTRA: Pre-training Text Encoders as Discriminators Rather Than Generators

  56. neuralmagic/sparseml โญ 1,965
    Libraries for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models

  57. fepegar/torchio โญ 1,949
    Medical imaging toolkit for deep learning
    ๐Ÿ”— www.torchio.org

  58. pytorch/torchrec โญ 1,716
    Pytorch domain library for recommendation systems

  59. tensorflow/mesh โญ 1,551
    Mesh TensorFlow: Model Parallelism Made Easier

  60. vt-vl-lab/FGVC โญ 1,544
    [ECCV 2020] Flow-edge Guided Video Completion

  61. tensorly/tensorly โญ 1,493
    TensorLy: Tensor Learning in Python.
    ๐Ÿ”— tensorly.org

  62. calculatedcontent/WeightWatcher โญ 1,388
    The WeightWatcher tool for predicting the accuracy of Deep Neural Networks

  63. hysts/pytorch_image_classification โญ 1,309
    PyTorch implementation of image classification models for CIFAR-10/CIFAR-100/MNIST/FashionMNIST/Kuzushiji-MNIST/ImageNet

  64. jeshraghian/snntorch โญ 1,053
    Deep and online learning with spiking neural networks in Python
    ๐Ÿ”— snntorch.readthedocs.io/en/latest

  65. xl0/lovely-tensors โญ 1,045
    Tensors, ready for human consumption
    ๐Ÿ”— xl0.github.io/lovely-tensors

  66. tensorflow/similarity โญ 994
    TensorFlow Similarity is a python package focused on making similarity learning quick and easy.

  67. deepmind/android_env โญ 949
    RL research on Android devices.

  68. keras-team/keras-cv โญ 941
    Industry-strength Computer Vision workflows with Keras

  69. deepmind/chex โญ 709
    Chex is a library of utilities for helping to write reliable JAX code
    ๐Ÿ”— chex.readthedocs.io

  70. kakaobrain/rq-vae-transformer โญ 680
    The official implementation of Autoregressive Image Generation using Residual Quantization (CVPR '22)

  71. whitead/dmol-book โญ 576
    Deep learning for molecules and materials book
    ๐Ÿ”— dmol.pub

Machine Learning - Interpretability

Machine learning interpretability libraries. Covers explainability, prediction explainations, dashboards, understanding knowledge development in training.

  1. slundberg/shap โญ 21,534
    A game theoretic approach to explain the output of any machine learning model.
    ๐Ÿ”— shap.readthedocs.io

  2. marcotcr/lime โญ 11,257
    Lime: Explaining the predictions of any machine learning classifier

  3. interpretml/interpret โญ 5,980
    Fit interpretable models. Explain blackbox machine learning.
    ๐Ÿ”— interpret.ml/docs

  4. tensorflow/lucid โญ 4,613
    A collection of infrastructure and tools for research in neural network interpretability.

  5. pytorch/captum โญ 4,542
    Model interpretability and understanding for PyTorch
    ๐Ÿ”— captum.ai

  6. pair-code/lit โญ 3,374
    The Learning Interpretability Tool: Interactively analyze ML models to understand their behavior in an extensible and framework agnostic interface.
    ๐Ÿ”— pair-code.github.io/lit

  7. arize-ai/phoenix โญ 2,531
    AI Observability & Evaluation
    ๐Ÿ”— docs.arize.com/phoenix

  8. seldonio/alibi โญ 2,283
    Algorithms for explaining machine learning models
    ๐Ÿ”— docs.seldon.io/projects/alibi/en/stable

  9. oegedijk/explainerdashboard โญ 2,221
    Quickly build Explainable AI dashboards that show the inner workings of so-called "blackbox" machine learning models.
    ๐Ÿ”— explainerdashboard.readthedocs.io

  10. eleutherai/pythia โญ 2,011
    Interpretability analysis and scaling laws to understand how knowledge develops and evolves during training in autoregressive transformers

  11. jalammar/ecco โญ 1,898
    Explain, analyze, and visualize NLP language models. Ecco creates interactive visualizations directly in Jupyter notebooks explaining the behavior of Transformer-based language models (like GPT2, BERT, RoBERTA, T5, and T0).
    ๐Ÿ”— ecco.readthedocs.io

  12. cdpierse/transformers-interpret โญ 1,199
    Model explainability that works seamlessly with ๐Ÿค— transformers. Explain your transformers model in just 2 lines of code.

  13. selfexplainml/PiML-Toolbox โญ 866
    PiML (Python Interpretable Machine Learning) toolbox for model development & diagnostics
    ๐Ÿ”— selfexplainml.github.io/piml-toolbox

Machine Learning - Ops

MLOps tools, frameworks and libraries: intersection of machine learning, data engineering and DevOps; deployment, health, diagnostics and governance of ML models.

  1. apache/airflow โญ 34,334
    Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
    ๐Ÿ”— airflow.apache.org

  2. ray-project/ray โญ 30,882
    Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a set of AI Libraries for accelerating ML workloads.
    ๐Ÿ”— ray.io

  3. spotify/luigi โญ 17,285
    Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.

  4. mlflow/mlflow โญ 17,178
    Open source platform for the machine learning lifecycle
    ๐Ÿ”— mlflow.org

  5. prefecthq/prefect โญ 14,495
    Prefect is a workflow orchestration tool empowering developers to build, observe, and react to data pipelines
    ๐Ÿ”— prefect.io

  6. horovod/horovod โญ 13,930
    Distributed training framework for TensorFlow, Keras, PyTorch, and Apache MXNet.
    ๐Ÿ”— horovod.ai

  7. iterative/dvc โญ 13,087
    ๐Ÿฆ‰ ML Experiments and Data Management with Git
    ๐Ÿ”— dvc.org

  8. ludwig-ai/ludwig โญ 10,759
    Low-code framework for building custom LLMs, neural networks, and other AI models
    ๐Ÿ”— ludwig.ai

  9. dagster-io/dagster โญ 10,114
    An orchestration platform for the development, production, and observation of data assets.
    ๐Ÿ”— dagster.io

  10. great-expectations/great_expectations โญ 9,418
    Always know what to expect from your data.
    ๐Ÿ”— docs.greatexpectations.io

  11. kedro-org/kedro โญ 9,337
    Kedro is a toolbox for production-ready data science. It uses software engineering best practices to help you create data engineering and data science pipelines that are reproducible, maintainable, and modular.
    ๐Ÿ”— kedro.org

  12. dbt-labs/dbt-core โญ 8,832
    dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.
    ๐Ÿ”— getdbt.com

  13. bentoml/OpenLLM โญ 8,650
    Run any open-source LLMs, such as Llama 2, Mistral, as OpenAI compatible API endpoint, locally and in the cloud.
    ๐Ÿ”— bentoml.com

  14. activeloopai/deeplake โญ 7,673
    Database for AI. Store Vectors, Images, Texts, Videos, etc. Use with LLMs/LangChain. Store, query, version, & visualize any AI data. Stream data in real-time to PyTorch/TensorFlow. https://activeloop.ai
    ๐Ÿ”— activeloop.ai

  15. netflix/metaflow โญ 7,538
    ๐Ÿš€ Build and manage real-life ML, AI, and data science projects with ease!
    ๐Ÿ”— metaflow.org

  16. mage-ai/mage-ai โญ 6,936
    ๐Ÿง™ Build, run, and manage data pipelines for integrating and transforming data.
    ๐Ÿ”— www.mage.ai

  17. bentoml/BentoML โญ 6,502
    The most flexible way to serve AI/ML models in production - Build Model Inference Service, LLM APIs, Inference Graph/Pipelines, Compound AI systems, Multi-Modal, RAG as a Service, and more!
    ๐Ÿ”— bentoml.com

  18. kestra-io/kestra โญ 6,209
    Infinitely scalable, event-driven, language-agnostic orchestration and scheduling platform to manage millions of workflows declaratively in code.
    ๐Ÿ”— kestra.io

  19. feast-dev/feast โญ 5,239
    Feature Store for Machine Learning
    ๐Ÿ”— feast.dev

  20. allegroai/clearml โญ 5,210
    ClearML - Auto-Magical CI/CD to streamline your AI workload. Experiment Management, Data Management, Pipeline, Orchestration, Scheduling & Serving in one MLOps/LLMOps solution
    ๐Ÿ”— clear.ml/docs

  21. aimhubio/aim โญ 4,748
    Aim ๐Ÿ’ซ โ€” An easy-to-use & supercharged open-source experiment tracker.
    ๐Ÿ”— aimstack.io

  22. flyteorg/flyte โญ 4,713
    Scalable and flexible workflow orchestration platform that seamlessly unifies data, ML and analytics stacks.
    ๐Ÿ”— flyte.org

  23. evidentlyai/evidently โญ 4,598
    Evaluate and monitor ML models from validation to production. Join our Discord: https://discord.com/invite/xZjKRaNp8b

  24. adap/flower โญ 4,104
    Flower: A Friendly Federated Learning Framework
    ๐Ÿ”— flower.ai

  25. orchest/orchest โญ 4,018
    Build data pipelines, the easy way ๐Ÿ› ๏ธ
    ๐Ÿ”— orchest.readthedocs.io/en/stable

  26. zenml-io/zenml โญ 3,633
    ZenML ๐Ÿ™: Build portable, production-ready MLOps pipelines. https://zenml.io.
    ๐Ÿ”— zenml.io

  27. polyaxon/polyaxon โญ 3,476
    MLOps Tools For Managing & Orchestrating The Machine Learning LifeCycle
    ๐Ÿ”— polyaxon.com

  28. kubeflow/pipelines โญ 3,431
    Machine Learning Pipelines for Kubeflow
    ๐Ÿ”— www.kubeflow.org/docs/components/pipelines

  29. ploomber/ploomber โญ 3,369
    The fastest โšก๏ธ way to build data pipelines. Develop iteratively, deploy anywhere. โ˜๏ธ
    ๐Ÿ”— docs.ploomber.io

  30. towhee-io/towhee โญ 2,952
    Towhee is a framework that is dedicated to making neural data processing pipelines simple and fast.
    ๐Ÿ”— towhee.io

  31. determined-ai/determined โญ 2,843
    Determined is an open-source machine learning platform that simplifies distributed training, hyperparameter tuning, experiment tracking, and resource management. Works with PyTorch and TensorFlow.
    ๐Ÿ”— determined.ai

  32. leptonai/leptonai โญ 2,405
    A Pythonic framework to simplify AI service building
    ๐Ÿ”— lepton.ai

  33. meltano/meltano โญ 1,573
    Meltano: the declarative code-first data integration engine that powers your wildest data and ML-powered product ideas. Say goodbye to writing, maintaining, and scaling your own API integrations.
    ๐Ÿ”— meltano.com

  34. hi-primus/optimus โญ 1,441
    ๐Ÿšš Agile Data Preparation Workflows madeย easy with Pandas, Dask, cuDF, Dask-cuDF, Vaex and PySpark
    ๐Ÿ”— hi-optimus.com

  35. kubeflow/examples โญ 1,371
    A repository to host extended examples and tutorials

  36. dagworks-inc/hamilton โญ 1,301
    Hamilton helps data scientists and engineers define testable, modular, self-documenting dataflows, that encode lineage and metadata. Runs and scales everywhere python does.
    ๐Ÿ”— hamilton.dagworks.io/en/latest

  37. azure/PyRIT โญ 1,232
    The Python Risk Identification Tool for generative AI (PyRIT) is an open access automation framework to empower security professionals and ML engineers to red team foundation models and their applications.

  38. dstackai/dstack โญ 1,063
    dstack is an open-source orchestration engine for cost-effectively running AI workloads in the cloud as well as on-premises. Discord: https://discord.gg/u8SmfwPpMd
    ๐Ÿ”— dstack.ai

  39. nccr-itmo/FEDOT โญ 602
    Automated modeling and machine learning framework FEDOT
    ๐Ÿ”— fedot.readthedocs.io

  40. dagworks-inc/burr โญ 371
    Build applications that make decisions (chatbots, agents, simulations, etc...). Monitor, persist, and execute on your own infrastructure.
    ๐Ÿ”— burr.dagworks.io

Machine Learning - Reinforcement

Machine learning libraries and toolkits that cross over with reinforcement learning in some way: agent reinforcement learning, agent environemnts, RLHF

  1. openai/gym โญ 33,819
    A toolkit for developing and comparing reinforcement learning algorithms.
    ๐Ÿ”— www.gymlibrary.dev

  2. unity-technologies/ml-agents โญ 16,279
    The Unity Machine Learning Agents Toolkit (ML-Agents) is an open-source project that enables games and simulations to serve as environments for training intelligent agents using deep reinforcement learning and imitation learning.
    ๐Ÿ”— unity.com/products/machine-learning-agents

  3. openai/baselines โญ 15,309
    OpenAI Baselines: high-quality implementations of reinforcement learning algorithms

  4. google/dopamine โญ 10,365
    Dopamine is a research framework for fast prototyping of reinforcement learning algorithms.
    ๐Ÿ”— github.com/google/dopamine

  5. deepmind/pysc2 โญ 7,904
    StarCraft II Learning Environment

  6. lucidrains/PaLM-rlhf-pytorch โญ 7,584
    Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

  7. thu-ml/tianshou โญ 7,353
    An elegant PyTorch deep reinforcement learning library.
    ๐Ÿ”— tianshou.org

  8. tensorlayer/TensorLayer โญ 7,289
    Deep Learning and Reinforcement Learning Library for Scientists and Engineers
    ๐Ÿ”— tensorlayerx.com

  9. farama-foundation/Gymnasium โญ 5,619
    An API standard for single-agent reinforcement learning environments, with popular reference environments and related utilities (formerly Gym)
    ๐Ÿ”— gymnasium.farama.org

  10. keras-rl/keras-rl โญ 5,487
    Deep Reinforcement Learning for Keras.
    ๐Ÿ”— keras-rl.readthedocs.io

  11. deepmind/dm_control โญ 3,525
    Google DeepMind's software stack for physics-based simulation and Reinforcement Learning environments, using MuJoCo.

  12. facebookresearch/ReAgent โญ 3,524
    A platform for Reasoning systems (Reinforcement Learning, Contextual Bandits, etc.)
    ๐Ÿ”— reagent.ai

  13. ai4finance-foundation/ElegantRL โญ 3,415
    Massively Parallel Deep Reinforcement Learning. ๐Ÿ”ฅ
    ๐Ÿ”— ai4finance.org

  14. deepmind/acme โญ 3,366
    A library of reinforcement learning components and agents

  15. eureka-research/Eureka โญ 2,578
    Official Repository for "Eureka: Human-Level Reward Design via Coding Large Language Models"
    ๐Ÿ”— eureka-research.github.io

  16. pettingzoo-team/PettingZoo โญ 2,351
    An API standard for multi-agent reinforcement learning environments, with popular reference environments and related utilities
    ๐Ÿ”— pettingzoo.farama.org

  17. kzl/decision-transformer โญ 2,133
    Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.

  18. pytorch/rl โญ 1,826
    A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.
    ๐Ÿ”— pytorch.org/rl

  19. anthropics/hh-rlhf โญ 1,427
    Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"
    ๐Ÿ”— arxiv.org/abs/2204.05862

  20. humancompatibleai/imitation โญ 1,126
    Clean PyTorch implementations of imitation and reward learning algorithms
    ๐Ÿ”— imitation.readthedocs.io

  21. arise-initiative/robosuite โญ 1,068
    robosuite: A Modular Simulation Framework and Benchmark for Robot Learning
    ๐Ÿ”— robosuite.ai

  22. denys88/rl_games โญ 703
    RL Games: High performance RL library

Natural Language Processing

Natural language processing libraries and toolkits: text processing, topic modelling, tokenisers, chatbots. Also see the LLMs and ChatGPT category for crossover.

  1. huggingface/transformers โญ 124,169
    ๐Ÿค— Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
    ๐Ÿ”— huggingface.co/transformers

  2. pytorch/fairseq โญ 29,156
    Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

  3. explosion/spaCy โญ 28,658
    ๐Ÿ’ซ Industrial-strength Natural Language Processing (NLP) in Python
    ๐Ÿ”— spacy.io

  4. huggingface/datasets โญ 18,345
    ๐Ÿค— The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
    ๐Ÿ”— huggingface.co/docs/datasets

  5. microsoft/unilm โญ 18,231
    Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
    ๐Ÿ”— aka.ms/generalai

  6. myshell-ai/OpenVoice โญ 16,993
    Instant voice cloning by MyShell.
    ๐Ÿ”— research.myshell.ai/open-voice

  7. rare-technologies/gensim โญ 15,196
    Topic Modelling for Humans
    ๐Ÿ”— radimrehurek.com/gensim

  8. gunthercox/ChatterBot โญ 13,883
    ChatterBot is a machine learning, conversational dialog engine for creating chat bots
    ๐Ÿ”— chatterbot.readthedocs.io

  9. ukplab/sentence-transformers โญ 13,657
    Multilingual Sentence & Image Embeddings with BERT
    ๐Ÿ”— www.sbert.net

  10. flairnlp/flair โญ 13,539
    A very simple framework for state-of-the-art Natural Language Processing (NLP)
    ๐Ÿ”— flairnlp.github.io/flair

  11. nltk/nltk โญ 12,991
    NLTK Source
    ๐Ÿ”— www.nltk.org

  12. jina-ai/clip-as-service โญ 12,167
    ๐Ÿ„ Scalable embedding, reasoning, ranking for images and sentences with CLIP
    ๐Ÿ”— clip-as-service.jina.ai

  13. allenai/allennlp โญ 11,686
    An open-source NLP research library, built on PyTorch.
    ๐Ÿ”— www.allennlp.org

  14. facebookresearch/ParlAI โญ 10,421
    A framework for training and evaluating AI models on a variety of openly available dialogue datasets.
    ๐Ÿ”— parl.ai

  15. facebookresearch/seamless_communication โญ 10,096
    Foundational Models for State-of-the-Art Speech and Text Translation

  16. nvidia/NeMo โญ 9,948
    NeMo: a framework for generative AI
    ๐Ÿ”— nvidia.github.io/nemo

  17. openai/tiktoken โญ 9,581
    tiktoken is a fast BPE tokeniser for use with OpenAI's models.

  18. google/sentencepiece โญ 9,397
    Unsupervised text tokenizer for Neural Network-based text generation.

  19. togethercomputer/OpenChatKit โญ 8,993
    OpenChatKit provides a powerful, open-source base to create both specialized and general purpose chatbots

  20. doccano/doccano โญ 8,936
    Open source annotation tool for machine learning practitioners.

  21. sloria/TextBlob โญ 8,913
    Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.
    ๐Ÿ”— textblob.readthedocs.io

  22. m-bain/whisperX โญ 8,792
    WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

  23. clips/pattern โญ 8,654
    Web mining module for Python, with tools for scraping, natural language processing, machine learning, network analysis and visualization.
    ๐Ÿ”— github.com/clips/pattern/wiki

  24. facebookresearch/nougat โญ 7,937
    Implementation of Nougat Neural Optical Understanding for Academic Documents
    ๐Ÿ”— facebookresearch.github.io/nougat

  25. espnet/espnet โญ 7,827
    End-to-End Speech Processing Toolkit
    ๐Ÿ”— espnet.github.io/espnet

  26. speechbrain/speechbrain โญ 7,819
    A PyTorch-based Speech Toolkit
    ๐Ÿ”— speechbrain.github.io

  27. vikparuchuri/marker โญ 7,767
    Marker converts PDF, EPUB, and MOBI to markdown. It's 10x faster than nougat, more accurate on most documents, and has low hallucination risk.

  28. neuml/txtai โญ 6,903
    ๐Ÿ’ก All-in-one open-source embeddings database for semantic search, LLM orchestration and language model workflows
    ๐Ÿ”— neuml.github.io/txtai

  29. deeppavlov/DeepPavlov โญ 6,531
    An open source library for deep learning end-to-end dialog systems and chatbots.
    ๐Ÿ”— deeppavlov.ai

  30. facebookresearch/metaseq โญ 6,371
    A codebase for working with Open Pre-trained Transformers, originally forked from fairseq.

  31. kingoflolz/mesh-transformer-jax โญ 6,213
    Model parallel transformers in JAX and Haiku

  32. maartengr/BERTopic โญ 5,503
    Leveraging BERT and c-TF-IDF to create easily interpretable topics.
    ๐Ÿ”— maartengr.github.io/bertopic

  33. vikparuchuri/surya โญ 5,500
    OCR, layout analysis, and line detection in 90+ languages

  34. minimaxir/textgenrnn โญ 4,941
    Easily train your own text-generating neural network of any size and complexity on any text dataset with a few lines of code.

  35. salesforce/CodeGen โญ 4,750
    CodeGen is a family of open-source model for program synthesis. Trained on TPU-v4. Competitive with OpenAI Codex.

  36. prefecthq/marvin โญ 4,669
    โœจ Build AI interfaces that spark joy
    ๐Ÿ”— askmarvin.ai

  37. aiwaves-cn/agents โญ 4,480
    An Open-source Framework for Autonomous Language Agents
    ๐Ÿ”— www.aiwaves-agents.com

  38. facebookresearch/DrQA โญ 4,456
    Reading Wikipedia to Answer Open-Domain Questions

  39. layout-parser/layout-parser โญ 4,426
    A Unified Toolkit for Deep Learning Based Document Image Analysis
    ๐Ÿ”— layout-parser.github.io

  40. makcedward/nlpaug โญ 4,288
    Data augmentation for NLP
    ๐Ÿ”— makcedward.github.io

  41. thilinarajapakse/simpletransformers โญ 3,971
    Transformers for Information Retrieval, Text Classification, NER, QA, Language Modelling, Language Generation, T5, Multi-Modal, and Conversational AI
    ๐Ÿ”— simpletransformers.ai

  42. life4/textdistance โญ 3,295
    ๐Ÿ“ Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.

  43. jsvine/markovify โญ 3,266
    A simple, extensible Markov chain generator.

  44. maartengr/KeyBERT โญ 3,199
    Minimal keyword extraction with BERT
    ๐Ÿ”— maartengr.github.io/keybert

  45. argilla-io/argilla โญ 3,082
    Argilla is a collaboration platform for AI engineers and domain experts that require high-quality outputs, full data ownership, and overall efficiency.
    ๐Ÿ”— docs.argilla.io

  46. bytedance/lightseq โญ 3,077
    LightSeq: A High Performance Library for Sequence Processing and Generation

  47. errbotio/errbot โญ 3,058
    Errbot is a chatbot, a daemon that connects to your favorite chat service and bring your tools and some fun into the conversation.
    ๐Ÿ”— errbot.io

  48. promptslab/Promptify โญ 3,007
    Prompt Engineering | Prompt Versioning | Use GPT or other prompt based models to get structured output. Join our discord for Prompt-Engineering, LLMs and other latest research
    ๐Ÿ”— discord.gg/m88xfymbk6

  49. huawei-noah/Pretrained-Language-Model โญ 2,952
    Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.

  50. jbesomi/texthero โญ 2,862
    Text preprocessing, representation and visualization from zero to hero.
    ๐Ÿ”— texthero.org

  51. neuralmagic/deepsparse โญ 2,858
    Sparsity-aware deep learning inference runtime for CPUs
    ๐Ÿ”— neuralmagic.com/deepsparse

  52. ddangelov/Top2Vec โญ 2,834
    Top2Vec learns jointly embedded topic, document and word vectors.

  53. huggingface/neuralcoref โญ 2,801
    โœจFast Coreference Resolution in spaCy with Neural Networks
    ๐Ÿ”— huggingface.co/coref

  54. salesforce/CodeT5 โญ 2,575
    Home of CodeT5: Open Code LLMs for Code Understanding and Generation
    ๐Ÿ”— arxiv.org/abs/2305.07922

  55. bigscience-workshop/promptsource โญ 2,476
    Toolkit for creating, sharing and using natural language prompts.

  56. jamesturk/jellyfish โญ 1,987
    ๐Ÿชผ a python library for doing approximate and phonetic matching of strings.
    ๐Ÿ”— jamesturk.github.io/jellyfish

  57. huggingface/setfit โญ 1,939
    Efficient few-shot learning with Sentence Transformers
    ๐Ÿ”— hf.co/docs/setfit

  58. alibaba/EasyNLP โญ 1,934
    EasyNLP: A Comprehensive and Easy-to-use NLP Toolkit

  59. thudm/P-tuning-v2 โญ 1,875
    An optimized deep prompt tuning strategy comparable to fine-tuning across scales and tasks

  60. deepset-ai/FARM โญ 1,722
    ๐Ÿก Fast & easy transfer learning for NLP. Harvesting language models for the industry. Focus on Question Answering.
    ๐Ÿ”— farm.deepset.ai

  61. marella/ctransformers โญ 1,677
    Python bindings for the Transformer models implemented in C/C++ using GGML library.

  62. franck-dernoncourt/NeuroNER โญ 1,675
    Named-entity recognition using neural networks. Easy-to-use and state-of-the-art results.
    ๐Ÿ”— neuroner.com

  63. featureform/featureform โญ 1,672
    The Virtual Feature Store. Turn your existing data infrastructure into a feature store.
    ๐Ÿ”— www.featureform.com

  64. plasticityai/magnitude โญ 1,610
    A fast, efficient universal vector embedding utility package.

  65. arxiv-vanity/arxiv-vanity โญ 1,594
    Renders papers from arXiv as responsive web pages so you don't have to squint at a PDF.
    ๐Ÿ”— www.arxiv-vanity.com

  66. google-research/language โญ 1,555
    Shared repository for open-sourced projects from the Google AI Language team.
    ๐Ÿ”— ai.google/research/teams/language

  67. explosion/spacy-models โญ 1,501
    ๐Ÿ’ซ Models for the spaCy Natural Language Processing (NLP) library
    ๐Ÿ”— spacy.io

  68. chrismattmann/tika-python โญ 1,405
    Tika-Python is a Python binding to the Apache Tikaโ„ข REST services allowing Tika to be called natively in the Python community.

  69. dmmiller612/bert-extractive-summarizer โญ 1,344
    Easy to use extractive text summarization with BERT

  70. gunthercox/chatterbot-corpus โญ 1,339
    A multilingual dialog corpus
    ๐Ÿ”— chatterbot-corpus.readthedocs.io

  71. jonasgeiping/cramming โญ 1,228
    Cramming the training of a (BERT-type) language model into limited compute.

  72. abertsch72/unlimiformer โญ 1,030
    Public repo for the NeurIPS 2023 paper "Unlimiformer: Long-Range Transformers with Unlimited Length Input"

  73. nomic-ai/nomic โญ 984
    Interact, analyze and structure massive text, image, embedding, audio and video datasets
    ๐Ÿ”— atlas.nomic.ai

  74. norskregnesentral/skweak โญ 909
    skweak: A software toolkit for weak supervision applied to NLP tasks

  75. intellabs/fastRAG โญ 873
    Efficient Retrieval Augmentation and Generation Framework

  76. pemistahl/lingua-py โญ 864
    The most accurate natural language detection library for Python, suitable for short text and mixed-language text

  77. openai/grade-school-math โญ 861
    GSM8K, a dataset of 8.5K high quality linguistically diverse grade school math word problems

  78. explosion/spacy-streamlit โญ 761
    ๐Ÿ‘‘ spaCy building blocks and visualizers for Streamlit apps
    ๐Ÿ”— share.streamlit.io/ines/spacy-streamlit-demo/master/app.py

  79. paddlepaddle/RocketQA โญ 739
    ๐Ÿš€ RocketQA, dense retrieval for information retrieval and question answering, including both Chinese and English state-of-the-art models.

  80. explosion/spacy-stanza โญ 710
    ๐Ÿ’ฅ Use the latest Stanza (StanfordNLP) research models directly in spaCy

  81. keras-team/keras-nlp โญ 689
    Modular Natural Language Processing workflows with Keras

  82. urchade/GLiNER โญ 543
    Generalist and Lightweight Model for Named Entity Recognition (Extract any entity types from texts) @ NAACL 24
    ๐Ÿ”— arxiv.org/abs/2311.08526

Packaging

Python packaging, dependency management and bundling.

  1. pyenv/pyenv โญ 36,490
    pyenv lets you easily switch between multiple versions of Python.

  2. python-poetry/poetry โญ 29,335
    Python packaging and dependency management made easy
    ๐Ÿ”— python-poetry.org

  3. pypa/pipenv โญ 24,569
    A virtualenv management tool that supports a multitude of systems and nicely bridges the gaps between pip, python and virtualenv.
    ๐Ÿ”— pipenv.pypa.io

  4. pyinstaller/pyinstaller โญ 11,246
    Freeze (package) Python programs into stand-alone executables
    ๐Ÿ”— www.pyinstaller.org

  5. mitsuhiko/rye โญ 11,159
    a Hassle-Free Python Experience
    ๐Ÿ”— rye-up.com

  6. astral-sh/uv โญ 10,810
    An extremely fast Python package installer and resolver, written in Rust. Designed as a drop-in replacement for pip and pip-compile.
    ๐Ÿ”— astral.sh

  7. pypa/pipx โญ 8,717
    Install and Run Python Applications in Isolated Environments
    ๐Ÿ”— pipx.pypa.io

  8. jazzband/pip-tools โญ 7,453
    A set of tools to keep your pinned Python dependencies fresh (pip-compile + pip-sync)
    ๐Ÿ”— pip-tools.rtfd.io

  9. pdm-project/pdm โญ 6,438
    A modern Python package and dependency manager supporting the latest PEP standards
    ๐Ÿ”— pdm-project.org

  10. mamba-org/mamba โญ 6,198
    The Fast Cross-Platform Package Manager: mamba is a reimplementation of the conda package manager in C++
    ๐Ÿ”— mamba.readthedocs.io

  11. conda/conda โญ 6,069
    A system-level, binary package and environment manager running on all major operating systems and platforms.
    ๐Ÿ”— docs.conda.io/projects/conda

  12. pypa/hatch โญ 5,267
    Modern, extensible Python project management
    ๐Ÿ”— hatch.pypa.io/latest

  13. conda-forge/miniforge โญ 5,204
    A conda-forge distribution.
    ๐Ÿ”— conda-forge.org/miniforge

  14. indygreg/PyOxidizer โญ 5,174
    A modern Python application packaging and distribution tool

  15. pypa/virtualenv โญ 4,708
    A tool to create isolated Python environments. Since Python 3.3, a subset of it has been integrated into the standard lib venv module.
    ๐Ÿ”— virtualenv.pypa.io

  16. spack/spack โญ 3,933
    A flexible package manager that supports multiple versions, configurations, platforms, and compilers.
    ๐Ÿ”— spack.io

  17. pantsbuild/pex โญ 2,448
    A tool for generating .pex (Python EXecutable) files, lock files and venvs.
    ๐Ÿ”— docs.pex-tool.org

  18. beeware/briefcase โญ 2,312
    Tools to support converting a Python project into a standalone native application.
    ๐Ÿ”— briefcase.readthedocs.io

  19. pypa/flit โญ 2,090
    Simplified packaging of Python modules
    ๐Ÿ”— flit.pypa.io

  20. prefix-dev/pixi โญ 1,833
    pixi is a cross-platform, multi-language package manager and workflow tool built on the foundation of the conda ecosystem.
    ๐Ÿ”— pixi.sh

  21. linkedin/shiv โญ 1,687
    shiv is a command line utility for building fully self contained Python zipapps as outlined in PEP 441, but with all their dependencies included.

  22. marcelotduarte/cx_Freeze โญ 1,239
    Create standalone executables from Python scripts, with the same performance and is cross-platform.
    ๐Ÿ”— marcelotduarte.github.io/cx_freeze

  23. ofek/pyapp โญ 1,039
    Runtime installer for Python applications
    ๐Ÿ”— ofek.dev/pyapp

  24. pypa/gh-action-pypi-publish โญ 832
    The blessed :octocat: GitHub Action, for publishing your ๐Ÿ“ฆ distribution files to PyPI: https://github.com/marketplace/actions/pypi-publish
    ๐Ÿ”— packaging.python.org/guides/publishing-package-distribution-releases-using-github-actions-ci-cd-workflows

  25. py2exe/py2exe โญ 742
    Create standalone Windows programs from Python code
    ๐Ÿ”— www.py2exe.org

  26. prefix-dev/rip โญ 614
    RIP is a library that allows the resolving and installing of Python PyPI packages from Rust into a virtual environment. It's based on our experience with building Rattler and aims to provide the same experience but for PyPI instead of Conda.
    ๐Ÿ”— prefix.dev

  27. snok/install-poetry โญ 526
    Github action for installing and configuring Poetry

  28. python-poetry/install.python-poetry.org โญ 172
    The official Poetry installation script
    ๐Ÿ”— install.python-poetry.org

Pandas

Pandas and dataframe libraries: data analysis, statistical reporting, pandas GUIs, pandas performance optimisations.

  1. pandas-dev/pandas โญ 41,850
    Flexible and powerful data analysis / manipulation library for Python, providing labeled data structures similar to R data.frame objects, statistical functions, and much more
    ๐Ÿ”— pandas.pydata.org

  2. pola-rs/polars โญ 25,822
    Dataframes powered by a multithreaded, vectorized query engine, written in Rust
    ๐Ÿ”— docs.pola.rs

  3. duckdb/duckdb โญ 16,322
    DuckDB is an in-process SQL OLAP Database Management System
    ๐Ÿ”— www.duckdb.org

  4. ydataai/ydata-profiling โญ 11,992
    1 Line of code data quality profiling & exploratory data analysis for Pandas and Spark DataFrames.
    ๐Ÿ”— docs.profiling.ydata.ai

  5. gventuri/pandas-ai โญ 10,803
    Chat with your database (SQL, CSV, pandas, polars, mongodb, noSQL, etc). PandasAI makes data analysis conversational using LLMs (GPT 3.5 / 4, Anthropic, VertexAI) and RAG.
    ๐Ÿ”— pandas-ai.com

  6. kanaries/pygwalker โญ 9,625
    PyGWalker: Turn your pandas dataframe into an interactive UI for visual analysis
    ๐Ÿ”— kanaries.net/pygwalker

  7. rapidsai/cudf โญ 7,235
    cuDF is a GPU DataFrame library for loading joining, aggregating, filtering, and otherwise manipulating data
    ๐Ÿ”— docs.rapids.ai/api/cudf/stable

  8. aws/aws-sdk-pandas โญ 3,791
    pandas on AWS - Easy integration with Athena, Glue, Redshift, Timestream, Neptune, OpenSearch, QuickSight, Chime, CloudWatchLogs, DynamoDB, EMR, SecretManager, PostgreSQL, MySQL, SQLServer and S3 (Parquet, CSV, JSON and EXCEL).
    ๐Ÿ”— aws-sdk-pandas.readthedocs.io

  9. nalepae/pandarallel โญ 3,482
    A simple and efficient tool to parallelize Pandas operations on all availableย CPUs
    ๐Ÿ”— nalepae.github.io/pandarallel

  10. blaze/blaze โญ 3,182
    NumPy and Pandas interface to Big Data
    ๐Ÿ”— blaze.pydata.org

  11. adamerose/PandasGUI โญ 3,123
    A GUI for Pandas DataFrames

  12. unionai-oss/pandera โญ 2,976
    A light-weight, flexible, and expressive statistical data testing library
    ๐Ÿ”— www.union.ai/pandera

  13. pydata/pandas-datareader โญ 2,811
    Extract data from a wide range of Internet sources into a pandas DataFrame.
    ๐Ÿ”— pydata.github.io/pandas-datareader/stable/index.html

  14. scikit-learn-contrib/sklearn-pandas โญ 2,785
    Pandas integration with sklearn

  15. jmcarpenter2/swifter โญ 2,457
    A package which efficiently applies any function to a pandas dataframe or series in the fastest available manner

  16. fugue-project/fugue โญ 1,866
    A unified interface for distributed computing. Fugue executes SQL, Python, Pandas, and Polars code on Spark, Dask and Ray without any rewrites.
    ๐Ÿ”— fugue-tutorials.readthedocs.io

  17. delta-io/delta-rs โญ 1,797
    A native Rust library for Delta Lake, with bindings into Python
    ๐Ÿ”— delta-io.github.io/delta-rs

  18. eventual-inc/Daft โญ 1,653
    Distributed DataFrame for Python designed for the cloud, powered by Rust
    ๐Ÿ”— getdaft.io

  19. pyjanitor-devs/pyjanitor โญ 1,276
    Clean APIs for data cleaning. Python implementation of R package Janitor
    ๐Ÿ”— pyjanitor-devs.github.io/pyjanitor

  20. machow/siuba โญ 1,099
    Python library for using dplyr like syntax with pandas and SQL
    ๐Ÿ”— siuba.org

  21. renumics/spotlight โญ 1,001
    Interactively explore unstructured datasets from your dataframe.
    ๐Ÿ”— renumics.com

  22. holoviz/hvplot โญ 933
    A high-level plotting API for pandas, dask, xarray, and networkx built on HoloViews
    ๐Ÿ”— hvplot.holoviz.org

  23. tkrabel/bamboolib โญ 933
    bamboolib - a GUI for pandas DataFrames
    ๐Ÿ”— bamboolib.com

Performance

Performance, parallelisation and low level libraries.

  1. celery/celery โญ 23,395
    Distributed Task Queue (development branch)
    ๐Ÿ”— docs.celeryq.dev

  2. google/flatbuffers โญ 21,988
    FlatBuffers: Memory Efficient Serialization Library
    ๐Ÿ”— flatbuffers.dev

  3. pybind/pybind11 โญ 14,701
    Seamless operability between C++11 and Python
    ๐Ÿ”— pybind11.readthedocs.io

  4. exaloop/codon โญ 13,810
    A high-performance, zero-overhead, extensible Python compiler using LLVM
    ๐Ÿ”— docs.exaloop.io/codon

  5. dask/dask โญ 11,965
    Parallel computing with task scheduling
    ๐Ÿ”— dask.org

  6. modin-project/modin โญ 9,454
    Modin: Scale your Pandas workflows by changing a single line of code
    ๐Ÿ”— modin.readthedocs.io

  7. numba/numba โญ 9,401
    NumPy aware dynamic Python compiler using LLVM
    ๐Ÿ”— numba.pydata.org

  8. nebuly-ai/nebuly โญ 8,371
    The user analytics platform for LLMs
    ๐Ÿ”— www.nebuly.com

  9. vaexio/vaex โญ 8,170
    Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second ๐Ÿš€
    ๐Ÿ”— vaex.io

  10. mher/flower โญ 6,151
    Real-time monitor and web admin for Celery distributed task queue
    ๐Ÿ”— flower.readthedocs.io

  11. python-trio/trio โญ 5,857
    Trio โ€“ a friendly Python library for async concurrency and I/O
    ๐Ÿ”— trio.readthedocs.io

  12. ultrajson/ultrajson โญ 4,242
    Ultra fast JSON decoder and encoder written in C with Python bindings
    ๐Ÿ”— pypi.org/project/ujson

  13. facebookincubator/cinder โญ 3,365
    Cinder is Meta's internal performance-oriented production version of CPython.
    ๐Ÿ”— trycinder.com

  14. tlkh/asitop โญ 2,798
    Perf monitoring CLI tool for Apple Silicon
    ๐Ÿ”— tlkh.github.io/asitop

  15. ipython/ipyparallel โญ 2,547
    IPython Parallel: Interactive Parallel Computing in Python
    ๐Ÿ”— ipyparallel.readthedocs.io

  16. h5py/h5py โญ 1,991
    HDF5 for Python -- The h5py package is a Pythonic interface to the HDF5 binary data format.
    ๐Ÿ”— www.h5py.org

  17. intel/intel-extension-for-transformers โญ 1,909
    โšก Build your chatbot within minutes on your favorite device; offer SOTA compression techniques for LLMs; run LLMs efficiently on Intel Platformsโšก

  18. airtai/faststream โญ 1,742
    FastStream is a powerful and easy-to-use Python framework for building asynchronous services interacting with event streams such as Apache Kafka, RabbitMQ, NATS and Redis.
    ๐Ÿ”— faststream.airt.ai/latest

  19. faster-cpython/ideas โญ 1,646
    Discussion and work tracker for Faster CPython project.

  20. agronholm/anyio โญ 1,597
    High level asynchronous concurrency and networking framework that works on top of either trio or asyncio

  21. dask/distributed โญ 1,539
    A distributed task scheduler for Dask
    ๐Ÿ”— distributed.dask.org

  22. tiangolo/asyncer โญ 1,424
    Asyncer, async and await, focused on developer experience.
    ๐Ÿ”— asyncer.tiangolo.com

  23. intel/intel-extension-for-pytorch โญ 1,317
    A Python package for extending the official PyTorch that can easily obtain performance on Intel platform

  24. nschloe/perfplot โญ 1,292
    ๐Ÿ“ˆ Performance analysis for Python snippets

  25. intel/scikit-learn-intelex โญ 1,152
    Intel(R) Extension for Scikit-learn is a seamless way to speed up your Scikit-learn application
    ๐Ÿ”— intel.github.io/scikit-learn-intelex

  26. markshannon/faster-cpython โญ 937
    How to make CPython faster.

  27. zerointensity/pointers.py โญ 883
    Bringing the hell of pointers to Python.
    ๐Ÿ”— pointers.zintensity.dev

  28. brandtbucher/specialist โญ 609
    Visualize CPython's specializing, adaptive interpreter. ๐Ÿ”ฅ

Profiling

Memory and CPU/GPU profiling tools and libraries.

  1. bloomberg/memray โญ 12,482
    Memray is a memory profiler for Python
    ๐Ÿ”— bloomberg.github.io/memray

  2. benfred/py-spy โญ 11,797
    Sampling profiler for Python programs

  3. plasma-umass/scalene โญ 11,074
    Scalene: a high-performance, high-precision CPU, GPU, and memory profiler for Python with AI-powered optimization proposals

  4. joerick/pyinstrument โญ 6,090
    ๐Ÿšดย Call stack profiler for Python. Shows you why your code is slow!
    ๐Ÿ”— pyinstrument.readthedocs.io

  5. gaogaotiantian/viztracer โญ 4,304
    VizTracer is a low-overhead logging/debugging/profiling tool that can trace and visualize your python code execution.
    ๐Ÿ”— viztracer.readthedocs.io

  6. pythonprofilers/memory_profiler โญ 4,201
    Monitor Memory usage of Python code
    ๐Ÿ”— pypi.python.org/pypi/memory_profiler

  7. reloadware/reloadium โญ 2,686
    Hot Reloading and Profiling for Python
    ๐Ÿ”— reloadium.io

  8. pyutils/line_profiler โญ 2,461
    Line-by-line profiling for Python

  9. jiffyclub/snakeviz โญ 2,222
    An in-browser Python profile viewer
    ๐Ÿ”— jiffyclub.github.io/snakeviz

  10. p403n1x87/austin โญ 1,350
    Python frame stack sampler for CPython
    ๐Ÿ”— pypi.org/project/austin-dist

  11. pythonspeed/filprofiler โญ 810
    A Python memory profiler for data processing and scientific computing applications
    ๐Ÿ”— pythonspeed.com/products/filmemoryprofiler

Security

Security related libraries: vulnerability discovery, SQL injection, environment auditing.

  1. swisskyrepo/PayloadsAllTheThings โญ 56,521
    A list of useful payloads and bypass for Web Application Security and Pentest/CTF
    ๐Ÿ”— swisskyrepo.github.io/payloadsallthethings

  2. certbot/certbot โญ 30,779
    Certbot is EFF's tool to obtain certs from Let's Encrypt and (optionally) auto-enable HTTPS on your server. It can also act as a client for any other CA that uses the ACME protocol.

  3. sqlmapproject/sqlmap โญ 30,444
    Automatic SQL injection and database takeover tool
    ๐Ÿ”— sqlmap.org

  4. aquasecurity/trivy โญ 21,213
    Find vulnerabilities, misconfigurations, secrets, SBOM in containers, Kubernetes, code repositories, clouds and more
    ๐Ÿ”— aquasecurity.github.io/trivy

  5. bridgecrewio/checkov โญ 6,490
    Checkov is a static code analysis tool for infrastructure as code (IaC) and also a software composition analysis (SCA) tool for images and open source packages.
    ๐Ÿ”— www.checkov.io

  6. nccgroup/ScoutSuite โญ 6,131
    Multi-Cloud Security Auditing Tool

  7. pycqa/bandit โญ 5,944
    Bandit is a tool designed to find common security issues in Python code.
    ๐Ÿ”— bandit.readthedocs.io

  8. stamparm/maltrail โญ 5,724
    Malicious traffic detection system

  9. rhinosecuritylabs/pacu โญ 4,003
    The AWS exploitation framework, designed for testing the security of Amazon Web Services environments.
    ๐Ÿ”— rhinosecuritylabs.com/aws/pacu-open-source-aws-exploitation-framework

  10. dashingsoft/pyarmor โญ 2,868
    A tool used to obfuscate python scripts, bind obfuscated scripts to fixed machine or expire obfuscated scripts.
    ๐Ÿ”— pyarmor.dashingsoft.com

  11. pyupio/safety โญ 1,623
    Safety checks Python dependencies for known security vulnerabilities and suggests the proper remediations for vulnerabilities detected.
    ๐Ÿ”— safetycli.com/product/safety-cli

  12. luijait/DarkGPT โญ 1,336
    DarkGPT is an OSINT assistant based on GPT-4-200K (recommended use) designed to perform queries on leaked databases, thus providing an artificial intelligence assistant that can be useful in your traditional OSINT processes.

  13. trailofbits/pip-audit โญ 912
    Audits Python environments, requirements files and dependency trees for known security vulnerabilities, and can automatically fix them
    ๐Ÿ”— pypi.org/project/pip-audit

  14. fadi002/de4py โญ 749
    toolkit for python reverse engineering
    ๐Ÿ”— de4py.000.pe

Simulation

Simulation libraries: robotics, economic, agent-based, traffic, physics, astronomy, chemistry, quantum simulation. Also see the Maths and Science category for crossover.

  1. atsushisakai/PythonRobotics โญ 21,568
    Python sample codes for robotics algorithms.
    ๐Ÿ”— atsushisakai.github.io/pythonrobotics

  2. bulletphysics/bullet3 โญ 11,861
    Bullet Physics SDK: real-time collision detection and multi-physics simulation for VR, games, visual effects, robotics, machine learning etc.
    ๐Ÿ”— bulletphysics.org

  3. isl-org/Open3D โญ 10,415
    Open3D: A Modern Library for 3D Data Processing
    ๐Ÿ”— www.open3d.org

  4. qiskit/qiskit โญ 4,560
    Qiskit is an open-source SDK for working with quantum computers at the level of extended quantum circuits, operators, and primitives.
    ๐Ÿ”— www.ibm.com/quantum/qiskit

  5. astropy/astropy โญ 4,200
    Astronomy and astrophysics core library
    ๐Ÿ”— www.astropy.org

  6. quantumlib/Cirq โญ 4,114
    A python framework for creating, editing, and invoking Noisy Intermediate Scale Quantum (NISQ) circuits.

  7. openai/mujoco-py โญ 2,734
    MuJoCo is a physics engine for detailed, efficient rigid body simulations with contacts. mujoco-py allows using MuJoCo from Python 3.

  8. rdkit/rdkit โญ 2,399
    The official sources for the RDKit library

  9. taichi-dev/difftaichi โญ 2,384
    10 differentiable physical simulators built with Taichi differentiable programming (DiffTaichi, ICLR 2020)

  10. projectmesa/mesa โญ 2,204
    Mesa is an open-source Python library for agent-based modeling, ideal for simulating complex systems and exploring emergent behaviors.
    ๐Ÿ”— mesa.readthedocs.io

  11. google/brax โญ 2,047
    Massively parallel rigidbody physics simulation on accelerator hardware.

  12. quantecon/QuantEcon.py โญ 1,853
    A community based Python library for quantitative economics
    ๐Ÿ”— quantecon.org/quantecon-py

  13. microsoft/PromptCraft-Robotics โญ 1,693
    Community for applying LLMs to robotics and a robot simulator with ChatGPT integration
    ๐Ÿ”— aka.ms/chatgpt-robotics

  14. facebookresearch/habitat-lab โญ 1,692
    A modular high-level library to train embodied AI agents across a variety of tasks and environments.
    ๐Ÿ”— aihabitat.org

  15. nvidia/warp โญ 1,662
    A Python framework for high performance GPU simulation and graphics
    ๐Ÿ”— nvidia.github.io/warp

  16. nvidia-omniverse/IsaacGymEnvs โญ 1,584
    Isaac Gym Reinforcement Learning Environments

  17. deepmodeling/deepmd-kit โญ 1,352
    A deep learning package for many-body potential energy representation and molecular dynamics
    ๐Ÿ”— docs.deepmodeling.com/projects/deepmd

  18. sail-sg/envpool โญ 1,009
    C++-based high-performance parallel environment execution engine (vectorized env) for general RL environments.
    ๐Ÿ”— envpool.readthedocs.io

  19. a-r-j/graphein โญ 971
    Protein Graph Library
    ๐Ÿ”— graphein.ai

  20. hardmaru/estool โญ 919
    Evolution Strategies Tool

  21. viblo/pymunk โญ 874
    Pymunk is a easy-to-use pythonic 2d physics library that can be used whenever you need 2d rigid body physics from Python
    ๐Ÿ”— www.pymunk.org

  22. facebookresearch/fairo โญ 829
    A modular embodied agent architecture and platform for building embodied agents

  23. bowang-lab/scGPT โญ 813
    scGPT: Towards Building a Foundation Model for Single-Cell Multi-omics Using Generative AI
    ๐Ÿ”— scgpt.readthedocs.io/en/latest

  24. google-deepmind/materials_discovery โญ 778
    Graph Networks for Materials Science (GNoME) is a project centered around scaling machine learning methods to tackle materials science.

  25. google/evojax โญ 776
    EvoJAX is a scalable, general purpose, hardware-accelerated neuroevolution toolkit built on the JAX library

  26. nvidia-omniverse/orbit โญ 719
    Unified framework for robot learning built on NVIDIA Isaac Sim
    ๐Ÿ”— isaac-orbit.github.io/orbit

  27. nvidia-omniverse/OmniIsaacGymEnvs โญ 658
    Reinforcement Learning Environments for Omniverse Isaac Gym

Study

Miscellaneous study resources: algorithms, general resources, system design, code repos for textbooks, best practices, tutorials.

  1. thealgorithms/Python โญ 177,754
    All Algorithms implemented in Python
    ๐Ÿ”— the-algorithms.com

  2. jakevdp/PythonDataScienceHandbook โญ 41,373
    Python Data Science Handbook: full text in Jupyter Notebooks
    ๐Ÿ”— jakevdp.github.io/pythondatasciencehandbook

  3. microsoft/generative-ai-for-beginners โญ 41,252
    18 Lessons, Get Started Building with Generative AI ๐Ÿ”— https://microsoft.github.io/generative-ai-for-beginners/
    ๐Ÿ”— microsoft.github.io/generative-ai-for-beginners

  4. mlabonne/llm-course โญ 28,052
    Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
    ๐Ÿ”— mlabonne.github.io/blog

  5. realpython/python-guide โญ 27,577
    Python best practices guidebook, written for humans.
    ๐Ÿ”— docs.python-guide.org

  6. christoschristofidis/awesome-deep-learning โญ 22,741
    A curated list of awesome Deep Learning tutorials, projects and communities.

  7. d2l-ai/d2l-en โญ 21,555
    Interactive deep learning book with multi-framework code, math, and discussions. Adopted at 500 universities from 70 countries including Stanford, MIT, Harvard, and Cambridge.
    ๐Ÿ”— d2l.ai

  8. wesm/pydata-book โญ 21,255
    Materials and IPython notebooks for "Python for Data Analysis" by Wes McKinney, published by O'Reilly Media

  9. microsoft/recommenders โญ 17,892
    Best Practices on Recommendation Systems
    ๐Ÿ”— recommenders-team.github.io/recommenders/intro.html

  10. fchollet/deep-learning-with-python-notebooks โญ 17,735
    Jupyter notebooks for the code samples of the book "Deep Learning with Python"

  11. hannibal046/Awesome-LLM โญ 13,940
    Awesome-LLM: a curated list of Large Language Model

  12. graykode/nlp-tutorial โญ 13,649
    Natural Language Processing Tutorial for Deep Learning Researchers
    ๐Ÿ”— www.reddit.com/r/machinelearning/comments/amfinl/project_nlptutoral_repository_who_is_studying

  13. shangtongzhang/reinforcement-learning-an-introduction โญ 13,151
    Python Implementation of Reinforcement Learning: An Introduction

  14. karpathy/nn-zero-to-hero โญ 10,264
    Neural Networks: Zero to Hero

  15. eugeneyan/open-llms โญ 10,056
    ๐Ÿ“‹ A list of open LLMs available for commercial use.

  16. openai/spinningup โญ 9,595
    An educational resource to help anyone learn deep reinforcement learning.
    ๐Ÿ”— spinningup.openai.com

  17. mooler0410/LLMsPracticalGuide โญ 8,428
    A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)
    ๐Ÿ”— arxiv.org/abs/2304.13712v2

  18. karpathy/micrograd โญ 8,140
    A tiny scalar-valued autograd engine and a neural net library on top of it with PyTorch-like API

  19. mrdbourke/pytorch-deep-learning โญ 7,818
    Materials for the Learn PyTorch for Deep Learning: Zero to Mastery course.
    ๐Ÿ”— learnpytorch.io

  20. nielsrogge/Transformers-Tutorials โญ 7,407
    This repository contains demos I made with the Transformers library by HuggingFace.

  21. firmai/industry-machine-learning โญ 7,008
    A curated list of applied machine learning and data science notebooks and libraries across different industries (by @firmai)
    ๐Ÿ”— www.linkedin.com/company/firmai

  22. zhanymkanov/fastapi-best-practices โญ 6,837
    FastAPI Best Practices and Conventions we used at our startup

  23. gkamradt/langchain-tutorials โญ 6,186
    Overview and tutorial of the LangChain Library

  24. udacity/deep-learning-v2-pytorch โญ 5,160
    Projects and exercises for the latest Deep Learning ND program https://www.udacity.com/course/deep-learning-nanodegree--nd101

  25. neetcode-gh/leetcode โญ 5,029
    Leetcode solutions for NeetCode.io

  26. srush/GPU-Puzzles โญ 4,973
    Teaching beginner GPU programming in a completely interactive fashion

  27. mrdbourke/tensorflow-deep-learning โญ 4,832
    All course materials for the Zero to Mastery Deep Learning with TensorFlow course.
    ๐Ÿ”— dbourke.link/ztmtfcourse

  28. udlbook/udlbook โญ 4,707
    Understanding Deep Learning - Simon J.D. Prince

  29. timofurrer/awesome-asyncio โญ 4,388
    A curated list of awesome Python asyncio frameworks, libraries, software and resources

  30. zotroneneis/machine_learning_basics โญ 4,194
    Plain python implementations of basic machine learning algorithms

  31. roboflow/notebooks โญ 4,103
    Examples and tutorials on using SOTA computer vision models and techniques. Learn everything from old-school ResNet, through YOLO and object-detection transformers like DETR, to the latest models like Grounding DINO and SAM.
    ๐Ÿ”— roboflow.com/models

  32. huggingface/deep-rl-class โญ 3,579
    This repo contains the syllabus of the Hugging Face Deep Reinforcement Learning Course.

  33. cosmicpython/book โญ 3,243
    A Book about Pythonic Application Architecture Patterns for Managing Complexity. Cosmos is the Opposite of Chaos you see. O'R. wouldn't actually let us call it "Cosmic Python" tho.
    ๐Ÿ”— www.cosmicpython.com

  34. alirezadir/Machine-Learning-Interviews โญ 3,227
    This repo is meant to serve as a guide for Machine Learning/AI technical interviews.

  35. huggingface/diffusion-models-class โญ 3,195
    Materials for the Hugging Face Diffusion Models Course

  36. promptslab/Awesome-Prompt-Engineering โญ 3,150
    This repository contains a hand-curated resources for Prompt Engineering with a focus on Generative Pre-trained Transformer (GPT), ChatGPT, PaLM etc
    ๐Ÿ”— discord.gg/m88xfymbk6

  37. fluentpython/example-code-2e โญ 2,893
    Example code for Fluent Python, 2nd edition (O'Reilly 2022)
    ๐Ÿ”— amzn.to/3j48u2j

  38. rasbt/machine-learning-book โญ 2,820
    Code Repository for Machine Learning with PyTorch and Scikit-Learn
    ๐Ÿ”— sebastianraschka.com/books/#machine-learning-with-pytorch-and-scikit-learn

  39. amanchadha/coursera-deep-learning-specialization โญ 2,658
    Notes, programming assignments and quizzes from all courses within the Coursera Deep Learning specialization offered by deeplearning.ai: (i) Neural Networks and Deep Learning; (ii) Improving Deep Neural Networks: Hyperparameter tuning, Regularization and Optimization; (iii) Structuring Machine Learning Projects; (iv...

  40. mrdbourke/zero-to-mastery-ml โญ 2,555
    All course materials for the Zero to Mastery Machine Learning and Data Science course.
    ๐Ÿ”— dbourke.link/ztmmlcourse

  41. krzjoa/awesome-python-data-science โญ 2,296
    Probably the best curated list of data science software in Python.
    ๐Ÿ”— krzjoa.github.io/awesome-python-data-science

  42. cgpotts/cs224u โญ 2,055
    Code for CS224u: Natural Language Understanding

  43. cerlymarco/MEDIUM_NoteBook โญ 2,016
    Repository containing notebooks of my posts on Medium

  44. trananhkma/fucking-awesome-python โญ 1,961
    awesome-python with :octocat: โญ and ๐Ÿด

  45. gerdm/prml โญ 1,796
    Repository of notes, code and notebooks in Python for the book Pattern Recognition and Machine Learning by Christopher Bishop

  46. atcold/NYU-DLSP21 โญ 1,489
    NYU Deep Learning Spring 2021
    ๐Ÿ”— atcold.github.io/nyu-dlsp21

  47. chandlerbang/awesome-self-supervised-gnn โญ 1,453
    Papers about pretraining and self-supervised learning on Graph Neural Networks (GNN).

  48. huggingface/cookbook โญ 1,229
    Community-driven practical examples of building AI applications and solving various tasks with AI using open-source tools and models.
    ๐Ÿ”— huggingface.co/learn/cookbook

  49. patrickloeber/MLfromscratch โญ 1,152
    Machine Learning algorithm implementations from scratch.

  50. davidadsp/Generative_Deep_Learning_2nd_Edition โญ 832
    The official code repository for the second edition of the O'Reilly book Generative Deep Learning: Teaching Machines to Paint, Write, Compose and Play.
    ๐Ÿ”— www.oreilly.com/library/view/generative-deep-learning/9781098134174

  51. jackhidary/quantumcomputingbook โญ 759
    Companion site for the textbook Quantum Computing: An Applied Approach

  52. dylanhogg/awesome-python โญ 230
    ๐Ÿ Hand-picked awesome Python libraries and frameworks, organised by category
    ๐Ÿ”— www.awesomepython.org

Template

Template tools and libraries: cookiecutter repos, generators, quick-starts.

  1. tiangolo/full-stack-fastapi-template โญ 22,548
    Full stack, modern web application template. Using FastAPI, React, SQLModel, PostgreSQL, Docker, GitHub Actions, automatic HTTPS and more.

  2. cookiecutter/cookiecutter โญ 21,520
    A cross-platform command-line utility that creates projects from cookiecutters (project templates), e.g. Python package projects, C projects.
    ๐Ÿ”— pypi.org/project/cookiecutter

  3. drivendata/cookiecutter-data-science โญ 7,546
    A logical, reasonably standardized, but flexible project structure for doing and sharing data science work.
    ๐Ÿ”— drivendata.github.io/cookiecutter-data-science

  4. buuntu/fastapi-react โญ 2,054
    ๐Ÿš€ Cookiecutter Template for FastAPI + React Projects. Using PostgreSQL, SQLAlchemy, and Docker

  5. pyscaffold/pyscaffold โญ 2,000
    ๐Ÿ›  Python project template generator with batteries included
    ๐Ÿ”— pyscaffold.org

  6. cjolowicz/cookiecutter-hypermodern-python โญ 1,713
    Cookiecutter template for a Python package based on the Hypermodern Python article series.
    ๐Ÿ”— cookiecutter-hypermodern-python.readthedocs.io

  7. tezromach/python-package-template โญ 1,068
    ๐Ÿš€ Your next Python package needs a bleeding-edge project structure.

  8. martinheinz/python-project-blueprint โญ 941
    Blueprint/Boilerplate For Python Projects

Terminal

Terminal and console tools and libraries: CLI tools, terminal based formatters, progress bars.

  1. willmcgugan/rich โญ 46,911
    Rich is a Python library for rich text and beautiful formatting in the terminal.
    ๐Ÿ”— rich.readthedocs.io/en/latest

  2. tqdm/tqdm โญ 27,353
    โšก A Fast, Extensible Progress Bar for Python and CLI
    ๐Ÿ”— tqdm.github.io

  3. google/python-fire โญ 26,249
    Python Fire is a library for automatically generating command line interfaces (CLIs) from absolutely any Python object.

  4. willmcgugan/textual โญ 23,403
    The lean application framework for Python. Build sophisticated user interfaces with a simple Python API. Run your apps in the terminal and a web browser.
    ๐Ÿ”— textual.textualize.io

  5. pallets/click โญ 14,977
    Python composable command line interface toolkit
    ๐Ÿ”— click.palletsprojects.com

  6. tiangolo/typer โญ 14,253
    Typer, build great CLIs. Easy to code. Based on Python type hints.
    ๐Ÿ”— typer.tiangolo.com

  7. saulpw/visidata โญ 7,385
    A terminal spreadsheet multitool for discovering and arranging data
    ๐Ÿ”— visidata.org

  8. manrajgrover/halo โญ 2,847
    ๐Ÿ’ซ Beautiful spinners for terminal, IPython and Jupyter

  9. urwid/urwid โญ 2,718
    Console user interface library for Python (official repo)
    ๐Ÿ”— urwid.org

  10. tconbeer/harlequin โญ 2,414
    The SQL IDE for Your Terminal.
    ๐Ÿ”— harlequin.sh

  11. textualize/trogon โญ 2,331
    Easily turn your Click CLI into a powerful terminal application

  12. tmbo/questionary โญ 1,405
    Python library to build pretty command line user prompts โœจEasy to use multi-select lists, confirmations, free text prompts ...

  13. jazzband/prettytable โญ 1,239
    Display tabular data in a visually appealing ASCII table format
    ๐Ÿ”— pypi.org/project/prettytable

  14. 1j01/textual-paint โญ 908
    ๐ŸŽจ MS Paint in your terminal.
    ๐Ÿ”— pypi.org/project/textual-paint

Testing

Testing libraries: unit testing, load testing, acceptance testing, code coverage, browser automation, plugins.

  1. locustio/locust โญ 23,561
    Write scalable load tests in plain Python ๐Ÿš—๐Ÿ’จ

  2. pytest-dev/pytest โญ 11,327
    The pytest framework makes it easy to write small tests, yet scales to support complex functional testing
    ๐Ÿ”— pytest.org

  3. microsoft/playwright-python โญ 10,579
    Python version of the Playwright testing and automation library.
    ๐Ÿ”— playwright.dev/python

  4. robotframework/robotframework โญ 9,007
    Generic automation framework for acceptance testing and RPA
    ๐Ÿ”— robotframework.org

  5. getmoto/moto โญ 7,363
    A library that allows you to easily mock out tests based on AWS infrastructure.
    ๐Ÿ”— docs.getmoto.org/en/latest

  6. hypothesisworks/hypothesis โญ 7,247
    Hypothesis is a powerful, flexible, and easy to use library for property-based testing.
    ๐Ÿ”— hypothesis.works

  7. newsapps/beeswithmachineguns โญ 6,388
    A utility for arming (creating) many bees (micro EC2 instances) to attack (load test) targets (web applications).
    ๐Ÿ”— apps.chicagotribune.com

  8. seleniumbase/SeleniumBase โญ 4,157
    ๐Ÿ“Š Python's all-in-one framework for web crawling, scraping, testing, and reporting. Supports pytest. UC Mode provides stealth. Includes many tools.
    ๐Ÿ”— seleniumbase.io

  9. getsentry/responses โญ 4,039
    A utility for mocking out the Python Requests library.

  10. spulec/freezegun โญ 3,966
    Let your Python tests travel through time

  11. tox-dev/tox โญ 3,508
    Command line driven CI frontend and development task automation tool.
    ๐Ÿ”— tox.wiki

  12. behave/behave โญ 3,059
    BDD, Python style.
    ๐Ÿ”— behave.readthedocs.io/en/latest

  13. nedbat/coveragepy โญ 2,818
    The code coverage tool for Python
    ๐Ÿ”— coverage.readthedocs.io

  14. cobrateam/splinter โญ 2,683
    splinter - python test framework for web applications
    ๐Ÿ”— splinter.readthedocs.org/en/stable/index.html

  15. kevin1024/vcrpy โญ 2,602
    Automatically mock your HTTP interactions to simplify and speed up testing

  16. pytest-dev/pytest-testinfra โญ 2,318
    With Testinfra you can write unit tests in Python to test actual state of your servers configured by management tools like Salt, Ansible, Puppet, Chef and so on.
    ๐Ÿ”— testinfra.readthedocs.io

  17. pytest-dev/pytest-mock โญ 1,753
    Thin-wrapper around the mock package for easier use with pytest
    ๐Ÿ”— pytest-mock.readthedocs.io/en/latest

  18. pytest-dev/pytest-cov โญ 1,655
    Coverage plugin for pytest.

  19. confident-ai/deepeval โญ 1,629
    The LLM Evaluation Framework
    ๐Ÿ”— docs.confident-ai.com

  20. pytest-dev/pytest-xdist โญ 1,350
    pytest plugin for distributed testing and loop-on-failures testing modes.
    ๐Ÿ”— pytest-xdist.readthedocs.io

  21. pytest-dev/pytest-asyncio โญ 1,313
    Asyncio support for pytest
    ๐Ÿ”— pytest-asyncio.readthedocs.io

  22. taverntesting/tavern โญ 989
    A command-line tool and Python library and Pytest plugin for automated testing of RESTful APIs, with a simple, concise and flexible YAML-based syntax
    ๐Ÿ”— taverntesting.github.io

Machine Learning - Time Series

Machine learning and classical timeseries libraries: forecasting, seasonality, anomaly detection, econometrics.

  1. facebook/prophet โญ 17,706
    Tool for producing high quality forecasts for time series data that has multiple seasonality with linear or non-linear growth.
    ๐Ÿ”— facebook.github.io/prophet

  2. blue-yonder/tsfresh โญ 8,066
    Automatic extraction of relevant features from time series:
    ๐Ÿ”— tsfresh.readthedocs.io

  3. sktime/sktime โญ 7,377
    A unified framework for machine learning with time series
    ๐Ÿ”— www.sktime.net

  4. unit8co/darts โญ 7,213
    A python library for user-friendly forecasting and anomaly detection on time series.
    ๐Ÿ”— unit8co.github.io/darts

  5. facebookresearch/Kats โญ 4,737
    Kats, a kit to analyze time series data, a lightweight, easy-to-use, generalizable, and extendable framework to perform time series analysis, from understanding the key statistics and characteristics, detecting change points and anomalies, to forecasting future trends.

  6. awslabs/gluonts โญ 4,254
    Probabilistic time series modeling in Python
    ๐Ÿ”— ts.gluon.ai

  7. nixtla/statsforecast โญ 3,512
    Lightning โšก๏ธ fast forecasting with statistical and econometric models.
    ๐Ÿ”— nixtlaverse.nixtla.io/statsforecast

  8. salesforce/Merlion โญ 3,249
    Merlion: A Machine Learning Framework for Time Series Intelligence

  9. tdameritrade/stumpy โญ 2,979
    STUMPY is a powerful and scalable Python library for modern time series analysis
    ๐Ÿ”— stumpy.readthedocs.io/en/latest

  10. rjt1990/pyflux โญ 2,085
    Open source time series library for Python

  11. aistream-peelout/flow-forecast โญ 1,872
    Deep learning PyTorch library for time series forecasting, classification, and anomaly detection (originally for flood forecasting).
    ๐Ÿ”— flow-forecast.atlassian.net/wiki/spaces/ff/overview

  12. uber/orbit โญ 1,797
    A Python package for Bayesian forecasting with object-oriented design and probabilistic models under the hood.
    ๐Ÿ”— orbit-ml.readthedocs.io/en/stable

  13. alkaline-ml/pmdarima โญ 1,513
    A statistical library designed to fill the void in Python's time series analysis capabilities, including the equivalent of R's auto.arima function.
    ๐Ÿ”— www.alkaline-ml.com/pmdarima

  14. amazon-science/chronos-forecasting โญ 1,503
    Chronos: Pretrained (Language) Models for Probabilistic Time Series Forecasting

  15. winedarksea/AutoTS โญ 998
    Automated Time Series Forecasting

  16. time-series-foundation-models/lag-llama โญ 913
    Lag-Llama: Towards Foundation Models for Probabilistic Time Series Forecasting

  17. autoviml/Auto_TS โญ 670
    Automatically build ARIMA, SARIMAX, VAR, FB Prophet and XGBoost Models on Time Series data sets with a Single Line of Code. Created by Ram Seshadri. Collaborators welcome.

  18. google/temporian โญ 616
    Temporian is an open-source Python library for preprocessing โšก and feature engineering ๐Ÿ›  temporal data ๐Ÿ“ˆ for machine learning applications ๐Ÿค–
    ๐Ÿ”— temporian.readthedocs.io

Typing

Typing libraries: static and run-time type checking, annotations.

  1. python/mypy โญ 17,489
    Optional static typing for Python
    ๐Ÿ”— www.mypy-lang.org

  2. microsoft/pyright โญ 11,960
    Static Type Checker for Python

  3. facebook/pyre-check โญ 6,673
    Performant type-checking for python.
    ๐Ÿ”— pyre-check.org

  4. python-attrs/attrs โญ 5,062
    Python Classes Without Boilerplate
    ๐Ÿ”— www.attrs.org

  5. google/pytype โญ 4,531
    A static type analyzer for Python code
    ๐Ÿ”— google.github.io/pytype

  6. instagram/MonkeyType โญ 4,524
    A Python library that generates static type annotations by collecting runtime types

  7. python/typeshed โญ 4,048
    Collection of library stubs for Python, with static types

  8. microsoft/pylance-release โญ 1,651
    Fast, feature-rich language support for Python. Documentation and issues for Pylance.

  9. agronholm/typeguard โญ 1,428
    Run-time type checker for Python

  10. mtshiba/pylyzer โญ 1,364
    A fast static code analyzer & language server for Python
    ๐Ÿ”— mtshiba.github.io/pylyzer

  11. patrick-kidger/torchtyping โญ 1,331
    Type annotations and dynamic checking for a tensor's shape, dtype, names, etc.

  12. robertcraigie/pyright-python โญ 135
    Python command line wrapper for pyright, a static type checker
    ๐Ÿ”— pypi.org/project/pyright

Utility

General utility libraries: miscellaneous tools, linters, code formatters, version management, package tools, documentation tools.

  1. yt-dlp/yt-dlp โญ 69,805
    A feature-rich command-line audio/video downloader
    ๐Ÿ”— discord.gg/h5mncfw63r

  2. home-assistant/core โญ 68,363
    ๐Ÿก Open source home automation that puts local control and privacy first.
    ๐Ÿ”— www.home-assistant.io

  3. python/cpython โญ 59,378
    The Python programming language
    ๐Ÿ”— www.python.org

  4. localstack/localstack โญ 51,999
    ๐Ÿ’ป A fully functional local AWS cloud stack. Develop and test your cloud & Serverless apps offline
    ๐Ÿ”— localstack.cloud

  5. faif/python-patterns โญ 39,317
    A collection of design patterns/idioms in Python

  6. mingrammer/diagrams โญ 34,763
    ๐ŸŽจ Diagram as Code for prototyping cloud system architectures
    ๐Ÿ”— diagrams.mingrammer.com

  7. ggerganov/whisper.cpp โญ 29,528
    Port of OpenAI's Whisper model in C/C++

  8. keon/algorithms โญ 23,511
    Minimal examples of data structures and algorithms in Python

  9. norvig/pytudes โญ 22,322
    Python programs, usually short, of considerable difficulty, to perfect particular skills.

  10. modularml/mojo โญ 20,163
    The Mojo Programming Language
    ๐Ÿ”— docs.modular.com/mojo

  11. openai/openai-python โญ 19,631
    The official Python library for the OpenAI API
    ๐Ÿ”— pypi.org/project/openai

  12. facebookresearch/audiocraft โญ 19,485
    Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.

  13. pydantic/pydantic โญ 18,467
    Data validation using Python type hints
    ๐Ÿ”— docs.pydantic.dev

  14. micropython/micropython โญ 18,286
    MicroPython - a lean and efficient Python implementation for microcontrollers and constrained systems
    ๐Ÿ”— micropython.org

  15. mkdocs/mkdocs โญ 18,202
    Project documentation with Markdown.
    ๐Ÿ”— www.mkdocs.org

  16. squidfunk/mkdocs-material โญ 18,127
    Documentation that simply works
    ๐Ÿ”— squidfunk.github.io/mkdocs-material

  17. delgan/loguru โญ 17,968
    Python logging made (stupidly) simple

  18. rustpython/RustPython โญ 17,483
    A Python Interpreter written in Rust
    ๐Ÿ”— rustpython.github.io

  19. kivy/kivy โญ 16,888
    Open source UI framework written in Python, running on Windows, Linux, macOS, Android and iOS
    ๐Ÿ”— kivy.org

  20. ipython/ipython โญ 16,129
    Official repository for IPython itself. Other repos in the IPython organization contain things like the website, documentation builds, etc.
    ๐Ÿ”— ipython.readthedocs.org

  21. alievk/avatarify-python โญ 16,091
    Avatars for Zoom, Skype and other video-conferencing apps.

  22. blakeblackshear/frigate โญ 14,480
    NVR with realtime local object detection for IP cameras
    ๐Ÿ”— frigate.video

  23. zulko/moviepy โญ 11,722
    Video editing with Python
    ๐Ÿ”— zulko.github.io/moviepy

  24. python-pillow/Pillow โญ 11,652
    Python Imaging Library (Fork)
    ๐Ÿ”— python-pillow.org

  25. dbader/schedule โญ 11,463
    Python job scheduling for humans.
    ๐Ÿ”— schedule.readthedocs.io

  26. pyodide/pyodide โญ 11,351
    Pyodide is a Python distribution for the browser and Node.js based on WebAssembly
    ๐Ÿ”— pyodide.org/en/stable

  27. pyo3/pyo3 โญ 10,919
    Rust bindings for the Python interpreter
    ๐Ÿ”— pyo3.rs

  28. openai/triton โญ 10,810
    Development repository for the Triton language and compiler
    ๐Ÿ”— triton-lang.org

  29. nuitka/Nuitka โญ 10,700
    Nuitka is a Python compiler written in Python. It's fully compatible with Python 2.6, 2.7, 3.4, 3.5, 3.6, 3.7, 3.8, 3.9, 3.10, and 3.11. You feed it your Python app, it does a lot of clever things, and spits out an executable or extension module.
    ๐Ÿ”— nuitka.net

  30. ninja-build/ninja โญ 10,463
    Ninja is a small build system with a focus on speed.
    ๐Ÿ”— ninja-build.org

  31. caronc/apprise โญ 10,337
    Apprise - Push Notifications that work with just about every platform!
    ๐Ÿ”— hub.docker.com/r/caronc/apprise

  32. pytube/pytube โญ 10,228
    A lightweight, dependency-free Python library (and command-line utility) for downloading YouTube Videos.
    ๐Ÿ”— pytube.io

  33. secdev/scapy โญ 10,005
    Scapy: the Python-based interactive packet manipulation program & library. Supports Python 2 & Python 3.
    ๐Ÿ”— scapy.net

  34. magicstack/uvloop โญ 9,978
    Ultra fast asyncio event loop.

  35. pallets/jinja โญ 9,924
    A very fast and expressive template engine.
    ๐Ÿ”— jinja.palletsprojects.com

  36. asweigart/pyautogui โญ 9,522
    A cross-platform GUI automation Python module for human beings. Used to programmatically control the mouse & keyboard.

  37. aws/serverless-application-model โญ 9,231
    The AWS Serverless Application Model (AWS SAM) transform is a AWS CloudFormation macro that transforms SAM templates into CloudFormation templates.
    ๐Ÿ”— aws.amazon.com/serverless/sam

  38. cython/cython โญ 8,868
    The most widely used Python to C compiler
    ๐Ÿ”— cython.org

  39. paramiko/paramiko โญ 8,804
    The leading native Python SSHv2 protocol library.
    ๐Ÿ”— paramiko.org

  40. boto/boto3 โญ 8,674
    AWS SDK for Python
    ๐Ÿ”— aws.amazon.com/sdk-for-python

  41. arrow-py/arrow โญ 8,545
    ๐Ÿน Better dates & times for Python
    ๐Ÿ”— arrow.readthedocs.io

  42. facebookresearch/hydra โญ 8,143
    Hydra is a framework for elegantly configuring complex applications
    ๐Ÿ”— hydra.cc

  43. xonsh/xonsh โญ 7,954
    ๐Ÿš Python-powered, cross-platform, Unix-gazing shell.
    ๐Ÿ”— xon.sh

  44. eternnoir/pyTelegramBotAPI โญ 7,654
    Python Telegram bot api.

  45. kellyjonbrazil/jc โญ 7,497
    CLI tool and python library that converts the output of popular command-line tools, file-types, and common strings to JSON, YAML, or Dictionaries. This allows piping of output to tools like jq and simplifying automation scripts.

  46. googleapis/google-api-python-client โญ 7,363
    ๐Ÿ The official Python client library for Google's discovery based APIs.
    ๐Ÿ”— googleapis.github.io/google-api-python-client/docs

  47. py-pdf/pypdf โญ 7,330
    A pure-python PDF library capable of splitting, merging, cropping, and transforming the pages of PDF files
    ๐Ÿ”— pypdf.readthedocs.io/en/latest

  48. theskumar/python-dotenv โญ 7,054
    Reads key-value pairs from a .env file and can set them as environment variables. It helps in developing applications following the 12-factor principles.
    ๐Ÿ”— saurabh-kumar.com/python-dotenv

  49. google/latexify_py โญ 6,955
    A library to generate LaTeX expression from Python code.

  50. googlecloudplatform/python-docs-samples โญ 6,954
    Code samples used on cloud.google.com

  51. marshmallow-code/marshmallow โญ 6,883
    A lightweight library for converting complex objects to and from simple Python datatypes.
    ๐Ÿ”— marshmallow.readthedocs.io

  52. hugapi/hug โญ 6,816
    Embrace the APIs of the future. Hug aims to make developing APIs as simple as possible, but no simpler.

  53. pygithub/PyGithub โญ 6,655
    Typed interactions with the GitHub API v3
    ๐Ÿ”— pygithub.readthedocs.io

  54. openai/point-e โญ 6,286
    Point cloud diffusion for 3D model synthesis

  55. pyca/cryptography โญ 6,266
    cryptography is a package designed to expose cryptographic primitives and recipes to Python developers.
    ๐Ÿ”— cryptography.io

  56. gorakhargosh/watchdog โญ 6,244
    Python library and shell utilities to monitor filesystem events.
    ๐Ÿ”— packages.python.org/watchdog

  57. sdispater/pendulum โญ 6,040
    Python datetimes made easy
    ๐Ÿ”— pendulum.eustace.io

  58. sphinx-doc/sphinx โญ 6,017
    The Sphinx documentation generator
    ๐Ÿ”— www.sphinx-doc.org

  59. jd/tenacity โญ 5,911
    Retrying library for Python
    ๐Ÿ”— tenacity.readthedocs.io

  60. icloud-photos-downloader/icloud_photos_downloader โญ 5,859
    A command-line tool to download photos from iCloud

  61. scikit-image/scikit-image โญ 5,858
    Image processing in Python
    ๐Ÿ”— scikit-image.org

  62. wireservice/csvkit โญ 5,808
    A suite of utilities for converting to and working with CSV, the king of tabular file formats.
    ๐Ÿ”— csvkit.readthedocs.io

  63. bndr/pipreqs โญ 5,797
    pipreqs - Generate pip requirements.txt file based on imports of any project. Looking for maintainers to move this project forward.

  64. agronholm/apscheduler โญ 5,675
    Task scheduling library for Python

  65. ijl/orjson โญ 5,518
    Fast, correct Python JSON library supporting dataclasses, datetimes, and numpy

  66. pdfminer/pdfminer.six โญ 5,400
    Community maintained fork of pdfminer - we fathom PDF
    ๐Ÿ”— pdfminersix.readthedocs.io

  67. pytransitions/transitions โญ 5,331
    A lightweight, object-oriented finite state machine implementation in Python with many extensions

  68. timdettmers/bitsandbytes โญ 5,308
    Accessible large language models via k-bit quantization for PyTorch.
    ๐Ÿ”— huggingface.co/docs/bitsandbytes/main/en/index

  69. buildbot/buildbot โญ 5,163
    Python-based continuous integration testing framework; your pull requests are more than welcome!
    ๐Ÿ”— www.buildbot.net

  70. rsalmei/alive-progress โญ 5,084
    A new kind of Progress Bar, with real-time throughput, ETA, and very cool animations!

  71. prompt-toolkit/ptpython โญ 5,037
    A better Python REPL

  72. pycqa/pycodestyle โญ 4,969
    Simple Python style checker in one Python file
    ๐Ÿ”— pycodestyle.pycqa.org

  73. spotify/pedalboard โญ 4,823
    ๐ŸŽ› ๐Ÿ”Š A Python library for audio.
    ๐Ÿ”— spotify.github.io/pedalboard

  74. jorgebastida/awslogs โญ 4,746
    AWS CloudWatch logs for Humansโ„ข

  75. pywinauto/pywinauto โญ 4,598
    Windows GUI Automation with Python (based on text properties)
    ๐Ÿ”— pywinauto.github.io

  76. hhatto/autopep8 โญ 4,510
    A tool that automatically formats Python code to conform to the PEP 8 style guide.
    ๐Ÿ”— pypi.org/project/autopep8

  77. tebelorg/RPA-Python โญ 4,507
    Python package for doing RPA

  78. pytoolz/toolz โญ 4,506
    A functional standard library for Python.
    ๐Ÿ”— toolz.readthedocs.org

  79. pyinvoke/invoke โญ 4,238
    Pythonic task management & command execution.
    ๐Ÿ”— pyinvoke.org

  80. bogdanp/dramatiq โญ 4,043
    A fast and reliable background task processing library for Python 3.
    ๐Ÿ”— dramatiq.io

  81. evhub/coconut โญ 3,939
    Coconut (coconut-lang.org) is a variant of Python that adds on top of Python syntax new features for simple, elegant, Pythonic functional programming.
    ๐Ÿ”— coconut-lang.org

  82. adafruit/circuitpython โญ 3,883
    CircuitPython - a Python implementation for teaching coding with microcontrollers
    ๐Ÿ”— circuitpython.org

  83. miguelgrinberg/python-socketio โญ 3,755
    Python Socket.IO server and client

  84. rspeer/python-ftfy โญ 3,701
    Fixes mojibake and other glitches in Unicode text, after the fact.
    ๐Ÿ”— ftfy.readthedocs.org

  85. joblib/joblib โญ 3,642
    Computing with Python functions.
    ๐Ÿ”— joblib.readthedocs.org

  86. ashleve/lightning-hydra-template โญ 3,641
    PyTorch Lightning + Hydra. A very user-friendly template for ML experimentation. โšก๐Ÿ”ฅโšก

  87. ets-labs/python-dependency-injector โญ 3,570
    Dependency injection framework for Python
    ๐Ÿ”— python-dependency-injector.ets-labs.org

  88. python-markdown/markdown โญ 3,562
    A Python implementation of John Gruberโ€™s Markdown with Extension support.
    ๐Ÿ”— python-markdown.github.io

  89. zeromq/pyzmq โญ 3,540
    PyZMQ: Python bindings for zeromq
    ๐Ÿ”— zguide.zeromq.org/py:all

  90. pypi/warehouse โญ 3,459
    The Python Package Index
    ๐Ÿ”— pypi.org

  91. more-itertools/more-itertools โญ 3,416
    More routines for operating on iterables, beyond itertools
    ๐Ÿ”— more-itertools.rtfd.io

  92. tartley/colorama โญ 3,416
    Simple cross-platform colored terminal text in Python

  93. pydata/xarray โญ 3,392
    N-D labeled arrays and datasets in Python
    ๐Ÿ”— xarray.dev

  94. osohq/oso โญ 3,376
    Oso is a batteries-included framework for building authorization in your application.
    ๐Ÿ”— docs.osohq.com

  95. jorisschellekens/borb โญ 3,277
    borb is a library for reading, creating and manipulating PDF files in python.
    ๐Ÿ”— borbpdf.com

  96. suor/funcy โญ 3,260
    A fancy and practical functional tools

  97. pyo3/maturin โญ 3,210
    Build and publish crates with pyo3, cffi and uniffi bindings as well as rust binaries as python packages
    ๐Ÿ”— maturin.rs

  98. pyserial/pyserial โญ 3,094
    Python serial port access library

  99. spotify/basic-pitch โญ 2,889
    A lightweight yet powerful audio-to-MIDI converter with pitch bend detection
    ๐Ÿ”— basicpitch.io

  100. tox-dev/pipdeptree โญ 2,670
    A command line utility to display dependency tree of the installed Python packages
    ๐Ÿ”— pypi.python.org/pypi/pipdeptree

  101. legrandin/pycryptodome โญ 2,652
    A self-contained cryptographic library for Python
    ๐Ÿ”— www.pycryptodome.org

  102. camelot-dev/camelot โญ 2,627
    A Python library to extract tabular data from PDFs
    ๐Ÿ”— camelot-py.readthedocs.io

  103. pyinfra-dev/pyinfra โญ 2,626
    pyinfra automates infrastructure using Python. Itโ€™s fast and scales from one server to thousands. Great for ad-hoc command execution, service deployment, configuration management and more.
    ๐Ÿ”— pyinfra.com

  104. liiight/notifiers โญ 2,598
    The easy way to send notifications
    ๐Ÿ”— notifiers.readthedocs.io

  105. lxml/lxml โญ 2,566
    The lxml XML toolkit for Python
    ๐Ÿ”— lxml.de

  106. whylabs/whylogs โญ 2,535
    An open-source data logging library for machine learning models and data pipelines. ๐Ÿ“š Provides visibility into data quality & model performance over time. ๐Ÿ›ก๏ธ Supports privacy-preserving data collection, ensuring safety & robustness. ๐Ÿ“ˆ
    ๐Ÿ”— whylogs.readthedocs.io

  107. pexpect/pexpect โญ 2,524
    A Python module for controlling interactive programs in a pseudo-terminal
    ๐Ÿ”— pexpect.readthedocs.io

  108. pyston/pyston โญ 2,489
    A faster and highly-compatible implementation of the Python programming language.
    ๐Ÿ”— www.pyston.org

  109. scrapinghub/dateparser โญ 2,458
    python parser for human readable dates

  110. dosisod/refurb โญ 2,444
    A tool for refurbishing and modernizing Python codebases

  111. yaml/pyyaml โญ 2,419
    Canonical source repository for PyYAML

  112. cdgriffith/Box โญ 2,352
    Python dictionaries with advanced dot notation access
    ๐Ÿ”— github.com/cdgriffith/box/wiki

  113. nschloe/tikzplotlib โญ 2,300
    ๐Ÿ“Š Save matplotlib figures as TikZ/PGFplots for smooth integration into LaTeX.

  114. pypa/setuptools โญ 2,299
    Official project repository for the Setuptools build system
    ๐Ÿ”— pypi.org/project/setuptools

  115. hgrecco/pint โญ 2,253
    Operate and manipulate physical quantities in Python
    ๐Ÿ”— pint.readthedocs.org

  116. dateutil/dateutil โญ 2,239
    Useful extensions to the standard Python datetime features

  117. ianmiell/shutit โญ 2,147
    Automation framework for programmers
    ๐Ÿ”— ianmiell.github.io/shutit

  118. grantjenks/python-diskcache โญ 2,144
    Python disk-backed cache (Django-compatible). Faster than Redis and Memcached. Pure-Python.
    ๐Ÿ”— www.grantjenks.com/docs/diskcache

  119. pndurette/gTTS โญ 2,132
    Python library and CLI tool to interface with Google Translate's text-to-speech API
    ๐Ÿ”— gtts.readthedocs.org

  120. kiminewt/pyshark โญ 2,113
    Python wrapper for tshark, allowing python packet parsing using wireshark dissectors

  121. pyparsing/pyparsing โญ 2,083
    Python library for creating PEG parsers

  122. libaudioflux/audioFlux โญ 2,038
    A library for audio and music analysis, feature extraction.
    ๐Ÿ”— audioflux.top

  123. google/gin-config โญ 1,989
    Gin provides a lightweight configuration framework for Python

  124. grahamdumpleton/wrapt โญ 1,970
    A Python module for decorators, wrappers and monkey patching.

  125. astanin/python-tabulate โญ 1,958
    Pretty-print tabular data in Python, a library and a command-line utility. Repository migrated from bitbucket.org/astanin/python-tabulate.
    ๐Ÿ”— pypi.org/project/tabulate

  126. pyfilesystem/pyfilesystem2 โญ 1,944
    Python's Filesystem abstraction layer
    ๐Ÿ”— www.pyfilesystem.org

  127. landscapeio/prospector โญ 1,905
    Inspects Python source files and provides information about type and location of classes, methods etc

  128. nateshmbhat/pyttsx3 โญ 1,892
    Offline Text To Speech synthesis for python

  129. julienpalard/Pipe โญ 1,852
    A Python library to use infix notation in Python

  130. jcrist/msgspec โญ 1,839
    A fast serialization and validation library, with builtin support for JSON, MessagePack, YAML, and TOML
    ๐Ÿ”— jcristharif.com/msgspec

  131. python-rope/rope โญ 1,830
    a python refactoring library

  132. chaostoolkit/chaostoolkit โญ 1,825
    Chaos Engineering Toolkit & Orchestration for Developers
    ๐Ÿ”— chaostoolkit.org

  133. numba/llvmlite โญ 1,818
    A lightweight LLVM python binding for writing JIT compilers
    ๐Ÿ”— llvmlite.pydata.org

  134. carpedm20/emoji โญ 1,811
    emoji terminal output for Python

  135. mitmproxy/pdoc โญ 1,804
    API Documentation for Python Projects
    ๐Ÿ”— pdoc.dev

  136. joowani/binarytree โญ 1,801
    Python Library for Studying Binary Trees
    ๐Ÿ”— binarytree.readthedocs.io

  137. omry/omegaconf โญ 1,784
    Flexible Python configuration system. The last one you will ever need.

  138. pydoit/doit โญ 1,780
    task management & automation tool
    ๐Ÿ”— pydoit.org

  139. pygments/pygments โญ 1,699
    Pygments is a generic syntax highlighter written in Python
    ๐Ÿ”— pygments.org

  140. kalliope-project/kalliope โญ 1,693
    Kalliope is a framework that will help you to create your own personal assistant.
    ๐Ÿ”— kalliope-project.github.io

  141. rhettbull/osxphotos โญ 1,672
    Python app to export pictures and associated metadata from Apple Photos on macOS. Also includes a package to provide programmatic access to the Photos library, pictures, and metadata.

  142. konradhalas/dacite โญ 1,648
    Simple creation of data classes from dictionaries.

  143. home-assistant/supervisor โญ 1,639
    ๐Ÿก Home Assistant Supervisor
    ๐Ÿ”— home-assistant.io/hassio

  144. rubik/radon โญ 1,592
    Various code metrics for Python code
    ๐Ÿ”— radon.readthedocs.org

  145. open-telemetry/opentelemetry-python โญ 1,586
    OpenTelemetry Python API and SDK
    ๐Ÿ”— opentelemetry.io

  146. samuelcolvin/watchfiles โญ 1,581
    Simple, modern and fast file watching and code reload in python.
    ๐Ÿ”— watchfiles.helpmanual.io

  147. p0dalirius/Coercer โญ 1,552
    A python script to automatically coerce a Windows server to authenticate on an arbitrary machine through 12 methods.
    ๐Ÿ”— podalirius.net

  148. mkdocstrings/mkdocstrings โญ 1,549
    ๐Ÿ“˜ Automatic documentation from sources, for MkDocs.
    ๐Ÿ”— mkdocstrings.github.io

  149. facebookincubator/Bowler โญ 1,511
    Safe code refactoring for modern Python.
    ๐Ÿ”— pybowler.io

  150. hbldh/bleak โญ 1,506
    A cross platform Bluetooth Low Energy Client for Python using asyncio

  151. nficano/python-lambda โญ 1,476
    A toolkit for developing and deploying serverless Python code in AWS Lambda.

  152. quodlibet/mutagen โญ 1,438
    Python module for handling audio metadata
    ๐Ÿ”— mutagen.readthedocs.io

  153. instagram/LibCST โญ 1,403
    A concrete syntax tree parser and serializer library for Python that preserves many aspects of Python's abstract syntax tree
    ๐Ÿ”— libcst.readthedocs.io

  154. fabiocaccamo/python-benedict โญ 1,400
    ๐Ÿ“˜ dict subclass with keylist/keypath support, built-in I/O operations (base64, csv, html, ini, json, pickle, plist, query-string, toml, xls, xml, yaml), s3 support and many utilities.

  155. aws-samples/aws-glue-samples โญ 1,389
    AWS Glue code samples

  156. pycqa/pyflakes โญ 1,337
    A simple program which checks Python source files for errors
    ๐Ÿ”— pypi.org/project/pyflakes

  157. lidatong/dataclasses-json โญ 1,295
    Easily serialize Data Classes to and from JSON

  158. lcompilers/lpython โญ 1,287
    Python compiler
    ๐Ÿ”— lpython.org

  159. ossf/criticality_score โญ 1,280
    Gives criticality score for an open source project

  160. brandon-rhodes/python-patterns โญ 1,256
    Source code behind the python-patterns.guide site by Brandon Rhodes

  161. aio-libs/yarl โญ 1,218
    Yet another URL library
    ๐Ÿ”— yarl.aio-libs.org

  162. oracle/graalpython โญ 1,103
    A Python 3 implementation built on GraalVM

  163. pdoc3/pdoc โญ 1,087
    ๐Ÿ โžก๏ธ ๐Ÿ“œ Auto-generate API documentation for Python projects
    ๐Ÿ”— pdoc3.github.io/pdoc

  164. c4urself/bump2version โญ 1,035
    Version-bump your software with a single command
    ๐Ÿ”— pypi.python.org/pypi/bump2version

  165. metachris/logzero โญ 1,030
    Robust and effective logging for Python 2 and 3.
    ๐Ÿ”— logzero.readthedocs.io

  166. pyo3/rust-numpy โญ 1,010
    PyO3-based Rust bindings of the NumPy C-API

  167. pyfpdf/fpdf2 โญ 929
    Simple PDF generation for Python
    ๐Ÿ”— py-pdf.github.io/fpdf2

  168. fastai/fastcore โญ 897
    Python supercharged for the fastai library
    ๐Ÿ”— fastcore.fast.ai

  169. anthropics/anthropic-sdk-python โญ 870
    SDK providing access to Anthropic's safety-first language model APIs

  170. fsspec/filesystem_spec โญ 869
    A specification that python filesystems should adhere to.

  171. alex-sherman/unsync โญ 867
    Unsynchronize asyncio

  172. milvus-io/pymilvus โญ 858
    Python SDK for Milvus.

  173. lastmile-ai/aiconfig โญ 825
    AIConfig saves prompts, models and model parameters as source control friendly configs. This allows you to iterate on prompts and model parameters separately from your application code.
    ๐Ÿ”— aiconfig.lastmileai.dev

  174. samuelcolvin/dirty-equals โญ 761
    Doing dirty (but extremely useful) things with equals.
    ๐Ÿ”— dirty-equals.helpmanual.io

  175. barracuda-fsh/pyobd โญ 726
    open source obd2 car diagnostics program - reuploaded

  176. pypy/pypy โญ 700
    PyPy is a very fast and compliant implementation of the Python language.
    ๐Ÿ”— pypy.org

  177. pypa/build โญ 652
    A simple, correct Python build frontend
    ๐Ÿ”— build.pypa.io

  178. instagram/Fixit โญ 649
    Advanced Python linting framework with auto-fixes and hierarchical configuration that makes it easy to write custom in-repo lint rules.
    ๐Ÿ”— fixit.rtfd.io/en/latest

  179. gefyrahq/gefyra โญ 616
    Blazingly-fast ๐Ÿš€, rock-solid, local application development โžก๏ธ with Kubernetes.
    ๐Ÿ”— gefyra.dev

  180. open-telemetry/opentelemetry-python-contrib โญ 604
    OpenTelemetry instrumentation for Python modules
    ๐Ÿ”— opentelemetry.io

  181. qdrant/qdrant-client โญ 600
    Python client for Qdrant vector search engine
    ๐Ÿ”— qdrant.tech

  182. methexis-inc/terminal-copilot โญ 563
    A smart terminal assistant that helps you find the right command.

  183. fastai/ghapi โญ 508
    A delightful and complete interface to GitHub's amazing API
    ๐Ÿ”— ghapi.fast.ai

  184. steamship-core/steamship-langchain โญ 501
    steamship-langchain

  185. google/pyglove โญ 318
    Manipulating Python Programs

Vizualisation

Vizualisation tools and libraries. Application frameworks, 2D/3D plotting, dashboards, WebGL.

  1. apache/superset โญ 58,562
    Apache Superset is a Data Visualization and Data Exploration Platform
    ๐Ÿ”— superset.apache.org

  2. streamlit/streamlit โญ 31,332
    Streamlit โ€” A faster way to build and share data apps.
    ๐Ÿ”— streamlit.io

  3. gradio-app/gradio โญ 28,460
    Build and share delightful machine learning apps, all in Python. ๐ŸŒŸ Star to support our work!
    ๐Ÿ”— www.gradio.app

  4. plotly/dash โญ 20,425
    Data Apps & Dashboards for Python. No JavaScript Required.
    ๐Ÿ”— plotly.com/dash

  5. matplotlib/matplotlib โญ 19,177
    matplotlib: plotting with Python
    ๐Ÿ”— matplotlib.org/stable

  6. bokeh/bokeh โญ 18,792
    Interactive Data Visualization in the browser, from Python
    ๐Ÿ”— bokeh.org

  7. plotly/plotly.py โญ 15,188
    The interactive graphing library for Python โœจ This project now includes Plotly Express!
    ๐Ÿ”— plotly.com/python

  8. mwaskom/seaborn โญ 11,909
    Statistical data visualization in Python
    ๐Ÿ”— seaborn.pydata.org

  9. visgl/deck.gl โญ 11,673
    WebGL2 powered visualization framework
    ๐Ÿ”— deck.gl

  10. marceloprates/prettymaps โญ 10,806
    A small set of Python functions to draw pretty maps from OpenStreetMap data. Based on osmnx, matplotlib and shapely libraries.

  11. altair-viz/altair โญ 8,876
    Declarative statistical visualization library for Python
    ๐Ÿ”— altair-viz.github.io

  12. nvidia/TensorRT-LLM โญ 6,336
    TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT...
    ๐Ÿ”— nvidia.github.io/tensorrt-llm

  13. lux-org/lux โญ 4,902
    Automatically visualize your pandas dataframe via a single print! ๐Ÿ“Š ๐Ÿ’ก

  14. man-group/dtale โญ 4,527
    Visualizer for pandas data structures
    ๐Ÿ”— alphatechadmin.pythonanywhere.com

  15. renpy/renpy โญ 4,523
    The Ren'Py Visual Novel Engine
    ๐Ÿ”— www.renpy.org

  16. holoviz/panel โญ 4,154
    Panel: The powerful data exploration & web app framework for Python
    ๐Ÿ”— panel.holoviz.org

  17. has2k1/plotnine โญ 3,803
    A Grammar of Graphics for Python
    ๐Ÿ”— plotnine.org

  18. residentmario/missingno โญ 3,803
    missingno provides a small toolset of flexible and easy-to-use missing data visualizations and utilities that allows you to get a quick visual summary of the completeness (or lack thereof) of your dataset.

  19. pyqtgraph/pyqtgraph โญ 3,660
    Fast data visualization and GUI tools for scientific / engineering applications
    ๐Ÿ”— www.pyqtgraph.org

  20. vispy/vispy โญ 3,212
    Main repository for Vispy
    ๐Ÿ”— vispy.org

  21. ml-tooling/opyrator โญ 3,011
    ๐Ÿช„ Turns your machine learning code into microservices with web API, interactive GUI, and more.
    ๐Ÿ”— opyrator-playground.mltooling.org

  22. netflix/flamescope โญ 2,984
    FlameScope is a visualization tool for exploring different time ranges as Flame Graphs.

  23. facebookresearch/hiplot โญ 2,696
    HiPlot makes understanding high dimensional data easy
    ๐Ÿ”— facebookresearch.github.io/hiplot

  24. holoviz/holoviews โญ 2,610
    With Holoviews, your data visualizes itself.
    ๐Ÿ”— holoviews.org

  25. kozea/pygal โญ 2,599
    pygal is a dynamic SVG charting library written in python.
    ๐Ÿ”— www.pygal.org

  26. mckinsey/vizro โญ 2,351
    Vizro is a toolkit for creating modular data visualization applications.
    ๐Ÿ”— vizro.readthedocs.io/en/stable

  27. pyvista/pyvista โญ 2,333
    3D plotting and mesh analysis through a streamlined interface for the Visualization Toolkit (VTK)
    ๐Ÿ”— docs.pyvista.org

  28. marcomusy/vedo โญ 1,904
    A python module for scientific analysis of 3D data based on VTK and Numpy
    ๐Ÿ”— vedo.embl.es

  29. datapane/datapane โญ 1,346
    Build and share data reports in 100% Python
    ๐Ÿ”— datapane.com

  30. facultyai/dash-bootstrap-components โญ 1,051
    Bootstrap components for Plotly Dash
    ๐Ÿ”— dash-bootstrap-components.opensource.faculty.ai

  31. nomic-ai/deepscatter โญ 964
    Zoomable, animated scatterplots in the browser that scales over a billion points

  32. hazyresearch/meerkat โญ 807
    Creative interactive views of any dataset.

  33. holoviz/holoviz โญ 781
    High-level tools to simplify visualization in Python.
    ๐Ÿ”— holoviz.org

Web

Web related frameworks and libraries: webapp servers, WSGI, ASGI, asyncio, HTTP, REST, user management.

  1. django/django โญ 76,586
    The Web framework for perfectionists with deadlines.
    ๐Ÿ”— www.djangoproject.com

  2. tiangolo/fastapi โญ 70,494
    FastAPI framework, high performance, easy to learn, fast to code, ready for production
    ๐Ÿ”— fastapi.tiangolo.com

  3. pallets/flask โญ 66,245
    The Python micro framework for building web applications.
    ๐Ÿ”— flask.palletsprojects.com

  4. psf/requests โญ 51,298
    A simple, yet elegant, HTTP library.
    ๐Ÿ”— requests.readthedocs.io/en/latest

  5. sherlock-project/sherlock โญ 51,051
    ๐Ÿ”Ž Hunt down social media accounts by username across social networks
    ๐Ÿ”— sherlock-project.github.io

  6. tornadoweb/tornado โญ 21,488
    Tornado is a Python web framework and asynchronous networking library, originally developed at FriendFeed.
    ๐Ÿ”— www.tornadoweb.org

  7. huge-success/sanic โญ 17,707
    Accelerate your web app development | Build fast. Run fast.
    ๐Ÿ”— sanic.dev

  8. pyscript/pyscript โญ 17,422
    A framework that allows users to create rich Python applications in the browser using HTML's interface and the power of Pyodide, WASM, and modern web technologies.
    ๐Ÿ”— pyscript.net

  9. wagtail/wagtail โญ 17,138
    A Django content management system focused on flexibility and user experience
    ๐Ÿ”— wagtail.org

  10. reflex-dev/reflex โญ 15,798
    ๐Ÿ•ธ๏ธ Web apps in pure Python ๐Ÿ
    ๐Ÿ”— reflex.dev

  11. aio-libs/aiohttp โญ 14,525
    Asynchronous HTTP client/server framework for asyncio and Python
    ๐Ÿ”— docs.aiohttp.org

  12. getpelican/pelican โญ 12,205
    Static site generator that supports Markdown and reST syntax. Powered by Python.
    ๐Ÿ”— getpelican.com

  13. encode/httpx โญ 12,194
    A next generation HTTP client for Python. ๐Ÿฆ‹
    ๐Ÿ”— www.python-httpx.org

  14. aws/chalice โญ 10,281
    Python Serverless Microframework for AWS

  15. benoitc/gunicorn โญ 9,480
    gunicorn 'Green Unicorn' is a WSGI HTTP Server for UNIX, fast clients and sleepy applications.
    ๐Ÿ”— www.gunicorn.org

  16. encode/starlette โญ 9,444
    The little ASGI framework that shines. ๐ŸŒŸ
    ๐Ÿ”— www.starlette.io

  17. falconry/falcon โญ 9,379
    The no-magic web data plane API and microservices framework for Python developers, with a focus on reliability, correctness, and performance at scale.
    ๐Ÿ”— falcon.readthedocs.io/en/stable

  18. flet-dev/flet โญ 9,016
    Flet enables developers to easily build realtime web, mobile and desktop apps in Python. No frontend experience required.
    ๐Ÿ”— flet.dev

  19. bottlepy/bottle โญ 8,281
    bottle.py is a fast and simple micro-framework for python web-applications.
    ๐Ÿ”— bottlepy.org

  20. graphql-python/graphene โญ 7,969
    GraphQL framework for Python
    ๐Ÿ”— graphene-python.org

  21. encode/uvicorn โญ 7,774
    An ASGI web server, for Python. ๐Ÿฆ„
    ๐Ÿ”— www.uvicorn.org

  22. reactive-python/reactpy โญ 7,650
    ReactPy is a library for building user interfaces in Python without Javascript
    ๐Ÿ”— reactpy.dev

  23. zauberzeug/nicegui โญ 7,088
    Create web-based user interfaces with Python. The nice way.
    ๐Ÿ”— nicegui.io

  24. pyeve/eve โญ 6,657
    REST API framework designed for human beings
    ๐Ÿ”— python-eve.org

  25. pallets/werkzeug โญ 6,532
    The comprehensive WSGI web application library.
    ๐Ÿ”— werkzeug.palletsprojects.com

  26. vitalik/django-ninja โญ 6,148
    ๐Ÿ’จ Fast, Async-ready, Openapi, type hints based framework for building APIs
    ๐Ÿ”— django-ninja.dev

  27. webpy/webpy โญ 5,870
    web.py is a web framework for python that is as simple as it is powerful.
    ๐Ÿ”— webpy.org

  28. stephenmcd/mezzanine โญ 4,712
    CMS framework for Django
    ๐Ÿ”— mezzanine.jupo.org

  29. nameko/nameko โญ 4,649
    A microservices framework for Python that lets service developers concentrate on application logic and encourages testability.
    ๐Ÿ”— www.nameko.io

  30. starlite-api/litestar โญ 4,369
    Production-ready, Light, Flexible and Extensible ASGI API framework | Effortlessly Build Performant APIs
    ๐Ÿ”— litestar.dev

  31. pywebio/PyWebIO โญ 4,317
    Write interactive web app in script way.
    ๐Ÿ”— pywebio.readthedocs.io

  32. fastapi-users/fastapi-users โญ 4,023
    Ready-to-use and customizable users management for FastAPI
    ๐Ÿ”— fastapi-users.github.io/fastapi-users

  33. pylons/pyramid โญ 3,899
    Pyramid - A Python web framework
    ๐Ÿ”— trypyramid.com

  34. h2oai/wave โญ 3,849
    H2O Wave is a software stack for building beautiful, low-latency, realtime, browser-based applications and dashboards entirely in Python/R without using HTML, Javascript, or CSS.
    ๐Ÿ”— wave.h2o.ai

  35. strawberry-graphql/strawberry โญ 3,748
    A GraphQL library for Python that leverages type annotations ๐Ÿ“
    ๐Ÿ”— strawberry.rocks

  36. websocket-client/websocket-client โญ 3,448
    WebSocket client for Python
    ๐Ÿ”— github.com/websocket-client/websocket-client

  37. unbit/uwsgi โญ 3,405
    uWSGI application server container
    ๐Ÿ”— projects.unbit.it/uwsgi

  38. pallets/quart โญ 2,601
    An async Python micro framework for building web applications.
    ๐Ÿ”— quart.palletsprojects.com

  39. fastapi-admin/fastapi-admin โญ 2,513
    A fast admin dashboard based on FastAPI and TortoiseORM with tabler ui, inspired by Django admin
    ๐Ÿ”— fastapi-admin-docs.long2ice.io

  40. flipkart-incubator/Astra โญ 2,420
    Automated Security Testing For REST API's

  41. masoniteframework/masonite โญ 2,144
    The Modern And Developer Centric Python Web Framework. Be sure to read the documentation and join the Discord channel for questions: https://discord.gg/TwKeFahmPZ
    ๐Ÿ”— docs.masoniteproject.com

  42. dot-agent/nextpy โญ 2,092
    ๐Ÿค–Self-Modifying App Framework from the Future ๐Ÿ”ฎ
    ๐Ÿ”— nextpy.org

  43. python-restx/flask-restx โญ 2,062
    Fork of Flask-RESTPlus: Fully featured framework for fast, easy and documented API development with Flask
    ๐Ÿ”— flask-restx.readthedocs.io/en/latest

  44. cherrypy/cherrypy โญ 1,779
    CherryPy is a pythonic, object-oriented HTTP framework. https://cherrypy.dev
    ๐Ÿ”— docs.cherrypy.dev

  45. dmontagu/fastapi-utils โญ 1,731
    Reusable utilities for FastAPI: a number of utilities to help reduce boilerplate and reuse common functionality across projects

  46. neoteroi/BlackSheep โญ 1,715
    Fast ASGI web framework for Python
    ๐Ÿ”— www.neoteroi.dev/blacksheep

  47. s3rius/FastAPI-template โญ 1,631
    Feature rich robust FastAPI template.

  48. jordaneremieff/mangum โญ 1,583
    AWS Lambda support for ASGI applications
    ๐Ÿ”— mangum.io

  49. wtforms/wtforms โญ 1,458
    A flexible forms validation and rendering library for Python.
    ๐Ÿ”— wtforms.readthedocs.io

  50. awtkns/fastapi-crudrouter โญ 1,301
    A dynamic FastAPI router that automatically creates CRUD routes for your models
    ๐Ÿ”— fastapi-crudrouter.awtkns.com

  51. magicstack/httptools โญ 1,162
    Fast HTTP parser

  52. long2ice/fastapi-cache โญ 1,115
    fastapi-cache is a tool to cache fastapi response and function result, with backends support redis and memcached.
    ๐Ÿ”— github.com/long2ice/fastapi-cache

  53. whitphx/stlite โญ 974
    A port of Streamlit to WebAssembly, powered by Pyodide.
    ๐Ÿ”— edit.share.stlite.net

  54. rstudio/py-shiny โญ 936
    Shiny for Python
    ๐Ÿ”— shiny.posit.co/py

  55. koxudaxi/fastapi-code-generator โญ 901
    This code generator creates FastAPI app from an openapi file.

  56. aeternalis-ingenium/FastAPI-Backend-Template โญ 550
    A backend project template with FastAPI, PostgreSQL with asynchronous SQLAlchemy 2.0, Alembic for asynchronous database migration, and Docker.


Interactive version: www.awesomepython.org, Hugging Face Dataset: awesome-python

Please raise a new issue to suggest a Python repo that you would like to see added.

1,438 hand-picked awesome Python libraries and frameworks, updated 14 Apr 2024

Hits