🏆 A ranked list of awesome machine learning Python libraries. Updated weekly.
This curated list contains 920 awesome open-source projects with a total of 4.2M stars grouped into 34 categories. All projects are ranked by a project-quality score, which is calculated based on various metrics automatically collected from GitHub and different package managers. If you like to add or update projects, feel free to open an issue, submit a pull request, or directly edit the projects.yaml. Contributions are very welcome!
🧙♂️ Discover other best-of lists or create your own.
📫 Subscribe to our newsletter for updates and trending projects.
Contents
- Machine Learning Frameworks 61 projects
- Data Visualization 54 projects
- Text Data & NLP 102 projects
- Image Data 64 projects
- Graph Data 36 projects
- Audio Data 29 projects
- Geospatial Data 22 projects
- Financial Data 25 projects
- Time Series Data 30 projects
- Medical Data 19 projects
- Tabular Data 5 projects
- Optical Character Recognition 12 projects
- Data Containers & Structures 1 projects
- Data Loading & Extraction 1 projects
- Web Scraping & Crawling 1 projects
- Data Pipelines & Streaming 1 projects
- Distributed Machine Learning 36 projects
- Hyperparameter Optimization & AutoML 52 projects
- Reinforcement Learning 23 projects
- Recommender Systems 17 projects
- Privacy Machine Learning 7 projects
- Workflow & Experiment Tracking 40 projects
- Model Serialization & Deployment 20 projects
- Model Interpretability 54 projects
- Vector Similarity Search (ANN) 13 projects
- Probabilistics & Statistics 23 projects
- Adversarial Robustness 9 projects
- GPU & Accelerator Utilities 20 projects
- Tensorflow Utilities 16 projects
- Jax Utilities 3 projects
- Sklearn Utilities 19 projects
- Pytorch Utilities 32 projects
- Database Clients 1 projects
- Others 66 projects
Explanation
- 🥇🥈🥉 Combined project-quality score
- ⭐️ Star count from GitHub
- 🐣 New project (less than 6 months old)
- 💤 Inactive project (6 months no activity)
- 💀 Dead project (12 months no activity)
- 📈📉 Project is trending up or down
- ➕ Project was recently added
- ❗️ Warning (e.g. missing/risky license)
- 👨💻 Contributors count from GitHub
- 🔀 Fork count from GitHub
- 📋 Issue count from GitHub
- ⏱️ Last update timestamp on package manager
- 📥 Download count from package manager
- 📦 Number of dependent projects
Tensorflow related project
Sklearn related project
PyTorch related project
MxNet related project
Apache Spark related project
Jupyter related project
PaddlePaddle related project
Pandas related project
Jax related project
Machine Learning Frameworks
General-purpose machine learning and deep learning frameworks.
Tensorflow (🥇54 · ⭐ 180K) - An Open Source Machine Learning Framework for Everyone. Apache-2

-
GitHub (👨💻 4.5K · 🔀 89K · 📦 330K · 📋 39K - 5% open · ⏱️ 07.12.2023):
git clone https://github.com/tensorflow/tensorflow
-
PyPi (📥 17M / month · 📦 15K · ⏱️ 05.12.2023):
pip install tensorflow
-
Conda (📥 4.4M · ⏱️ 14.11.2023):
conda install -c conda-forge tensorflow
-
Docker Hub (📥 75M · ⭐ 2.2K · ⏱️ 07.12.2023):
docker pull tensorflow/tensorflow
scikit-learn (🥇52 · ⭐ 57K) - scikit-learn: machine learning in Python. BSD-3

pytorch-lightning (🥇45 · ⭐ 25K) - Deep learning framework to train, finetune and deploy.. Apache-2

PaddlePaddle (🥇45 · ⭐ 21K) - PArallel Distributed Deep LEarning: Machine Learning.. Apache-2

XGBoost (🥈44 · ⭐ 25K) - Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or.. Apache-2
LightGBM (🥈43 · ⭐ 16K) - A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT,.. MIT
StatsModels (🥈43 · ⭐ 9.1K) - Statsmodels: statistical modeling and econometrics in Python. BSD-3
Catboost (🥈41 · ⭐ 7.5K) - A fast, scalable, high performance Gradient Boosting on Decision.. Apache-2
Jina (🥈40 · ⭐ 19K) - Build multimodal AI applications with cloud-native stack. Apache-2
-
GitHub (👨💻 180 · 🔀 2.2K · 📦 1.3K · 📋 1.9K - 0% open · ⏱️ 06.12.2023):
git clone https://github.com/jina-ai/jina
-
PyPi (📥 68K / month · 📦 16 · ⏱️ 01.12.2023):
pip install jina
-
Conda (📥 58K · ⏱️ 16.06.2023):
conda install -c conda-forge jina-core
-
Docker Hub (📥 1.3M · ⭐ 8 · ⏱️ 06.12.2023):
docker pull jinaai/jina
Theano (🥈38 · ⭐ 9.8K · 💤) - Theano was a Python library that allows you to define, optimize, and.. BSD-3
Vowpal Wabbit (🥈35 · ⭐ 8.3K) - Vowpal Wabbit is a machine learning system which pushes the.. BSD-3
Thinc (🥈35 · ⭐ 2.8K) - A refreshing functional take on deep learning, compatible with your favorite.. MIT
tensorpack (🥉32 · ⭐ 6.3K) - A Neural Net Training Interface on TensorFlow, with focus.. Apache-2

tensorflow-upstream (🥉31 · ⭐ 670) - TensorFlow ROCm port. Apache-2

Neural Network Libraries (🥉30 · ⭐ 2.7K) - Neural Network Libraries. Apache-2
Determined (🥉30 · ⭐ 2.6K) - Determined is an open-source machine learning platform.. Apache-2


Neural Tangents (🥉27 · ⭐ 2.1K) - Fast and Easy Infinite Neural Networks in Python. Apache-2
Torchbearer (🥉25 · ⭐ 630 · 📈) - torchbearer: A model fitting library for PyTorch. MIT

NeuPy (🥉24 · ⭐ 740 · 💤) - NeuPy is a Tensorflow based python library for prototyping and building.. MIT
chefboost (🥉20 · ⭐ 420) - A Lightweight Decision Tree Framework supporting regular algorithms:.. MIT
ThunderSVM (🥉19 · ⭐ 1.5K) - ThunderSVM: A Fast SVM Library on GPUs and CPUs. Apache-2
ThunderGBM (🥉15 · ⭐ 680) - ThunderGBM: Fast GBDTs and Random Forests on GPUs. Apache-2
Show 14 hidden projects...
- dlib (🥈39 · ⭐ 12K) - A toolkit for making real world machine learning and data analysis..
❗️BSL-1.0
- MindsDB (🥈34 · ⭐ 19K) - MindsDB connects AI models to real time data.
❗️GPL-3.0
- Turi Create (🥉33 · ⭐ 11K · 💀) - Turi Create simplifies the development of custom machine..
BSD-3
- Chainer (🥉33 · ⭐ 5.8K · 💀) - A flexible framework of neural networks for deep learning.
MIT
- TFlearn (🥉32 · ⭐ 9.6K · 💀) - Deep learning library featuring a higher-level API for TensorFlow.
MIT
- CNTK (🥉30 · ⭐ 17K · 💀) - Microsoft Cognitive Toolkit (CNTK), an open source deep-learning toolkit.
MIT
- NuPIC (🥉28 · ⭐ 6.3K) - Numenta Platform for Intelligent Computing is an implementation of..
❗️AGPL-3.0
- Lasagne (🥉28 · ⭐ 3.8K · 💀) - Lightweight library to build and train neural networks in Theano.
MIT
- SHOGUN (🥉26 · ⭐ 3K · 💀) - Unified and efficient Machine Learning.
BSD-3
- xLearn (🥉25 · ⭐ 3.1K · 💀) - High performance, easy-to-use, and scalable machine learning (ML)..
Apache-2
- mace (🥉22 · ⭐ 4.8K · 💀) - MACE is a deep learning inference framework optimized for mobile..
Apache-2
- neon (🥉22 · ⭐ 3.9K · 💀) - Intel Nervana reference deep learning framework committed to best..
Apache-2
- elegy (🥉17 · ⭐ 460 · 💀) - A High Level API for Deep Learning in JAX.
MIT
- StarSpace (🥉16 · ⭐ 3.9K · 💀) - Learning embeddings for classification, retrieval and ranking.
MIT
Data Visualization
General-purpose and task-specific data visualization libraries.
Matplotlib (🥇50 · ⭐ 19K) - matplotlib: plotting with Python. Python-2.0
Plotly (🥇44 · ⭐ 15K) - The interactive graphing library for Python This project now includes.. MIT
-
GitHub (👨💻 250 · 🔀 2.4K · 📦 220K · 📋 2.8K - 50% open · ⏱️ 30.11.2023):
git clone https://github.com/plotly/plotly.py
-
PyPi (📥 12M / month · 📦 5.4K · ⏱️ 25.10.2023):
pip install plotly
-
Conda (📥 5.1M · ⏱️ 25.10.2023):
conda install -c conda-forge plotly
-
npm (📥 52K / month · 📦 6 · ⏱️ 12.01.2021):
npm install plotlywidget
pandas-profiling (🥈37 · ⭐ 12K) - 1 Line of code data quality profiling & exploratory.. MIT


VisPy (🥈35 · ⭐ 3.2K) - High-performance interactive 2D/3D data visualization library. BSD-3

-
GitHub (👨💻 190 · 🔀 610 · 📦 1.3K · 📋 1.4K - 22% open · ⏱️ 31.10.2023):
git clone https://github.com/vispy/vispy
-
PyPi (📥 400K / month · 📦 140 · ⏱️ 03.10.2023):
pip install vispy
-
Conda (📥 440K · ⏱️ 04.10.2023):
conda install -c conda-forge vispy
-
npm (📥 11 / month · 📦 1 · ⏱️ 15.03.2020):
npm install vispy
HoloViews (🥈35 · ⭐ 2.5K) - With Holoviews, your data visualizes itself. BSD-3

-
GitHub (👨💻 140 · 🔀 380 · 📋 3.1K - 33% open · ⏱️ 06.12.2023):
git clone https://github.com/holoviz/holoviews
-
PyPi (📥 320K / month · 📦 270 · ⏱️ 06.12.2023):
pip install holoviews
-
Conda (📥 1.5M · ⏱️ 08.11.2023):
conda install -c conda-forge holoviews
-
npm (📥 660 / month · 📦 2 · ⏱️ 16.08.2023):
npm install @pyviz/jupyterlab_pyviz
datashader (🥈33 · ⭐ 3.1K) - Quickly and accurately render even the largest data. BSD-3
Perspective (🥈31 · ⭐ 7K) - A data visualization and analytics component, especially.. Apache-2

-
GitHub (👨💻 93 · 🔀 890 · 📥 1.7K · 📦 100 · 📋 700 - 13% open · ⏱️ 03.12.2023):
git clone https://github.com/finos/perspective
-
PyPi (📥 5.9K / month · 📦 14 · ⏱️ 29.11.2023):
pip install perspective-python
-
Conda (📥 490K · ⏱️ 04.12.2023):
conda install -c conda-forge perspective
-
npm (📥 850 / month · 📦 3 · ⏱️ 29.11.2023):
npm install @finos/perspective-jupyterlab
D-Tale (🥈31 · ⭐ 4.4K) - Visualizer for pandas data structures. ❗️LGPL-2.1


hvPlot (🥈31 · ⭐ 830) - A high-level plotting API for pandas, dask, xarray, and networkx built on.. BSD-3
bqplot (🥉29 · ⭐ 3.5K) - Plotting library for IPython/Jupyter notebooks. Apache-2

-
GitHub (👨💻 64 · 🔀 470 · 📦 51 · 📋 620 - 40% open · ⏱️ 04.10.2023):
git clone https://github.com/bqplot/bqplot
-
PyPi (📥 120K / month · 📦 100 · ⏱️ 17.10.2023):
pip install bqplot
-
Conda (📥 1.2M · ⏱️ 06.11.2023):
conda install -c conda-forge bqplot
-
npm (📥 3.6K / month · 📦 16 · ⏱️ 17.10.2023):
npm install bqplot
Sweetviz (🥉29 · ⭐ 2.7K) - Visualize and compare datasets, target values and associations, with one.. MIT
mpld3 (🥉29 · ⭐ 2.3K · 💤) - D3 Renderings of Matplotlib Graphics. BSD-3
-
GitHub (👨💻 51 · 🔀 360 · 📦 5.3K · 📋 360 - 60% open · ⏱️ 10.12.2022):
git clone https://github.com/mpld3/mpld3
-
PyPi (📥 410K / month · 📦 410 · ⏱️ 10.12.2022):
pip install mpld3
-
Conda (📥 190K · ⏱️ 16.06.2023):
conda install -c conda-forge mpld3
-
npm (📥 430 / month · 📦 8 · ⏱️ 10.12.2022):
npm install mpld3
data-validation (🥉29 · ⭐ 740) - Library for exploring and validating machine learning.. Apache-2


Facets Overview (🥉28 · ⭐ 7.2K · 💤) - Visualizations for machine learning datasets. Apache-2

AutoViz (🥉27 · ⭐ 1.5K) - Automatically Visualize any dataset, any size with a single line of.. Apache-2
pythreejs (🥉27 · ⭐ 910 · 💤) - A Jupyter - Three.js bridge. BSD-3

-
GitHub (👨💻 30 · 🔀 180 · 📦 28 · 📋 240 - 25% open · ⏱️ 20.02.2023):
git clone https://github.com/jupyter-widgets/pythreejs
-
PyPi (📥 60K / month · 📦 57 · ⏱️ 20.02.2023):
pip install pythreejs
-
Conda (📥 520K · ⏱️ 16.06.2023):
conda install -c conda-forge pythreejs
-
npm (📥 3.8K / month · 📦 11 · ⏱️ 20.02.2023):
npm install jupyter-threejs
Plotly-Resampler (🥉26 · ⭐ 860) - Visualize large time series data with plotly.py. MIT
Chartify (🥉25 · ⭐ 3.4K) - Python library that makes it easy for data scientists to create.. Apache-2
Multicore-TSNE (🥉25 · ⭐ 1.8K · 💤) - Parallel t-SNE implementation with Python and Torch.. BSD-3

Pandas-Bokeh (🥉23 · ⭐ 870 · 💤) - Bokeh Plotting Backend for Pandas and GeoPandas. MIT

python-ternary (🥉22 · ⭐ 660) - Ternary plotting library for python with matplotlib. MIT
Show 13 hidden projects...
- cartopy (🥈34 · ⭐ 1.3K) - Cartopy - a cartographic python library with matplotlib support.
❗️LGPL-3.0
- Cufflinks (🥉29 · ⭐ 2.9K · 💀) - Productivity Tools for Plotly + Pandas.
MIT
- HyperTools (🥉25 · ⭐ 1.8K · 💀) - A Python toolbox for gaining geometric insights into high-..
MIT
- PandasGUI (🥉24 · ⭐ 3.1K) - A GUI for Pandas DataFrames.
❗️MIT-0
- pivottablejs (🥉23 · ⭐ 630 · 💀) - Dragndrop Pivot Tables and Charts for Jupyter/IPython..
MIT
- joypy (🥉21 · ⭐ 500 · 💀) - Joyplots in Python with matplotlib & pandas.
MIT
- PyWaffle (🥉20 · ⭐ 560 · 💀) - Make Waffle Charts in Python.
MIT
- vegafusion (🥉20 · ⭐ 280) - Serverside scaling for Vega and Altair visualizations.
BSD-3
- animatplot (🥉18 · ⭐ 400 · 💀) - A python package for animating plots build on matplotlib.
MIT
- data-describe (🥉17 · ⭐ 290 · 💀) - datadescribe: Pythonic EDA Accelerator for Data Science.
Apache-2
- pdvega (🥉16 · ⭐ 340 · 💀) - Interactive plotting for Pandas using Vega-Lite.
MIT
- nx-altair (🥉15 · ⭐ 220 · 💀) - Draw interactive NetworkX graphs with Altair.
MIT
- nptsne (🥉12 · ⭐ 30 · 💀) - nptsne is a numpy compatible python binary package that offers a..
Apache-2
Text Data & NLP
Libraries for processing, cleaning, manipulating, and analyzing text data as well as libraries for NLP tasks such as language detection, fuzzy matching, classification, seq2seq learning, conversational AI, keyword extraction, and translation.
transformers (🥇50 · ⭐ 120K) - Transformers: State-of-the-art Machine Learning for.. Apache-2


nltk (🥇43 · ⭐ 13K) - Suite of libraries and programs for symbolic and statistical natural.. Apache-2
gensim (🥇40 · ⭐ 15K) - Topic Modelling for Humans. ❗️LGPL-2.1
flair (🥇39 · ⭐ 13K) - A very simple framework for state-of-the-art Natural Language Processing.. MIT

sentence-transformers (🥇37 · ⭐ 13K) - Multilingual Sentence & Image Embeddings with BERT. Apache-2

Tokenizers (🥇37 · ⭐ 7.9K) - Fast State-of-the-Art Tokenizers optimized for Research and.. Apache-2
haystack (🥈35 · ⭐ 12K) - LLM orchestration framework to build customizable, production-ready.. Apache-2
sentencepiece (🥈35 · ⭐ 8.6K) - Unsupervised text tokenizer for Neural Network-based text.. Apache-2
stanza (🥈35 · ⭐ 6.9K) - Stanford NLP Python library for tokenization, sentence segmentation,.. Apache-2
TensorFlow Text (🥈34 · ⭐ 1.2K) - Making text a first-class citizen in TensorFlow. Apache-2

jellyfish (🥈33 · ⭐ 1.9K) - a python library for doing approximate and phonetic matching of strings. MIT
snowballstemmer (🥈33 · ⭐ 700) - Snowball compiler and stemming algorithms. BSD-3
DeepPavlov (🥈32 · ⭐ 6.4K) - An open source library for deep learning end-to-end dialog.. Apache-2

spacy-transformers (🥈31 · ⭐ 1.3K) - Use pretrained transformers like BERT, XLNet and GPT-2.. MIT
spacy
SciSpacy (🥈30 · ⭐ 1.5K) - A full spaCy pipeline and models for scientific/biomedical documents. Apache-2
TextDistance (🥈29 · ⭐ 3.2K) - Compute distance between sequences. 30+ algorithms, pure python.. MIT
qdrant (🥈28 · ⭐ 15K) - Qdrant - High-performance, massive-scale Vector Database for the next.. Apache-2
-
GitHub (👨💻 78 · 🔀 820 · 📥 5.9K · 📋 820 - 20% open · ⏱️ 07.12.2023):
git clone https://github.com/qdrant/qdrant
fastNLP (🥉27 · ⭐ 3K · 💤) - fastNLP: A Modularized and Extensible NLP Framework. Currently.. Apache-2
PyTextRank (🥉27 · ⭐ 2.1K) - Python implementation of TextRank algorithms (textgraphs) for phrase.. MIT
Ciphey (🥉26 · ⭐ 14K) - Automatically decrypt encryptions without knowing the key or cipher, decode.. MIT
-
GitHub (👨💻 48 · 🔀 880 · 📋 320 - 17% open · ⏱️ 12.10.2023):
git clone https://github.com/Ciphey/Ciphey
-
PyPi (📥 45K / month · ⏱️ 06.06.2021):
pip install ciphey
-
Docker Hub (📥 21K · ⭐ 14 · ⏱️ 14.10.2023):
docker pull remnux/ciphey
OpenPrompt (🥉26 · ⭐ 3.9K · 💤) - An Open-Source Framework for Prompt-Learning. Apache-2
promptsource (🥉26 · ⭐ 2.3K) - Toolkit for creating, sharing and using natural language.. Apache-2
pytorch-nlp (🥉25 · ⭐ 2.2K) - Basic Utilities for PyTorch Natural Language Processing (NLP). BSD-3

scattertext (🥉25 · ⭐ 2.2K) - Beautiful visualizations of how language differs among document.. Apache-2
Texthero (🥉23 · ⭐ 2.8K) - Text preprocessing, representation and visualization from zero to hero. MIT
happy-transformer (🥉23 · ⭐ 480) - Happy Transformer makes it easy to fine-tune and.. Apache-2
huggingface
YouTokenToMe (🥉22 · ⭐ 920 · 💤) - Unsupervised text tokenizer focused on computational efficiency. MIT
lightseq (🥉20 · ⭐ 3K · 💤) - LightSeq: A High Performance Library for Sequence Processing and.. Apache-2
small-text (🥉20 · ⭐ 500) - Active Learning for Text Classification in Python. MIT


textaugment (🥉20 · ⭐ 350) - TextAugment: Text Augmentation Library. MIT
OpenNRE (🥉16 · ⭐ 4.1K · 💤) - An Open-Source Package for Neural Relation Extraction (NRE). MIT
-
GitHub (👨💻 12 · 🔀 1K · 📋 360 - 3% open · ⏱️ 03.01.2023):
git clone https://github.com/thunlp/OpenNRE
Show 44 hidden projects...
- ChatterBot (🥈35 · ⭐ 14K · 💀) - ChatterBot is a machine learning, conversational dialog engine..
BSD-3
- AllenNLP (🥈35 · ⭐ 12K · 💀) - An open-source NLP research library, built on PyTorch.
Apache-2
- fuzzywuzzy (🥈33 · ⭐ 9K · 💀) - Fuzzy String Matching in Python.
❗️GPL-2.0
- english-words (🥈29 · ⭐ 9.6K · 💀) - A text file containing 479k English words for all your..
Unlicense
- nlpaug (🥈29 · ⭐ 4.2K · 💀) - Data augmentation for NLP.
MIT
- polyglot (🥈28 · ⭐ 2.2K · 💀) - Multilingual text (NLP) processing toolkit.
❗️GPL-3.0
- langid (🥈28 · ⭐ 2.2K · 💀) - Stand-alone language identification system.
BSD-3
- underthesea (🥈28 · ⭐ 1.3K) - Underthesea - Vietnamese NLP Toolkit.
❗️GPL-3.0
- flashtext (🥉27 · ⭐ 5.5K · 💀) - Extract Keywords from sentence or Replace keywords in sentences.
MIT
- vaderSentiment (🥉27 · ⭐ 4.1K · 💀) - VADER Sentiment Analysis. VADER (Valence Aware Dictionary..
MIT
- textacy (🥉27 · ⭐ 2.1K · 💤) - NLP, before and after spaCy.
❗Unlicensed
- FARM (🥉27 · ⭐ 1.7K · 💀) - Fast & easy transfer learning for NLP. Harvesting language..
Apache-2
- PyText (🥉25 · ⭐ 6.4K · 💀) - A natural language modeling framework based on PyTorch.
BSD-3
- Snips NLU (🥉25 · ⭐ 3.8K · 💀) - Snips Python library to extract meaning from text.
Apache-2
- neuralcoref (🥉25 · ⭐ 2.8K · 💀) - Fast Coreference Resolution in spaCy with Neural Networks.
MIT
- textgenrnn (🥉24 · ⭐ 4.9K · 💀) - Easily train your own text-generating neural network of any..
MIT
- MatchZoo (🥉24 · ⭐ 3.8K · 💀) - Facilitating the design, comparison and sharing of deep..
Apache-2
- pySBD (🥉24 · ⭐ 680 · 💀) - pySBD (Python Sentence Boundary Disambiguation) is a rule-based sentence..
MIT
- whoosh (🥉24 · ⭐ 480 · 💀) - Pure-Python full-text search library.
❗️BSD-1-Clause
- Kashgari (🥉23 · ⭐ 2.4K · 💀) - Kashgari is a production-level NLP Transfer learning..
Apache-2
- DeepMatcher (🥉22 · ⭐ 4.9K · 💀) - Python package for performing Entity and Text Matching using..
BSD-3
- gpt-2-simple (🥉22 · ⭐ 3.3K · 💀) - Python package to easily retrain OpenAIs GPT-2 text-..
MIT
- Texar (🥉22 · ⭐ 2.4K · 💀) - Toolkit for Machine Learning, Natural Language Processing, and..
Apache-2
- jiant (🥉22 · ⭐ 1.6K · 💀) - jiant is an nlp toolkit.
MIT
- stop-words (🥉22 · ⭐ 150 · 💀) - Get list of common stop words in various languages in Python.
BSD-3
- NLP Architect (🥉21 · ⭐ 2.9K · 💀) - A model library for exploring state-of-the-art deep..
Apache-2
- DELTA (🥉21 · ⭐ 1.6K · 💀) - DELTA is a deep learning based natural language and speech..
Apache-2
- anaGo (🥉21 · ⭐ 1.5K · 💀) - Bidirectional LSTM-CRF and ELMo for Named-Entity Recognition,..
MIT
- pyfasttext (🥉20 · ⭐ 230 · 💀) - Yet another Python binding for fastText.
❗️GPL-3.0
- numerizer (🥉20 · ⭐ 210 · 💤) - A Python module to convert natural language numerics into ints and..
MIT
- fastT5 (🥉19 · ⭐ 510 · 💀) - boost inference speed of T5 models by 5x & reduce the model size..
Apache-2
- textpipe (🥉19 · ⭐ 300 · 💀) - Textpipe: clean and extract metadata from text.
MIT
- NeuroNER (🥉18 · ⭐ 1.7K · 💀) - Named-entity recognition using neural networks. Easy-to-use and..
MIT
- Camphr (🥉18 · ⭐ 340 · 💀) - Camphr - NLP libary for creating pipeline components.
Apache-2
spacy
- nboost (🥉17 · ⭐ 670 · 💀) - NBoost is a scalable, search-api-boosting platform for deploying..
Apache-2
- Translate (🥉16 · ⭐ 810 · 💀) - Translate - a PyTorch Language Library.
BSD-3
- skift (🥉16 · ⭐ 230 · 💀) - scikit-learn wrappers for Python fastText.
MIT
- BLINK (🥉14 · ⭐ 1.1K · 💀) - Entity Linker solution.
MIT
- Headliner (🥉14 · ⭐ 230 · 💀) - Easy training and deployment of seq2seq models.
MIT
- NeuralQA (🥉14 · ⭐ 230 · 💀) - NeuralQA: A Usable Library for Question Answering on Large Datasets..
MIT
- TransferNLP (🥉13 · ⭐ 290 · 💀) - NLP library designed for reproducible experimentation..
MIT
- ONNX-T5 (🥉13 · ⭐ 240 · 💀) - Summarization, translation, sentiment-analysis, text-generation..
Apache-2
- textvec (🥉13 · ⭐ 190 · 💀) - Text vectorization tool to outperform TFIDF for classification..
MIT
- spacy-dbpedia-spotlight (🥉12 · ⭐ 93 · 💤) - A spaCy wrapper for DBpedia Spotlight.
MIT
spacy
Image Data
Libraries for image & video processing, manipulation, and augmentation as well as libraries for computer vision tasks such as facial recognition, object detection, and classification.
torchvision (🥇42 · ⭐ 15K) - Datasets, Transforms and Models specific to Computer Vision. BSD-3

PyTorch Image Models (🥇41 · ⭐ 28K) - PyTorch image models, scripts, pretrained weights --.. Apache-2

MMDetection (🥇39 · ⭐ 26K) - OpenMMLab Detection Toolbox and Benchmark. Apache-2

InsightFace (🥈35 · ⭐ 19K) - State-of-the-art 2D and 3D Face Analysis Project. MIT

Albumentations (🥈35 · ⭐ 13K) - Fast image augmentation library and an easy-to-use wrapper.. MIT

detectron2 (🥈33 · ⭐ 27K) - Detectron2 is a platform for object detection, segmentation.. Apache-2

opencv-python (🥈33 · ⭐ 3.9K) - Automated CI toolchain to produce precompiled opencv-python,.. MIT
vit-pytorch (🥈30 · ⭐ 16K) - Implementation of Vision Transformer, a simple way to achieve.. MIT

imageai (🥈30 · ⭐ 8.2K) - A python library built to empower developers to build applications and.. MIT
Face Alignment (🥈30 · ⭐ 6.6K) - 2D and 3D Face alignment library build using pytorch. BSD-3

PaddleDetection (🥉29 · ⭐ 12K) - Object Detection toolkit based on PaddlePaddle. It.. Apache-2

sahi (🥉29 · ⭐ 3.2K) - Framework agnostic sliced/tiled inference + interactive ui + error analysis.. MIT
CellProfiler (🥉29 · ⭐ 820) - An open-source application for biological image analysis. BSD-3
vidgear (🥉28 · ⭐ 3K) - A High-performance cross-platform Video Processing Python framework.. Apache-2
facenet-pytorch (🥉26 · ⭐ 3.9K) - Pretrained Pytorch face detection (MTCNN) and facial.. MIT

Image Deduplicator (🥉25 · ⭐ 4.8K · 💤) - Finding duplicate images made easy!. Apache-2

pytorchvideo (🥉25 · ⭐ 3.1K) - A deep learning library for video understanding research. Apache-2

tensorflow-graphics (🥉23 · ⭐ 2.7K) - TensorFlow Graphics: Differentiable Graphics Layers.. Apache-2

DE⫶TR (🥉22 · ⭐ 12K · 💤) - End-to-End Object Detection with Transformers. Apache-2

-
GitHub (👨💻 26 · 🔀 2.1K · 📦 21 · 📋 520 - 45% open · ⏱️ 07.02.2023):
git clone https://github.com/facebookresearch/detr
Classy Vision (🥉21 · ⭐ 1.6K · 💤) - An end-to-end PyTorch framework for image and video.. MIT

PySlowFast (🥉20 · ⭐ 6K) - PySlowFast: video understanding codebase from FAIR for.. Apache-2

image-match (🥉20 · ⭐ 2.9K · 💤) - Quickly search over billions of images. Apache-2
scenic (🥉18 · ⭐ 2.6K) - Scenic: A Jax Library for Computer Vision Research and Beyond. Apache-2

-
GitHub (👨💻 74 · 🔀 350 · 📋 260 - 58% open · ⏱️ 15.11.2023):
git clone https://github.com/google-research/scenic
Caer (🥉18 · ⭐ 730 · 💤) - A lightweight Computer Vision library. Scale your models, not boilerplate. MIT
Show 19 hidden projects...
- scikit-image (🥇42 · ⭐ 5.7K) - Image processing in Python.
❗Unlicensed
- glfw (🥈36 · ⭐ 11K) - A multi-platform library for OpenGL, OpenGL ES, Vulkan, window and input.
❗️Zlib
- imgaug (🥈35 · ⭐ 14K · 💀) - Image augmentation for machine learning experiments.
MIT
- Face Recognition (🥈33 · ⭐ 50K · 💀) - The worlds simplest facial recognition api for Python..
MIT
- imutils (🥈31 · ⭐ 4.4K · 💀) - A series of convenience functions to make basic image processing..
MIT
- PyTorch3D (🥈30 · ⭐ 7.9K) - PyTorch3D is FAIRs library of reusable components for..
❗Unlicensed
- layout-parser (🥉27 · ⭐ 4.1K · 💀) - A Unified Toolkit for Deep Learning Based Document Image..
Apache-2
- chainercv (🥉27 · ⭐ 1.5K · 💀) - ChainerCV: a Library for Deep Learning in Computer Vision.
MIT
- mtcnn (🥉26 · ⭐ 2.1K · 💀) - MTCNN face detection implementation for TensorFlow, as a PIP..
MIT
- Pillow-SIMD (🥉26 · ⭐ 2K) - The friendly PIL fork.
❗️PIL
- segmentation_models (🥉24 · ⭐ 4.5K · 💀) - Segmentation models with pretrained backbones. Keras..
MIT
- Image Super-Resolution (🥉23 · ⭐ 4.4K · 💀) - Super-scale your images and run experiments with..
Apache-2
- Luminoth (🥉23 · ⭐ 2.4K · 💀) - Deep Learning toolkit for Computer Vision.
BSD-3
- deep-daze (🥉22 · ⭐ 4.4K · 💀) - Simple command line tool for text to image generation using..
MIT
- nude.py (🥉21 · ⭐ 920 · 💀) - Nudity detection with Python.
MIT
- detecto (🥉20 · ⭐ 600 · 💀) - Build fully-functioning computer vision models with PyTorch.
MIT
- solt (🥉18 · ⭐ 260 · 💀) - Streaming over lightweight data transformations.
MIT
- Torch Points 3D (🥉16 · ⭐ 170 · 💀) - Pytorch framework for doing deep learning on point..
BSD-3
- HugsVision (🥉15 · ⭐ 180 · 💤) - HugsVision is a easy to use huggingface wrapper for state-of-..
MIT
huggingface
Graph Data
Libraries for graph processing, clustering, embedding, and machine learning tasks.
dgl (🥇39 · ⭐ 13K) - Python package built to ease deep learning on graph, on top of existing DL.. Apache-2
PyTorch Geometric (🥇38 · ⭐ 19K) - Graph Neural Network Library for PyTorch. MIT

pygraphistry (🥈30 · ⭐ 1.9K) - PyGraphistry is a Python library to quickly load, shape,.. BSD-3

ogb (🥈28 · ⭐ 1.8K) - Benchmark datasets, data loaders, and evaluators for graph machine learning. MIT
Paddle Graph Learning (🥈26 · ⭐ 1.5K) - Paddle Graph Learning (PGL) is an efficient and.. Apache-2

PyKEEN (🥈26 · ⭐ 1.4K) - A Python library for learning and evaluating knowledge graph embeddings. MIT
pytorch_geometric_temporal (🥈25 · ⭐ 2.3K) - PyTorch Geometric Temporal: Spatiotemporal Signal.. MIT

AmpliGraph (🥈25 · ⭐ 2K) - Python library for Representation Learning on Knowledge.. Apache-2

PyTorch-BigGraph (🥈24 · ⭐ 3.3K) - Generate embeddings from large-scale graph-structured.. BSD-3

torch-cluster (🥈24 · ⭐ 720) - PyTorch Extension Library of Optimized Graph Cluster.. MIT

graph-nets (🥉22 · ⭐ 5.3K · 💤) - Build Graph Nets in Tensorflow. Apache-2

GraphGym (🥉19 · ⭐ 1.5K · 💤) - Platform for designing and evaluating Graph Neural Networks (GNN). MIT
Show 17 hidden projects...
- igraph (🥇34 · ⭐ 1.2K · 📈) - Python interface for igraph.
❗️GPL-2.0
- pygal (🥈30 · ⭐ 2.6K) - PYthon svg GrAph plotting Library.
❗️LGPL-3.0
- StellarGraph (🥈27 · ⭐ 2.8K · 💀) - StellarGraph - Machine Learning on Graphs.
Apache-2
- Karate Club (🥈24 · ⭐ 2K) - Karate Club: An API Oriented Open-source Python Framework for..
❗️GPL-3.0
- DIG (🥉22 · ⭐ 1.7K) - A library for graph deep learning research.
❗️GPL-3.0
- graph4nlp (🥉21 · ⭐ 1.6K · 💀) - Graph4nlp is the library for the easy use of Graph..
Apache-2
- jraph (🥉21 · ⭐ 1.3K · 💀) - A Graph Neural Network Library in Jax.
Apache-2
- DeepWalk (🥉20 · ⭐ 2.6K · 💀) - DeepWalk - Deep Learning for Graphs.
❗️GPL-3.0
- pyRDF2Vec (🥉20 · ⭐ 230) - Python Implementation and Extension of RDF2Vec.
MIT
- DeepGraph (🥉17 · ⭐ 280) - Analyze Data with Pandas-based Networks. Documentation:.
BSD-3
- OpenKE (🥉16 · ⭐ 3.6K · 💀) - An Open-Source Package for Knowledge Embedding (KE).
MIT
- GraphEmbedding (🥉16 · ⭐ 3.5K · 💀) - Implementation and experiments of graph embedding..
MIT
- Euler (🥉15 · ⭐ 2.9K · 💀) - A distributed graph deep learning framework.
Apache-2
- ptgnn (🥉15 · ⭐ 370 · 💀) - A PyTorch Graph Neural Network Library.
MIT
- GraphSAGE (🥉14 · ⭐ 3.2K · 💀) - Representation learning on large graphs using stochastic..
MIT
- OpenNE (🥉14 · ⭐ 1.7K · 💀) - An Open-Source Package for Network Embedding (NE).
MIT
- GraphVite (🥉13 · ⭐ 1.2K · 💀) - GraphVite: A General and High-performance Graph Embedding..
Apache-2
Audio Data
Libraries for audio analysis, manipulation, transformation, and extraction, as well as speech recognition and music generation tasks.
speechbrain (🥇37 · ⭐ 6.9K) - A PyTorch-based Speech Toolkit. Apache-2

torchaudio (🥈35 · ⭐ 2.3K) - Data manipulation and transformation for audio signal.. BSD-2

SpeechRecognition (🥈34 · ⭐ 7.7K · 📈) - Speech recognition module for Python, supporting.. BSD-3
audioread (🥈31 · ⭐ 460) - cross-library (GStreamer + Core Audio + MAD + FFmpeg) audio decoding.. MIT
audiomentations (🥈30 · ⭐ 1.6K) - A Python library for audio data augmentation. Inspired by.. MIT
pyAudioAnalysis (🥈29 · ⭐ 5.5K) - Python Audio Analysis Library: Feature Extraction,.. Apache-2
tinytag (🥉25 · ⭐ 630) - Read audio and music meta data and duration of MP3, OGG, OPUS, MP4, M4A,.. MIT
python-soundfile (🥉25 · ⭐ 610) - SoundFile is an audio library based on libsndfile, CFFI, and.. BSD-3
DeepSpeech (🥉22 · ⭐ 23K · 📉) - DeepSpeech is an open source embedded (offline, on-.. MPL-2.0

textlesslib (🥉10 · ⭐ 470) - Library for Textless Spoken Language Processing. MIT

-
GitHub (👨💻 8 · 🔀 48 · 📋 17 - 47% open · ⏱️ 29.08.2023):
git clone https://github.com/facebookresearch/textlesslib
Show 9 hidden projects...
- TTS (🥉27 · ⭐ 8.3K · 💀) - Deep learning for Text to Speech (Discussion forum:..
MPL-2.0
- aubio (🥉27 · ⭐ 3.1K · 💀) - a library for audio and music analysis.
❗️GPL-3.0
- Essentia (🥉27 · ⭐ 2.6K) - C++ library for audio and music analysis, description and..
❗️AGPL-3.0
- python_speech_features (🥉25 · ⭐ 2.3K · 💀) - This library provides common speech features for ASR..
MIT
- kapre (🥉23 · ⭐ 900 · 💀) - kapre: Keras Audio Preprocessors.
MIT
- Dejavu (🥉22 · ⭐ 6.2K · 💀) - Audio fingerprinting and recognition in Python.
MIT
- TimeSide (🥉21 · ⭐ 360 · 💤) - scalable audio processing framework and server written in..
❗️AGPL-3.0
- Julius (🥉20 · ⭐ 380 · 💀) - Fast PyTorch based DSP for audio and 1D signals.
MIT
- Muda (🥉17 · ⭐ 220 · 💀) - A library for augmenting annotated audio data.
ISC
Geospatial Data
Libraries to load, process, analyze, and write geographic data as well as libraries for spatial analysis, map visualization, and geocoding.
pydeck (🥇42 · ⭐ 11K) - WebGL2 powered visualization framework. MIT

-
GitHub (👨💻 240 · 🔀 2.1K · 📦 6.8K · 📋 2.8K - 9% open · ⏱️ 01.12.2023):
git clone https://github.com/visgl/deck.gl
-
PyPi (📥 1.9M / month · 📦 52 · ⏱️ 15.04.2023):
pip install pydeck
-
Conda (📥 460K · ⏱️ 16.06.2023):
conda install -c conda-forge pydeck
-
npm (📥 440K / month · 📦 480 · ⏱️ 05.12.2023):
npm install deck.gl
ipyleaflet (🥈33 · ⭐ 1.4K) - A Jupyter - Leaflet.js bridge. MIT

-
GitHub (👨💻 86 · 🔀 360 · 📦 6.3K · 📋 600 - 42% open · ⏱️ 27.11.2023):
git clone https://github.com/jupyter-widgets/ipyleaflet
-
PyPi (📥 140K / month · 📦 160 · ⏱️ 27.11.2023):
pip install ipyleaflet
-
Conda (📥 1.1M · ⏱️ 27.11.2023):
conda install -c conda-forge ipyleaflet
-
npm (📥 55K / month · 📦 5 · ⏱️ 27.11.2023):
npm install jupyter-leaflet
ArcGIS API (🥉32 · ⭐ 1.7K) - Documentation and samples for ArcGIS API for Python. Apache-2
EarthPy (🥉26 · ⭐ 470) - A package built to support working with spatial data using open source.. BSD-3
pymap3d (🥉25 · ⭐ 350) - pure-Python (Numpy optional) 3D coordinate conversions for geospace ecef.. BSD-2
Show 7 hidden projects...
- Geocoder (🥉32 · ⭐ 1.6K · 💀) - Python Geocoder.
MIT
- Satpy (🥉31 · ⭐ 970) - Python package for earth-observing satellite data processing.
❗️GPL-3.0
- Sentinelsat (🥉26 · ⭐ 940) - Search and download Copernicus Sentinel satellite images.
❗️GPL-3.0
- gmaps (🥉24 · ⭐ 760 · 💀) - Google maps for Jupyter notebooks.
BSD-3
- Mapbox GL (🥉24 · ⭐ 640 · 💀) - Use Mapbox GL JS to visualize data in a Python Jupyter notebook.
MIT
- prettymaps (🥉22 · ⭐ 11K · 💤) - A small set of Python functions to draw pretty maps from..
❗️AGPL-3.0
- geoplotlib (🥉21 · ⭐ 1K · 💀) - python toolbox for visualizing geographical data and making maps.
MIT
Financial Data
Libraries for algorithmic stock/crypto trading, risk analytics, backtesting, technical analysis, and other tasks on financial data.
Alpha Vantage (🥈28 · ⭐ 4.1K) - A python wrapper for Alpha Vantage API for financial data. MIT
stockstats (🥉26 · ⭐ 1.2K) - Supply a wrapper ``StockDataFrame`` based on the.. BSD-3
PyAlgoTrade (🥉25 · ⭐ 4.2K · 💤) - Python Algorithmic Trading Library. Apache-2
FinQuant (🥉24 · ⭐ 1.1K) - A program for financial portfolio management, analysis and optimisation. MIT
tf-quant-finance (🥉21 · ⭐ 4.1K) - High-performance TensorFlow library for quantitative.. Apache-2

finmarketpy (🥉21 · ⭐ 3.2K · 💤) - Python library for backtesting trading strategies &.. Apache-2
Show 13 hidden projects...
- zipline (🥇32 · ⭐ 17K · 💀) - Zipline, a Pythonic Algorithmic Trading Library.
Apache-2
- pyfolio (🥇32 · ⭐ 5.2K · 💀) - Portfolio and risk analytics in Python.
Apache-2
- arch (🥈31 · ⭐ 1.2K) - ARCH models in Python.
❗Unlicensed
- backtrader (🥈29 · ⭐ 12K · 💤) - Python Backtesting library for trading strategies.
❗️GPL-3.0
- empyrical (🥈28 · ⭐ 1.2K · 💀) - Common financial risk and performance metrics. Used by..
Apache-2
- Alphalens (🥉26 · ⭐ 2.9K · 💀) - Performance analysis of predictive (alpha) stock factors.
Apache-2
- TensorTrade (🥉25 · ⭐ 4.3K · 💀) - An open source reinforcement learning framework for..
Apache-2
- Enigma Catalyst (🥉25 · ⭐ 2.4K · 💀) - An Algorithmic Trading Library for Crypto-Assets in..
Apache-2
- Crypto Signals (🥉23 · ⭐ 4.6K · 💀) - Github.com/CryptoSignal - Trading & Technical Analysis Bot -..
MIT
- FinTA (🥉23 · ⭐ 2K · 💀) - Common financial technical indicators implemented in Pandas.
❗️LGPL-3.0
- Backtesting.py (🥉22 · ⭐ 4.3K · 💤) - Backtest trading strategies in Python.
❗️AGPL-3.0
- surpriver (🥉12 · ⭐ 1.7K · 💀) - Find big moving stocks before they move using machine..
❗️GPL-3.0
- pyrtfolio (🥉9 · ⭐ 140 · 💀) - Python package to generate stock portfolios.
❗️GPL-3.0
Time Series Data
Libraries for forecasting, anomaly detection, feature extraction, and machine learning on time-series and sequential data.
Prophet (🥇36 · ⭐ 17K) - Tool for producing high quality forecasts for time series data that has.. MIT
StatsForecast (🥇33 · ⭐ 3.2K) - Lightning fast forecasting with statistical and econometric.. Apache-2
pmdarima (🥇33 · ⭐ 1.5K) - A statistical library designed to fill the void in Pythons time series.. MIT
Darts (🥈31 · ⭐ 6.7K) - A python library for user-friendly forecasting and anomaly detection.. Apache-2
-
GitHub (👨💻 110 · 🔀 740 · 📦 440 · 📋 1.2K - 18% open · ⏱️ 01.12.2023):
git clone https://github.com/unit8co/darts
-
PyPi (📥 14K / month · 📦 5 · ⏱️ 18.11.2023):
pip install u8darts
-
Conda (📥 38K · ⏱️ 19.11.2023):
conda install -c conda-forge u8darts-all
-
Docker Hub (📥 450 · ⏱️ 18.11.2023):
docker pull unit8/darts
NeuralProphet (🥈31 · ⭐ 3.4K) - NeuralProphet: A simple forecasting package. MIT

STUMPY (🥈31 · ⭐ 2.8K) - STUMPY is a powerful and scalable Python library for modern time series.. BSD-3
pytorch-forecasting (🥈30 · ⭐ 3.4K) - Time series forecasting with PyTorch. MIT
NeuralForecast (🥈29 · ⭐ 2K) - Scalable and user friendly neural forecasting algorithms. Apache-2
skforecast (🥉28 · ⭐ 740) - Time series forecasting with scikit-learn models. BSD-3

uber/orbit (🥉22 · ⭐ 1.8K) - A Python package for Bayesian forecasting with object-oriented.. Apache-2
tick (🥉22 · ⭐ 440 · 💤) - Module for statistical learning, with a particular emphasis on time-.. BSD-3
Show 7 hidden projects...
- PyFlux (🥉25 · ⭐ 2.1K · 💀) - Open source time series library for Python.
BSD-3
- seglearn (🥉21 · ⭐ 560 · 💀) - Python module for machine learning time series:.
BSD-3
- ADTK (🥉19 · ⭐ 980 · 💀) - A Python toolkit for rule-based/unsupervised anomaly detection in time..
MPL-2.0
- matrixprofile-ts (🥉19 · ⭐ 720 · 💀) - A Python library for detecting patterns and anomalies..
Apache-2
- atspy (🥉15 · ⭐ 500 · 💀) - AtsPy: Automated Time Series Models in Python (by @firmai).
MIT
- tsaug (🥉13 · ⭐ 320 · 💀) - A Python package for time series augmentation.
Apache-2
- tslumen (🥉8 · ⭐ 54 · 💀) - A library for Time Series EDA (exploratory data analysis).
Apache-2
Medical Data
Libraries for processing and analyzing medical data such as MRIs, EEGs, genomic data, and other medical imaging formats.
MNE (🥇38 · ⭐ 2.4K) - MNE: Magnetoencephalography (MEG) and Electroencephalography (EEG) in Python. BSD-3
DeepVariant (🥉27 · ⭐ 3K) - DeepVariant is an analysis pipeline that uses a deep neural.. BSD-3

Brainiak (🥉19 · ⭐ 320) - Brain Imaging Analysis Kit. Apache-2
-
GitHub (👨💻 35 · 🔀 130 · 📋 210 - 38% open · ⏱️ 27.11.2023):
git clone https://github.com/brainiak/brainiak
-
PyPi (📥 130 / month · 📦 1 · ⏱️ 15.10.2020):
pip install brainiak
-
Docker Hub (📥 1.8K · ⭐ 1 · ⏱️ 15.10.2020):
docker pull brainiak/brainiak
Show 8 hidden projects...
- NiftyNet (🥉24 · ⭐ 1.3K · 💀) - [unmaintained] An open-source convolutional neural..
Apache-2
- MedPy (🥉22 · ⭐ 520) - Medical image processing in Python.
❗️GPL-3.0
- DLTK (🥉20 · ⭐ 1.4K · 💀) - Deep Learning Toolkit for Medical Image Analysis.
Apache-2
- Glow (🥉20 · ⭐ 250 · 💤) - An open-source toolkit for large-scale genomic analysis.
Apache-2
- Medical Detection Toolkit (🥉15 · ⭐ 1.2K · 💀) - The Medical Detection Toolkit contains 2D + 3D..
Apache-2
- MedicalTorch (🥉15 · ⭐ 820 · 💀) - A medical imaging framework for Pytorch.
Apache-2
- DeepNeuro (🥉13 · ⭐ 120 · 💀) - A deep learning python package for neuroimaging data. Made by:.
MIT
- MedicalNet (🥉12 · ⭐ 1.7K · 💀) - Many studies have shown that the performance on deep learning is..
MIT
Tabular Data
Libraries for processing tabular and structured data.
carefree-learn (🥈21 · ⭐ 400) - Deep Learning PyTorch. MIT

pytorch_tabular (🥉20 · ⭐ 1.1K) - A standard framework for modelling Deep Learning Models.. MIT

Show 3 hidden projects...
Optical Character Recognition
Libraries for optical character recognition (OCR) and text extraction from images or videos.
EasyOCR (🥇35 · ⭐ 20K) - Ready-to-use OCR with 80+ supported languages and all popular writing.. Apache-2
OCRmyPDF (🥈34 · ⭐ 11K) - OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them.. MPL-2.0
Tesseract (🥈33 · ⭐ 5.2K) - Python-tesseract is an optical character recognition (OCR) tool.. Apache-2
attention-ocr (🥉21 · ⭐ 960) - A Tensorflow model for text recognition (CNN + seq2seq with.. MIT

Show 3 hidden projects...
Data Containers & Structures
General-purpose data containers & structures as well as utilities & extensions for pandas.
🔗 best-of-python - Data Containers ( ⭐ 3.2K) - Collection of data-container, dataframe, and pandas-..
Data Loading & Extraction
Libraries for loading, collecting, and extracting data from a variety of data sources and formats.
🔗 best-of-python - Data Extraction ( ⭐ 3.2K) - Collection of data-loading and -extraction libraries.
Web Scraping & Crawling
Libraries for web scraping, crawling, downloading, and mining as well as libraries.
🔗 best-of-web-python - Web Scraping ( ⭐ 2.1K) - Collection of web-scraping and crawling libraries.
Data Pipelines & Streaming
Libraries for data batch- and stream-processing, workflow automation, job scheduling, and other data pipeline tasks.
🔗 best-of-python - Data Pipelines ( ⭐ 3.2K) - Libraries for data batch- and stream-processing,..
Distributed Machine Learning
Libraries that provide capabilities to distribute and parallelize machine learning tasks across large-scale compute infrastructure.
Ray (🥇45 · ⭐ 29K) - Ray is a unified framework for scaling AI and Python applications. Ray.. Apache-2
DeepSpeed (🥇40 · ⭐ 30K · 📉) - DeepSpeed is a deep learning optimization library that.. Apache-2

-
GitHub (👨💻 270 · 🔀 3.6K · 📦 5K · 📋 2.3K - 37% open · ⏱️ 05.12.2023):
git clone https://github.com/microsoft/DeepSpeed
-
PyPi (📥 400K / month · 📦 87 · ⏱️ 01.12.2023):
pip install deepspeed
-
Docker Hub (📥 17K · ⭐ 4 · ⏱️ 02.09.2022):
docker pull deepspeed/deepspeed
dask.distributed (🥇40 · ⭐ 1.5K) - A distributed task scheduler for Dask. BSD-3
H2O-3 (🥈36 · ⭐ 6.6K) - H2O is an Open Source, Distributed, Fast & Scalable Machine Learning.. Apache-2
BigDL (🥈36 · ⭐ 4.6K) - Accelerate LLM with low-bit (FP4 / INT4 / FP8 / INT8) optimizations.. Apache-2
-
GitHub (👨💻 190 · 🔀 1.2K · 📥 57 · 📦 46 · 📋 2.2K - 40% open · ⏱️ 07.12.2023):
git clone https://github.com/intel-analytics/BigDL
-
PyPi (📥 5.7K / month · 📦 1 · ⏱️ 07.12.2023):
pip install bigdl
-
Maven (📦 4 · ⏱️ 20.04.2021):
<dependency> <groupId>com.intel.analytics.bigdl</groupId> <artifactId>bigdl-SPARK_2.4</artifactId> <version>[VERSION]</version> </dependency>
horovod (🥈35 · ⭐ 14K) - Distributed training framework for TensorFlow, Keras, PyTorch, and.. Apache-2
ColossalAI (🥈33 · ⭐ 36K) - Making large AI models cheaper, faster and more accessible. Apache-2
-
GitHub (👨💻 160 · 🔀 4K · 📦 280 · 📋 1.5K - 21% open · ⏱️ 07.12.2023):
git clone https://github.com/hpcaitech/colossalai
TensorFlowOnSpark (🥉27 · ⭐ 3.9K · 💤) - TensorFlowOnSpark brings TensorFlow programs to.. Apache-2


petastorm (🥉26 · ⭐ 1.7K) - Petastorm library enables single machine or distributed training.. Apache-2
Apache Singa (🥉23 · ⭐ 3K) - a distributed deep learning platform. Apache-2
-
GitHub (👨💻 85 · 🔀 1K · 📦 2 · 📋 120 - 42% open · ⏱️ 05.11.2023):
git clone https://github.com/apache/singa
-
Conda (📥 730 · ⏱️ 16.06.2023):
conda install -c nusdbsystem singa
-
Docker Hub (📥 4.8K · ⭐ 4 · ⏱️ 31.05.2022):
docker pull apache/singa
analytics-zoo (🥉23 · ⭐ 2.6K) - Distributed Tensorflow, Keras and PyTorch on Apache.. Apache-2

mesh-transformer-jax (🥉18 · ⭐ 6.2K · 💤) - Model parallel transformers in JAX and Haiku. Apache-2

-
GitHub (👨💻 23 · 🔀 880 · 📦 20 · 📋 200 - 22% open · ⏱️ 12.01.2023):
git clone https://github.com/kingoflolz/mesh-transformer-jax
moolib (🥉10 · ⭐ 360 · 💤) - A library for distributed ML training with PyTorch. MIT

-
GitHub (👨💻 6 · 🔀 20 · 📦 3 · 📋 19 - 36% open · ⏱️ 12.12.2022):
git clone https://github.com/facebookresearch/moolib
Show 12 hidden projects...
- DEAP (🥈33 · ⭐ 5.4K) - Distributed Evolutionary Algorithms in Python.
❗️LGPL-3.0
- ipyparallel (🥈29 · ⭐ 2.5K) - IPython Parallel: Interactive Parallel Computing in..
❗Unlicensed
- Elephas (🥈29 · ⭐ 1.6K · 💀) - Distributed Deep learning with Keras & Spark.
MIT
keras
- BytePS (🥉21 · ⭐ 3.5K · 💀) - A high performance and generic framework for distributed DNN..
Apache-2
- somoclu (🥉21 · ⭐ 260 · 💀) - Massively parallel self-organizing maps: accelerate training on..
MIT
- TensorFrames (🥉20 · ⭐ 750 · 💀) - [DEPRECATED] Tensorflow wrapper for DataFrames on..
Apache-2
- sk-dist (🥉20 · ⭐ 290 · 💤) - Distributed scikit-learn meta-estimators in PySpark.
Apache-2
- bluefog (🥉18 · ⭐ 290 · 💤) - Distributed and decentralized training framework for PyTorch..
Apache-2
- Fiber (🥉17 · ⭐ 1K · 💀) - Distributed Computing for AI Made Simple.
Apache-2
- parallelformers (🥉16 · ⭐ 730 · 💀) - Parallelformers: An Efficient Model Parallelization..
Apache-2
- LazyCluster (🥉13 · ⭐ 49 · 💀) - Distributed machine learning made simple.
Apache-2
- autodist (🥉11 · ⭐ 130 · 💀) - Simple Distributed Deep Learning on TensorFlow.
Apache-2
Hyperparameter Optimization & AutoML
Libraries for hyperparameter optimization, automl and neural architecture search.
Keras Tuner (🥇33 · ⭐ 2.8K) - A Hyperparameter Tuning Library for Keras. Apache-2

featuretools (🥈32 · ⭐ 6.9K) - An open source python library for automated feature engineering. BSD-3
auto-sklearn (🥈31 · ⭐ 7.2K · 💤) - Automated Machine Learning with scikit-learn. BSD-3

Bayesian Optimization (🥈31 · ⭐ 7.2K) - A Python implementation of global optimization with.. MIT
AutoGluon (🥈29 · ⭐ 6.5K) - AutoGluon: AutoML for Image, Text, Time Series, and.. Apache-2


-
GitHub (👨💻 110 · 🔀 800 · 📦 2 · 📋 1.1K - 23% open · ⏱️ 01.12.2023):
git clone https://github.com/autogluon/autogluon
-
PyPi (📥 57K / month · 📦 7 · ⏱️ 07.12.2023):
pip install autogluon
-
Docker Hub (📥 5K · ⭐ 17 · ⏱️ 26.07.2023):
docker pull autogluon/autogluon
mljar-supervised (🥈29 · ⭐ 2.8K) - Python package for AutoML on Tabular Data with Feature.. MIT
featurewiz (🥈24 · ⭐ 500) - Use advanced feature engineering strategies and select best.. Apache-2
Hyperactive (🥉23 · ⭐ 470) - An optimization and data collection toolbox for convenient and fast.. MIT
igel (🥉20 · ⭐ 3.1K · 💤) - a delightful machine learning tool that allows you to train, test, and.. MIT
AlphaPy (🥉19 · ⭐ 980) - Automated Machine Learning [AutoML] with Python, scikit-learn, Keras,.. Apache-2
Auto ViML (🥉19 · ⭐ 470) - Automatically Build Multiple ML Models with a Single Line of Code... Apache-2
opytimizer (🥉18 · ⭐ 580) - Opytimizer is a Python library consisting of meta-heuristic.. Apache-2
Show 31 hidden projects...
- Optuna (🥇39 · ⭐ 9K) - A hyperparameter optimization framework.
❗Unlicensed
- TPOT (🥇33 · ⭐ 9.3K) - A Python Automated Machine Learning tool that optimizes machine..
❗️LGPL-3.0
- scikit-optimize (🥇33 · ⭐ 2.7K · 💀) - Sequential model-based optimization with a..
BSD-3
- SMAC3 (🥈27 · ⭐ 940) - SMAC3: A Versatile Bayesian Optimization Package for..
❗️BSD-1-Clause
- lazypredict (🥈26 · ⭐ 2.3K · 💀) - Lazy Predict help build a lot of basic models without much..
MIT
- Talos (🥈25 · ⭐ 1.6K · 💀) - Hyperparameter Optimization for TensorFlow, Keras and PyTorch.
MIT
- AdaNet (🥈24 · ⭐ 3.5K · 💀) - Fast and flexible AutoML with learning guarantees.
Apache-2
- Orion (🥈24 · ⭐ 270) - Asynchronous Distributed Hyperparameter Optimization.
BSD-3
- auto_ml (🥉22 · ⭐ 1.6K · 💀) - [UNMAINTAINED] Automated machine learning for analytics & production.
MIT
- Test Tube (🥉22 · ⭐ 730 · 💀) - Python library to easily log experiments and parallelize..
MIT
- HpBandSter (🥉22 · ⭐ 590 · 💀) - a distributed Hyperband implementation on Steroids.
BSD-3
- optunity (🥉22 · ⭐ 410 · 💀) - optimization routines for hyperparameter tuning.
BSD-3
- lightwood (🥉22 · ⭐ 390) - Lightwood is Legos for Machine Learning.
❗️GPL-3.0
- MLBox (🥉21 · ⭐ 1.5K · 💀) - MLBox is a powerful Automated Machine Learning python library.
❗️BSD-1-Clause
- Neuraxle (🥉21 · ⭐ 590 · 💀) - The worlds cleanest AutoML library - Do hyperparameter tuning..
Apache-2
- Dragonfly (🥉20 · ⭐ 810 · 💀) - An open source python library for scalable Bayesian optimisation.
MIT
- sklearn-deap (🥉20 · ⭐ 740 · 💀) - Use evolutionary algorithms instead of gridsearch in..
MIT
- Auto Tune Models (🥉18 · ⭐ 520 · 💀) - Auto Tune Models - A multi-tenant, multi-data system for..
MIT
- shap-hypetune (🥉18 · ⭐ 500 · 💀) - A python package for simultaneous Hyperparameters Tuning and..
MIT
- Sherpa (🥉18 · ⭐ 320 · 💀) - Hyperparameter optimization that enables researchers to..
❗️GPL-3.0
- Advisor (🥉17 · ⭐ 1.5K · 💀) - Open-source implementation of Google Vizier for hyper parameters..
Apache-2
- automl-gs (🥉16 · ⭐ 1.8K · 💀) - Provide an input CSV and a target field to predict, generate a..
MIT
- Xcessiv (🥉16 · ⭐ 1.3K · 💀) - A web-based application for quick, scalable, and automated..
Apache-2
- HyperparameterHunter (🥉16 · ⭐ 700 · 💀) - Easy hyperparameter optimization and automatic result..
MIT
- Parfit (🥉16 · ⭐ 200 · 💀) - A package for parallelizing the fit and flexibly scoring of..
MIT
- ENAS (🥉13 · ⭐ 2.7K · 💀) - PyTorch implementation of Efficient Neural Architecture Search via..
Apache-2
- Auptimizer (🥉12 · ⭐ 200 · 💀) - An automatic ML model optimization tool.
❗️GPL-3.0
- model_search (🥉11 · ⭐ 3.3K · 💀) - AutoML algorithms for model architecture search at scale.
Apache-2
- Devol (🥉11 · ⭐ 950 · 💀) - Genetic neural architecture search with Keras.
MIT
- Hypermax (🥉11 · ⭐ 110) - Better, faster hyper-parameter optimization.
BSD-3
- Hypertunity (🥉9 · ⭐ 140 · 💀) - A toolset for black-box hyperparameter optimisation.
Apache-2
Reinforcement Learning
Libraries for building and evaluating reinforcement learning & agent-based systems.
OpenAI Gym (🥇41 · ⭐ 33K · 💤) - A toolkit for developing and comparing reinforcement learning.. MIT
TensorLayer (🥈28 · ⭐ 7.3K · 💤) - Deep Learning and Reinforcement Learning Library for.. Apache-2

TensorForce (🥉26 · ⭐ 3.3K) - Tensorforce: a TensorFlow library for applied.. Apache-2

rliable (🥉14 · ⭐ 650 · 💤) - [NeurIPS21 Outstanding Paper] Library for reliable evaluation on.. Apache-2
Show 8 hidden projects...
- baselines (🥈30 · ⭐ 15K · 💀) - OpenAI Baselines: high-quality implementations of reinforcement..
MIT
- keras-rl (🥈28 · ⭐ 5.5K · 💀) - Deep Reinforcement Learning for Keras.
MIT
- Stable Baselines (🥉24 · ⭐ 4K · 💀) - A fork of OpenAI Baselines, implementations of..
MIT
- ChainerRL (🥉24 · ⭐ 1.1K · 💀) - ChainerRL is a deep reinforcement learning library built on top of..
MIT
- TRFL (🥉22 · ⭐ 3.1K · 💀) - TensorFlow Reinforcement Learning.
Apache-2
- SerpentAI (🥉18 · ⭐ 6.6K · 💀) - Game Agent Framework. Helping you create AIs / Bots that learn to..
MIT
- DeepMind Lab (🥉17 · ⭐ 6.9K · 💤) - A customisable 3D platform for agent-based AI research.
❗Unlicensed
- Maze (🥉12 · ⭐ 250 · 💀) - Maze Applied Reinforcement Learning Framework.
❗️Custom
Recommender Systems
Libraries for building and evaluating recommendation systems.
Recommenders (🥇35 · ⭐ 17K) - Best Practices on Recommendation Systems. MIT
TF Recommenders (🥈29 · ⭐ 1.7K) - TensorFlow Recommenders is a library for building.. Apache-2

scikit-surprise (🥈28 · ⭐ 6.1K · 💤) - A Python scikit for building and analyzing recommender.. BSD-3
TF Ranking (🥉27 · ⭐ 2.7K) - Learning to Rank in TensorFlow. Apache-2

recmetrics (🥉21 · ⭐ 540) - A library of metrics for evaluating recommender systems. MIT
Show 7 hidden projects...
- lkpy (🥉24 · ⭐ 250) - Python recommendation toolkit.
MIT
- tensorrec (🥉21 · ⭐ 1.2K · 💀) - A TensorFlow recommendation algorithm and framework in..
Apache-2
- fastFM (🥉21 · ⭐ 1.1K · 💀) - fastFM: A Library for Factorization Machines.
BSD-3
- Spotlight (🥉18 · ⭐ 2.9K · 💀) - Deep recommender models using PyTorch.
MIT
- Case Recommender (🥉18 · ⭐ 450 · 💀) - Case Recommender: A Flexible and Extensible Python..
MIT
- Collie (🥉17 · ⭐ 100 · 💤) - A library for preparing, training, and evaluating scalable deep..
BSD-3
- OpenRec (🥉16 · ⭐ 400 · 💀) - OpenRec is an open-source and modular library for neural network-..
Apache-2
Privacy Machine Learning
Libraries for encrypted and privacy-preserving machine learning using methods like federated learning & differential privacy.
TensorFlow Privacy (🥈27 · ⭐ 1.8K) - Library for training machine learning models with.. Apache-2

TFEncrypted (🥉25 · ⭐ 1.2K · 💤) - A Framework for Encrypted Machine Learning in.. Apache-2

Show 1 hidden projects...
- PipelineDP (🥉21 · ⭐ 260) - PipelineDP is a Python framework for applying differentially..
Apache-2
Workflow & Experiment Tracking
Libraries to organize, track, and visualize machine learning experiments.
Tensorboard (🥇43 · ⭐ 6.4K) - TensorFlows Visualization Toolkit. Apache-2

wandb client (🥇41 · ⭐ 7.6K) - A tool for visualizing and tracking your machine learning.. MIT
SageMaker SDK (🥈38 · ⭐ 2K) - A library for training and deploying machine learning.. Apache-2


AzureML SDK (🥈34 · ⭐ 3.8K) - Python notebooks with ML and deep learning examples with Azure.. MIT
tensorboardX (🥈33 · ⭐ 7.7K) - tensorboard for pytorch (and chainer, mxnet, numpy, ...). MIT
ClearML (🥈33 · ⭐ 4.9K) - ClearML - Auto-Magical CI/CD to streamline your ML workflow... Apache-2
-
GitHub (👨💻 88 · 🔀 600 · 📥 1.8K · 📦 830 · 📋 920 - 43% open · ⏱️ 07.12.2023):
git clone https://github.com/allegroai/clearml
-
PyPi (📥 230K / month · 📦 16 · ⏱️ 08.11.2023):
pip install clearml
-
Docker Hub (📥 30K · ⏱️ 05.10.2020):
docker pull allegroai/trains
Neptune.ai (🥈31 · ⭐ 430) - The MLOps stack component for experiment tracking. Apache-2
Labml (🥉27 · ⭐ 1.7K) - Monitor deep learning model training and hardware usage from your mobile.. MIT
ml-metadata (🥉27 · ⭐ 570) - For recording and retrieving metadata associated with ML.. Apache-2
TensorWatch (🥉20 · ⭐ 3.3K) - Debugging, monitoring and visualization for Python Machine Learning.. MIT
Show 16 hidden projects...
- Catalyst (🥈29 · ⭐ 3.2K · 💀) - Accelerated deep learning R&D.
Apache-2
- knockknock (🥉25 · ⭐ 2.7K · 💀) - Knock Knock: Get notified when your training ends with only two..
MIT
- livelossplot (🥉25 · ⭐ 1.3K · 💀) - Live training loss plot in Jupyter Notebook for Keras,..
MIT
- SKLL (🥉25 · ⭐ 550) - SciKit-Learn Laboratory (SKLL) makes it easy to run machine..
❗️BSD-1-Clause
- gokart (🥉25 · ⭐ 290) - Gokart solves reproducibility, task dependencies, constraints of good code,..
MIT
- hiddenlayer (🥉22 · ⭐ 1.7K · 💀) - Neural network graphs and training metrics for..
MIT
- lore (🥉20 · ⭐ 1.6K · 💀) - Lore makes machine learning approachable for Software Engineers and..
MIT
- TensorBoard Logger (🥉20 · ⭐ 630 · 💀) - Log TensorBoard events without touching TensorFlow.
MIT
- MXBoard (🥉20 · ⭐ 320 · 💀) - Logging MXNet data for visualization in TensorBoard.
Apache-2
- keepsake (🥉18 · ⭐ 1.6K · 💀) - Version control for machine learning.
Apache-2
- datmo (🥉17 · ⭐ 340 · 💀) - Open source production model management tool for data scientists.
MIT
- chitra (🥉16 · ⭐ 220) - A multi-functional library for full-stack Deep Learning. Simplifies..
Apache-2
- steppy (🥉16 · ⭐ 140 · 💀) - Lightweight, Python library for fast and reproducible experimentation.
MIT
- ModelChimp (🥉13 · ⭐ 120 · 💀) - Experiment tracking for machine and deep learning projects.
BSD-2
- CometML (🥉13) - Supercharging Machine Learning.
❗Unlicensed
- traintool (🥉8 · ⭐ 12 · 💀) - Train off-the-shelf machine learning models in one..
Apache-2
Model Serialization & Deployment
Libraries to serialize models to files, convert between a variety of model formats, and optimize models for deployment.
TorchServe (🥈35 · ⭐ 3.8K) - Serve, optimize and scale PyTorch models in production. Apache-2

-
GitHub (👨💻 180 · 🔀 780 · 📥 5.4K · 📦 560 · 📋 1.5K - 22% open · ⏱️ 06.12.2023):
git clone https://github.com/pytorch/serve
-
PyPi (📥 41K / month · 📦 15 · ⏱️ 12.10.2023):
pip install torchserve
-
Conda (📥 140K · ⏱️ 12.10.2023):
conda install -c pytorch torchserve
-
Docker Hub (📥 1.2M · ⭐ 22 · ⏱️ 12.10.2023):
docker pull pytorch/torchserve
huggingface_hub (🥈35 · ⭐ 1.4K) - All the open source things related to the Hugging Face Hub. Apache-2
Core ML Tools (🥈33 · ⭐ 3.8K) - Core ML tools contain supporting tools for Core ML model.. BSD-3
Hummingbird (🥈26 · ⭐ 3.3K) - Hummingbird compiles trained ML models into tensor computation for.. MIT
Show 10 hidden projects...
- mmdnn (🥈26 · ⭐ 5.8K · 💀) - MMdnn is a set of tools to help users inter-operate among different deep..
MIT
- m2cgen (🥉25 · ⭐ 2.7K · 💀) - Transform ML models into a native code (Java, C, Python, Go,..
MIT
- sklearn-porter (🥉23 · ⭐ 1.3K · 💀) - Transpile trained scikit-learn estimators to C, Java,..
BSD-3
- Larq Compute Engine (🥉21 · ⭐ 230) - Highly optimized inference engine for Binarized..
Apache-2
- OMLT (🥉20 · ⭐ 220) - Represent trained machine learning models as Pyomo optimization formulations.
BSD-3
- pytorch2keras (🥉19 · ⭐ 840 · 💀) - PyTorch to Keras model convertor.
MIT
- modelkit (🥉17 · ⭐ 150) - Toolkit for developing and maintaining ML models.
MIT
- tfdeploy (🥉16 · ⭐ 350 · 💀) - Deploy tensorflow graphs for fast evaluation and export to..
BSD-3
- backprop (🥉14 · ⭐ 240 · 💀) - Backprop makes it simple to use, finetune, and deploy state-of-..
Apache-2
- ml-ane-transformers (🥉13 · ⭐ 2.4K · 💀) - Reference implementation of the Transformer..
❗Unlicensed
Model Interpretability
Libraries to visualize, explain, debug, evaluate, and interpret machine learning models.
shap (🥇41 · ⭐ 21K) - A game theoretic approach to explain the output of any machine learning model. MIT
InterpretML (🥇33 · ⭐ 5.8K) - Fit interpretable models. Explain blackbox machine learning. MIT

evaluate (🥈32 · ⭐ 1.6K) - Evaluate: A library for easily evaluating machine learning models.. Apache-2
explainerdashboard (🥈30 · ⭐ 2.1K) - Quickly build Explainable AI dashboards that show the inner.. MIT
dtreeviz (🥈29 · ⭐ 2.7K) - A python library for decision tree visualization and model interpretation. MIT
Model Analysis (🥈29 · ⭐ 1.2K) - Model analysis tools for TensorFlow. Apache-2


responsible-ai-widgets (🥈29 · ⭐ 1K) - Responsible AI Toolbox is a suite of tools providing.. MIT



yellowbrick (🥈27 · ⭐ 4.1K) - Visual analysis and diagnostic tools to facilitate machine.. Apache-2

LIT (🥈27 · ⭐ 3.2K) - The Learning Interpretability Tool: Interactively analyze ML models to.. Apache-2
Fairness 360 (🥈27 · ⭐ 2.2K) - A comprehensive set of fairness metrics for datasets and.. Apache-2
iNNvestigate (🥉26 · ⭐ 1.2K) - A toolbox to iNNvestigate neural networks predictions!. BSD-2

Explainability 360 (🥉25 · ⭐ 1.4K) - Interpretability and explainability of data and.. Apache-2
What-If Tool (🥉22 · ⭐ 860 · 💤) - Source code/webpage/demos for the What-If Tool. Apache-2
-
GitHub (👨💻 20 · 🔀 160 · 📋 140 - 58% open · ⏱️ 27.04.2023):
git clone https://github.com/PAIR-code/what-if-tool
-
PyPi (📥 4.9K / month · 📦 4 · ⏱️ 12.10.2021):
pip install witwidget
-
Conda (📥 2.1M · ⏱️ 16.06.2023):
conda install -c conda-forge tensorboard-plugin-wit
-
npm (📥 920 / month · 📦 1 · ⏱️ 12.10.2021):
npm install wit-widget
model-card-toolkit (🥉20 · ⭐ 390) - A toolkit that streamlines and automates the generation.. Apache-2
fairness-indicators (🥉19 · ⭐ 320) - Tensorflows Fairness Evaluation and Visualization.. Apache-2


sklearn-evaluation (🥉17 · ⭐ 360 · 💤) - Machine learning model evaluation made easy: plots,.. MIT

FlashTorch (🥉15 · ⭐ 710) - Visualization toolkit for neural networks in PyTorch! Demo --. MIT

interpret-text (🥉13 · ⭐ 380) - A library that incorporates state-of-the-art explainers for.. MIT

Show 20 hidden projects...
- Lime (🥇33 · ⭐ 11K · 💀) - Lime: Explaining the predictions of any machine learning classifier.
BSD-2
- Deep Checks (🥈29 · ⭐ 3.1K) - Deepchecks: Tests for Continuous Validation of ML Models &..
❗️AGPL-3.0
- scikit-plot (🥈28 · ⭐ 2.4K · 💀) - An intuitive library to add plotting functionality to..
MIT
- eli5 (🥈27 · ⭐ 2.7K · 💀) - A library for debugging/inspecting machine learning classifiers and..
MIT
- Lucid (🥉25 · ⭐ 4.6K · 💀) - A collection of infrastructure and tools for research in..
Apache-2
- keras-vis (🥉25 · ⭐ 3K · 💀) - Neural network visualization toolkit for keras.
MIT
- DALEX (🥉24 · ⭐ 1.3K) - moDel Agnostic Language for Exploration and eXplanation.
❗️GPL-3.0
- random-forest-importances (🥉23 · ⭐ 580 · 💀) - Code to compute permutation and drop-column..
MIT
- tf-explain (🥉22 · ⭐ 1K · 💀) - Interpretability Methods for tf.keras models with Tensorflow..
MIT
- TreeInterpreter (🥉22 · ⭐ 730 · 💀) - Package for interpreting scikit-learns decision tree..
BSD-3
- Quantus (🥉22 · ⭐ 440) - Quantus is an eXplainable AI toolkit for responsible evaluation of..
❗️GPL-3.0
- deeplift (🥉21 · ⭐ 770 · 💀) - Public facing deeplift repo.
MIT
- tcav (🥉19 · ⭐ 600 · 💀) - Code for the TCAV ML interpretability project.
Apache-2
- XAI (🥉17 · ⭐ 960 · 💀) - XAI - An eXplainability toolbox for machine learning.
MIT
- Anchor (🥉16 · ⭐ 760 · 💀) - Code for High-Precision Model-Agnostic Explanations paper.
BSD-2
- ExplainX.ai (🥉15 · ⭐ 370 · 💀) - Explainable AI framework for data scientists. Explain & debug any..
MIT
- Skater (🥉14 · ⭐ 1.1K) - Python Library for Model Interpretation/Explanations.
❗️UPL-1.0
- Attribution Priors (🥉12 · ⭐ 120 · 💀) - Tools for training explainable models using..
MIT
- contextual-ai (🥉12 · ⭐ 84 · 💀) - Contextual AI adds explainability to different stages of..
Apache-2
- bias-detector (🥉11 · ⭐ 42 · 💀) - Bias Detector is a python package for detecting bias in machine..
MIT
Vector Similarity Search (ANN)
Libraries for Approximate Nearest Neighbor Search and Vector Indexing/Similarity Search.
🔗 ANN Benchmarks ( ⭐ 4.2K) - Benchmarks of approximate nearest neighbor libraries in Python.
Milvus (🥇41 · ⭐ 24K) - A cloud-native vector database, storage for next generation AI.. Apache-2
-
GitHub (👨💻 260 · 🔀 2.5K · 📥 57K · 📋 9.7K - 6% open · ⏱️ 07.12.2023):
git clone https://github.com/milvus-io/milvus
-
PyPi (📥 520K / month · 📦 76 · ⏱️ 04.12.2023):
pip install pymilvus
-
Docker Hub (📥 9M · ⭐ 38 · ⏱️ 07.12.2023):
docker pull milvusdb/milvus