Skip to content

ml-tooling/best-of-ml-python

main
Switch branches/tags

Name already in use

A tag already exists with the provided branch name. Many Git commands accept both tag and branch names, so creating this branch may cause unexpected behavior. Are you sure you want to create this branch?
Code

Latest commit

 

Git stats

Files

Permalink
Failed to load latest commit information.
Type
Name
Latest commit message
Commit time
 
 
 
 
 
 
 
 
 
 
 
 

Best-of Machine Learning with Python

🏆  A ranked list of awesome machine learning Python libraries. Updated weekly.

This curated list contains 910 awesome open-source projects with a total of 3.7M stars grouped into 34 categories. All projects are ranked by a project-quality score, which is calculated based on various metrics automatically collected from GitHub and different package managers. If you like to add or update projects, feel free to open an issue, submit a pull request, or directly edit the projects.yaml. Contributions are very welcome!


🧙‍♂️  Discover other best-of lists or create your own.
📫  Subscribe to our newsletter for updates and trending projects.


Contents

Explanation

  • 🥇🥈🥉  Combined project-quality score
  • ⭐️  Star count from GitHub
  • 🐣  New project (less than 6 months old)
  • 💤  Inactive project (6 months no activity)
  • 💀  Dead project (12 months no activity)
  • 📈📉  Project is trending up or down
  •   Project was recently added
  • ❗️  Warning (e.g. missing/risky license)
  • 👨‍💻  Contributors count from GitHub
  • 🔀  Fork count from GitHub
  • 📋  Issue count from GitHub
  • ⏱️  Last update timestamp on package manager
  • 📥  Download count from package manager
  • 📦  Number of dependent projects
  •   Tensorflow related project
  •   Sklearn related project
  •   PyTorch related project
  •   MxNet related project
  •   Apache Spark related project
  •   Jupyter related project
  •   PaddlePaddle related project
  •   Pandas related project
  •   Jax related project

Machine Learning Frameworks

Back to top

General-purpose machine learning and deep learning frameworks.

Tensorflow (🥇55 · 170K) - An Open Source Machine Learning Framework for Everyone. Apache-2
  • GitHub (👨‍💻 4.3K · 🔀 88K · 📦 260K · 📋 37K - 6% open · ⏱️ 23.03.2023):

     git clone https://github.com/tensorflow/tensorflow
    
  • PyPi (📥 17M / month · 📦 15K · ⏱️ 18.11.2022):

     pip install tensorflow
    
  • Conda (📥 4M · ⏱️ 23.03.2023):

     conda install -c conda-forge tensorflow
    
  • Docker Hub (📥 72M · 2.1K · ⏱️ 23.03.2023):

     docker pull tensorflow/tensorflow
    
scikit-learn (🥇52 · 53K) - scikit-learn: machine learning in Python. BSD-3
  • GitHub (👨‍💻 2.9K · 🔀 24K · 📥 850 · 📦 490K · 📋 11K - 20% open · ⏱️ 23.03.2023):

     git clone https://github.com/scikit-learn/scikit-learn
    
  • PyPi (📥 39M / month · 📦 28K · ⏱️ 24.01.2023):

     pip install scikit-learn
    
  • Conda (📥 20M · ⏱️ 09.03.2023):

     conda install -c conda-forge scikit-learn
    
PyTorch (🥇50 · 64K) - Tensors and Dynamic neural networks in Python with strong GPU.. BSD-3
  • GitHub (👨‍💻 3.9K · 🔀 18K · 📥 12K · 📋 33K - 34% open · ⏱️ 23.03.2023):

     git clone https://github.com/pytorch/pytorch
    
  • PyPi (📥 9.4M / month · 📦 9.1K · ⏱️ 15.12.2022):

     pip install torch
    
  • Conda (📥 15M · ⏱️ 14.03.2023):

     conda install -c pytorch pytorch
    
Keras (🥇45 · 58K) - Deep Learning for humans. Apache-2
  • GitHub (👨‍💻 1.2K · 🔀 19K · 📋 12K - 3% open · ⏱️ 22.03.2023):

     git clone https://github.com/keras-team/keras
    
  • PyPi (📥 11M / month · 📦 550 · ⏱️ 23.02.2023):

     pip install keras
    
  • Conda (📥 2.9M · ⏱️ 21.11.2022):

     conda install -c conda-forge keras
    
PySpark (🥇45 · 35K) - Apache Spark Python API. Apache-2
  • GitHub (👨‍💻 2.8K · 🔀 27K · ⏱️ 23.03.2023):

     git clone https://github.com/apache/spark
    
  • PyPi (📥 26M / month · 📦 1K · ⏱️ 28.11.2022):

     pip install pyspark
    
  • Conda (📥 2.4M · ⏱️ 19.02.2023):

     conda install -c conda-forge pyspark
    
StatsModels (🥇45 · 8.3K) - Statsmodels: statistical modeling and econometrics in Python. BSD-3
  • GitHub (👨‍💻 410 · 🔀 2.7K · 📥 27 · 📦 88K · 📋 5.2K - 49% open · ⏱️ 19.03.2023):

     git clone https://github.com/statsmodels/statsmodels
    
  • PyPi (📥 9.3M / month · 📦 5K · ⏱️ 02.11.2022):

     pip install statsmodels
    
  • Conda (📥 9.1M · ⏱️ 04.11.2022):

     conda install -c conda-forge statsmodels
    
XGBoost (🥈44 · 24K) - Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or.. Apache-2
  • GitHub (👨‍💻 600 · 🔀 8.6K · 📥 6.2K · 📦 50K · 📋 4.8K - 7% open · ⏱️ 23.03.2023):

     git clone https://github.com/dmlc/xgboost
    
  • PyPi (📥 9.1M / month · 📦 1.5K · ⏱️ 06.01.2023):

     pip install xgboost
    
  • Conda (📥 3.7M · ⏱️ 17.03.2023):

     conda install -c conda-forge xgboost
    
jax (🥈43 · 22K) - Composable transformations of Python+NumPy programs: differentiate,.. Apache-2
  • GitHub (👨‍💻 520 · 🔀 2.1K · 📦 9K · 📋 4.3K - 31% open · ⏱️ 23.03.2023):

     git clone https://github.com/google/jax
    
  • PyPi (📥 1.9M / month · 📦 560 · ⏱️ 25.01.2023):

     pip install jax
    
  • Conda (📥 650K · ⏱️ 12.03.2023):

     conda install -c conda-forge jaxlib
    
LightGBM (🥈43 · 15K) - A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT,.. MIT
  • GitHub (👨‍💻 280 · 🔀 3.7K · 📥 180K · 📦 20K · 📋 3K - 8% open · ⏱️ 23.03.2023):

     git clone https://github.com/microsoft/LightGBM
    
  • PyPi (📥 6.2M / month · 📦 730 · ⏱️ 24.01.2023):

     pip install lightgbm
    
  • Conda (📥 1.6M · ⏱️ 24.01.2023):

     conda install -c conda-forge lightgbm
    
pytorch-lightning (🥈42 · 22K) - Deep learning framework to train, deploy, and ship AI.. Apache-2
  • GitHub (👨‍💻 840 · 🔀 2.8K · 📥 11K · 📋 6K - 11% open · ⏱️ 21.03.2023):

     git clone https://github.com/Lightning-AI/lightning
    
  • PyPi (📥 4.2M / month · 📦 590 · ⏱️ 18.01.2023):

     pip install pytorch-lightning
    
  • Conda (📥 740K · ⏱️ 16.03.2023):

     conda install -c conda-forge pytorch-lightning
    
PaddlePaddle (🥈42 · 20K) - PArallel Distributed Deep LEarning: Machine Learning.. Apache-2
  • GitHub (👨‍💻 980 · 🔀 5K · 📥 15K · 📦 210 · 📋 17K - 11% open · ⏱️ 23.03.2023):

     git clone https://github.com/PaddlePaddle/Paddle
    
  • PyPi (📥 110K / month · 📦 70 · ⏱️ 17.02.2023):

     pip install paddlepaddle
    
Catboost (🥈42 · 7K) - A fast, scalable, high performance Gradient Boosting on Decision.. Apache-2
  • GitHub (👨‍💻 1.1K · 🔀 1.1K · 📥 130K · 📋 2K - 24% open · ⏱️ 23.03.2023):

     git clone https://github.com/catboost/catboost
    
  • PyPi (📥 1.9M / month · 📦 290 · ⏱️ 01.11.2022):

     pip install catboost
    
  • Conda (📥 1.2M · ⏱️ 30.01.2023):

     conda install -c conda-forge catboost
    
Fastai (🥈40 · 24K) - The fastai deep learning library. Apache-2
  • GitHub (👨‍💻 650 · 🔀 7.4K · 📦 13K · 📋 1.7K - 8% open · ⏱️ 21.03.2023):

     git clone https://github.com/fastai/fastai
    
  • PyPi (📥 400K / month · 📦 330 · ⏱️ 15.02.2023):

     pip install fastai
    
MXNet (🥈40 · 20K) - Lightweight, Portable, Flexible Distributed/Mobile Deep Learning.. Apache-2
  • GitHub (👨‍💻 980 · 🔀 6.9K · 📥 26K · 📦 6.2K · 📋 9.6K - 18% open · ⏱️ 26.01.2023):

     git clone https://github.com/apache/incubator-mxnet
    
  • PyPi (📥 530K / month · 📦 290 · ⏱️ 17.05.2022):

     pip install mxnet
    
  • Conda (📥 9.1K · 📦 5 · ⏱️ 24.10.2022):

     conda install -c anaconda mxnet
    
Jina (🥈39 · 18K) - Build multimodal AI services via cloud native technologies Neural Search.. Apache-2
  • GitHub (👨‍💻 160 · 🔀 2.1K · 📦 460 · 📋 1.8K - 2% open · ⏱️ 20.03.2023):

     git clone https://github.com/jina-ai/jina
    
  • PyPi (📥 57K / month · 📦 6 · ⏱️ 06.07.2022):

     pip install jina
    
  • Conda (📥 39K · ⏱️ 16.08.2022):

     conda install -c conda-forge jina-core
    
  • Docker Hub (📥 1.2M · 8 · ⏱️ 20.03.2023):

     docker pull jinaai/jina
    
PyFlink (🥈37 · 21K) - Apache Flink Python API. Apache-2
  • GitHub (👨‍💻 1.7K · 🔀 12K · ⏱️ 23.03.2023):

     git clone https://github.com/apache/flink
    
  • PyPi (📥 69K / month · 📦 22 · ⏱️ 30.01.2023):

     pip install apache-flink
    
Theano (🥈37 · 9.7K) - Theano was a Python library that allows you to define, optimize, and.. BSD-3
  • GitHub (👨‍💻 390 · 🔀 2.5K · 📦 14K · 📋 2.8K - 24% open · ⏱️ 20.12.2022):

     git clone https://github.com/Theano/Theano
    
  • PyPi (📥 240K / month · 📦 2.8K · ⏱️ 27.07.2020):

     pip install theano
    
  • Conda (📥 2.3M · ⏱️ 16.03.2022):

     conda install -c conda-forge theano
    
Vowpal Wabbit (🥈37 · 8.2K) - Vowpal Wabbit is a machine learning system which pushes the.. BSD-3
  • GitHub (👨‍💻 330 · 🔀 1.9K · 📋 1.2K - 10% open · ⏱️ 20.03.2023):

     git clone https://github.com/VowpalWabbit/vowpal_wabbit
    
  • PyPi (📥 110K / month · 📦 32 · ⏱️ 09.11.2022):

     pip install vowpalwabbit
    
  • Conda (📥 100K · ⏱️ 15.03.2023):

     conda install -c conda-forge vowpalwabbit
    
Flax (🥈37 · 4.1K) - Flax is a neural network library for JAX that is designed for.. Apache-2
  • GitHub (👨‍💻 190 · 🔀 490 · 📥 42 · 📦 2.6K · 📋 720 - 18% open · ⏱️ 22.03.2023):

     git clone https://github.com/google/flax
    
  • PyPi (📥 570K / month · 📦 140 · ⏱️ 26.01.2023):

     pip install flax
    
  • Conda (📥 21K · ⏱️ 16.03.2023):

     conda install -c conda-forge flax
    
Thinc (🥈36 · 2.7K) - A refreshing functional take on deep learning, compatible with your favorite.. MIT
  • GitHub (👨‍💻 61 · 🔀 270 · 📦 29K · 📋 140 - 17% open · ⏱️ 21.03.2023):

     git clone https://github.com/explosion/thinc
    
  • PyPi (📥 4.6M / month · 📦 620 · ⏱️ 13.01.2023):

     pip install thinc
    
  • Conda (📥 2.4M · ⏱️ 08.03.2023):

     conda install -c conda-forge thinc
    
einops (🥈35 · 6.5K) - Deep learning operations reinvented (for pytorch, tensorflow, jax and.. MIT
  • GitHub (👨‍💻 23 · 🔀 290 · 📦 8.5K · 📋 140 - 26% open · ⏱️ 16.03.2023):

     git clone https://github.com/arogozhnikov/einops
    
  • PyPi (📥 3.3M / month · 📦 290 · ⏱️ 09.11.2022):

     pip install einops
    
  • Conda (📥 82K · ⏱️ 09.12.2022):

     conda install -c conda-forge einops
    
ivy (🥉33 · 9.7K) - The Unified Machine Learning Framework. Apache-2
  • GitHub (👨‍💻 850 · 🔀 3.5K · 📋 7K - 21% open · ⏱️ 23.03.2023):

     git clone https://github.com/unifyai/ivy
    
  • PyPi (📥 510 / month · 📦 6 · ⏱️ 01.06.2022):

     pip install ivy-core
    
Sonnet (🥉33 · 9.5K) - TensorFlow-based neural network library. Apache-2
  • GitHub (👨‍💻 56 · 🔀 1.4K · 📦 1K · 📋 190 - 17% open · ⏱️ 23.02.2023):

     git clone https://github.com/deepmind/sonnet
    
  • PyPi (📥 25K / month · 📦 53 · ⏱️ 15.12.2022):

     pip install dm-sonnet
    
  • Conda (📥 21K · ⏱️ 14.11.2020):

     conda install -c conda-forge sonnet
    
Ludwig (🥉33 · 8.8K) - Data-centric declarative deep learning framework. Apache-2
  • GitHub (👨‍💻 130 · 🔀 1K · 📦 160 · 📋 950 - 27% open · ⏱️ 22.03.2023):

     git clone https://github.com/ludwig-ai/ludwig
    
  • PyPi (📥 3.1K / month · 📦 9 · ⏱️ 28.10.2022):

     pip install ludwig
    
Chainer (🥉33 · 5.8K) - A flexible framework of neural networks for deep learning. MIT
  • GitHub (👨‍💻 320 · 🔀 1.4K · 📦 3K · 📋 2K - 0% open · ⏱️ 17.10.2022):

     git clone https://github.com/chainer/chainer
    
  • PyPi (📥 22K / month · 📦 400 · ⏱️ 05.01.2022):

     pip install chainer
    
  • Conda (📥 13K · ⏱️ 21.01.2022):

     conda install -c conda-forge chainer
    
mlpack (🥉33 · 4.3K) - mlpack: a fast, header-only C++ machine learning library. BSD-3
  • GitHub (👨‍💻 300 · 🔀 1.5K · 📋 1.5K - 3% open · ⏱️ 19.03.2023):

     git clone https://github.com/mlpack/mlpack
    
  • PyPi (📥 2.3K / month · 📦 1 · ⏱️ 29.12.2022):

     pip install mlpack
    
  • Conda (📥 130K · ⏱️ 30.12.2022):

     conda install -c conda-forge mlpack
    
tensorflow-upstream (🥉33 · 630) - TensorFlow ROCm port. Apache-2
  • GitHub (👨‍💻 4.3K · 🔀 74 · 📥 21 · 📋 350 - 21% open · ⏱️ 23.03.2023):

     git clone https://github.com/ROCmSoftwarePlatform/tensorflow-upstream
    
  • PyPi (📥 3.8K / month · 📦 5 · ⏱️ 06.12.2022):

     pip install tensorflow-rocm
    
tensorpack (🥉32 · 6.3K) - A Neural Net Training Interface on TensorFlow, with focus.. Apache-2
  • GitHub (👨‍💻 58 · 🔀 1.9K · 📥 150 · 📦 1.2K · 📋 1.4K - 0% open · ⏱️ 26.11.2022):

     git clone https://github.com/tensorpack/tensorpack
    
  • PyPi (📥 15K / month · 📦 47 · ⏱️ 22.01.2021):

     pip install tensorpack
    
  • Conda (📥 8.3K · ⏱️ 06.02.2022):

     conda install -c conda-forge tensorpack
    
Ignite (🥉32 · 4.2K) - High-level library to help with training and evaluating neural.. BSD-3
  • GitHub (👨‍💻 190 · 🔀 590 · 📋 1.2K - 12% open · ⏱️ 21.03.2023):

     git clone https://github.com/pytorch/ignite
    
  • PyPi (📥 98K / month · 📦 39 · ⏱️ 08.11.2022):

     pip install pytorch-ignite
    
  • Conda (📥 130K · ⏱️ 18.02.2023):

     conda install -c pytorch ignite
    
Haiku (🥉32 · 2.4K) - JAX-based neural network library. Apache-2
  • GitHub (👨‍💻 73 · 🔀 200 · 📦 1.1K · 📋 240 - 36% open · ⏱️ 22.03.2023):

     git clone https://github.com/deepmind/dm-haiku
    
  • PyPi (📥 140K / month · 📦 44 · ⏱️ 16.11.2022):

     pip install dm-haiku
    
  • Conda (📥 9.5K · ⏱️ 21.09.2022):

     conda install -c conda-forge dm-haiku
    
CNTK (🥉31 · 17K) - Microsoft Cognitive Toolkit (CNTK), an open source deep-learning toolkit. MIT
  • GitHub (👨‍💻 270 · 🔀 4.4K · 📥 14K · 📋 3.4K - 24% open · ⏱️ 23.09.2022):

     git clone https://github.com/microsoft/CNTK
    
  • PyPi (📥 760 / month · 📦 18 · ⏱️ 09.12.2020):

     pip install cntk
    
skorch (🥉30 · 5.1K) - A scikit-learn compatible neural network library that wraps.. BSD-3
  • GitHub (👨‍💻 55 · 🔀 340 · 📦 790 · 📋 470 - 12% open · ⏱️ 21.03.2023):

     git clone https://github.com/skorch-dev/skorch
    
  • PyPi (📥 65K / month · 📦 45 · ⏱️ 18.11.2022):

     pip install skorch
    
  • Conda (📥 730K · ⏱️ 14.03.2023):

     conda install -c conda-forge skorch
    
Neural Network Libraries (🥉30 · 2.6K) - Neural Network Libraries. Apache-2
  • GitHub (👨‍💻 72 · 🔀 330 · 📥 560 · 📋 89 - 35% open · ⏱️ 23.03.2023):

     git clone https://github.com/sony/nnabla
    
  • PyPi (📥 4.4K / month · 📦 53 · ⏱️ 14.02.2023):

     pip install nnabla
    
ktrain (🥉30 · 1.1K) - ktrain is a Python library that makes deep learning and AI more.. Apache-2
  • GitHub (👨‍💻 15 · 🔀 260 · 📦 400 · 📋 450 - 0% open · ⏱️ 22.03.2023):

     git clone https://github.com/amaiya/ktrain
    
  • PyPi (📥 21K / month · 📦 3 · ⏱️ 03.02.2023):

     pip install ktrain
    
dyNET (🥉29 · 3.4K · 💤) - DyNet: The Dynamic Neural Network Toolkit. Apache-2
  • GitHub (👨‍💻 160 · 🔀 700 · 📥 9.7K · 📦 240 · 📋 930 - 28% open · ⏱️ 14.08.2022):

     git clone https://github.com/clab/dynet
    
  • PyPi (📥 3.2K / month · 📦 28 · ⏱️ 21.10.2020):

     pip install dyNET
    
Neural Tangents (🥉27 · 2K) - Fast and Easy Infinite Neural Networks in Python. Apache-2
  • GitHub (👨‍💻 24 · 🔀 220 · 📥 270 · 📦 64 · 📋 140 - 40% open · ⏱️ 09.03.2023):

     git clone https://github.com/google/neural-tangents
    
  • PyPi (📥 4K / month · 📦 1 · ⏱️ 19.09.2022):

     pip install neural-tangents
    
Towhee (🥉27 · 1.9K) - Towhee is a framework that is dedicated to making neural data.. Apache-2
  • GitHub (👨‍💻 29 · 🔀 190 · 📥 1.1K · 📋 590 - 1% open · ⏱️ 23.03.2023):

     git clone https://github.com/towhee-io/towhee
    
  • PyPi (📥 1.7K / month · ⏱️ 02.12.2022):

     pip install towhee
    
Geomstats (🥉26 · 910) - Computations and statistics on manifolds with geometric structures. MIT
  • GitHub (👨‍💻 76 · 🔀 210 · 📋 540 - 42% open · ⏱️ 13.03.2023):

     git clone https://github.com/geomstats/geomstats
    
  • PyPi (📥 1.7K / month · 📦 2 · ⏱️ 22.04.2022):

     pip install geomstats
    
  • Conda (📥 1.1K · ⏱️ 01.06.2022):

     conda install -c conda-forge geomstats
    
xLearn (🥉25 · 3K · 💤) - High performance, easy-to-use, and scalable machine learning (ML).. Apache-2
  • GitHub (👨‍💻 30 · 🔀 530 · 📥 3.8K · 📦 120 · 📋 310 - 62% open · ⏱️ 05.06.2022):

     git clone https://github.com/aksnzhy/xlearn
    
  • PyPi (📥 3.1K / month · 📦 12 · ⏱️ 04.12.2018):

     pip install xlearn
    
NeuPy (🥉24 · 740) - NeuPy is a Tensorflow based python library for prototyping and building.. MIT
  • GitHub (👨‍💻 8 · 🔀 160 · 📦 140 · 📋 270 - 12% open · ⏱️ 03.01.2023):

     git clone https://github.com/itdxer/neupy
    
  • PyPi (📥 5K / month · 📦 12 · ⏱️ 04.04.2019):

     pip install neupy
    
fklearn (🥉23 · 1.4K) - fklearn: Functional Machine Learning. Apache-2
  • GitHub (👨‍💻 51 · 🔀 160 · 📦 13 · 📋 62 - 61% open · ⏱️ 21.03.2023):

     git clone https://github.com/nubank/fklearn
    
  • PyPi (📥 3.2K / month · ⏱️ 06.09.2022):

     pip install fklearn
    
mace (🥉22 · 4.8K · 💤) - MACE is a deep learning inference framework optimized for mobile.. Apache-2
  • GitHub (👨‍💻 67 · 🔀 810 · 📥 1.5K · 📋 670 - 8% open · ⏱️ 30.05.2022):

     git clone https://github.com/XiaoMi/mace
    
Objax (🥉21 · 740 · 💤) - Objax is a machine learning framework that provides an Object.. Apache-2
  • GitHub (👨‍💻 24 · 🔀 68 · 📦 33 · 📋 110 - 43% open · ⏱️ 30.08.2022):

     git clone https://github.com/google/objax
    
  • PyPi (📥 410 / month · 📦 3 · ⏱️ 31.01.2022):

     pip install objax
    
ThunderSVM (🥉20 · 1.5K · 💤) - ThunderSVM: A Fast SVM Library on GPUs and CPUs. Apache-2
  • GitHub (👨‍💻 34 · 🔀 200 · 📥 2.6K · 📋 210 - 29% open · ⏱️ 09.04.2022):

     git clone https://github.com/Xtra-Computing/thundersvm
    
  • PyPi (📥 460 / month · ⏱️ 13.03.2020):

     pip install thundersvm
    
NeoML (🥉20 · 720) - Machine learning framework for both deep learning and traditional.. Apache-2
  • GitHub (👨‍💻 33 · 🔀 110 · 📋 83 - 40% open · ⏱️ 22.03.2023):

     git clone https://github.com/neoml-lib/neoml
    
  • PyPi (📥 94 / month · ⏱️ 07.09.2022):

     pip install neoml
    
chefboost (🥉20 · 390) - A Lightweight Decision Tree Framework supporting regular algorithms:.. MIT
  • GitHub (👨‍💻 6 · 🔀 93 · 📦 35 · 📋 32 - 15% open · ⏱️ 06.02.2023):

     git clone https://github.com/serengil/chefboost
    
  • PyPi (📥 1.4K / month · ⏱️ 16.02.2022):

     pip install chefboost
    
elegy (🥉18 · 440 · 💤) - A High Level API for Deep Learning in JAX. MIT
  • GitHub (👨‍💻 18 · 🔀 32 · 📦 38 · 📋 100 - 37% open · ⏱️ 23.05.2022):

     git clone https://github.com/poets-ai/elegy
    
  • PyPi (📥 1K / month · ⏱️ 22.04.2022):

     pip install elegy
    
ThunderGBM (🥉17 · 660) - ThunderGBM: Fast GBDTs and Random Forests on GPUs. Apache-2
  • GitHub (👨‍💻 10 · 🔀 87 · 📦 2 · 📋 76 - 44% open · ⏱️ 13.09.2022):

     git clone https://github.com/Xtra-Computing/thundergbm
    
  • PyPi (📥 170 / month · ⏱️ 19.09.2022):

     pip install thundergbm
    
Show 10 hidden projects...
  • dlib (🥈40 · 12K · 📈) - A toolkit for making real world machine learning and data analysis.. ❗️BSL-1.0
  • MindsDB (🥈35 · 14K) - A low-code Machine Learning platform to help developers build.. ❗️GPL-3.0
  • Turi Create (🥉33 · 11K · 💀) - Turi Create simplifies the development of custom machine.. BSD-3
  • TFlearn (🥉32 · 9.6K · 💀) - Deep learning library featuring a higher-level API for TensorFlow. MIT
  • NuPIC (🥉28 · 6.3K · 💀) - Numenta Platform for Intelligent Computing is an implementation.. ❗️AGPL-3.0
  • Lasagne (🥉28 · 3.8K · 💀) - Lightweight library to build and train neural networks in Theano. MIT
  • SHOGUN (🥉26 · 2.9K · 💀) - Unified and efficient Machine Learning. BSD-3
  • neon (🥉23 · 3.9K · 💀) - Intel Nervana reference deep learning framework committed to best.. Apache-2
  • Torchbearer (🥉21 · 630 · 💀) - torchbearer: A model fitting library for PyTorch. MIT
  • StarSpace (🥉16 · 3.8K · 💀) - Learning embeddings for classification, retrieval and ranking. MIT

Data Visualization

Back to top

General-purpose and task-specific data visualization libraries.

Matplotlib (🥇50 · 17K) - matplotlib: plotting with Python. Python-2.0
  • GitHub (👨‍💻 1.5K · 🔀 6.8K · 📦 780K · 📋 9.7K - 19% open · ⏱️ 23.03.2023):

     git clone https://github.com/matplotlib/matplotlib
    
  • PyPi (📥 35M / month · 📦 59K · ⏱️ 25.01.2023):

     pip install matplotlib
    
  • Conda (📥 17M · ⏱️ 06.03.2023):

     conda install -c conda-forge matplotlib
    
Bokeh (🥇43 · 17K) - Interactive Data Visualization in the browser, from Python. BSD-3
  • GitHub (👨‍💻 640 · 🔀 4.1K · 📦 200 · 📋 7.3K - 9% open · ⏱️ 23.03.2023):

     git clone https://github.com/bokeh/bokeh
    
  • PyPi (📥 3.5M / month · 📦 3.7K · ⏱️ 20.12.2022):

     pip install bokeh
    
  • Conda (📥 10M · ⏱️ 14.03.2023):

     conda install -c conda-forge bokeh
    
Seaborn (🥇43 · 10K) - Statistical data visualization in Python. BSD-3
  • GitHub (👨‍💻 190 · 🔀 1.7K · 📥 240 · 📦 240K · 📋 2.3K - 5% open · ⏱️ 20.03.2023):

     git clone https://github.com/mwaskom/seaborn
    
  • PyPi (📥 9.4M / month · 📦 10K · ⏱️ 02.01.2023):

     pip install seaborn
    
  • Conda (📥 6.2M · ⏱️ 31.12.2022):

     conda install -c conda-forge seaborn
    
Plotly (🥇42 · 13K) - The interactive graphing library for Python This project now includes.. MIT
  • GitHub (👨‍💻 220 · 🔀 2.3K · 📦 18 · 📋 2.6K - 50% open · ⏱️ 17.03.2023):

     git clone https://github.com/plotly/plotly.py
    
  • PyPi (📥 8.2M / month · 📦 4.6K · ⏱️ 23.01.2023):

     pip install plotly
    
  • Conda (📥 3.9M · ⏱️ 24.02.2023):

     conda install -c conda-forge plotly
    
  • npm (📥 47K / month · 📦 6 · ⏱️ 12.01.2021):

     npm install plotlywidget
    
Altair (🥇42 · 8.1K) - Declarative statistical visualization library for Python. BSD-3
  • GitHub (👨‍💻 150 · 🔀 700 · 📥 21 · 📦 49K · 📋 1.8K - 13% open · ⏱️ 22.03.2023):

     git clone https://github.com/altair-viz/altair
    
  • PyPi (📥 11M / month · 📦 540 · ⏱️ 01.03.2023):

     pip install altair
    
  • Conda (📥 1.7M · ⏱️ 31.01.2023):

     conda install -c conda-forge altair
    
dash (🥇40 · 18K) - Data Apps & Dashboards for Python. No JavaScript Required. MIT
  • GitHub (👨‍💻 130 · 🔀 1.9K · 📦 43K · 📋 1.5K - 48% open · ⏱️ 17.03.2023):

     git clone https://github.com/plotly/dash
    
  • PyPi (📥 1.4M / month · 📦 1.3K · ⏱️ 30.01.2023):

     pip install dash
    
  • Conda (📥 870K · ⏱️ 18.03.2023):

     conda install -c conda-forge dash
    
pyecharts (🥈37 · 13K) - Python Echarts Plotting Library. MIT
  • GitHub (👨‍💻 38 · 🔀 2.8K · 📦 2.9K · 📋 1.7K - 0% open · ⏱️ 28.02.2023):

     git clone https://github.com/pyecharts/pyecharts
    
  • PyPi (📥 100K / month · 📦 230 · ⏱️ 09.01.2023):

     pip install pyecharts
    
PyQtGraph (🥈36 · 3.2K) - Fast data visualization and GUI tools for scientific / engineering.. MIT
  • GitHub (👨‍💻 250 · 🔀 990 · 📋 1.2K - 30% open · ⏱️ 18.03.2023):

     git clone https://github.com/pyqtgraph/pyqtgraph
    
  • PyPi (📥 120K / month · 📦 900 · ⏱️ 29.09.2022):

     pip install pyqtgraph
    
  • Conda (📥 370K · ⏱️ 06.03.2023):

     conda install -c conda-forge pyqtgraph
    
FiftyOne (🥈36 · 2.7K) - Visualize, create, and debug image and video datasets.. Apache-2
  • GitHub (👨‍💻 76 · 🔀 320 · 📦 270 · 📋 1.2K - 35% open · ⏱️ 22.03.2023):

     git clone https://github.com/voxel51/fiftyone
    
  • PyPi (📥 290K / month · 📦 5 · ⏱️ 04.01.2023):

     pip install fiftyone
    
HoloViews (🥈36 · 2.4K) - With Holoviews, your data visualizes itself. BSD-3
  • GitHub (👨‍💻 130 · 🔀 370 · 📋 3K - 33% open · ⏱️ 16.03.2023):

     git clone https://github.com/holoviz/holoviews
    
  • PyPi (📥 490K / month · 📦 240 · ⏱️ 16.01.2023):

     pip install holoviews
    
  • Conda (📥 1.1M · ⏱️ 17.01.2023):

     conda install -c conda-forge holoviews
    
  • npm (📥 910 / month · 📦 2 · ⏱️ 24.05.2020):

     npm install @pyviz/jupyterlab_pyviz
    
PyVista (🥈35 · 1.7K) - 3D plotting and mesh analysis through a streamlined interface for.. MIT
  • GitHub (👨‍💻 120 · 🔀 320 · 📥 710 · 📦 1.7K · 📋 1.2K - 29% open · ⏱️ 23.03.2023):

     git clone https://github.com/pyvista/pyvista
    
  • PyPi (📥 87K / month · 📦 180 · ⏱️ 02.11.2022):

     pip install pyvista
    
  • Conda (📥 310K · ⏱️ 17.03.2023):

     conda install -c conda-forge pyvista
    
UMAP (🥈34 · 6.1K) - Uniform Manifold Approximation and Projection. BSD-3
  • GitHub (👨‍💻 110 · 🔀 710 · 📦 8.2K · 📋 700 - 54% open · ⏱️ 23.02.2023):

     git clone https://github.com/lmcinnes/umap
    
  • PyPi (📥 930K / month · 📦 420 · ⏱️ 13.04.2022):

     pip install umap-learn
    
  • Conda (📥 1.8M · ⏱️ 14.04.2022):

     conda install -c conda-forge umap-learn
    
plotnine (🥈34 · 3.4K) - A grammar of graphics for Python. MIT
  • GitHub (👨‍💻 99 · 🔀 190 · 📦 4.6K · 📋 560 - 15% open · ⏱️ 15.03.2023):

     git clone https://github.com/has2k1/plotnine
    
  • PyPi (📥 280K / month · 📦 240 · ⏱️ 29.09.2022):

     pip install plotnine
    
  • Conda (📥 250K · ⏱️ 28.02.2023):

     conda install -c conda-forge plotnine
    
VisPy (🥈34 · 3K) - High-performance interactive 2D/3D data visualization library. BSD-3
  • GitHub (👨‍💻 180 · 🔀 610 · 📦 1.1K · 📋 1.4K - 23% open · ⏱️ 20.03.2023):

     git clone https://github.com/vispy/vispy
    
  • PyPi (📥 54K / month · 📦 120 · ⏱️ 14.11.2022):

     pip install vispy
    
  • Conda (📥 360K · ⏱️ 20.03.2023):

     conda install -c conda-forge vispy
    
  • npm (📥 16 / month · 📦 1 · ⏱️ 15.03.2020):

     npm install vispy
    
datashader (🥈33 · 2.9K) - Quickly and accurately render even the largest data. BSD-3
  • GitHub (👨‍💻 53 · 🔀 360 · 📦 2K · 📋 540 - 24% open · ⏱️ 20.03.2023):

     git clone https://github.com/holoviz/datashader
    
  • PyPi (📥 160K / month · 📦 110 · ⏱️ 02.02.2023):

     pip install datashader
    
  • Conda (📥 520K · ⏱️ 02.02.2023):

     conda install -c conda-forge datashader
    
Graphviz (🥈33 · 1.4K) - Simple Python interface for Graphviz. MIT
  • GitHub (👨‍💻 19 · 🔀 190 · 📦 45K · 📋 160 - 5% open · ⏱️ 30.01.2023):

     git clone https://github.com/xflr6/graphviz
    
  • PyPi (📥 8.3M / month · 📦 3.2K · ⏱️ 23.07.2022):

     pip install graphviz
    
  • Conda (📥 35K · ⏱️ 16.03.2023):

     conda install -c anaconda python-graphviz
    
wordcloud (🥈32 · 9.3K) - A little word cloud generator in Python. MIT
  • GitHub (👨‍💻 67 · 🔀 2.3K · 📋 510 - 26% open · ⏱️ 04.01.2023):

     git clone https://github.com/amueller/word_cloud
    
  • PyPi (📥 830K / month · 📦 790 · ⏱️ 27.06.2022):

     pip install wordcloud
    
  • Conda (📥 380K · ⏱️ 25.08.2022):

     conda install -c conda-forge wordcloud
    
missingno (🥈31 · 3.5K) - Missing data visualization module for Python. MIT
  • GitHub (👨‍💻 18 · 🔀 440 · 📦 11K · 📋 130 - 6% open · ⏱️ 26.02.2023):

     git clone https://github.com/ResidentMario/missingno
    
  • PyPi (📥 480K / month · 📦 130 · ⏱️ 27.02.2022):

     pip install missingno
    
  • Conda (📥 260K · ⏱️ 15.02.2020):

     conda install -c conda-forge missingno
    
bqplot (🥈31 · 3.4K) - Plotting library for IPython/Jupyter notebooks. Apache-2
  • GitHub (👨‍💻 60 · 🔀 470 · 📦 38 · 📋 610 - 40% open · ⏱️ 21.03.2023):

     git clone https://github.com/bqplot/bqplot
    
  • PyPi (📥 150K / month · 📦 100 · ⏱️ 02.09.2022):

     pip install bqplot
    
  • Conda (📥 1.1M · ⏱️ 02.09.2022):

     conda install -c conda-forge bqplot
    
  • npm (📥 3.3K / month · 📦 14 · ⏱️ 02.09.2022):

     npm install bqplot
    
mpld3 (🥈31 · 2.2K) - D3 Renderings of Matplotlib Graphics. BSD-3
  • GitHub (👨‍💻 51 · 🔀 350 · 📦 4.4K · 📋 360 - 59% open · ⏱️ 10.12.2022):

     git clone https://github.com/mpld3/mpld3
    
  • PyPi (📥 280K / month · 📦 400 · ⏱️ 10.12.2022):

     pip install mpld3
    
  • Conda (📥 170K · ⏱️ 10.12.2022):

     conda install -c conda-forge mpld3
    
  • npm (📥 820 / month · 📦 7 · ⏱️ 10.12.2022):

     npm install mpld3
    
Facets Overview (🥈30 · 7.1K) - Visualizations for machine learning datasets. Apache-2
  • GitHub (👨‍💻 30 · 🔀 910 · 📦 170 · 📋 160 - 50% open · ⏱️ 18.02.2023):

     git clone https://github.com/pair-code/facets
    
  • PyPi (📥 340K / month · 📦 8 · ⏱️ 30.01.2023):

     pip install facets-overview
    
Perspective (🥈30 · 5.3K) - A data visualization and analytics component, especially.. Apache-2
  • GitHub (👨‍💻 82 · 🔀 610 · 📦 8 · 📋 620 - 16% open · ⏱️ 21.03.2023):

     git clone https://github.com/finos/perspective
    
  • PyPi (📥 3.3K / month · 📦 10 · ⏱️ 20.01.2023):

     pip install perspective-python
    
  • Conda (📥 280K · ⏱️ 27.02.2023):

     conda install -c conda-forge perspective
    
  • npm (📥 1.3K / month · 📦 2 · ⏱️ 27.02.2023):

     npm install @finos/perspective-jupyterlab
    
D-Tale (🥈30 · 3.9K) - Visualizer for pandas data structures. ❗️LGPL-2.1
  • GitHub (👨‍💻 29 · 🔀 330 · 📦 650 · 📋 500 - 7% open · ⏱️ 06.03.2023):

     git clone https://github.com/man-group/dtale
    
  • PyPi (📥 210K / month · 📦 14 · ⏱️ 17.06.2022):

     pip install dtale
    
  • Conda (📥 200K · ⏱️ 03.03.2023):

     conda install -c conda-forge dtale
    
hvPlot (🥈30 · 730) - A high-level plotting API for pandas, dask, xarray, and networkx built on.. BSD-3
  • GitHub (👨‍💻 39 · 🔀 83 · 📦 2.6K · 📋 610 - 41% open · ⏱️ 17.03.2023):

     git clone https://github.com/holoviz/hvplot
    
  • PyPi (📥 130K / month · 📦 80 · ⏱️ 24.11.2022):

     pip install hvplot
    
  • Conda (📥 340K · ⏱️ 17.03.2023):

     conda install -c conda-forge hvplot
    
pythreejs (🥉29 · 860) - A Jupyter - Three.js bridge. BSD-3
  • GitHub (👨‍💻 30 · 🔀 180 · 📦 24 · 📋 230 - 25% open · ⏱️ 20.02.2023):

     git clone https://github.com/jupyter-widgets/pythreejs
    
  • PyPi (📥 84K / month · 📦 52 · ⏱️ 20.02.2023):

     pip install pythreejs
    
  • Conda (📥 470K · ⏱️ 16.03.2023):

     conda install -c conda-forge pythreejs
    
  • npm (📥 5.1K / month · 📦 10 · ⏱️ 20.02.2023):

     npm install jupyter-threejs
    
data-validation (🥉29 · 700) - Library for exploring and validating machine learning.. Apache-2
  • GitHub (👨‍💻 25 · 🔀 140 · 📥 390 · 📦 640 · 📋 170 - 21% open · ⏱️ 22.03.2023):

     git clone https://github.com/tensorflow/data-validation
    
  • PyPi (📥 840K / month · 📦 28 · ⏱️ 08.12.2022):

     pip install tensorflow-data-validation
    
pandas-profiling (🥉27 · 10K) - Deprecated pandas-profiling package, use ydata-.. MIT
  • GitHub (👨‍💻 110 · 🔀 1.5K · 📦 160):

     git clone https://github.com/ydataai/pandas-profiling
    
  • PyPi (📥 1.2M / month · 📦 180 · ⏱️ 31.01.2023):

     pip install pandas-profiling
    
  • Conda (📥 350K · ⏱️ 25.01.2023):

     conda install -c conda-forge pandas-profiling
    
openTSNE (🥉27 · 1.2K) - Extensible, parallel implementations of t-SNE. BSD-3
  • GitHub (👨‍💻 11 · 🔀 140 · 📦 520 · 📋 120 - 1% open · ⏱️ 20.02.2023):

     git clone https://github.com/pavlin-policar/openTSNE
    
  • PyPi (📥 48K / month · 📦 13 · ⏱️ 18.03.2022):

     pip install opentsne
    
  • Conda (📥 190K · ⏱️ 20.02.2023):

     conda install -c conda-forge opentsne
    
AutoViz (🥉26 · 1.3K) - Automatically Visualize any dataset, any size with a single line of.. Apache-2
  • GitHub (👨‍💻 14 · 🔀 160 · 📦 380 · 📋 67 - 5% open · ⏱️ 30.12.2022):

     git clone https://github.com/AutoViML/AutoViz
    
  • PyPi (📥 270K / month · 📦 7 · ⏱️ 03.10.2022):

     pip install autoviz
    
  • Conda (📥 29K · ⏱️ 03.10.2022):

     conda install -c conda-forge autoviz
    
lets-plot (🥉26 · 850) - An open-source plotting library for statistical data. MIT
  • GitHub (👨‍💻 19 · 🔀 43 · 📥 400 · 📦 29 · 📋 340 - 25% open · ⏱️ 23.03.2023):

     git clone https://github.com/JetBrains/lets-plot
    
  • PyPi (📥 5.3K / month · 📦 1 · ⏱️ 15.12.2022):

     pip install lets-plot
    
Chartify (🥉25 · 3.3K) - Python library that makes it easy for data scientists to create.. Apache-2
  • GitHub (👨‍💻 25 · 🔀 300 · 📦 71 · 📋 77 - 59% open · ⏱️ 22.03.2023):

     git clone https://github.com/spotify/chartify
    
  • PyPi (📥 4.2K / month · 📦 10 · ⏱️ 13.12.2022):

     pip install chartify
    
  • Conda (📥 25K · ⏱️ 07.11.2020):

     conda install -c conda-forge chartify
    
Pandas-Bokeh (🥉25 · 840) - Bokeh Plotting Backend for Pandas and GeoPandas. MIT
  • GitHub (👨‍💻 15 · 🔀 100 · 📦 440 · 📋 100 - 31% open · ⏱️ 06.03.2023):

     git clone https://github.com/PatrikHlobil/Pandas-Bokeh
    
  • PyPi (📥 29K / month · 📦 11 · ⏱️ 11.04.2021):

     pip install pandas-bokeh
    
HiPlot (🥉24 · 2.5K) - HiPlot makes understanding high dimensional data easy. MIT
  • GitHub (👨‍💻 8 · 🔀 120 · 📦 260 · 📋 87 - 18% open · ⏱️ 03.03.2023):

     git clone https://github.com/facebookresearch/hiplot
    
  • PyPi (📥 36K / month · 📦 12 · ⏱️ 27.10.2022):

     pip install hiplot
    
  • Conda (📥 130K · ⏱️ 31.05.2022):

     conda install -c conda-forge hiplot
    
Plotly-Resampler (🥉24 · 680) - Visualize large time series data with plotly.py. MIT
  • GitHub (👨‍💻 10 · 🔀 44 · 📦 150 · 📋 95 - 31% open · ⏱️ 08.03.2023):

     git clone https://github.com/predict-idlab/plotly-resampler
    
  • PyPi (📥 200K / month · 📦 4 · ⏱️ 29.06.2022):

     pip install plotly-resampler
    
  • Conda (📥 13K · ⏱️ 09.03.2023):

     conda install -c conda-forge plotly-resampler
    
vega (🥉24 · 340) - IPython/Jupyter notebook module for Vega and Vega-Lite. BSD-3
  • GitHub (👨‍💻 13 · 🔀 59 · 📦 2 · 📋 100 - 11% open · ⏱️ 28.02.2023):

     git clone https://github.com/vega/ipyvega
    
  • PyPi (📥 12K / month · 📦 84 · ⏱️ 10.02.2022):

     pip install vega
    
  • Conda (📥 540K · ⏱️ 05.12.2022):

     conda install -c conda-forge vega
    
Sweetviz (🥉23 · 2.4K · 💤) - Visualize and compare datasets, target values and associations, with.. MIT
  • GitHub (👨‍💻 6 · 🔀 230 · 📋 110 - 32% open · ⏱️ 08.06.2022):

     git clone https://github.com/fbdesignpro/sweetviz
    
  • PyPi (📥 80K / month · 📦 8 · ⏱️ 14.06.2022):

     pip install sweetviz
    
  • Conda (📥 19K · ⏱️ 15.06.2022):

     conda install -c conda-forge sweetviz
    
python-ternary (🥉22 · 620) - Ternary plotting library for python with matplotlib. MIT
  • GitHub (👨‍💻 27 · 🔀 140 · 📥 19 · 📦 120 · 📋 140 - 27% open · ⏱️ 31.12.2022):

     git clone https://github.com/marcharper/python-ternary
    
  • PyPi (📥 61K / month · 📦 21 · ⏱️ 17.02.2021):

     pip install python-ternary
    
  • Conda (📥 72K · ⏱️ 17.02.2021):

     conda install -c conda-forge python-ternary
    
Popmon (🥉22 · 430) - Monitor the stability of a Pandas or Spark dataframe. MIT
  • GitHub (👨‍💻 16 · 🔀 32 · 📥 45 · 📦 19 · 📋 49 - 30% open · ⏱️ 15.02.2023):

     git clone https://github.com/ing-bank/popmon
    
  • PyPi (📥 17K / month · 📦 2 · ⏱️ 19.10.2022):

     pip install popmon
    
PyWaffle (🥉20 · 540 · 💤) - Make Waffle Charts in Python. MIT
  • GitHub (👨‍💻 6 · 🔀 99 · 📦 220 · 📋 21 - 23% open · ⏱️ 08.06.2022):

     git clone https://github.com/gyli/PyWaffle
    
  • PyPi (📥 2.8K / month · 📦 2 · ⏱️ 08.06.2022):

     pip install pywaffle
    
  • Conda (📥 8.7K · ⏱️ 05.06.2022):

     conda install -c conda-forge pywaffle
    
Show 15 hidden projects...
  • cartopy (🥈33 · 1.2K) - Cartopy - a cartographic python library with matplotlib support. ❗️LGPL-3.0
  • Cufflinks (🥉29 · 2.8K · 💀) - Productivity Tools for Plotly + Pandas. MIT
  • PandasGUI (🥉25 · 2.9K · 💤) - A GUI for Pandas DataFrames. ❗️MIT-0
  • HyperTools (🥉25 · 1.8K · 💀) - A Python toolbox for gaining geometric insights into high-.. MIT
  • PDPbox (🥉24 · 730 · 💀) - python partial dependence plot toolbox. MIT
  • Multicore-TSNE (🥉23 · 1.8K · 💀) - Parallel t-SNE implementation with Python and Torch.. BSD-3
  • pivottablejs (🥉23 · 540 · 💀) - Dragndrop Pivot Tables and Charts for Jupyter/IPython.. MIT
  • joypy (🥉21 · 480 · 💀) - Joyplots in Python with matplotlib & pandas. MIT
  • ivis (🥉20 · 300) - Dimensionality reduction in very large datasets using Siamese.. Apache-2
  • vegafusion (🥉20 · 170) - Serverside scaling for the Vega and Altair visualizations. BSD-3
  • animatplot (🥉18 · 400 · 💀) - A python package for animating plots build on matplotlib. MIT
  • data-describe (🥉17 · 290 · 💀) - datadescribe: Pythonic EDA Accelerator for Data Science. Apache-2
  • pdvega (🥉16 · 340 · 💀) - Interactive plotting for Pandas using Vega-Lite. MIT
  • nx-altair (🥉16 · 210 · 💀) - Draw interactive NetworkX graphs with Altair. MIT
  • nptsne (🥉12 · 30 · 💀) - nptsne is a numpy compatible python binary package that offers a.. Apache-2

Text Data & NLP

Back to top

Libraries for processing, cleaning, manipulating, and analyzing text data as well as libraries for NLP tasks such as language detection, fuzzy matching, classification, seq2seq learning, conversational AI, keyword extraction, and translation.

transformers (🥇49 · 87K) - Transformers: State-of-the-art Machine Learning for.. Apache-2
  • GitHub (👨‍💻 1.8K · 🔀 19K · 📥 270 · 📦 60K · 📋 11K - 5% open · ⏱️ 23.03.2023):

     git clone https://github.com/huggingface/transformers
    
  • PyPi (📥 12M / month · 📦 1.5K · ⏱️ 24.01.2023):

     pip install transformers
    
  • Conda (📥 970K · ⏱️ 22.03.2023):

     conda install -c conda-forge transformers
    
spaCy (🥇44 · 26K · 📈) - Industrial-strength Natural Language Processing (NLP) in Python. MIT
  • GitHub (👨‍💻 720 · 🔀 4K · 📦 56K · 📋 5.4K - 1% open · ⏱️ 22.03.2023):

     git clone https://github.com/explosion/spaCy
    
  • PyPi (📥 5.2M / month · 📦 2.6K · ⏱️ 20.01.2023):

     pip install spacy
    
  • Conda (📥 3M · ⏱️ 13.03.2023):

     conda install -c conda-forge spacy
    
nltk (🥇44 · 12K) - Suite of libraries and programs for symbolic and statistical natural.. Apache-2
  • GitHub (👨‍💻 440 · 🔀 2.7K · 📦 180K · 📋 1.7K - 13% open · ⏱️ 13.03.2023):

     git clone https://github.com/nltk/nltk
    
  • PyPi (📥 11M / month · 📦 12K · ⏱️ 02.01.2023):

     pip install nltk
    
  • Conda (📥 1.8M · ⏱️ 02.01.2023):

     conda install -c conda-forge nltk
    
Rasa (🥇41 · 16K) - Open source machine learning framework to automate text- and voice-.. Apache-2
  • GitHub (👨‍💻 580 · 🔀 4.3K · 📦 3.1K · 📋 6.8K - 1% open · ⏱️ 23.03.2023):

     git clone https://github.com/RasaHQ/rasa
    
  • PyPi (📥 150K / month · 📦 65 · ⏱️ 27.01.2023):

     pip install rasa
    
gensim (🥇41 · 14K) - Topic Modelling for Humans. ❗️LGPL-2.1
  • GitHub (👨‍💻 440 · 🔀 4.3K · 📥 4.3K · 📦 43K · 📋 1.8K - 21% open · ⏱️ 13.03.2023):

     git clone https://github.com/RaRe-Technologies/gensim
    
  • PyPi (📥 4.5M / month · 📦 3.1K · ⏱️ 21.12.2022):

     pip install gensim
    
  • Conda (📥 1M · ⏱️ 21.12.2022):

     conda install -c conda-forge gensim
    
flair (🥇40 · 13K) - A very simple framework for state-of-the-art Natural Language Processing.. MIT
  • GitHub (👨‍💻 240 · 🔀 2K · 📦 2.3K · 📋 2.1K - 3% open · ⏱️ 22.03.2023):

     git clone https://github.com/flairNLP/flair
    
  • PyPi (📥 93K / month · 📦 90 · ⏱️ 20.05.2022):

     pip install flair
    
  • Conda (📥 16K · ⏱️ 21.05.2022):

     conda install -c conda-forge python-flair
    
fairseq (🥇36 · 21K) - Facebook AI Research Sequence-to-Sequence Toolkit written in Python. MIT
  • GitHub (👨‍💻 410 · 🔀 5.3K · 📥 300 · 📦 1.3K · 📋 3.9K - 24% open · ⏱️ 23.02.2023):

     git clone https://github.com/facebookresearch/fairseq
    
  • PyPi (📥 86K / month · 📦 50 · ⏱️ 27.06.2022):

     pip install fairseq
    
  • Conda (📥 25K · ⏱️ 13.07.2022):

     conda install -c conda-forge fairseq
    
AllenNLP (🥇36 · 11K) - An open-source NLP research library, built on PyTorch. Apache-2
  • GitHub (👨‍💻 260 · 🔀 2.2K · 📥 54 · 📦 3.3K · 📋 2.6K - 3% open · ⏱️ 22.11.2022):

     git clone https://github.com/allenai/allennlp
    
  • PyPi (📥 63K / month · 📦 180 · ⏱️ 18.10.2022):

     pip install allennlp
    
  • Conda (📥 100K · ⏱️ 15.07.2022):

     conda install -c conda-forge allennlp
    
spark-nlp (🥇36 · 3.2K) - State of the Art Natural Language Processing. Apache-2
  • GitHub (👨‍💻 140 · 🔀 650 · 📦 260 · 📋 780 - 4% open · ⏱️ 23.03.2023):

     git clone https://github.com/JohnSnowLabs/spark-nlp
    
  • PyPi (📥 2.9M / month · 📦 19 · ⏱️ 24.01.2023):

     pip install spark-nlp
    
TextBlob (🥈35 · 8.5K) - Simple, Pythonic, text processing--Sentiment analysis, part-of-speech.. MIT
  • GitHub (👨‍💻 36 · 🔀 1.1K · 📥 100 · 📦 28K · 📋 270 - 41% open · ⏱️ 11.03.2023):

     git clone https://github.com/sloria/TextBlob
    
  • PyPi (📥 980K / month · 📦 1.4K · ⏱️ 22.10.2021):

     pip install textblob
    
  • Conda (📥 210K · ⏱️ 24.02.2019):

     conda install -c conda-forge textblob
    
torchtext (🥈35 · 3.2K) - Models, data loaders and abstractions for language processing,.. BSD-3
  • GitHub (👨‍💻 140 · 🔀 790 · 📋 790 - 37% open · ⏱️ 21.03.2023):

     git clone https://github.com/pytorch/text
    
  • PyPi (📥 480K / month · 📦 460 · ⏱️ 15.12.2022):

     pip install torchtext
    
sentence-transformers (🥈34 · 9.8K) - Multilingual Sentence & Image Embeddings with BERT. Apache-2
  • GitHub (👨‍💻 110 · 🔀 1.9K · 📦 6.7K · 📋 1.7K - 54% open · ⏱️ 26.01.2023):

     git clone https://github.com/UKPLab/sentence-transformers
    
  • PyPi (📥 1.9M / month · 📦 230 · ⏱️ 26.06.2022):

     pip install sentence-transformers
    
  • Conda (📥 93K · ⏱️ 27.06.2022):

     conda install -c conda-forge sentence-transformers
    
haystack (🥈34 · 7.5K) - Haystack is an open source NLP framework to interact with your data.. Apache-2
  • GitHub (👨‍💻 160 · 🔀 1.1K · 📥 20 · 📦 590 · 📋 2.1K - 16% open · ⏱️ 23.03.2023):

     git clone https://github.com/deepset-ai/haystack
    
  • PyPi (📥 2.1K / month · 📦 85 · ⏱️ 03.07.2017):

     pip install haystack
    
sentencepiece (🥈34 · 6.9K) - Unsupervised text tokenizer for Neural Network-based text.. Apache-2
  • GitHub (👨‍💻 74 · 🔀 890 · 📥 25K · 📦 25K · 📋 590 - 4% open · ⏱️ 21.02.2023):

     git clone https://github.com/google/sentencepiece
    
  • PyPi (📥 9.3M / month · 📦 580 · ⏱️ 07.08.2022):

     pip install sentencepiece
    
  • Conda (📥 370K · ⏱️ 15.02.2023):

     conda install -c conda-forge sentencepiece
    
Tokenizers (🥈34 · 6.6K) - Fast State-of-the-Art Tokenizers optimized for Research and.. Apache-2
  • GitHub (👨‍💻 66 · 🔀 560 · 📦 56 · 📋 750 - 32% open · ⏱️ 21.03.2023):

     git clone https://github.com/huggingface/tokenizers
    
  • PyPi (📥 11M / month · 📦 190 · ⏱️ 07.11.2022):

     pip install tokenizers
    
  • Conda (📥 930K · ⏱️ 26.01.2023):

     conda install -c conda-forge tokenizers
    
stanza (🥈34 · 6.5K) - Official Stanford NLP Python Library for Many Human Languages. Apache-2
  • GitHub (👨‍💻 58 · 🔀 850 · 📦 1.7K · 📋 790 - 11% open · ⏱️ 14.03.2023):

     git clone https://github.com/stanfordnlp/stanza
    
  • PyPi (📥 180K / month · 📦 84 · ⏱️ 20.09.2022):

     pip install stanza
    
  • Conda (📥 6.4K · ⏱️ 14.09.2022):

     conda install -c stanfordnlp stanza
    
NeMo (🥈34 · 6K) - NeMo: a toolkit for conversational AI. Apache-2
  • GitHub (👨‍💻 210 · 🔀 1.4K · 📥 30K · 📋 1.5K - 4% open · ⏱️ 23.03.2023):

     git clone https://github.com/NVIDIA/NeMo
    
  • PyPi (📥 28K / month · 📦 11 · ⏱️ 01.07.2022):

     pip install nemo-toolkit
    
TensorFlow Text (🥈34 · 1.1K) - Making text a first-class citizen in TensorFlow. Apache-2
  • GitHub (👨‍💻 100 · 🔀 280 · 📦 4.9K · 📋 260 - 42% open · ⏱️ 22.03.2023):

     git clone https://github.com/tensorflow/text
    
  • PyPi (📥 3.4M / month · 📦 100 · ⏱️ 18.11.2022):

     pip install tensorflow-text
    
ParlAI (🥈33 · 9.9K) - A framework for training and evaluating AI models on a variety of.. MIT
  • GitHub (👨‍💻 210 · 🔀 2K · 📦 170 · 📋 1.5K - 4% open · ⏱️ 22.03.2023):

     git clone https://github.com/facebookresearch/ParlAI
    
  • PyPi (📥 3K / month · 📦 3 · ⏱️ 20.09.2022):

     pip install parlai
    
OpenNMT (🥈33 · 5.9K) - Open Source Neural Machine Translation in PyTorch. MIT
  • GitHub (👨‍💻 190 · 🔀 2.2K · 📦 180 · 📋 1.4K - 1% open · ⏱️ 23.03.2023):

     git clone https://github.com/OpenNMT/OpenNMT-py
    
  • PyPi (📥 5.3K / month · 📦 9 · ⏱️ 19.12.2022):

     pip install OpenNMT-py
    
Dedupe (🥈33 · 3.7K) - A python library for accurate and scalable fuzzy matching, record.. MIT
  • GitHub (👨‍💻 69 · 🔀 500 · 📦 270 · 📋 800 - 8% open · ⏱️ 17.02.2023):

     git clone https://github.com/dedupeio/dedupe
    
  • PyPi (📥 140K / month · 📦 49 · ⏱️ 18.01.2023):

     pip install dedupe
    
  • Conda (📥 25K · ⏱️ 12.12.2022):

     conda install -c conda-forge dedupe
    
fastText (🥈32 · 24K · 💤) - Library for fast text representation and classification. MIT
  • GitHub (👨‍💻 59 · 🔀 4.5K · 📋 1.1K - 46% open · ⏱️ 04.03.2022):

     git clone https://github.com/facebookresearch/fastText
    
  • PyPi (📥 950K / month · 📦 190 · ⏱️ 28.04.2020):

     pip install fasttext
    
  • Conda (📥 50K · ⏱️ 01.11.2022):

     conda install -c conda-forge fasttext
    
DeepPavlov (🥈32 · 6.1K) - An open source library for deep learning end-to-end dialog.. Apache-2
  • GitHub (👨‍💻 73 · 🔀 1.1K · 📦 330 · 📋 620 - 8% open · ⏱️ 14.03.2023):

     git clone https://github.com/deepmipt/DeepPavlov
    
  • PyPi (📥 8.7K / month · 📦 6 · ⏱️ 17.02.2023):

     pip install deeppavlov
    
jellyfish (🥈31 · 1.8K) - a python library for doing approximate and phonetic matching of.. BSD-2
  • GitHub (👨‍💻 28 · 🔀 150 · 📦 5.5K · 📋 120 - 8% open · ⏱️ 03.02.2023):

     git clone https://github.com/jamesturk/jellyfish
    
  • PyPi (📥 2.5M / month · 📦 430 · ⏱️ 07.01.2022):

     pip install jellyfish
    
  • Conda (📥 500K · ⏱️ 28.10.2022):

     conda install -c conda-forge jellyfish
    
spacy-transformers (🥈31 · 1.2K) - Use pretrained transformers like BERT, XLNet and GPT-2.. MIT spacy
  • GitHub (👨‍💻 21 · 🔀 160 · 📦 900 · ⏱️ 22.03.2023):

     git clone https://github.com/explosion/spacy-transformers
    
  • PyPi (📥 220K / month · 📦 30 · ⏱️ 26.01.2023):

     pip install spacy-transformers
    
  • Conda (📥 11K · ⏱️ 28.02.2023):

     conda install -c conda-forge spacy-transformers
    
snowballstemmer (🥈31 · 650) - Snowball compiler and stemming algorithms. BSD-3
  • GitHub (👨‍💻 30 · 🔀 170 · 📦 4 · 📋 80 - 37% open · ⏱️ 16.11.2022):

     git clone https://github.com/snowballstem/snowball
    
  • PyPi (📥 9M / month · 📦 6.8K · ⏱️ 16.11.2021):

     pip install snowballstemmer
    
  • Conda (📥 6.2M · ⏱️ 17.11.2021):

     conda install -c conda-forge snowballstemmer
    
rubrix (🥈30 · 1.7K) - Argilla: Open-source platform empowering teams to make better LLM and.. Apache-2
  • GitHub (👨‍💻 43 · 🔀 160 · 📦 67 · 📋 970 - 19% open · ⏱️ 23.03.2023):

     git clone https://github.com/recognai/rubrix
    
  • PyPi (📥 1.1K / month · ⏱️ 24.10.2022):

     pip install rubrix
    
  • Conda (📥 18K · ⏱️ 06.10.2022):

     conda install -c conda-forge rubrix
    
nlpaug (🥈29 · 3.9K · 💤) - Data augmentation for NLP. MIT
  • GitHub (👨‍💻 33 · 🔀 430 · 📦 640 · 📋 210 - 27% open · ⏱️ 07.07.2022):

     git clone https://github.com/makcedward/nlpaug
    
  • PyPi (📥 180K / month · 📦 21 · ⏱️ 07.07.2022):

     pip install nlpaug
    
  • Conda (📥 7.7K · ⏱️ 30.01.2023):

     conda install -c conda-forge nlpaug
    
ftfy (🥈29 · 3.5K) - Fixes mojibake and other glitches in Unicode text, after the fact. MIT
  • GitHub (👨‍💻 18 · 🔀 120 · 📦 9.7K · 📋 130 - 11% open · ⏱️ 25.10.2022):

     git clone https://github.com/rspeer/python-ftfy
    
  • PyPi (📥 5.3M / month · 📦 570 · ⏱️ 09.02.2022):

     pip install ftfy
    
  • Conda (📥 240K · ⏱️ 13.03.2022):

     conda install -c conda-forge ftfy
    
Sumy (🥈29 · 3.1K) - Module for automatic summarization of text documents and HTML pages. Apache-2
  • GitHub (👨‍💻 26 · 🔀 500 · 📦 1.8K · 📋 110 - 15% open · ⏱️ 21.02.2023):

     git clone https://github.com/miso-belica/sumy
    
  • PyPi (📥 25K / month · 📦 100 · ⏱️ 23.10.2022):

     pip install sumy
    
  • Conda (📥 4.2K · ⏱️ 25.10.2022):

     conda install -c conda-forge sumy
    
textacy (🥈29 · 2K · 💤) - NLP, before and after spaCy. Apache-2
  • GitHub (👨‍💻 32 · 🔀 240 · 📋 260 - 11% open · ⏱️ 06.03.2022):

     git clone https://github.com/chartbeat-labs/textacy
    
  • PyPi (📥 130K / month · 📦 100 · ⏱️ 06.12.2021):

     pip install textacy
    
  • Conda (📥 120K · ⏱️ 09.03.2023):

     conda install -c conda-forge textacy
    
english-words (🥈28 · 8.8K) - A text file containing 479k English words for all your.. Unlicense
  • GitHub (👨‍💻 30 · 🔀 1.6K · 📋 120 - 73% open · ⏱️ 08.11.2022):

     git clone https://github.com/dwyl/english-words
    
  • PyPi (📥 320K / month · 📦 7 · ⏱️ 06.01.2023):

     pip install english-words
    
fastNLP (🥈28 · 2.9K) - fastNLP: A Modularized and Extensible NLP Framework. Currently still.. Apache-2
  • GitHub (👨‍💻 62 · 🔀 440 · 📥 67 · 📦 130 · 📋 210 - 25% open · ⏱️ 13.12.2022):

     git clone https://github.com/fastnlp/fastNLP
    
  • PyPi (📥 9.9K / month · 📦 3 · ⏱️ 04.02.2019):

     pip install fastnlp
    
GluonNLP (🥈28 · 2.5K) - Toolkit that enables easy text preprocessing, datasets loading.. Apache-2
  • GitHub (👨‍💻 84 · 🔀 520 · 📦 1.2K · 📋 560 - 46% open · ⏱️ 25.12.2022):

     git clone https://github.com/dmlc/gluon-nlp
    
  • PyPi (📥 190K / month · 📦 22 · ⏱️ 13.08.2020):

     pip install gluonnlp
    
scattertext (🥈28 · 2.1K) - Beautiful visualizations of how language differs among document.. Apache-2
  • GitHub (👨‍💻 13 · 🔀 280 · 📦 360 · 📋 95 - 20% open · ⏱️ 28.02.2023):

     git clone https://github.com/JasonKessler/scattertext
    
  • PyPi (📥 11K / month · 📦 10 · ⏱️ 26.03.2022):

     pip install scattertext
    
  • Conda (📥 75K · ⏱️ 08.12.2022):

     conda install -c conda-forge scattertext
    
Ciphey (🥈27 · 13K) - Automatically decrypt encryptions without knowing the key or cipher, decode.. MIT
  • GitHub (👨‍💻 46 · 🔀 740 · 📋 310 - 16% open · ⏱️ 05.12.2022):

     git clone https://github.com/Ciphey/Ciphey
    
  • PyPi (📥 37K / month · ⏱️ 06.06.2021):

     pip install ciphey
    
  • Docker Hub (📥 18K · 11 · ⏱️ 10.03.2023):

     docker pull remnux/ciphey
    
T5 (🥈27 · 4.9K) - Code for the paper Exploring the Limits of Transfer Learning with a.. Apache-2
  • GitHub (👨‍💻 54 · 🔀 660 · 📦 160 · 📋 440 - 22% open · ⏱️ 20.03.2023):

     git clone https://github.com/google-research/text-to-text-transfer-transformer
    
  • PyPi (📥 19K / month · 📦 3 · ⏱️ 18.10.2021):

     pip install t5
    
vaderSentiment (🥈27 · 3.9K · 💤) - VADER Sentiment Analysis. VADER (Valence Aware Dictionary.. MIT
  • GitHub (👨‍💻 11 · 🔀 930 · 📦 6.3K · 📋 120 - 37% open · ⏱️ 01.04.2022):

     git clone https://github.com/cjhutto/vaderSentiment
    
  • PyPi (📥 220K / month · 📦 180 · ⏱️ 22.05.2020):

     pip install vadersentiment
    
  • Conda (📥 12K · ⏱️ 22.03.2021):

     conda install -c conda-forge vadersentiment
    
TextDistance (🥈27 · 3.1K) - Compute distance between sequences. 30+ algorithms, pure python.. MIT
  • GitHub (👨‍💻 13 · 🔀 240 · 📥 890 · 📦 3.7K · ⏱️ 18.09.2022):

     git clone https://github.com/life4/textdistance
    
  • PyPi (📥 440K / month · 📦 56 · ⏱️ 20.09.2022):

     pip install textdistance
    
  • Conda (📥 340K · ⏱️ 18.09.2022):

     conda install -c conda-forge textdistance
    
DeepKE (🥈27 · 1.7K) - An Open Toolkit for Knowledge Graph Extraction and Construction.. MIT
  • GitHub (👨‍💻 17 · 🔀 460 · 📦 14 · 📋 220 - 0% open · ⏱️ 21.03.2023):

     git clone https://github.com/zjunlp/deepke
    
  • PyPi (📥 2.7K / month · ⏱️ 28.12.2022):

     pip install deepke
    
SciSpacy (🥈27 · 1.3K) - A full spaCy pipeline and models for scientific/biomedical documents. Apache-2
  • GitHub (👨‍💻 27 · 🔀 180 · 📦 640 · 📋 280 - 11% open · ⏱️ 06.11.2022):

     git clone https://github.com/allenai/scispacy
    
  • PyPi (📥 46K / month · 📦 21 · ⏱️ 07.09.2022):

     pip install scispacy
    
CLTK (🥈27 · 780) - The Classical Language Toolkit. MIT
  • GitHub (👨‍💻 120 · 🔀 310 · 📥 25 · 📦 230 · 📋 540 - 5% open · ⏱️ 06.03.2023):

     git clone https://github.com/cltk/cltk
    
  • PyPi (📥 2.2K / month · 📦 42 · ⏱️ 13.10.2022):

     pip install cltk
    
PyText (🥉26 · 6.4K) - A natural language modeling framework based on PyTorch. BSD-3
  • GitHub (👨‍💻 230 · 🔀 820 · 📥 320 · 📦 110 · 📋 220 - 66% open · ⏱️ 17.10.2022):

     git clone https://github.com/facebookresearch/pytext
    
  • PyPi (📥 220 / month · 📦 1 · ⏱️ 08.06.2020):

     pip install pytext-nlp
    
PyTextRank (🥉26 · 2K · 💤) - Python implementation of TextRank algorithms (textgraphs) for.. MIT
  • GitHub (👨‍💻 18 · 🔀 330 · 📦 360 · 📋 93 - 22% open · ⏱️ 27.07.2022):

     git clone https://github.com/DerwenAI/pytextrank
    
  • PyPi (📥 50K / month · 📦 16 · ⏱️ 27.07.2022):

     pip install pytextrank
    
qdrant (🥉25 · 5.1K) - Qdrant - Vector Search Engine and Database for the next generation of.. Apache-2
  • GitHub (👨‍💻 35 · 🔀 230 · 📋 420 - 11% open · ⏱️ 22.03.2023):

     git clone https://github.com/qdrant/qdrant
    
sense2vec (🥉25 · 1.5K) - Contextually-keyed word vectors. MIT
  • GitHub (👨‍💻 18 · 🔀 240 · 📥 49K · 📦 240 · 📋 110 - 18% open · ⏱️ 08.12.2022):

     git clone https://github.com/explosion/sense2vec
    
  • PyPi (📥 4.6K / month · 📦 11 · ⏱️ 08.12.2022):

     pip install sense2vec
    
  • Conda (📥 31K · ⏱️ 14.07.2021):

     conda install -c conda-forge sense2vec
    
detoxify (🥉25 · 580) - Trained models & code to predict toxic comments on all 3 Jigsaw.. Apache-2
  • GitHub (👨‍💻 9 · 🔀 78 · 📥 160K · 📦 250 · 📋 51 - 58% open · ⏱️ 19.12.2022):

     git clone https://github.com/unitaryai/detoxify
    
  • PyPi (📥 70K / month · 📦 3 · ⏱️ 19.12.2022):

     pip install detoxify
    
OpenPrompt (🥉24 · 2.7K) - An Open-Source Framework for Prompt-Learning. Apache-2
  • GitHub (👨‍💻 19 · 🔀 310 · 📦 36 · 📋 210 - 26% open · ⏱️ 05.02.2023):

     git clone https://github.com/thunlp/OpenPrompt
    
  • PyPi (📥 2K / month · 📦 1 · ⏱️ 06.07.2022):

     pip install openprompt
    
FARM (🥉24 · 1.6K · 💤) - Fast & easy transfer learning for NLP. Harvesting language.. Apache-2
  • GitHub (👨‍💻 37 · 🔀 230 · 📋 400 - 0% open · ⏱️ 31.08.2022):

     git clone https://github.com/deepset-ai/FARM
    
  • PyPi (📥 4.6K / month · 📦 3 · ⏱️ 10.06.2021):

     pip install farm
    
  • Conda (📥 2.2K · ⏱️ 14.06.2021):

     conda install -c conda-forge farm
    
promptsource (🥉23 · 1.4K) - Toolkit for creating, sharing and using natural language.. Apache-2
  • GitHub (👨‍💻 59 · 🔀 220 · 📦 34 · 📋 160 - 14% open · ⏱️ 20.01.2023):

     git clone https://github.com/bigscience-workshop/promptsource
    
  • PyPi (📥 4K / month · ⏱️ 18.04.2022):

     pip install promptsource
    
Sockeye (🥉23 · 1.2K) - Sequence-to-sequence framework with a focus on Neural Machine.. Apache-2
  • GitHub (👨‍💻 59 · 🔀 330 · 📥 16 · 📋 300 - 0% open · ⏱️ 02.03.2023):

     git clone https://github.com/awslabs/sockeye
    
  • PyPi (📥 840 / month · 📦 2 · ⏱️ 03.03.2023):

     pip install sockeye
    
small-text (🥉23 · 430) - Active Learning for Text Classification in Python. MIT
  • GitHub (👨‍💻 3 · 🔀 44 · 📦 18 · 📋 30 - 26% open · ⏱️ 21.02.2023):

     git clone https://github.com/webis-de/small-text
    
  • PyPi (📥 720 / month · ⏱️ 14.10.2022):

     pip install small-text
    
  • Conda (📥 2K · ⏱️ 21.02.2023):

     conda install -c conda-forge small-text
    
gpt-2-simple (🥉22 · 3.2K · 💤) - Python package to easily retrain OpenAIs GPT-2 text-.. MIT
  • GitHub (👨‍💻 21 · 🔀 650 · 📥 410 · 📋 280 - 64% open · ⏱️ 22.05.2022):

     git clone https://github.com/minimaxir/gpt-2-simple
    
  • PyPi (📥 7.1K / month · 📦 6 · ⏱️ 18.10.2021):

     pip install gpt-2-simple
    
Texthero (🥉22 · 2.7K) - Text preprocessing, representation and visualization from zero to hero. MIT
  • GitHub (👨‍💻 20 · 🔀 230 · 📥 100 · 📋 140 - 55% open · ⏱️ 28.10.2022):

     git clone https://github.com/jbesomi/texthero
    
  • PyPi (📥 23K / month · 📦 6 · ⏱️ 01.07.2021):

     pip install texthero
    
lightseq (🥉22 · 2.6K) - LightSeq: A High Performance Library for Sequence Processing and.. Apache-2
  • GitHub (👨‍💻 17 · 🔀 290 · 📥 650 · 📋 260 - 58% open · ⏱️ 22.03.2023):

     git clone https://github.com/bytedance/lightseq
    
  • PyPi (📥 940 / month · 📦 1 · ⏱️ 03.11.2022):

     pip install lightseq
    
jiant (🥉22 · 1.5K) - jiant is an nlp toolkit. MIT
  • GitHub (👨‍💻 59 · 🔀 280 · 📦 2 · 📋 550 - 12% open · ⏱️ 17.10.2022):

     git clone https://github.com/nyu-mll/jiant
    
  • PyPi (📥 160 / month · ⏱️ 10.05.2021):

     pip install jiant
    
YouTokenToMe (🥉22 · 860) - Unsupervised text tokenizer focused on computational efficiency. MIT
  • GitHub (👨‍💻 8 · 🔀 69 · 📦 390 · 📋 60 - 60% open · ⏱️ 10.03.2023):

     git clone https://github.com/vkcom/youtokentome
    
  • PyPi (📥 25K / month · 📦 22 · ⏱️ 12.02.2020):

     pip install youtokentome
    
  • Conda (📥 26K · ⏱️ 30.10.2022):

     conda install -c conda-forge youtokentome
    
happy-transformer (🥉22 · 400) - A package built on top of Hugging Faces transformers.. Apache-2 huggingface
  • GitHub (👨‍💻 14 · 🔀 51 · 📦 130 · 📋 120 - 20% open · ⏱️ 31.10.2022):

     git clone https://github.com/EricFillion/happy-transformer
    
  • PyPi (📥 12K / month · 📦 2 · ⏱️ 06.02.2022):

     pip install happytransformer
    
NLP Architect (🥉21 · 2.9K) - A model library for exploring state-of-the-art deep learning.. Apache-2
  • GitHub (👨‍💻 38 · 🔀 450 · 📦 9 · 📋 130 - 16% open · ⏱️ 07.11.2022):

     git clone https://github.com/IntelLabs/nlp-architect
    
  • PyPi (📥 160 / month · ⏱️ 12.04.2020):

     pip install nlp-architect
    
fast-bert (🥉21 · 1.8K) - Super easy library for BERT based NLP models. Apache-2
  • GitHub (👨‍💻 36 · 🔀 340 · 📋 250 - 62% open · ⏱️ 27.09.2022):

     git clone https://github.com/utterworks/fast-bert
    
  • PyPi (📥 2.9K / month · 📦 2 · ⏱️ 22.09.2022):

     pip install fast-bert
    
finetune (🥉20 · 680) - Scikit-learn style model finetuning for NLP. MPL-2.0
  • GitHub (👨‍💻 20 · 🔀 75 · 📦 9 · 📋 140 - 16% open · ⏱️ 02.03.2023):

     git clone https://github.com/IndicoDataSolutions/finetune
    
  • PyPi (📥 160 / month · 📦 2 · ⏱️ 20.12.2021):

     pip install finetune
    
textaugment (🥉19 · 300 · 💤) - TextAugment: Text Augmentation Library. MIT
  • GitHub (👨‍💻 6 · 🔀 55 · 📥 54 · 📦 43 · 📋 20 - 35% open · ⏱️ 17.05.2022):

     git clone https://github.com/dsfsi/textaugment
    
  • PyPi (📥 3.9K / month · 📦 1 · ⏱️ 05.11.2020):

     pip install textaugment
    
TextBox (🥉18 · 930) - TextBox 2.0 is a text generation library with pre-trained language models. MIT
  • GitHub (👨‍💻 18 · 🔀 97 · 📦 5 · 📋 45 - 6% open · ⏱️ 25.02.2023):

     git clone https://github.com/RUCAIBox/TextBox
    
  • PyPi (📥 1 / month · ⏱️ 15.04.2021):

     pip install textbox
    
fastT5 (🥉18 · 440 · 💤) - boost inference speed of T5 models by 5x & reduce the model size.. Apache-2
  • GitHub (👨‍💻 5 · 🔀 52 · 📦 28 · 📋 57 - 29% open · ⏱️ 05.04.2022):

     git clone https://github.com/Ki6an/fastT5
    
  • PyPi (📥 1.4K / month · ⏱️ 05.04.2022):

     pip install fastt5
    
OpenNRE (🥉16 · 3.9K) - An Open-Source Package for Neural Relation Extraction (NRE). MIT
  • GitHub (👨‍💻 12 · 🔀 1K · 📋 360 - 1% open · ⏱️ 03.01.2023):

     git clone https://github.com/thunlp/OpenNRE
    
Translate (🥉15 · 780 · 💤) - Translate - a PyTorch Language Library. BSD-3
  • GitHub (👨‍💻 88 · 🔀 200 · 📋 55 - 50% open · ⏱️ 10.06.2022):

     git clone https://github.com/pytorch/translate
    
  • PyPi (📥 18 / month · ⏱️ 01.05.2018):

     pip install pytorch-translate
    
VizSeq (🥉14 · 410) - An Analysis Toolkit for Natural Language Generation (Translation,.. MIT
  • GitHub (👨‍💻 3 · 🔀 52 · 📦 6 · 📋 16 - 43% open · ⏱️ 02.01.2023):

     git clone https://github.com/facebookresearch/vizseq
    
  • PyPi (📥 57 / month · ⏱️ 07.08.2020):

     pip install vizseq
    
Show 33 hidden projects...
  • ChatterBot (🥇36 · 13K · 💀) - ChatterBot is a machine learning, conversational dialog engine.. BSD-3
  • fuzzywuzzy (🥈34 · 8.9K · 💀) - Fuzzy String Matching in Python. ❗️GPL-2.0
  • polyglot (🥈28 · 2.1K · 💀) - Multilingual text (NLP) processing toolkit. ❗️GPL-3.0
  • langid (🥈27 · 2.1K · 💀) - Stand-alone language identification system. BSD-3
  • underthesea (🥈27 · 1.1K) - Underthesea - Vietnamese NLP Toolkit. ❗️GPL-3.0
  • flashtext (🥉26 · 5.4K · 💀) - Extract Keywords from sentence or Replace keywords in sentences. MIT
  • neuralcoref (🥉26 · 2.7K · 💀) - Fast Coreference Resolution in spaCy with Neural Networks. MIT
  • textgenrnn (🥉25 · 4.9K · 💀) - Easily train your own text-generating neural network of any.. MIT
  • Snips NLU (🥉25 · 3.8K · 💀) - Snips Python library to extract meaning from text. Apache-2
  • pytorch-nlp (🥉25 · 2.2K · 💀) - Basic Utilities for PyTorch Natural Language Processing.. BSD-3
  • MatchZoo (🥉24 · 3.8K · 💀) - Facilitating the design, comparison and sharing of deep.. Apache-2
  • whoosh (🥉24 · 380 · 💀) - Pure-Python full-text search library. ❗️BSD-1-Clause
  • DeepMatcher (🥉23 · 4.6K · 💀) - Python package for performing Entity and Text Matching using.. BSD-3
  • Kashgari (🥉23 · 2.4K · 💀) - Kashgari is a production-level NLP Transfer learning.. Apache-2
  • pySBD (🥉23 · 590 · 💀) - pySBD (Python Sentence Boundary Disambiguation) is a rule-based sentence.. MIT
  • Texar (🥉22 · 2.4K · 💀) - Toolkit for Machine Learning, Natural Language Processing, and.. Apache-2
  • DELTA (🥉21 · 1.5K · 💀) - DELTA is a deep learning based natural language and speech.. Apache-2
  • anaGo (🥉21 · 1.5K · 💀) - Bidirectional LSTM-CRF and ELMo for Named-Entity Recognition,.. MIT
  • stop-words (🥉21 · 150 · 💀) - Get list of common stop words in various languages in Python. BSD-3
  • pyfasttext (🥉20 · 230 · 💀) - Yet another Python binding for fastText. ❗️GPL-3.0
  • numerizer (🥉19 · 200) - A Python module to convert natural language numerics into ints and.. MIT
  • Camphr (🥉18 · 340 · 💀) - Camphr - NLP libary for creating pipeline components. Apache-2 spacy
  • textpipe (🥉18 · 300 · 💀) - Textpipe: clean and extract metadata from text. MIT
  • NeuroNER (🥉17 · 1.6K · 💀) - Named-entity recognition using neural networks. Easy-to-use and.. MIT
  • nboost (🥉17 · 650 · 💀) - NBoost is a scalable, search-api-boosting platform for deploying.. Apache-2
  • skift (🥉17 · 230 · 💤) - scikit-learn wrappers for Python fastText. MIT
  • NeuralQA (🥉15 · 220 · 💀) - NeuralQA: A Usable Library for Question Answering on Large Datasets.. MIT
  • spacy-dbpedia-spotlight (🥉15 · 85) - A spaCy wrapper for DBpedia Spotlight. MIT spacy
  • BLINK (🥉14 · 1K · 💀) - Entity Linker solution. MIT
  • TransferNLP (🥉14 · 290 · 💀) - NLP library designed for reproducible experimentation.. MIT
  • Headliner (🥉14 · 230 · 💀) - Easy training and deployment of seq2seq models. MIT
  • ONNX-T5 (🥉13 · 220 · 💀) - Summarization, translation, sentiment-analysis, text-generation.. Apache-2
  • textvec (🥉13 · 190 · 💤) - Text vectorization tool to outperform TFIDF for classification.. MIT

Image Data

Back to top

Libraries for image & video processing, manipulation, and augmentation as well as libraries for computer vision tasks such as facial recognition, object detection, and classification.

Pillow (🥇47 · 11K) - Python Imaging Library (Fork). ❗️PIL
  • GitHub (👨‍💻 430 · 🔀 2K · 📦 1.1M · 📋 2.8K - 4% open · ⏱️ 23.03.2023):

     git clone https://github.com/python-pillow/Pillow
    
  • PyPi (📥 55M / month · 📦 65K · ⏱️ 02.01.2023):

     pip install Pillow
    
  • Conda (📥 25M · ⏱️ 08.03.2023):

     conda install -c conda-forge pillow
    
scikit-image (🥇44 · 5.3K) - Image processing in Python. BSD-2
  • GitHub (👨‍💻 600 · 🔀 2.1K · 📦 130K · 📋 2.6K - 26% open · ⏱️ 17.03.2023):

     git clone https://github.com/scikit-image/scikit-image
    
  • PyPi (📥 7M / month · 📦 9.9K · ⏱️ 12.06.2022):

     pip install scikit-image
    
  • Conda (📥 4.7M · ⏱️ 30.10.2022):

     conda install -c conda-forge scikit-image
    
torchvision (🥇42 · 14K) - Datasets, Transforms and Models specific to Computer Vision. BSD-3
  • GitHub (👨‍💻 520 · 🔀 6.6K · 📥 21K · 📋 3K - 29% open · ⏱️ 23.03.2023):

     git clone https://github.com/pytorch/vision
    
  • PyPi (📥 6.1M / month · 📦 4.1K · ⏱️ 15.12.2022):

     pip install torchvision
    
  • Conda (📥 570K · ⏱️ 09.03.2023):

     conda install -c conda-forge torchvision
    
PyTorch Image Models (🥇39 · 24K) - PyTorch image models, scripts, pretrained weights --.. Apache-2
  • GitHub (👨‍💻 96 · 🔀 4K · 📥 3.5M · 📦 8.9K · 📋 690 - 9% open · ⏱️ 22.03.2023):

     git clone https://github.com/rwightman/pytorch-image-models
    
  • PyPi (📥 3.1M / month · 📦 200 · ⏱️ 12.01.2023):

     pip install timm
    
  • Conda (📥 55K · ⏱️ 24.11.2022):

     conda install -c conda-forge timm
    
MMDetection (🥇39 · 23K) - OpenMMLab Detection Toolbox and Benchmark. Apache-2
  • GitHub (👨‍💻 390 · 🔀 8.5K · 📦 1K · 📋 6.9K - 8% open · ⏱️ 28.02.2023):

     git clone https://github.com/open-mmlab/mmdetection
    
  • PyPi (📥 130K / month · 📦 23 · ⏱️ 01.06.2022):

     pip install mmdet
    
MoviePy (🥇38 · 10K) - Video editing with Python. MIT
  • GitHub (👨‍💻 160 · 🔀 1.3K · 📦 23K · 📋 1.3K - 23% open · ⏱️ 22.02.2023):

     git clone https://github.com/Zulko/moviepy
    
  • PyPi (📥 4.9M / month · 📦 860 · ⏱️ 05.10.2020):

     pip install moviepy
    
  • Conda (📥 160K · ⏱️ 07.10.2022):

     conda install -c conda-forge moviepy
    
imageio (🥇38 · 1.2K) - Python library for reading and writing image data. BSD-2
  • GitHub (👨‍💻 98 · 🔀 250 · 📥 530 · 📦 84K · 📋 530 - 14% open · ⏱️ 20.03.2023):

     git clone https://github.com/imageio/imageio
    
  • PyPi (📥 19M / month · 📦 2.9K · ⏱️ 23.01.2023):

     pip install imageio
    
  • Conda (📥 4.6M · ⏱️ 27.02.2023):

     conda install -c conda-forge imageio
    
Kornia (🥈36 · 7.9K) - Open Source Differentiable Computer Vision Library. Apache-2
  • GitHub (👨‍💻 210 · 🔀 790 · 📥 570 · 📦 3.4K · 📋 740 - 30% open · ⏱️ 23.03.2023):

     git clone https://github.com/kornia/kornia
    
  • PyPi (📥 2.4M / month · 📦 80 · ⏱️ 21.12.2022):

     pip install kornia
    
  • Conda (📥 68K · ⏱️ 28.02.2023):

     conda install -c conda-forge kornia
    
InsightFace (🥈35 · 14K) - State-of-the-art 2D and 3D Face Analysis Project. MIT
  • GitHub (👨‍💻 53 · 🔀 4.2K · 📦 270 · 📋 2.2K - 56% open · ⏱️ 20.03.2023):

     git clone https://github.com/deepinsight/insightface
    
  • PyPi (📥 30K / month · 📦 7 · ⏱️ 17.12.2022):

     pip install insightface
    
Wand (🥈35 · 1.3K) - The ctypes-based simple ImageMagick binding for Python. MIT
  • GitHub (👨‍💻 100 · 🔀 200 · 📥 11K · 📦 15K · 📋 400 - 6% open · ⏱️ 05.03.2023):

     git clone https://github.com/emcconville/wand
    
  • PyPi (📥 640K / month · 📦 700 · ⏱️ 05.01.2023):

     pip install wand
    
  • Conda (📥 22K · ⏱️ 22.08.2022):

     conda install -c conda-forge wand
    
detectron2 (🥈34 · 24K) - Detectron2 is a platform for object detection, segmentation.. Apache-2
  • GitHub (👨‍💻 240 · 🔀 6.4K · 📦 960 · 📋 3.3K - 10% open · ⏱️ 16.03.2023):

     git clone https://github.com/facebookresearch/detectron2
    
  • PyPi (📦 6 · ⏱️ 06.02.2020):

     pip install detectron2
    
  • Conda (📥 130K · ⏱️ 22.01.2023):

     conda install -c conda-forge detectron2
    
Albumentations (🥈34 · 12K · 📉) - Fast image augmentation library and an easy-to-use.. MIT
  • GitHub (👨‍💻 130 · 🔀 1.5K · 📦 13K · 📋 770 - 46% open · ⏱️ 08.03.2023):

     git clone https://github.com/albumentations-team/albumentations
    
  • PyPi (📥 660K / month · 📦 280 · ⏱️ 20.09.2022):

     pip install albumentations
    
  • Conda (📥 99K · ⏱️ 20.09.2022):

     conda install -c conda-forge albumentations
    
PaddleDetection (🥈34 · 9.8K) - Object Detection toolkit based on PaddlePaddle. It.. Apache-2
  • GitHub (👨‍💻 140 · 🔀 2.5K · 📦 59 · 📋 4.7K - 20% open · ⏱️ 16.03.2023):

     git clone https://github.com/PaddlePaddle/PaddleDetection
    
  • PyPi (📥 1.8K / month · 📦 1 · ⏱️ 19.09.2022):

     pip install paddledet
    
deepface (🥈34 · 5.9K) - A Lightweight Face Recognition and Facial Attribute Analysis (Age,.. MIT
  • GitHub (👨‍💻 37 · 🔀 1.2K · 📦 1.2K · 📋 640 - 0% open · ⏱️ 09.03.2023):

     git clone https://github.com/serengil/deepface
    
  • PyPi (📥 98K / month · 📦 9 · ⏱️ 24.01.2023):

     pip install deepface
    
opencv-python (🥈34 · 3.3K) - Automated CI toolchain to produce precompiled opencv-python,.. MIT
  • GitHub (👨‍💻 43 · 🔀 650 · 📋 650 - 11% open · ⏱️ 16.03.2023):

     git clone https://github.com/opencv/opencv-python
    
  • PyPi (📥 9.2M / month · 📦 10K · ⏱️ 02.01.2023):

     pip install opencv-python
    
Face Recognition (🥈32 · 48K · 💤) - The worlds simplest facial recognition api for Python.. MIT
  • GitHub (👨‍💻 54 · 🔀 13K · 📥 1.1K · 📋 1.3K - 55% open · ⏱️ 10.06.2022):

     git clone https://github.com/ageitgey/face_recognition
    
  • PyPi (📥 59K / month · 📦 55 · ⏱️ 20.02.2020):

     pip install face_recognition
    
  • Conda (📥 14K · ⏱️ 30.04.2021):

     conda install -c conda-forge face_recognition
    
imageai (🥈32 · 7.6K) - A python library built to empower developers to build applications and.. MIT
  • GitHub (👨‍💻 17 · 🔀 2.1K · 📥 840K · 📦 1.3K · 📋 740 - 40% open · ⏱️ 03.03.2023):

     git clone https://github.com/OlafenwaMoses/ImageAI
    
  • PyPi (📥 10K / month · 📦 18 · ⏱️ 02.01.2023):

     pip install imageai
    
  • Conda (📥 5.1K · ⏱️ 30.04.2021):

     conda install -c conda-forge imageai
    
PaddleSeg (🥈32 · 6.6K) - Easy-to-use image segmentation library with awesome pre-.. Apache-2
  • GitHub (👨‍💻 96 · 🔀 1.4K · 📦 840 · 📋 1.6K - 10% open · ⏱️ 27.02.2023):

     git clone https://github.com/PaddlePaddle/PaddleSeg
    
  • PyPi (📥 4.8K / month · 📦 3 · ⏱️ 30.11.2022):

     pip install paddleseg
    
GluonCV (🥈32 · 5.5K) - Gluon CV Toolkit. Apache-2
  • GitHub (👨‍💻 120 · 🔀 1.2K · 📦 1.1K · 📋 830 - 7% open · ⏱️ 19.01.2023):

     git clone https://github.com/dmlc/gluon-cv
    
  • PyPi (📥 560K / month · 📦 59 · ⏱️ 03.10.2022):

     pip install gluoncv
    
ImageHash (🥈31 · 2.7K) - A Python Perceptual Image Hashing Module. BSD-2
  • GitHub (👨‍💻 25 · 🔀 320 · 📦 8.3K · 📋 120 - 7% open · ⏱️ 07.02.2023):

     git clone https://github.com/JohannesBuchner/imagehash
    
  • PyPi (📥 1.3M / month · 📦 360 · ⏱️ 28.09.2022):

     pip install ImageHash
    
  • Conda (📥 290K · ⏱️ 28.09.2022):

     conda install -c conda-forge imagehash
    
vit-pytorch (🥈29 · 13K) - Implementation of Vision Transformer, a simple way to achieve.. MIT
  • GitHub (👨‍💻 17 · 🔀 2.2K · 📦 220 · 📋 220 - 47% open · ⏱️ 19.03.2023):

     git clone https://github.com/lucidrains/vit-pytorch
    
  • PyPi (📥 30K / month · 📦 6 · ⏱️ 30.06.2022):

     pip install vit-pytorch
    
vidgear (🥈29 · 2.7K) - A High-performance cross-platform Video Processing Python framework.. Apache-2
  • GitHub (👨‍💻 13 · 🔀 220 · 📥 760 · 📦 360 · 📋 250 - 1% open · ⏱️ 26.01.2023):

     git clone https://github.com/abhiTronix/vidgear
    
  • PyPi (📥 5K / month · 📦 7 · ⏱️ 26.01.2023):

     pip install vidgear
    
sahi (🥈29 · 2.5K) - Framework agnostic sliced/tiled inference + interactive ui + error analysis.. MIT
  • GitHub (👨‍💻 26 · 🔀 380 · 📥 13K · 📦 340 · ⏱️ 21.03.2023):

     git clone https://github.com/obss/sahi
    
  • PyPi (📥 97K / month · 📦 10 · ⏱️ 15.01.2023):

     pip install sahi
    
  • Conda (📥 30K · ⏱️ 07.03.2023):

     conda install -c conda-forge sahi
    
lightly (🥈29 · 2.2K) - A python library for self-supervised learning on images. MIT
  • GitHub (👨‍💻 26 · 🔀 180 · 📦 99 · 📋 360 - 12% open · ⏱️ 23.03.2023):

     git clone https://github.com/lightly-ai/lightly
    
  • PyPi (📥 5.7K / month · 📦 3 · ⏱️ 31.01.2023):

     pip install lightly
    
Norfair (🥉28 · 1.8K) - Lightweight Python library for adding real-time multi-object tracking.. BSD-3
  • GitHub (👨‍💻 24 · 🔀 180 · 📥 250 · 📦 99 · 📋 120 - 2% open · ⏱️ 09.02.2023):

     git clone https://github.com/tryolabs/norfair
    
  • PyPi (📥 18K / month · 📦 1 · ⏱️ 04.01.2023):

     pip install norfair
    
CellProfiler (🥉28 · 750) - An open-source application for biological image analysis. BSD-3
  • GitHub (👨‍💻 130 · 🔀 350 · 📥 4.7K · 📦 16 · 📋 3.2K - 7% open · ⏱️ 20.03.2023):

     git clone https://github.com/CellProfiler/CellProfiler
    
  • PyPi (📥 460 / month · ⏱️ 15.12.2022):

     pip install cellprofiler
    
doctr (🥉27 · 1.7K) - docTR (Document Text Recognition) - a seamless, high-.. Apache-2
  • GitHub (👨‍💻 29 · 🔀 220 · 📥 1.1M · 📦 120 · 📋 240 - 21% open · ⏱️ 15.02.2023):

     git clone https://github.com/mindee/doctr
    
  • PyPi (📥 6.3K / month · 📦 2 · ⏱️ 29.09.2022):

     pip install python-doctr
    
mahotas (🥉27 · 800) - Computer Vision in Python. MIT
  • GitHub (👨‍💻 33 · 🔀 150 · 📦 990 · 📋 82 - 23% open · ⏱️ 26.02.2023):

     git clone https://github.com/luispedro/mahotas
    
  • PyPi (📥 10K / month · 📦 120 · ⏱️ 28.06.2022):

     pip install mahotas
    
  • Conda (📥 360K · ⏱️ 05.11.2022):

     conda install -c conda-forge mahotas
    
Augmentor (🥉26 · 4.9K) - Image augmentation library in Python for machine learning. MIT
  • GitHub (👨‍💻 23 · 🔀 850 · 📦 570 · 📋 200 - 63% open · ⏱️ 24.01.2023):

     git clone https://github.com/mdbloice/Augmentor
    
  • PyPi (📥 17K / month · 📦 29 · ⏱️ 27.04.2022):

     pip install Augmentor
    
ffcv (🥉26 · 2.4K) - FFCV: Fast Forward Computer Vision (and other ML workloads!). Apache-2
  • GitHub (👨‍💻 26 · 🔀 150 · 📦 16 · 📋 220 - 31% open · ⏱️ 05.03.2023):

     git clone https://github.com/libffcv/ffcv
    
  • PyPi (📥 13K / month · ⏱️ 28.01.2022):

     pip install ffcv
    
MMF (🥉25 · 5.2K) - A modular framework for vision & language multimodal research from.. BSD-3
  • GitHub (👨‍💻 110 · 🔀 880 · 📦 14 · 📋 720 - 32% open · ⏱️ 02.02.2023):

     git clone https://github.com/facebookresearch/mmf
    
  • PyPi (📥 350 / month · 📦 1 · ⏱️ 12.06.2020):

     pip install mmf
    
layout-parser (🥉25 · 3.5K · 💤) - A Unified Toolkit for Deep Learning Based Document Image.. Apache-2
  • GitHub (👨‍💻 8 · 🔀 350 · 📦 160 · 📋 130 - 58% open · ⏱️ 06.08.2022):

     git clone https://github.com/Layout-Parser/layout-parser
    
  • PyPi (📥 42K / month · 📦 5 · ⏱️ 06.04.2022):

     pip install layoutparser
    
pyvips (🥉25 · 490) - python binding for libvips using cffi. MIT
  • GitHub (👨‍💻 15 · 🔀 46 · 📦 490 · 📋 340 - 38% open · ⏱️ 28.02.2023):

     git clone https://github.com/libvips/pyvips
    
  • PyPi (📥 32K / month · 📦 40 · ⏱️ 12.06.2022):

     pip install pyvips
    
  • Conda (📥 49K · ⏱️ 29.10.2022):

     conda install -c conda-forge pyvips
    
Image Deduplicator (🥉24 · 4.5K) - Finding duplicate images made easy!. Apache-2
  • GitHub (👨‍💻 13 · 🔀 400 · 📦 36 · 📋 130 - 37% open · ⏱️ 10.01.2023):

     git clone https://github.com/idealo/imagededup
    
  • PyPi (📥 1.7K / month · 📦 4 · ⏱️ 16.01.2023):

     pip install imagededup
    
segmentation_models (🥉24 · 4.3K · 💤) - Segmentation models with pretrained backbones. Keras.. MIT
  • GitHub (👨‍💻 14 · 🔀 960 · 📋 510 - 47% open · ⏱️ 29.07.2022):

     git clone https://github.com/qubvel/segmentation_models
    
  • PyPi (📥 22K / month · 📦 24 · ⏱️ 10.01.2020):

     pip install segmentation_models
    
pytorchvideo (🥉24 · 2.8K) - A deep learning library for video understanding research. Apache-2
  • GitHub (👨‍💻 50 · 🔀 340 · 📋 180 - 45% open · ⏱️ 27.02.2023):

     git clone https://github.com/facebookresearch/pytorchvideo
    
  • PyPi (📥 13K / month · 📦 7 · ⏱️ 20.01.2022):

     pip install pytorchvideo
    
deep-daze (🥉23 · 4.4K · 💤) - Simple command line tool for text to image generation using.. MIT
  • GitHub (👨‍💻 14 · 🔀 320 · 📦 51 · 📋 170 - 56% open · ⏱️ 13.03.2022):

     git clone https://github.com/lucidrains/deep-daze
    
  • PyPi (📥 1.5K / month · ⏱️ 13.03.2022):

     pip install deep-daze
    
kubric (🥉23 · 1.8K) - A data generation pipeline for creating semi-realistic synthetic.. Apache-2
  • GitHub (👨‍💻 23 · 🔀 160 · 📦 2 · 📋 170 - 30% open · ⏱️ 09.03.2023):

     git clone https://github.com/google-research/kubric
    
  • PyPi (📥 5.1K / month · ⏱️ 06.07.2022):

     pip install kubric-nightly
    
Classy Vision (🥉23 · 1.5K) - An end-to-end PyTorch framework for image and video.. MIT
  • GitHub (👨‍💻 77 · 🔀 280 · 📋 120 - 46% open · ⏱️ 21.03.2023):

     git clone https://github.com/facebookresearch/ClassyVision
    
  • PyPi (📥 1.8K / month · 📦 4 · ⏱️ 09.07.2021):

     pip install classy_vision
    
  • Conda (📥 16K · ⏱️ 22.03.2022):

     conda install -c conda-forge classy_vision
    
icevision (🥉23 · 810) - An Agnostic Computer Vision Framework - Pluggable to any Training.. Apache-2
  • GitHub (👨‍💻 41 · 🔀 130 · 📋 570 - 10% open · ⏱️ 07.12.2022):

     git clone https://github.com/airctic/icevision
    
  • PyPi (📥 3.4K / month · 📦 6 · ⏱️ 10.02.2022):

     pip install icevision
    
vissl (🥉22 · 3K) - VISSL is FAIRs library of extensible, modular and scalable components.. MIT
  • GitHub (👨‍💻 34 · 🔀 310 · 📦 15 · 📋 180 - 42% open · ⏱️ 28.12.2022):

     git clone https://github.com/facebookresearch/vissl
    
  • PyPi (📥 380 / month · 📦 1 · ⏱️ 02.11.2021):

     pip install vissl
    
tensorflow-graphics (🥉22 · 2.7K) - TensorFlow Graphics: Differentiable Graphics Layers.. Apache-2
  • GitHub (👨‍💻 39 · 🔀 360 · 📋 240 - 61% open · ⏱️ 14.02.2023):

     git clone https://github.com/tensorflow/graphics
    
  • PyPi (📥 3.9K / month · 📦 5 · ⏱️ 03.12.2021):

     pip install tensorflow-graphics
    
pycls (🥉21 · 2K · 💤) - Codebase for Image Classification Research, written in PyTorch. MIT
  • GitHub (👨‍💻 17 · 🔀 230 · 📦 10 · 📋 81 - 30% open · ⏱️ 12.07.2022):

     git clone https://github.com/facebookresearch/pycls
    
  • PyPi (📥 52K / month · ⏱️ 05.09.2020):

     pip install pycls
    
image-match (🥉20 · 2.8K) - Quickly search over billions of images. Apache-2
  • GitHub (👨‍💻 19 · 🔀 400 · 📋 110 - 57% open · ⏱️ 06.12.2022):

     git clone https://github.com/ProvenanceLabs/image-match
    
  • PyPi (📥 730 / month · 📦 4 · ⏱️ 13.02.2017):

     pip install image_match
    
DE⫶TR (🥉19 · 11K) - End-to-End Object Detection with Transformers. Apache-2
  • GitHub (👨‍💻 26 · 🔀 1.9K · 📋 490 - 42% open · ⏱️ 07.02.2023):

     git clone https://github.com/facebookresearch/detr
    
PySlowFast (🥉19 · 5.5K) - PySlowFast: video understanding codebase from FAIR for.. Apache-2
  • GitHub (👨‍💻 29 · 🔀 1.1K · 📦 10 · 📋 600 - 54% open · ⏱️ 12.01.2023):

     git clone https://github.com/facebookresearch/SlowFast
    
  • PyPi (📥 16 / month · ⏱️ 15.01.2020):

     pip install pyslowfast
    
scenic (🥉19 · 2K) - Scenic: A Jax Library for Computer Vision Research and Beyond. Apache-2
  • GitHub (👨‍💻 62 · 🔀 270 · 📋 190 - 66% open · ⏱️ 22.03.2023):

     git clone https://github.com/google-research/scenic
    
Show 17 hidden projects...
  • imgaug (🥈34 · 13K · 💀) - Image augmentation for machine learning experiments. MIT
  • glfw (🥈34 · 10K) - A multi-platform library for OpenGL, OpenGL ES, Vulkan, window and input. ❗️Zlib
  • imutils (🥈31 · 4.3K · 💀) - A series of convenience functions to make basic image processing.. MIT
  • PyTorch3D (🥈29 · 7.1K) - PyTorch3D is FAIRs library of reusable components for.. ❗Unlicensed
  • Pillow-SIMD (🥉28 · 1.9K · 💤) - The friendly PIL fork. ❗️PIL
  • Face Alignment (🥉27 · 6.2K · 💀) - 2D and 3D Face alignment library build using pytorch. BSD-3
  • chainercv (🥉27 · 1.5K · 💀) - ChainerCV: a Library for Deep Learning in Computer Vision. MIT
  • facenet-pytorch (🥉26 · 3.4K · 💀) - Pretrained Pytorch face detection (MTCNN) and facial.. MIT
  • mtcnn (🥉26 · 1.9K · 💀) - MTCNN face detection implementation for TensorFlow, as a PIP.. MIT
  • Image Super-Resolution (🥉23 · 4.1K · 💀) - Super-scale your images and run experiments with.. Apache-2
  • Luminoth (🥉22 · 2.4K · 💀) - Deep Learning toolkit for Computer Vision. BSD-3
  • nude.py (🥉21 · 890 · 💀) - Nudity detection with Python. MIT
  • detecto (🥉20 · 580 · 💀) - Build fully-functioning computer vision models with PyTorch. MIT
  • solt (🥉18 · 260 · 💤) - Streaming over lightweight data transformations. MIT
  • Caer (🥉17 · 680 · 💀) - A lightweight Computer Vision library. Scale your models, not boilerplate. MIT
  • HugsVision (🥉16 · 180) - HugsVision is a easy to use huggingface wrapper for state-of-the-.. MIT huggingface
  • Torch Points 3D (🥉16 · 140 · 💀) - Pytorch framework for doing deep learning on point.. BSD-3

Graph Data

Back to top

Libraries for graph processing, clustering, embedding, and machine learning tasks.

networkx (🥇43 · 12K) - Network Analysis in Python. BSD-3
  • GitHub (👨‍💻 640 · 🔀 2.9K · 📥 64 · 📦 150K · 📋 3.1K - 12% open · ⏱️ 21.03.2023):

     git clone https://github.com/networkx/networkx
    
  • PyPi (📥 23M / month · 📦 14K · ⏱️ 08.01.2023):

     pip install networkx
    
  • Conda (📥 10M · ⏱️ 08.01.2023):

     conda install -c conda-forge networkx
    
PyTorch Geometric (🥇38 · 17K) - Graph Neural Network Library for PyTorch. MIT
  • GitHub (👨‍💻 390 · 🔀 3.2K · 📋 3K - 24% open · ⏱️ 23.03.2023):

     git clone https://github.com/pyg-team/pytorch_geometric
    
  • PyPi (📥 160K / month · 📦 69 · ⏱️ 01.12.2022):

     pip install torch-geometric
    
  • Conda (📥 18K · ⏱️ 04.01.2023):

     conda install -c conda-forge pytorch_geometric
    
dgl (🥇37 · 11K) - Python package built to ease deep learning on graph, on top of existing DL.. Apache-2
  • GitHub (👨‍💻 260 · 🔀 2.7K · 📦 68 · 📋 2.2K - 16% open · ⏱️ 23.03.2023):

     git clone https://github.com/dmlc/dgl
    
  • PyPi (📥 50K / month · 📦 64 · ⏱️ 20.09.2022):

     pip install dgl
    
Spektral (🥈28 · 2.2K) - Graph Neural Networks with Keras and Tensorflow 2. MIT
  • GitHub (👨‍💻 26 · 🔀 330 · 📦 190 · 📋 260 - 23% open · ⏱️ 11.02.2023):

     git clone https://github.com/danielegrattarola/spektral
    
  • PyPi (📥 6.6K / month · 📦 3 · ⏱️ 22.07.2022):

     pip install spektral
    
AmpliGraph (🥈28 · 1.9K) - Python library for Representation Learning on Knowledge.. Apache-2
  • GitHub (👨‍💻 20 · 🔀 230 · 📦 28 · 📋 220 - 14% open · ⏱️ 08.03.2023):

     git clone https://github.com/Accenture/AmpliGraph
    
  • PyPi (📥 670 / month · ⏱️ 25.05.2021):

     pip install ampligraph
    
ogb (🥈28 · 1.6K) - Benchmark datasets, data loaders, and evaluators for graph machine learning. MIT
  • GitHub (👨‍💻 24 · 🔀 360 · 📦 690 · 📋 250 - 3% open · ⏱️ 27.01.2023):

     git clone https://github.com/snap-stanford/ogb
    
  • PyPi (📥 39K / month · 📦 19 · ⏱️ 02.11.2022):

     pip install ogb
    
  • Conda (📥 21K · ⏱️ 03.11.2022):

     conda install -c conda-forge ogb
    
PyKEEN (🥈27 · 1.2K) - A Python library for learning and evaluating knowledge graph embeddings. MIT
  • GitHub (👨‍💻 35 · 🔀 160 · 📥 150 · 📋 500 - 19% open · ⏱️ 23.03.2023):

     git clone https://github.com/pykeen/pykeen
    
  • PyPi (📥 2.2K / month · 📦 5 · ⏱️ 22.02.2023):

     pip install pykeen
    
pygraphistry (🥈26 · 1.8K) - PyGraphistry is a Python library to quickly load, shape,.. BSD-3
  • GitHub (👨‍💻 31 · 🔀 190 · 📦 85 · 📋 260 - 49% open · ⏱️ 23.12.2022):

     git clone https://github.com/graphistry/pygraphistry
    
  • PyPi (📥 2.2K / month · 📦 5 · ⏱️ 23.12.2022):

     pip install graphistry
    
Paddle Graph Learning (🥈26 · 1.5K) - Paddle Graph Learning (PGL) is an efficient and.. Apache-2
  • GitHub (👨‍💻 29 · 🔀 300 · 📦 38 · 📋 190 - 40% open · ⏱️ 23.03.2023):

     git clone https://github.com/PaddlePaddle/PGL
    
  • PyPi (📥 2.3K / month · 📦 2 · ⏱️ 03.08.2022):

     pip install pgl
    
PyTorch-BigGraph (🥈24 · 3.2K) - Generate embeddings from large-scale graph-structured.. BSD-3
  • GitHub (👨‍💻 29 · 🔀 430 · 📥 150 · 📋 190 - 28% open · ⏱️ 08.12.2022):

     git clone https://github.com/facebookresearch/PyTorch-BigGraph
    
  • PyPi (📥 170K / month · 📦 3 · ⏱️ 01.05.2019):

     pip install torchbiggraph
    
Node2Vec (🥈24 · 1K) - Implementation of the node2vec algorithm. MIT
  • GitHub (👨‍💻 11 · 🔀 230 · 📦 370 · 📋 84 - 1% open · ⏱️ 19.10.2022):

     git clone https://github.com/eliorc/node2vec
    
  • PyPi (📥 93K / month · 📦 16 · ⏱️ 01.08.2022):

     pip install node2vec
    
  • Conda (📥 24K · ⏱️ 16.02.2023):

     conda install -c conda-forge node2vec
    
pytorch_geometric_temporal (🥉23 · 2K) - PyTorch Geometric Temporal: Spatiotemporal Signal.. MIT
  • GitHub (👨‍💻 26 · 🔀 280 · 📋 140 - 15% open · ⏱️ 18.02.2023):

     git clone https://github.com/benedekrozemberczki/pytorch_geometric_temporal
    
  • PyPi (📥 2.8K / month · 📦 2 · ⏱️ 04.09.2022):

     pip install torch-geometric-temporal
    
torch-cluster (🥉23 · 620 · 📈) - PyTorch Extension Library of Optimized Graph Cluster.. MIT
  • GitHub (👨‍💻 28 · 🔀 110 · 📋 130 - 19% open · ⏱️ 16.03.2023):

     git clone https://github.com/rusty1s/pytorch_cluster
    
  • PyPi (📥 14K / month · 📦 36 · ⏱️ 11.03.2022):

     pip install torch-cluster
    
  • Conda (📥 53K · ⏱️ 23.03.2023):

     conda install -c conda-forge pytorch_cluster
    
graph4nlp (🥉22 · 1.6K) - Graph4nlp is the library for the easy use of Graph Neural.. Apache-2
  • GitHub (👨‍💻 27 · 🔀 190 · 📋 170 - 6% open · ⏱️ 13.11.2022):

     git clone https://github.com/graph4ai/graph4nlp
    
  • PyPi (📥 120 / month · ⏱️ 20.01.2022):

     pip install graph4nlp
    
graph-nets (🥉21 · 5.2K) - Build Graph Nets in Tensorflow. Apache-2
  • GitHub (👨‍💻 11 · 🔀 780 · 📋 130 - 3% open · ⏱️ 12.12.2022):

     git clone https://github.com/deepmind/graph_nets
    
  • PyPi (📥 1.2K / month · 📦 8 · ⏱️ 29.01.2020):

     pip install graph-nets
    
jraph (🥉21 · 1.1K · 💤) - A Graph Neural Network Library in Jax. Apache-2
  • GitHub (👨‍💻 17 · 🔀 72 · 📦 63 · 📋 35 - 25% open · ⏱️ 31.08.2022):

     git clone https://github.com/deepmind/jraph
    
  • PyPi (📥 5.7K / month · 📦 5 · ⏱️ 12.08.2022):

     pip install jraph
    
  • Conda (📥 1.5K · ⏱️ 31.10.2021):

     conda install -c conda-forge jraph
    
deepsnap (🥉20 · 460) - Python library assists deep learning on graphs. MIT
  • GitHub (👨‍💻 16 · 🔀 49 · 📥 8 · 📦 44 · 📋 42 - 42% open · ⏱️ 05.03.2023):

     git clone https://github.com/snap-stanford/deepsnap
    
  • PyPi (📥 800 / month · 📦 1 · ⏱️ 05.09.2021):

     pip install deepsnap
    
GraphGym (🥉18 · 1.3K) - Platform for designing and evaluating Graph Neural Networks (GNN). MIT
  • GitHub (👨‍💻 6 · 🔀 160 · 📥 21 · 📦 2 · 📋 39 - 30% open · ⏱️ 14.03.2023):

     git clone https://github.com/snap-stanford/GraphGym
    
  • PyPi (📥 84 / month · ⏱️ 24.03.2022):

     pip install graphgym
    
AutoGL (🥉18 · 910) - An autoML framework & toolkit for machine learning on graphs. Apache-2
  • GitHub (👨‍💻 15 · 🔀 110 · 📋 28 - 25% open · ⏱️ 30.12.2022):

     git clone https://github.com/THUMNLab/AutoGL
    
  • PyPi (⏱️ 23.12.2020):

     pip install auto-graph-learning
    
GraphEmbedding (🥉16 · 3.2K · 💤) - Implementation and experiments of graph embedding.. MIT
  • GitHub (👨‍💻 9 · 🔀 910 · 📦 23 · 📋 64 - 60% open · ⏱️ 21.06.2022):

     git clone https://github.com/shenweichen/GraphEmbedding
    
kglib (🥉16 · 540) - TypeDB-ML is the Machine Learning integrations library for TypeDB. Apache-2
  • GitHub (👨‍💻 11 · 🔀 98 · 📥 210 · 📋 62 - 17% open · ⏱️ 09.11.2022):

     git clone https://github.com/vaticle/kglib
    
  • PyPi (📥 97 / month · ⏱️ 19.08.2020):

     pip install grakn-kglib
    
OpenKE (🥉15 · 3.4K) - An Open-Source Package for Knowledge Embedding (KE). MIT
  • GitHub (👨‍💻 11 · 🔀 920 · 📋 360 - 2% open · ⏱️ 03.11.2022):

     git clone https://github.com/thunlp/OpenKE
    
OpenNE (🥉14 · 1.6K) - An Open-Source Package for Network Embedding (NE). MIT
  • GitHub (👨‍💻 11 · 🔀 480 · 📋 100 - 4% open · ⏱️ 02.11.2022):

     git clone https://github.com/thunlp/OpenNE
    
Show 13 hidden projects...
  • igraph (🥇32 · 1.1K) - Python interface for igraph. ❗️GPL-2.0
  • StellarGraph (🥈29 · 2.6K · 💀) - StellarGraph - Machine Learning on Graphs. Apache-2
  • pygal (🥈28 · 2.5K · 💀) - PYthon svg GrAph plotting Library. ❗️LGPL-3.0
  • Karate Club (🥈24 · 1.8K) - Karate Club: An API Oriented Open-source Python Framework for.. ❗️GPL-3.0
  • DIG (🥉23 · 1.4K) - A library for graph deep learning research. ❗️GPL-3.0
  • DeepWalk (🥉21 · 2.5K · 💀) - DeepWalk - Deep Learning for Graphs. ❗️GPL-3.0
  • pyRDF2Vec (🥉20 · 180) - Python Implementation and Extension of RDF2Vec. MIT
  • Sematch (🥉18 · 400 · 💀) - semantic similarity framework for knowledge graph. Apache-2
  • DeepGraph (🥉16 · 270 · 💀) - Analyze Data with Pandas-based Networks. Documentation:. BSD-3
  • Euler (🥉15 · 2.8K · 💀) - A distributed graph deep learning framework. Apache-2
  • GraphSAGE (🥉14 · 3K · 💀) - Representation learning on large graphs using stochastic graph.. MIT
  • ptgnn (🥉14 · 360 · 💀) - A PyTorch Graph Neural Network Library. MIT
  • GraphVite (🥉13 · 1.1K · 💀) - GraphVite: A General and High-performance Graph Embedding.. Apache-2

Audio Data

Back to top

Libraries for audio analysis, manipulation, transformation, and extraction, as well as speech recognition and music generation tasks.

espnet (🥇37 · 6.2K) - End-to-End Speech Processing Toolkit. Apache-2
  • GitHub (👨‍💻 340 · 🔀 1.9K · 📥 77 · 📦 130 · 📋 2.1K - 20% open · ⏱️ 23.03.2023):

     git clone https://github.com/espnet/espnet
    
  • PyPi (📥 18K / month · 📦 5 · ⏱️ 28.05.2022):

     pip install espnet
    
torchaudio (🥇35 · 2K) - Data manipulation and transformation for audio signal.. BSD-2
  • GitHub (👨‍💻 200 · 🔀 530 · 📋 820 - 27% open · ⏱️ 22.03.2023):

     git clone https://github.com/pytorch/audio
    
  • PyPi (📥 780K / month · 📦 270 · ⏱️ 15.12.2022):

     pip install torchaudio
    
Pydub (🥇34 · 7K) - Manipulate audio with a simple and easy high level interface. MIT
  • GitHub (👨‍💻 95 · 🔀 900 · 📦 20K · 📋 540 - 51% open · ⏱️ 08.12.2022):

     git clone https://github.com/jiaaro/pydub
    
  • PyPi (📥 6M / month · 📦 1K · ⏱️ 10.03.2021):

     pip install pydub
    
  • Conda (📥 42K · ⏱️ 13.03.2021):

     conda install -c conda-forge pydub
    
SpeechRecognition (🥇34 · 6.9K) - Speech recognition module for Python, supporting several.. BSD-3
  • GitHub (👨‍💻 49 · 🔀 2.2K · 📋 580 - 48% open · ⏱️ 13.03.2023):

     git clone https://github.com/Uberi/speech_recognition
    
  • PyPi (📥 440K / month · 📦 760 · ⏱️ 04.12.2022):

     pip install SpeechRecognition
    
  • Conda (📥 160K · ⏱️ 14.03.2023):

     conda install -c conda-forge speechrecognition
    
librosa (🥈33 · 5.8K · 📉) - Python library for audio and music analysis. ISC
  • GitHub (👨‍💻 110 · 🔀 880 · 📋 1.1K - 4% open · ⏱️ 17.03.2023):

     git clone https://github.com/librosa/librosa
    
  • PyPi (📥 1.6M / month · 📦 1.3K · ⏱️ 27.06.2022):

     pip install librosa
    
  • Conda (📥 600K · ⏱️ 17.03.2023):

     conda install -c conda-forge librosa
    
speechbrain (🥈33 · 5.5K) - A PyTorch-based Speech Toolkit. Apache-2
  • GitHub (👨‍💻 200 · 🔀 1K · 📦 540 · 📋 910 - 18% open · ⏱️ 22.03.2023):

     git clone https://github.com/speechbrain/speechbrain
    
  • PyPi (📥 58K / month · 📦 10 · ⏱️ 29.08.2022):

     pip install speechbrain
    
spleeter (🥈31 · 22K) - Deezer source separation library including pretrained models. MIT
  • GitHub (👨‍💻 19 · 🔀 2.4K · 📥 2.2M · 📦 470 · 📋 740 - 25% open · ⏱️ 25.11.2022):

     git clone https://github.com/deezer/spleeter
    
  • PyPi (📥 15K / month · 📦 5 · ⏱️ 07.09.2022):

     pip install spleeter
    
  • Conda (📥 73K · ⏱️ 30.06.2020):

     conda install -c conda-forge spleeter
    
Magenta (🥈31 · 18K) - Magenta: Music and Art Generation with Machine Intelligence. Apache-2
  • GitHub (👨‍💻 160 · 🔀 3.7K · 📦 410 · 📋 950 - 38% open · ⏱️ 18.01.2023):

     git clone https://github.com/magenta/magenta
    
  • PyPi (📥 7.4K / month · 📦 38 · ⏱️ 01.08.2022):

     pip install magenta
    
python-soundfile (🥈31 · 530) - SoundFile is an audio library based on libsndfile, CFFI, and.. BSD-3
  • GitHub (👨‍💻 30 · 🔀 87 · 📥 14K · 📦 19K · 📋 200 - 41% open · ⏱️ 24.02.2023):

     git clone https://github.com/bastibe/python-soundfile
    
  • PyPi (📥 1.2M / month · 📦 95 · ⏱️ 27.09.2022):

     pip install soundfile
    
  • Conda:

     conda install -c anaconda pysoundfile
    
Coqui TTS (🥈30 · 9.4K) - - a deep learning toolkit for Text-to-Speech, battle-.. MPL-2.0
  • GitHub (👨‍💻 120 · 🔀 1.1K · 📥 630K · 📋 530 - 5% open · ⏱️ 21.03.2023):

     git clone https://github.com/coqui-ai/TTS
    
  • PyPi (📥 20K / month · 📦 12 · ⏱️ 11.01.2023):

     pip install tts
    
  • Conda (📥 6.8K · ⏱️ 15.12.2021):

     conda install -c conda-forge tts
    
pyAudioAnalysis (🥈28 · 5.2K) - Python Audio Analysis Library: Feature Extraction,.. Apache-2
  • GitHub (👨‍💻 27 · 🔀 1.1K · 📦 340 · 📋 310 - 60% open · ⏱️ 18.09.2022):

     git clone https://github.com/tyiannak/pyAudioAnalysis
    
  • PyPi (📥 14K / month · 📦 19 · ⏱️ 07.02.2022):

     pip install pyAudioAnalysis
    
audiomentations (🥈28 · 1.3K) - A Python library for audio data augmentation. Inspired by.. MIT
  • GitHub (👨‍💻 23 · 🔀 160 · 📦 220 · 📋 150 - 28% open · ⏱️ 15.03.2023):

     git clone https://github.com/iver56/audiomentations
    
  • PyPi (📥 8.5K / month · 📦 2 · ⏱️ 12.01.2023):

     pip install audiomentations
    
audioread (🥈28 · 430) - cross-library (GStreamer + Core Audio + MAD + FFmpeg) audio decoding.. MIT
  • GitHub (👨‍💻 23 · 🔀 100 · 📦 12K · 📋 87 - 40% open · ⏱️ 18.11.2022):

     git clone https://github.com/beetbox/audioread
    
  • PyPi (📥 1.5M / month · 📦 330 · ⏱️ 12.08.2022):

     pip install audioread
    
  • Conda (📥 600K · ⏱️ 29.10.2022):

     conda install -c conda-forge audioread
    
Porcupine (🥉27 · 2.9K) - On-device wake word detection powered by deep learning. Apache-2
  • GitHub (👨‍💻 34 · 🔀 420 · 📦 15 · 📋 420 - 0% open · ⏱️ 15.03.2023):

     git clone https://github.com/Picovoice/Porcupine
    
  • PyPi (📥 2.3K / month · 📦 11 · ⏱️ 28.06.2022):

     pip install pvporcupine
    
tinytag (🥉26 · 590) - Read audio and music meta data and duration of MP3, OGG, OPUS, MP4, M4A,.. MIT
  • GitHub (👨‍💻 25 · 🔀 92 · 📦 720 · 📋 99 - 13% open · ⏱️ 20.03.2023):

     git clone https://github.com/devsnd/tinytag
    
  • PyPi (📥 18K / month · 📦 70 · ⏱️ 12.03.2022):

     pip install tinytag
    
DDSP (🥉25 · 2.4K) - DDSP: Differentiable Digital Signal Processing. Apache-2
  • GitHub (👨‍💻 32 · 🔀 290 · 📦 38 · 📋 150 - 22% open · ⏱️ 22.12.2022):

     git clone https://github.com/magenta/ddsp
    
  • PyPi (📥 3.7K / month · 📦 1 · ⏱️ 04.10.2022):

     pip install ddsp
    
  • Conda (📥 13K · ⏱️ 08.06.2020):

     conda install -c conda-forge ddsp
    
DeepSpeech (🥉23 · 21K · 📉) - DeepSpeech is an open source embedded (offline, on-.. MPL-2.0
  • GitHub (👨‍💻 140 · 🔀 3.7K):

     git clone https://github.com/mozilla/DeepSpeech
    
  • PyPi (📥 14K / month · 📦 45 · ⏱️ 19.12.2020):

     pip install deepspeech
    
  • Conda (📥 1.5K · ⏱️ 29.07.2021):

     conda install -c conda-forge deepspeech
    
nnAudio (🥉23 · 810) - Audio processing by using pytorch 1D convolution network. MIT
  • GitHub (👨‍💻 14 · 🔀 81 · 📦 82 · 📋 56 - 25% open · ⏱️ 16.03.2023):

     git clone https://github.com/KinWaiCheuk/nnAudio
    
  • PyPi (📥 2.1K / month · 📦 1 · ⏱️ 09.10.2022):

     pip install nnAudio
    
kapre (🥉22 · 880 · 💤) - kapre: Keras Audio Preprocessors. MIT
  • GitHub (👨‍💻 13 · 🔀 140 · 📥 22 · 📦 2.2K · 📋 96 - 14% open · ⏱️ 04.07.2022):

     git clone https://github.com/keunwoochoi/kapre
    
  • PyPi (📥 3.1K / month · 📦 14 · ⏱️ 21.01.2022):

     pip install kapre
    
Julius (🥉19 · 320) - Fast PyTorch based DSP for audio and 1D signals. MIT
  • GitHub (👨‍💻 2 · 🔀 20 · 📦 270 · ⏱️ 19.09.2022):

     git clone https://github.com/adefossez/julius
    
  • PyPi (📥 76K / month · 📦 5 · ⏱️ 20.09.2022):

     pip install julius
    
Show 9 hidden projects...
  • Essentia (🥈28 · 2.3K) - C++ library for audio and music analysis, description and.. ❗️AGPL-3.0
  • aubio (🥉27 · 2.9K · 💀) - a library for audio and music analysis. ❗️GPL-3.0
  • Madmom (🥉25 · 1K · 💀) - Python audio and music signal processing library. BSD-3
  • TimeSide (🥉24 · 350) - scalable audio processing framework and server written in Python. ❗️AGPL-3.0
  • TTS (🥉23 · 6.9K · 💀) - Deep learning for Text to Speech (Discussion forum:.. MPL-2.0
  • python_speech_features (🥉23 · 2.2K · 💀) - This library provides common speech features for ASR.. MIT
  • Dejavu (🥉22 · 6K · 💀) - Audio fingerprinting and recognition in Python. MIT
  • Muda (🥉17 · 220 · 💀) - A library for augmenting annotated audio data. ISC
  • textlesslib (🥉9 · 400 · 💤) - Library for Textless Spoken Language Processing. MIT

Geospatial Data

Back to top

Libraries to load, process, analyze, and write geographic data as well as libraries for spatial analysis, map visualization, and geocoding.

pydeck (🥇42 · 11K) - WebGL2 powered visualization framework. MIT
  • GitHub (👨‍💻 220 · 🔀 2K · 📦 5.6K · 📋 2.7K - 7% open · ⏱️ 22.03.2023):

     git clone https://github.com/visgl/deck.gl
    
  • PyPi (📥 1.2M / month · 📦 38 · ⏱️ 04.11.2022):

     pip install pydeck
    
  • Conda (📥 320K · ⏱️ 04.11.2022):

     conda install -c conda-forge pydeck
    
  • npm (📥 440K / month · 📦 440 · ⏱️ 22.03.2023):

     npm install deck.gl
    
folium (🥇39 · 6.1K) - Python Data. Leaflet.js Maps. MIT
  • GitHub (👨‍💻 150 · 🔀 2.2K · 📦 23K · 📋 1K - 5% open · ⏱️ 23.03.2023):

     git clone https://github.com/python-visualization/folium
    
  • PyPi (📥 870K / month · 📦 720 · ⏱️ 12.12.2022):

     pip install folium
    
  • Conda (📥 1.8M · ⏱️ 13.12.2022):

     conda install -c conda-forge folium
    
GeoPandas (🥇39 · 3.6K) - Python tools for geographic data. BSD-3
  • GitHub (👨‍💻 190 · 🔀 800 · 📥 1.8K · 📦 20K · 📋 1.5K - 27% open · ⏱️ 20.03.2023):

     git clone https://github.com/geopandas/geopandas
    
  • PyPi (📥 3.4M / month · 📦 1.4K · ⏱️ 10.12.2022):

     pip install geopandas
    
  • Conda (📥 2.6M · ⏱️ 10.12.2022):

     conda install -c conda-forge geopandas
    
Rasterio (🥈38 · 1.9K) - Rasterio reads and writes geospatial raster datasets. BSD-3
  • GitHub (👨‍💻 140 · 🔀 510 · 📥 770 · 📦 7.1K · 📋 1.7K - 8% open ·