Skip to content

ml-tooling/best-of-ml-python

main
Switch branches/tags

Name already in use

A tag already exists with the provided branch name. Many Git commands accept both tag and branch names, so creating this branch may cause unexpected behavior. Are you sure you want to create this branch?
Code

Files

Permalink
Failed to load latest commit information.
Type
Name
Latest commit message
Commit time
January 16, 2021 22:25
November 30, 2020 02:21
November 30, 2020 02:21

Best-of Machine Learning with Python

πŸ†Β  A ranked list of awesome machine learning Python libraries. Updated weekly.

This curated list contains 910 awesome open-source projects with a total of 3.9M stars grouped into 34 categories. All projects are ranked by a project-quality score, which is calculated based on various metrics automatically collected from GitHub and different package managers. If you like to add or update projects, feel free to open an issue, submit a pull request, or directly edit the projects.yaml. Contributions are very welcome!


πŸ§™β€β™‚οΈΒ  Discover other best-of lists or create your own.
πŸ“«Β  Subscribe to our newsletter for updates and trending projects.


Contents

Explanation

  • πŸ₯‡πŸ₯ˆπŸ₯‰Β  Combined project-quality score
  • ⭐️  Star count from GitHub
  • 🐣  New project (less than 6 months old)
  • πŸ’€Β  Inactive project (6 months no activity)
  • πŸ’€Β  Dead project (12 months no activity)
  • πŸ“ˆπŸ“‰Β  Project is trending up or down
  • βž•Β  Project was recently added
  • ❗️  Warning (e.g. missing/risky license)
  • πŸ‘¨β€πŸ’»Β  Contributors count from GitHub
  • πŸ”€Β  Fork count from GitHub
  • πŸ“‹Β  Issue count from GitHub
  • ⏱️  Last update timestamp on package manager
  • πŸ“₯Β  Download count from package manager
  • πŸ“¦Β  Number of dependent projects
  • Β  Tensorflow related project
  • Β  Sklearn related project
  • Β  PyTorch related project
  • Β  MxNet related project
  • Β  Apache Spark related project
  • Β  Jupyter related project
  • Β  PaddlePaddle related project
  • Β  Pandas related project
  • Β  Jax related project

Machine Learning Frameworks

Back to top

General-purpose machine learning and deep learning frameworks.

Tensorflow (πŸ₯‡55 Β· ⭐ 180K) - An Open Source Machine Learning Framework for Everyone. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 4.4K Β· πŸ”€ 71K Β· πŸ“¦ 280K Β· πŸ“‹ 37K - 5% open Β· ⏱️ 01.06.2023):

     git clone https://github.com/tensorflow/tensorflow
    
  • PyPi (πŸ“₯ 15M / month):

     pip install tensorflow
    
  • Conda (πŸ“₯ 4.2M Β· ⏱️ 27.03.2023):

     conda install -c conda-forge tensorflow
    
  • Docker Hub (πŸ“₯ 73M Β· ⭐ 2.2K Β· ⏱️ 01.06.2023):

     docker pull tensorflow/tensorflow
    
scikit-learn (πŸ₯‡52 Β· ⭐ 54K) - scikit-learn: machine learning in Python. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 2.9K Β· πŸ”€ 23K Β· πŸ“₯ 870 Β· πŸ“¦ 540K Β· πŸ“‹ 10K - 15% open Β· ⏱️ 01.06.2023):

     git clone https://github.com/scikit-learn/scikit-learn
    
  • PyPi (πŸ“₯ 36M / month):

     pip install scikit-learn
    
  • Conda (πŸ“₯ 22M Β· ⏱️ 25.05.2023):

     conda install -c conda-forge scikit-learn
    
PyTorch (πŸ₯‡45 Β· ⭐ 67K Β· πŸ“‰) - Tensors and Dynamic neural networks in Python with strong GPU.. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 4.1K Β· πŸ”€ 18K Β· πŸ“₯ 17K Β· πŸ“‹ 34K - 32% open Β· ⏱️ 01.06.2023):

     git clone https://github.com/pytorch/pytorch
    
  • PyPi (πŸ“₯ 10M / month):

     pip install torch
    
  • Conda (πŸ“₯ 16M Β· ⏱️ 05.05.2023):

     conda install -c pytorch pytorch
    
StatsModels (πŸ₯‡45 Β· ⭐ 8.5K) - Statsmodels: statistical modeling and econometrics in Python. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 410 Β· πŸ”€ 2.5K Β· πŸ“₯ 27 Β· πŸ“¦ 94K Β· πŸ“‹ 5.1K - 47% open Β· ⏱️ 17.05.2023):

     git clone https://github.com/statsmodels/statsmodels
    
  • PyPi (πŸ“₯ 11M / month):

     pip install statsmodels
    
  • Conda (πŸ“₯ 10M Β· ⏱️ 05.05.2023):

     conda install -c conda-forge statsmodels
    
jax (πŸ₯‡44 Β· ⭐ 23K) - Composable transformations of Python+NumPy programs: differentiate,.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 540 Β· πŸ”€ 2.1K Β· πŸ“¦ 13K Β· πŸ“‹ 4.2K - 27% open Β· ⏱️ 01.06.2023):

     git clone https://github.com/google/jax
    
  • PyPi (πŸ“₯ 4.4M / month):

     pip install jax
    
  • Conda (πŸ“₯ 760K Β· ⏱️ 30.05.2023):

     conda install -c conda-forge jaxlib
    
XGBoost (πŸ₯‡43 Β· ⭐ 24K) - Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 610 Β· πŸ”€ 8.1K Β· πŸ“₯ 6.6K Β· πŸ“¦ 58K Β· πŸ“‹ 4.8K - 6% open Β· ⏱️ 31.05.2023):

     git clone https://github.com/dmlc/xgboost
    
  • PyPi (πŸ“₯ 8.7M / month):

     pip install xgboost
    
  • Conda (πŸ“₯ 3.9M Β· ⏱️ 17.03.2023):

     conda install -c conda-forge xgboost
    
LightGBM (πŸ₯‡43 Β· ⭐ 15K) - A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT,.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 280 Β· πŸ”€ 3.7K Β· πŸ“₯ 190K Β· πŸ“¦ 22K Β· πŸ“‹ 3K - 8% open Β· ⏱️ 01.06.2023):

     git clone https://github.com/microsoft/LightGBM
    
  • PyPi (πŸ“₯ 5.9M / month Β· πŸ“¦ 770 Β· ⏱️ 24.01.2023):

     pip install lightgbm
    
  • Conda (πŸ“₯ 1.7M Β· ⏱️ 24.01.2023):

     conda install -c conda-forge lightgbm
    
Keras (πŸ₯ˆ42 Β· ⭐ 58K Β· πŸ“‰) - Deep Learning for humans. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 1.2K Β· πŸ”€ 18K Β· πŸ“‹ 12K - 2% open Β· ⏱️ 01.06.2023):

     git clone https://github.com/keras-team/keras
    
  • PyPi (πŸ“₯ 10M / month):

     pip install keras
    
  • Conda (πŸ“₯ 3.1M Β· ⏱️ 25.03.2023):

     conda install -c conda-forge keras
    
pytorch-lightning (πŸ₯ˆ42 Β· ⭐ 24K) - Deep learning framework to train, deploy, and ship AI.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 880 Β· πŸ”€ 2.9K Β· πŸ“₯ 12K Β· πŸ“‹ 6.2K - 11% open Β· ⏱️ 31.05.2023):

     git clone https://github.com/Lightning-AI/lightning
    
  • PyPi (πŸ“₯ 4.2M / month Β· πŸ“¦ 640 Β· ⏱️ 18.01.2023):

     pip install pytorch-lightning
    
  • Conda (πŸ“₯ 820K Β· ⏱️ 24.04.2023):

     conda install -c conda-forge pytorch-lightning
    
PaddlePaddle (πŸ₯ˆ42 Β· ⭐ 20K) - PArallel Distributed Deep LEarning: Machine Learning.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 1K Β· πŸ”€ 5K Β· πŸ“₯ 15K Β· πŸ“¦ 250 Β· πŸ“‹ 17K - 5% open Β· ⏱️ 01.06.2023):

     git clone https://github.com/PaddlePaddle/Paddle
    
  • PyPi (πŸ“₯ 120K / month):

     pip install paddlepaddle
    
Catboost (πŸ₯ˆ41 Β· ⭐ 7.2K Β· πŸ“ˆ) - A fast, scalable, high performance Gradient Boosting on.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 1.1K Β· πŸ”€ 1.1K Β· πŸ“₯ 150K Β· πŸ“‹ 2.1K - 24% open Β· ⏱️ 01.06.2023):

     git clone https://github.com/catboost/catboost
    
  • PyPi (πŸ“₯ 5.4M / month Β· πŸ“¦ 310 Β· ⏱️ 01.11.2022):

     pip install catboost
    
  • Conda (πŸ“₯ 1.3M Β· ⏱️ 02.05.2023):

     conda install -c conda-forge catboost
    
Fastai (πŸ₯ˆ39 Β· ⭐ 24K) - The fastai deep learning library. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 650 Β· πŸ”€ 7.4K Β· πŸ“¦ 14K Β· πŸ“‹ 1.7K - 8% open Β· ⏱️ 26.05.2023):

     git clone https://github.com/fastai/fastai
    
  • PyPi (πŸ“₯ 300K / month Β· πŸ“¦ 330 Β· ⏱️ 15.02.2023):

     pip install fastai
    
Jina (πŸ₯ˆ39 Β· ⭐ 18K) - Build multimodal AI services via cloud native technologies. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 170 Β· πŸ”€ 2.1K Β· πŸ“¦ 600 Β· πŸ“‹ 1.9K - 1% open Β· ⏱️ 31.05.2023):

     git clone https://github.com/jina-ai/jina
    
  • PyPi (πŸ“₯ 400K / month):

     pip install jina
    
  • Conda (πŸ“₯ 46K Β· ⏱️ 16.08.2022):

     conda install -c conda-forge jina-core
    
  • Docker Hub (πŸ“₯ 1.2M Β· ⭐ 8 Β· ⏱️ 29.05.2023):

     docker pull jinaai/jina
    
PySpark (πŸ₯ˆ38 Β· ⭐ 36K) - Apache Spark Python API. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 2.9K Β· πŸ”€ 26K Β· ⏱️ 01.06.2023):

     git clone https://github.com/apache/spark
    
  • PyPi (πŸ“₯ 25M / month):

     pip install pyspark
    
  • Conda (πŸ“₯ 2.6M Β· ⏱️ 16.04.2023):

     conda install -c conda-forge pyspark
    
MXNet (πŸ₯ˆ38 Β· ⭐ 20K) - Lightweight, Portable, Flexible Distributed/Mobile Deep Learning.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 980 Β· πŸ”€ 6.9K Β· πŸ“₯ 26K Β· πŸ“¦ 6.5K Β· πŸ“‹ 9.6K - 18% open Β· ⏱️ 26.01.2023):

     git clone https://github.com/apache/incubator-mxnet
    
  • PyPi (πŸ“₯ 340K / month):

     pip install mxnet
    
  • Conda (πŸ“₯ 9.5K Β· πŸ“¦ 5 Β· ⏱️ 24.10.2022):

     conda install -c anaconda mxnet
    
Flax (πŸ₯ˆ36 Β· ⭐ 4.4K) - Flax is a neural network library for JAX that is designed for.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 190 Β· πŸ”€ 480 Β· πŸ“₯ 45 Β· πŸ“¦ 3.3K Β· πŸ“‹ 690 - 12% open Β· ⏱️ 01.06.2023):

     git clone https://github.com/google/flax
    
  • PyPi (πŸ“₯ 640K / month):

     pip install flax
    
  • Conda (πŸ“₯ 27K Β· ⏱️ 16.03.2023):

     conda install -c conda-forge flax
    
Thinc (πŸ₯ˆ36 Β· ⭐ 2.7K) - A refreshing functional take on deep learning, compatible with your favorite.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 61 Β· πŸ”€ 260 Β· πŸ“¦ 32K Β· πŸ“‹ 130 - 12% open Β· ⏱️ 30.05.2023):

     git clone https://github.com/explosion/thinc
    
  • PyPi (πŸ“₯ 4.4M / month):

     pip install thinc
    
  • Conda (πŸ“₯ 2.5M Β· ⏱️ 03.05.2023):

     conda install -c conda-forge thinc
    
Vowpal Wabbit (πŸ₯ˆ35 Β· ⭐ 8.2K) - Vowpal Wabbit is a machine learning system which pushes the.. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 330 Β· πŸ”€ 1.9K Β· πŸ“‹ 1.2K - 9% open Β· ⏱️ 31.05.2023):

     git clone https://github.com/VowpalWabbit/vowpal_wabbit
    
  • PyPi (πŸ“₯ 81K / month):

     pip install vowpalwabbit
    
  • Conda (πŸ“₯ 120K Β· ⏱️ 15.03.2023):

     conda install -c conda-forge vowpalwabbit
    
einops (πŸ₯ˆ35 Β· ⭐ 6.8K) - Deep learning operations reinvented (for pytorch, tensorflow, jax and.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 23 Β· πŸ”€ 300 Β· πŸ“¦ 11K Β· πŸ“‹ 140 - 22% open Β· ⏱️ 15.05.2023):

     git clone https://github.com/arogozhnikov/einops
    
  • PyPi (πŸ“₯ 3.9M / month):

     pip install einops
    
  • Conda (πŸ“₯ 100K Β· ⏱️ 19.04.2023):

     conda install -c conda-forge einops
    
Chainer (πŸ₯ˆ33 Β· ⭐ 5.8K Β· πŸ’€) - A flexible framework of neural networks for deep learning. MIT
  • GitHub (πŸ‘¨β€πŸ’» 320 Β· πŸ”€ 1.3K Β· πŸ“¦ 3K Β· πŸ“‹ 2K - 0% open Β· ⏱️ 17.10.2022):

     git clone https://github.com/chainer/chainer
    
  • PyPi (πŸ“₯ 18K / month):

     pip install chainer
    
  • Conda (πŸ“₯ 14K Β· ⏱️ 21.01.2022):

     conda install -c conda-forge chainer
    
Ludwig (πŸ₯‰32 Β· ⭐ 8.9K) - Data-centric declarative deep learning framework. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 140 Β· πŸ”€ 990 Β· πŸ“¦ 170 Β· πŸ“‹ 920 - 23% open Β· ⏱️ 01.06.2023):

     git clone https://github.com/ludwig-ai/ludwig
    
  • PyPi (πŸ“₯ 2.4K / month):

     pip install ludwig
    
mlpack (πŸ₯‰32 Β· ⭐ 4.4K) - mlpack: a fast, header-only C++ machine learning library. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 300 Β· πŸ”€ 1.5K Β· πŸ“‹ 1.5K - 3% open Β· ⏱️ 01.06.2023):

     git clone https://github.com/mlpack/mlpack
    
  • PyPi (πŸ“₯ 1.2K / month Β· πŸ“¦ 1 Β· ⏱️ 29.12.2022):

     pip install mlpack
    
  • Conda (πŸ“₯ 150K Β· ⏱️ 28.04.2023):

     conda install -c conda-forge mlpack
    
Ignite (πŸ₯‰32 Β· ⭐ 4.3K) - High-level library to help with training and evaluating neural.. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 190 Β· πŸ”€ 600 Β· πŸ“‹ 1.3K - 12% open Β· ⏱️ 01.06.2023):

     git clone https://github.com/pytorch/ignite
    
  • PyPi (πŸ“₯ 130K / month Β· πŸ“¦ 39 Β· ⏱️ 08.11.2022):

     pip install pytorch-ignite
    
  • Conda (πŸ“₯ 140K Β· ⏱️ 01.05.2023):

     conda install -c pytorch ignite
    
tensorflow-upstream (πŸ₯‰32 Β· ⭐ 650) - TensorFlow ROCm port. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 4.4K Β· πŸ”€ 77 Β· πŸ“₯ 21 Β· πŸ“‹ 360 - 23% open Β· ⏱️ 01.06.2023):

     git clone https://github.com/ROCmSoftwarePlatform/tensorflow-upstream
    
  • PyPi (πŸ“₯ 3.4K / month Β· πŸ“¦ 5 Β· ⏱️ 06.12.2022):

     pip install tensorflow-rocm
    
PyFlink (πŸ₯‰31 Β· ⭐ 21K) - Apache Flink Python API. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 1.7K Β· πŸ”€ 12K Β· ⏱️ 01.06.2023):

     git clone https://github.com/apache/flink
    
  • PyPi (πŸ“₯ 55K / month):

     pip install apache-flink
    
Sonnet (πŸ₯‰31 Β· ⭐ 9.6K) - TensorFlow-based neural network library. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 56 Β· πŸ”€ 1.3K Β· πŸ“¦ 1.1K Β· πŸ“‹ 190 - 16% open Β· ⏱️ 23.02.2023):

     git clone https://github.com/deepmind/sonnet
    
  • PyPi (πŸ“₯ 22K / month):

     pip install dm-sonnet
    
  • Conda (πŸ“₯ 23K Β· ⏱️ 14.11.2020):

     conda install -c conda-forge sonnet
    
tensorpack (πŸ₯‰31 Β· ⭐ 6.3K) - A Neural Net Training Interface on TensorFlow, with focus.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 58 Β· πŸ”€ 1.8K Β· πŸ“₯ 150 Β· πŸ“¦ 1.3K Β· πŸ“‹ 1.4K - 0% open Β· ⏱️ 31.03.2023):

     git clone https://github.com/tensorpack/tensorpack
    
  • PyPi (πŸ“₯ 16K / month):

     pip install tensorpack
    
  • Conda (πŸ“₯ 9.4K Β· ⏱️ 06.02.2022):

     conda install -c conda-forge tensorpack
    
skorch (πŸ₯‰31 Β· ⭐ 5.2K) - A scikit-learn compatible neural network library that wraps.. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 58 Β· πŸ”€ 350 Β· πŸ“¦ 860 Β· πŸ“‹ 480 - 10% open Β· ⏱️ 22.05.2023):

     git clone https://github.com/skorch-dev/skorch
    
  • PyPi (πŸ“₯ 44K / month Β· πŸ“¦ 49 Β· ⏱️ 18.11.2022):

     pip install skorch
    
  • Conda (πŸ“₯ 770K Β· ⏱️ 19.05.2023):

     conda install -c conda-forge skorch
    
Haiku (πŸ₯‰31 Β· ⭐ 2.5K) - JAX-based neural network library. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 74 Β· πŸ”€ 210 Β· πŸ“¦ 1.3K Β· πŸ“‹ 220 - 28% open Β· ⏱️ 31.05.2023):

     git clone https://github.com/deepmind/dm-haiku
    
  • PyPi (πŸ“₯ 120K / month):

     pip install dm-haiku
    
  • Conda (πŸ“₯ 11K Β· ⏱️ 21.09.2022):

     conda install -c conda-forge dm-haiku
    
ktrain (πŸ₯‰31 Β· ⭐ 1.1K) - ktrain is a Python library that makes deep learning and AI more.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 16 Β· πŸ”€ 260 Β· πŸ“¦ 430 Β· πŸ“‹ 460 - 0% open Β· ⏱️ 12.05.2023):

     git clone https://github.com/amaiya/ktrain
    
  • PyPi (πŸ“₯ 22K / month):

     pip install ktrain
    
Neural Network Libraries (πŸ₯‰30 Β· ⭐ 2.6K) - Neural Network Libraries. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 72 Β· πŸ”€ 330 Β· πŸ“₯ 640 Β· πŸ“‹ 88 - 35% open Β· ⏱️ 24.05.2023):

     git clone https://github.com/sony/nnabla
    
  • PyPi (πŸ“₯ 4K / month Β· πŸ“¦ 53 Β· ⏱️ 14.02.2023):

     pip install nnabla
    
Geomstats (πŸ₯‰30 Β· ⭐ 950) - Computations and statistics on manifolds with geometric structures. MIT
  • GitHub (πŸ‘¨β€πŸ’» 81 Β· πŸ”€ 210 Β· πŸ“¦ 67 Β· πŸ“‹ 550 - 42% open Β· ⏱️ 01.06.2023):

     git clone https://github.com/geomstats/geomstats
    
  • PyPi (πŸ“₯ 1.8K / month Β· πŸ“¦ 2 Β· ⏱️ 22.04.2022):

     pip install geomstats
    
  • Conda (πŸ“₯ 1.3K Β· ⏱️ 01.06.2022):

     conda install -c conda-forge geomstats
    
dyNET (πŸ₯‰29 Β· ⭐ 3.4K Β· πŸ’€) - DyNet: The Dynamic Neural Network Toolkit. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 160 Β· πŸ”€ 680 Β· πŸ“₯ 11K Β· πŸ“¦ 250 Β· πŸ“‹ 930 - 28% open Β· ⏱️ 14.08.2022):

     git clone https://github.com/clab/dynet
    
  • PyPi (πŸ“₯ 3.3K / month):

     pip install dyNET
    
Towhee (πŸ₯‰27 Β· ⭐ 2.3K) - Towhee is a framework that is dedicated to making neural data.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 30 Β· πŸ”€ 200 Β· πŸ“₯ 1.1K Β· πŸ“‹ 600 - 0% open Β· ⏱️ 01.06.2023):

     git clone https://github.com/towhee-io/towhee
    
  • PyPi (πŸ“₯ 2.2K / month Β· ⏱️ 02.12.2022):

     pip install towhee
    
Neural Tangents (πŸ₯‰27 Β· ⭐ 2K) - Fast and Easy Infinite Neural Networks in Python. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 25 Β· πŸ”€ 220 Β· πŸ“₯ 300 Β· πŸ“¦ 73 Β· πŸ“‹ 140 - 39% open Β· ⏱️ 08.05.2023):

     git clone https://github.com/google/neural-tangents
    
  • PyPi (πŸ“₯ 5.3K / month):

     pip install neural-tangents
    
pyRiemann (πŸ₯‰26 Β· ⭐ 500) - Machine learning for multivariate data analysis through the.. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 25 Β· πŸ”€ 150 Β· πŸ“¦ 250 Β· πŸ“‹ 94 - 6% open Β· ⏱️ 28.05.2023):

     git clone https://github.com/pyRiemann/pyRiemann
    
  • PyPi (πŸ“₯ 44K / month):

     pip install pyriemann
    
  • Conda (πŸ“₯ 3K Β· ⏱️ 15.02.2023):

     conda install -c conda-forge pyriemann
    
xLearn (πŸ₯‰25 Β· ⭐ 3K Β· πŸ’€) - High performance, easy-to-use, and scalable machine learning (ML).. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 30 Β· πŸ”€ 530 Β· πŸ“₯ 3.9K Β· πŸ“¦ 120 Β· πŸ“‹ 310 - 62% open Β· ⏱️ 05.06.2022):

     git clone https://github.com/aksnzhy/xlearn
    
  • PyPi (πŸ“₯ 3.2K / month Β· πŸ“¦ 12 Β· ⏱️ 04.12.2018):

     pip install xlearn
    
fklearn (πŸ₯‰25 Β· ⭐ 1.4K) - fklearn: Functional Machine Learning. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 53 Β· πŸ”€ 160 Β· πŸ“¦ 13 Β· πŸ“‹ 60 - 60% open Β· ⏱️ 11.04.2023):

     git clone https://github.com/nubank/fklearn
    
  • PyPi (πŸ“₯ 3.2K / month Β· ⏱️ 06.09.2022):

     pip install fklearn
    
NeuPy (πŸ₯‰24 Β· ⭐ 740) - NeuPy is a Tensorflow based python library for prototyping and building.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 8 Β· πŸ”€ 160 Β· πŸ“¦ 150 Β· πŸ“‹ 270 - 12% open Β· ⏱️ 03.01.2023):

     git clone https://github.com/itdxer/neupy
    
  • PyPi (πŸ“₯ 5K / month):

     pip install neupy
    
ThunderSVM (πŸ₯‰20 Β· ⭐ 1.5K) - ThunderSVM: A Fast SVM Library on GPUs and CPUs. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 35 Β· πŸ”€ 200 Β· πŸ“₯ 2.6K Β· πŸ“‹ 220 - 31% open Β· ⏱️ 13.05.2023):

     git clone https://github.com/Xtra-Computing/thundersvm
    
  • PyPi (πŸ“₯ 390 / month Β· ⏱️ 13.03.2020):

     pip install thundersvm
    
NeoML (πŸ₯‰19 Β· ⭐ 730) - Machine learning framework for both deep learning and traditional.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 33 Β· πŸ”€ 120 Β· πŸ“‹ 64 - 21% open Β· ⏱️ 31.05.2023):

     git clone https://github.com/neoml-lib/neoml
    
  • PyPi (πŸ“₯ 47 / month):

     pip install neoml
    
chefboost (πŸ₯‰18 Β· ⭐ 400) - A Lightweight Decision Tree Framework supporting regular algorithms:.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 6 Β· πŸ”€ 95 Β· πŸ“¦ 38 Β· πŸ“‹ 32 - 15% open Β· ⏱️ 30.03.2023):

     git clone https://github.com/serengil/chefboost
    
  • PyPi (πŸ“₯ 670 / month Β· ⏱️ 16.02.2022):

     pip install chefboost
    
ThunderGBM (πŸ₯‰16 Β· ⭐ 660 Β· πŸ’€) - ThunderGBM: Fast GBDTs and Random Forests on GPUs. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 10 Β· πŸ”€ 87 Β· πŸ“¦ 2 Β· πŸ“‹ 77 - 45% open Β· ⏱️ 13.09.2022):

     git clone https://github.com/Xtra-Computing/thundergbm
    
  • PyPi (πŸ“₯ 34 / month Β· ⏱️ 19.09.2022):

     pip install thundergbm
    
Show 16 hidden projects...
  • dlib (πŸ₯ˆ41 Β· ⭐ 12K) - A toolkit for making real world machine learning and data analysis.. ❗️BSL-1.0
  • MindsDB (πŸ₯ˆ34 Β· ⭐ 16K) - MindsDB is a Server for Artificial Intelligence Logic. Enabling.. ❗️GPL-3.0
  • Theano (πŸ₯ˆ34 Β· ⭐ 9.7K) - Theano was a Python library that allows you to define, optimize,.. ❗Unlicensed
  • Turi Create (πŸ₯ˆ33 Β· ⭐ 11K Β· πŸ’€) - Turi Create simplifies the development of custom machine.. BSD-3
  • ivy (πŸ₯‰31 Β· ⭐ 11K) - The Unified Machine Learning Framework. ❗Unlicensed
  • TFlearn (πŸ₯‰30 Β· ⭐ 9.6K Β· πŸ’€) - Deep learning library featuring a higher-level API for.. ❗Unlicensed
  • NuPIC (πŸ₯‰28 Β· ⭐ 6.3K Β· πŸ’€) - Numenta Platform for Intelligent Computing is an implementation.. ❗️AGPL-3.0
  • Lasagne (πŸ₯‰28 Β· ⭐ 3.8K Β· πŸ’€) - Lightweight library to build and train neural networks in Theano. MIT
  • CNTK (πŸ₯‰26 Β· ⭐ 17K Β· πŸ’€) - Microsoft Cognitive Toolkit (CNTK), an open source deep-learning.. ❗Unlicensed
  • SHOGUN (πŸ₯‰26 Β· ⭐ 2.9K Β· πŸ’€) - Unified and efficient Machine Learning. BSD-3
  • mace (πŸ₯‰23 Β· ⭐ 4.8K Β· πŸ’€) - MACE is a deep learning inference framework optimized for mobile.. Apache-2
  • neon (πŸ₯‰22 Β· ⭐ 3.9K Β· πŸ’€) - Intel Nervana reference deep learning framework committed to best.. Apache-2
  • Torchbearer (πŸ₯‰21 Β· ⭐ 630 Β· πŸ’€) - torchbearer: A model fitting library for PyTorch. MIT
  • Objax (πŸ₯‰20 Β· ⭐ 740) - Apache-2
  • elegy (πŸ₯‰18 Β· ⭐ 450 Β· πŸ’€) - A High Level API for Deep Learning in JAX. MIT
  • StarSpace (πŸ₯‰16 Β· ⭐ 3.9K Β· πŸ’€) - Learning embeddings for classification, retrieval and ranking. MIT

Data Visualization

Back to top

General-purpose and task-specific data visualization libraries.

Matplotlib (πŸ₯‡48 Β· ⭐ 17K) - matplotlib: plotting with Python. ❗Unlicensed
  • GitHub (πŸ‘¨β€πŸ’» 1.5K Β· πŸ”€ 6.7K Β· πŸ“¦ 850K Β· πŸ“‹ 9.5K - 13% open Β· ⏱️ 01.06.2023):

     git clone https://github.com/matplotlib/matplotlib
    
  • PyPi (πŸ“₯ 33M / month):

     pip install matplotlib
    
  • Conda (πŸ“₯ 19M Β· ⏱️ 06.03.2023):

     conda install -c conda-forge matplotlib
    
Bokeh (πŸ₯‡42 Β· ⭐ 18K) - Interactive Data Visualization in the browser, from Python. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 660 Β· πŸ”€ 4K Β· πŸ“¦ 70K Β· πŸ“‹ 7.3K - 9% open Β· ⏱️ 31.05.2023):

     git clone https://github.com/bokeh/bokeh
    
  • PyPi (πŸ“₯ 3.4M / month):

     pip install bokeh
    
  • Conda (πŸ“₯ 11M Β· ⏱️ 10.05.2023):

     conda install -c conda-forge bokeh
    
Seaborn (πŸ₯‡42 Β· ⭐ 11K) - Statistical data visualization in Python. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 200 Β· πŸ”€ 1.6K Β· πŸ“₯ 250 Β· πŸ“¦ 260K Β· πŸ“‹ 2.3K - 5% open Β· ⏱️ 21.05.2023):

     git clone https://github.com/mwaskom/seaborn
    
  • PyPi (πŸ“₯ 8.9M / month):

     pip install seaborn
    
  • Conda (πŸ“₯ 6.7M Β· ⏱️ 31.12.2022):

     conda install -c conda-forge seaborn
    
dash (πŸ₯‡41 Β· ⭐ 19K) - Data Apps & Dashboards for Python. No JavaScript Required. MIT
  • GitHub (πŸ‘¨β€πŸ’» 140 Β· πŸ”€ 1.9K Β· πŸ“¦ 47K Β· πŸ“‹ 1.5K - 47% open Β· ⏱️ 31.05.2023):

     git clone https://github.com/plotly/dash
    
  • PyPi (πŸ“₯ 1.5M / month Β· πŸ“¦ 1.4K Β· ⏱️ 30.01.2023):

     pip install dash
    
  • Conda (πŸ“₯ 970K Β· ⏱️ 31.05.2023):

     conda install -c conda-forge dash
    
Altair (πŸ₯‡41 Β· ⭐ 8.3K) - Declarative statistical visualization library for Python. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 160 Β· πŸ”€ 730 Β· πŸ“₯ 45 Β· πŸ“¦ 58K Β· πŸ“‹ 1.8K - 10% open Β· ⏱️ 26.05.2023):

     git clone https://github.com/altair-viz/altair
    
  • PyPi (πŸ“₯ 12M / month Β· πŸ“¦ 570 Β· ⏱️ 01.03.2023):

     pip install altair
    
  • Conda (πŸ“₯ 1.8M Β· ⏱️ 29.05.2023):

     conda install -c conda-forge altair
    
pyecharts (πŸ₯‡37 Β· ⭐ 14K) - Python Echarts Plotting Library. MIT
  • GitHub (πŸ‘¨β€πŸ’» 39 Β· πŸ”€ 2.7K Β· πŸ“¦ 3.2K Β· πŸ“‹ 1.7K - 0% open Β· ⏱️ 10.04.2023):

     git clone https://github.com/pyecharts/pyecharts
    
  • PyPi (πŸ“₯ 99K / month):

     pip install pyecharts
    
Plotly (πŸ₯ˆ36 Β· ⭐ 14K Β· πŸ“‰) - The interactive graphing library for Python This project now includes.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 230 Β· πŸ”€ 2.3K Β· πŸ“‹ 2.6K - 49% open Β· ⏱️ 19.05.2023):

     git clone https://github.com/plotly/plotly.py
    
  • PyPi (πŸ“₯ 7.9M / month):

     pip install plotly
    
  • Conda (πŸ“₯ 4.3M Β· ⏱️ 05.04.2023):

     conda install -c conda-forge plotly
    
  • npm (πŸ“₯ 28K / month):

     npm install plotlywidget
    
plotnine (πŸ₯ˆ36 Β· ⭐ 3.5K) - A grammar of graphics for Python. MIT
  • GitHub (πŸ‘¨β€πŸ’» 100 Β· πŸ”€ 200 Β· πŸ“¦ 5.1K Β· πŸ“‹ 560 - 13% open Β· ⏱️ 10.05.2023):

     git clone https://github.com/has2k1/plotnine
    
  • PyPi (πŸ“₯ 890K / month Β· πŸ“¦ 250 Β· ⏱️ 29.09.2022):

     pip install plotnine
    
  • Conda (πŸ“₯ 270K Β· ⏱️ 09.05.2023):

     conda install -c conda-forge plotnine
    
PyQtGraph (πŸ₯ˆ36 Β· ⭐ 3.3K) - Fast data visualization and GUI tools for scientific / engineering.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 250 Β· πŸ”€ 1K Β· πŸ“‹ 1.2K - 30% open Β· ⏱️ 29.05.2023):

     git clone https://github.com/pyqtgraph/pyqtgraph
    
  • PyPi (πŸ“₯ 110K / month Β· πŸ“¦ 940 Β· ⏱️ 29.09.2022):

     pip install pyqtgraph
    
  • Conda (πŸ“₯ 410K Β· ⏱️ 14.04.2023):

     conda install -c conda-forge pyqtgraph
    
FiftyOne (πŸ₯ˆ36 Β· ⭐ 3.1K) - Visualize, create, and debug image and video datasets.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 81 Β· πŸ”€ 350 Β· πŸ“¦ 310 Β· πŸ“‹ 1.3K - 37% open Β· ⏱️ 28.05.2023):

     git clone https://github.com/voxel51/fiftyone
    
  • PyPi (πŸ“₯ 67K / month Β· πŸ“¦ 7 Β· ⏱️ 04.01.2023):

     pip install fiftyone
    
VisPy (πŸ₯ˆ35 Β· ⭐ 3.1K) - High-performance interactive 2D/3D data visualization library. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 190 Β· πŸ”€ 610 Β· πŸ“¦ 1.2K Β· πŸ“‹ 1.4K - 22% open Β· ⏱️ 29.05.2023):

     git clone https://github.com/vispy/vispy
    
  • PyPi (πŸ“₯ 68K / month Β· πŸ“¦ 130 Β· ⏱️ 14.11.2022):

     pip install vispy
    
  • Conda (πŸ“₯ 390K Β· ⏱️ 13.05.2023):

     conda install -c conda-forge vispy
    
  • npm (πŸ“₯ 6 / month Β· πŸ“¦ 1 Β· ⏱️ 15.03.2020):

     npm install vispy
    
PyVista (πŸ₯ˆ35 Β· ⭐ 1.8K) - 3D plotting and mesh analysis through a streamlined interface for.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 120 Β· πŸ”€ 340 Β· πŸ“₯ 710 Β· πŸ“¦ 1.9K Β· πŸ“‹ 1.2K - 28% open Β· ⏱️ 01.06.2023):

     git clone https://github.com/pyvista/pyvista
    
  • PyPi (πŸ“₯ 96K / month Β· πŸ“¦ 200 Β· ⏱️ 02.11.2022):

     pip install pyvista
    
  • Conda (πŸ“₯ 340K Β· ⏱️ 22.05.2023):

     conda install -c conda-forge pyvista
    
wordcloud (πŸ₯ˆ34 Β· ⭐ 9.5K Β· πŸ“ˆ) - A little word cloud generator in Python. MIT
  • GitHub (πŸ‘¨β€πŸ’» 68 Β· πŸ”€ 2.3K Β· πŸ“‹ 520 - 22% open Β· ⏱️ 18.05.2023):

     git clone https://github.com/amueller/word_cloud
    
  • PyPi (πŸ“₯ 800K / month Β· πŸ“¦ 800 Β· ⏱️ 27.06.2022):

     pip install wordcloud
    
  • Conda (πŸ“₯ 410K Β· ⏱️ 22.05.2023):

     conda install -c conda-forge wordcloud
    
UMAP (πŸ₯ˆ34 Β· ⭐ 6.2K) - Uniform Manifold Approximation and Projection. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 120 Β· πŸ”€ 730 Β· πŸ“¦ 9.2K Β· πŸ“‹ 720 - 55% open Β· ⏱️ 27.05.2023):

     git clone https://github.com/lmcinnes/umap
    
  • PyPi (πŸ“₯ 810K / month Β· πŸ“¦ 460 Β· ⏱️ 13.04.2022):

     pip install umap-learn
    
  • Conda (πŸ“₯ 1.9M Β· ⏱️ 19.05.2023):

     conda install -c conda-forge umap-learn
    
datashader (πŸ₯ˆ33 Β· ⭐ 3K) - Quickly and accurately render even the largest data. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 53 Β· πŸ”€ 360 Β· πŸ“¦ 2.4K Β· πŸ“‹ 550 - 24% open Β· ⏱️ 30.05.2023):

     git clone https://github.com/holoviz/datashader
    
  • PyPi (πŸ“₯ 52K / month Β· πŸ“¦ 120 Β· ⏱️ 02.02.2023):

     pip install datashader
    
  • Conda (πŸ“₯ 580K Β· ⏱️ 31.05.2023):

     conda install -c conda-forge datashader
    
Graphviz (πŸ₯ˆ33 Β· ⭐ 1.4K) - Simple Python interface for Graphviz. MIT
  • GitHub (πŸ‘¨β€πŸ’» 19 Β· πŸ”€ 190 Β· πŸ“¦ 49K Β· πŸ“‹ 160 - 4% open Β· ⏱️ 30.01.2023):

     git clone https://github.com/xflr6/graphviz
    
  • PyPi (πŸ“₯ 7.8M / month):

     pip install graphviz
    
  • Conda (πŸ“₯ 37K Β· ⏱️ 16.03.2023):

     conda install -c anaconda python-graphviz
    
Perspective (πŸ₯ˆ30 Β· ⭐ 6.3K) - A data visualization and analytics component, especially.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 88 Β· πŸ”€ 690 Β· πŸ“¦ 9 Β· πŸ“‹ 640 - 14% open Β· ⏱️ 31.05.2023):

     git clone https://github.com/finos/perspective
    
  • PyPi (πŸ“₯ 5.3K / month Β· πŸ“¦ 11 Β· ⏱️ 20.01.2023):

     pip install perspective-python
    
  • Conda (πŸ“₯ 350K Β· ⏱️ 31.05.2023):

     conda install -c conda-forge perspective
    
  • npm (πŸ“₯ 1.7K / month):

     npm install @finos/perspective-jupyterlab
    
D-Tale (πŸ₯ˆ30 Β· ⭐ 4.1K) - Visualizer for pandas data structures. ❗️LGPL-2.1
  • GitHub (πŸ‘¨β€πŸ’» 29 Β· πŸ”€ 340 Β· πŸ“¦ 730 Β· πŸ“‹ 510 - 8% open Β· ⏱️ 16.05.2023):

     git clone https://github.com/man-group/dtale
    
  • PyPi (πŸ“₯ 220K / month Β· πŸ“¦ 16 Β· ⏱️ 17.06.2022):

     pip install dtale
    
  • Conda (πŸ“₯ 220K Β· ⏱️ 16.05.2023):

     conda install -c conda-forge dtale
    
bqplot (πŸ₯ˆ30 Β· ⭐ 3.4K) - Plotting library for IPython/Jupyter notebooks. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 62 Β· πŸ”€ 470 Β· πŸ“¦ 43 Β· πŸ“‹ 610 - 40% open Β· ⏱️ 11.04.2023):

     git clone https://github.com/bqplot/bqplot
    
  • PyPi (πŸ“₯ 140K / month Β· πŸ“¦ 100 Β· ⏱️ 02.09.2022):

     pip install bqplot
    
  • Conda (πŸ“₯ 1.2M Β· ⏱️ 12.04.2023):

     conda install -c conda-forge bqplot
    
  • npm (πŸ“₯ 3.9K / month Β· πŸ“¦ 14 Β· ⏱️ 11.04.2023):

     npm install bqplot
    
HoloViews (πŸ₯ˆ30 Β· ⭐ 2.4K Β· πŸ“‰) - With Holoviews, your data visualizes itself. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 130 Β· πŸ”€ 360 Β· πŸ“‹ 3K - 32% open Β· ⏱️ 09.05.2023):

     git clone https://github.com/holoviz/holoviews
    
  • PyPi (πŸ“₯ 340K / month):

     pip install holoviews
    
  • Conda (πŸ“₯ 1.2M Β· ⏱️ 11.05.2023):

     conda install -c conda-forge holoviews
    
  • npm (πŸ“₯ 680 / month):

     npm install @pyviz/jupyterlab_pyviz
    
hvPlot (πŸ₯ˆ30 Β· ⭐ 760) - A high-level plotting API for pandas, dask, xarray, and networkx built on.. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 40 Β· πŸ”€ 85 Β· πŸ“¦ 3K Β· πŸ“‹ 620 - 41% open Β· ⏱️ 23.05.2023):

     git clone https://github.com/holoviz/hvplot
    
  • PyPi (πŸ“₯ 140K / month Β· πŸ“¦ 82 Β· ⏱️ 24.11.2022):

     pip install hvplot
    
  • Conda (πŸ“₯ 390K Β· ⏱️ 17.03.2023):

     conda install -c conda-forge hvplot
    
Facets Overview (πŸ₯‰29 Β· ⭐ 7.1K) - Visualizations for machine learning datasets. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 31 Β· πŸ”€ 910 Β· πŸ“¦ 180 Β· πŸ“‹ 160 - 50% open Β· ⏱️ 24.05.2023):

     git clone https://github.com/pair-code/facets
    
  • PyPi (πŸ“₯ 130K / month Β· πŸ“¦ 9 Β· ⏱️ 30.01.2023):

     pip install facets-overview
    
missingno (πŸ₯‰29 Β· ⭐ 3.5K) - Missing data visualization module for Python. MIT
  • GitHub (πŸ‘¨β€πŸ’» 18 Β· πŸ”€ 450 Β· πŸ“¦ 12K Β· πŸ“‹ 130 - 7% open Β· ⏱️ 26.02.2023):

     git clone https://github.com/ResidentMario/missingno
    
  • PyPi (πŸ“₯ 410K / month Β· πŸ“¦ 130 Β· ⏱️ 27.02.2022):

     pip install missingno
    
  • Conda (πŸ“₯ 280K Β· ⏱️ 15.02.2020):

     conda install -c conda-forge missingno
    
mpld3 (πŸ₯‰29 Β· ⭐ 2.3K) - D3 Renderings of Matplotlib Graphics. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 51 Β· πŸ”€ 350 Β· πŸ“¦ 4.7K Β· πŸ“‹ 360 - 59% open Β· ⏱️ 10.12.2022):

     git clone https://github.com/mpld3/mpld3
    
  • PyPi (πŸ“₯ 240K / month Β· πŸ“¦ 410 Β· ⏱️ 10.12.2022):

     pip install mpld3
    
  • Conda (πŸ“₯ 180K Β· ⏱️ 10.12.2022):

     conda install -c conda-forge mpld3
    
  • npm (πŸ“₯ 880 / month Β· πŸ“¦ 8 Β· ⏱️ 10.12.2022):

     npm install mpld3
    
pythreejs (πŸ₯‰28 Β· ⭐ 870) - A Jupyter - Three.js bridge. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 30 Β· πŸ”€ 180 Β· πŸ“¦ 26 Β· πŸ“‹ 230 - 25% open Β· ⏱️ 20.02.2023):

     git clone https://github.com/jupyter-widgets/pythreejs
    
  • PyPi (πŸ“₯ 86K / month Β· πŸ“¦ 56 Β· ⏱️ 20.02.2023):

     pip install pythreejs
    
  • Conda (πŸ“₯ 480K Β· ⏱️ 16.03.2023):

     conda install -c conda-forge pythreejs
    
  • npm (πŸ“₯ 4.2K / month Β· πŸ“¦ 11 Β· ⏱️ 20.02.2023):

     npm install jupyter-threejs
    
data-validation (πŸ₯‰28 Β· ⭐ 720) - Library for exploring and validating machine learning.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 25 Β· πŸ”€ 150 Β· πŸ“₯ 420 Β· πŸ“¦ 690 Β· πŸ“‹ 170 - 21% open Β· ⏱️ 30.05.2023):

     git clone https://github.com/tensorflow/data-validation
    
  • PyPi (πŸ“₯ 420K / month Β· πŸ“¦ 30 Β· ⏱️ 08.12.2022):

     pip install tensorflow-data-validation
    
openTSNE (πŸ₯‰27 Β· ⭐ 1.2K) - Extensible, parallel implementations of t-SNE. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 11 Β· πŸ”€ 140 Β· πŸ“¦ 580 Β· πŸ“‹ 120 - 0% open Β· ⏱️ 26.05.2023):

     git clone https://github.com/pavlin-policar/openTSNE
    
  • PyPi (πŸ“₯ 21K / month):

     pip install opentsne
    
  • Conda (πŸ“₯ 210K Β· ⏱️ 24.05.2023):

     conda install -c conda-forge opentsne
    
lets-plot (πŸ₯‰27 Β· ⭐ 880) - An open-source plotting library for statistical data. MIT
  • GitHub (πŸ‘¨β€πŸ’» 19 Β· πŸ”€ 45 Β· πŸ“₯ 460 Β· πŸ“¦ 33 Β· πŸ“‹ 380 - 25% open Β· ⏱️ 31.05.2023):

     git clone https://github.com/JetBrains/lets-plot
    
  • PyPi (πŸ“₯ 11K / month Β· πŸ“¦ 1 Β· ⏱️ 15.12.2022):

     pip install lets-plot
    
pandas-profiling (πŸ₯‰26 Β· ⭐ 11K Β· πŸ“ˆ) - Deprecated pandas-profiling package, use ydata-.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 110 Β· πŸ”€ 1.5K Β· πŸ“¦ 550):

     git clone https://github.com/ydataai/pandas-profiling
    
  • PyPi (πŸ“₯ 970K / month Β· πŸ“¦ 190 Β· ⏱️ 31.01.2023):

     pip install pandas-profiling
    
  • Conda (πŸ“₯ 370K Β· ⏱️ 25.01.2023):

     conda install -c conda-forge pandas-profiling
    
Chartify (πŸ₯‰26 Β· ⭐ 3.3K) - Python library that makes it easy for data scientists to create.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 25 Β· πŸ”€ 300 Β· πŸ“¦ 71 Β· πŸ“‹ 74 - 58% open Β· ⏱️ 21.04.2023):

     git clone https://github.com/spotify/chartify
    
  • PyPi (πŸ“₯ 3.8K / month):

     pip install chartify
    
  • Conda (πŸ“₯ 26K Β· ⏱️ 07.11.2020):

     conda install -c conda-forge chartify
    
Plotly-Resampler (πŸ₯‰26 Β· ⭐ 730) - Visualize large time series data with plotly.py. MIT
  • GitHub (πŸ‘¨β€πŸ’» 10 Β· πŸ”€ 48 Β· πŸ“¦ 310 Β· πŸ“‹ 120 - 29% open Β· ⏱️ 30.05.2023):

     git clone https://github.com/predict-idlab/plotly-resampler
    
  • PyPi (πŸ“₯ 290K / month):

     pip install plotly-resampler
    
  • Conda (πŸ“₯ 19K Β· ⏱️ 10.04.2023):

     conda install -c conda-forge plotly-resampler
    
HiPlot (πŸ₯‰25 Β· ⭐ 2.5K) - HiPlot makes understanding high dimensional data easy. MIT
  • GitHub (πŸ‘¨β€πŸ’» 8 Β· πŸ”€ 120 Β· πŸ“¦ 280 Β· πŸ“‹ 84 - 15% open Β· ⏱️ 03.03.2023):

     git clone https://github.com/facebookresearch/hiplot
    
  • PyPi (πŸ“₯ 34K / month):

     pip install hiplot
    
  • Conda (πŸ“₯ 140K Β· ⏱️ 31.05.2022):

     conda install -c conda-forge hiplot
    
Multicore-TSNE (πŸ₯‰25 Β· ⭐ 1.8K) - Parallel t-SNE implementation with Python and Torch.. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 17 Β· πŸ”€ 200 Β· πŸ“¦ 380 Β· πŸ“‹ 59 - 64% open Β· ⏱️ 26.05.2023):

     git clone https://github.com/DmitryUlyanov/Multicore-TSNE
    
  • PyPi (πŸ“₯ 12K / month Β· πŸ“¦ 23 Β· ⏱️ 09.01.2019):

     pip install MulticoreTSNE
    
  • Conda (πŸ“₯ 25K Β· ⏱️ 09.11.2021):

     conda install -c conda-forge multicore-tsne
    
AutoViz (πŸ₯‰25 Β· ⭐ 1.3K) - Automatically Visualize any dataset, any size with a single line of.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 14 Β· πŸ”€ 170 Β· πŸ“¦ 450 Β· πŸ“‹ 70 - 1% open Β· ⏱️ 28.05.2023):

     git clone https://github.com/AutoViML/AutoViz
    
  • PyPi (πŸ“₯ 25K / month):

     pip install autoviz
    
  • Conda (πŸ“₯ 35K Β· ⏱️ 16.05.2023):

     conda install -c conda-forge autoviz
    
vega (πŸ₯‰25 Β· ⭐ 350 Β· πŸ“ˆ) - IPython/Jupyter notebook module for Vega and Vega-Lite. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 14 Β· πŸ”€ 62 Β· πŸ“¦ 2 Β· πŸ“‹ 100 - 12% open Β· ⏱️ 01.06.2023):

     git clone https://github.com/vega/ipyvega
    
  • PyPi (πŸ“₯ 9.4K / month Β· πŸ“¦ 84 Β· ⏱️ 10.02.2022):

     pip install vega
    
  • Conda (πŸ“₯ 550K Β· ⏱️ 12.04.2023):

     conda install -c conda-forge vega
    
Sweetviz (πŸ₯‰24 Β· ⭐ 2.4K Β· πŸ’€) - Visualize and compare datasets, target values and associations, with.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 6 Β· πŸ”€ 240 Β· πŸ“‹ 120 - 38% open Β· ⏱️ 08.06.2022):

     git clone https://github.com/fbdesignpro/sweetviz
    
  • PyPi (πŸ“₯ 67K / month Β· πŸ“¦ 10 Β· ⏱️ 14.06.2022):

     pip install sweetviz
    
  • Conda (πŸ“₯ 22K Β· ⏱️ 15.06.2022):

     conda install -c conda-forge sweetviz
    
Pandas-Bokeh (πŸ₯‰24 Β· ⭐ 840) - Bokeh Plotting Backend for Pandas and GeoPandas. MIT
  • GitHub (πŸ‘¨β€πŸ’» 15 Β· πŸ”€ 110 Β· πŸ“¦ 460 Β· πŸ“‹ 100 - 32% open Β· ⏱️ 06.03.2023):

     git clone https://github.com/PatrikHlobil/Pandas-Bokeh
    
  • PyPi (πŸ“₯ 23K / month Β· πŸ“¦ 12 Β· ⏱️ 11.04.2021):

     pip install pandas-bokeh
    
Popmon (πŸ₯‰23 Β· ⭐ 450) - Monitor the stability of a Pandas or Spark dataframe. MIT
  • GitHub (πŸ‘¨β€πŸ’» 17 Β· πŸ”€ 34 Β· πŸ“₯ 81 Β· πŸ“¦ 20 Β· πŸ“‹ 52 - 28% open Β· ⏱️ 26.05.2023):

     git clone https://github.com/ing-bank/popmon
    
  • PyPi (πŸ“₯ 10K / month Β· πŸ“¦ 2 Β· ⏱️ 19.10.2022):

     pip install popmon
    
python-ternary (πŸ₯‰22 Β· ⭐ 640) - Ternary plotting library for python with matplotlib. MIT
  • GitHub (πŸ‘¨β€πŸ’» 27 Β· πŸ”€ 140 Β· πŸ“₯ 20 Β· πŸ“¦ 140 Β· πŸ“‹ 140 - 27% open Β· ⏱️ 31.12.2022):

     git clone https://github.com/marcharper/python-ternary
    
  • PyPi (πŸ“₯ 57K / month Β· πŸ“¦ 26 Β· ⏱️ 17.02.2021):

     pip install python-ternary
    
  • Conda (πŸ“₯ 75K Β· ⏱️ 17.02.2021):

     conda install -c conda-forge python-ternary
    
PyWaffle (πŸ₯‰19 Β· ⭐ 550 Β· πŸ’€) - Make Waffle Charts in Python. MIT
  • GitHub (πŸ‘¨β€πŸ’» 6 Β· πŸ”€ 99 Β· πŸ“¦ 230 Β· πŸ“‹ 21 - 23% open Β· ⏱️ 08.06.2022):

     git clone https://github.com/gyli/PyWaffle
    
  • PyPi (πŸ“₯ 3.9K / month):

     pip install pywaffle
    
  • Conda (πŸ“₯ 9.6K Β· ⏱️ 05.06.2022):

     conda install -c conda-forge pywaffle
    
Show 14 hidden projects...
  • cartopy (πŸ₯ˆ32 Β· ⭐ 1.2K) - Cartopy - a cartographic python library with matplotlib support. ❗️LGPL-3.0
  • Cufflinks (πŸ₯‰29 Β· ⭐ 2.8K Β· πŸ’€) - Productivity Tools for Plotly + Pandas. MIT
  • HyperTools (πŸ₯‰25 Β· ⭐ 1.8K Β· πŸ’€) - A Python toolbox for gaining geometric insights into high-.. MIT
  • PandasGUI (πŸ₯‰24 Β· ⭐ 2.9K Β· πŸ’€) - A GUI for Pandas DataFrames. ❗️MIT-0
  • PDPbox (πŸ₯‰24 Β· ⭐ 750 Β· πŸ’€) - python partial dependence plot toolbox. MIT
  • pivottablejs (πŸ₯‰22 Β· ⭐ 550 Β· πŸ’€) - Dragndrop Pivot Tables and Charts for Jupyter/IPython.. MIT
  • joypy (πŸ₯‰21 Β· ⭐ 480 Β· πŸ’€) - Joyplots in Python with matplotlib & pandas. MIT
  • vegafusion (πŸ₯‰21 Β· ⭐ 240) - Serverside scaling for Vega and Altair visualizations. BSD-3
  • ivis (πŸ₯‰19 Β· ⭐ 300) - Dimensionality reduction in very large datasets using Siamese.. Apache-2
  • animatplot (πŸ₯‰18 Β· ⭐ 400 Β· πŸ’€) - A python package for animating plots build on matplotlib. MIT
  • data-describe (πŸ₯‰17 Β· ⭐ 290 Β· πŸ’€) - datadescribe: Pythonic EDA Accelerator for Data Science. Apache-2
  • pdvega (πŸ₯‰16 Β· ⭐ 340 Β· πŸ’€) - Interactive plotting for Pandas using Vega-Lite. MIT
  • nx-altair (πŸ₯‰16 Β· ⭐ 210 Β· πŸ’€) - Draw interactive NetworkX graphs with Altair. MIT
  • nptsne (πŸ₯‰9 Β· ⭐ 30 Β· πŸ’€) - nptsne is a numpy compatible python binary package that offers a.. Apache-2

Text Data & NLP

Back to top

Libraries for processing, cleaning, manipulating, and analyzing text data as well as libraries for NLP tasks such as language detection, fuzzy matching, classification, seq2seq learning, conversational AI, keyword extraction, and translation.

transformers (πŸ₯‡49 Β· ⭐ 100K) - Transformers: State-of-the-art Machine Learning for.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 1.9K Β· πŸ”€ 20K Β· πŸ“₯ 140 Β· πŸ“¦ 75K Β· πŸ“‹ 12K - 4% open Β· ⏱️ 01.06.2023):

     git clone https://github.com/huggingface/transformers
    
  • PyPi (πŸ“₯ 13M / month):

     pip install transformers
    
  • Conda (πŸ“₯ 1.1M Β· ⏱️ 17.05.2023):

     conda install -c conda-forge transformers
    
spaCy (πŸ₯‡44 Β· ⭐ 26K) - Industrial-strength Natural Language Processing (NLP) in Python. MIT
  • GitHub (πŸ‘¨β€πŸ’» 720 Β· πŸ”€ 4K Β· πŸ“¦ 61K Β· πŸ“‹ 5.4K - 1% open Β· ⏱️ 31.05.2023):

     git clone https://github.com/explosion/spaCy
    
  • PyPi (πŸ“₯ 5.2M / month):

     pip install spacy
    
  • Conda (πŸ“₯ 3.1M Β· ⏱️ 16.05.2023):

     conda install -c conda-forge spacy
    
Rasa (πŸ₯‡41 Β· ⭐ 16K) - Open source machine learning framework to automate text- and voice-.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 580 Β· πŸ”€ 4.2K Β· πŸ“¦ 3.4K Β· πŸ“‹ 6.6K - 0% open Β· ⏱️ 26.05.2023):

     git clone https://github.com/RasaHQ/rasa
    
  • PyPi (πŸ“₯ 120K / month):

     pip install rasa
    
nltk (πŸ₯‡41 Β· ⭐ 12K) - Suite of libraries and programs for symbolic and statistical natural.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 440 Β· πŸ”€ 2.6K Β· πŸ“¦ 200K Β· πŸ“‹ 1.7K - 13% open Β· ⏱️ 17.05.2023):

     git clone https://github.com/nltk/nltk
    
  • PyPi (πŸ“₯ 11M / month):

     pip install nltk
    
  • Conda (πŸ“₯ 2M Β· ⏱️ 02.01.2023):

     conda install -c conda-forge nltk
    
gensim (πŸ₯‡40 Β· ⭐ 14K) - Topic Modelling for Humans. ❗️LGPL-2.1
  • GitHub (πŸ‘¨β€πŸ’» 450 Β· πŸ”€ 4.1K Β· πŸ“₯ 4.4K Β· πŸ“¦ 46K Β· πŸ“‹ 1.8K - 20% open Β· ⏱️ 25.05.2023):

     git clone https://github.com/RaRe-Technologies/gensim
    
  • PyPi (πŸ“₯ 3.8M / month):

     pip install gensim
    
  • Conda (πŸ“₯ 1.1M Β· ⏱️ 28.03.2023):

     conda install -c conda-forge gensim
    
flair (πŸ₯‡40 Β· ⭐ 13K) - A very simple framework for state-of-the-art Natural Language Processing.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 240 Β· πŸ”€ 2K Β· πŸ“¦ 2.5K Β· πŸ“‹ 2.1K - 3% open Β· ⏱️ 12.05.2023):

     git clone https://github.com/flairNLP/flair
    
  • PyPi (πŸ“₯ 75K / month Β· πŸ“¦ 90 Β· ⏱️ 20.05.2022):

     pip install flair
    
  • Conda (πŸ“₯ 18K Β· ⏱️ 13.04.2023):

     conda install -c conda-forge python-flair
    
fairseq (πŸ₯‡38 Β· ⭐ 26K) - Facebook AI Research Sequence-to-Sequence Toolkit written in Python. MIT
  • GitHub (πŸ‘¨β€πŸ’» 420 Β· πŸ”€ 5.6K Β· πŸ“₯ 310 Β· πŸ“¦ 1.6K Β· πŸ“‹ 3.9K - 23% open Β· ⏱️ 31.05.2023):

     git clone https://github.com/facebookresearch/fairseq
    
  • PyPi (πŸ“₯ 550K / month):

     pip install fairseq
    
  • Conda (πŸ“₯ 34K Β· ⏱️ 12.05.2023):

     conda install -c conda-forge fairseq
    
sentencepiece (πŸ₯‡37 Β· ⭐ 7.5K) - Unsupervised text tokenizer for Neural Network-based text.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 78 Β· πŸ”€ 960 Β· πŸ“₯ 27K Β· πŸ“¦ 31K Β· πŸ“‹ 620 - 2% open Β· ⏱️ 25.05.2023):

     git clone https://github.com/google/sentencepiece
    
  • PyPi (πŸ“₯ 11M / month Β· πŸ“¦ 670 Β· ⏱️ 07.08.2022):

     pip install sentencepiece
    
  • Conda (πŸ“₯ 460K Β· ⏱️ 27.05.2023):

     conda install -c conda-forge sentencepiece
    
haystack (πŸ₯‡36 Β· ⭐ 9K) - Haystack is an open source NLP framework to interact with your data.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 180 Β· πŸ”€ 1.2K Β· πŸ“₯ 22 Β· πŸ“¦ 720 Β· πŸ“‹ 2.2K - 15% open Β· ⏱️ 01.06.2023):

     git clone https://github.com/deepset-ai/haystack
    
  • PyPi (πŸ“₯ 2.8K / month Β· πŸ“¦ 85 Β· ⏱️ 03.07.2017):

     pip install haystack
    
sentence-transformers (πŸ₯‡35 Β· ⭐ 11K) - Multilingual Sentence & Image Embeddings with BERT. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 110 Β· πŸ”€ 2K Β· πŸ“¦ 9.1K Β· πŸ“‹ 1.8K - 55% open Β· ⏱️ 23.05.2023):

     git clone https://github.com/UKPLab/sentence-transformers
    
  • PyPi (πŸ“₯ 2.3M / month Β· πŸ“¦ 300 Β· ⏱️ 26.06.2022):

     pip install sentence-transformers
    
  • Conda (πŸ“₯ 120K Β· ⏱️ 27.06.2022):

     conda install -c conda-forge sentence-transformers
    
NeMo (πŸ₯‡35 Β· ⭐ 6.8K) - NeMo: a toolkit for conversational AI. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 220 Β· πŸ”€ 1.6K Β· πŸ“₯ 41K Β· πŸ“‹ 1.6K - 4% open Β· ⏱️ 31.05.2023):

     git clone https://github.com/NVIDIA/NeMo
    
  • PyPi (πŸ“₯ 42K / month Β· πŸ“¦ 12 Β· ⏱️ 01.07.2022):

     pip install nemo-toolkit
    
OpenNMT (πŸ₯‡35 Β· ⭐ 6.1K) - Open Source Neural Machine Translation in PyTorch. MIT
  • GitHub (πŸ‘¨β€πŸ’» 190 Β· πŸ”€ 2K Β· πŸ“¦ 190 Β· πŸ“‹ 1.4K - 0% open Β· ⏱️ 01.06.2023):

     git clone https://github.com/OpenNMT/OpenNMT-py
    
  • PyPi (πŸ“₯ 3.9K / month):

     pip install OpenNMT-py
    
torchtext (πŸ₯‡35 Β· ⭐ 3.3K) - Models, data loaders and abstractions for language processing,.. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 150 Β· πŸ”€ 800 Β· πŸ“‹ 800 - 37% open Β· ⏱️ 30.05.2023):

     git clone https://github.com/pytorch/text
    
  • PyPi (πŸ“₯ 1.2M / month Β· πŸ“¦ 470 Β· ⏱️ 15.12.2022):

     pip install torchtext
    
spark-nlp (πŸ₯‡35 Β· ⭐ 3.3K) - State of the Art Natural Language Processing. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 110 Β· πŸ”€ 660 Β· πŸ“¦ 300 Β· πŸ“‹ 790 - 4% open Β· ⏱️ 29.05.2023):

     git clone https://github.com/JohnSnowLabs/spark-nlp
    
  • PyPi (πŸ“₯ 2.8M / month Β· πŸ“¦ 19 Β· ⏱️ 24.01.2023):

     pip install spark-nlp
    
TensorFlow Text (πŸ₯‡35 Β· ⭐ 1.1K) - Making text a first-class citizen in TensorFlow. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 110 Β· πŸ”€ 280 Β· πŸ“¦ 5.3K Β· πŸ“‹ 210 - 26% open Β· ⏱️ 12.05.2023):

     git clone https://github.com/tensorflow/text
    
  • PyPi (πŸ“₯ 4M / month):

     pip install tensorflow-text
    
fastText (πŸ₯ˆ34 Β· ⭐ 25K) - Library for fast text representation and classification. MIT
  • GitHub (πŸ‘¨β€πŸ’» 60 Β· πŸ”€ 4.4K Β· πŸ“¦ 4.5K Β· πŸ“‹ 1.1K - 42% open Β· ⏱️ 17.04.2023):

     git clone https://github.com/facebookresearch/fastText
    
  • PyPi (πŸ“₯ 980K / month):

     pip install fasttext
    
  • Conda (πŸ“₯ 56K Β· ⏱️ 01.11.2022):

     conda install -c conda-forge fasttext
    
AllenNLP (πŸ₯ˆ34 Β· ⭐ 12K Β· πŸ’€) - An open-source NLP research library, built on PyTorch. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 260 Β· πŸ”€ 2.2K Β· πŸ“₯ 55 Β· πŸ“¦ 3.6K Β· πŸ“‹ 2.6K - 3% open Β· ⏱️ 22.11.2022):

     git clone https://github.com/allenai/allennlp
    
  • PyPi (πŸ“₯ 48K / month):

     pip install allennlp
    
  • Conda (πŸ“₯ 110K Β· ⏱️ 15.07.2022):

     conda install -c conda-forge allennlp
    
ParlAI (πŸ₯ˆ34 Β· ⭐ 10K) - A framework for training and evaluating AI models on a variety of.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 210 Β· πŸ”€ 2K Β· πŸ“¦ 190 Β· πŸ“‹ 1.5K - 3% open Β· ⏱️ 31.05.2023):

     git clone https://github.com/facebookresearch/ParlAI
    
  • PyPi (πŸ“₯ 2.9K / month):

     pip install parlai
    
TextBlob (πŸ₯ˆ34 Β· ⭐ 8.6K) - Simple, Pythonic, text processing--Sentiment analysis, part-of-speech.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 36 Β· πŸ”€ 1.1K Β· πŸ“₯ 100 Β· πŸ“¦ 30K Β· πŸ“‹ 270 - 41% open Β· ⏱️ 11.03.2023):

     git clone https://github.com/sloria/TextBlob
    
  • PyPi (πŸ“₯ 830K / month Β· πŸ“¦ 1.4K Β· ⏱️ 22.10.2021):

     pip install textblob
    
  • Conda (πŸ“₯ 220K Β· ⏱️ 24.02.2019):

     conda install -c conda-forge textblob
    
Tokenizers (πŸ₯ˆ34 Β· ⭐ 7.1K) - Fast State-of-the-Art Tokenizers optimized for Research and.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 69 Β· πŸ”€ 580 Β· πŸ“₯ 4 Β· πŸ“¦ 61 Β· πŸ“‹ 790 - 32% open Β· ⏱️ 25.05.2023):

     git clone https://github.com/huggingface/tokenizers
    
  • PyPi (πŸ“₯ 11M / month Β· πŸ“¦ 200 Β· ⏱️ 07.11.2022):

     pip install tokenizers
    
  • Conda (πŸ“₯ 1.1M Β· ⏱️ 05.04.2023):

     conda install -c conda-forge tokenizers
    
textacy (πŸ₯ˆ34 Β· ⭐ 2.1K) - NLP, before and after spaCy. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 35 Β· πŸ”€ 250 Β· πŸ“¦ 1.5K Β· πŸ“‹ 260 - 11% open Β· ⏱️ 03.04.2023):

     git clone https://github.com/chartbeat-labs/textacy
    
  • PyPi (πŸ“₯ 150K / month Β· πŸ“¦ 110 Β· ⏱️ 06.12.2021):

     pip install textacy
    
  • Conda (πŸ“₯ 130K Β· ⏱️ 09.03.2023):

     conda install -c conda-forge textacy
    
jellyfish (πŸ₯ˆ32 Β· ⭐ 1.9K) - a python library for doing approximate and phonetic matching of strings. MIT
  • GitHub (πŸ‘¨β€πŸ’» 29 Β· πŸ”€ 150 Β· πŸ“¦ 6.1K Β· πŸ“‹ 120 - 6% open Β· ⏱️ 10.04.2023):

     git clone https://github.com/jamesturk/jellyfish
    
  • PyPi (πŸ“₯ 1.8M / month):

     pip install jellyfish
    
  • Conda (πŸ“₯ 580K Β· ⏱️ 28.10.2022):

     conda install -c conda-forge jellyfish
    
Dedupe (πŸ₯ˆ31 Β· ⭐ 3.7K) - A python library for accurate and scalable fuzzy matching, record.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 71 Β· πŸ”€ 500 Β· πŸ“¦ 280 Β· πŸ“‹ 800 - 8% open Β· ⏱️ 17.02.2023):

     git clone https://github.com/dedupeio/dedupe
    
  • PyPi (πŸ“₯ 68K / month Β· πŸ“¦ 49 Β· ⏱️ 18.01.2023):

     pip install dedupe
    
  • Conda (πŸ“₯ 33K Β· ⏱️ 12.12.2022):

     conda install -c conda-forge dedupe
    
rubrix (πŸ₯ˆ31 Β· ⭐ 2K) - Argilla: the open-source data curation platform for LLMs. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 48 Β· πŸ”€ 180 Β· πŸ“¦ 570 Β· πŸ“‹ 1.1K - 18% open Β· ⏱️ 01.06.2023):

     git clone https://github.com/recognai/rubrix
    
  • PyPi (πŸ“₯ 680 / month):

     pip install rubrix
    
  • Conda (πŸ“₯ 22K Β· ⏱️ 06.10.2022):

     conda install -c conda-forge rubrix
    
spacy-transformers (πŸ₯ˆ31 Β· ⭐ 1.3K) - Use pretrained transformers like BERT, XLNet and GPT-2.. MIT spacy
  • GitHub (πŸ‘¨β€πŸ’» 21 Β· πŸ”€ 150 Β· πŸ“¦ 1K Β· ⏱️ 22.05.2023):

     git clone https://github.com/explosion/spacy-transformers
    
  • PyPi (πŸ“₯ 260K / month):

     pip install spacy-transformers
    
  • Conda (πŸ“₯ 15K Β· ⏱️ 23.05.2023):

     conda install -c conda-forge spacy-transformers
    
snowballstemmer (πŸ₯ˆ31 Β· ⭐ 660) - Snowball compiler and stemming algorithms. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 32 Β· πŸ”€ 170 Β· πŸ“¦ 4 Β· πŸ“‹ 81 - 37% open Β· ⏱️ 24.04.2023):

     git clone https://github.com/snowballstem/snowball
    
  • PyPi (πŸ“₯ 8.6M / month Β· πŸ“¦ 6.8K Β· ⏱️ 16.11.2021):

     pip install snowballstemmer
    
  • Conda (πŸ“₯ 6.6M Β· ⏱️ 17.11.2021):

     conda install -c conda-forge snowballstemmer
    
DeepPavlov (πŸ₯ˆ30 Β· ⭐ 6.2K) - An open source library for deep learning end-to-end dialog.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 73 Β· πŸ”€ 1.1K Β· πŸ“¦ 350 Β· πŸ“‹ 630 - 8% open Β· ⏱️ 14.03.2023):

     git clone https://github.com/deepmipt/DeepPavlov
    
  • PyPi (πŸ“₯ 8.9K / month):

     pip install deeppavlov
    
DeepKE (πŸ₯ˆ30 Β· ⭐ 1.9K) - An Open Toolkit for Knowledge Graph Extraction and Construction.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 22 Β· πŸ”€ 490 Β· πŸ“¦ 16 Β· ⏱️ 01.06.2023):

     git clone https://github.com/zjunlp/deepke
    
  • PyPi (πŸ“₯ 1.3K / month):

     pip install deepke
    
SciSpacy (πŸ₯ˆ30 Β· ⭐ 1.4K) - A full spaCy pipeline and models for scientific/biomedical documents. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 29 Β· πŸ”€ 190 Β· πŸ“¦ 710 Β· πŸ“‹ 290 - 9% open Β· ⏱️ 16.05.2023):

     git clone https://github.com/allenai/scispacy
    
  • PyPi (πŸ“₯ 39K / month):

     pip install scispacy
    
ftfy (πŸ₯ˆ29 Β· ⭐ 3.5K Β· πŸ’€) - Fixes mojibake and other glitches in Unicode text, after the fact. MIT
  • GitHub (πŸ‘¨β€πŸ’» 18 Β· πŸ”€ 120 Β· πŸ“¦ 11K Β· πŸ“‹ 130 - 11% open Β· ⏱️ 25.10.2022):

     git clone https://github.com/rspeer/python-ftfy
    
  • PyPi (πŸ“₯ 4.1M / month Β· πŸ“¦ 580 Β· ⏱️ 09.02.2022):

     pip install ftfy
    
  • Conda (πŸ“₯ 250K Β· ⏱️ 13.03.2022):

     conda install -c conda-forge ftfy
    
GluonNLP (πŸ₯ˆ29 Β· ⭐ 2.5K) - Toolkit that enables easy text preprocessing, datasets loading.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 84 Β· πŸ”€ 500 Β· πŸ“¦ 1.3K Β· πŸ“‹ 530 - 44% open Β· ⏱️ 25.12.2022):

     git clone https://github.com/dmlc/gluon-nlp
    
  • PyPi (πŸ“₯ 100K / month):

     pip install gluonnlp
    
english-words (πŸ₯ˆ28 Β· ⭐ 9K Β· πŸ’€) - A text file containing 479k English words for all your.. Unlicense
  • GitHub (πŸ‘¨β€πŸ’» 30 Β· πŸ”€ 1.6K Β· πŸ“‹ 120 - 72% open Β· ⏱️ 08.11.2022):

     git clone https://github.com/dwyl/english-words
    
  • PyPi (πŸ“₯ 360K / month Β· πŸ“¦ 8 Β· ⏱️ 06.01.2023):

     pip install english-words
    
nlpaug (πŸ₯ˆ28 Β· ⭐ 4K Β· πŸ’€) - Data augmentation for NLP. MIT
  • GitHub (πŸ‘¨β€πŸ’» 33 Β· πŸ”€ 430 Β· πŸ“¦ 740 Β· πŸ“‹ 210 - 27% open Β· ⏱️ 07.07.2022):

     git clone https://github.com/makcedward/nlpaug
    
  • PyPi (πŸ“₯ 120K / month Β· πŸ“¦ 26 Β· ⏱️ 07.07.2022):

     pip install nlpaug
    
  • Conda (πŸ“₯ 9.7K Β· ⏱️ 30.01.2023):

     conda install -c conda-forge nlpaug
    
Sumy (πŸ₯ˆ28 Β· ⭐ 3.2K) - Module for automatic summarization of text documents and HTML pages. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 26 Β· πŸ”€ 500 Β· πŸ“¦ 2K Β· πŸ“‹ 110 - 15% open Β· ⏱️ 21.02.2023):

     git clone https://github.com/miso-belica/sumy
    
  • PyPi (πŸ“₯ 31K / month Β· πŸ“¦ 110 Β· ⏱️ 23.10.2022):

     pip install sumy
    
  • Conda (πŸ“₯ 5.1K Β· ⏱️ 25.10.2022):

     conda install -c conda-forge sumy
    
fastNLP (πŸ₯ˆ28 Β· ⭐ 2.9K) - fastNLP: A Modularized and Extensible NLP Framework. Currently still.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 62 Β· πŸ”€ 460 Β· πŸ“₯ 69 Β· πŸ“¦ 140 Β· πŸ“‹ 220 - 28% open Β· ⏱️ 13.12.2022):

     git clone https://github.com/fastnlp/fastNLP
    
  • PyPi (πŸ“₯ 14K / month Β· πŸ“¦ 4 Β· ⏱️ 04.02.2019):

     pip install fastnlp
    
Ciphey (πŸ₯ˆ27 Β· ⭐ 13K) - Automatically decrypt encryptions without knowing the key or cipher, decode.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 47 Β· πŸ”€ 790 Β· πŸ“‹ 300 - 15% open Β· ⏱️ 05.12.2022):

     git clone https://github.com/Ciphey/Ciphey
    
  • PyPi (πŸ“₯ 40K / month):

     pip install ciphey
    
  • Docker Hub (πŸ“₯ 19K Β· ⭐ 14 Β· ⏱️ 10.03.2023):

     docker pull remnux/ciphey
    
TextDistance (πŸ₯ˆ27 Β· ⭐ 3.1K Β· πŸ’€) - Compute distance between sequences. 30+ algorithms, pure.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 13 Β· πŸ”€ 240 Β· πŸ“₯ 900 Β· πŸ“¦ 4.2K Β· ⏱️ 18.09.2022):

     git clone https://github.com/life4/textdistance
    
  • PyPi (πŸ“₯ 360K / month Β· πŸ“¦ 60 Β· ⏱️ 20.09.2022):

     pip install textdistance
    
  • Conda (πŸ“₯ 390K Β· ⏱️ 18.09.2022):

     conda install -c conda-forge textdistance
    
scattertext (πŸ₯ˆ27 Β· ⭐ 2.1K) - Beautiful visualizations of how language differs among document.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 14 Β· πŸ”€ 280 Β· πŸ“¦ 400 Β· πŸ“‹ 97 - 19% open Β· ⏱️ 06.05.2023):

     git clone https://github.com/JasonKessler/scattertext
    
  • PyPi (πŸ“₯ 8.8K / month Β· πŸ“¦ 10 Β· ⏱️ 26.03.2022):

     pip install scattertext
    
  • Conda (πŸ“₯ 79K Β· ⏱️ 08.12.2022):

     conda install -c conda-forge scattertext
    
qdrant (πŸ₯‰26 Β· ⭐ 11K) - Qdrant - Vector Database for the next generation of AI applications... Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 50 Β· πŸ”€ 520 Β· πŸ“₯ 120 Β· πŸ“‹ 530 - 17% open Β· ⏱️ 31.05.2023):

     git clone https://github.com/qdrant/qdrant
    
T5 (πŸ₯‰26 Β· ⭐ 5.2K) - Code for the paper Exploring the Limits of Transfer Learning with a.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 56 Β· πŸ”€ 680 Β· πŸ“¦ 170 Β· πŸ“‹ 400 - 14% open Β· ⏱️ 22.05.2023):

     git clone https://github.com/google-research/text-to-text-transfer-transformer
    
  • PyPi (πŸ“₯ 21K / month):

     pip install t5
    
OpenPrompt (πŸ₯‰26 Β· ⭐ 3.3K) - An Open-Source Framework for Prompt-Learning. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 20 Β· πŸ”€ 360 Β· πŸ“¦ 53 Β· πŸ“‹ 230 - 25% open Β· ⏱️ 06.05.2023):

     git clone https://github.com/thunlp/OpenPrompt
    
  • PyPi (πŸ“₯ 3.1K / month Β· πŸ“¦ 2 Β· ⏱️ 06.07.2022):

     pip install openprompt
    
PyTextRank (πŸ₯‰26 Β· ⭐ 2K Β· πŸ’€) - Python implementation of TextRank algorithms (textgraphs) for.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 18 Β· πŸ”€ 340 Β· πŸ“¦ 410 Β· πŸ“‹ 96 - 25% open Β· ⏱️ 27.07.2022):

     git clone https://github.com/DerwenAI/pytextrank
    
  • PyPi (πŸ“₯ 45K / month Β· πŸ“¦ 16 Β· ⏱️ 27.07.2022):

     pip install pytextrank
    
sense2vec (πŸ₯‰26 Β· ⭐ 1.5K) - Contextually-keyed word vectors. MIT
  • GitHub (πŸ‘¨β€πŸ’» 19 Β· πŸ”€ 240 Β· πŸ“₯ 54K Β· πŸ“¦ 260 Β· πŸ“‹ 110 - 18% open Β· ⏱️ 20.04.2023):

     git clone https://github.com/explosion/sense2vec
    
  • PyPi (πŸ“₯ 5.3K / month Β· πŸ“¦ 13 Β· ⏱️ 08.12.2022):

     pip install sense2vec
    
  • Conda (πŸ“₯ 33K Β· ⏱️ 14.07.2021):

     conda install -c conda-forge sense2vec
    
CLTK (πŸ₯‰25 Β· ⭐ 790) - The Classical Language Toolkit. MIT
  • GitHub (πŸ‘¨β€πŸ’» 120 Β· πŸ”€ 310 Β· πŸ“₯ 34 Β· πŸ“¦ 240 Β· πŸ“‹ 550 - 6% open Β· ⏱️ 06.03.2023):

     git clone https://github.com/cltk/cltk
    
  • PyPi (πŸ“₯ 1.5K / month):

     pip install cltk
    
FARM (πŸ₯‰23 Β· ⭐ 1.7K Β· πŸ’€) - Fast & easy transfer learning for NLP. Harvesting language.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 37 Β· πŸ”€ 230 Β· πŸ“‹ 400 - 0% open Β· ⏱️ 31.08.2022):

     git clone https://github.com/deepset-ai/FARM
    
  • PyPi (πŸ“₯ 4.1K / month Β· πŸ“¦ 3 Β· ⏱️ 10.06.2021):

     pip install farm
    
  • Conda (πŸ“₯ 2.6K Β· ⏱️ 14.06.2021):

     conda install -c conda-forge farm
    
jiant (πŸ₯‰23 Β· ⭐ 1.5K Β· πŸ’€) - jiant is an nlp toolkit. MIT
  • GitHub (πŸ‘¨β€πŸ’» 60 Β· πŸ”€ 280 Β· πŸ“¦ 3 Β· πŸ“‹ 550 - 12% open Β· ⏱️ 17.10.2022):

     git clone https://github.com/nyu-mll/jiant
    
  • PyPi (πŸ“₯ 230 / month Β· ⏱️ 10.05.2021):

     pip install jiant
    
YouTokenToMe (πŸ₯‰23 Β· ⭐ 880) - Unsupervised text tokenizer focused on computational efficiency. MIT
  • GitHub (πŸ‘¨β€πŸ’» 8 Β· πŸ”€ 72 Β· πŸ“¦ 460 Β· πŸ“‹ 56 - 51% open Β· ⏱️ 29.03.2023):

     git clone https://github.com/vkcom/youtokentome
    
  • PyPi (πŸ“₯ 29K / month):

     pip install youtokentome
    
  • Conda (πŸ“₯ 31K Β· ⏱️ 30.10.2022):

     conda install -c conda-forge youtokentome
    
lightseq (πŸ₯‰22 Β· ⭐ 2.8K) - LightSeq: A High Performance Library for Sequence Processing and.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 17 Β· πŸ”€ 300 Β· πŸ“₯ 660 Β· πŸ“‹ 270 - 59% open Β· ⏱️ 10.05.2023):

     git clone https://github.com/bytedance/lightseq
    
  • PyPi (πŸ“₯ 3.5K / month Β· πŸ“¦ 2 Β· ⏱️ 03.11.2022):

     pip install lightseq
    
promptsource (πŸ₯‰22 Β· ⭐ 1.8K) - Toolkit for creating, sharing and using natural language.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 62 Β· πŸ”€ 240 Β· πŸ“‹ 190 - 27% open Β· ⏱️ 31.05.2023):

     git clone https://github.com/bigscience-workshop/promptsource
    
  • PyPi (πŸ“₯ 3.2K / month Β· πŸ“¦ 1 Β· ⏱️ 18.04.2022):

     pip install promptsource
    
Sockeye (πŸ₯‰22 Β· ⭐ 1.2K) - Sequence-to-sequence framework with a focus on Neural Machine.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 59 Β· πŸ”€ 300 Β· πŸ“₯ 16 Β· πŸ“‹ 300 - 0% open Β· ⏱️ 02.03.2023):

     git clone https://github.com/awslabs/sockeye
    
  • PyPi (πŸ“₯ 900 / month):

     pip install sockeye
    
detoxify (πŸ₯‰22 Β· ⭐ 630) - Trained models & code to predict toxic comments on all 3 Jigsaw.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 9 Β· πŸ”€ 83 Β· πŸ“₯ 230K Β· πŸ“¦ 310 Β· πŸ“‹ 50 - 54% open Β· ⏱️ 15.05.2023):

     git clone https://github.com/unitaryai/detoxify
    
  • PyPi (πŸ“₯ 11K / month):

     pip install detoxify
    
NLP Architect (πŸ₯‰21 Β· ⭐ 2.9K Β· πŸ’€) - A model library for exploring state-of-the-art deep.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 38 Β· πŸ”€ 440 Β· πŸ“¦ 9 Β· πŸ“‹ 130 - 11% open Β· ⏱️ 07.11.2022):

     git clone https://github.com/IntelLabs/nlp-architect
    
  • PyPi (πŸ“₯ 200 / month):

     pip install nlp-architect
    
Texthero (πŸ₯‰21 Β· ⭐ 2.7K Β· πŸ’€) - Text preprocessing, representation and visualization from zero to.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 20 Β· πŸ”€ 230 Β· πŸ“₯ 110 Β· πŸ“‹ 120 - 45% open Β· ⏱️ 28.10.2022):

     git clone https://github.com/jbesomi/texthero
    
  • PyPi (πŸ“₯ 14K / month):

     pip install texthero
    
finetune (πŸ₯‰21 Β· ⭐ 680) - Scikit-learn style model finetuning for NLP. MPL-2.0
  • GitHub (πŸ‘¨β€πŸ’» 21 Β· πŸ”€ 75 Β· πŸ“¦ 10 Β· πŸ“‹ 140 - 16% open Β· ⏱️ 18.05.2023):

     git clone https://github.com/IndicoDataSolutions/finetune
    
  • PyPi (πŸ“₯ 92 / month):

     pip install finetune
    
small-text (πŸ₯‰21 Β· ⭐ 460) - Active Learning for Text Classification in Python. MIT
  • GitHub (πŸ‘¨β€πŸ’» 3 Β· πŸ”€ 47 Β· πŸ“¦ 19 Β· πŸ“‹ 31 - 22% open Β· ⏱️ 27.05.2023):

     git clone https://github.com/webis-de/small-text
    
  • PyPi (πŸ“₯ 620 / month Β· ⏱️ 14.10.2022):

     pip install small-text
    
  • Conda (πŸ“₯ 2.9K Β· ⏱️ 21.02.2023):

     conda install -c conda-forge small-text
    
fast-bert (πŸ₯‰20 Β· ⭐ 1.8K) - Super easy library for BERT based NLP models. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 36 Β· πŸ”€ 330 Β· πŸ“‹ 250 - 61% open Β· ⏱️ 31.03.2023):

     git clone https://github.com/utterworks/fast-bert
    
  • PyPi (πŸ“₯ 2K / month):

     pip install fast-bert
    
happy-transformer (πŸ₯‰20 Β· ⭐ 420) - A package built on top of Hugging Faces transformers.. Apache-2 huggingface
  • GitHub (πŸ‘¨β€πŸ’» 14 Β· πŸ”€ 55 Β· πŸ“¦ 160 Β· πŸ“‹ 120 - 22% open Β· ⏱️ 06.04.2023):

     git clone https://github.com/EricFillion/happy-transformer
    
  • PyPi (πŸ“₯ 4.5K / month Β· πŸ“¦ 5 Β· ⏱️ 06.02.2022):

     pip install happytransformer
    
textaugment (πŸ₯‰18 Β· ⭐ 320) - TextAugment: Text Augmentation Library. MIT
  • GitHub (πŸ‘¨β€πŸ’» 7 Β· πŸ”€ 55 Β· πŸ“₯ 57 Β· πŸ“¦ 51 Β· πŸ“‹ 20 - 35% open Β· ⏱️ 03.04.2023):

     git clone https://github.com/dsfsi/textaugment
    
  • PyPi (πŸ“₯ 3.4K / month Β· πŸ“¦ 4 Β· ⏱️ 05.11.2020):

     pip install textaugment
    
TextBox (πŸ₯‰17 Β· ⭐ 970) - TextBox 2.0 is a text generation library with pre-trained language models. MIT
  • GitHub (πŸ‘¨β€πŸ’» 18 Β· πŸ”€ 110 Β· πŸ“¦ 5 Β· πŸ“‹ 63 - 4% open Β· ⏱️ 18.05.2023):

     git clone https://github.com/RUCAIBox/TextBox
    
  • PyPi (πŸ“₯ 1 / month Β· ⏱️ 15.04.2021):

     pip install textbox
    
UForm (πŸ₯‰17 Β· ⭐ 360 Β· 🐣) - Multi-Modal Transformers library for Semantic Search and other.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 8 Β· πŸ”€ 14 Β· πŸ“¦ 4 Β· πŸ“‹ 5 - 20% open Β· ⏱️ 02.05.2023):

     git clone https://github.com/unum-cloud/uform
    
  • PyPi (πŸ“₯ 350 / month Β· ⏱️ 02.05.2023):

     pip install uform
    
OpenNRE (πŸ₯‰16 Β· ⭐ 4K) - An Open-Source Package for Neural Relation Extraction (NRE). MIT
  • GitHub (πŸ‘¨β€πŸ’» 12 Β· πŸ”€ 1K Β· πŸ“‹ 360 - 2% open Β· ⏱️ 03.01.2023):

     git clone https://github.com/thunlp/OpenNRE
    
Translate (πŸ₯‰16 Β· ⭐ 790 Β· πŸ’€) - Translate - a PyTorch Language Library. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 88 Β· πŸ”€ 200 Β· πŸ“‹ 55 - 50% open Β· ⏱️ 10.06.2022):

     git clone https://github.com/pytorch/translate
    
  • PyPi (πŸ“₯ 3 / month Β· ⏱️ 01.05.2018):

     pip install pytorch-translate
    
VizSeq (πŸ₯‰12 Β· ⭐ 420) - An Analysis Toolkit for Natural Language Generation (Translation,.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 3 Β· πŸ”€ 54 Β· πŸ“¦ 6 Β· πŸ“‹ 15 - 40% open Β· ⏱️ 08.05.2023):

     git clone https://github.com/facebookresearch/vizseq
    
  • PyPi (πŸ“₯ 62 / month):

     pip install vizseq
    
Show 38 hidden projects...
  • ChatterBot (πŸ₯‡35 Β· ⭐ 13K Β· πŸ’€) - ChatterBot is a machine learning, conversational dialog engine.. BSD-3
  • fuzzywuzzy (πŸ₯ˆ33 Β· ⭐ 8.9K Β· πŸ’€) - Fuzzy String Matching in Python. ❗️GPL-2.0
  • stanza (πŸ₯ˆ31 Β· ⭐ 6.7K) - Official Stanford NLP Python Library for Many Human Languages. ❗Unlicensed
  • langid (πŸ₯ˆ28 Β· ⭐ 2.1K Β· πŸ’€) - Stand-alone language identification system. BSD-3
  • vaderSentiment (πŸ₯ˆ27 Β· ⭐ 4K Β· πŸ’€) - VADER Sentiment Analysis. VADER (Valence Aware Dictionary and.. MIT
  • polyglot (πŸ₯ˆ27 Β· ⭐ 2.2K Β· πŸ’€) - Multilingual text (NLP) processing toolkit. ❗️GPL-3.0
  • flashtext (πŸ₯‰26 Β· ⭐ 5.4K Β· πŸ’€) - Extract Keywords from sentence or Replace keywords in sentences. MIT
  • neuralcoref (πŸ₯‰26 Β· ⭐ 2.7K Β· πŸ’€) - Fast Coreference Resolution in spaCy with Neural Networks. MIT
  • underthesea (πŸ₯‰26 Β· ⭐ 1.2K) - Underthesea - Vietnamese NLP Toolkit. ❗️GPL-3.0
  • pytorch-nlp (πŸ₯‰25 Β· ⭐ 2.2K Β· πŸ’€) - Basic Utilities for PyTorch Natural Language Processing.. BSD-3
  • textgenrnn (πŸ₯‰24 Β· ⭐ 4.9K Β· πŸ’€) - Easily train your own text-generating neural network of any.. MIT
  • Snips NLU (πŸ₯‰24 Β· ⭐ 3.8K Β· πŸ’€) - Snips Python library to extract meaning from text. Apache-2
  • MatchZoo (πŸ₯‰24 Β· ⭐ 3.8K Β· πŸ’€) - Facilitating the design, comparison and sharing of deep.. Apache-2
  • Kashgari (πŸ₯‰23 Β· ⭐ 2.4K Β· πŸ’€) - Kashgari is a production-level NLP Transfer learning.. Apache-2
  • pySBD (πŸ₯‰23 Β· ⭐ 620 Β· πŸ’€) - pySBD (Python Sentence Boundary Disambiguation) is a rule-based sentence.. MIT
  • gpt-2-simple (πŸ₯‰22 Β· ⭐ 3.3K Β· πŸ’€) - Python package to easily retrain OpenAIs GPT-2 text-.. MIT
  • Texar (πŸ₯‰22 Β· ⭐ 2.4K Β· πŸ’€) - Toolkit for Machine Learning, Natural Language Processing, and.. Apache-2
  • stop-words (πŸ₯‰22 Β· ⭐ 150 Β· πŸ’€) - Get list of common stop words in various languages in Python. BSD-3
  • DELTA (πŸ₯‰21 Β· ⭐ 1.5K Β· πŸ’€) - DELTA is a deep learning based natural language and speech.. Apache-2
  • anaGo (πŸ₯‰21 Β· ⭐ 1.5K Β· πŸ’€) - Bidirectional LSTM-CRF and ELMo for Named-Entity Recognition,.. MIT
  • PyText (πŸ₯‰20 Β· ⭐ 6.4K Β· πŸ’€) - A natural language modeling framework based on PyTorch. ❗Unlicensed
  • pyfasttext (πŸ₯‰20 Β· ⭐ 230 Β· πŸ’€) - Yet another Python binding for fastText. ❗️GPL-3.0
  • fastT5 (πŸ₯‰19 Β· ⭐ 470 Β· πŸ’€) - boost inference speed of T5 models by 5x & reduce the model size.. Apache-2
  • numerizer (πŸ₯‰19 Β· ⭐ 200) - A Python module to convert natural language numerics into ints and.. MIT
  • DeepMatcher (πŸ₯‰18 Β· ⭐ 490 Β· πŸ’€) - Python package for performing Entity and Text Matching using.. BSD-3
  • NeuroNER (πŸ₯‰17 Β· ⭐ 1.7K Β· πŸ’€) - Named-entity recognition using neural networks. Easy-to-use and.. MIT
  • Camphr (πŸ₯‰17 Β· ⭐ 340 Β· πŸ’€) - Camphr - NLP libary for creating pipeline components. Apache-2 spacy
  • skift (πŸ₯‰17 Β· ⭐ 230 Β· πŸ’€) - scikit-learn wrappers for Python fastText. MIT
  • nboost (πŸ₯‰16 Β· ⭐ 660 Β· πŸ’€) - NBoost is a scalable, search-api-boosting platform for deploying.. Apache-2
  • whoosh (πŸ₯‰16 Β· ⭐ 410 Β· πŸ’€) - Pure-Python full-text search library. ❗Unlicensed
  • textpipe (πŸ₯‰16 Β· ⭐ 300 Β· πŸ’€) - Textpipe: clean and extract metadata from text. MIT
  • Headliner (πŸ₯‰15 Β· ⭐ 230 Β· πŸ’€) - Easy training and deployment of seq2seq models. MIT
  • BLINK (πŸ₯‰14 Β· ⭐ 1K Β· πŸ’€) - Entity Linker solution. MIT
  • spacy-dbpedia-spotlight (πŸ₯‰14 Β· ⭐ 88) - A spaCy wrapper for DBpedia Spotlight. MIT spacy
  • TransferNLP (πŸ₯‰13 Β· ⭐ 290 Β· πŸ’€) - NLP library designed for reproducible experimentation.. MIT
  • ONNX-T5 (πŸ₯‰13 Β· ⭐ 230 Β· πŸ’€) - Summarization, translation, sentiment-analysis, text-generation.. Apache-2
  • NeuralQA (πŸ₯‰13 Β· ⭐ 220 Β· πŸ’€) - NeuralQA: A Usable Library for Question Answering on Large Datasets.. MIT
  • textvec (πŸ₯‰11 Β· ⭐ 190 Β· πŸ’€) - Text vectorization tool to outperform TFIDF for classification.. MIT

Image Data

Back to top

Libraries for image & video processing, manipulation, and augmentation as well as libraries for computer vision tasks such as facial recognition, object detection, and classification.

Pillow (πŸ₯‡47 Β· ⭐ 11K) - Python Imaging Library (Fork). ❗️PIL
  • GitHub (πŸ‘¨β€πŸ’» 440 Β· πŸ”€ 1.9K Β· πŸ“¦ 1.2M Β· πŸ“‹ 2.8K - 3% open Β· ⏱️ 01.06.2023):

     git clone https://github.com/python-pillow/Pillow
    
  • PyPi (πŸ“₯ 55M / month):

     pip install Pillow
    
  • Conda (πŸ“₯ 29M Β· ⏱️ 21.05.2023):

     conda install -c conda-forge pillow
    
Kornia (πŸ₯‡38 Β· ⭐ 8.2K) - Differentiable Computer Vision Library. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 220 Β· πŸ”€ 840 Β· πŸ“₯ 650 Β· πŸ“¦ 4.5K Β· πŸ“‹ 790 - 31% open Β· ⏱️ 30.05.2023):

     git clone https://github.com/kornia/kornia
    
  • PyPi (πŸ“₯ 2.7M / month Β· πŸ“¦ 100 Β· ⏱️ 21.12.2022):

     pip install kornia
    
  • Conda (πŸ“₯ 82K Β· ⏱️ 21.04.2023):

     conda install -c conda-forge kornia
    
imageio (πŸ₯‡38 Β· ⭐ 1.3K) - Python library for reading and writing image data. BSD-2
  • GitHub (πŸ‘¨β€πŸ’» 100 Β· πŸ”€ 260 Β· πŸ“₯ 690 Β· πŸ“¦ 92K Β· πŸ“‹ 540 - 13% open Β· ⏱️ 31.05.2023):

     git clone https://github.com/imageio/imageio
    
  • PyPi (πŸ“₯ 15M / month):

     pip install imageio
    
  • Conda (πŸ“₯ 5M Β· ⏱️ 02.05.2023):

     conda install -c conda-forge imageio
    
MoviePy (πŸ₯‡37 Β· ⭐ 11K) - Video editing with Python. MIT
  • GitHub (πŸ‘¨β€πŸ’» 160 Β· πŸ”€ 1.3K Β· πŸ“¦ 25K Β· πŸ“‹ 1.3K - 23% open Β· ⏱️ 26.05.2023):

     git clone https://github.com/Zulko/moviepy
    
  • PyPi (πŸ“₯ 650K / month):

     pip install moviepy
    
  • Conda (πŸ“₯ 180K Β· ⏱️ 07.10.2022):

     conda install -c conda-forge moviepy
    
MMDetection (πŸ₯‡36 Β· ⭐ 24K) - OpenMMLab Detection Toolbox and Benchmark. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 390 Β· πŸ”€ 8.7K Β· πŸ“‹ 7.2K - 10% open Β· ⏱️ 06.04.2023):

     git clone https://github.com/open-mmlab/mmdetection
    
  • PyPi (πŸ“₯ 170K / month Β· πŸ“¦ 29 Β· ⏱️ 01.06.2022):

     pip install mmdet
    
torchvision (πŸ₯‡36 Β· ⭐ 14K) - Datasets, Transforms and Models specific to Computer Vision. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 530 Β· πŸ”€ 6.5K Β· πŸ“₯ 25K Β· πŸ“‹ 2.8K - 25% open Β· ⏱️ 31.05.2023):

     git clone https://github.com/pytorch/vision
    
  • PyPi (πŸ“₯ 6.3M / month):

     pip install torchvision
    
  • Conda (πŸ“₯ 680K Β· ⏱️ 08.05.2023):

     conda install -c conda-forge torchvision
    
InsightFace (πŸ₯ˆ35 Β· ⭐ 15K) - State-of-the-art 2D and 3D Face Analysis Project. MIT
  • GitHub (πŸ‘¨β€πŸ’» 53 Β· πŸ”€ 4.3K Β· πŸ“₯ 22K Β· πŸ“¦ 330 Β· πŸ“‹ 2.2K - 43% open Β· ⏱️ 22.05.2023):

     git clone https://github.com/deepinsight/insightface
    
  • PyPi (πŸ“₯ 130K / month):

     pip install insightface
    
PyTorch Image Models (πŸ₯ˆ34 Β· ⭐ 25K) - PyTorch image models, scripts, pretrained weights --.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 100 Β· πŸ”€ 4.1K Β· πŸ“₯ 4.6M Β· πŸ“‹ 740 - 8% open Β· ⏱️ 25.05.2023):

     git clone https://github.com/rwightman/pytorch-image-models
    
  • PyPi (πŸ“₯ 3.2M / month):

     pip install timm
    
  • Conda (πŸ“₯ 69K Β· ⏱️ 14.05.2023):

     conda install -c conda-forge timm
    
detectron2 (πŸ₯ˆ34 Β· ⭐ 25K) - Detectron2 is a platform for object detection, segmentation.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 250 Β· πŸ”€ 6.7K Β· πŸ“¦ 1.2K Β· πŸ“‹ 3.4K - 10% open Β· ⏱️ 01.06.2023):

     git clone https://github.com/facebookresearch/detectron2
    
  • PyPi (πŸ“¦ 6 Β· ⏱️ 06.02.2020):

     pip install detectron2
    
  • Conda (πŸ“₯ 170K Β· ⏱️ 15.05.2023):

     conda install -c conda-forge detectron2
    
Albumentations (πŸ₯ˆ34 Β· ⭐ 12K) - Fast image augmentation library and an easy-to-use wrapper.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 130 Β· πŸ”€ 1.5K Β· πŸ“¦ 15K Β· πŸ“‹ 780 - 45% open Β· ⏱️ 23.03.2023):

     git clone https://github.com/albumentations-team/albumentations
    
  • PyPi (πŸ“₯ 640K / month Β· πŸ“¦ 290 Β· ⏱️ 20.09.2022):

     pip install albumentations
    
  • Conda (πŸ“₯ 110K Β· ⏱️ 20.09.2022):

     conda install -c conda-forge albumentations
    
PaddleSeg (πŸ₯ˆ34 Β· ⭐ 7.2K) - Easy-to-use image segmentation library with awesome pre-.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 110 Β· πŸ”€ 1.5K Β· πŸ“¦ 930 Β· πŸ“‹ 1.8K - 14% open Β· ⏱️ 29.05.2023):

     git clone https://github.com/PaddlePaddle/PaddleSeg
    
  • PyPi (πŸ“₯ 3.4K / month Β· πŸ“¦ 3 Β· ⏱️ 30.11.2022):

     pip install paddleseg
    
PaddleDetection (πŸ₯ˆ32 Β· ⭐ 11K) - Object Detection toolkit based on PaddlePaddle. It.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 140 Β· πŸ”€ 2.6K Β· πŸ“¦ 84 Β· πŸ“‹ 4.8K - 21% open Β· ⏱️ 30.05.2023):

     git clone https://github.com/PaddlePaddle/PaddleDetection
    
  • PyPi (πŸ“₯ 730 / month):

     pip install paddledet
    
deepface (πŸ₯ˆ32 Β· ⭐ 6.5K) - A Lightweight Face Recognition and Facial Attribute Analysis (Age,.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 41 Β· πŸ”€ 1.3K Β· πŸ“¦ 1.5K Β· πŸ“‹ 700 - 2% open Β· ⏱️ 11.05.2023):

     git clone https://github.com/serengil/deepface
    
  • PyPi (πŸ“₯ 54K / month Β· πŸ“¦ 12 Β· ⏱️ 24.01.2023):

     pip install deepface
    
Wand (πŸ₯ˆ32 Β· ⭐ 1.3K) - The ctypes-based simple ImageMagick binding for Python. MIT
  • GitHub (πŸ‘¨β€πŸ’» 100 Β· πŸ”€ 200 Β· πŸ“₯ 15K Β· πŸ“¦ 16K Β· πŸ“‹ 400 - 6% open Β· ⏱️ 05.03.2023):

     git clone https://github.com/emcconville/wand
    
  • PyPi (πŸ“₯ 700K / month Β· πŸ“¦ 710 Β· ⏱️ 05.01.2023):

     pip install wand
    
  • Conda (πŸ“₯ 25K Β· ⏱️ 22.08.2022):

     conda install -c conda-forge wand
    
Face Recognition (πŸ₯ˆ30 Β· ⭐ 48K Β· πŸ’€) - The worlds simplest facial recognition api for Python.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 54 Β· πŸ”€ 13K Β· πŸ“₯ 1.1K Β· πŸ“‹ 1.3K - 55% open Β· ⏱️ 10.06.2022):

     git clone https://github.com/ageitgey/face_recognition
    
  • PyPi (πŸ“₯ 63K / month):

     pip install face_recognition
    
  • Conda (πŸ“₯ 16K Β· ⏱️ 30.04.2021):

     conda install -c conda-forge face_recognition
    
GluonCV (πŸ₯ˆ30 Β· ⭐ 5.6K) - Gluon CV Toolkit. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 120 Β· πŸ”€ 1.2K Β· πŸ“‹ 840 - 7% open Β· ⏱️ 19.01.2023):

     git clone https://github.com/dmlc/gluon-cv
    
  • PyPi (πŸ“₯ 550K / month Β· πŸ“¦ 61 Β· ⏱️ 03.10.2022):

     pip install gluoncv
    
ImageHash (πŸ₯ˆ30 Β· ⭐ 2.8K) - A Python Perceptual Image Hashing Module. BSD-2
  • GitHub (πŸ‘¨β€πŸ’» 25 Β· πŸ”€ 320 Β· πŸ“¦ 9.3K Β· πŸ“‹ 120 - 4% open Β· ⏱️ 07.02.2023):

     git clone https://github.com/JohannesBuchner/imagehash
    
  • PyPi (πŸ“₯ 1.3M / month):

     pip install ImageHash
    
  • Conda (πŸ“₯ 310K Β· ⏱️ 28.09.2022):

     conda install -c conda-forge imagehash
    
vit-pytorch (πŸ₯ˆ29 Β· ⭐ 14K) - Implementation of Vision Transformer, a simple way to achieve.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 17 Β· πŸ”€ 2.3K Β· πŸ“¦ 260 Β· πŸ“‹ 220 - 46% open Β· ⏱️ 20.05.2023):

     git clone https://github.com/lucidrains/vit-pytorch
    
  • PyPi (πŸ“₯ 21K / month):

     pip install vit-pytorch
    
facenet-pytorch (πŸ₯ˆ29 Β· ⭐ 3.5K) - Pretrained Pytorch face detection (MTCNN) and facial.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 15 Β· πŸ”€ 780 Β· πŸ“₯ 620K Β· πŸ“¦ 1.1K Β· πŸ“‹ 170 - 39% open Β· ⏱️ 06.04.2023):

     git clone https://github.com/timesler/facenet-pytorch
    
  • PyPi (πŸ“₯ 28K / month Β· πŸ“¦ 18 Β· ⏱️ 10.03.2021):

     pip install facenet-pytorch
    
sahi (πŸ₯ˆ29 Β· ⭐ 2.7K) - Framework agnostic sliced/tiled inference + interactive ui + error analysis.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 29 Β· πŸ”€ 390 Β· πŸ“₯ 15K Β· πŸ“¦ 460 Β· ⏱️ 15.05.2023):

     git clone https://github.com/obss/sahi
    
  • PyPi (πŸ“₯ 93K / month):

     pip install sahi
    
  • Conda (πŸ“₯ 38K Β· ⏱️ 16.05.2023):

     conda install -c conda-forge sahi
    
lightly (πŸ₯ˆ29 Β· ⭐ 2.3K) - A python library for self-supervised learning on images. MIT
  • GitHub (πŸ‘¨β€πŸ’» 27 Β· πŸ”€ 200 Β· πŸ“¦ 120 Β· πŸ“‹ 410 - 19% open Β· ⏱️ 01.06.2023):

     git clone https://github.com/lightly-ai/lightly
    
  • PyPi (πŸ“₯ 7K / month Β· πŸ“¦ 4 Β· ⏱️ 31.01.2023):

     pip install lightly
    
opencv-python (πŸ₯‰28 Β· ⭐ 3.5K Β· πŸ“‰) - Automated CI toolchain to produce precompiled opencv-python,.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 46 Β· πŸ”€ 670 Β· πŸ“‹ 660 - 11% open Β· ⏱️ 01.06.2023):

     git clone https://github.com/opencv/opencv-python
    
  • PyPi (πŸ“₯ 7.8M / month):

     pip install opencv-python
    
doctr (πŸ₯‰28 Β· ⭐ 1.8K) - docTR (Document Text Recognition) - a seamless, high-.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 33 Β· πŸ”€ 240 Β· πŸ“₯ 1.4M Β· πŸ“¦ 150 Β· πŸ“‹ 250 - 20% open Β· ⏱️ 01.06.2023):

     git clone https://github.com/mindee/doctr
    
  • PyPi (πŸ“₯ 7.1K / month Β· πŸ“¦ 4 Β· ⏱️ 29.09.2022):

     pip install python-doctr
    
imageai (πŸ₯‰27 Β· ⭐ 7.8K Β· πŸ“‰) - A python library built to empower developers to build applications.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 17 Β· πŸ”€ 2K Β· πŸ“₯ 860K Β· πŸ“¦ 1.4K Β· πŸ“‹ 720 - 39% open Β· ⏱️ 03.03.2023):

     git clone https://github.com/OlafenwaMoses/ImageAI
    
  • PyPi (πŸ“₯ 11K / month):

     pip install imageai
    
  • Conda (πŸ“₯ 5.7K Β· ⏱️ 30.04.2021):

     conda install -c conda-forge imageai
    
Face Alignment (πŸ₯‰26 Β· ⭐ 6.3K) - 2D and 3D Face alignment library build using pytorch. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 25 Β· πŸ”€ 1.3K Β· πŸ“‹ 290 - 21% open Β· ⏱️ 15.04.2023):

     git clone https://github.com/1adrianb/face-alignment
    
  • PyPi (πŸ“₯ 55K / month):

     pip install face-alignment
    
layout-parser (πŸ₯‰26 Β· ⭐ 3.7K Β· πŸ’€) - A Unified Toolkit for Deep Learning Based Document Image.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 8 Β· πŸ”€ 360 Β· πŸ“¦ 310 Β· πŸ“‹ 130 - 58% open Β· ⏱️ 06.08.2022):

     git clone https://github.com/Layout-Parser/layout-parser
    
  • PyPi (πŸ“₯ 67K / month Β· πŸ“¦ 6 Β· ⏱️ 06.04.2022):

     pip install layoutparser
    
vidgear (πŸ₯‰26 Β· ⭐ 2.8K) - A High-performance cross-platform Video Processing Python framework.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 13 Β· πŸ”€ 220 Β· πŸ“₯ 810 Β· πŸ“¦ 400 Β· πŸ“‹ 250 - 1% open Β· ⏱️ 26.01.2023):

     git clone https://github.com/abhiTronix/vidgear
    
  • PyPi (πŸ“₯ 8.5K / month):

     pip install vidgear
    
Augmentor (πŸ₯‰25 Β· ⭐ 4.9K) - Image augmentation library in Python for machine learning. MIT
  • GitHub (πŸ‘¨β€πŸ’» 23 Β· πŸ”€ 840 Β· πŸ“¦ 600 Β· πŸ“‹ 190 - 61% open Β· ⏱️ 29.03.2023):

     git clone https://github.com/mdbloice/Augmentor
    
  • PyPi (πŸ“₯ 11K / month):

     pip install Augmentor
    
Image Deduplicator (πŸ₯‰25 Β· ⭐ 4.5K) - Finding duplicate images made easy!. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 15 Β· πŸ”€ 410 Β· πŸ“¦ 45 Β· πŸ“‹ 130 - 37% open Β· ⏱️ 28.04.2023):

     git clone https://github.com/idealo/imagededup
    
  • PyPi (πŸ“₯ 1.5K / month Β· πŸ“¦ 4 Β· ⏱️ 16.01.2023):

     pip install imagededup
    
segmentation_models (πŸ₯‰25 Β· ⭐ 4.3K Β· πŸ’€) - Segmentation models with pretrained backbones. Keras.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 14 Β· πŸ”€ 970 Β· πŸ“‹ 510 - 47% open Β· ⏱️ 29.07.2022):

     git clone https://github.com/qubvel/segmentation_models
    
  • PyPi (πŸ“₯ 25K / month Β· πŸ“¦ 28 Β· ⏱️ 10.01.2020):

     pip install segmentation_models
    
Norfair (πŸ₯‰25 Β· ⭐ 1.8K) - Lightweight Python library for adding real-time multi-object tracking.. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 24 Β· πŸ”€ 180 Β· πŸ“₯ 270 Β· πŸ“¦ 120 Β· πŸ“‹ 120 - 2% open Β· ⏱️ 15.05.2023):

     git clone https://github.com/tryolabs/norfair
    
  • PyPi (πŸ“₯ 7K / month Β· πŸ“¦ 2 Β· ⏱️ 04.01.2023):

     pip install norfair
    
MMF (πŸ₯‰24 Β· ⭐ 5.2K) - A modular framework for vision & language multimodal research from.. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 110 Β· πŸ”€ 880 Β· πŸ“¦ 14 Β· πŸ“‹ 780 - 30% open Β· ⏱️ 19.05.2023):

     git clone https://github.com/facebookresearch/mmf
    
  • PyPi (πŸ“₯ 240 / month Β· πŸ“¦ 1 Β· ⏱️ 12.06.2020):

     pip install mmf
    
ffcv (πŸ₯‰24 Β· ⭐ 2.5K) - FFCV: Fast Forward Computer Vision (and other ML workloads!). Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 26 Β· πŸ”€ 150 Β· πŸ“¦ 23 Β· πŸ“‹ 230 - 31% open Β· ⏱️ 18.04.2023):

     git clone https://github.com/libffcv/ffcv
    
  • PyPi (πŸ“₯ 950 / month Β· πŸ“¦ 1 Β· ⏱️ 28.01.2022):

     pip install ffcv
    
pytorchvideo (πŸ₯‰23 Β· ⭐ 2.9K) - A deep learning library for video understanding research. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 52 Β· πŸ”€ 340 Β· πŸ“‹ 160 - 40% open Β· ⏱️ 05.05.2023):

     git clone https://github.com/facebookresearch/pytorchvideo
    
  • PyPi (πŸ“₯ 19K / month):

     pip install pytorchvideo
    
kubric (πŸ₯‰23 Β· ⭐ 1.9K) - A data generation pipeline for creating semi-realistic synthetic.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 23 Β· πŸ”€ 170 Β· πŸ“¦ 2 Β· πŸ“‹ 180 - 29% open Β· ⏱️ 31.03.2023):

     git clone https://github.com/google-research/kubric
    
  • PyPi (πŸ“₯ 15K / month Β· ⏱️ 06.07.2022):

     pip install kubric-nightly
    
Classy Vision (πŸ₯‰23 Β· ⭐ 1.6K) - An end-to-end PyTorch framework for image and video.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 77 Β· πŸ”€ 280 Β· πŸ“‹ 77 - 16% open Β· ⏱️ 23.03.2023):

     git clone https://github.com/facebookresearch/ClassyVision
    
  • PyPi (πŸ“₯ 2.2K / month):

     pip install classy_vision
    
  • Conda (πŸ“₯ 17K Β· ⏱️ 22.03.2022):

     conda install -c conda-forge classy_vision
    
pyvips (πŸ₯‰23 Β· ⭐ 520) - python binding for libvips using cffi. MIT
  • GitHub (πŸ‘¨β€πŸ’» 15 Β· πŸ”€ 47 Β· πŸ“¦ 520 Β· πŸ“‹ 350 - 38% open Β· ⏱️ 11.05.2023):

     git clone https://github.com/libvips/pyvips
    
  • PyPi (πŸ“₯ 27K / month Β· πŸ“¦ 46 Β· ⏱️ 12.06.2022):

     pip install pyvips
    
  • Conda (πŸ“₯ 57K Β· ⏱️ 29.10.2022):

     conda install -c conda-forge pyvips
    
vissl (πŸ₯‰22 Β· ⭐ 3K) - VISSL is FAIRs library of extensible, modular and scalable components.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 35 Β· πŸ”€ 310 Β· πŸ“¦ 18 Β· πŸ“‹ 180 - 42% open Β· ⏱️ 03.05.2023):

     git clone https://github.com/facebookresearch/vissl
    
  • PyPi (πŸ“₯ 520 / month Β· πŸ“¦ 1 Β· ⏱️ 02.11.2021):

     pip install vissl
    
tensorflow-graphics (πŸ₯‰22 Β· ⭐ 2.7K) - TensorFlow Graphics: Differentiable Graphics Layers.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 39 Β· πŸ”€ 360 Β· πŸ“‹ 240 - 61% open Β· ⏱️ 27.04.2023):

     git clone https://github.com/tensorflow/graphics
    
  • PyPi (πŸ“₯ 4.4K / month Β· πŸ“¦ 8 Β· ⏱️ 03.12.2021):

     pip install tensorflow-graphics
    
pycls (πŸ₯‰22 Β· ⭐ 2.1K Β· πŸ’€) - Codebase for Image Classification Research, written in PyTorch. MIT
  • GitHub (πŸ‘¨β€πŸ’» 17 Β· πŸ”€ 230 Β· πŸ“¦ 14 Β· πŸ“‹ 79 - 29% open Β· ⏱️ 12.07.2022):

     git clone https://github.com/facebookresearch/pycls
    
  • PyPi (πŸ“₯ 130K / month):

     pip install pycls
    
icevision (πŸ₯‰22 Β· ⭐ 820) - An Agnostic Computer Vision Framework - Pluggable to any Training.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 41 Β· πŸ”€ 140 Β· πŸ“‹ 570 - 10% open Β· ⏱️ 07.12.2022):

     git clone https://github.com/airctic/icevision
    
  • PyPi (πŸ“₯ 9.4K / month):

     pip install icevision
    
image-match (πŸ₯‰20 Β· ⭐ 2.9K) - Quickly search over billions of images. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 19 Β· πŸ”€ 400 Β· πŸ“‹ 110 - 57% open Β· ⏱️ 06.12.2022):

     git clone https://github.com/ProvenanceLabs/image-match
    
  • PyPi (πŸ“₯ 920 / month Β· πŸ“¦ 4 Β· ⏱️ 13.02.2017):

     pip install image_match
    
DEβ«ΆTR (πŸ₯‰19 Β· ⭐ 11K) - End-to-End Object Detection with Transformers. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 26 Β· πŸ”€ 2K Β· πŸ“‹ 490 - 42% open Β· ⏱️ 07.02.2023):

     git clone https://github.com/facebookresearch/detr
    
PySlowFast (πŸ₯‰19 Β· ⭐ 5.7K) - PySlowFast: video understanding codebase from FAIR for.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 30 Β· πŸ”€ 1.1K Β· πŸ“¦ 10 Β· πŸ“‹ 610 - 54% open Β· ⏱️ 24.04.2023):

     git clone https://github.com/facebookresearch/SlowFast
    
  • PyPi (πŸ“₯ 13 / month):

     pip install pyslowfast
    
scenic (πŸ₯‰19 Β· ⭐ 2.2K) - Scenic: A Jax Library for Computer Vision Research and Beyond. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 63 Β· πŸ”€ 310 Β· πŸ“‹ 150 - 52% open Β· ⏱️ 01.06.2023):

     git clone https://github.com/google-research/scenic
    
Show 19 hidden projects...
  • scikit-image (πŸ₯‡42 Β· ⭐ 5.4K) - Image processing in Python. ❗Unlicensed
  • imgaug (πŸ₯ˆ35 Β· ⭐ 14K Β· πŸ’€) - Image augmentation for machine learning experiments. MIT
  • glfw (πŸ₯ˆ35 Β· ⭐ 11K) - A multi-platform library for OpenGL, OpenGL ES, Vulkan, window and input. ❗️Zlib
  • PyTorch3D (πŸ₯ˆ30 Β· ⭐ 7.3K) - PyTorch3D is FAIRs library of reusable components for.. ❗Unlicensed
  • imutils (πŸ₯ˆ29 Β· ⭐ 4.3K Β· πŸ’€) - A series of convenience functions to make basic image processing.. MIT
  • chainercv (πŸ₯‰27 Β· ⭐ 1.5K Β· πŸ’€) - ChainerCV: a Library for Deep Learning in Computer Vision. MIT
  • mtcnn (πŸ₯‰26 Β· ⭐ 2K Β· πŸ’€) - MTCNN face detection implementation for TensorFlow, as a PIP package. MIT
  • Pillow-SIMD (πŸ₯‰26 Β· ⭐ 2K Β· πŸ’€) - The friendly PIL fork. ❗️PIL
  • mahotas (πŸ₯‰25 Β· ⭐ 800) - Computer Vision in Python. ❗Unlicensed
  • CellProfiler (πŸ₯‰25 Β· ⭐ 770) - An open-source application for biological image analysis. ❗Unlicensed
  • deep-daze (πŸ₯‰22 Β· ⭐ 4.4K Β· πŸ’€) - Simple command line tool for text to image generation using.. MIT
  • Image Super-Resolution (πŸ₯‰22 Β· ⭐ 4.2K Β· πŸ’€) - Super-scale your images and run experiments with.. Apache-2
  • Luminoth (πŸ₯‰22 Β· ⭐ 2.4K Β· πŸ’€) - Deep Learning toolkit for Computer Vision. BSD-3
  • nude.py (πŸ₯‰21 Β· ⭐ 910 Β· πŸ’€) - Nudity detection with Python. MIT
  • detecto (πŸ₯‰20 Β· ⭐ 590 Β· πŸ’€) - Build fully-functioning computer vision models with PyTorch. MIT
  • Caer (πŸ₯‰18 Β· ⭐ 690 Β· πŸ’€) - A lightweight Computer Vision library. Scale your models, not boilerplate. MIT
  • solt (πŸ₯‰18 Β· ⭐ 260 Β· πŸ’€) - Streaming over lightweight data transformations. MIT
  • Torch Points 3D (πŸ₯‰14 Β· ⭐ 140 Β· πŸ’€) - Pytorch framework for doing deep learning on point.. BSD-3
  • HugsVision (πŸ₯‰13 Β· ⭐ 180) - HugsVision is a easy to use huggingface wrapper for state-of-the-.. MIT huggingface

Graph Data

Back to top

Libraries for graph processing, clustering, embedding, and machine learning tasks.

dgl (πŸ₯‡37 Β· ⭐ 12K) - Python package built to ease deep learning on graph, on top of existing DL.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 260 Β· πŸ”€ 2.7K Β· πŸ“¦ 95 Β· πŸ“‹ 2.2K - 14% open Β· ⏱️ 01.06.2023):

     git clone https://github.com/dmlc/dgl
    
  • PyPi (πŸ“₯ 64K / month):

     pip install dgl
    
PyTorch Geometric (πŸ₯‡35 Β· ⭐ 18K Β· πŸ“‰) - Graph Neural Network Library for PyTorch. MIT
  • GitHub (πŸ‘¨β€πŸ’» 420 Β· πŸ”€ 3.2K Β· πŸ“‹ 3.1K - 22% open Β· ⏱️ 01.06.2023):

     git clone https://github.com/pyg-team/pytorch_geometric
    
  • PyPi (πŸ“₯ 180K / month):

     pip install torch-geometric
    
  • Conda (πŸ“₯ 25K Β· ⏱️ 28.04.2023):

     conda install -c conda-forge pytorch_geometric
    
ogb (πŸ₯ˆ31 Β· ⭐ 1.7K) - Benchmark datasets, data loaders, and evaluators for graph machine learning. MIT
  • GitHub (πŸ‘¨β€πŸ’» 30 Β· πŸ”€ 380 Β· πŸ“¦ 860 Β· πŸ“‹ 270 - 5% open Β· ⏱️ 25.05.2023):

     git clone https://github.com/snap-stanford/ogb
    
  • PyPi (πŸ“₯ 29K / month Β· πŸ“¦ 22 Β· ⏱️ 02.11.2022):

     pip install ogb
    
  • Conda (πŸ“₯ 24K Β· ⏱️ 07.04.2023):

     conda install -c conda-forge ogb
    
pygraphistry (πŸ₯ˆ28 Β· ⭐ 1.9K) - PyGraphistry is a Python library to quickly load, shape,.. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 37 Β· πŸ”€ 190 Β· πŸ“¦ 92 Β· πŸ“‹ 280 - 48% open Β· ⏱️ 04.05.2023):

     git clone https://github.com/graphistry/pygraphistry
    
  • PyPi (πŸ“₯ 3.3K / month):

     pip install graphistry
    
Paddle Graph Learning (πŸ₯ˆ28 Β· ⭐ 1.5K) - Paddle Graph Learning (PGL) is an efficient and.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 31 Β· πŸ”€ 300 Β· πŸ“¦ 43 Β· πŸ“‹ 190 - 36% open Β· ⏱️ 17.05.2023):

     git clone https://github.com/PaddlePaddle/PGL
    
  • PyPi (πŸ“₯ 1.2K / month):

     pip install pgl
    
Spektral (πŸ₯ˆ27 Β· ⭐ 2.3K) - Graph Neural Networks with Keras and Tensorflow 2. MIT
  • GitHub (πŸ‘¨β€πŸ’» 26 Β· πŸ”€ 330 Β· πŸ“¦ 210 Β· πŸ“‹ 260 - 23% open Β· ⏱️ 11.02.2023):

     git clone https://github.com/danielegrattarola/spektral
    
  • PyPi (πŸ“₯ 7.5K / month Β· πŸ“¦ 7 Β· ⏱️ 22.07.2022):

     pip install spektral
    
AmpliGraph (πŸ₯ˆ26 Β· ⭐ 1.9K) - Python library for Representation Learning on Knowledge.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 20 Β· πŸ”€ 230 Β· πŸ“¦ 37 Β· πŸ“‹ 220 - 14% open Β· ⏱️ 08.03.2023):

     git clone https://github.com/Accenture/AmpliGraph
    
  • PyPi (πŸ“₯ 880 / month Β· ⏱️ 25.05.2021):

     pip install ampligraph
    
PyKEEN (πŸ₯ˆ26 Β· ⭐ 1.2K) - A Python library for learning and evaluating knowledge graph embeddings. MIT
  • GitHub (πŸ‘¨β€πŸ’» 35 Β· πŸ”€ 160 Β· πŸ“₯ 160 Β· πŸ“‹ 490 - 16% open Β· ⏱️ 19.05.2023):

     git clone https://github.com/pykeen/pykeen
    
  • PyPi (πŸ“₯ 2.4K / month):

     pip install pykeen
    
pytorch_geometric_temporal (πŸ₯ˆ24 Β· ⭐ 2.1K) - PyTorch Geometric Temporal: Spatiotemporal Signal.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 26 Β· πŸ”€ 300 Β· πŸ“‹ 150 - 20% open Β· ⏱️ 18.02.2023):

     git clone https://github.com/benedekrozemberczki/pytorch_geometric_temporal
    
  • PyPi (πŸ“₯ 2.1K / month Β· πŸ“¦ 4 Β· ⏱️ 04.09.2022):

     pip install torch-geometric-temporal
    
Node2Vec (πŸ₯ˆ23 Β· ⭐ 1.1K Β· πŸ’€) - Implementation of the node2vec algorithm. MIT
  • GitHub (πŸ‘¨β€πŸ’» 11 Β· πŸ”€ 230 Β· πŸ“¦ 410 Β· πŸ“‹ 86 - 2% open Β· ⏱️ 19.10.2022):

     git clone https://github.com/eliorc/node2vec
    
  • PyPi (πŸ“₯ 71K / month Β· πŸ“¦ 18 Β· ⏱️ 01.08.2022):

     pip install node2vec
    
  • Conda (πŸ“₯ 25K Β· ⏱️ 16.02.2023):

     conda install -c conda-forge node2vec
    
graph4nlp (πŸ₯‰22 Β· ⭐ 1.6K Β· πŸ’€) - Graph4nlp is the library for the easy use of Graph.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 28 Β· πŸ”€ 190 Β· πŸ“‹ 170 - 3% open Β· ⏱️ 13.11.2022):

     git clone https://github.com/graph4ai/graph4nlp
    
  • PyPi (πŸ“₯ 200 / month):

     pip install graph4nlp
    
jraph (πŸ₯‰21 Β· ⭐ 1.2K Β· πŸ’€) - A Graph Neural Network Library in Jax. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 17 Β· πŸ”€ 73 Β· πŸ“¦ 82 Β· πŸ“‹ 35 - 25% open Β· ⏱️ 31.08.2022):

     git clone https://github.com/deepmind/jraph
    
  • PyPi (πŸ“₯ 4.3K / month Β· πŸ“¦ 6 Β· ⏱️ 12.08.2022):

     pip install jraph
    
  • Conda (πŸ“₯ 2.5K Β· ⏱️ 31.03.2023):

     conda install -c conda-forge jraph
    
torch-cluster (πŸ₯‰21 Β· ⭐ 650) - PyTorch Extension Library of Optimized Graph Cluster.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 30 Β· πŸ”€ 120 Β· πŸ“‹ 130 - 17% open Β· ⏱️ 27.05.2023):

     git clone https://github.com/rusty1s/pytorch_cluster
    
  • PyPi (πŸ“₯ 12K / month):

     pip install torch-cluster
    
  • Conda (πŸ“₯ 67K Β· ⏱️ 01.05.2023):

     conda install -c conda-forge pytorch_cluster
    
deepsnap (πŸ₯‰20 Β· ⭐ 480) - Python library assists deep learning on graphs. MIT
  • GitHub (πŸ‘¨β€πŸ’» 17 Β· πŸ”€ 51 Β· πŸ“₯ 9 Β· πŸ“¦ 50 Β· πŸ“‹ 42 - 42% open Β· ⏱️ 27.03.2023):

     git clone https://github.com/snap-stanford/deepsnap
    
  • PyPi (πŸ“₯ 670 / month Β· πŸ“¦ 1 Β· ⏱️ 05.09.2021):

     pip install deepsnap
    
graph-nets (πŸ₯‰18 Β· ⭐ 5.3K) - Build Graph Nets in Tensorflow. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 11 Β· πŸ”€ 770 Β· πŸ“‹ 130 - 4% open Β· ⏱️ 12.12.2022):

     git clone https://github.com/deepmind/graph_nets
    
  • PyPi (πŸ“₯ 3.1K / month):

     pip install graph-nets
    
GraphEmbedding (πŸ₯‰16 Β· ⭐ 3.3K Β· πŸ’€) - Implementation and experiments of graph embedding.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 9 Β· πŸ”€ 940 Β· πŸ“¦ 27 Β· πŸ“‹ 67 - 62% open Β· ⏱️ 21.06.2022):

     git clone https://github.com/shenweichen/GraphEmbedding
    
kglib (πŸ₯‰16 Β· ⭐ 540 Β· πŸ’€) - TypeDB-ML is the Machine Learning integrations library for TypeDB. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 11 Β· πŸ”€ 98 Β· πŸ“₯ 220 Β· πŸ“‹ 62 - 17% open Β· ⏱️ 09.11.2022):

     git clone https://github.com/vaticle/kglib
    
  • PyPi (πŸ“₯ 77 / month Β· ⏱️ 19.08.2020):

     pip install grakn-kglib
    
OpenKE (πŸ₯‰15 Β· ⭐ 3.5K Β· πŸ’€) - An Open-Source Package for Knowledge Embedding (KE). MIT
  • GitHub (πŸ‘¨β€πŸ’» 11 Β· πŸ”€ 950 Β· πŸ“‹ 370 - 4% open Β· ⏱️ 03.11.2022):

     git clone https://github.com/thunlp/OpenKE
    
AutoGL (πŸ₯‰15 Β· ⭐ 930) - An autoML framework & toolkit for machine learning on graphs. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 15 Β· πŸ”€ 110 Β· πŸ“‹ 30 - 30% open Β· ⏱️ 30.12.2022):

     git clone https://github.com/THUMNLab/AutoGL
    
  • PyPi (⏱️ 23.12.2020):

     pip install auto-graph-learning
    
OpenNE (πŸ₯‰14 Β· ⭐ 1.6K Β· πŸ’€) - An Open-Source Package for Network Embedding (NE). MIT
  • GitHub (πŸ‘¨β€πŸ’» 11 Β· πŸ”€ 480 Β· πŸ“‹ 100 - 4% open Β· ⏱️ 02.11.2022):

     git clone https://github.com/thunlp/OpenNE
    
Show 16 hidden projects...
  • networkx (πŸ₯‡42 Β· ⭐ 13K) - Network Analysis in Python. ❗Unlicensed
  • igraph (πŸ₯‡33 Β· ⭐ 1.1K) - Python interface for igraph. ❗️GPL-2.0
  • StellarGraph (πŸ₯ˆ28 Β· ⭐ 2.7K Β· πŸ’€) - StellarGraph - Machine Learning on Graphs. Apache-2
  • pygal (πŸ₯ˆ27 Β· ⭐ 2.5K) - PYthon svg GrAph plotting Library. ❗️LGPL-3.0
  • Karate Club (πŸ₯ˆ23 Β· ⭐ 1.9K) - Karate Club: An API Oriented Open-source Python Framework for.. ❗️GPL-3.0
  • DIG (πŸ₯ˆ23 Β· ⭐ 1.5K) - A library for graph deep learning research. ❗️GPL-3.0
  • PyTorch-BigGraph (πŸ₯‰21 Β· ⭐ 3.2K) - Generate embeddings from large-scale graph-structured.. ❗Unlicensed
  • DeepWalk (πŸ₯‰21 Β· ⭐ 2.6K Β· πŸ’€) - DeepWalk - Deep Learning for Graphs. ❗️GPL-3.0
  • pyRDF2Vec (πŸ₯‰21 Β· ⭐ 200) - Python Implementation and Extension of RDF2Vec. MIT
  • Sematch (πŸ₯‰17 Β· ⭐ 410 Β· πŸ’€) - semantic similarity framework for knowledge graph. Apache-2
  • DeepGraph (πŸ₯‰16 Β· ⭐ 270 Β· πŸ’€) - Analyze Data with Pandas-based Networks. Documentation:. BSD-3
  • Euler (πŸ₯‰15 Β· ⭐ 2.8K Β· πŸ’€) - A distributed graph deep learning framework. Apache-2
  • GraphGym (πŸ₯‰15 Β· ⭐ 1.4K) - Platform for designing and evaluating Graph Neural Networks.. ❗Unlicensed
  • GraphSAGE (πŸ₯‰14 Β· ⭐ 3.1K Β· πŸ’€) - Representation learning on large graphs using stochastic.. MIT
  • GraphVite (πŸ₯‰14 Β· ⭐ 1.1K Β· πŸ’€) - GraphVite: A General and High-performance Graph Embedding.. Apache-2
  • ptgnn (πŸ₯‰14 Β· ⭐ 370 Β· πŸ’€) - A PyTorch Graph Neural Network Library. MIT

Audio Data

Back to top

Libraries for audio analysis, manipulation, transformation, and extraction, as well as speech recognition and music generation tasks.

espnet (πŸ₯‡37 Β· ⭐ 6.6K) - End-to-End Speech Processing Toolkit. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 350 Β· πŸ”€ 1.9K Β· πŸ“₯ 78 Β· πŸ“¦ 190 Β· πŸ“‹ 2.2K - 21% open Β· ⏱️ 31.05.2023):

     git clone https://github.com/espnet/espnet
    
  • PyPi (πŸ“₯ 22K / month Β· πŸ“¦ 5 Β· ⏱️ 28.05.2022):

     pip install espnet
    
speechbrain (πŸ₯‡36 Β· ⭐ 6K) - A PyTorch-based Speech Toolkit. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 200 Β· πŸ”€ 1.1K Β· πŸ“¦ 740 Β· πŸ“‹ 950 - 17% open Β· ⏱️ 01.06.2023):

     git clone https://github.com/speechbrain/speechbrain
    
  • PyPi (πŸ“₯ 110K / month Β· πŸ“¦ 11 Β· ⏱️ 29.08.2022):

     pip install speechbrain
    
torchaudio (πŸ₯‡35 Β· ⭐ 2.1K) - Data manipulation and transformation for audio signal.. BSD-2
  • GitHub (πŸ‘¨β€πŸ’» 210 Β· πŸ”€ 560 Β· πŸ“‹ 860 - 27% open Β· ⏱️ 01.06.2023):

     git clone https://github.com/pytorch/audio
    
  • PyPi (πŸ“₯ 960K / month Β· πŸ“¦ 330 Β· ⏱️ 15.12.2022):

     pip install torchaudio
    
Pydub (πŸ₯ˆ34 Β· ⭐ 7.3K) - Manipulate audio with a simple and easy high level interface. MIT
  • GitHub (πŸ‘¨β€πŸ’» 95 Β· πŸ”€ 940 Β· πŸ“¦ 28K Β· πŸ“‹ 560 - 53% open Β· ⏱️ 08.12.2022):

     git clone https://github.com/jiaaro/pydub
    
  • PyPi (πŸ“₯ 5.3M / month Β· πŸ“¦ 1.1K Β· ⏱️ 10.03.2021):

     pip install pydub
    
  • Conda (πŸ“₯ 52K Β· ⏱️ 13.03.2021):

     conda install -c conda-forge pydub
    
SpeechRecognition (πŸ₯ˆ34 Β· ⭐ 7.2K) - Speech recognition module for Python, supporting several.. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 49 Β· πŸ”€ 2.3K Β· πŸ“‹ 590 - 48% open Β· ⏱️ 13.03.2023):

     git clone https://github.com/Uberi/speech_recognition
    
  • PyPi (πŸ“₯ 470K / month Β· πŸ“¦ 780 Β· ⏱️ 04.12.2022):

     pip install SpeechRecognition
    
  • Conda (πŸ“₯ 170K Β· ⏱️ 14.03.2023):

     conda install -c conda-forge speechrecognition
    
Coqui TTS (πŸ₯ˆ33 Β· ⭐ 12K) - - a deep learning toolkit for Text-to-Speech, battle-.. MPL-2.0
  • GitHub (πŸ‘¨β€πŸ’» 130 Β· πŸ”€ 1.6K Β· πŸ“₯ 890K Β· πŸ“‹ 600 - 5% open Β· ⏱️ 22.05.2023):

     git clone https://github.com/coqui-ai/TTS
    
  • PyPi (πŸ“₯ 21K / month Β· πŸ“¦ 14 Β· ⏱️ 11.01.2023):

     pip install tts
    
  • Conda (πŸ“₯ 8.2K Β· ⏱️ 15.12.2021):

     conda install -c conda-forge tts
    
librosa (πŸ₯ˆ32 Β· ⭐ 6K Β· πŸ“ˆ) - Python library for audio and music analysis. ISC
  • GitHub (πŸ‘¨β€πŸ’» 110 Β· πŸ”€ 890 Β· πŸ“‹ 1.1K - 4% open Β· ⏱️ 16.05.2023):

     git clone https://github.com/librosa/librosa
    
  • PyPi (πŸ“₯ 2.5M / month Β· πŸ“¦ 1.4K Β· ⏱️ 27.06.2022):

     pip install librosa
    
  • Conda (πŸ“₯ 640K Β· ⏱️ 17.03.2023):

     conda install -c conda-forge librosa
    
Magenta (πŸ₯ˆ31 Β· ⭐ 19K) - Magenta: Music and Art Generation with Machine Intelligence. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 160 Β· πŸ”€ 3.7K Β· πŸ“¦ 440 Β· πŸ“‹ 960 - 38% open Β· ⏱️ 01.05.2023):

     git clone https://github.com/magenta/magenta
    
  • PyPi (πŸ“₯ 5.8K / month Β· πŸ“¦ 38 Β· ⏱️ 01.08.2022):

     pip install magenta
    
spleeter (πŸ₯ˆ30 Β· ⭐ 23K) - Deezer source separation library including pretrained models. MIT
  • GitHub (πŸ‘¨β€πŸ’» 19 Β· πŸ”€ 2.5K Β· πŸ“₯ 2.4M Β· πŸ“¦ 510 Β· πŸ“‹ 750 - 26% open Β· ⏱️ 06.04.2023):

     git clone https://github.com/deezer/spleeter
    
  • PyPi (πŸ“₯ 17K / month Β· πŸ“¦ 6 Β· ⏱️ 07.09.2022):

     pip install spleeter
    
  • Conda (πŸ“₯ 76K Β· ⏱️ 30.06.2020):

     conda install -c conda-forge spleeter
    
python-soundfile (πŸ₯ˆ29 Β· ⭐ 550) - SoundFile is an audio library based on libsndfile, CFFI, and.. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 30 Β· πŸ”€ 87 Β· πŸ“₯ 15K Β· πŸ“¦ 21K Β· πŸ“‹ 210 - 39% open Β· ⏱️ 24.03.2023):

     git clone https://github.com/bastibe/python-soundfile
    
  • PyPi (πŸ“₯ 1.3M / month Β· πŸ“¦ 150 Β· ⏱️ 27.09.2022):

     pip install soundfile
    
  • Conda:

     conda install -c anaconda pysoundfile
    
pyAudioAnalysis (πŸ₯ˆ28 Β· ⭐ 5.3K Β· πŸ’€) - Python Audio Analysis Library: Feature Extraction,.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 27 Β· πŸ”€ 1.1K Β· πŸ“¦ 350 Β· πŸ“‹ 310 - 60% open Β· ⏱️ 18.09.2022):

     git clone https://github.com/tyiannak/pyAudioAnalysis
    
  • PyPi (πŸ“₯ 12K / month Β· πŸ“¦ 19 Β· ⏱️ 07.02.2022):

     pip install pyAudioAnalysis
    
audiomentations (πŸ₯ˆ28 Β· ⭐ 1.4K) - A Python library for audio data augmentation. Inspired by.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 24 Β· πŸ”€ 160 Β· πŸ“¦ 270 Β· πŸ“‹ 160 - 26% open Β· ⏱️ 26.05.2023):

     git clone https://github.com/iver56/audiomentations
    
  • PyPi (πŸ“₯ 7.8K / month Β· πŸ“¦ 4 Β· ⏱️ 12.01.2023):

     pip install audiomentations
    
tinytag (πŸ₯ˆ28 Β· ⭐ 600) - Read audio and music meta data and duration of MP3, OGG, OPUS, MP4, M4A,.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 26 Β· πŸ”€ 100 Β· πŸ“¦ 780 Β· πŸ“‹ 100 - 15% open Β· ⏱️ 21.04.2023):

     git clone https://github.com/devsnd/tinytag
    
  • PyPi (πŸ“₯ 19K / month Β· πŸ“¦ 100 Β· ⏱️ 12.03.2022):

     pip install tinytag
    
audioread (πŸ₯ˆ28 Β· ⭐ 440 Β· πŸ’€) - cross-library (GStreamer + Core Audio + MAD + FFmpeg) audio.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 23 Β· πŸ”€ 100 Β· πŸ“¦ 13K Β· πŸ“‹ 87 - 40% open Β· ⏱️ 18.11.2022):

     git clone https://github.com/beetbox/audioread
    
  • PyPi (πŸ“₯ 1.6M / month Β· πŸ“¦ 350 Β· ⏱️ 12.08.2022):

     pip install audioread
    
  • Conda (πŸ“₯ 650K Β· ⏱️ 29.10.2022):

     conda install -c conda-forge audioread
    
Porcupine (πŸ₯‰27 Β· ⭐ 3K) - On-device wake word detection powered by deep learning. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 35 Β· πŸ”€ 420 Β· πŸ“¦ 19 Β· πŸ“‹ 450 - 0% open Β· ⏱️ 31.05.2023):

     git clone https://github.com/Picovoice/Porcupine
    
  • PyPi (πŸ“₯ 3.2K / month Β· πŸ“¦ 12 Β· ⏱️ 28.06.2022):

     pip install pvporcupine
    
DDSP (πŸ₯‰26 Β· ⭐ 2.5K) - DDSP: Differentiable Digital Signal Processing. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 32 Β· πŸ”€ 300 Β· πŸ“¦ 40 Β· πŸ“‹ 160 - 24% open Β· ⏱️ 25.05.2023):

     git clone https://github.com/magenta/ddsp
    
  • PyPi (πŸ“₯ 35K / month Β· πŸ“¦ 1 Β· ⏱️ 04.10.2022):

     pip install ddsp
    
  • Conda (πŸ“₯ 14K Β· ⏱️ 08.06.2020):

     conda install -c conda-forge ddsp
    
kapre (πŸ₯‰22 Β· ⭐ 880 Β· πŸ’€) - kapre: Keras Audio Preprocessors. MIT
  • GitHub (πŸ‘¨β€πŸ’» 13 Β· πŸ”€ 140 Β· πŸ“₯ 23 Β· πŸ“¦ 2.3K Β· πŸ“‹ 96 - 14% open Β· ⏱️ 04.07.2022):

     git clone https://github.com/keunwoochoi/kapre
    
  • PyPi (πŸ“₯ 1.7K / month Β· πŸ“¦ 14 Β· ⏱️ 21.01.2022):

     pip install kapre
    
nnAudio (πŸ₯‰20 Β· ⭐ 840) - Audio processing by using pytorch 1D convolution network. MIT
  • GitHub (πŸ‘¨β€πŸ’» 14 Β· πŸ”€ 82 Β· πŸ“¦ 100 Β· πŸ“‹ 59 - 28% open Β· ⏱️ 28.03.2023):

     git clone https://github.com/KinWaiCheuk/nnAudio
    
  • PyPi (πŸ“₯ 4.2K / month Β· πŸ“¦ 3 Β· ⏱️ 09.10.2022):

     pip install nnAudio
    
Julius (πŸ₯‰20 Β· ⭐ 340 Β· πŸ’€) - Fast PyTorch based DSP for audio and 1D signals. MIT
  • GitHub (πŸ‘¨β€πŸ’» 2 Β· πŸ”€ 23 Β· πŸ“¦ 350 Β· πŸ“‹ 10 - 10% open Β· ⏱️ 19.09.2022):

     git clone https://github.com/adefossez/julius
    
  • PyPi (πŸ“₯ 180K / month Β· πŸ“¦ 6 Β· ⏱️ 20.09.2022):

     pip install julius
    
Show 10 hidden projects...
  • DeepSpeech (πŸ₯ˆ34 Β· ⭐ 22K Β· πŸ’€) - DeepSpeech is an open source embedded (offline, on-.. MPL-2.0
  • aubio (πŸ₯ˆ28 Β· ⭐ 3K Β· πŸ’€) - a library for audio and music analysis. ❗️GPL-3.0
  • Essentia (πŸ₯ˆ28 Β· ⭐ 2.4K) - C++ library for audio and music analysis, description and.. ❗️AGPL-3.0
  • Madmom (πŸ₯‰25 Β· ⭐ 1.1K Β· πŸ’€) - Python audio and music signal processing library. BSD-3
  • python_speech_features (πŸ₯‰24 Β· ⭐ 2.2K Β· πŸ’€) - This library provides common speech features for ASR.. MIT
  • TTS (πŸ₯‰22 Β· ⭐ 7.5K Β· πŸ’€) - Deep learning for Text to Speech (Discussion forum:.. MPL-2.0
  • TimeSide (πŸ₯‰22 Β· ⭐ 350) - scalable audio processing framework and server written in Python. ❗️AGPL-3.0
  • Dejavu (πŸ₯‰21 Β· ⭐ 6.1K Β· πŸ’€) - Audio fingerprinting and recognition in Python. MIT
  • Muda (πŸ₯‰17 Β· ⭐ 220 Β· πŸ’€) - A library for augmenting annotated audio data. ISC
  • textlesslib (πŸ₯‰9 Β· ⭐ 410 Β· πŸ’€) - Library for Textless Spoken Language Processing. MIT

Geospatial Data

Back to top

Libraries to load, process, analyze, and write geographic data as well as libraries for spatial analysis, map visualization, and geocoding.

pydeck (πŸ₯‡42 Β· ⭐ 11K) - WebGL2 powered visualization framework. MIT
  • GitHub (πŸ‘¨β€πŸ’» 230 Β· πŸ”€ 2K Β· πŸ“¦ 6K Β· πŸ“‹ 2.7K - 8% open Β· ⏱️ 31.05.2023):

     git clone https://github.com/visgl/deck.gl
    
  • PyPi (πŸ“₯ 1.4M / month Β· πŸ“¦ 42 Β· ⏱️ 04.11.2022):

     pip install pydeck
    
  • Conda (πŸ“₯ 380K Β· ⏱️ 04.11.2022):

     conda install -c conda-forge pydeck
    
  • npm (πŸ“₯ 410K / month Β· πŸ“¦ 450 Β· ⏱️ 31.05.2023):

     npm install deck.gl
    
GeoPandas (πŸ₯‡40 Β· ⭐ 3.7K) - Python tools for geographic data. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 200 Β· πŸ”€ 820 Β· πŸ“₯ 1.9K Β· πŸ“¦ 23K Β· πŸ“‹ 1.5K - 26% open Β· ⏱️ 27.05.2023):

     git clone https://github.com/geopandas/geopandas
    
  • PyPi (πŸ“₯ 4.8M / month Β· πŸ“¦ 1.5K Β· ⏱️ 10.12.2022):

     pip install geopandas
    
  • Conda (πŸ“₯ 2.9M Β· ⏱️ 06.05.2023):

     conda install -c conda-forge geopandas
    
folium (πŸ₯‡38 Β· ⭐ 6.3K) - Python Data. Leaflet.js Maps. MIT
  • GitHub (πŸ‘¨β€πŸ’» 150 Β· πŸ”€ 2.2K Β· πŸ“¦ 26K Β· πŸ“‹ 1K - 5% open Β· ⏱️ 24.05.2023):

     git clone https://github.com/python-visualization/folium
    
  • PyPi (πŸ“₯ 730K / month Β· πŸ“¦ 770 Β· ⏱️ 12.12.2022):

     pip install folium
    
  • Conda (πŸ“₯ 2M Β· ⏱️ 13.12.2022):

     conda install -c conda-forge folium
    
Rasterio (πŸ₯‡38 Β· ⭐ 2K) - Rasterio reads and writes geospatial raster datasets. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 150 Β· πŸ”€ 520 Β· πŸ“₯ 800 Β· πŸ“¦ 7.8K Β· πŸ“‹ 1.7K - 7% open Β· ⏱️ 31.05.2023):

     git clone https://github.com/rasterio/rasterio
    
  • PyPi (πŸ“₯ 1.5M / month Β· πŸ“¦ 950 Β· ⏱️ 13.02.2023):

     pip install rasterio
    
  • Conda (πŸ“₯ 2.3M Β· ⏱️ 01.06.2023):

     conda install -c conda-forge rasterio
    
Shapely (πŸ₯ˆ36 Β· ⭐ 3.3K) - Manipulation and analysis of geometric objects. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 140 Β· πŸ”€ 520 Β· πŸ“₯ 1.5K Β· πŸ“¦ 46K Β· πŸ“‹ 1.1K - 19% open Β· ⏱️ 20.05.2023):

     git clone https://github.com/shapely/shapely
    
  • PyPi (πŸ“₯ 12M / month Β· πŸ“¦ 770 Β· ⏱️ 30.01.2023):

     pip install shapely
    
  • Conda (πŸ“₯ 7.7M Β· ⏱️ 17.03.2023):

     conda install -c conda-forge shapely
    
pyproj (πŸ₯ˆ36 Β· ⭐ 890) - Python interface to PROJ (cartographic projections and coordinate.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 61 Β· πŸ”€ 190 Β· πŸ“¦ 21K Β· πŸ“‹ 570 - 4% open Β· ⏱️ 31.05.2023):

     git clone https://github.com/pyproj4/pyproj
    
  • PyPi (πŸ“₯ 6.7M / month Β· πŸ“¦ 1.9K Β· ⏱️ 13.12.2022):

     pip install pyproj
    
  • Conda (πŸ“₯ 6M Β· ⏱️ 29.03.2023):

     conda install -c conda-forge pyproj
    
Fiona (πŸ₯ˆ35 Β· ⭐ 1K) - Fiona reads and writes geographic data files. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 68 Β· πŸ”€ 200 Β· πŸ“¦ 13K Β· πŸ“‹ 750 - 4% open Β· ⏱️ 23.05.2023):

     git clone https://github.com/Toblerity/Fiona
    
  • PyPi (πŸ“₯ 5M / month Β· πŸ“¦ 860 Β· ⏱️ 10.02.2023):

     pip install fiona
    
  • Conda (πŸ“₯ 4.5M Β· ⏱️ 17.05.2023):

     conda install -c conda-forge fiona
    
geopy (πŸ₯ˆ33 Β· ⭐ 4K Β· πŸ’€) - Geocoding library for Python. MIT
  • GitHub (πŸ‘¨β€πŸ’» 130 Β· πŸ”€ 610 Β· πŸ“‹ 280 - 12% open Β· ⏱️ 13.11.2022):

     git clone https://github.com/geopy/geopy
    
  • PyPi (πŸ“₯ 4.1M / month Β· πŸ“¦ 4K Β· ⏱️ 13.11.2022):

     pip install geopy
    
  • Conda (πŸ“₯ 1.1M Β· ⏱️ 13.11.2022):

     conda install -c conda-forge geopy
    
ArcGIS API (πŸ₯ˆ32 Β· ⭐ 1.6K) - Documentation and samples for ArcGIS API for Python. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 88 Β· πŸ”€ 1K Β· πŸ“₯ 8.3K Β· πŸ“‹ 600 - 9% open Β· ⏱️ 01.06.2023):

     git clone https://github.com/Esri/arcgis-python-api
    
  • PyPi (πŸ“₯ 91K / month Β· πŸ“¦ 31 Β· ⏱️ 27.01.2023):

     pip install arcgis
    
  • Docker Hub (πŸ“₯ 10K Β· ⭐ 40 Β· ⏱️ 17.06.2022):

     docker pull esridocker/arcgis-api-python-notebook
    
ipyleaflet (πŸ₯‰31 Β· ⭐ 1.4K) - A Jupyter - Leaflet.js bridge. MIT
  • GitHub (πŸ‘¨β€πŸ’» 82 Β· πŸ”€ 350 Β· πŸ“¦ 4.5K Β· πŸ“‹ 570 - 40% open Β· ⏱️ 10.02.2023):

     git clone https://github.com/jupyter-widgets/ipyleaflet
    
  • PyPi (πŸ“₯ 150K / month Β· πŸ“¦ 150 Β· ⏱️ 19.10.2022):

     pip install ipyleaflet
    
  • Conda (πŸ“₯ 1M Β· ⏱️ 19.10.2022):

     conda install -c conda-forge ipyleaflet
    
  • npm (πŸ“₯ 40K / month Β· πŸ“¦ 5 Β· ⏱️ 19.10.2022):

     npm install jupyter-leaflet
    
geojson (πŸ₯‰31 Β· ⭐ 810) - Python bindings and utilities for GeoJSON. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 53 Β· πŸ”€ 120 Β· πŸ“¦ 13K Β· πŸ“‹ 96 - 25% open Β· ⏱️ 28.05.2023):

     git clone https://github.com/jazzband/geojson
    
  • PyPi (πŸ“₯ 1.1M / month Β· πŸ“¦ 1.2K Β· ⏱️ 26.01.2023):

     pip install geojson
    
  • Conda (πŸ“₯ 710K Β· ⏱️ 16.02.2023):

     conda install -c conda-forge geojson
    
GeoViews (πŸ₯‰28 Β· ⭐ 480) - Simple, concise geographical visualization in Python. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 28 Β· πŸ”€ 70 Β· πŸ“¦ 680 Β· πŸ“‹ 320 - 34% open Β· ⏱️ 25.05.2023):

     git clone https://github.com/holoviz/geoviews
    
  • PyPi (πŸ“₯ 6.3K / month Β· πŸ“¦ 33 Β· ⏱️ 17.01.2023):

     pip install geoviews
    
  • Conda (πŸ“₯ 170K Β· ⏱️ 25.05.2023):

     conda install -c conda-forge geoviews
    
PySAL (πŸ₯‰25 Β· ⭐ 1.1K) - PySAL: Python Spatial Analysis Library Meta-Package. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 78 Β· πŸ”€ 280 Β· πŸ“‹ 620 - 2% open Β· ⏱️ 15.03.2023):

     git clone https://github.com/pysal/pysal
    
  • PyPi (πŸ“₯ 15K / month Β· πŸ“¦ 36 Β· ⏱️ 31.01.2023):

     pip install pysal
    
  • Conda (πŸ“₯ 490K Β· ⏱️ 31.01.2023):

     conda install -c conda-forge pysal
    
pymap3d (πŸ₯‰24 Β· ⭐ 310) - pure-Python (Numpy optional) 3D coordinate conversions for geospace ecef.. BSD-2
  • GitHub (πŸ‘¨β€πŸ’» 15 Β· πŸ”€ 78 Β· πŸ“¦ 270 Β· πŸ“‹ 47 - 6% open Β· ⏱️ 05.03.2023):

     git clone https://github.com/geospace-code/pymap3d
    
  • PyPi (πŸ“₯ 60K / month Β· πŸ“¦ 20 Β· ⏱️ 03.07.2022):

     pip install pymap3d
    
  • Conda (πŸ“₯ 52K Β· ⏱️ 18.04.2023):

     conda install -c conda-forge pymap3d
    
Show 8 hidden projects...
  • Geocoder (πŸ₯ˆ32 Β· ⭐ 1.5K Β· πŸ’€) - Python Geocoder. MIT
  • Satpy (πŸ₯‰31 Β· ⭐ 920) - Python package for earth-observing satellite data processing. ❗️GPL-3.0
  • Sentinelsat (πŸ₯‰29 Β· ⭐ 880) - Search and download Copernicus Sentinel satellite images. ❗️GPL-3.0
  • EarthPy (πŸ₯‰26 Β· ⭐ 440 Β· πŸ’€) - A package built to support working with spatial data using open.. BSD-3
  • prettymaps (πŸ₯‰24 Β· ⭐ 9.8K) - A small set of Python functions to draw pretty maps from.. ❗️AGPL-3.0
  • gmaps (πŸ₯‰23 Β· ⭐ 760 Β· πŸ’€) - Google maps for Jupyter notebooks. BSD-3
  • Mapbox GL (πŸ₯‰23 Β· ⭐ 640 Β· πŸ’€) - Use Mapbox GL JS to visualize data in a Python Jupyter notebook. MIT
  • geoplotlib (πŸ₯‰21 Β· ⭐ 990 Β· πŸ’€) - python toolbox for visualizing geographical data and making maps. MIT

Financial Data

Back to top

Libraries for algorithmic stock/crypto trading, risk analytics, backtesting, technical analysis, and other tasks on financial data.

yfinance (πŸ₯‡38 Β· ⭐ 9.6K) - Download market data from Yahoo! Finances API. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 80 Β· πŸ”€ 1.8K Β· πŸ“¦ 23K Β· πŸ“‹ 1.1K - 15% open Β· ⏱️ 23.05.2023):

     git clone https://github.com/ranaroussi/yfinance
    
  • PyPi (πŸ“₯ 640K / month):

     pip install yfinance
    
  • Conda (πŸ“₯ 77K Β· ⏱️ 10.07.2021):

     conda install -c ranaroussi yfinance
    
ta (πŸ₯ˆ30 Β· ⭐ 3.6K Β· πŸ’€) - Technical Analysis Library using Pandas and Numpy. MIT
  • GitHub (πŸ‘¨β€πŸ’» 29 Β· πŸ”€ 780 Β· πŸ“¦ 2.2K Β· πŸ“‹ 230 - 56% open Β· ⏱️ 23.08.2022):