Skip to content

LukasMasuch/best-of-ml-tools

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

15 Commits
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

Contents

Explanation

  • πŸ₯‡πŸ₯ˆπŸ₯‰ Combined project-quality score
  • ⭐️ Star count from Github
  • 🐣 New project (less than 6 month old)
  • πŸ’€ Inactive project (6 month no activity)
  • πŸ’€ Dead project (12 month no activity)
  • ❗️ Warning (e.g. missing/risky license)
  • πŸ‘¨β€πŸ’» Contributors count from Github
  • πŸ”€ Fork count from Github
  • πŸ“‹ Issue count from Github
  • ⏱️ Last update timestamp on package manager
  • πŸ“₯ Download count from package manager
  • πŸ“¦ Number of dependent projects

IDEs & Notebook Editors

Back to top

Development environments and notebook editors suitable for machine learning & data science projects.

Visual Studio Code (πŸ₯‡37 Β· ⭐ 91K) - Visual Studio Code. MIT
  • GitHub (πŸ‘¨β€πŸ’» 1.3K Β· πŸ”€ 14K Β· πŸ“¦ 540 Β· πŸ“‹ 82K - 5% open Β· ⏱️ 24.01.2020):

     git clone https://github.com/microsoft/vscode
    
  • NPM (πŸ“₯ 25K / month Β· πŸ“¦ 470 Β· ⏱️ 14.01.2020):

     npm install monaco-editor-core
    
JupyterLab (πŸ₯‡35 Β· ⭐ 9.3K) - JupyterLab computational environment. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 310 Β· πŸ”€ 1.5K Β· πŸ“¦ 13K Β· πŸ“‹ 4.4K - 30% open Β· ⏱️ 24.01.2020):

     git clone https://github.com/jupyterlab/jupyterlab
    
  • PyPi (πŸ“₯ 320K / month Β· πŸ“¦ 2.4K Β· ⏱️ 24.01.2020):

     pip install jupyterlab
    
Jupyter (πŸ₯‡35 Β· ⭐ 7.4K) - Jupyter Interactive Notebook. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 470 Β· πŸ”€ 2.7K Β· πŸ“¦ 47K Β· πŸ“‹ 3.5K - 47% open Β· ⏱️ 23.01.2020):

     git clone https://github.com/jupyter/notebook
    
  • PyPi (πŸ“₯ 2.6M / month Β· πŸ“¦ 13K Β· ⏱️ 12.08.2015):

     pip install jupyter
    
  • Dockerhub (πŸ“₯ 7.2M Β· ⭐ 590 Β· ⏱️ 02.12.2019):

     docker pull jupyter/datascience-notebook
    
Spyder (πŸ₯ˆ32 Β· ⭐ 5.1K) - Official repository for Spyder - The Scientific Python Development Environment. MIT
  • GitHub (πŸ‘¨β€πŸ’» 190 Β· πŸ”€ 1K Β· πŸ“¦ 9K Β· πŸ“‹ 9.3K - 8% open Β· ⏱️ 23.01.2020):

     git clone https://github.com/spyder-ide/spyder
    
  • PyPi (πŸ“₯ 38K / month Β· πŸ“¦ 1.8K Β· ⏱️ 02.01.2020):

     pip install spyder
    
  • Conda (⏱️ 07.01.2020):

     conda install -c anaconda spyder
    
Eclipse Che (πŸ₯ˆ31 Β· ⭐ 6K) - Eclipse Che: Next-generation Eclipse IDE. Open source workspace server and.. EPL-2.0
  • GitHub (πŸ‘¨β€πŸ’» 170 Β· πŸ”€ 1.1K Β· πŸ“¦ 72 Β· πŸ“‹ 9.4K - 10% open Β· ⏱️ 24.01.2020):

     git clone https://github.com/eclipse/che
    
  • Dockerhub (πŸ“₯ 4M Β· ⭐ 140 Β· ⏱️ 14.07.2019):

     docker pull eclipse/che
    
  • Maven (πŸ“¦ 3 Β· ⏱️ 05.07.2019):

     <dependency>
     	<groupId>org.eclipse.che</groupId>
     	<artifactId>bootstrapper</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Atom (πŸ₯ˆ30 Β· ⭐ 51K) - The hackable text editor. MIT
  • GitHub (πŸ‘¨β€πŸ’» 540 Β· πŸ”€ 14K Β· πŸ“₯ 5.1M Β· πŸ“‹ 16K - 3% open Β· ⏱️ 24.01.2020):

     git clone https://github.com/atom/atom
    
Theia (πŸ₯ˆ29 Β· ⭐ 6.8K) - Eclipse Theia is a cloud & desktop IDE framework implemented in TypeScript. EPL-2.0
  • GitHub (πŸ‘¨β€πŸ’» 140 Β· πŸ”€ 870 Β· πŸ“¦ 200 Β· πŸ“‹ 4K - 29% open Β· ⏱️ 24.01.2020):

     git clone https://github.com/eclipse-theia/theia
    
  • NPM (πŸ“₯ 58K / month Β· πŸ“¦ 110 Β· ⏱️ 19.01.2020):

     npm install @theia/application-manager
    
  • Dockerhub (πŸ“₯ 1.2M Β· ⭐ 37 Β· ⏱️ 24.01.2020):

     docker pull theiaide/theia
    
code-server (πŸ₯ˆ26 Β· ⭐ 27K) - Run VS Code on a remote server. MIT
  • GitHub (πŸ‘¨β€πŸ’» 69 Β· πŸ”€ 1.8K Β· πŸ“₯ 190K Β· πŸ“‹ 1K - 19% open Β· ⏱️ 17.01.2020):

     git clone https://github.com/cdr/code-server
    
  • Dockerhub (πŸ“₯ 8.1M Β· ⭐ 140 Β· ⏱️ 17.01.2020):

     docker pull codercom/code-server
    
nteract (πŸ₯ˆ26 Β· ⭐ 4.7K) - The interactive computing suite for you!. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 140 Β· πŸ”€ 470 Β· πŸ“₯ 740K Β· πŸ“‹ 1.4K - 11% open Β· ⏱️ 24.01.2020):

     git clone https://github.com/nteract/nteract
    
  • PyPi (πŸ“₯ 2.5K / month Β· πŸ“¦ 5 Β· ⏱️ 16.07.2019):

     pip install nteract_on_jupyter
    
Jupyter Docker Stacks (πŸ₯‰24 Β· ⭐ 4.9K) - Ready-to-run Docker images containing Jupyter applications. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 160 Β· πŸ”€ 1.8K Β· πŸ“‹ 500 - 11% open Β· ⏱️ 23.01.2020):

     git clone https://github.com/jupyter/docker-stacks
    
  • Dockerhub (πŸ“₯ 3.6M Β· ⭐ 210 Β· ⏱️ 02.12.2019):

     docker pull jupyter/scipy-notebook
    
DIGITS (πŸ₯‰24 Β· ⭐ 4K) - Deep Learning GPU Training System. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 48 Β· πŸ”€ 1.4K Β· πŸ“‹ 1.5K - 40% open Β· ⏱️ 05.11.2019):

     git clone https://github.com/NVIDIA/DIGITS
    
  • Dockerhub (πŸ“₯ 740K Β· ⭐ 65 Β· ⏱️ 01.05.2018):

     docker pull nvidia/digits
    
Zeppelin (πŸ₯‰23 Β· ⭐ 4.7K) - Web-based notebook that enables interactive data analytics. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 390 Β· πŸ”€ 2.2K Β· ⏱️ 21.01.2020):

     git clone https://github.com/apache/zeppelin
    
  • Dockerhub (πŸ“₯ 950K Β· ⭐ 110 Β· ⏱️ 07.10.2019):

     docker pull apache/zeppelin
    
  • Maven:

     <dependency>
     	<groupId>org.apache.zeppelin</groupId>
     	<artifactId>zeppelin-server</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
polynote (πŸ₯‰23 Β· ⭐ 3.5K) - A better notebook for Scala (and more). Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 22 Β· πŸ”€ 280 Β· πŸ“₯ 15K Β· πŸ“‹ 350 - 36% open Β· ⏱️ 22.01.2020):

     git clone https://github.com/polynote/polynote
    
  • Dockerhub (πŸ“₯ 1.7K Β· ⭐ 3 Β· ⏱️ 14.01.2020):

     docker pull polynote/polynote
    
Pyodide (πŸ₯‰22 Β· ⭐ 3.4K) - The Python scientific stack, compiled to WebAssembly. MPL-2.0
  • GitHub (πŸ‘¨β€πŸ’» 33 Β· πŸ”€ 210 Β· πŸ“₯ 8.3K Β· πŸ“‹ 310 - 50% open Β· ⏱️ 02.01.2020):

     git clone https://github.com/iodide-project/pyodide
    
  • Dockerhub (πŸ“₯ 32K Β· ⭐ 3 Β· ⏱️ 05.11.2018):

     docker pull iodide/pyodide-env
    
Hydrogen (πŸ₯‰20 Β· ⭐ 3.5K) - Run code interactively, inspect data, and plot. All the power of Jupyter kernels,.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 74 Β· πŸ”€ 260 Β· πŸ“‹ 1.1K - 8% open Β· ⏱️ 14.12.2019):

     git clone https://github.com/nteract/hydrogen
    
Deepo (πŸ₯‰19 Β· ⭐ 5.4K) - Set up deep learning environment in a single command line. MIT
  • GitHub (πŸ‘¨β€πŸ’» 10 Β· πŸ”€ 640 Β· πŸ“‹ 100 - 11% open Β· ⏱️ 06.01.2020):

     git clone https://github.com/ufoym/deepo
    
  • Dockerhub (πŸ“₯ 150K Β· ⭐ 150 Β· ⏱️ 06.01.2020):

     docker pull ufoym/deepo
    
Spark Notebook (πŸ₯‰18 Β· ⭐ 2.9K Β· πŸ’€) - Interactive and Reactive Data Science using Scala and Spark. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 83 Β· πŸ”€ 600 Β· πŸ“‹ 510 - 40% open Β· ⏱️ 11.03.2019):

     git clone https://github.com/spark-notebook/spark-notebook
    
DataLab (πŸ₯‰18 Β· ⭐ 900 Β· πŸ’€) - Interactive tools and developer experiences for Big Data on Google Cloud.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 50 Β· πŸ”€ 240 Β· πŸ“‹ 870 - 24% open Β· ⏱️ 05.06.2019):

     git clone https://github.com/googledatalab/datalab
    
ML Workspace (πŸ₯‰16 Β· ⭐ 890) - All-in-one web-based IDE specialized for machine learning and data science. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 6 Β· πŸ”€ 92 Β· πŸ“‹ 20 - 55% open Β· ⏱️ 24.01.2020):

     git clone https://github.com/ml-tooling/ml-workspace
    
  • Dockerhub (πŸ“₯ 48K Β· ⭐ 7 Β· ⏱️ 04.10.2019):

     docker pull mltooling/ml-workspace
    
h2o-flow (πŸ₯‰16 Β· ⭐ 100) - Web based interactive computing environment for H2O. MIT
  • GitHub (πŸ‘¨β€πŸ’» 36 Β· πŸ”€ 56 Β· ⏱️ 18.11.2019):

     git clone https://github.com/h2oai/h2o-flow
    
Judge0 IDE (πŸ₯‰13 Β· ⭐ 140) - Free and open-source online code editor that allows you to write and execute code.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 2 Β· πŸ”€ 28 Β· πŸ“‹ 32 - 18% open Β· ⏱️ 05.01.2020):

     git clone https://github.com/judge0/ide
    
Show 1 hidden projects...
RStudio (πŸ₯‰17 Β· ⭐ 3K) - RStudio is an integrated development environment (IDE) for R. ❗️AGPL-3.0
  • GitHub (πŸ‘¨β€πŸ’» 93 Β· πŸ”€ 740 Β· πŸ“‹ 3.4K - 39% open Β· ⏱️ 24.01.2020):

     git clone https://github.com/rstudio/rstudio
    

Machine Learning Platforms

Back to top

Platforms that enable large-scale and distributed machine learning.

H2O (πŸ₯‡30 Β· ⭐ 4.6K) - Open Source Fast Scalable Machine Learning Platform For Smarter Applications: Deep.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 190 Β· πŸ”€ 1.7K Β· πŸ“¦ 250 Β· ⏱️ 24.01.2020):

     git clone https://github.com/h2oai/h2o-3
    
  • PyPi (πŸ“₯ 96K / month Β· πŸ“¦ 59 Β· ⏱️ 20.01.2020):

     pip install h2o
    
Kubeflow (πŸ₯‡28 Β· ⭐ 8.3K) - Machine Learning Toolkit for Kubernetes. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 190 Β· πŸ”€ 1.3K Β· πŸ“₯ 57K Β· πŸ“‹ 2.4K - 11% open Β· ⏱️ 23.01.2020):

     git clone https://github.com/kubeflow/kubeflow
    
  • PyPi (πŸ“₯ 17K / month Β· πŸ“¦ 4 Β· ⏱️ 20.01.2020):

     pip install kfp
    
PredictionIO (πŸ₯ˆ27 Β· ⭐ 12K) - PredictionIO, a machine learning server for developers and ML engineers. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 160 Β· πŸ”€ 2K Β· πŸ“₯ 3.8K Β· ⏱️ 12.12.2019):

     git clone https://github.com/apache/predictionio
    
  • PyPi (πŸ“₯ 1.4K / month Β· πŸ“¦ 15 Β· ⏱️ 24.10.2017):

     pip install predictionio
    
  • Dockerhub (πŸ“₯ 1.5K Β· ⭐ 3 Β· ⏱️ 19.11.2018):

     docker pull predictionio/pio
    
Pachyderm (πŸ₯ˆ26 Β· ⭐ 4.2K) - Reproducible Data Science at Scale!. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 110 Β· πŸ”€ 400 Β· πŸ“₯ 36K Β· πŸ“‹ 2.3K - 22% open Β· ⏱️ 24.01.2020):

     git clone https://github.com/pachyderm/pachyderm
    
  • PyPi (πŸ“₯ 1K / month Β· πŸ“¦ 3 Β· ⏱️ 22.01.2020):

     pip install python-pachyderm
    
  • Dockerhub (πŸ“₯ 1.6M Β· ⭐ 2 Β· ⏱️ 23.01.2020):

     docker pull pachyderm/pachd
    
Polyaxon (πŸ₯ˆ25 Β· ⭐ 2.3K) - A platform for reproducible and scalable machine learning and deep learning on.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 74 Β· πŸ”€ 210 Β· πŸ“‹ 500 - 26% open Β· ⏱️ 16.01.2020):

     git clone https://github.com/polyaxon/polyaxon
    
  • PyPi (πŸ“₯ 2K / month Β· πŸ“¦ 1 Β· ⏱️ 13.01.2020):

     pip install polyaxon-cli
    
  • Dockerhub (πŸ“₯ 1.5M Β· ⏱️ 13.01.2020):

     docker pull polyaxon/polyaxon-api
    
Mahout (πŸ₯‰22 Β· ⭐ 1.8K) - Powerful, scalable machine-learning library that runs on top of Hadoop MapReduce. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 58 Β· πŸ”€ 910 Β· πŸ“¦ 55 Β· ⏱️ 24.01.2020):

     git clone https://github.com/apache/mahout
    
  • Maven (πŸ“¦ 7 Β· ⏱️ 15.04.2017):

     <dependency>
     	<groupId>org.apache.mahout</groupId>
     	<artifactId>mahout-math-scala_2.10</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
SystemML (πŸ₯‰22 Β· ⭐ 800) - A machine learning platform optimal for big data running on Apache Spark,. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 86 Β· πŸ”€ 290 Β· πŸ“¦ 3 Β· ⏱️ 16.01.2020):

     git clone https://github.com/apache/systemml
    
  • PyPi (πŸ“₯ 3.4K / month Β· ⏱️ 27.08.2018):

     pip install systemml
    
  • Maven:

     <dependency>
     	<groupId>org.apache.systemml</groupId>
     	<artifactId>systemml</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Oryx 2 (πŸ₯‰21 Β· ⭐ 1.7K) - Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 17 Β· πŸ”€ 390 Β· πŸ“₯ 14K Β· πŸ“‹ 200 - 1% open Β· ⏱️ 24.11.2019):

     git clone https://github.com/OryxProject/oryx
    
  • Maven (πŸ“¦ 32 Β· ⏱️ 06.10.2018):

     <dependency>
     	<groupId>com.cloudera.oryx</groupId>
     	<artifactId>oryx-api</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Flyte (πŸ₯‰19 Β· ⭐ 640 Β· 🐣) - develop, execute, and monitor distributed workflows reliably at scale. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 15 Β· πŸ”€ 43 Β· πŸ“‹ 130 - 87% open Β· ⏱️ 23.01.2020):

     git clone https://github.com/lyft/flyte
    
  • PyPi (πŸ“₯ 33K / month Β· πŸ“¦ 4 Β· ⏱️ 30.12.2019):

     pip install flytekit
    
  • Dockerhub (πŸ“₯ 7.2K Β· ⏱️ 09.12.2019):

     docker pull lyft/flyteadmin
    
Singa (πŸ₯‰17 Β· ⭐ 1.9K) - Flexible architecture for scalable distributed training, it is extensible to run.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 62 Β· πŸ”€ 480 Β· ⏱️ 18.01.2020):

     git clone https://github.com/apache/singa
    
  • Conda:

     conda install -c conda-forge singa-cpu
    
  • Dockerhub (πŸ“₯ 120 Β· ⭐ 2 Β· ⏱️ 04.06.2019):

     docker pull apache/singa
    
FfDL (πŸ₯‰17 Β· ⭐ 580 Β· πŸ’€) - Fabric for Deep Learning (FfDL, pronounced fiddle) is a Deep Learning Platform.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 15 Β· πŸ”€ 170 Β· πŸ“‹ 72 - 44% open Β· ⏱️ 27.04.2019):

     git clone https://github.com/IBM/FfDL
    
  • Dockerhub (πŸ“₯ 43K Β· ⏱️ 08.08.2019):

     docker pull ffdlops/ffdl
    
ML Hub (πŸ₯‰17 Β· ⭐ 55) - Multi-user development platform for machine learning teams. Simple to setup within.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 4 Β· πŸ”€ 15 Β· πŸ“₯ 95 Β· πŸ“‹ 3 - 33% open Β· ⏱️ 02.12.2019):

     git clone https://github.com/ml-tooling/ml-hub
    
  • Dockerhub (πŸ“₯ 20K Β· ⭐ 3 Β· ⏱️ 17.01.2020):

     docker pull mltooling/ml-hub
    
PennAI (πŸ₯‰14 Β· ⭐ 110) - the Penn AI engine. ❗️GPL-3.0
  • GitHub (πŸ‘¨β€πŸ’» 38 Β· πŸ”€ 29 Β· πŸ“₯ 250 Β· πŸ“‹ 190 - 30% open Β· ⏱️ 17.01.2020):

     git clone https://github.com/EpistasisLab/pennai
    
  • Dockerhub (πŸ“₯ 51 Β· ⏱️ 07.08.2019):

     docker pull moorelab/pennai_lab
    

Business Intelligence

Back to top

GUI-based business intelligence tools combining SQL query engines, data analytics, visualization, and dashboarding features.

Metabase (πŸ₯‡29 Β· ⭐ 19K) - The simplest, fastest way to get business intelligence and analytics to.. ❗️AGPL-3.0
  • GitHub (πŸ‘¨β€πŸ’» 230 Β· πŸ”€ 2.5K Β· πŸ“₯ 1K Β· πŸ“¦ 1 Β· πŸ“‹ 6.8K - 31% open Β· ⏱️ 24.01.2020):

     git clone https://github.com/metabase/metabase
    
  • Dockerhub (πŸ“₯ 33M Β· ⭐ 160 Β· ⏱️ 14.01.2020):

     docker pull metabase/metabase
    
Redash (πŸ₯ˆ28 Β· ⭐ 15K) - Make Your Company Data Driven. Connect to any data source, easily visualize,.. BSD-2
  • GitHub (πŸ‘¨β€πŸ’» 340 Β· πŸ”€ 2.5K Β· πŸ“₯ 14K Β· πŸ“‹ 1.9K - 25% open Β· ⏱️ 23.01.2020):

     git clone https://github.com/getredash/redash
    
  • Dockerhub (πŸ“₯ 11M Β· ⭐ 140 Β· ⏱️ 23.01.2020):

     docker pull redash/redash
    
Superset (πŸ₯ˆ27 Β· ⭐ 28K) - Apache Superset (incubating) is a modern, enterprise-ready business.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 460 Β· πŸ”€ 5.5K Β· πŸ“¦ 2 Β· πŸ“‹ 4.2K - 6% open Β· ⏱️ 24.01.2020):

     git clone https://github.com/apache/incubator-superset
    
  • PyPi (πŸ“₯ 2.8K / month Β· ⏱️ 12.10.2019):

     pip install apache-superset
    
  • Dockerhub (πŸ“₯ 1.3M Β· ⭐ 220 Β· ⏱️ 23.01.2020):

     docker pull amancevice/superset
    
Hue (πŸ₯ˆ24 Β· ⭐ 4K) - Open source SQL Query Assistant for Databases/Warehouses. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 230 Β· πŸ”€ 1.5K Β· πŸ“¦ 3 Β· πŸ“‹ 730 - 28% open Β· ⏱️ 22.01.2020):

     git clone https://github.com/cloudera/hue
    
  • Dockerhub (πŸ“₯ 560K Β· ⭐ 58 Β· ⏱️ 24.01.2020):

     docker pull gethue/hue
    
Azure Data Studio (πŸ₯‰22 Β· ⭐ 5.5K) - Azure Data Studio is a data management tool that enables working.. ❗️Custom EULA
  • GitHub (πŸ‘¨β€πŸ’» 97 Β· πŸ”€ 460 Β· πŸ“₯ 170K Β· πŸ“‹ 5.5K - 32% open Β· ⏱️ 24.01.2020):

     git clone https://github.com/microsoft/azuredatastudio
    
Blazer (πŸ₯‰20 Β· ⭐ 2.4K) - Explore your data with SQL. Easily create charts and dashboards, and share them with.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 42 Β· πŸ”€ 290 Β· πŸ“¦ 240 Β· πŸ“‹ 160 - 6% open Β· ⏱️ 14.11.2019):

     git clone https://github.com/ankane/blazer
    
Poli (πŸ₯‰19 Β· ⭐ 1.6K) - An easy-to-use BI server built for SQL lovers. Power data analysis in SQL and gain.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 2 Β· πŸ”€ 220 Β· πŸ“₯ 3.6K Β· πŸ“‹ 48 - 25% open Β· ⏱️ 11.01.2020):

     git clone https://github.com/shzlw/poli
    
  • Dockerhub (πŸ“₯ 570 Β· ⏱️ 11.12.2019):

     docker pull zhonglu/poli
    
CBoard (πŸ₯‰17 Β· ⭐ 2.3K) - An easy to use, self-service open BI reporting and BI dashboard platform. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 30 Β· πŸ”€ 970 Β· πŸ“‹ 560 - 13% open Β· ⏱️ 22.09.2019):

     git clone https://github.com/TuiQiao/CBoard
    
  • Dockerhub (πŸ“₯ 3.5K Β· ⭐ 4 Β· ⏱️ 18.10.2017):

     docker pull peterzhang921/cboard
    
Meltano (πŸ₯‰15 Β· ⭐ 320) - Convention-over-configuration product for the whole data lifecycle, all the way from.. MIT
  • PyPi (πŸ“₯ 1.8K / month Β· ⏱️ 23.01.2020):

     pip install meltano
    
  • Dockerhub (πŸ“₯ 320K Β· ⭐ 4 Β· ⏱️ 23.01.2020):

     docker pull meltano/meltano
    

Job Scheduler & Pipelines

Back to top

Platforms and tools to schedule, orchestrate, and monitor jobs for workflow automation and data pipeline tasks.

Airflow (πŸ₯‡35 Β· ⭐ 15K) - Platform to programmatically author, schedule, and monitor workflows. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 1.3K Β· πŸ”€ 5.8K Β· πŸ“₯ 400 Β· πŸ“¦ 720 Β· ⏱️ 24.01.2020):

     git clone https://github.com/apache/airflow
    
  • PyPi (πŸ“₯ 470K / month Β· πŸ“¦ 290 Β· ⏱️ 24.12.2019):

     pip install apache-airflow
    
  • Conda:

     conda install -c conda-forge airflow
    
  • Dockerhub (πŸ“₯ 330K Β· ⭐ 80 Β· ⏱️ 24.01.2020):

     docker pull apache/airflow
    
luigi (πŸ₯‡33 Β· ⭐ 13K) - Luigi is a Python module that helps you build complex pipelines of batch jobs. It.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 530 Β· πŸ”€ 2.1K Β· πŸ“¦ 970 Β· πŸ“‹ 830 - 6% open Β· ⏱️ 16.01.2020):

     git clone https://github.com/spotify/luigi
    
  • PyPi (πŸ“₯ 170K / month Β· πŸ“¦ 680 Β· ⏱️ 02.01.2020):

     pip install luigi
    
  • Conda (⏱️ 17.12.2019):

     conda install -c anaconda luigi
    
argo (πŸ₯ˆ25 Β· ⭐ 4.5K) - Argo Workflows: Get stuff done with Kubernetes. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 160 Β· πŸ”€ 700 Β· πŸ“₯ 270K Β· πŸ“‹ 1.2K - 28% open Β· ⏱️ 24.01.2020):

     git clone https://github.com/argoproj/argo
    
  • Dockerhub (πŸ“₯ 1.5M Β· ⭐ 1 Β· ⏱️ 16.12.2019):

     docker pull argoproj/argoui
    
Kubeflow Pipelines (πŸ₯ˆ25 Β· ⭐ 1.4K) - Machine Learning Pipelines for Kubeflow. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 110 Β· πŸ”€ 410 Β· πŸ“‹ 1.1K - 33% open Β· ⏱️ 24.01.2020):

     git clone https://github.com/kubeflow/pipelines
    
  • PyPi (πŸ“₯ 17K / month Β· πŸ“¦ 4 Β· ⏱️ 20.01.2020):

     pip install kfp
    
Genie (πŸ₯ˆ25 Β· ⭐ 1.3K) - Distributed Big Data Orchestration Service. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 30 Β· πŸ”€ 290 Β· πŸ“‹ 160 - 0% open Β· ⏱️ 15.01.2020):

     git clone https://github.com/Netflix/genie
    
  • PyPi (πŸ“₯ 52K / month Β· πŸ“¦ 5 Β· ⏱️ 09.12.2019):

     pip install nflx-genie-client
    
  • Dockerhub (πŸ“₯ 7.1K Β· ⭐ 3 Β· ⏱️ 15.01.2020):

     docker pull netflixoss/genie-app
    
  • Maven (⏱️ 25.10.2018):

     <dependency>
     	<groupId>com.netflix.genie</groupId>
     	<artifactId>genie-common</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
n8n.io (πŸ₯ˆ24 Β· ⭐ 6K) - Free and open node based Workflow Automation Tool. Easily automate tasks across.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 35 Β· πŸ”€ 340 Β· πŸ“¦ 1 Β· πŸ“‹ 110 - 39% open Β· ⏱️ 24.01.2020):

     git clone https://github.com/n8n-io/n8n
    
  • NPM (πŸ“₯ 3.1K / month Β· ⏱️ 19.01.2020):

     npm install n8n
    
  • Dockerhub (πŸ“₯ 330K Β· ⭐ 11 Β· ⏱️ 19.01.2020):

     docker pull n8nio/n8n
    
Cadence (πŸ₯ˆ24 Β· ⭐ 3.3K) - Cadence is a distributed, scalable, durable, and highly available orchestration.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 57 Β· πŸ”€ 270 Β· πŸ“₯ 7.5K Β· πŸ“‹ 1.1K - 28% open Β· ⏱️ 24.01.2020):

     git clone https://github.com/uber/cadence
    
  • Dockerhub (πŸ“₯ 670K Β· ⭐ 2 Β· ⏱️ 24.01.2020):

     docker pull ubercadence/server
    
  • Maven (πŸ“¦ 2 Β· ⏱️ 06.08.2018):

     <dependency>
     	<groupId>com.uber.cadence</groupId>
     	<artifactId>cadence-client</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
dkron (πŸ₯‰23 Β· ⭐ 2.1K) - Dkron - Distributed, fault tolerant job scheduling system https://dkron.io. ❗️LGPL-3.0
  • GitHub (πŸ‘¨β€πŸ’» 43 Β· πŸ”€ 200 Β· πŸ“₯ 260K Β· πŸ“‹ 330 - 7% open Β· ⏱️ 04.01.2020):

     git clone https://github.com/distribworks/dkron
    
  • Dockerhub (πŸ“₯ 480K Β· ⭐ 11 Β· ⏱️ 04.01.2020):

     docker pull dkron/dkron
    
Dolphin Scheduler (πŸ₯‰22 Β· ⭐ 3.4K) - Dolphin Scheduler is a distributed and easy-to-expand visual DAG.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 72 Β· πŸ”€ 1K Β· πŸ“₯ 5.6K Β· πŸ“‹ 730 - 44% open Β· ⏱️ 24.01.2020):

     git clone https://github.com/apache/incubator-dolphinscheduler
    
Oozie (πŸ₯‰22 Β· ⭐ 540) - Server-based workflow scheduling system to manage Hadoop jobs. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 36 Β· πŸ”€ 380 Β· πŸ“¦ 270 Β· ⏱️ 18.01.2020):

     git clone https://github.com/apache/oozie
    
  • Maven (πŸ“¦ 170 Β· ⏱️ 02.12.2016):

     <dependency>
     	<groupId>org.apache.oozie</groupId>
     	<artifactId>oozie-client</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Azkaban (πŸ₯‰21 Β· ⭐ 3K) - Batch workflow job scheduler created at LinkedIn to run Hadoop jobs. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 120 Β· πŸ”€ 1.2K Β· πŸ“₯ 960 Β· πŸ“‹ 1K - 63% open Β· ⏱️ 21.01.2020):

     git clone https://github.com/azkaban/azkaban
    
Ofelia (πŸ₯‰19 Β· ⭐ 1.2K) - A docker job scheduler (aka. crontab for docker). MIT
  • GitHub (πŸ‘¨β€πŸ’» 16 Β· πŸ”€ 87 Β· πŸ“₯ 8.4K Β· πŸ“‹ 69 - 62% open Β· ⏱️ 06.01.2020):

     git clone https://github.com/mcuadros/ofelia
    
  • Dockerhub (πŸ“₯ 2.7M Β· ⭐ 15 Β· ⏱️ 06.01.2020):

     docker pull mcuadros/ofelia
    
Aurora (πŸ₯‰16 Β· ⭐ 620) - Apache Aurora - A Mesos framework for long-running services, cron jobs, and ad-.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 140 Β· πŸ”€ 220 Β· πŸ“‹ 36 - 30% open Β· ⏱️ 13.01.2020):

     git clone https://github.com/apache/aurora
    
Flotilla (πŸ₯‰11 Β· ⭐ 140) - Self-service framework for defining and executing containerized jobs. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 9 Β· πŸ”€ 7 Β· πŸ“‹ 51 - 37% open Β· ⏱️ 23.01.2020):

     git clone https://github.com/stitchfix/flotilla-os
    

Data Integration & Ingestion

Back to top

Tools to integrate and ingest data from a variety of data sources. This includes query engines, ETL tools, data pipeline software, and command-line database clients.

Prisma (πŸ₯‡35 Β· ⭐ 17K) - Database Tools incl. ORM, Migrations and Admin UI (Postgres, MySQL & MongoDB). Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 250 Β· πŸ”€ 910 Β· πŸ“¦ 6.1K Β· πŸ“‹ 3.4K - 18% open Β· ⏱️ 17.01.2020):

     git clone https://github.com/prisma/prisma
    
  • NPM (πŸ“₯ 120K / month Β· πŸ“¦ 490 Β· ⏱️ 14.06.2019):

     npm install prisma-client-lib
    
  • Dockerhub (πŸ“₯ 20M Β· ⭐ 74 Β· ⏱️ 11.11.2019):

     docker pull prismagraphql/prisma
    
Presto (πŸ₯‡33 Β· ⭐ 10K) - High performance, distributed SQL query engine for a variety of data sources. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 460 Β· πŸ”€ 3.5K Β· πŸ“¦ 1 Β· πŸ“‹ 4.1K - 23% open Β· ⏱️ 24.01.2020):

     git clone https://github.com/prestodb/presto
    
  • PyPi (πŸ“₯ 270K / month Β· ⏱️ 16.04.2019):

     pip install presto-python-client
    
  • NPM (πŸ“₯ 2.4K / month Β· πŸ“¦ 30 Β· ⏱️ 07.01.2020):

     npm install presto-client
    
  • Dockerhub (πŸ“₯ 110K Β· ⭐ 12 Β· ⏱️ 23.01.2020):

     docker pull prestosql/presto
    
  • Maven (πŸ“¦ 360 Β· ⏱️ 22.02.2019):

     <dependency>
     	<groupId>com.facebook.presto</groupId>
     	<artifactId>presto-spi</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Hive (πŸ₯‡33 Β· ⭐ 3K) - Data warehouse software facilitates reading, writing, and managing large datasets.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 350 Β· πŸ”€ 2.8K Β· πŸ“¦ 9 Β· ⏱️ 24.01.2020):

     git clone https://github.com/apache/hive
    
  • PyPi (πŸ“₯ 1M / month Β· πŸ“¦ 810 Β· ⏱️ 10.09.2018):

     pip install pyhive
    
  • Maven (πŸ“¦ 1.1K Β· ⏱️ 23.07.2018):

     <dependency>
     	<groupId>org.apache.hive</groupId>
     	<artifactId>hive-common</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Impala (πŸ₯ˆ30 Β· ⭐ 2.3K Β· πŸ’€) - Lightning-fast, distributed SQL queries for petabytes of data stored in.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 170 Β· πŸ”€ 810 Β· ⏱️ 26.06.2019):

     git clone https://github.com/cloudera/Impala
    
  • PyPi (πŸ“₯ 92K / month Β· πŸ“¦ 540 Β· ⏱️ 21.11.2019):

     pip install impyla
    
TinkerPop Gremlin (πŸ₯ˆ30 Β· ⭐ 1K) - Apache TinkerPop - a graph computing framework. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 150 Β· πŸ”€ 520 Β· πŸ“¦ 210 Β· ⏱️ 23.01.2020):

     git clone https://github.com/apache/tinkerpop
    
  • PyPi (πŸ“₯ 110K / month Β· πŸ“¦ 150 Β· ⏱️ 09.08.2019):

     pip install gremlinpython
    
  • NPM (πŸ“₯ 13K / month Β· πŸ“¦ 100 Β· ⏱️ 09.08.2019):

     npm install gremlin
    
  • Maven (πŸ“¦ 230 Β· ⏱️ 08.05.2018):

     <dependency>
     	<groupId>org.apache.tinkerpop</groupId>
     	<artifactId>gremlin-core</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Alluxio (πŸ₯ˆ29 Β· ⭐ 4.5K) - Alluxio, data orchestration for analytics and machine learning in the cloud. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 1.2K Β· πŸ”€ 2.3K Β· πŸ“₯ 20K Β· πŸ“¦ 1 Β· πŸ“‹ 670 - 42% open Β· ⏱️ 24.01.2020):

     git clone https://github.com/Alluxio/alluxio
    
  • PyPi (πŸ“₯ 61 / month Β· ⏱️ 07.09.2017):

     pip install alluxio
    
  • Dockerhub (πŸ“₯ 550K Β· ⭐ 3 Β· ⏱️ 24.01.2020):

     docker pull alluxio/alluxio
    
  • Maven (πŸ“¦ 38 Β· ⏱️ 27.03.2018):

     <dependency>
     	<groupId>org.alluxio</groupId>
     	<artifactId>alluxio-core-client-fs</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Apache NiFi (πŸ₯ˆ29 Β· ⭐ 2K) - Integrated data logistics platform for automating the movement of data.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 370 Β· πŸ”€ 1.6K Β· ⏱️ 23.01.2020):

     git clone https://github.com/apache/nifi
    
  • Dockerhub (πŸ“₯ 1.3M Β· ⭐ 150 Β· ⏱️ 05.11.2019):

     docker pull apache/nifi
    
  • Maven (πŸ“¦ 210 Β· ⏱️ 23.10.2018):

     <dependency>
     	<groupId>org.apache.nifi</groupId>
     	<artifactId>nifi-api</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Camel (πŸ₯ˆ28 Β· ⭐ 3.1K) - Integration framework that empowers you to easily integrate various systems.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 810 Β· πŸ”€ 3.9K Β· πŸ“¦ 8 Β· ⏱️ 24.01.2020):

     git clone https://github.com/apache/camel
    
  • Maven (πŸ“¦ 6.9K Β· ⏱️ 24.11.2018):

     <dependency>
     	<groupId>org.apache.camel</groupId>
     	<artifactId>camel-core</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Tika (πŸ₯ˆ27 Β· ⭐ 970) - Toolkit for detecting and extracting metadata and structured text content from.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 100 Β· πŸ”€ 500 Β· πŸ“¦ 96 Β· ⏱️ 17.01.2020):

     git clone https://github.com/apache/tika
    
  • PyPi (πŸ“₯ 33K / month Β· πŸ“¦ 160 Β· ⏱️ 09.11.2019):

     pip install tika
    
  • NPM (πŸ“₯ 900 / month Β· πŸ“¦ 28 Β· ⏱️ 22.02.2017):

     npm install tika
    
  • Dockerhub (πŸ“₯ 1.2M Β· ⭐ 26 Β· ⏱️ 11.01.2020):

     docker pull logicalspark/docker-tikaserver
    
  • Maven (πŸ“¦ 2.7K Β· ⏱️ 20.04.2018):

     <dependency>
     	<groupId>org.apache.tika</groupId>
     	<artifactId>tika-core</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Drill (πŸ₯‰26 Β· ⭐ 1.3K) - Schema-free SQL Query Engine for Hadoop, NoSQL and Cloud Storage. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 170 Β· πŸ”€ 780 Β· πŸ“¦ 14 Β· ⏱️ 23.01.2020):

     git clone https://github.com/apache/drill
    
  • PyPi (πŸ“₯ 1.5K / month Β· πŸ“¦ 3 Β· ⏱️ 24.04.2018):

     pip install pydrill
    
  • Dockerhub (πŸ“₯ 35K Β· ⭐ 9 Β· ⏱️ 26.12.2019):

     docker pull drill/apache-drill
    
  • Maven (πŸ“¦ 38 Β· ⏱️ 24.12.2018):

     <dependency>
     	<groupId>org.apache.drill</groupId>
     	<artifactId>drill-common</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Dagster (πŸ₯‰25 Β· ⭐ 1.2K) - A Python library for building data applications: ETL, ML, Data Pipelines, and.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 49 Β· πŸ”€ 110 Β· πŸ“¦ 64 Β· πŸ“‹ 1.1K - 26% open Β· ⏱️ 24.01.2020):

     git clone https://github.com/dagster-io/dagster
    
  • PyPi (πŸ“₯ 6.4K / month Β· πŸ“¦ 4 Β· ⏱️ 14.01.2020):

     pip install dagster
    
Calcite (πŸ₯‰24 Β· ⭐ 1.7K) - Framework for building databases and data management systems. Includes a SQL.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 290 Β· πŸ”€ 990 Β· ⏱️ 23.01.2020):

     git clone https://github.com/apache/calcite
    
  • Maven (πŸ“¦ 500 Β· ⏱️ 16.07.2018):

     <dependency>
     	<groupId>org.apache.calcite</groupId>
     	<artifactId>calcite-core</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Embulk (πŸ₯‰24 Β· ⭐ 1.4K) - Parallel bulk data loader that helps data transfer between various storages,.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 49 Β· πŸ”€ 160 Β· πŸ“₯ 110K Β· πŸ“‹ 420 - 39% open Β· ⏱️ 24.12.2019):

     git clone https://github.com/embulk/embulk
    
Data Collector (πŸ₯‰24 Β· ⭐ 970) - StreamSets Data Collector - Continuous big data and cloud platform ingest.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 97 Β· πŸ”€ 480 Β· ⏱️ 24.01.2020):

     git clone https://github.com/streamsets/datacollector
    
  • PyPi (πŸ“₯ 260 / month Β· ⏱️ 25.10.2019):

     pip install streamsets
    
  • Dockerhub (πŸ“₯ 2.2M Β· ⭐ 57 Β· ⏱️ 24.01.2020):

     docker pull streamsets/datacollector
    
  • Maven (πŸ“¦ 3 Β· ⏱️ 24.05.2018):

     <dependency>
     	<groupId>com.streamsets</groupId>
     	<artifactId>streamsets-datacollector</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Zenko (πŸ₯‰22 Β· ⭐ 290) - Zenko is the open source multi-cloud data controller: own and keep control of your.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 40 Β· πŸ”€ 47 Β· πŸ“₯ 310 Β· πŸ“‹ 44 - 25% open Β· ⏱️ 21.01.2020):

     git clone https://github.com/scality/Zenko
    
  • Dockerhub (πŸ“₯ 3.6M Β· ⭐ 11 Β· ⏱️ 17.01.2020):

     docker pull zenko/cloudserver
    
Gobblin (πŸ₯‰21 Β· ⭐ 1.7K) - Gobblin is a distributed big data integration framework (ingestion,.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 140 Β· πŸ”€ 600 Β· πŸ“₯ 120K Β· ⏱️ 17.01.2020):

     git clone https://github.com/apache/incubator-gobblin
    
  • Maven (⏱️ 20.06.2018):

     <dependency>
     	<groupId>org.apache.gobblin</groupId>
     	<artifactId>gobblin-api</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
s4cmd (πŸ₯‰21 Β· ⭐ 960 Β· πŸ’€) - Super S3 command line tool. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 30 Β· πŸ”€ 160 Β· πŸ“¦ 2 Β· πŸ“‹ 110 - 58% open Β· ⏱️ 03.04.2019):

     git clone https://github.com/bloomreach/s4cmd
    
  • PyPi (πŸ“₯ 11K / month Β· πŸ“¦ 7 Β· ⏱️ 13.08.2018):

     pip install s4cmd
    
Pentaho Kettle (πŸ₯‰20 Β· ⭐ 3.6K) - Pentaho Data Integration ( ETL ) a.k.a Kettle. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 350 Β· πŸ”€ 2.1K Β· ⏱️ 24.01.2020):

     git clone https://github.com/pentaho/pentaho-kettle
    
Delta Lake (πŸ₯‰19 Β· ⭐ 2.1K) - An open-source storage layer that brings scalable, ACID transactions to.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 55 Β· πŸ”€ 420 Β· πŸ“‹ 190 - 45% open Β· ⏱️ 21.01.2020):

     git clone https://github.com/delta-io/delta
    
  • Maven:

     <dependency>
     	<groupId>io.delta</groupId>
     	<artifactId>delta-core_2.11</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Sqoop (πŸ₯‰18 Β· ⭐ 670) - Sqoop allows easy imports and exports of data sets between databases and HDFS. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 39 Β· πŸ”€ 450 Β· ⏱️ 16.10.2019):

     git clone https://github.com/apache/sqoop
    
  • PyPi (πŸ“₯ 220 / month Β· ⏱️ 26.11.2019):

     pip install pysqoop
    
Amundsen (πŸ₯‰16 Β· ⭐ 590) - Metadata driven application for improving the productivity of data engineers.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 24 Β· πŸ”€ 99 Β· πŸ“‹ 110 - 63% open Β· ⏱️ 24.01.2020):

     git clone https://github.com/lyft/amundsen
    
  • Dockerhub (πŸ“₯ 19K Β· ⏱️ 20.12.2019):

     docker pull amundsendev/amundsen-search
    
Show 1 hidden projects...
s3cmd (πŸ₯‰23 Β· ⭐ 3.2K) - Official s3cmd repo -- Command line tool for managing Amazon S3 and CloudFront.. ❗️GPL-2.0
  • GitHub (πŸ‘¨β€πŸ’» 180 Β· πŸ”€ 740 Β· πŸ“₯ 2.7M Β· πŸ“‹ 700 - 41% open Β· ⏱️ 18.12.2019):

     git clone https://github.com/s3tools/s3cmd
    

Data Batch & Stream Processing

Back to top

Frameworks and computing-engines for large-scale (distributed) data batch- and stream-processing.

Spark (πŸ₯‡36 Β· ⭐ 25K) - Unified analytics engine for big data processing, with built-in modules for.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 2.2K Β· πŸ”€ 21K Β· πŸ“¦ 280 Β· ⏱️ 24.01.2020):

     git clone https://github.com/apache/spark
    
  • PyPi (πŸ“₯ 2.7M / month Β· πŸ“¦ 760 Β· ⏱️ 07.05.2019):

     pip install pyspark
    
  • Dockerhub (πŸ“₯ 170K Β· ⭐ 32 Β· ⏱️ 30.09.2019):

     docker pull bde2020/spark-master
    
  • Maven (πŸ“¦ 120 Β· ⏱️ 29.10.2018):

     <dependency>
     	<groupId>org.apache.spark</groupId>
     	<artifactId>spark-catalyst_2.11</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Flink (πŸ₯‡34 Β· ⭐ 12K) - Stream processing framework with powerful stream- and batch-processing capabilities. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 900 Β· πŸ”€ 6.2K Β· πŸ“¦ 120 Β· ⏱️ 24.01.2020):

     git clone https://github.com/apache/flink
    
  • Dockerhub (πŸ“₯ 50M Β· ⭐ 140 Β· ⏱️ 23.01.2020):

     docker pull flink
    
  • Maven (πŸ“¦ 530 Β· ⏱️ 30.09.2019):

     <dependency>
     	<groupId>org.apache.flink</groupId>
     	<artifactId>flink-core</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Beam (πŸ₯ˆ33 Β· ⭐ 3.7K) - Unified programming model to define and execute data processing pipelines,.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 740 Β· πŸ”€ 2.2K Β· ⏱️ 24.01.2020):

     git clone https://github.com/apache/beam
    
  • PyPi (πŸ“₯ 2.6M / month Β· πŸ“¦ 190 Β· ⏱️ 23.01.2020):

     pip install apache-beam
    
  • Maven (πŸ“¦ 360 Β· ⏱️ 17.12.2019):

     <dependency>
     	<groupId>org.apache.beam</groupId>
     	<artifactId>beam-sdks-java-core</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Storm (πŸ₯ˆ32 Β· ⭐ 6.1K) - Distributed real-time computational system for processing data streams. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 480 Β· πŸ”€ 4K Β· πŸ“¦ 3 Β· ⏱️ 24.01.2020):

     git clone https://github.com/apache/storm
    
  • Dockerhub (πŸ“₯ 3.2M Β· ⭐ 120 Β· ⏱️ 23.01.2020):

     docker pull storm
    
  • Maven (πŸ“¦ 3.9K Β· ⏱️ 29.04.2019):

     <dependency>
     	<groupId>org.apache.storm</groupId>
     	<artifactId>storm-core</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Kafka (πŸ₯ˆ29 Β· ⭐ 15K) - Distributed streaming platform that is used to build real time streaming data.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 780 Β· πŸ”€ 7.7K Β· ⏱️ 24.01.2020):

     git clone https://github.com/apache/kafka
    
  • PyPi (πŸ“₯ 260K / month Β· πŸ“¦ 350 Β· ⏱️ 07.10.2017):

     pip install kafka
    
  • Dockerhub (πŸ“₯ 20M Β· ⭐ 100 Β· ⏱️ 24.01.2020):

     docker pull bitnami/kafka
    
  • Maven (πŸ“¦ 52 Β· ⏱️ 22.11.2013):

     <dependency>
     	<groupId>org.apache.kafka</groupId>
     	<artifactId>kafka</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Hadoop (πŸ₯ˆ29 Β· ⭐ 10K) - Framework that allows for the distributed processing of large data sets across.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 420 Β· πŸ”€ 6.2K Β· πŸ“¦ 62 Β· ⏱️ 24.01.2020):

     git clone https://github.com/apache/hadoop
    
  • Maven (πŸ“¦ 12K Β· ⏱️ 02.08.2018):

     <dependency>
     	<groupId>org.apache.hadoop</groupId>
     	<artifactId>hadoop-common</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Heron (πŸ₯ˆ23 Β· ⭐ 3.4K) - Apache Heron (Incubating) is a realtime, distributed, fault-tolerant stream.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 130 Β· πŸ”€ 600 Β· πŸ“₯ 58K Β· πŸ“‹ 940 - 40% open Β· ⏱️ 17.01.2020):

     git clone https://github.com/apache/incubator-heron
    
  • Dockerhub (πŸ“₯ 160K Β· ⭐ 2 Β· ⏱️ 01.04.2018):

     docker pull heron/heron
    
Hazelcast Jet (πŸ₯ˆ23 Β· ⭐ 400) - Distributed stream and batch processing engine, built on top of Hazelcast. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 32 Β· πŸ”€ 110 Β· πŸ“₯ 13 Β· πŸ“¦ 140 Β· πŸ“‹ 530 - 14% open Β· ⏱️ 24.01.2020):

     git clone https://github.com/hazelcast/hazelcast-jet
    
  • Dockerhub (πŸ“₯ 8.7K Β· ⭐ 2 Β· ⏱️ 24.01.2020):

     docker pull hazelcast/hazelcast-jet
    
  • Maven (πŸ“¦ 12 Β· ⏱️ 14.06.2017):

     <dependency>
     	<groupId>com.hazelcast.jet</groupId>
     	<artifactId>hazelcast-jet</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Vespa (πŸ₯‰22 Β· ⭐ 3.1K) - Vespa is an engine for low-latency computation over large data sets. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 86 Β· πŸ”€ 360 Β· πŸ“¦ 1 Β· πŸ“‹ 310 - 28% open Β· ⏱️ 24.01.2020):

     git clone https://github.com/vespa-engine/vespa
    
  • Dockerhub (πŸ“₯ 510K Β· ⭐ 5 Β· ⏱️ 24.01.2020):

     docker pull vespaengine/vespa
    
Flume (πŸ₯‰22 Β· ⭐ 1.9K) - Service for efficiently collecting, aggregating, and moving large amounts of log.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 68 Β· πŸ”€ 1.3K Β· πŸ“¦ 120 Β· ⏱️ 13.01.2020):

     git clone https://github.com/apache/flume
    
  • Maven (πŸ“¦ 1.4K Β· ⏱️ 15.09.2017):

     <dependency>
     	<groupId>org.apache.flume</groupId>
     	<artifactId>flume-ng-sdk</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Gearpump (πŸ₯‰19 Β· ⭐ 730) - Lightweight real-time big data streaming engine over Akka. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 39 Β· πŸ”€ 150 Β· πŸ“₯ 3.7K Β· πŸ“‹ 1.1K - 7% open Β· ⏱️ 01.12.2019):

     git clone https://github.com/gearpump/gearpump
    
  • Maven:

     <dependency>
     	<groupId>io.github.gearpump</groupId>
     	<artifactId>gearpump-core</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Brooklin (πŸ₯‰19 Β· ⭐ 520) - An extensible distributed system for reliable nearline data streaming at scale. BSD-2
  • GitHub (πŸ‘¨β€πŸ’» 35 Β· πŸ”€ 64 Β· πŸ“₯ 850 Β· πŸ“‹ 19 - 42% open Β· ⏱️ 22.11.2019):

     git clone https://github.com/linkedin/brooklin
    
  • Maven:

     <dependency>
     	<groupId>com.github.datastream</groupId>
     	<artifactId>datastream-client</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Wallaroo (πŸ₯‰18 Β· ⭐ 1.4K) - Distributed Stream Processing. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 34 Β· πŸ”€ 59 Β· πŸ“‹ 1.8K - 19% open Β· ⏱️ 21.01.2020):

     git clone https://github.com/WallarooLabs/wallaroo
    
kapacitor (πŸ₯‰17 Β· ⭐ 1.9K) - Open source framework for processing, monitoring, and alerting on time series data. MIT
  • GitHub (πŸ‘¨β€πŸ’» 90 Β· πŸ”€ 410 Β· πŸ“‹ 1.6K - 42% open Β· ⏱️ 21.01.2020):

     git clone https://github.com/influxdata/kapacitor
    
Samza (πŸ₯‰17 Β· ⭐ 610) - Near-realtime, asynchronous computational framework for stream processing. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 170 Β· πŸ”€ 250 Β· ⏱️ 23.01.2020):

     git clone https://github.com/apache/samza
    
  • Maven:

     <dependency>
     	<groupId>org.apache.samza</groupId>
     	<artifactId>samza-core</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Show 1 hidden projects...
Onyx (πŸ₯‰15 Β· ⭐ 1.9K) - Distributed, masterless, high performance, fault tolerant data processing. ❗️EPL-1.0
  • GitHub (πŸ‘¨β€πŸ’» 62 Β· πŸ”€ 200 Β· πŸ“‹ 590 - 13% open Β· ⏱️ 31.08.2019):

     git clone https://github.com/onyx-platform/onyx
    

Data Labeling & Annotation

Back to top

Tools to label and annotate any type of data (e.g. images, text, videos, and audio).

LabelImg (πŸ₯‡26 Β· ⭐ 9.6K) - LabelImg is a graphical image annotation tool and label object bounding boxes in.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 66 Β· πŸ”€ 3.3K Β· πŸ“¦ 53 Β· πŸ“‹ 430 - 40% open Β· ⏱️ 10.01.2020):

     git clone https://github.com/tzutalin/labelImg
    
  • PyPi (πŸ“₯ 4.7K / month Β· πŸ“¦ 9 Β· ⏱️ 26.05.2019):

     pip install labelImg
    
Labelme (πŸ₯‡26 Β· ⭐ 4K) - Image Polygonal Annotation with Python (polygon, rectangle, circle, line, point.. ❗️GPL-3.0
  • GitHub (πŸ‘¨β€πŸ’» 38 Β· πŸ”€ 1.3K Β· πŸ“₯ 7K Β· πŸ“¦ 47 Β· πŸ“‹ 340 - 16% open Β· ⏱️ 23.01.2020):

     git clone https://github.com/wkentaro/labelme
    
  • PyPi (πŸ“₯ 8.3K / month Β· πŸ“¦ 8 Β· ⏱️ 14.01.2020):

     pip install labelme
    
  • Dockerhub (πŸ“₯ 1.1K Β· ⭐ 2 Β· ⏱️ 23.01.2020):

     docker pull wkentaro/labelme
    
doccano (πŸ₯ˆ24 Β· ⭐ 2.3K) - Open source text annotation tool for machine learning practitioner. MIT
  • GitHub (πŸ‘¨β€πŸ’» 37 Β· πŸ”€ 480 Β· πŸ“‹ 340 - 16% open Β· ⏱️ 10.01.2020):

     git clone https://github.com/chakki-works/doccano
    
  • Dockerhub (πŸ“₯ 390K Β· ⭐ 8 Β· ⏱️ 29.11.2019):

     docker pull chakkiworks/doccano
    
VoTT (πŸ₯ˆ20 Β· ⭐ 2.1K) - Visual Object Tagging Tool: An electron app for building end to end Object Detection.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 13 Β· πŸ”€ 450 Β· πŸ“₯ 45K Β· πŸ“‹ 390 - 42% open Β· ⏱️ 04.10.2019):

     git clone https://github.com/Microsoft/VoTT
    
CVAT (πŸ₯ˆ19 Β· ⭐ 3.1K) - Powerful and efficient Computer Vision Annotation Tool (CVAT). MIT
  • GitHub (πŸ‘¨β€πŸ’» 53 Β· πŸ”€ 710 Β· πŸ“‹ 560 - 26% open Β· ⏱️ 24.01.2020):

     git clone https://github.com/opencv/cvat
    
Label Studio (πŸ₯ˆ19 Β· ⭐ 2.2K) - Label Studio is a multi-type data labeling and annotation tool with.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 12 Β· πŸ”€ 130 Β· πŸ“₯ 3 Β· πŸ“¦ 1 Β· πŸ“‹ 60 - 33% open Β· ⏱️ 23.01.2020):

     git clone https://github.com/heartexlabs/label-studio
    
  • PyPi (πŸ“₯ 630 / month Β· ⏱️ 23.01.2020):

     pip install label-studio
    
  • NPM (πŸ“₯ 250 / month Β· ⏱️ 10.01.2020):

     npm install label-studio
    
  • Dockerhub (πŸ“₯ 300 Β· ⏱️ 23.01.2020):

     docker pull heartexlabs/label-studio
    
PixelAnnotationTool (πŸ₯‰17 Β· ⭐ 670) - Annotate quickly images. ❗️LGPL-3.0
  • GitHub (πŸ‘¨β€πŸ’» 7 Β· πŸ”€ 170 Β· πŸ“₯ 11K Β· πŸ“‹ 42 - 38% open Β· ⏱️ 07.01.2020):

     git clone https://github.com/abreheret/PixelAnnotationTool
    
Semantic Segmentation Editor (πŸ₯‰17 Β· ⭐ 520) - Web labeling tool for camera and LIDAR data. MIT
  • GitHub (πŸ‘¨β€πŸ’» 4 Β· πŸ”€ 160 Β· πŸ“‹ 71 - 21% open Β· ⏱️ 04.01.2020):

     git clone https://github.com/Hitachi-Automotive-And-Industry-Lab/semantic-segmentation-editor
    
  • Dockerhub (πŸ“₯ 850 Β· ⭐ 6 Β· ⏱️ 30.07.2018):

     docker pull hitachiail/semantic-segmentation-editor
    
Labelbox (πŸ₯‰15 Β· ⭐ 1.3K) - Labelbox is the fastest way to annotate data to build and ship computer vision.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 15 Β· πŸ”€ 190 Β· ⏱️ 26.11.2019):

     git clone https://github.com/Labelbox/Labelbox
    
LOST (πŸ₯‰15 Β· ⭐ 280) - Label Objects and Save Time (LOST) - Design your own smart Image Annotation process in a.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 5 Β· πŸ”€ 39 Β· πŸ“‹ 50 - 40% open Β· ⏱️ 17.10.2019):

     git clone https://github.com/l3p-cv/lost
    
  • Dockerhub (πŸ“₯ 550 Β· ⏱️ 17.10.2019):

     docker pull l3pcv/lost
    
ImgLab (πŸ₯‰14 Β· ⭐ 570) - To speedup and simplify image labeling/ annotation process with multiple supported.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 29 Β· πŸ”€ 310 Β· πŸ“‹ 100 - 28% open Β· ⏱️ 19.10.2019):

     git clone https://github.com/NaturalIntelligence/imglab
    
OpenLabeling (πŸ₯‰14 Β· ⭐ 520) - Label images and video for Computer Vision applications. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 14 Β· πŸ”€ 140 Β· πŸ“‹ 34 - 26% open Β· ⏱️ 16.01.2020):

     git clone https://github.com/Cartucho/OpenLabeling
    
makesense.ai (πŸ₯‰13 Β· ⭐ 930) - Free to use online tool for labelling photos. https://makesense.ai. ❗️GPL-3.0
  • GitHub (πŸ‘¨β€πŸ’» 8 Β· πŸ”€ 100 Β· πŸ“‹ 30 - 50% open Β· ⏱️ 08.01.2020):

     git clone https://github.com/SkalskiP/make-sense
    
Show 2 hidden projects...
jupyter-innotater (πŸ₯‰12 Β· ⭐ 44) - Inline data annotator for Jupyter notebooks. MIT
  • GitHub (πŸ‘¨β€πŸ’» 1 Β· πŸ”€ 4 Β· πŸ“‹ 10 - 40% open Β· ⏱️ 06.01.2020):

     git clone https://github.com/ideonate/jupyter-innotater
    
  • PyPi (πŸ“₯ 69 / month Β· ⏱️ 29.07.2019):

     pip install jupyter_innotater
    
superintendent (πŸ₯‰9 Β· ⭐ 85) - Practical active learning in python. ❗️Unlicensed
  • GitHub (πŸ‘¨β€πŸ’» 6 Β· πŸ”€ 9 Β· πŸ“¦ 3 Β· πŸ“‹ 15 - 26% open Β· ⏱️ 24.01.2020):

     git clone https://github.com/janfreyberg/superintendent
    

Data Visualization & Exploration

Back to top

GUI tools to visualize, explore, and analyze data.

Grafana (πŸ₯‡33 Β· ⭐ 34K) - The tool for beautiful monitoring and metric analytics & dashboards for.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 1.2K Β· πŸ”€ 6.4K Β· πŸ“¦ 7 Β· πŸ“‹ 15K - 16% open Β· ⏱️ 24.01.2020):

     git clone https://github.com/grafana/grafana
    
  • Dockerhub (πŸ“₯ 720M Β· ⭐ 1.3K Β· ⏱️ 24.01.2020):

     docker pull grafana/grafana
    
Kibana (πŸ₯‡31 Β· ⭐ 15K) - Your window into the Elastic Stack. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 520 Β· πŸ”€ 5.4K Β· πŸ“¦ 5 Β· πŸ“‹ 22K - 27% open Β· ⏱️ 24.01.2020):

     git clone https://github.com/elastic/kibana
    
  • Dockerhub (πŸ“₯ 76M Β· ⭐ 1.7K Β· ⏱️ 23.01.2020):

     docker pull kibana
    
Orange (πŸ₯ˆ27 Β· ⭐ 2.1K) - Orange: Interactive data analysis https://orange.biolab.si. ❗️GPL-3.0
  • GitHub (πŸ‘¨β€πŸ’» 99 Β· πŸ”€ 580 Β· πŸ“₯ 140 Β· πŸ“¦ 250 Β· πŸ“‹ 1.4K - 5% open Β· ⏱️ 24.01.2020):

     git clone https://github.com/biolab/orange3
    
  • PyPi (πŸ“₯ 4.8K / month Β· πŸ“¦ 110 Β· ⏱️ 20.12.2019):

     pip install orange3
    
  • Conda (⏱️ 16.10.2019):

     conda install -c anaconda orange3
    
Gephi (πŸ₯ˆ25 Β· ⭐ 3.6K) - Gephi - The Open Graph Viz Platform. ❗️GPL-3.0
  • GitHub (πŸ‘¨β€πŸ’» 81 Β· πŸ”€ 1.3K Β· πŸ“₯ 1.8M Β· πŸ“¦ 20 Β· πŸ“‹ 2K - 23% open Β· ⏱️ 13.01.2020):

     git clone https://github.com/gephi/gephi
    
  • Maven (πŸ“¦ 12 Β· ⏱️ 14.02.2016):

     <dependency>
     	<groupId>org.gephi</groupId>
     	<artifactId>project-api</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
CARTO (πŸ₯ˆ25 Β· ⭐ 2.3K) - Location Intelligence & Data Visualization tool. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 180 Β· πŸ”€ 620 Β· πŸ“‹ 8.5K - 1% open Β· ⏱️ 24.01.2020):

     git clone https://github.com/CartoDB/cartodb
    
  • Dockerhub (πŸ“₯ 31K Β· ⭐ 37 Β· ⏱️ 07.11.2019):

     docker pull sverhoeven/cartodb
    
SandDance (πŸ₯‰21 Β· ⭐ 4K) - Visually explore, understand, and present your data. MIT
  • GitHub (πŸ‘¨β€πŸ’» 8 Β· πŸ”€ 260 Β· πŸ“‹ 90 - 51% open Β· ⏱️ 22.01.2020):

     git clone https://github.com/microsoft/SandDance
    
  • NPM (πŸ“₯ 390 / month Β· πŸ“¦ 6 Β· ⏱️ 09.01.2020):

     npm install @msrvida/sanddance
    
Facette (πŸ₯‰19 Β· ⭐ 1.1K) - Time series data visualization software. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 30 Β· πŸ”€ 76 Β· πŸ“₯ 5.9K Β· πŸ“‹ 350 - 10% open Β· ⏱️ 20.11.2019):

     git clone https://github.com/facette/facette
    
Voyager 2 (πŸ₯‰19 Β· ⭐ 920 Β· πŸ’€) - Visualization Tool for Data Exploration. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 16 Β· πŸ”€ 110 Β· πŸ“¦ 24 Β· πŸ“‹ 450 - 18% open Β· ⏱️ 18.05.2019):

     git clone https://github.com/vega/voyager
    
  • NPM (πŸ“₯ 250 / month Β· πŸ“¦ 12 Β· ⏱️ 06.07.2018):

     npm install datavoyager
    
ParaView (πŸ₯‰19 Β· ⭐ 580) - VTK-based Data Analysis and Visualization Application. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 270 Β· πŸ”€ 260 Β· ⏱️ 24.01.2020):

     git clone https://github.com/Kitware/ParaView
    
Datawrapper (πŸ₯‰18 Β· ⭐ 1K) - An open source data visualization platform helping everyone to create simple,.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 47 Β· πŸ”€ 240 Β· ⏱️ 23.01.2020):

     git clone https://github.com/datawrapper/datawrapper
    
Falcon (πŸ₯‰17 Β· ⭐ 410) - Brushing and linking for big data. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 5 Β· πŸ”€ 23 Β· πŸ“‹ 98 - 9% open Β· ⏱️ 24.01.2020):

     git clone https://github.com/uwdata/falcon
    
  • NPM (πŸ“₯ 82 / month Β· ⏱️ 29.08.2019):

     npm install falcon-vis
    
Banana (πŸ₯‰16 Β· ⭐ 640) - Banana for Solr - A Port of Kibana. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 44 Β· πŸ”€ 220 Β· πŸ“‹ 150 - 67% open Β· ⏱️ 13.11.2019):

     git clone https://github.com/LucidWorks/banana
    
Papaya (πŸ₯‰15 Β· ⭐ 340 Β· πŸ’€) - A pure JavaScript medical research image viewer. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 10 Β· πŸ”€ 130 Β· πŸ“‹ 170 - 20% open Β· ⏱️ 05.05.2019):

     git clone https://github.com/rii-mango/Papaya
    
  • NPM (πŸ“₯ 190 / month Β· ⏱️ 05.05.2019):

     npm install papaya-viewer
    
Dex (πŸ₯‰13 Β· ⭐ 1.2K Β· πŸ’€) - Dex : The Data Explorer -- A data visualization tool written in.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 1 Β· πŸ”€ 300 Β· πŸ“‹ 13 - 23% open Β· ⏱️ 12.02.2019):

     git clone https://github.com/PatMartin/Dex
    
Show 3 hidden projects...
Visdom (πŸ₯ˆ26 Β· ⭐ 7K) - A flexible tool for creating, organizing, and sharing visualizations of.. ❗️CC-BY-NC-4.0
  • GitHub (πŸ‘¨β€πŸ’» 95 Β· πŸ”€ 840 Β· πŸ“¦ 1.7K Β· πŸ“‹ 450 - 12% open Β· ⏱️ 12.12.2019):

     git clone https://github.com/facebookresearch/visdom
    
  • PyPi (πŸ“₯ 26K / month Β· πŸ“¦ 410 Β· ⏱️ 12.09.2019):

     pip install visdom
    
Chronograf (πŸ₯ˆ26 Β· ⭐ 1.2K) - Open source monitoring and visualization UI for the TICK stack. ❗️AGPL-3.0
  • GitHub (πŸ‘¨β€πŸ’» 79 Β· πŸ”€ 200 Β· πŸ“‹ 3.1K - 1% open Β· ⏱️ 23.01.2020):

     git clone https://github.com/influxdata/chronograf
    
  • Dockerhub (πŸ“₯ 71M Β· ⭐ 190 Β· ⏱️ 24.01.2020):

     docker pull chronograf
    
Bumblebee (πŸ₯‰11 Β· ⭐ 47) - An agnostic data profiling GUI to make your data science tasks easier. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 5 Β· πŸ”€ 6 Β· πŸ“¦ 2 Β· πŸ“‹ 11 - 54% open Β· ⏱️ 21.01.2020):

     git clone https://github.com/ironmussa/bumblebee
    
  • Dockerhub (πŸ“₯ 10 Β· ⏱️ 10.12.2019):

     docker pull ironmussa/bumblebee
    

Model Visualization

Back to top

Tools to visualize, explore, and understand neural networks and other machine learning models.

Netron (πŸ₯‡23 Β· ⭐ 7.8K) - Visualizer for neural network, deep learning and machine learning models. MIT
  • GitHub (πŸ‘¨β€πŸ’» 2 Β· πŸ”€ 950 Β· πŸ“₯ 41K Β· πŸ“‹ 380 - 2% open Β· ⏱️ 24.01.2020):

     git clone https://github.com/lutzroeder/netron
    
  • PyPi (πŸ“₯ 1.6K / month Β· πŸ“¦ 4 Β· ⏱️ 21.01.2020):

     pip install netron
    
TensorSpace.js (πŸ₯ˆ19 Β· ⭐ 4.1K Β· πŸ’€) - Neural network 3D visualization framework, build interactive and.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 10 Β· πŸ”€ 360 Β· πŸ“¦ 10 Β· πŸ“‹ 200 - 12% open Β· ⏱️ 21.04.2019):

     git clone https://github.com/tensorspace-team/tensorspace
    
  • NPM (πŸ“₯ 140 / month Β· πŸ“¦ 1 Β· ⏱️ 02.04.2019):

     npm install tensorspace
    
Netwulf (πŸ₯ˆ18 Β· ⭐ 180) - Interactive visualization of networks based on Ulf Aslak's d3 web app. MIT
  • GitHub (πŸ‘¨β€πŸ’» 5 Β· πŸ”€ 16 Β· πŸ“¦ 6 Β· πŸ“‹ 32 - 25% open Β· ⏱️ 17.01.2020):

     git clone https://github.com/benmaier/netwulf
    
  • PyPi (πŸ“₯ 190 / month Β· ⏱️ 09.09.2019):

     pip install netwulf
    
PlotNeuralNet (πŸ₯‰16 Β· ⭐ 7.5K) - Latex code for making neural networks diagrams. MIT
  • GitHub (πŸ‘¨β€πŸ’» 9 Β· πŸ”€ 1K Β· πŸ“‹ 78 - 53% open Β· ⏱️ 17.01.2020):

     git clone https://github.com/HarisIqbal88/PlotNeuralNet
    
BertViz (πŸ₯‰15 Β· ⭐ 1.3K) - Tool for visualizing attention in the Transformer model (BERT, GPT-2, Albert,.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 3 Β· πŸ”€ 230 Β· πŸ“‹ 30 - 43% open Β· ⏱️ 02.12.2019):

     git clone https://github.com/jessevig/bertviz
    
GANDissect (πŸ₯‰13 Β· ⭐ 1.5K Β· πŸ’€) - Pytorch-based tools for visualizing and understanding the neurons of a GAN... MIT
  • GitHub (πŸ‘¨β€πŸ’» 3 Β· πŸ”€ 220 Β· πŸ“‹ 13 - 46% open Β· ⏱️ 11.03.2019):

     git clone https://github.com/CSAILVision/gandissect
    
exBERT (πŸ₯‰10 Β· ⭐ 190 Β· 🐣) - A Visual Analysis Tool to Explore Learned Representations in Transformers.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 3 Β· πŸ”€ 21 Β· πŸ“‹ 4 - 75% open Β· ⏱️ 15.10.2019):

     git clone https://github.com/bhoov/exbert
    
Show 1 hidden projects...
Fabrik (πŸ₯‰14 Β· ⭐ 1K Β· πŸ’€) - Collaboratively build, visualize, and design neural nets in browser. ❗️GPL-3.0
  • GitHub (πŸ‘¨β€πŸ’» 45 Β· πŸ”€ 240 Β· πŸ“‹ 130 - 30% open Β· ⏱️ 12.12.2018):

     git clone https://github.com/Cloud-CV/Fabrik
    

Model Deployment

Back to top

Tools and platforms to deploy, run, and serve machine learning models.

TensorFlow.js (πŸ₯‡31 Β· ⭐ 13K) - A WebGL accelerated JavaScript library for training and deploying ML models. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 220 Β· πŸ”€ 1K Β· πŸ“₯ 12 Β· πŸ“¦ 6.3K Β· πŸ“‹ 1.9K - 25% open Β· ⏱️ 24.01.2020):

     git clone https://github.com/tensorflow/tfjs
    
  • NPM (πŸ“₯ 52K / month Β· πŸ“¦ 1.3K Β· ⏱️ 20.12.2019):

     npm install @tensorflow/tfjs
    
TensorFlow Serving (πŸ₯‡30 Β· ⭐ 4.2K) - A flexible, high-performance serving system for machine learning.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 150 Β· πŸ”€ 1.6K Β· πŸ“‹ 1.1K - 5% open Β· ⏱️ 24.01.2020):

     git clone https://github.com/tensorflow/serving
    
  • PyPi (πŸ“₯ 1.7M / month Β· πŸ“¦ 86 Β· ⏱️ 14.01.2020):

     pip install tensorflow-serving-api
    
  • Dockerhub (πŸ“₯ 9.9M Β· ⭐ 73 Β· ⏱️ 24.01.2020):

     docker pull tensorflow/serving
    
ONNX Runtime (πŸ₯‡28 Β· ⭐ 1.7K) - ONNX Runtime: cross-platform, high performance scoring engine for ML models. MIT
  • GitHub (πŸ‘¨β€πŸ’» 130 Β· πŸ”€ 360 Β· πŸ“₯ 6.4K Β· πŸ“¦ 150 Β· πŸ“‹ 710 - 20% open Β· ⏱️ 24.01.2020):

     git clone https://github.com/microsoft/onnxruntime
    
  • PyPi (πŸ“₯ 65K / month Β· πŸ“¦ 33 Β· ⏱️ 23.01.2020):

     pip install onnxruntime
    
  • Dockerhub (πŸ“₯ 1K Β· ⭐ 7 Β· ⏱️ 20.12.2019):

     docker pull onnx/onnx-ecosystem
    
Seldon (πŸ₯ˆ27 Β· ⭐ 1.4K) - Machine Learning Deployment for Kubernetes. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 73 Β· πŸ”€ 280 Β· πŸ“‹ 700 - 13% open Β· ⏱️ 24.01.2020):

     git clone https://github.com/SeldonIO/seldon-core
    
  • PyPi (πŸ“₯ 4.7K / month Β· πŸ“¦ 13 Β· ⏱️ 15.01.2020):

     pip install seldon-core
    
  • Dockerhub (πŸ“₯ 1.8M Β· ⏱️ 24.01.2020):

     docker pull seldonio/seldon-core-operator
    
plaidML (πŸ₯ˆ26 Β· ⭐ 2.8K) - PlaidML is a framework for making deep learning work everywhere. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 39 Β· πŸ”€ 270 Β· πŸ“₯ 240 Β· πŸ“‹ 350 - 35% open Β· ⏱️ 24.01.2020):

     git clone https://github.com/plaidml/plaidml
    
  • PyPi (πŸ“₯ 36K / month Β· πŸ“¦ 22 Β· ⏱️ 17.01.2020):

     pip install plaidml-keras
    
MLeap (πŸ₯ˆ25 Β· ⭐ 1K) - MLeap: Deploy Spark Pipelines to Production. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 58 Β· πŸ”€ 240 Β· πŸ“¦ 62 Β· πŸ“‹ 350 - 26% open Β· ⏱️ 23.01.2020):

     git clone https://github.com/combust/mleap
    
  • PyPi (πŸ“₯ 270K / month Β· πŸ“¦ 15 Β· ⏱️ 09.10.2017):

     pip install mleap
    
  • Dockerhub (πŸ“₯ 15K Β· ⭐ 5 Β· ⏱️ 23.01.2020):

     docker pull combustml/mleap-serving
    
  • Maven (⏱️ 10.09.2018):

     <dependency>
     	<groupId>ml.combust.mleap</groupId>
     	<artifactId>mleap-base_2.11</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
MXNet Model Server (πŸ₯ˆ22 Β· ⭐ 580) - Multi Model Server is a tool for serving neural net models for.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 54 Β· πŸ”€ 160 Β· πŸ“‹ 330 - 7% open Β· ⏱️ 08.01.2020):

     git clone https://github.com/awslabs/mxnet-model-server
    
  • PyPi (πŸ“₯ 3.5K / month Β· πŸ“¦ 2 Β· ⏱️ 14.11.2019):

     pip install mxnet-model-server
    
  • Dockerhub (πŸ“₯ 33K Β· ⭐ 4 Β· ⏱️ 04.12.2019):

     docker pull awsdeeplearningteam/mxnet-model-server
    
KFServing (πŸ₯ˆ22 Β· ⭐ 300) - Serverless Inferencing on Kubernetes. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 34 Β· πŸ”€ 100 Β· πŸ“₯ 37 Β· πŸ“¦ 9 Β· πŸ“‹ 270 - 36% open Β· ⏱️ 24.01.2020):

     git clone https://github.com/kubeflow/kfserving
    
  • PyPi (πŸ“₯ 1.5K / month Β· πŸ“¦ 6 Β· ⏱️ 27.11.2019):

     pip install kfserving
    
DeepDetect (πŸ₯ˆ21 Β· ⭐ 2K) - Deep Learning API and Server in C++11 support for Caffe, Caffe2,.. ❗️GPL-3.0
  • GitHub (πŸ‘¨β€πŸ’» 25 Β· πŸ”€ 490 Β· πŸ“‹ 370 - 19% open Β· ⏱️ 24.01.2020):

     git clone https://github.com/jolibrain/deepdetect
    
  • Dockerhub (πŸ“₯ 28K Β· ⏱️ 23.01.2020):

     docker pull jolibrain/deepdetect_cpu
    
PipelineAI (πŸ₯‰20 Β· ⭐ 3.9K) - PipelineAI Kubeflow Distribution. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 13 Β· πŸ”€ 950 Β· πŸ“‹ 250 - 0% open Β· ⏱️ 12.01.2020):

     git clone https://github.com/PipelineAI/pipeline
    
  • PyPi (πŸ“₯ 2.4K / month Β· ⏱️ 17.05.2019):

     pip install cli-pipeline
    
OpenPAI (πŸ₯‰20 Β· ⭐ 1.6K) - Resource scheduling and cluster management for AI. MIT
  • GitHub (πŸ‘¨β€πŸ’» 81 Β· πŸ”€ 360 Β· πŸ“‹ 1.4K - 10% open Β· ⏱️ 21.01.2020):

     git clone https://github.com/Microsoft/pai
    
  • PyPi (πŸ“₯ 140 / month Β· ⏱️ 22.07.2019):

     pip install paicli
    
TensorRT Inference Server (πŸ₯‰20 Β· ⭐ 960) - The TensorRT Inference Server provides a cloud inferencing.. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 23 Β· πŸ”€ 190 Β· πŸ“₯ 12K Β· πŸ“‹ 320 - 6% open Β· ⏱️ 24.01.2020):

     git clone https://github.com/NVIDIA/tensorrt-inference-server
    
BentoML (πŸ₯‰20 Β· ⭐ 770) - Model Serving made easy. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 13 Β· πŸ”€ 88 Β· πŸ“₯ 96 Β· πŸ“¦ 7 Β· πŸ“‹ 68 - 7% open Β· ⏱️ 23.01.2020):

     git clone https://github.com/bentoml/bentoml
    
  • PyPi (πŸ“₯ 1K / month Β· πŸ“¦ 4 Β· ⏱️ 23.01.2020):

     pip install bentoml
    
  • Dockerhub:

     docker pull bentoml/bento_api_server
    
Clipper (πŸ₯‰19 Β· ⭐ 1.1K) - A low-latency prediction-serving system. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 36 Β· πŸ”€ 240 Β· πŸ“‹ 370 - 28% open Β· ⏱️ 09.07.2019):

     git clone https://github.com/ucbrise/clipper
    
  • PyPi (πŸ“₯ 340 / month Β· ⏱️ 07.06.2019):

     pip install clipper-admin
    
  • Dockerhub (πŸ“₯ 1.2M Β· ⏱️ 23.01.2020):

     docker pull clipper/management_frontend
    
ONNX.js (πŸ₯‰19 Β· ⭐ 980 Β· πŸ’€) - ONNX.js: run ONNX models using JavaScript. MIT
  • GitHub (πŸ‘¨β€πŸ’» 10 Β· πŸ”€ 68 Β· πŸ“¦ 53 Β· πŸ“‹ 57 - 40% open Β· ⏱️ 01.06.2019):

     git clone https://github.com/microsoft/onnxjs
    
  • NPM (πŸ“₯ 380 / month Β· πŸ“¦ 10 Β· ⏱️ 01.06.2019):

     npm install onnxjs
    
Simple TensorFlow Serving (πŸ₯‰19 Β· ⭐ 640) - Generic and easy-to-use serving service for machine learning.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 8 Β· πŸ”€ 160 Β· πŸ“‹ 65 - 49% open Β· ⏱️ 11.11.2019):

     git clone https://github.com/tobegit3hub/simple_tensorflow_serving
    
  • PyPi (πŸ“₯ 120 / month Β· ⏱️ 23.09.2019):

     pip install simple_tensorflow_serving
    
  • Dockerhub (πŸ“₯ 3.7K Β· ⭐ 1 Β· ⏱️ 11.11.2019):

     docker pull tobegit3hub/simple_tensorflow_serving
    
Hydrosphere Serving (πŸ₯‰19 Β· ⭐ 190) - Machine Learning Serving cluster. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 18 Β· πŸ”€ 31 Β· πŸ“₯ 5.9K Β· πŸ“‹ 120 - 18% open Β· ⏱️ 17.01.2020):

     git clone https://github.com/Hydrospheredata/hydro-serving
    
  • PyPi (πŸ“₯ 150 / month Β· ⏱️ 12.11.2019):

     pip install hs
    
  • Dockerhub (πŸ“₯ 190K Β· ⏱️ 13.01.2020):

     docker pull hydrosphere/serving-manager
    
Cortex (πŸ₯‰18 Β· ⭐ 2.8K) - Deploy machine learning models in production. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 10 Β· πŸ”€ 190 Β· πŸ“‹ 400 - 21% open Β· ⏱️ 23.01.2020):

     git clone https://github.com/cortexlabs/cortex
    
OpenVINO DLDT (πŸ₯‰14 Β· ⭐ 1K) - OpenVINO Toolkit - Deep Learning Deployment Toolkit repository. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 7 Β· πŸ”€ 330 Β· πŸ“₯ 140 Β· πŸ“‹ 320 - 67% open Β· ⏱️ 15.11.2019):

     git clone https://github.com/opencv/dldt
    
Show 3 hidden projects...
RedisAI (πŸ₯‰15 Β· ⭐ 300) - A Redis module for serving tensors and executing deep learning graphs. ❗️RSAL
  • GitHub (πŸ‘¨β€πŸ’» 15 Β· πŸ”€ 31 Β· πŸ“‹ 120 - 28% open Β· ⏱️ 24.01.2020):

     git clone https://github.com/RedisAI/RedisAI
    
  • Dockerhub (πŸ“₯ 14K Β· ⏱️ 24.01.2020):

     docker pull redisai/redisai
    
GraphPipe (πŸ₯‰14 Β· ⭐ 690 Β· πŸ’€) - Machine Learning Model Deployment Made Simple. ❗️UPL-1.0
  • GitHub (πŸ‘¨β€πŸ’» 4 Β· πŸ”€ 96 Β· πŸ“‹ 13 - 92% open Β· ⏱️ 16.10.2018):

     git clone https://github.com/oracle/graphpipe
    
  • PyPi (πŸ“₯ 180 / month Β· πŸ“¦ 2 Β· ⏱️ 15.08.2018):

     pip install graphpipe
    
  • Dockerhub (πŸ“₯ 1K Β· ⏱️ 02.11.2018):

     docker pull sleepsonthefloor/graphpipe-tf
    
Openscoring (πŸ₯‰13 Β· ⭐ 490 Β· πŸ’€) - REST web service for the true real-time scoring (1 ms) of R, Scikit-.. ❗️AGPL-3.0
  • GitHub (πŸ‘¨β€πŸ’» 1 Β· πŸ”€ 140 Β· πŸ“₯ 1.5K Β· πŸ“‹ 50 - 12% open Β· ⏱️ 19.06.2019):

     git clone https://github.com/openscoring/openscoring
    
  • Maven (⏱️ 12.01.2019):

     <dependency>
     	<groupId>org.openscoring</groupId>
     	<artifactId>openscoring</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    

ML Applications & Services

Back to top

Task-specific machine learning applications and services.

bert-as-service (πŸ₯‡27 Β· ⭐ 6.6K) - Mapping a variable-length sentence to a fixed-length vector using BERT model. MIT
  • GitHub (πŸ‘¨β€πŸ’» 38 Β· πŸ”€ 1.3K Β· πŸ“‹ 390 - 44% open Β· ⏱️ 21.01.2020):

     git clone https://github.com/hanxiao/bert-as-service
    
  • PyPi (πŸ“₯ 10K / month Β· πŸ“¦ 10 Β· ⏱️ 20.12.2019):

     pip install bert-serving-server
    
OCRmyPDF (πŸ₯‡24 Β· ⭐ 2.2K) - OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be.. ❗️GPL-3.0
  • GitHub (πŸ‘¨β€πŸ’» 28 Β· πŸ”€ 270 Β· πŸ“¦ 25 Β· πŸ“‹ 440 - 12% open Β· ⏱️ 18.01.2020):

     git clone https://github.com/jbarlow83/OCRmyPDF
    
  • PyPi (πŸ“₯ 6.9K / month Β· πŸ“¦ 12 Β· ⏱️ 19.01.2020):

     pip install ocrmypdf
    
GNES (πŸ₯ˆ23 Β· ⭐ 1K Β· 🐣) - GNES is Generic Neural Elastic Search, a cloud-native semantic search system.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 11 Β· πŸ”€ 180 Β· πŸ“¦ 2 Β· πŸ“‹ 23 - 69% open Β· ⏱️ 24.10.2019):

     git clone https://github.com/gnes-ai/gnes
    
  • PyPi (πŸ“₯ 250 / month Β· πŸ“¦ 1 Β· ⏱️ 06.11.2019):

     pip install gnes
    
  • Dockerhub (πŸ“₯ 94K Β· ⏱️ 12.11.2019):

     docker pull gnes/gnes
    
face-api.js (πŸ₯ˆ22 Β· ⭐ 880) - Context aware, pluggable and customizable data protection and anonymization.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 18 Β· πŸ”€ 75 Β· πŸ“‹ 54 - 31% open Β· ⏱️ 19.01.2020):

     git clone https://github.com/microsoft/presidio
    
  • NPM (πŸ“₯ 14K / month Β· πŸ“¦ 160 Β· ⏱️ 15.12.2019):

     npm install face-api.js
    
Hastic (πŸ₯ˆ18 Β· ⭐ 180) - Server for managing data for analytics. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 5 Β· πŸ”€ 8 Β· πŸ“₯ 410 Β· πŸ“‹ 490 - 36% open Β· ⏱️ 27.12.2019):

     git clone https://github.com/hastic/hastic-server
    
  • Dockerhub (πŸ“₯ 41K Β· ⏱️ 07.11.2019):

     docker pull hastic/server
    
Real-Time-Voice-Cloning (πŸ₯ˆ17 Β· ⭐ 15K) - Clone a voice in 5 seconds to generate arbitrary speech in real-time. MIT
  • GitHub (πŸ‘¨β€πŸ’» 9 Β· πŸ”€ 2.6K Β· πŸ“‹ 230 - 41% open Β· ⏱️ 13.11.2019):

     git clone https://github.com/CorentinJ/Real-Time-Voice-Cloning
    
DeOldify (πŸ₯ˆ17 Β· ⭐ 8.9K) - A Deep Learning based project for colorizing and restoring old images (and video!). MIT
  • GitHub (πŸ‘¨β€πŸ’» 23 Β· πŸ”€ 960 Β· πŸ“‹ 130 - 11% open Β· ⏱️ 21.01.2020):

     git clone https://github.com/jantic/DeOldify
    
DeepFaceLab (πŸ₯‰16 Β· ⭐ 12K) - DeepFaceLab is a tool that utilizes machine learning to replace faces in.. ❗️GPL-3.0
  • GitHub (πŸ‘¨β€πŸ’» 17 Β· πŸ”€ 2.7K Β· πŸ“‹ 490 - 35% open Β· ⏱️ 24.01.2020):

     git clone https://github.com/iperov/DeepFaceLab
    
Automatic-Speech-Recognition (πŸ₯‰15 Β· ⭐ 2.6K) - End-to-end Automatic Speech Recognition for Madarian and.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 9 Β· πŸ”€ 510 Β· πŸ“‹ 86 - 77% open Β· ⏱️ 17.10.2019):

     git clone https://github.com/zzw922cn/Automatic_Speech_Recognition
    
Presidio (πŸ₯‰15 Β· ⭐ 880) - Context aware, pluggable and customizable data protection and anonymization service.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 18 Β· πŸ”€ 75 Β· πŸ“‹ 54 - 31% open Β· ⏱️ 19.01.2020):

     git clone https://github.com/microsoft/presidio
    
  • Dockerhub:

     docker pull mcr.microsoft.com/presidio-api
    
neural-style (πŸ₯‰13 Β· ⭐ 5K) - Neural style in TensorFlow!. ❗️GPL-3.0
  • GitHub (πŸ‘¨β€πŸ’» 17 Β· πŸ”€ 1.4K Β· πŸ“‹ 120 - 4% open Β· ⏱️ 06.10.2019):

     git clone https://github.com/anishathalye/neural-style
    
Deep Colorization (πŸ₯‰13 Β· ⭐ 2.1K) - Deep learning software for colorizing black and white images with a few.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 7 Β· πŸ”€ 340 Β· πŸ“‹ 63 - 31% open Β· ⏱️ 03.10.2019):

     git clone https://github.com/junyanz/interactive-deep-colorization
    
Show 3 hidden projects...
FastPhotoStyle (πŸ₯‰13 Β· ⭐ 10K Β· πŸ’€) - Style transfer, deep learning, feature transform. ❗️CC-BY-4.0
  • GitHub (πŸ‘¨β€πŸ’» 5 Β· πŸ”€ 1K Β· πŸ“‹ 78 - 55% open Β· ⏱️ 27.02.2019):

     git clone https://github.com/NVIDIA/FastPhotoStyle
    
AlphaPose (πŸ₯‰12 Β· ⭐ 3.5K) - Real-Time and Accurate Multi-Person Pose Estimation&Tracking System. ❗️Unlicensed
  • GitHub (πŸ‘¨β€πŸ’» 4 Β· πŸ”€ 950 Β· πŸ“‹ 490 - 13% open Β· ⏱️ 14.01.2020):

     git clone https://github.com/MVIG-SJTU/AlphaPose
    
BERTsearch (πŸ₯‰7 Β· ⭐ 340 Β· 🐣) - Elasticsearch with BERT for advanced document search. ❗️Unlicensed
  • GitHub (πŸ‘¨β€πŸ’» 2 Β· πŸ”€ 58 Β· πŸ“‹ 9 - 22% open Β· ⏱️ 14.11.2019):

     git clone https://github.com/Hironsan/bertsearch
    

Performance Optimization & Accelerators

Back to top

Compilers, accelerators, and libraries to improve compute performance and optimize machine learning models.

Cython (πŸ₯‡36 Β· ⭐ 4.8K) - The most widely used Python to C compiler. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 370 Β· πŸ”€ 950 Β· πŸ“¦ 41K Β· πŸ“‹ 2.1K - 36% open Β· ⏱️ 23.01.2020):

     git clone https://github.com/cython/cython
    
  • PyPi (πŸ“₯ 3.6M / month Β· πŸ“¦ 27K Β· ⏱️ 01.11.2019):

     pip install Cython
    
  • Conda (⏱️ 20.11.2019):

     conda install -c anaconda cython
    
Numba (πŸ₯ˆ35 Β· ⭐ 4.9K) - NumPy aware dynamic Python compiler using LLVM. BSD-2
  • GitHub (πŸ‘¨β€πŸ’» 180 Β· πŸ”€ 590 Β· πŸ“¦ 14K Β· πŸ“‹ 2.9K - 36% open Β· ⏱️ 22.01.2020):

     git clone https://github.com/numba/numba
    
  • PyPi (πŸ“₯ 860K / month Β· πŸ“¦ 4K Β· ⏱️ 03.01.2020):

     pip install numba
    
  • Conda (⏱️ 17.01.2020):

     conda install -c anaconda numba
    
CuPy (πŸ₯ˆ30 Β· ⭐ 3.9K) - NumPy-like API accelerated with CUDA. MIT
  • GitHub (πŸ‘¨β€πŸ’» 220 Β· πŸ”€ 330 Β· πŸ“‹ 800 - 41% open Β· ⏱️ 24.01.2020):

     git clone https://github.com/cupy/cupy
    
  • PyPi (πŸ“₯ 9.3K / month Β· πŸ“¦ 190 Β· ⏱️ 22.01.2020):

     pip install cupy
    
  • Dockerhub (πŸ“₯ 44K Β· ⭐ 5 Β· ⏱️ 24.01.2020):

     docker pull cupy/cupy
    
mkl-dnn (πŸ₯ˆ24 Β· ⭐ 1.8K) - Deep Neural Network Library (DNNL). Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 110 Β· πŸ”€ 430 Β· πŸ“₯ 1.9M Β· πŸ“‹ 550 - 4% open Β· ⏱️ 21.01.2020):

     git clone https://github.com/intel/mkl-dnn
    
  • Conda (⏱️ 18.09.2019):

     conda install -c anaconda mkl
    
TVM (πŸ₯‰23 Β· ⭐ 4.9K) - Open deep learning compiler stack for cpu, gpu and specialized accelerators. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 320 Β· πŸ”€ 1.3K Β· πŸ“₯ 59 Β· πŸ“‹ 1.3K - 10% open Β· ⏱️ 24.01.2020):

     git clone https://github.com/apache/incubator-tvm
    
  • Dockerhub (πŸ“₯ 1.7K Β· ⭐ 4 Β· ⏱️ 27.09.2019):

     docker pull tvmai/demo-cpu
    
nGraph (πŸ₯‰22 Β· ⭐ 1.2K) - nGraph - open source C++ library, compiler and runtime for Deep Learning. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 94 Β· πŸ”€ 180 Β· πŸ“¦ 1 Β· πŸ“‹ 250 - 38% open Β· ⏱️ 24.01.2020):

     git clone https://github.com/NervanaSystems/ngraph
    
  • PyPi (πŸ“₯ 300 / month Β· ⏱️ 09.11.2019):

     pip install ngraph-core
    
OpenBLAS (πŸ₯‰21 Β· ⭐ 3.1K) - OpenBLAS is an optimized BLAS library based on GotoBLAS2 1.13 BSD version. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 160 Β· πŸ”€ 860 Β· πŸ“₯ 1.5K Β· πŸ“‹ 1.4K - 10% open Β· ⏱️ 23.01.2020):

     git clone https://github.com/xianyi/OpenBLAS
    
  • Conda (⏱️ 18.07.2019):

     conda install -c anaconda openblas
    
Glow (πŸ₯‰17 Β· ⭐ 1.9K) - Compiler for Neural Network hardware accelerators. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 130 Β· πŸ”€ 340 Β· πŸ“‹ 590 - 33% open Β· ⏱️ 24.01.2020):

     git clone https://github.com/pytorch/glow
    
TensorRT (πŸ₯‰16 Β· ⭐ 1.9K) - TensorRT is a C++ library for high performance inference on NVIDIA GPUs and.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 8 Β· πŸ”€ 370 Β· πŸ“‹ 300 - 26% open Β· ⏱️ 23.01.2020):

     git clone https://github.com/NVIDIA/TensorRT
    
PocketFlow (πŸ₯‰14 Β· ⭐ 2.4K Β· πŸ’€) - An Automatic Model Compression (AutoMC) framework for developing smaller.. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 11 Β· πŸ”€ 450 Β· πŸ“‹ 270 - 25% open Β· ⏱️ 28.05.2019):

     git clone https://github.com/Tencent/PocketFlow
    

Data Storage

Back to top

Dart Storage tools such as relational, document, graph, time-series, and key-value databases. For a more in-depth comparison, take a look at the DB Engines Ranking.

Elasticsearch (πŸ₯‡40 Β· ⭐ 51K) - Open Source, Distributed, RESTful Search Engine. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 1.5K Β· πŸ”€ 16K Β· πŸ“‹ 23K - 10% open Β· ⏱️ 24.01.2020):

     git clone https://github.com/elastic/elasticsearch
    
  • PyPi (πŸ“₯ 3.1M / month Β· πŸ“¦ 11K Β· ⏱️ 19.01.2020):

     pip install elasticsearch
    
  • NPM (πŸ“₯ 970K / month Β· πŸ“¦ 8K Β· ⏱️ 15.01.2020):

     npm install elasticsearch
    
  • Dockerhub (πŸ“₯ 350M Β· ⭐ 4.1K Β· ⏱️ 23.01.2020):

     docker pull elasticsearch
    
  • Maven (πŸ“¦ 2.4K Β· ⏱️ 06.09.2019):

     <dependency>
     	<groupId>org.elasticsearch.client</groupId>
     	<artifactId>transport</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
MongoDB (πŸ₯‡38 Β· ⭐ 24K) - The MongoDB Database. ❗️SSPL
  • GitHub (πŸ‘¨β€πŸ’» 660 Β· πŸ”€ 4.3K Β· ⏱️ 24.01.2020):

     git clone https://github.com/mongodb/mongo
    
  • PyPi (πŸ“₯ 5.4M / month Β· πŸ“¦ 36K Β· ⏱️ 08.01.2020):

     pip install pymongo
    
  • NPM (πŸ“₯ 5.3M / month Β· πŸ“¦ 230K Β· ⏱️ 17.01.2020):

     npm install mongodb
    
  • Conda (⏱️ 02.11.2018):

     conda install -c anaconda mongodb
    
  • Dockerhub (πŸ“₯ 1.5B Β· ⭐ 6.5K Β· ⏱️ 18.01.2020):

     docker pull mongo
    
  • Maven (πŸ“¦ 210 Β· ⏱️ 17.01.2020):

     <dependency>
     	<groupId>org.mongodb</groupId>
     	<artifactId>mongodb-driver-sync</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
PostgreSQL (πŸ₯‡38 Β· ⭐ 15K) - Object-relational database that supports an extended subset of the SQL.. ❗️PostgreSQL
  • GitHub (πŸ‘¨β€πŸ’» 50 Β· πŸ”€ 2.1K Β· ⏱️ 24.01.2020):

     git clone https://github.com/postgres/postgres
    
  • PyPi (πŸ“₯ 5.9M / month Β· πŸ“¦ 120K Β· ⏱️ 14.04.2019):

     pip install psycopg2
    
  • NPM (πŸ“₯ 4.1M / month Β· πŸ“¦ 91K Β· ⏱️ 10.01.2020):

     npm install pg
    
  • Dockerhub (πŸ“₯ 2B Β· ⭐ 7.4K Β· ⏱️ 24.01.2020):

     docker pull postgres
    
  • Maven (πŸ“¦ 52K Β· ⏱️ 15.03.2018):

     <dependency>
     	<groupId>org.postgresql</groupId>
     	<artifactId>postgresql</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Redis (πŸ₯‡37 Β· ⭐ 49K) - Redis is an in-memory database that persists on disk. The data model is key-value,.. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 400 Β· πŸ”€ 16K Β· πŸ“‹ 4.8K - 52% open Β· ⏱️ 15.01.2020):

     git clone https://github.com/antirez/redis
    
  • PyPi (πŸ“₯ 7.4M / month Β· πŸ“¦ 49K Β· ⏱️ 13.10.2019):

     pip install redis
    
  • NPM (πŸ“₯ 4.9M / month Β· πŸ“¦ 93K Β· ⏱️ 08.08.2017):

     npm install redis
    
  • Conda (⏱️ 24.01.2019):

     conda install -c anaconda redis
    
  • Dockerhub (πŸ“₯ 1.7B Β· ⭐ 7.7K Β· ⏱️ 18.01.2020):

     docker pull redis
    
  • Maven (πŸ“¦ 40K Β· ⏱️ 02.12.2018):

     <dependency>
     	<groupId>redis.clients</groupId>
     	<artifactId>jedis</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
InfluxDB (πŸ₯‡37 Β· ⭐ 19K) - Scalable datastore for metrics, events, and real-time analytics. MIT
  • GitHub (πŸ‘¨β€πŸ’» 480 Β· πŸ”€ 2.5K Β· πŸ“‹ 9.3K - 6% open Β· ⏱️ 24.01.2020):

     git clone https://github.com/influxdata/influxdb
    
  • PyPi (πŸ“₯ 750K / month Β· πŸ“¦ 1.9K Β· ⏱️ 26.08.2019):

     pip install influxdb
    
  • NPM (πŸ“₯ 140K / month Β· πŸ“¦ 970 Β· ⏱️ 13.11.2019):

     npm install influx
    
  • Dockerhub (πŸ“₯ 400M Β· ⭐ 900 Β· ⏱️ 23.01.2020):

     docker pull influxdb
    
  • Maven (πŸ“¦ 960 Β· ⏱️ 12.09.2018):

     <dependency>
     	<groupId>org.influxdb</groupId>
     	<artifactId>influxdb-java</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Prometheus (πŸ₯ˆ35 Β· ⭐ 30K) - The Prometheus monitoring system and time series database. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 480 Β· πŸ”€ 4.2K Β· πŸ“₯ 12M Β· πŸ“‹ 3.3K - 14% open Β· ⏱️ 22.01.2020):

     git clone https://github.com/prometheus/prometheus
    
  • PyPi (πŸ“₯ 4.7M / month Β· πŸ“¦ 1.2K Β· ⏱️ 20.06.2019):

     pip install prometheus_client
    
  • Conda:

     conda install -c conda-forge prometheus_client
    
  • Dockerhub (πŸ“₯ 510M Β· ⭐ 930 Β· ⏱️ 22.01.2020):

     docker pull prom/prometheus
    
  • Maven (πŸ“¦ 1.1K Β· ⏱️ 30.07.2018):

     <dependency>
     	<groupId>io.prometheus</groupId>
     	<artifactId>simpleclient</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Solr (πŸ₯ˆ35 Β· ⭐ 4K) - Apache Lucene and Solr open-source search software. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 240 Β· πŸ”€ 2.2K Β· πŸ“¦ 2.8K Β· ⏱️ 24.01.2020):

     git clone https://github.com/apache/lucene-solr
    
  • PyPi (πŸ“₯ 51K / month Β· πŸ“¦ 2.3K Β· ⏱️ 03.10.2018):

     pip install pysolr
    
  • NPM (πŸ“₯ 50K / month Β· πŸ“¦ 840 Β· ⏱️ 19.05.2017):

     npm install solr-client
    
  • Dockerhub (πŸ“₯ 60M Β· ⭐ 730 Β· ⏱️ 23.01.2020):

     docker pull solr
    
  • Maven (πŸ“¦ 3K Β· ⏱️ 28.12.2019):

     <dependency>
     	<groupId>org.apache.solr</groupId>
     	<artifactId>solr-core</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Minio (πŸ₯ˆ34 Β· ⭐ 20K) - MinIO is a high performance object storage server compatible with Amazon S3 APIs. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 220 Β· πŸ”€ 1.9K Β· πŸ“‹ 3.8K - 2% open Β· ⏱️ 23.01.2020):

     git clone https://github.com/minio/minio
    
  • PyPi (πŸ“₯ 120K / month Β· πŸ“¦ 340 Β· ⏱️ 30.12.2019):

     pip install minio
    
  • NPM (πŸ“₯ 72K / month Β· πŸ“¦ 350 Β· ⏱️ 24.12.2019):

     npm install minio
    
  • Conda:

     conda install -c conda-forge minio
    
  • Dockerhub (πŸ“₯ 310M Β· ⭐ 290 Β· ⏱️ 24.01.2020):

     docker pull minio/minio
    
  • Maven (πŸ“¦ 200 Β· ⏱️ 11.09.2019):

     <dependency>
     	<groupId>io.minio</groupId>
     	<artifactId>minio</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Neo4j (πŸ₯ˆ34 Β· ⭐ 8.1K) - Graphs for Everyone. ❗️GPL-3.0
  • GitHub (πŸ‘¨β€πŸ’» 260 Β· πŸ”€ 1.7K Β· πŸ“¦ 560 Β· πŸ“‹ 2.7K - 7% open Β· ⏱️ 22.01.2020):

     git clone https://github.com/neo4j/neo4j
    
  • PyPi (πŸ“₯ 47K / month Β· πŸ“¦ 62 Β· ⏱️ 11.11.2019):

     pip install neo4j
    
  • NPM (πŸ“₯ 65K / month Β· πŸ“¦ 980 Β· ⏱️ 16.12.2019):

     npm install neo4j-driver
    
  • Dockerhub (πŸ“₯ 66M Β· ⭐ 750 Β· ⏱️ 23.01.2020):

     docker pull neo4j
    
  • Maven (πŸ“¦ 3K Β· ⏱️ 17.10.2019):

     <dependency>
     	<groupId>org.neo4j</groupId>
     	<artifactId>neo4j</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Arrow (πŸ₯ˆ34 Β· ⭐ 5K) - Apache Arrow is a cross-language development platform for in-memory data. It.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 430 Β· πŸ”€ 1.3K Β· πŸ“¦ 5 Β· πŸ“‹ 510 - 15% open Β· ⏱️ 24.01.2020):

     git clone https://github.com/apache/arrow
    
  • PyPi (πŸ“₯ 4.9M / month Β· πŸ“¦ 800 Β· ⏱️ 01.11.2019):

     pip install pyarrow
    
  • NPM (πŸ“₯ 7.7K / month Β· πŸ“¦ 30 Β· ⏱️ 01.11.2019):

     npm install apache-arrow
    
  • Conda:

     conda install -c conda-forge pyarrow
    
  • Maven (πŸ“¦ 250 Β· ⏱️ 30.09.2019):

     <dependency>
     	<groupId>org.apache.arrow</groupId>
     	<artifactId>arrow-vector</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
etcd (πŸ₯ˆ33 Β· ⭐ 29K) - Distributed reliable key-value store for the most critical data of a distributed.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 620 Β· πŸ”€ 6K Β· πŸ“₯ 22M Β· πŸ“‹ 4.8K - 12% open Β· ⏱️ 23.01.2020):

     git clone https://github.com/etcd-io/etcd
    
  • PyPi (πŸ“₯ 74K / month Β· πŸ“¦ 56 Β· ⏱️ 07.12.2019):

     pip install etcd3
    
  • NPM (πŸ“₯ 8.8K / month Β· πŸ“¦ 34 Β· ⏱️ 03.07.2019):

     npm install etcd3
    
  • Dockerhub (πŸ“₯ 4M Β· ⭐ 25 Β· ⏱️ 24.01.2020):

     docker pull bitnami/etcd
    
  • Maven:

     <dependency>
     	<groupId>io.etcd</groupId>
     	<artifactId>jetcd-core</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
LevelDB (πŸ₯ˆ33 Β· ⭐ 20K) - LevelDB is a fast key-value storage library written at Google that provides an.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 46 Β· πŸ”€ 4.5K Β· πŸ“‹ 590 - 21% open Β· ⏱️ 15.01.2020):

     git clone https://github.com/google/leveldb
    
  • PyPi (πŸ“₯ 44K / month Β· πŸ“¦ 630 Β· ⏱️ 22.01.2020):

     pip install plyvel
    
  • NPM (πŸ“₯ 940K / month Β· πŸ“¦ 15K Β· ⏱️ 04.10.2019):

     npm install levelup
    
Hazelcast (πŸ₯ˆ33 Β· ⭐ 3.6K) - Open Source In-Memory Data Grid. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 250 Β· πŸ”€ 1.2K Β· πŸ“¦ 9.5K Β· πŸ“‹ 6.1K - 12% open Β· ⏱️ 24.01.2020):

     git clone https://github.com/hazelcast/hazelcast
    
  • PyPi (πŸ“₯ 5.7K / month Β· πŸ“¦ 1 Β· ⏱️ 15.07.2019):

     pip install hazelcast-python-client
    
  • NPM (πŸ“₯ 1.7K / month Β· πŸ“¦ 38 Β· ⏱️ 06.05.2019):

     npm install hazelcast-client
    
  • Dockerhub (πŸ“₯ 6.1M Β· ⭐ 58 Β· ⏱️ 10.01.2020):

     docker pull hazelcast/hazelcast
    
  • Maven (πŸ“¦ 5.5K Β· ⏱️ 23.02.2019):

     <dependency>
     	<groupId>com.hazelcast</groupId>
     	<artifactId>hazelcast</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
HBase (πŸ₯ˆ33 Β· ⭐ 3.3K) - Non-relational distributed database modeled after Google's Bigtable and written.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 470 Β· πŸ”€ 2.2K Β· πŸ“¦ 2.6K Β· ⏱️ 24.01.2020):

     git clone https://github.com/apache/hbase
    
  • PyPi (πŸ“₯ 180K / month Β· πŸ“¦ 410 Β· ⏱️ 03.04.2017):

     pip install happybase
    
  • Maven (πŸ“¦ 5.1K Β· ⏱️ 27.10.2018):

     <dependency>
     	<groupId>org.apache.hbase</groupId>
     	<artifactId>hbase-client</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
RethinkDB (πŸ₯ˆ32 Β· ⭐ 24K) - The open-source database for the realtime web. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 240 Β· πŸ”€ 1.8K Β· πŸ“₯ 2.9K Β· πŸ“¦ 260 Β· πŸ“‹ 6.2K - 23% open Β· ⏱️ 13.01.2020):

     git clone https://github.com/rethinkdb/rethinkdb
    
  • PyPi (πŸ“₯ 21K / month Β· πŸ“¦ 870 Β· ⏱️ 02.11.2019):

     pip install rethinkdb
    
  • NPM (πŸ“₯ 54K / month Β· πŸ“¦ 3.3K Β· ⏱️ 13.12.2019):

     npm install rethinkdb
    
  • Dockerhub (πŸ“₯ 54M Β· ⭐ 530 Β· ⏱️ 16.01.2020):

     docker pull rethinkdb
    
  • Maven (⏱️ 26.07.2016):

     <dependency>
     	<groupId>com.rethinkdb</groupId>
     	<artifactId>rethinkdb-driver</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
OrientDB (πŸ₯ˆ32 Β· ⭐ 4.2K) - OrientDB is the most versatile DBMS supporting Graph, Document, Reactive,.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 180 Β· πŸ”€ 810 Β· πŸ“¦ 350 Β· πŸ“‹ 8.4K - 17% open Β· ⏱️ 24.01.2020):

     git clone https://github.com/orientechnologies/orientdb
    
  • PyPi (πŸ“₯ 3K / month Β· πŸ“¦ 40 Β· ⏱️ 29.04.2017):

     pip install pyorient
    
  • NPM (πŸ“₯ 18K / month Β· πŸ“¦ 200 Β· ⏱️ 11.12.2019):

     npm install orientjs
    
  • Dockerhub (πŸ“₯ 10M Β· ⭐ 130 Β· ⏱️ 09.01.2020):

     docker pull orientdb
    
  • Maven (πŸ“¦ 660 Β· ⏱️ 03.07.2019):

     <dependency>
     	<groupId>com.orientechnologies</groupId>
     	<artifactId>orientdb-core</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
ClickHouse (πŸ₯ˆ31 Β· ⭐ 9.7K) - ClickHouse is a free analytics DBMS for big data. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 540 Β· πŸ”€ 1.7K Β· πŸ“‹ 3.8K - 31% open Β· ⏱️ 24.01.2020):

     git clone https://github.com/ClickHouse/ClickHouse
    
  • PyPi (πŸ“₯ 290K / month Β· πŸ“¦ 18 Β· ⏱️ 20.09.2019):

     pip install clickhouse-driver
    
  • NPM (πŸ“₯ 4.5K / month Β· πŸ“¦ 12 Β· ⏱️ 04.11.2018):

     npm install clickhouse
    
  • Dockerhub (πŸ“₯ 4.5M Β· ⭐ 210 Β· ⏱️ 23.01.2020):

     docker pull yandex/clickhouse-server
    
ArangoDB (πŸ₯ˆ31 Β· ⭐ 9.3K) - ArangoDB is a native multi-model database with flexible data models for.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 120 Β· πŸ”€ 590 Β· πŸ“‹ 3.6K - 15% open Β· ⏱️ 24.01.2020):

     git clone https://github.com/arangodb/arangodb
    
  • PyPi (πŸ“₯ 5.8K / month Β· πŸ“¦ 10 Β· ⏱️ 30.10.2019):

     pip install pyarango
    
  • NPM (πŸ“₯ 14K / month Β· πŸ“¦ 440 Β· ⏱️ 24.01.2020):

     npm install arangojs
    
  • Dockerhub (πŸ“₯ 17M Β· ⭐ 180 Β· ⏱️ 23.01.2020):

     docker pull arangodb
    
  • Maven (πŸ“¦ 130 Β· ⏱️ 05.09.2019):

     <dependency>
     	<groupId>com.arangodb</groupId>
     	<artifactId>arangodb-java-driver</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Ignite (πŸ₯ˆ31 Β· ⭐ 3K) - Memory-centric distributed database, caching and processing platform designed to.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 330 Β· πŸ”€ 1.4K Β· πŸ“¦ 1 Β· ⏱️ 24.01.2020):

     git clone https://github.com/apache/ignite
    
  • PyPi (πŸ“₯ 1.6K / month Β· ⏱️ 23.11.2018):

     pip install pyignite
    
  • NPM (πŸ“₯ 330 / month Β· ⏱️ 10.12.2018):

     npm install apache-ignite-client
    
  • Dockerhub (πŸ“₯ 9.6M Β· ⭐ 58 Β· ⏱️ 19.09.2019):

     docker pull apacheignite/ignite
    
  • Maven (πŸ“¦ 460 Β· ⏱️ 10.07.2018):

     <dependency>
     	<groupId>org.apache.ignite</groupId>
     	<artifactId>ignite-core</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Cassandra (πŸ₯‰30 Β· ⭐ 6.8K) - Distributed, wide column store, NoSQL database designed to handle large.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 400 Β· πŸ”€ 2.5K Β· ⏱️ 24.01.2020):

     git clone https://github.com/apache/cassandra
    
  • PyPi (πŸ“₯ 220K / month Β· πŸ“¦ 1.2K Β· ⏱️ 15.01.2020):

     pip install cassandra-driver
    
  • NPM (πŸ“₯ 130K / month Β· πŸ“¦ 1.3K Β· ⏱️ 06.11.2019):

     npm install cassandra-driver
    
  • Dockerhub (πŸ“₯ 92M Β· ⭐ 1.1K Β· ⏱️ 28.12.2019):

     docker pull cassandra
    
  • Maven:

     <dependency>
     	<groupId>com.datastax.oss</groupId>
     	<artifactId>java-driver-core</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Graphite (πŸ₯‰30 Β· ⭐ 4.9K) - A highly scalable real-time graphing system. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 410 Β· πŸ”€ 1.2K Β· πŸ“¦ 45 Β· πŸ“‹ 1.1K - 33% open Β· ⏱️ 23.01.2020):

     git clone https://github.com/graphite-project/graphite-web
    
  • Dockerhub (πŸ“₯ 8.2M Β· ⭐ 70 Β· ⏱️ 24.10.2019):

     docker pull graphiteapp/graphite-statsd
    
TiDB (πŸ₯‰28 Β· ⭐ 22K) - TiDB is an open source distributed HTAP database compatible with the MySQL protocol. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 400 Β· πŸ”€ 3.3K Β· πŸ“‹ 3.8K - 33% open Β· ⏱️ 23.01.2020):

     git clone https://github.com/pingcap/tidb
    
  • Dockerhub (πŸ“₯ 520K Β· ⭐ 51 Β· ⏱️ 24.01.2020):

     docker pull pingcap/tidb
    
CrateDB (πŸ₯‰28 Β· ⭐ 2.8K) - CrateDB is a distributed SQL database that makes it simple to store and analyze.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 79 Β· πŸ”€ 350 Β· πŸ“‹ 860 - 10% open Β· ⏱️ 24.01.2020):

     git clone https://github.com/crate/crate
    
  • PyPi (πŸ“₯ 11K / month Β· πŸ“¦ 42 Β· ⏱️ 19.09.2019):

     pip install crate
    
  • Dockerhub (πŸ“₯ 14M Β· ⭐ 140 Β· ⏱️ 23.01.2020):

     docker pull crate
    
  • Maven:

     <dependency>
     	<groupId>io.crate</groupId>
     	<artifactId>crate-jdbc</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Dgraph (πŸ₯‰27 Β· ⭐ 12K) - Fast, Distributed Graph DB. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 130 Β· πŸ”€ 860 Β· πŸ“₯ 110K Β· πŸ“‹ 2K - 11% open Β· ⏱️ 24.01.2020):

     git clone https://github.com/dgraph-io/dgraph
    
  • PyPi (πŸ“₯ 4K / month Β· πŸ“¦ 10 Β· ⏱️ 10.09.2019):

     pip install pydgraph
    
  • NPM (πŸ“₯ 3.6K / month Β· πŸ“¦ 16 Β· ⏱️ 01.10.2019):

     npm install dgraph-js
    
  • Dockerhub (πŸ“₯ 1.8M Β· ⭐ 51 Β· ⏱️ 24.01.2020):

     docker pull dgraph/dgraph
    
  • Maven:

     <dependency>
     	<groupId>io.dgraph</groupId>
     	<artifactId>dgraph4j</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Geode (πŸ₯‰27 Β· ⭐ 1.7K) - Data management platform that provides real-time, consistent access to data-.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 170 Β· πŸ”€ 570 Β· ⏱️ 24.01.2020):

     git clone https://github.com/apache/geode
    
  • Dockerhub (πŸ“₯ 59K Β· ⭐ 19 Β· ⏱️ 30.12.2019):

     docker pull apachegeode/geode
    
  • Maven (πŸ“¦ 140 Β· ⏱️ 20.12.2019):

     <dependency>
     	<groupId>org.apache.geode</groupId>
     	<artifactId>geode-core</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
OmniSciDB (πŸ₯‰26 Β· ⭐ 2.1K) - OmniSciDB (formerly MapD Core). Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 80 Β· πŸ”€ 290 Β· πŸ“‹ 350 - 35% open Β· ⏱️ 21.01.2020):

     git clone https://github.com/omnisci/omniscidb
    
  • PyPi (πŸ“₯ 2.4K / month Β· πŸ“¦ 20 Β· ⏱️ 05.12.2019):

     pip install pymapd
    
  • NPM (πŸ“₯ 120 / month Β· πŸ“¦ 26 Β· ⏱️ 22.08.2019):

     npm install @mapd/connector
    
  • Conda:

     conda install -c conda-forge pymapd
    
Druid (πŸ₯‰25 Β· ⭐ 9.1K) - Apache Druid: a high performance real-time analytics database. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 380 Β· πŸ”€ 2.2K Β· πŸ“₯ 120 Β· πŸ“‹ 3.2K - 26% open Β· ⏱️ 24.01.2020):

     git clone https://github.com/apache/incubator-druid
    
  • Dockerhub (πŸ“₯ 29K Β· ⭐ 9 Β· ⏱️ 10.12.2019):

     docker pull apache/incubator-druid
    
JanusGraph (πŸ₯‰25 Β· ⭐ 3.1K) - JanusGraph: an open-source, distributed graph database. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 140 Β· πŸ”€ 750 Β· πŸ“₯ 110K Β· πŸ“‹ 1.2K - 36% open Β· ⏱️ 15.01.2020):

     git clone https://github.com/JanusGraph/janusgraph
    
  • Dockerhub (πŸ“₯ 29K Β· ⭐ 6 Β· ⏱️ 21.10.2019):

     docker pull janusgraph/janusgraph
    
  • Maven (πŸ“¦ 84 Β· ⏱️ 08.10.2018):

     <dependency>
     	<groupId>org.janusgraph</groupId>
     	<artifactId>janusgraph-core</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Cayley (πŸ₯‰24 Β· ⭐ 13K) - An open-source graph database. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 100 Β· πŸ”€ 1.2K Β· πŸ“₯ 26K Β· πŸ“‹ 460 - 16% open Β· ⏱️ 18.01.2020):

     git clone https://github.com/cayleygraph/cayley
    
  • PyPi (πŸ“₯ 160 / month Β· ⏱️ 26.10.2019):

     pip install pyley
    
  • NPM (πŸ“₯ 46 / month Β· ⏱️ 06.11.2019):

     npm install @cayleygraph/cayley
    
  • Dockerhub (πŸ“₯ 3.6K Β· ⭐ 5 Β· ⏱️ 18.01.2020):

     docker pull cayleygraph/cayley
    
Riak (πŸ₯‰24 Β· ⭐ 3.4K) - Riak is a decentralized datastore from Basho Technologies. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 120 Β· πŸ”€ 510 Β· πŸ“₯ 63 Β· πŸ“‹ 440 - 31% open Β· ⏱️ 08.01.2020):

     git clone https://github.com/basho/riak
    
  • PyPi (πŸ“₯ 5.6K / month Β· πŸ“¦ 170 Β· ⏱️ 12.12.2016):

     pip install riak
    
  • NPM (πŸ“₯ 780 / month Β· πŸ“¦ 60 Β· ⏱️ 12.12.2016):

     npm install basho-riak-client
    
  • Dockerhub (πŸ“₯ 650K Β· ⭐ 28 Β· ⏱️ 04.04.2017):

     docker pull basho/riak-kv
    
  • Maven (πŸ“¦ 300 Β· ⏱️ 15.12.2016):

     <dependency>
     	<groupId>com.basho.riak</groupId>
     	<artifactId>riak-client</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Tile38 (πŸ₯‰23 Β· ⭐ 6.6K) - Real-time Geospatial and Geofencing. MIT
  • GitHub (πŸ‘¨β€πŸ’» 36 Β· πŸ”€ 370 Β· πŸ“₯ 39K Β· πŸ“‹ 410 - 21% open Β· ⏱️ 11.12.2019):

     git clone https://github.com/tidwall/tile38
    
  • Dockerhub (πŸ“₯ 680K Β· ⭐ 15 Β· ⏱️ 11.12.2019):

     docker pull tile38/tile38
    
Quilt (πŸ₯‰23 Β· ⭐ 860) - Quilt is a versioned data portal for AWS. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 29 Β· πŸ”€ 51 Β· πŸ“¦ 16 Β· πŸ“‹ 91 - 51% open Β· ⏱️ 22.01.2020):

     git clone https://github.com/quiltdata/quilt
    
  • PyPi (πŸ“₯ 1.1K / month Β· πŸ“¦ 4 Β· ⏱️ 18.01.2020):

     pip install quilt3
    
Sonic (πŸ₯‰22 Β· ⭐ 8K) - Fast, lightweight & schema-less search backend. An alternative to Elasticsearch that.. MPL-2.0
  • GitHub (πŸ‘¨β€πŸ’» 22 Β· πŸ”€ 230 Β· πŸ“‹ 170 - 12% open Β· ⏱️ 09.01.2020):

     git clone https://github.com/valeriansaliou/sonic
    
  • PyPi (πŸ“₯ 140 / month Β· ⏱️ 03.08.2019):

     pip install sonic-client
    
  • NPM (πŸ“₯ 830 / month Β· ⏱️ 26.04.2019):

     npm install sonic-channel
    
  • Dockerhub (πŸ“₯ 4.1K Β· ⭐ 4 Β· ⏱️ 14.10.2019):

     docker pull valeriansaliou/sonic
    
  • Maven:

     <dependency>
     	<groupId>com.github.twohou</groupId>
     	<artifactId>java-sonic</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
rqlite (πŸ₯‰21 Β· ⭐ 5.5K) - The lightweight, distributed relational database built on SQLite. MIT
  • GitHub (πŸ‘¨β€πŸ’» 17 Β· πŸ”€ 300 Β· πŸ“₯ 5.9K Β· πŸ“‹ 270 - 19% open Β· ⏱️ 24.01.2020):

     git clone https://github.com/rqlite/rqlite
    
  • PyPi:

     pip install sqlalchemy_rqlite
    
  • NPM (πŸ“₯ 140 / month Β· ⏱️ 22.07.2019):

     npm install rqlite-js
    
  • Dockerhub (πŸ“₯ 17K Β· ⭐ 8 Β· ⏱️ 11.01.2020):

     docker pull rqlite/rqlite
    
  • Maven:

     <dependency>
     	<groupId>com.rqlite</groupId>
     	<artifactId>rqlite-java</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Doris (πŸ₯‰21 Β· ⭐ 1.5K) - MPP-based interactive SQL data warehousing for reporting and analysis. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 82 Β· πŸ”€ 420 Β· πŸ“‹ 960 - 35% open Β· ⏱️ 21.01.2020):

     git clone https://github.com/apache/incubator-doris
    
  • Dockerhub (πŸ“₯ 1.6K Β· ⭐ 4 Β· ⏱️ 04.01.2020):

     docker pull apachedoris/doris-dev
    
Gaffer (πŸ₯‰20 Β· ⭐ 1.6K) - A large-scale entity and relation database supporting aggregation of properties. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 40 Β· πŸ”€ 320 Β· πŸ“₯ 26 Β· πŸ“‹ 1.1K - 8% open Β· ⏱️ 23.01.2020):

     git clone https://github.com/gchq/Gaffer
    
  • Maven (πŸ“¦ 8 Β· ⏱️ 06.01.2020):

     <dependency>
     	<groupId>uk.gov.gchq.gaffer</groupId>
     	<artifactId>graph</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
EdgeDB (πŸ₯‰19 Β· ⭐ 3.5K) - The next generation relational database. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 18 Β· πŸ”€ 79 Β· πŸ“‹ 390 - 25% open Β· ⏱️ 23.01.2020):

     git clone https://github.com/edgedb/edgedb
    
  • PyPi (πŸ“₯ 2.9K / month Β· ⏱️ 11.01.2020):

     pip install edgedb
    
  • Dockerhub (πŸ“₯ 4.4K Β· ⭐ 2 Β· ⏱️ 24.01.2020):

     docker pull edgedb/edgedb
    
Nebula (πŸ₯‰19 Β· ⭐ 2.3K) - A distributed, fast open-source graph database featuring horizontal scalability.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 40 Β· πŸ”€ 300 Β· πŸ“₯ 400 Β· πŸ“‹ 660 - 32% open Β· ⏱️ 21.01.2020):

     git clone https://github.com/vesoft-inc/nebula
    
  • Dockerhub (πŸ“₯ 2.2K Β· ⭐ 3 Β· ⏱️ 24.01.2020):

     docker pull vesoft/nebula-graph
    
TileDB-Inc/TileDB (πŸ₯‰19 Β· ⭐ 570) - The Fastest Array Storage Engine. MIT
  • GitHub (πŸ‘¨β€πŸ’» 27 Β· πŸ”€ 72 Β· πŸ“₯ 380 Β· πŸ“‹ 620 - 16% open Β· ⏱️ 22.01.2020):

     git clone https://github.com/TileDB-Inc/TileDB
    
  • Conda:

     conda install -c conda-forge tiledb
    
  • Dockerhub (πŸ“₯ 520 Β· ⏱️ 09.10.2019):

     docker pull tiledb/tiledb
    
Show 5 hidden projects...
MySQL (πŸ₯ˆ36 Β· ⭐ 24K) - MySQL Server, the world's most popular open source database, and MySQL Cluster, a.. ❗️GPL-2.0
  • GitHub (πŸ‘¨β€πŸ’» 570 Β· πŸ”€ 2.1K Β· ⏱️ 09.12.2019):

     git clone https://github.com/mysql/mysql-server
    
  • PyPi (πŸ“₯ 2.1M / month Β· πŸ“¦ 6.6K Β· ⏱️ 21.11.2019):

     pip install mysqlclient
    
  • NPM (πŸ“₯ 2.1M / month Β· πŸ“¦ 100K Β· ⏱️ 23.01.2020):

     npm install mysql
    
  • Conda (⏱️ 18.11.2019):

     conda install -c anaconda mysql-connector-python
    
  • Dockerhub (πŸ“₯ 1.4B Β· ⭐ 9.1K Β· ⏱️ 15.01.2020):

     docker pull mysql
    
  • Maven (πŸ“¦ 300K Β· ⏱️ 27.09.2018):

     <dependency>
     	<groupId>mysql</groupId>
     	<artifactId>mysql-connector-java</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
MariaDB (πŸ₯‰28 Β· ⭐ 6.4K) - MariaDB server is a community developed fork of MySQL server. Started by core.. ❗️GPL-2.0
  • GitHub (πŸ‘¨β€πŸ’» 1.6K Β· πŸ”€ 920 Β· ⏱️ 24.01.2020):

     git clone https://github.com/MariaDB/server
    
  • NPM (πŸ“₯ 33K / month Β· πŸ“¦ 580 Β· ⏱️ 19.07.2017):

     npm install mariadb
    
  • Dockerhub (πŸ“₯ 1.1B Β· ⭐ 3.2K Β· ⏱️ 16.01.2020):

     docker pull mariadb
    
TimescaleDB (πŸ₯‰25 Β· ⭐ 8.1K) - An open-source time-series SQL database optimized for fast ingest and.. ❗️Unlicensed
  • GitHub (πŸ‘¨β€πŸ’» 43 Β· πŸ”€ 430 Β· πŸ“₯ 3.5K Β· πŸ“‹ 660 - 28% open Β· ⏱️ 23.01.2020):

     git clone https://github.com/timescale/timescaledb
    
  • Dockerhub (πŸ“₯ 16M Β· ⭐ 65 Β· ⏱️ 15.01.2020):

     docker pull timescale/timescaledb
    
RavenDB (πŸ₯‰25 Β· ⭐ 2.4K) - ACID Document Database. ❗️AGPL-3.0
  • GitHub (πŸ‘¨β€πŸ’» 320 Β· πŸ”€ 690 Β· πŸ“¦ 2.2K Β· πŸ“‹ 400 - 1% open Β· ⏱️ 24.01.2020):

     git clone https://github.com/ravendb/ravendb
    
  • PyPi (πŸ“₯ 140 / month Β· πŸ“¦ 2 Β· ⏱️ 27.02.2019):

     pip install pyravendb
    
  • NPM (πŸ“₯ 600 / month Β· πŸ“¦ 28 Β· ⏱️ 16.09.2019):

     npm install ravendb
    
  • Dockerhub (πŸ“₯ 340K Β· ⭐ 27 Β· ⏱️ 21.01.2020):

     docker pull ravendb/ravendb
    
  • Maven (⏱️ 07.02.2018):

     <dependency>
     	<groupId>net.ravendb</groupId>
     	<artifactId>ravendb</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Grakn (πŸ₯‰25 Β· ⭐ 1.9K) - Grakn Core: The Knowledge Graph. ❗️AGPL-3.0
  • GitHub (πŸ‘¨β€πŸ’» 54 Β· πŸ”€ 230 Β· πŸ“₯ 13K Β· πŸ“¦ 18 Β· πŸ“‹ 1.9K - 10% open Β· ⏱️ 22.01.2020):

     git clone https://github.com/graknlabs/grakn
    
  • PyPi (πŸ“₯ 450 / month Β· πŸ“¦ 6 Β· ⏱️ 25.11.2019):

     pip install grakn-client
    
  • NPM (πŸ“₯ 790 / month Β· πŸ“¦ 2 Β· ⏱️ 25.11.2019):

     npm install grakn-client
    
  • Dockerhub (πŸ“₯ 290K Β· ⭐ 10 Β· ⏱️ 17.01.2020):

     docker pull graknlabs/grakn
    
  • Maven:

     <dependency>
     	<groupId>io.grakn.client</groupId>
     	<artifactId>grakn-client</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    

Database GUIs

Back to top

GUI tools for database administration and data management for a variety of databases.

mongo-express (πŸ₯‡31 Β· ⭐ 4.1K) - Web-based MongoDB admin interface, written with Node.js and express. MIT
  • GitHub (πŸ‘¨β€πŸ’» 110 Β· πŸ”€ 660 Β· πŸ“¦ 590 Β· πŸ“‹ 330 - 24% open Β· ⏱️ 30.12.2019):

     git clone https://github.com/mongo-express/mongo-express
    
  • NPM (πŸ“₯ 6.1K / month Β· πŸ“¦ 390 Β· ⏱️ 24.12.2019):

     npm install mongo-express
    
  • Dockerhub (πŸ“₯ 45M Β· ⭐ 600 Β· ⏱️ 25.12.2019):

     docker pull mongo-express
    
Adminer (πŸ₯‡29 Β· ⭐ 4.2K) - Database management in a single PHP file. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 130 Β· πŸ”€ 750 Β· πŸ“₯ 2.3M Β· πŸ“¦ 98 Β· ⏱️ 20.12.2019):

     git clone https://github.com/vrana/adminer
    
  • Dockerhub (πŸ“₯ 100M Β· ⭐ 330 Β· ⏱️ 24.01.2020):

     docker pull adminer
    
SQLite Browser (πŸ₯ˆ26 Β· ⭐ 12K) - Official home of the DB Browser for SQLite (DB4S) project. Previously.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 120 Β· πŸ”€ 1.5K Β· πŸ“₯ 7.6M Β· πŸ“‹ 1.7K - 23% open Β· ⏱️ 17.01.2020):

     git clone https://github.com/sqlitebrowser/sqlitebrowser
    
DBeaver (πŸ₯ˆ26 Β· ⭐ 12K) - Free universal database tool and SQL client. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 150 Β· πŸ”€ 1K Β· πŸ“₯ 370K Β· πŸ“‹ 7.1K - 19% open Β· ⏱️ 24.01.2020):

     git clone https://github.com/dbeaver/dbeaver
    
pgweb (πŸ₯ˆ24 Β· ⭐ 6.3K) - Cross-platform client for PostgreSQL databases. MIT
  • GitHub (πŸ‘¨β€πŸ’» 45 Β· πŸ”€ 420 Β· πŸ“₯ 72K Β· πŸ“‹ 260 - 16% open Β· ⏱️ 16.12.2019):

     git clone https://github.com/sosedoff/pgweb
    
  • Dockerhub (πŸ“₯ 4.3M Β· ⭐ 23 Β· ⏱️ 16.12.2019):

     docker pull sosedoff/pgweb
    
Sequel Pro (πŸ₯ˆ23 Β· ⭐ 7.4K) - MySQL/MariaDB database management for macOS. MIT
  • GitHub (πŸ‘¨β€πŸ’» 41 Β· πŸ”€ 640 Β· πŸ“₯ 4M Β· πŸ“‹ 3.5K - 29% open Β· ⏱️ 30.09.2019):

     git clone https://github.com/sequelpro/sequelpro
    
Dejavu (πŸ₯ˆ22 Β· ⭐ 6.5K) - The Missing Web UI for Elasticsearch: Import, browse and edit data with rich filters.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 24 Β· πŸ”€ 410 Β· πŸ“‹ 290 - 11% open Β· ⏱️ 24.01.2020):

     git clone https://github.com/appbaseio/dejavu
    
  • Dockerhub (πŸ“₯ 1.3M Β· ⭐ 24 Β· ⏱️ 19.12.2019):

     docker pull appbaseio/dejavu
    
Redis Desktop Manager (πŸ₯ˆ21 Β· ⭐ 15K) - Cross-platform GUI management tool for Redis. ❗️GPL-3.0
  • GitHub (πŸ‘¨β€πŸ’» 70 Β· πŸ”€ 2.5K Β· πŸ“₯ 1.5M Β· πŸ“‹ 4.3K - 0% open Β· ⏱️ 23.01.2020):

     git clone https://github.com/uglide/RedisDesktopManager
    
Kafka Manager (πŸ₯ˆ21 Β· ⭐ 8.6K) - CMAK is a tool for managing Apache Kafka clusters. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 85 Β· πŸ”€ 2K Β· πŸ“‹ 530 - 70% open Β· ⏱️ 23.01.2020):

     git clone https://github.com/yahoo/kafka-manager
    
  • Dockerhub (πŸ“₯ 350K Β· ⭐ 41 Β· ⏱️ 12.04.2019):

     docker pull kafkamanager/kafka-manager
    
elasticsearch-head (πŸ₯ˆ21 Β· ⭐ 6.8K) - A web front end for an elastic search cluster. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 68 Β· πŸ”€ 1.4K Β· πŸ“‹ 320 - 49% open Β· ⏱️ 24.09.2019):

     git clone https://github.com/mobz/elasticsearch-head
    
  • Dockerhub (πŸ“₯ 1.5M Β· ⭐ 56 Β· ⏱️ 31.01.2017):

     docker pull mobz/elasticsearch-head
    
OmniDB (πŸ₯ˆ21 Β· ⭐ 1.9K) - Web tool for database management. MIT
  • GitHub (πŸ‘¨β€πŸ’» 22 Β· πŸ”€ 240 Β· πŸ“₯ 6.6K Β· πŸ“‹ 500 - 37% open Β· ⏱️ 05.12.2019):

     git clone https://github.com/OmniDB/OmniDB
    
ElectroCRUD (πŸ₯‰20 Β· ⭐ 980) - Database CRUD Application Built on Electron | MySQL, Postgres. MIT
  • GitHub (πŸ‘¨β€πŸ’» 44 Β· πŸ”€ 220 Β· πŸ“₯ 27K Β· πŸ“‹ 46 - 17% open Β· ⏱️ 17.12.2019):

     git clone https://github.com/garrylachman/ElectroCRUD
    
Robo 3T (πŸ₯‰19 Β· ⭐ 7.8K Β· πŸ’€) - Native cross-platform MongoDB management tool. ❗️GPL-3.0
  • GitHub (πŸ‘¨β€πŸ’» 39 Β· πŸ”€ 660 Β· πŸ“₯ 41K Β· πŸ“‹ 1.5K - 42% open Β· ⏱️ 12.04.2019):

     git clone https://github.com/Studio3T/robomongo
    
Mongoku (πŸ₯‰19 Β· ⭐ 810) - The Web-scale GUI for MongoDB. MIT
  • GitHub (πŸ‘¨β€πŸ’» 7 Β· πŸ”€ 40 Β· πŸ“¦ 3 Β· πŸ“‹ 29 - 41% open Β· ⏱️ 10.01.2020):

     git clone https://github.com/huggingface/Mongoku
    
  • NPM (πŸ“₯ 260 / month Β· ⏱️ 31.07.2019):

     npm install mongoku
    
  • Dockerhub (πŸ“₯ 210K Β· ⭐ 1 Β· ⏱️ 31.07.2019):

     docker pull huggingface/mongoku
    
Mirage (πŸ₯‰18 Β· ⭐ 2K) - GUI for simplifying Elasticsearch Query DSL. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 9 Β· πŸ”€ 100 Β· πŸ“‹ 64 - 20% open Β· ⏱️ 11.10.2019):

     git clone https://github.com/appbaseio/mirage
    
  • Dockerhub (πŸ“₯ 320K Β· ⭐ 6 Β· ⏱️ 11.10.2019):

     docker pull appbaseio/mirage
    
Sequeler (πŸ₯‰16 Β· ⭐ 510) - SQL Client built in Vala. ❗️GPL-3.0
  • GitHub (πŸ‘¨β€πŸ’» 34 Β· πŸ”€ 48 Β· πŸ“‹ 220 - 18% open Β· ⏱️ 17.11.2019):

     git clone https://github.com/Alecaddd/sequeler
    
FastoNoSQL (πŸ₯‰15 Β· ⭐ 620) - FastoNoSQL is a crossplatform Redis, Memcached, SSDB, LevelDB, RocksDB,.. ❗️GPL-3.0
  • GitHub (πŸ‘¨β€πŸ’» 3 Β· πŸ”€ 53 Β· πŸ“‹ 69 - 15% open Β· ⏱️ 14.01.2020):

     git clone https://github.com/fastogt/fastonosql
    
Franchise (πŸ₯‰14 Β· ⭐ 3.5K) - a notebook sql client. what you get when have a lot of sequels. MIT
  • GitHub (πŸ‘¨β€πŸ’» 15 Β· πŸ”€ 230 Β· πŸ“‹ 56 - 53% open Β· ⏱️ 13.11.2019):

     git clone https://github.com/HVF/franchise
    
  • Dockerhub (πŸ“₯ 390 Β· ⭐ 2 Β· ⏱️ 27.02.2019):

     docker pull binakot/franchise
    
Show 7 hidden projects...
Nosqlclient (πŸ₯‡27 Β· ⭐ 3.1K) - Cross-platform and self hosted, easy to use, intuitive mongodb management.. ❗️AGPL-3.0
  • GitHub (πŸ‘¨β€πŸ’» 28 Β· πŸ”€ 320 Β· πŸ“₯ 170K Β· πŸ“¦ 190 Β· πŸ“‹ 390 - 5% open Β· ⏱️ 09.01.2020):

     git clone https://github.com/nosqlclient/nosqlclient
    
  • Dockerhub (πŸ“₯ 8.9M Β· ⭐ 79 Β· ⏱️ 09.01.2020):

     docker pull mongoclient/mongoclient
    
phpMyAdmin (πŸ₯ˆ23 Β· ⭐ 4.7K) - A web interface for MySQL and MariaDB. ❗️GPL-2.0
  • GitHub (πŸ‘¨β€πŸ’» 1.7K Β· πŸ”€ 2.6K Β· πŸ“‹ 12K - 4% open Β· ⏱️ 24.01.2020):

     git clone https://github.com/phpmyadmin/phpmyadmin
    
pgAdmin (πŸ₯ˆ21 Β· ⭐ 840) - Web-based administration tool for the PostgreSQL database. ❗️PostgreSQL
  • GitHub (πŸ‘¨β€πŸ’» 100 Β· πŸ”€ 160 Β· ⏱️ 24.01.2020):

     git clone https://github.com/postgres/pgadmin4
    
  • Dockerhub (πŸ“₯ 29M Β· ⭐ 470 Β· ⏱️ 24.01.2020):

     docker pull dpage/pgadmin4
    
Sqlectron (πŸ₯‰20 Β· ⭐ 2.9K Β· πŸ’€) - UNMAINTAINED - SEE BELOW. A simple and lightweight SQL client desktop with.. MIT
  • GitHub (πŸ‘¨β€πŸ’» 29 Β· πŸ”€ 330 Β· πŸ“₯ 200K Β· πŸ“‹ 330 - 30% open Β· ⏱️ 31.10.2018):

     git clone https://github.com/sqlectron/sqlectron-gui
    
Filestash (πŸ₯‰18 Β· ⭐ 2.3K) - A modern web client for SFTP, S3, FTP, WebDAV, Git, Minio, LDAP, CalDAV,.. ❗️AGPL-3.0
  • GitHub (πŸ‘¨β€πŸ’» 18 Β· πŸ”€ 120 Β· πŸ“₯ 210 Β· πŸ“‹ 190 - 13% open Β· ⏱️ 21.01.2020):

     git clone https://github.com/mickael-kerjean/filestash
    
  • Dockerhub (πŸ“₯ 540K Β· ⭐ 9 Β· ⏱️ 21.01.2020):

     docker pull machines/filestash
    
HeidiSQL (πŸ₯‰18 Β· ⭐ 1.7K) - A lightweight client for managing MariaDB, MySQL, SQL Server and PostgreSQL,.. ❗️GPL-2.0
  • GitHub (πŸ‘¨β€πŸ’» 25 Β· πŸ”€ 180 Β· πŸ“‹ 830 - 36% open Β· ⏱️ 23.01.2020):

     git clone https://github.com/HeidiSQL/HeidiSQL
    
MySQL Workbench (πŸ₯‰10 Β· ⭐ 430) - MySQL Workbench is a unified visual tool for database architects,.. ❗️GPL-2.0
  • GitHub (πŸ‘¨β€πŸ’» 18 Β· πŸ”€ 140 Β· ⏱️ 15.12.2019):

     git clone https://github.com/mysql/mysql-workbench
    

Others

Back to top

Netdata (πŸ₯‡29 Β· ⭐ 44K) - Real-time performance monitoring, done right! https://my-netdata.io/. ❗️GPL-3.0
  • GitHub (πŸ‘¨β€πŸ’» 360 Β· πŸ”€ 4K Β· πŸ“₯ 500K Β· πŸ“‹ 4.6K - 14% open Β· ⏱️ 24.01.2020):

     git clone https://github.com/netdata/netdata
    
  • Dockerhub (πŸ“₯ 99M Β· ⭐ 130 Β· ⏱️ 17.01.2020):

     docker pull netdata/netdata
    
Apollo (πŸ₯‡29 Β· ⭐ 16K) - An open autonomous driving platform. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 300 Β· πŸ”€ 5.5K Β· πŸ“₯ 36K Β· πŸ“‹ 2.1K - 25% open Β· ⏱️ 24.01.2020):

     git clone https://github.com/ApolloAuto/apollo
    
  • Dockerhub (πŸ“₯ 1.8M Β· ⭐ 35 Β· ⏱️ 23.01.2020):

     docker pull apolloauto/apollo
    
Glances (πŸ₯‡28 Β· ⭐ 15K) - Glances an Eye on your system. A top/htop alternative for GNU/Linux, BSD, Mac.. ❗️LGPL-3.0
  • GitHub (πŸ‘¨β€πŸ’» 120 Β· πŸ”€ 990 Β· πŸ“₯ 350 Β· πŸ“¦ 160 Β· πŸ“‹ 1.1K - 10% open Β· ⏱️ 22.01.2020):

     git clone https://github.com/nicolargo/glances
    
  • PyPi (πŸ“₯ 41K / month Β· πŸ“¦ 44 Β· ⏱️ 27.08.2019):

     pip install glances
    
  • Dockerhub (πŸ“₯ 32M Β· ⭐ 51 Β· ⏱️ 29.01.2019):

     docker pull nicolargo/glances
    
ungit (πŸ₯ˆ27 Β· ⭐ 8.7K) - The easiest way to use git. On any platform. Anywhere. MIT
  • GitHub (πŸ‘¨β€πŸ’» 90 Β· πŸ”€ 570 Β· πŸ“₯ 1.2K Β· πŸ“¦ 49 Β· πŸ“‹ 730 - 25% open Β· ⏱️ 18.01.2020):

     git clone https://github.com/FredrikNoren/ungit
    
  • NPM (πŸ“₯ 4.6K / month Β· πŸ“¦ 36 Β· ⏱️ 22.11.2019):

     npm install ungit
    
File Browser (πŸ₯ˆ25 Β· ⭐ 6.4K) - Web File Browser which can be used as a middleware or standalone app. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 77 Β· πŸ”€ 820 Β· πŸ“₯ 40K Β· πŸ“‹ 790 - 16% open Β· ⏱️ 09.01.2020):

     git clone https://github.com/filebrowser/filebrowser
    
  • Dockerhub (πŸ“₯ 12M Β· ⭐ 67 Β· ⏱️ 09.01.2020):

     docker pull filebrowser/filebrowser
    
guess (πŸ₯ˆ25 Β· ⭐ 6.1K) - Libraries & tools for enabling Machine Learning driven user-experiences on the web. MIT
  • GitHub (πŸ‘¨β€πŸ’» 18 Β· πŸ”€ 150 Β· πŸ“‹ 63 - 11% open Β· ⏱️ 23.01.2020):

     git clone https://github.com/guess-js/guess
    
  • NPM (πŸ“₯ 12K / month Β· πŸ“¦ 78 Β· ⏱️ 23.01.2020):

     npm install guess-webpack
    
Portia (πŸ₯ˆ24 Β· ⭐ 7.5K) - Visual scraping for Scrapy. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 50 Β· πŸ”€ 1.2K Β· πŸ“₯ 110 Β· πŸ“¦ 9 Β· πŸ“‹ 430 - 22% open Β· ⏱️ 10.07.2019):

     git clone https://github.com/scrapinghub/portia
    
  • PyPi (πŸ“₯ 200 / month Β· πŸ“¦ 7 Β· ⏱️ 28.06.2017):

     pip install slybot
    
  • Dockerhub (πŸ“₯ 340K Β· ⭐ 26 Β· ⏱️ 10.07.2019):

     docker pull scrapinghub/portia
    
Kylin (πŸ₯ˆ24 Β· ⭐ 2.5K) - Distributed Analytics Engine that provides SQL interface and multi-dimensional.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 230 Β· πŸ”€ 1.1K Β· πŸ“¦ 1 Β· ⏱️ 23.01.2020):

     git clone https://github.com/apache/kylin
    
  • PyPi (πŸ“₯ 690 / month Β· ⏱️ 07.04.2019):

     pip install kylinpy
    
  • Dockerhub (πŸ“₯ 2.2K Β· ⭐ 2 Β· ⏱️ 28.08.2019):

     docker pull apachekylin/apache-kylin-standalone
    
OpenRefine (πŸ₯‰23 Β· ⭐ 6.8K) - OpenRefine is a free, open source power tool for working with messy data and.. BSD-3
  • GitHub (πŸ‘¨β€πŸ’» 140 Β· πŸ”€ 1.2K Β· πŸ“₯ 910K Β· πŸ“‹ 1.7K - 25% open Β· ⏱️ 24.01.2020):

     git clone https://github.com/OpenRefine/OpenRefine
    
  • Dockerhub (πŸ“₯ 95K Β· ⭐ 8 Β· ⏱️ 23.01.2020):

     docker pull vimagick/openrefine
    
Pravega (πŸ₯‰22 Β· ⭐ 900) - Pravega - Streaming as a new software defined storage primitive. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 57 Β· πŸ”€ 210 Β· πŸ“₯ 4.6K Β· πŸ“‹ 2.6K - 12% open Β· ⏱️ 24.01.2020):

     git clone https://github.com/pravega/pravega
    
  • Dockerhub (πŸ“₯ 200K Β· ⭐ 1 Β· ⏱️ 06.01.2020):

     docker pull pravega/pravega
    
Feast (πŸ₯‰21 Β· ⭐ 620) - Feature Store for Machine Learning. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 23 Β· πŸ”€ 96 Β· πŸ“₯ 340 Β· πŸ“¦ 4 Β· πŸ“‹ 180 - 20% open Β· ⏱️ 21.01.2020):

     git clone https://github.com/gojek/feast
    
  • PyPi (πŸ“₯ 2K / month Β· ⏱️ 08.01.2020):

     pip install feast
    
TensorFlow for R (πŸ₯‰20 Β· ⭐ 1.2K) - TensorFlow for R. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 21 Β· πŸ”€ 320 Β· πŸ“₯ 8 Β· πŸ“‹ 260 - 5% open Β· ⏱️ 09.01.2020):

     git clone https://github.com/rstudio/tensorflow
    
Aleph (πŸ₯‰20 Β· ⭐ 960) - Search and browse documents and data; find the people and companies you look for. MIT
  • GitHub (πŸ‘¨β€πŸ’» 39 Β· πŸ”€ 140 Β· πŸ“‹ 470 - 7% open Β· ⏱️ 19.01.2020):

     git clone https://github.com/alephdata/aleph
    
  • Dockerhub (πŸ“₯ 220K Β· ⭐ 1 Β· ⏱️ 23.01.2020):

     docker pull alephdata/aleph
    
Katib (πŸ₯‰20 Β· ⭐ 620) - Repository for hyperparameter tuning. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 45 Β· πŸ”€ 160 Β· πŸ“₯ 95 Β· πŸ“‹ 410 - 17% open Β· ⏱️ 24.01.2020):

     git clone https://github.com/kubeflow/katib
    
  • Dockerhub (πŸ“₯ 660K Β· ⏱️ 08.05.2019):

     docker pull katib/metrics-collector
    
Handout (πŸ₯‰18 Β· ⭐ 1.8K) - Turn Python scripts into handouts with Markdown and figures. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 2 Β· πŸ”€ 92 Β· πŸ“¦ 7 Β· πŸ“‹ 39 - 33% open Β· ⏱️ 08.11.2019):

     git clone https://github.com/danijar/handout
    
  • PyPi (πŸ“₯ 220 / month Β· πŸ“¦ 1 Β· ⏱️ 08.11.2019):

     pip install handout
    
Shiny (πŸ₯‰17 Β· ⭐ 3.7K) - Easy interactive web applications with R. ❗️GPL-3.0
  • GitHub (πŸ‘¨β€πŸ’» 53 Β· πŸ”€ 1.5K Β· πŸ“‹ 1.9K - 24% open Β· ⏱️ 16.01.2020):

     git clone https://github.com/rstudio/shiny
    
SQLFlow (πŸ₯‰17 Β· ⭐ 3.5K) - Brings SQL and AI together. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 30 Β· πŸ”€ 540 Β· πŸ“‹ 660 - 26% open Β· ⏱️ 22.01.2020):

     git clone https://github.com/sql-machine-learning/sqlflow
    
  • Dockerhub (πŸ“₯ 16K Β· ⭐ 1 Β· ⏱️ 24.01.2020):

     docker pull sqlflow/sqlflow
    
MediaPipe (πŸ₯‰15 Β· ⭐ 4.3K) - MediaPipe is a cross-platform framework for building multimodal applied.. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 2 Β· πŸ”€ 760 Β· πŸ“‹ 400 - 16% open Β· ⏱️ 18.01.2020):

     git clone https://github.com/google/mediapipe
    
EuclidesDB (πŸ₯‰14 Β· ⭐ 570) - A multi-model machine learning feature embedding database. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 2 Β· πŸ”€ 26 Β· πŸ“₯ 220 Β· πŸ“¦ 1 Β· πŸ“‹ 22 - 50% open Β· ⏱️ 15.09.2019):

     git clone https://github.com/perone/euclidesdb
    
  • PyPi (πŸ“₯ 60 / month Β· ⏱️ 12.02.2019):

     pip install euclides
    
  • Dockerhub (πŸ“₯ 220 Β· ⭐ 1 Β· ⏱️ 12.02.2019):

     docker pull euclidesdb/euclidesdb
    
Show 2 hidden projects...
Botpress (πŸ₯‡28 Β· ⭐ 8.3K) - The Conversational Platform with built-in language understanding (NLU),.. ❗️AGPL-3.0
  • GitHub (πŸ‘¨β€πŸ’» 120 Β· πŸ”€ 920 Β· πŸ“¦ 340 Β· πŸ“‹ 940 - 12% open Β· ⏱️ 24.01.2020):

     git clone https://github.com/botpress/botpress
    
  • NPM (πŸ“₯ 1.2K / month Β· πŸ“¦ 170 Β· ⏱️ 19.01.2019):

     npm install botpress
    
MLDB (πŸ₯‰14 Β· ⭐ 560 Β· πŸ’€) - MLDB is the Machine Learning Database. Apache-2
  • GitHub (πŸ‘¨β€πŸ’» 17 Β· πŸ”€ 81 Β· πŸ“‹ 27 - 88% open Β· ⏱️ 09.10.2018):

     git clone https://github.com/mldbai/mldb
    

About

πŸ† A ranked list of awesome machine learning tools. Updated weekly.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published