Skip to content

Latest commit

 

History

History
4428 lines (3456 loc) · 189 KB

README.md

File metadata and controls

4428 lines (3456 loc) · 189 KB

Contents

Explanation

  • 🥇🥈🥉 Combined project-quality score
  • ⭐️ Star count from Github
  • 🐣 New project (less than 6 month old)
  • 💤 Inactive project (6 month no activity)
  • 💀 Dead project (12 month no activity)
  • ❗️ Warning (e.g. missing/risky license)
  • 👨‍💻 Contributors count from Github
  • 🔀 Fork count from Github
  • 📋 Issue count from Github
  • ⏱️ Last update timestamp on package manager
  • 📥 Download count from package manager
  • 📦 Number of dependent projects

IDEs & Notebook Editors

Back to top

Development environments and notebook editors suitable for machine learning & data science projects.

Visual Studio Code (🥇37 · ⭐ 91K) - Visual Studio Code. MIT
  • GitHub (👨‍💻 1.3K · 🔀 14K · 📦 540 · 📋 82K - 5% open · ⏱️ 24.01.2020):

     git clone https://github.com/microsoft/vscode
    
  • NPM (📥 25K / month · 📦 470 · ⏱️ 14.01.2020):

     npm install monaco-editor-core
    
JupyterLab (🥇35 · ⭐ 9.3K) - JupyterLab computational environment. BSD-3
  • GitHub (👨‍💻 310 · 🔀 1.5K · 📦 13K · 📋 4.4K - 30% open · ⏱️ 24.01.2020):

     git clone https://github.com/jupyterlab/jupyterlab
    
  • PyPi (📥 320K / month · 📦 2.4K · ⏱️ 24.01.2020):

     pip install jupyterlab
    
Jupyter (🥇35 · ⭐ 7.4K) - Jupyter Interactive Notebook. BSD-3
  • GitHub (👨‍💻 470 · 🔀 2.7K · 📦 47K · 📋 3.5K - 47% open · ⏱️ 23.01.2020):

     git clone https://github.com/jupyter/notebook
    
  • PyPi (📥 2.6M / month · 📦 13K · ⏱️ 12.08.2015):

     pip install jupyter
    
  • Dockerhub (📥 7.2M · ⭐ 590 · ⏱️ 02.12.2019):

     docker pull jupyter/datascience-notebook
    
Spyder (🥈32 · ⭐ 5.1K) - Official repository for Spyder - The Scientific Python Development Environment. MIT
  • GitHub (👨‍💻 190 · 🔀 1K · 📦 9K · 📋 9.3K - 8% open · ⏱️ 23.01.2020):

     git clone https://github.com/spyder-ide/spyder
    
  • PyPi (📥 38K / month · 📦 1.8K · ⏱️ 02.01.2020):

     pip install spyder
    
  • Conda (⏱️ 07.01.2020):

     conda install -c anaconda spyder
    
Eclipse Che (🥈31 · ⭐ 6K) - Eclipse Che: Next-generation Eclipse IDE. Open source workspace server and.. EPL-2.0
  • GitHub (👨‍💻 170 · 🔀 1.1K · 📦 72 · 📋 9.4K - 10% open · ⏱️ 24.01.2020):

     git clone https://github.com/eclipse/che
    
  • Dockerhub (📥 4M · ⭐ 140 · ⏱️ 14.07.2019):

     docker pull eclipse/che
    
  • Maven (📦 3 · ⏱️ 05.07.2019):

     <dependency>
     	<groupId>org.eclipse.che</groupId>
     	<artifactId>bootstrapper</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Atom (🥈30 · ⭐ 51K) - The hackable text editor. MIT
  • GitHub (👨‍💻 540 · 🔀 14K · 📥 5.1M · 📋 16K - 3% open · ⏱️ 24.01.2020):

     git clone https://github.com/atom/atom
    
Theia (🥈29 · ⭐ 6.8K) - Eclipse Theia is a cloud & desktop IDE framework implemented in TypeScript. EPL-2.0
  • GitHub (👨‍💻 140 · 🔀 870 · 📦 200 · 📋 4K - 29% open · ⏱️ 24.01.2020):

     git clone https://github.com/eclipse-theia/theia
    
  • NPM (📥 58K / month · 📦 110 · ⏱️ 19.01.2020):

     npm install @theia/application-manager
    
  • Dockerhub (📥 1.2M · ⭐ 37 · ⏱️ 24.01.2020):

     docker pull theiaide/theia
    
code-server (🥈26 · ⭐ 27K) - Run VS Code on a remote server. MIT
  • GitHub (👨‍💻 69 · 🔀 1.8K · 📥 190K · 📋 1K - 19% open · ⏱️ 17.01.2020):

     git clone https://github.com/cdr/code-server
    
  • Dockerhub (📥 8.1M · ⭐ 140 · ⏱️ 17.01.2020):

     docker pull codercom/code-server
    
nteract (🥈26 · ⭐ 4.7K) - The interactive computing suite for you!. BSD-3
  • GitHub (👨‍💻 140 · 🔀 470 · 📥 740K · 📋 1.4K - 11% open · ⏱️ 24.01.2020):

     git clone https://github.com/nteract/nteract
    
  • PyPi (📥 2.5K / month · 📦 5 · ⏱️ 16.07.2019):

     pip install nteract_on_jupyter
    
Jupyter Docker Stacks (🥉24 · ⭐ 4.9K) - Ready-to-run Docker images containing Jupyter applications. BSD-3
  • GitHub (👨‍💻 160 · 🔀 1.8K · 📋 500 - 11% open · ⏱️ 23.01.2020):

     git clone https://github.com/jupyter/docker-stacks
    
  • Dockerhub (📥 3.6M · ⭐ 210 · ⏱️ 02.12.2019):

     docker pull jupyter/scipy-notebook
    
DIGITS (🥉24 · ⭐ 4K) - Deep Learning GPU Training System. BSD-3
  • GitHub (👨‍💻 48 · 🔀 1.4K · 📋 1.5K - 40% open · ⏱️ 05.11.2019):

     git clone https://github.com/NVIDIA/DIGITS
    
  • Dockerhub (📥 740K · ⭐ 65 · ⏱️ 01.05.2018):

     docker pull nvidia/digits
    
Zeppelin (🥉23 · ⭐ 4.7K) - Web-based notebook that enables interactive data analytics. Apache-2
  • GitHub (👨‍💻 390 · 🔀 2.2K · ⏱️ 21.01.2020):

     git clone https://github.com/apache/zeppelin
    
  • Dockerhub (📥 950K · ⭐ 110 · ⏱️ 07.10.2019):

     docker pull apache/zeppelin
    
  • Maven:

     <dependency>
     	<groupId>org.apache.zeppelin</groupId>
     	<artifactId>zeppelin-server</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
polynote (🥉23 · ⭐ 3.5K) - A better notebook for Scala (and more). Apache-2
  • GitHub (👨‍💻 22 · 🔀 280 · 📥 15K · 📋 350 - 36% open · ⏱️ 22.01.2020):

     git clone https://github.com/polynote/polynote
    
  • Dockerhub (📥 1.7K · ⭐ 3 · ⏱️ 14.01.2020):

     docker pull polynote/polynote
    
Pyodide (🥉22 · ⭐ 3.4K) - The Python scientific stack, compiled to WebAssembly. MPL-2.0
  • GitHub (👨‍💻 33 · 🔀 210 · 📥 8.3K · 📋 310 - 50% open · ⏱️ 02.01.2020):

     git clone https://github.com/iodide-project/pyodide
    
  • Dockerhub (📥 32K · ⭐ 3 · ⏱️ 05.11.2018):

     docker pull iodide/pyodide-env
    
Hydrogen (🥉20 · ⭐ 3.5K) - Run code interactively, inspect data, and plot. All the power of Jupyter kernels,.. MIT
  • GitHub (👨‍💻 74 · 🔀 260 · 📋 1.1K - 8% open · ⏱️ 14.12.2019):

     git clone https://github.com/nteract/hydrogen
    
Deepo (🥉19 · ⭐ 5.4K) - Set up deep learning environment in a single command line. MIT
  • GitHub (👨‍💻 10 · 🔀 640 · 📋 100 - 11% open · ⏱️ 06.01.2020):

     git clone https://github.com/ufoym/deepo
    
  • Dockerhub (📥 150K · ⭐ 150 · ⏱️ 06.01.2020):

     docker pull ufoym/deepo
    
Spark Notebook (🥉18 · ⭐ 2.9K · 💤) - Interactive and Reactive Data Science using Scala and Spark. Apache-2
  • GitHub (👨‍💻 83 · 🔀 600 · 📋 510 - 40% open · ⏱️ 11.03.2019):

     git clone https://github.com/spark-notebook/spark-notebook
    
DataLab (🥉18 · ⭐ 900 · 💤) - Interactive tools and developer experiences for Big Data on Google Cloud.. Apache-2
  • GitHub (👨‍💻 50 · 🔀 240 · 📋 870 - 24% open · ⏱️ 05.06.2019):

     git clone https://github.com/googledatalab/datalab
    
ML Workspace (🥉16 · ⭐ 890) - All-in-one web-based IDE specialized for machine learning and data science. Apache-2
  • GitHub (👨‍💻 6 · 🔀 92 · 📋 20 - 55% open · ⏱️ 24.01.2020):

     git clone https://github.com/ml-tooling/ml-workspace
    
  • Dockerhub (📥 48K · ⭐ 7 · ⏱️ 04.10.2019):

     docker pull mltooling/ml-workspace
    
h2o-flow (🥉16 · ⭐ 100) - Web based interactive computing environment for H2O. MIT
  • GitHub (👨‍💻 36 · 🔀 56 · ⏱️ 18.11.2019):

     git clone https://github.com/h2oai/h2o-flow
    
Judge0 IDE (🥉13 · ⭐ 140) - Free and open-source online code editor that allows you to write and execute code.. MIT
  • GitHub (👨‍💻 2 · 🔀 28 · 📋 32 - 18% open · ⏱️ 05.01.2020):

     git clone https://github.com/judge0/ide
    
Show 1 hidden projects...
RStudio (🥉17 · ⭐ 3K) - RStudio is an integrated development environment (IDE) for R. ❗️AGPL-3.0
  • GitHub (👨‍💻 93 · 🔀 740 · 📋 3.4K - 39% open · ⏱️ 24.01.2020):

     git clone https://github.com/rstudio/rstudio
    

Machine Learning Platforms

Back to top

Platforms that enable large-scale and distributed machine learning.

H2O (🥇30 · ⭐ 4.6K) - Open Source Fast Scalable Machine Learning Platform For Smarter Applications: Deep.. Apache-2
  • GitHub (👨‍💻 190 · 🔀 1.7K · 📦 250 · ⏱️ 24.01.2020):

     git clone https://github.com/h2oai/h2o-3
    
  • PyPi (📥 96K / month · 📦 59 · ⏱️ 20.01.2020):

     pip install h2o
    
Kubeflow (🥇28 · ⭐ 8.3K) - Machine Learning Toolkit for Kubernetes. Apache-2
  • GitHub (👨‍💻 190 · 🔀 1.3K · 📥 57K · 📋 2.4K - 11% open · ⏱️ 23.01.2020):

     git clone https://github.com/kubeflow/kubeflow
    
  • PyPi (📥 17K / month · 📦 4 · ⏱️ 20.01.2020):

     pip install kfp
    
PredictionIO (🥈27 · ⭐ 12K) - PredictionIO, a machine learning server for developers and ML engineers. Apache-2
  • GitHub (👨‍💻 160 · 🔀 2K · 📥 3.8K · ⏱️ 12.12.2019):

     git clone https://github.com/apache/predictionio
    
  • PyPi (📥 1.4K / month · 📦 15 · ⏱️ 24.10.2017):

     pip install predictionio
    
  • Dockerhub (📥 1.5K · ⭐ 3 · ⏱️ 19.11.2018):

     docker pull predictionio/pio
    
Pachyderm (🥈26 · ⭐ 4.2K) - Reproducible Data Science at Scale!. Apache-2
  • GitHub (👨‍💻 110 · 🔀 400 · 📥 36K · 📋 2.3K - 22% open · ⏱️ 24.01.2020):

     git clone https://github.com/pachyderm/pachyderm
    
  • PyPi (📥 1K / month · 📦 3 · ⏱️ 22.01.2020):

     pip install python-pachyderm
    
  • Dockerhub (📥 1.6M · ⭐ 2 · ⏱️ 23.01.2020):

     docker pull pachyderm/pachd
    
Polyaxon (🥈25 · ⭐ 2.3K) - A platform for reproducible and scalable machine learning and deep learning on.. Apache-2
  • GitHub (👨‍💻 74 · 🔀 210 · 📋 500 - 26% open · ⏱️ 16.01.2020):

     git clone https://github.com/polyaxon/polyaxon
    
  • PyPi (📥 2K / month · 📦 1 · ⏱️ 13.01.2020):

     pip install polyaxon-cli
    
  • Dockerhub (📥 1.5M · ⏱️ 13.01.2020):

     docker pull polyaxon/polyaxon-api
    
Mahout (🥉22 · ⭐ 1.8K) - Powerful, scalable machine-learning library that runs on top of Hadoop MapReduce. Apache-2
  • GitHub (👨‍💻 58 · 🔀 910 · 📦 55 · ⏱️ 24.01.2020):

     git clone https://github.com/apache/mahout
    
  • Maven (📦 7 · ⏱️ 15.04.2017):

     <dependency>
     	<groupId>org.apache.mahout</groupId>
     	<artifactId>mahout-math-scala_2.10</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
SystemML (🥉22 · ⭐ 800) - A machine learning platform optimal for big data running on Apache Spark,. Apache-2
  • GitHub (👨‍💻 86 · 🔀 290 · 📦 3 · ⏱️ 16.01.2020):

     git clone https://github.com/apache/systemml
    
  • PyPi (📥 3.4K / month · ⏱️ 27.08.2018):

     pip install systemml
    
  • Maven:

     <dependency>
     	<groupId>org.apache.systemml</groupId>
     	<artifactId>systemml</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Oryx 2 (🥉21 · ⭐ 1.7K) - Oryx 2: Lambda architecture on Apache Spark, Apache Kafka for real-time large.. Apache-2
  • GitHub (👨‍💻 17 · 🔀 390 · 📥 14K · 📋 200 - 1% open · ⏱️ 24.11.2019):

     git clone https://github.com/OryxProject/oryx
    
  • Maven (📦 32 · ⏱️ 06.10.2018):

     <dependency>
     	<groupId>com.cloudera.oryx</groupId>
     	<artifactId>oryx-api</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Flyte (🥉19 · ⭐ 640 · 🐣) - develop, execute, and monitor distributed workflows reliably at scale. Apache-2
  • GitHub (👨‍💻 15 · 🔀 43 · 📋 130 - 87% open · ⏱️ 23.01.2020):

     git clone https://github.com/lyft/flyte
    
  • PyPi (📥 33K / month · 📦 4 · ⏱️ 30.12.2019):

     pip install flytekit
    
  • Dockerhub (📥 7.2K · ⏱️ 09.12.2019):

     docker pull lyft/flyteadmin
    
Singa (🥉17 · ⭐ 1.9K) - Flexible architecture for scalable distributed training, it is extensible to run.. Apache-2
  • GitHub (👨‍💻 62 · 🔀 480 · ⏱️ 18.01.2020):

     git clone https://github.com/apache/singa
    
  • Conda:

     conda install -c conda-forge singa-cpu
    
  • Dockerhub (📥 120 · ⭐ 2 · ⏱️ 04.06.2019):

     docker pull apache/singa
    
FfDL (🥉17 · ⭐ 580 · 💤) - Fabric for Deep Learning (FfDL, pronounced fiddle) is a Deep Learning Platform.. Apache-2
  • GitHub (👨‍💻 15 · 🔀 170 · 📋 72 - 44% open · ⏱️ 27.04.2019):

     git clone https://github.com/IBM/FfDL
    
  • Dockerhub (📥 43K · ⏱️ 08.08.2019):

     docker pull ffdlops/ffdl
    
ML Hub (🥉17 · ⭐ 55) - Multi-user development platform for machine learning teams. Simple to setup within.. Apache-2
  • GitHub (👨‍💻 4 · 🔀 15 · 📥 95 · 📋 3 - 33% open · ⏱️ 02.12.2019):

     git clone https://github.com/ml-tooling/ml-hub
    
  • Dockerhub (📥 20K · ⭐ 3 · ⏱️ 17.01.2020):

     docker pull mltooling/ml-hub
    
PennAI (🥉14 · ⭐ 110) - the Penn AI engine. ❗️GPL-3.0
  • GitHub (👨‍💻 38 · 🔀 29 · 📥 250 · 📋 190 - 30% open · ⏱️ 17.01.2020):

     git clone https://github.com/EpistasisLab/pennai
    
  • Dockerhub (📥 51 · ⏱️ 07.08.2019):

     docker pull moorelab/pennai_lab
    

Business Intelligence

Back to top

GUI-based business intelligence tools combining SQL query engines, data analytics, visualization, and dashboarding features.

Metabase (🥇29 · ⭐ 19K) - The simplest, fastest way to get business intelligence and analytics to.. ❗️AGPL-3.0
  • GitHub (👨‍💻 230 · 🔀 2.5K · 📥 1K · 📦 1 · 📋 6.8K - 31% open · ⏱️ 24.01.2020):

     git clone https://github.com/metabase/metabase
    
  • Dockerhub (📥 33M · ⭐ 160 · ⏱️ 14.01.2020):

     docker pull metabase/metabase
    
Redash (🥈28 · ⭐ 15K) - Make Your Company Data Driven. Connect to any data source, easily visualize,.. BSD-2
  • GitHub (👨‍💻 340 · 🔀 2.5K · 📥 14K · 📋 1.9K - 25% open · ⏱️ 23.01.2020):

     git clone https://github.com/getredash/redash
    
  • Dockerhub (📥 11M · ⭐ 140 · ⏱️ 23.01.2020):

     docker pull redash/redash
    
Superset (🥈27 · ⭐ 28K) - Apache Superset (incubating) is a modern, enterprise-ready business.. Apache-2
  • GitHub (👨‍💻 460 · 🔀 5.5K · 📦 2 · 📋 4.2K - 6% open · ⏱️ 24.01.2020):

     git clone https://github.com/apache/incubator-superset
    
  • PyPi (📥 2.8K / month · ⏱️ 12.10.2019):

     pip install apache-superset
    
  • Dockerhub (📥 1.3M · ⭐ 220 · ⏱️ 23.01.2020):

     docker pull amancevice/superset
    
Hue (🥈24 · ⭐ 4K) - Open source SQL Query Assistant for Databases/Warehouses. Apache-2
  • GitHub (👨‍💻 230 · 🔀 1.5K · 📦 3 · 📋 730 - 28% open · ⏱️ 22.01.2020):

     git clone https://github.com/cloudera/hue
    
  • Dockerhub (📥 560K · ⭐ 58 · ⏱️ 24.01.2020):

     docker pull gethue/hue
    
Azure Data Studio (🥉22 · ⭐ 5.5K) - Azure Data Studio is a data management tool that enables working.. ❗️Custom EULA
  • GitHub (👨‍💻 97 · 🔀 460 · 📥 170K · 📋 5.5K - 32% open · ⏱️ 24.01.2020):

     git clone https://github.com/microsoft/azuredatastudio
    
Blazer (🥉20 · ⭐ 2.4K) - Explore your data with SQL. Easily create charts and dashboards, and share them with.. MIT
  • GitHub (👨‍💻 42 · 🔀 290 · 📦 240 · 📋 160 - 6% open · ⏱️ 14.11.2019):

     git clone https://github.com/ankane/blazer
    
Poli (🥉19 · ⭐ 1.6K) - An easy-to-use BI server built for SQL lovers. Power data analysis in SQL and gain.. MIT
  • GitHub (👨‍💻 2 · 🔀 220 · 📥 3.6K · 📋 48 - 25% open · ⏱️ 11.01.2020):

     git clone https://github.com/shzlw/poli
    
  • Dockerhub (📥 570 · ⏱️ 11.12.2019):

     docker pull zhonglu/poli
    
CBoard (🥉17 · ⭐ 2.3K) - An easy to use, self-service open BI reporting and BI dashboard platform. Apache-2
  • GitHub (👨‍💻 30 · 🔀 970 · 📋 560 - 13% open · ⏱️ 22.09.2019):

     git clone https://github.com/TuiQiao/CBoard
    
  • Dockerhub (📥 3.5K · ⭐ 4 · ⏱️ 18.10.2017):

     docker pull peterzhang921/cboard
    
Meltano (🥉15 · ⭐ 320) - Convention-over-configuration product for the whole data lifecycle, all the way from.. MIT
  • PyPi (📥 1.8K / month · ⏱️ 23.01.2020):

     pip install meltano
    
  • Dockerhub (📥 320K · ⭐ 4 · ⏱️ 23.01.2020):

     docker pull meltano/meltano
    

Job Scheduler & Pipelines

Back to top

Platforms and tools to schedule, orchestrate, and monitor jobs for workflow automation and data pipeline tasks.

Airflow (🥇35 · ⭐ 15K) - Platform to programmatically author, schedule, and monitor workflows. Apache-2
  • GitHub (👨‍💻 1.3K · 🔀 5.8K · 📥 400 · 📦 720 · ⏱️ 24.01.2020):

     git clone https://github.com/apache/airflow
    
  • PyPi (📥 470K / month · 📦 290 · ⏱️ 24.12.2019):

     pip install apache-airflow
    
  • Conda:

     conda install -c conda-forge airflow
    
  • Dockerhub (📥 330K · ⭐ 80 · ⏱️ 24.01.2020):

     docker pull apache/airflow
    
luigi (🥇33 · ⭐ 13K) - Luigi is a Python module that helps you build complex pipelines of batch jobs. It.. Apache-2
  • GitHub (👨‍💻 530 · 🔀 2.1K · 📦 970 · 📋 830 - 6% open · ⏱️ 16.01.2020):

     git clone https://github.com/spotify/luigi
    
  • PyPi (📥 170K / month · 📦 680 · ⏱️ 02.01.2020):

     pip install luigi
    
  • Conda (⏱️ 17.12.2019):

     conda install -c anaconda luigi
    
argo (🥈25 · ⭐ 4.5K) - Argo Workflows: Get stuff done with Kubernetes. Apache-2
  • GitHub (👨‍💻 160 · 🔀 700 · 📥 270K · 📋 1.2K - 28% open · ⏱️ 24.01.2020):

     git clone https://github.com/argoproj/argo
    
  • Dockerhub (📥 1.5M · ⭐ 1 · ⏱️ 16.12.2019):

     docker pull argoproj/argoui
    
Kubeflow Pipelines (🥈25 · ⭐ 1.4K) - Machine Learning Pipelines for Kubeflow. Apache-2
  • GitHub (👨‍💻 110 · 🔀 410 · 📋 1.1K - 33% open · ⏱️ 24.01.2020):

     git clone https://github.com/kubeflow/pipelines
    
  • PyPi (📥 17K / month · 📦 4 · ⏱️ 20.01.2020):

     pip install kfp
    
Genie (🥈25 · ⭐ 1.3K) - Distributed Big Data Orchestration Service. Apache-2
  • GitHub (👨‍💻 30 · 🔀 290 · 📋 160 - 0% open · ⏱️ 15.01.2020):

     git clone https://github.com/Netflix/genie
    
  • PyPi (📥 52K / month · 📦 5 · ⏱️ 09.12.2019):

     pip install nflx-genie-client
    
  • Dockerhub (📥 7.1K · ⭐ 3 · ⏱️ 15.01.2020):

     docker pull netflixoss/genie-app
    
  • Maven (⏱️ 25.10.2018):

     <dependency>
     	<groupId>com.netflix.genie</groupId>
     	<artifactId>genie-common</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
n8n.io (🥈24 · ⭐ 6K) - Free and open node based Workflow Automation Tool. Easily automate tasks across.. Apache-2
  • GitHub (👨‍💻 35 · 🔀 340 · 📦 1 · 📋 110 - 39% open · ⏱️ 24.01.2020):

     git clone https://github.com/n8n-io/n8n
    
  • NPM (📥 3.1K / month · ⏱️ 19.01.2020):

     npm install n8n
    
  • Dockerhub (📥 330K · ⭐ 11 · ⏱️ 19.01.2020):

     docker pull n8nio/n8n
    
Cadence (🥈24 · ⭐ 3.3K) - Cadence is a distributed, scalable, durable, and highly available orchestration.. MIT
  • GitHub (👨‍💻 57 · 🔀 270 · 📥 7.5K · 📋 1.1K - 28% open · ⏱️ 24.01.2020):

     git clone https://github.com/uber/cadence
    
  • Dockerhub (📥 670K · ⭐ 2 · ⏱️ 24.01.2020):

     docker pull ubercadence/server
    
  • Maven (📦 2 · ⏱️ 06.08.2018):

     <dependency>
     	<groupId>com.uber.cadence</groupId>
     	<artifactId>cadence-client</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
dkron (🥉23 · ⭐ 2.1K) - Dkron - Distributed, fault tolerant job scheduling system https://dkron.io. ❗️LGPL-3.0
  • GitHub (👨‍💻 43 · 🔀 200 · 📥 260K · 📋 330 - 7% open · ⏱️ 04.01.2020):

     git clone https://github.com/distribworks/dkron
    
  • Dockerhub (📥 480K · ⭐ 11 · ⏱️ 04.01.2020):

     docker pull dkron/dkron
    
Dolphin Scheduler (🥉22 · ⭐ 3.4K) - Dolphin Scheduler is a distributed and easy-to-expand visual DAG.. Apache-2
  • GitHub (👨‍💻 72 · 🔀 1K · 📥 5.6K · 📋 730 - 44% open · ⏱️ 24.01.2020):

     git clone https://github.com/apache/incubator-dolphinscheduler
    
Oozie (🥉22 · ⭐ 540) - Server-based workflow scheduling system to manage Hadoop jobs. Apache-2
  • GitHub (👨‍💻 36 · 🔀 380 · 📦 270 · ⏱️ 18.01.2020):

     git clone https://github.com/apache/oozie
    
  • Maven (📦 170 · ⏱️ 02.12.2016):

     <dependency>
     	<groupId>org.apache.oozie</groupId>
     	<artifactId>oozie-client</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Azkaban (🥉21 · ⭐ 3K) - Batch workflow job scheduler created at LinkedIn to run Hadoop jobs. Apache-2
  • GitHub (👨‍💻 120 · 🔀 1.2K · 📥 960 · 📋 1K - 63% open · ⏱️ 21.01.2020):

     git clone https://github.com/azkaban/azkaban
    
Ofelia (🥉19 · ⭐ 1.2K) - A docker job scheduler (aka. crontab for docker). MIT
  • GitHub (👨‍💻 16 · 🔀 87 · 📥 8.4K · 📋 69 - 62% open · ⏱️ 06.01.2020):

     git clone https://github.com/mcuadros/ofelia
    
  • Dockerhub (📥 2.7M · ⭐ 15 · ⏱️ 06.01.2020):

     docker pull mcuadros/ofelia
    
Aurora (🥉16 · ⭐ 620) - Apache Aurora - A Mesos framework for long-running services, cron jobs, and ad-.. Apache-2
  • GitHub (👨‍💻 140 · 🔀 220 · 📋 36 - 30% open · ⏱️ 13.01.2020):

     git clone https://github.com/apache/aurora
    
Flotilla (🥉11 · ⭐ 140) - Self-service framework for defining and executing containerized jobs. Apache-2
  • GitHub (👨‍💻 9 · 🔀 7 · 📋 51 - 37% open · ⏱️ 23.01.2020):

     git clone https://github.com/stitchfix/flotilla-os
    

Data Integration & Ingestion

Back to top

Tools to integrate and ingest data from a variety of data sources. This includes query engines, ETL tools, data pipeline software, and command-line database clients.

Prisma (🥇35 · ⭐ 17K) - Database Tools incl. ORM, Migrations and Admin UI (Postgres, MySQL & MongoDB). Apache-2
  • GitHub (👨‍💻 250 · 🔀 910 · 📦 6.1K · 📋 3.4K - 18% open · ⏱️ 17.01.2020):

     git clone https://github.com/prisma/prisma
    
  • NPM (📥 120K / month · 📦 490 · ⏱️ 14.06.2019):

     npm install prisma-client-lib
    
  • Dockerhub (📥 20M · ⭐ 74 · ⏱️ 11.11.2019):

     docker pull prismagraphql/prisma
    
Presto (🥇33 · ⭐ 10K) - High performance, distributed SQL query engine for a variety of data sources. Apache-2
  • GitHub (👨‍💻 460 · 🔀 3.5K · 📦 1 · 📋 4.1K - 23% open · ⏱️ 24.01.2020):

     git clone https://github.com/prestodb/presto
    
  • PyPi (📥 270K / month · ⏱️ 16.04.2019):

     pip install presto-python-client
    
  • NPM (📥 2.4K / month · 📦 30 · ⏱️ 07.01.2020):

     npm install presto-client
    
  • Dockerhub (📥 110K · ⭐ 12 · ⏱️ 23.01.2020):

     docker pull prestosql/presto
    
  • Maven (📦 360 · ⏱️ 22.02.2019):

     <dependency>
     	<groupId>com.facebook.presto</groupId>
     	<artifactId>presto-spi</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Hive (🥇33 · ⭐ 3K) - Data warehouse software facilitates reading, writing, and managing large datasets.. Apache-2
  • GitHub (👨‍💻 350 · 🔀 2.8K · 📦 9 · ⏱️ 24.01.2020):

     git clone https://github.com/apache/hive
    
  • PyPi (📥 1M / month · 📦 810 · ⏱️ 10.09.2018):

     pip install pyhive
    
  • Maven (📦 1.1K · ⏱️ 23.07.2018):

     <dependency>
     	<groupId>org.apache.hive</groupId>
     	<artifactId>hive-common</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Impala (🥈30 · ⭐ 2.3K · 💤) - Lightning-fast, distributed SQL queries for petabytes of data stored in.. Apache-2
  • GitHub (👨‍💻 170 · 🔀 810 · ⏱️ 26.06.2019):

     git clone https://github.com/cloudera/Impala
    
  • PyPi (📥 92K / month · 📦 540 · ⏱️ 21.11.2019):

     pip install impyla
    
TinkerPop Gremlin (🥈30 · ⭐ 1K) - Apache TinkerPop - a graph computing framework. Apache-2
  • GitHub (👨‍💻 150 · 🔀 520 · 📦 210 · ⏱️ 23.01.2020):

     git clone https://github.com/apache/tinkerpop
    
  • PyPi (📥 110K / month · 📦 150 · ⏱️ 09.08.2019):

     pip install gremlinpython
    
  • NPM (📥 13K / month · 📦 100 · ⏱️ 09.08.2019):

     npm install gremlin
    
  • Maven (📦 230 · ⏱️ 08.05.2018):

     <dependency>
     	<groupId>org.apache.tinkerpop</groupId>
     	<artifactId>gremlin-core</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Alluxio (🥈29 · ⭐ 4.5K) - Alluxio, data orchestration for analytics and machine learning in the cloud. Apache-2
  • GitHub (👨‍💻 1.2K · 🔀 2.3K · 📥 20K · 📦 1 · 📋 670 - 42% open · ⏱️ 24.01.2020):

     git clone https://github.com/Alluxio/alluxio
    
  • PyPi (📥 61 / month · ⏱️ 07.09.2017):

     pip install alluxio
    
  • Dockerhub (📥 550K · ⭐ 3 · ⏱️ 24.01.2020):

     docker pull alluxio/alluxio
    
  • Maven (📦 38 · ⏱️ 27.03.2018):

     <dependency>
     	<groupId>org.alluxio</groupId>
     	<artifactId>alluxio-core-client-fs</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Apache NiFi (🥈29 · ⭐ 2K) - Integrated data logistics platform for automating the movement of data.. Apache-2
  • GitHub (👨‍💻 370 · 🔀 1.6K · ⏱️ 23.01.2020):

     git clone https://github.com/apache/nifi
    
  • Dockerhub (📥 1.3M · ⭐ 150 · ⏱️ 05.11.2019):

     docker pull apache/nifi
    
  • Maven (📦 210 · ⏱️ 23.10.2018):

     <dependency>
     	<groupId>org.apache.nifi</groupId>
     	<artifactId>nifi-api</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Camel (🥈28 · ⭐ 3.1K) - Integration framework that empowers you to easily integrate various systems.. Apache-2
  • GitHub (👨‍💻 810 · 🔀 3.9K · 📦 8 · ⏱️ 24.01.2020):

     git clone https://github.com/apache/camel
    
  • Maven (📦 6.9K · ⏱️ 24.11.2018):

     <dependency>
     	<groupId>org.apache.camel</groupId>
     	<artifactId>camel-core</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Tika (🥈27 · ⭐ 970) - Toolkit for detecting and extracting metadata and structured text content from.. Apache-2
  • GitHub (👨‍💻 100 · 🔀 500 · 📦 96 · ⏱️ 17.01.2020):

     git clone https://github.com/apache/tika
    
  • PyPi (📥 33K / month · 📦 160 · ⏱️ 09.11.2019):

     pip install tika
    
  • NPM (📥 900 / month · 📦 28 · ⏱️ 22.02.2017):

     npm install tika
    
  • Dockerhub (📥 1.2M · ⭐ 26 · ⏱️ 11.01.2020):

     docker pull logicalspark/docker-tikaserver
    
  • Maven (📦 2.7K · ⏱️ 20.04.2018):

     <dependency>
     	<groupId>org.apache.tika</groupId>
     	<artifactId>tika-core</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Drill (🥉26 · ⭐ 1.3K) - Schema-free SQL Query Engine for Hadoop, NoSQL and Cloud Storage. Apache-2
  • GitHub (👨‍💻 170 · 🔀 780 · 📦 14 · ⏱️ 23.01.2020):

     git clone https://github.com/apache/drill
    
  • PyPi (📥 1.5K / month · 📦 3 · ⏱️ 24.04.2018):

     pip install pydrill
    
  • Dockerhub (📥 35K · ⭐ 9 · ⏱️ 26.12.2019):

     docker pull drill/apache-drill
    
  • Maven (📦 38 · ⏱️ 24.12.2018):

     <dependency>
     	<groupId>org.apache.drill</groupId>
     	<artifactId>drill-common</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Dagster (🥉25 · ⭐ 1.2K) - A Python library for building data applications: ETL, ML, Data Pipelines, and.. Apache-2
  • GitHub (👨‍💻 49 · 🔀 110 · 📦 64 · 📋 1.1K - 26% open · ⏱️ 24.01.2020):

     git clone https://github.com/dagster-io/dagster
    
  • PyPi (📥 6.4K / month · 📦 4 · ⏱️ 14.01.2020):

     pip install dagster
    
Calcite (🥉24 · ⭐ 1.7K) - Framework for building databases and data management systems. Includes a SQL.. Apache-2
  • GitHub (👨‍💻 290 · 🔀 990 · ⏱️ 23.01.2020):

     git clone https://github.com/apache/calcite
    
  • Maven (📦 500 · ⏱️ 16.07.2018):

     <dependency>
     	<groupId>org.apache.calcite</groupId>
     	<artifactId>calcite-core</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Embulk (🥉24 · ⭐ 1.4K) - Parallel bulk data loader that helps data transfer between various storages,.. Apache-2
  • GitHub (👨‍💻 49 · 🔀 160 · 📥 110K · 📋 420 - 39% open · ⏱️ 24.12.2019):

     git clone https://github.com/embulk/embulk
    
Data Collector (🥉24 · ⭐ 970) - StreamSets Data Collector - Continuous big data and cloud platform ingest.. Apache-2
  • GitHub (👨‍💻 97 · 🔀 480 · ⏱️ 24.01.2020):

     git clone https://github.com/streamsets/datacollector
    
  • PyPi (📥 260 / month · ⏱️ 25.10.2019):

     pip install streamsets
    
  • Dockerhub (📥 2.2M · ⭐ 57 · ⏱️ 24.01.2020):

     docker pull streamsets/datacollector
    
  • Maven (📦 3 · ⏱️ 24.05.2018):

     <dependency>
     	<groupId>com.streamsets</groupId>
     	<artifactId>streamsets-datacollector</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Zenko (🥉22 · ⭐ 290) - Zenko is the open source multi-cloud data controller: own and keep control of your.. Apache-2
  • GitHub (👨‍💻 40 · 🔀 47 · 📥 310 · 📋 44 - 25% open · ⏱️ 21.01.2020):

     git clone https://github.com/scality/Zenko
    
  • Dockerhub (📥 3.6M · ⭐ 11 · ⏱️ 17.01.2020):

     docker pull zenko/cloudserver
    
Gobblin (🥉21 · ⭐ 1.7K) - Gobblin is a distributed big data integration framework (ingestion,.. Apache-2
  • GitHub (👨‍💻 140 · 🔀 600 · 📥 120K · ⏱️ 17.01.2020):

     git clone https://github.com/apache/incubator-gobblin
    
  • Maven (⏱️ 20.06.2018):

     <dependency>
     	<groupId>org.apache.gobblin</groupId>
     	<artifactId>gobblin-api</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
s4cmd (🥉21 · ⭐ 960 · 💤) - Super S3 command line tool. Apache-2
  • GitHub (👨‍💻 30 · 🔀 160 · 📦 2 · 📋 110 - 58% open · ⏱️ 03.04.2019):

     git clone https://github.com/bloomreach/s4cmd
    
  • PyPi (📥 11K / month · 📦 7 · ⏱️ 13.08.2018):

     pip install s4cmd
    
Pentaho Kettle (🥉20 · ⭐ 3.6K) - Pentaho Data Integration ( ETL ) a.k.a Kettle. Apache-2
  • GitHub (👨‍💻 350 · 🔀 2.1K · ⏱️ 24.01.2020):

     git clone https://github.com/pentaho/pentaho-kettle
    
Delta Lake (🥉19 · ⭐ 2.1K) - An open-source storage layer that brings scalable, ACID transactions to.. Apache-2
  • GitHub (👨‍💻 55 · 🔀 420 · 📋 190 - 45% open · ⏱️ 21.01.2020):

     git clone https://github.com/delta-io/delta
    
  • Maven:

     <dependency>
     	<groupId>io.delta</groupId>
     	<artifactId>delta-core_2.11</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Sqoop (🥉18 · ⭐ 670) - Sqoop allows easy imports and exports of data sets between databases and HDFS. Apache-2
  • GitHub (👨‍💻 39 · 🔀 450 · ⏱️ 16.10.2019):

     git clone https://github.com/apache/sqoop
    
  • PyPi (📥 220 / month · ⏱️ 26.11.2019):

     pip install pysqoop
    
Amundsen (🥉16 · ⭐ 590) - Metadata driven application for improving the productivity of data engineers.. Apache-2
  • GitHub (👨‍💻 24 · 🔀 99 · 📋 110 - 63% open · ⏱️ 24.01.2020):

     git clone https://github.com/lyft/amundsen
    
  • Dockerhub (📥 19K · ⏱️ 20.12.2019):

     docker pull amundsendev/amundsen-search
    
Show 1 hidden projects...
s3cmd (🥉23 · ⭐ 3.2K) - Official s3cmd repo -- Command line tool for managing Amazon S3 and CloudFront.. ❗️GPL-2.0
  • GitHub (👨‍💻 180 · 🔀 740 · 📥 2.7M · 📋 700 - 41% open · ⏱️ 18.12.2019):

     git clone https://github.com/s3tools/s3cmd
    

Data Batch & Stream Processing

Back to top

Frameworks and computing-engines for large-scale (distributed) data batch- and stream-processing.

Spark (🥇36 · ⭐ 25K) - Unified analytics engine for big data processing, with built-in modules for.. Apache-2
  • GitHub (👨‍💻 2.2K · 🔀 21K · 📦 280 · ⏱️ 24.01.2020):

     git clone https://github.com/apache/spark
    
  • PyPi (📥 2.7M / month · 📦 760 · ⏱️ 07.05.2019):

     pip install pyspark
    
  • Dockerhub (📥 170K · ⭐ 32 · ⏱️ 30.09.2019):

     docker pull bde2020/spark-master
    
  • Maven (📦 120 · ⏱️ 29.10.2018):

     <dependency>
     	<groupId>org.apache.spark</groupId>
     	<artifactId>spark-catalyst_2.11</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Flink (🥇34 · ⭐ 12K) - Stream processing framework with powerful stream- and batch-processing capabilities. Apache-2
  • GitHub (👨‍💻 900 · 🔀 6.2K · 📦 120 · ⏱️ 24.01.2020):

     git clone https://github.com/apache/flink
    
  • Dockerhub (📥 50M · ⭐ 140 · ⏱️ 23.01.2020):

     docker pull flink
    
  • Maven (📦 530 · ⏱️ 30.09.2019):

     <dependency>
     	<groupId>org.apache.flink</groupId>
     	<artifactId>flink-core</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Beam (🥈33 · ⭐ 3.7K) - Unified programming model to define and execute data processing pipelines,.. Apache-2
  • GitHub (👨‍💻 740 · 🔀 2.2K · ⏱️ 24.01.2020):

     git clone https://github.com/apache/beam
    
  • PyPi (📥 2.6M / month · 📦 190 · ⏱️ 23.01.2020):

     pip install apache-beam
    
  • Maven (📦 360 · ⏱️ 17.12.2019):

     <dependency>
     	<groupId>org.apache.beam</groupId>
     	<artifactId>beam-sdks-java-core</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Storm (🥈32 · ⭐ 6.1K) - Distributed real-time computational system for processing data streams. Apache-2
  • GitHub (👨‍💻 480 · 🔀 4K · 📦 3 · ⏱️ 24.01.2020):

     git clone https://github.com/apache/storm
    
  • Dockerhub (📥 3.2M · ⭐ 120 · ⏱️ 23.01.2020):

     docker pull storm
    
  • Maven (📦 3.9K · ⏱️ 29.04.2019):

     <dependency>
     	<groupId>org.apache.storm</groupId>
     	<artifactId>storm-core</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Kafka (🥈29 · ⭐ 15K) - Distributed streaming platform that is used to build real time streaming data.. Apache-2
  • GitHub (👨‍💻 780 · 🔀 7.7K · ⏱️ 24.01.2020):

     git clone https://github.com/apache/kafka
    
  • PyPi (📥 260K / month · 📦 350 · ⏱️ 07.10.2017):

     pip install kafka
    
  • Dockerhub (📥 20M · ⭐ 100 · ⏱️ 24.01.2020):

     docker pull bitnami/kafka
    
  • Maven (📦 52 · ⏱️ 22.11.2013):

     <dependency>
     	<groupId>org.apache.kafka</groupId>
     	<artifactId>kafka</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Hadoop (🥈29 · ⭐ 10K) - Framework that allows for the distributed processing of large data sets across.. Apache-2
  • GitHub (👨‍💻 420 · 🔀 6.2K · 📦 62 · ⏱️ 24.01.2020):

     git clone https://github.com/apache/hadoop
    
  • Maven (📦 12K · ⏱️ 02.08.2018):

     <dependency>
     	<groupId>org.apache.hadoop</groupId>
     	<artifactId>hadoop-common</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Heron (🥈23 · ⭐ 3.4K) - Apache Heron (Incubating) is a realtime, distributed, fault-tolerant stream.. Apache-2
  • GitHub (👨‍💻 130 · 🔀 600 · 📥 58K · 📋 940 - 40% open · ⏱️ 17.01.2020):

     git clone https://github.com/apache/incubator-heron
    
  • Dockerhub (📥 160K · ⭐ 2 · ⏱️ 01.04.2018):

     docker pull heron/heron
    
Hazelcast Jet (🥈23 · ⭐ 400) - Distributed stream and batch processing engine, built on top of Hazelcast. Apache-2
  • GitHub (👨‍💻 32 · 🔀 110 · 📥 13 · 📦 140 · 📋 530 - 14% open · ⏱️ 24.01.2020):

     git clone https://github.com/hazelcast/hazelcast-jet
    
  • Dockerhub (📥 8.7K · ⭐ 2 · ⏱️ 24.01.2020):

     docker pull hazelcast/hazelcast-jet
    
  • Maven (📦 12 · ⏱️ 14.06.2017):

     <dependency>
     	<groupId>com.hazelcast.jet</groupId>
     	<artifactId>hazelcast-jet</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Vespa (🥉22 · ⭐ 3.1K) - Vespa is an engine for low-latency computation over large data sets. Apache-2
  • GitHub (👨‍💻 86 · 🔀 360 · 📦 1 · 📋 310 - 28% open · ⏱️ 24.01.2020):

     git clone https://github.com/vespa-engine/vespa
    
  • Dockerhub (📥 510K · ⭐ 5 · ⏱️ 24.01.2020):

     docker pull vespaengine/vespa
    
Flume (🥉22 · ⭐ 1.9K) - Service for efficiently collecting, aggregating, and moving large amounts of log.. Apache-2
  • GitHub (👨‍💻 68 · 🔀 1.3K · 📦 120 · ⏱️ 13.01.2020):

     git clone https://github.com/apache/flume
    
  • Maven (📦 1.4K · ⏱️ 15.09.2017):

     <dependency>
     	<groupId>org.apache.flume</groupId>
     	<artifactId>flume-ng-sdk</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Gearpump (🥉19 · ⭐ 730) - Lightweight real-time big data streaming engine over Akka. Apache-2
  • GitHub (👨‍💻 39 · 🔀 150 · 📥 3.7K · 📋 1.1K - 7% open · ⏱️ 01.12.2019):

     git clone https://github.com/gearpump/gearpump
    
  • Maven:

     <dependency>
     	<groupId>io.github.gearpump</groupId>
     	<artifactId>gearpump-core</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Brooklin (🥉19 · ⭐ 520) - An extensible distributed system for reliable nearline data streaming at scale. BSD-2
  • GitHub (👨‍💻 35 · 🔀 64 · 📥 850 · 📋 19 - 42% open · ⏱️ 22.11.2019):

     git clone https://github.com/linkedin/brooklin
    
  • Maven:

     <dependency>
     	<groupId>com.github.datastream</groupId>
     	<artifactId>datastream-client</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Wallaroo (🥉18 · ⭐ 1.4K) - Distributed Stream Processing. Apache-2
  • GitHub (👨‍💻 34 · 🔀 59 · 📋 1.8K - 19% open · ⏱️ 21.01.2020):

     git clone https://github.com/WallarooLabs/wallaroo
    
kapacitor (🥉17 · ⭐ 1.9K) - Open source framework for processing, monitoring, and alerting on time series data. MIT
  • GitHub (👨‍💻 90 · 🔀 410 · 📋 1.6K - 42% open · ⏱️ 21.01.2020):

     git clone https://github.com/influxdata/kapacitor
    
Samza (🥉17 · ⭐ 610) - Near-realtime, asynchronous computational framework for stream processing. Apache-2
  • GitHub (👨‍💻 170 · 🔀 250 · ⏱️ 23.01.2020):

     git clone https://github.com/apache/samza
    
  • Maven:

     <dependency>
     	<groupId>org.apache.samza</groupId>
     	<artifactId>samza-core</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Show 1 hidden projects...
Onyx (🥉15 · ⭐ 1.9K) - Distributed, masterless, high performance, fault tolerant data processing. ❗️EPL-1.0
  • GitHub (👨‍💻 62 · 🔀 200 · 📋 590 - 13% open · ⏱️ 31.08.2019):

     git clone https://github.com/onyx-platform/onyx
    

Data Labeling & Annotation

Back to top

Tools to label and annotate any type of data (e.g. images, text, videos, and audio).

LabelImg (🥇26 · ⭐ 9.6K) - LabelImg is a graphical image annotation tool and label object bounding boxes in.. MIT
  • GitHub (👨‍💻 66 · 🔀 3.3K · 📦 53 · 📋 430 - 40% open · ⏱️ 10.01.2020):

     git clone https://github.com/tzutalin/labelImg
    
  • PyPi (📥 4.7K / month · 📦 9 · ⏱️ 26.05.2019):

     pip install labelImg
    
Labelme (🥇26 · ⭐ 4K) - Image Polygonal Annotation with Python (polygon, rectangle, circle, line, point.. ❗️GPL-3.0
  • GitHub (👨‍💻 38 · 🔀 1.3K · 📥 7K · 📦 47 · 📋 340 - 16% open · ⏱️ 23.01.2020):

     git clone https://github.com/wkentaro/labelme
    
  • PyPi (📥 8.3K / month · 📦 8 · ⏱️ 14.01.2020):

     pip install labelme
    
  • Dockerhub (📥 1.1K · ⭐ 2 · ⏱️ 23.01.2020):

     docker pull wkentaro/labelme
    
doccano (🥈24 · ⭐ 2.3K) - Open source text annotation tool for machine learning practitioner. MIT
  • GitHub (👨‍💻 37 · 🔀 480 · 📋 340 - 16% open · ⏱️ 10.01.2020):

     git clone https://github.com/chakki-works/doccano
    
  • Dockerhub (📥 390K · ⭐ 8 · ⏱️ 29.11.2019):

     docker pull chakkiworks/doccano
    
VoTT (🥈20 · ⭐ 2.1K) - Visual Object Tagging Tool: An electron app for building end to end Object Detection.. MIT
  • GitHub (👨‍💻 13 · 🔀 450 · 📥 45K · 📋 390 - 42% open · ⏱️ 04.10.2019):

     git clone https://github.com/Microsoft/VoTT
    
CVAT (🥈19 · ⭐ 3.1K) - Powerful and efficient Computer Vision Annotation Tool (CVAT). MIT
  • GitHub (👨‍💻 53 · 🔀 710 · 📋 560 - 26% open · ⏱️ 24.01.2020):

     git clone https://github.com/opencv/cvat
    
Label Studio (🥈19 · ⭐ 2.2K) - Label Studio is a multi-type data labeling and annotation tool with.. Apache-2
  • GitHub (👨‍💻 12 · 🔀 130 · 📥 3 · 📦 1 · 📋 60 - 33% open · ⏱️ 23.01.2020):

     git clone https://github.com/heartexlabs/label-studio
    
  • PyPi (📥 630 / month · ⏱️ 23.01.2020):

     pip install label-studio
    
  • NPM (📥 250 / month · ⏱️ 10.01.2020):

     npm install label-studio
    
  • Dockerhub (📥 300 · ⏱️ 23.01.2020):

     docker pull heartexlabs/label-studio
    
PixelAnnotationTool (🥉17 · ⭐ 670) - Annotate quickly images. ❗️LGPL-3.0
  • GitHub (👨‍💻 7 · 🔀 170 · 📥 11K · 📋 42 - 38% open · ⏱️ 07.01.2020):

     git clone https://github.com/abreheret/PixelAnnotationTool
    
Semantic Segmentation Editor (🥉17 · ⭐ 520) - Web labeling tool for camera and LIDAR data. MIT
  • GitHub (👨‍💻 4 · 🔀 160 · 📋 71 - 21% open · ⏱️ 04.01.2020):

     git clone https://github.com/Hitachi-Automotive-And-Industry-Lab/semantic-segmentation-editor
    
  • Dockerhub (📥 850 · ⭐ 6 · ⏱️ 30.07.2018):

     docker pull hitachiail/semantic-segmentation-editor
    
Labelbox (🥉15 · ⭐ 1.3K) - Labelbox is the fastest way to annotate data to build and ship computer vision.. Apache-2
  • GitHub (👨‍💻 15 · 🔀 190 · ⏱️ 26.11.2019):

     git clone https://github.com/Labelbox/Labelbox
    
LOST (🥉15 · ⭐ 280) - Label Objects and Save Time (LOST) - Design your own smart Image Annotation process in a.. MIT
  • GitHub (👨‍💻 5 · 🔀 39 · 📋 50 - 40% open · ⏱️ 17.10.2019):

     git clone https://github.com/l3p-cv/lost
    
  • Dockerhub (📥 550 · ⏱️ 17.10.2019):

     docker pull l3pcv/lost
    
ImgLab (🥉14 · ⭐ 570) - To speedup and simplify image labeling/ annotation process with multiple supported.. MIT
  • GitHub (👨‍💻 29 · 🔀 310 · 📋 100 - 28% open · ⏱️ 19.10.2019):

     git clone https://github.com/NaturalIntelligence/imglab
    
OpenLabeling (🥉14 · ⭐ 520) - Label images and video for Computer Vision applications. Apache-2
  • GitHub (👨‍💻 14 · 🔀 140 · 📋 34 - 26% open · ⏱️ 16.01.2020):

     git clone https://github.com/Cartucho/OpenLabeling
    
makesense.ai (🥉13 · ⭐ 930) - Free to use online tool for labelling photos. https://makesense.ai. ❗️GPL-3.0
  • GitHub (👨‍💻 8 · 🔀 100 · 📋 30 - 50% open · ⏱️ 08.01.2020):

     git clone https://github.com/SkalskiP/make-sense
    
Show 2 hidden projects...
jupyter-innotater (🥉12 · ⭐ 44) - Inline data annotator for Jupyter notebooks. MIT
  • GitHub (👨‍💻 1 · 🔀 4 · 📋 10 - 40% open · ⏱️ 06.01.2020):

     git clone https://github.com/ideonate/jupyter-innotater
    
  • PyPi (📥 69 / month · ⏱️ 29.07.2019):

     pip install jupyter_innotater
    
superintendent (🥉9 · ⭐ 85) - Practical active learning in python. ❗️Unlicensed
  • GitHub (👨‍💻 6 · 🔀 9 · 📦 3 · 📋 15 - 26% open · ⏱️ 24.01.2020):

     git clone https://github.com/janfreyberg/superintendent
    

Data Visualization & Exploration

Back to top

GUI tools to visualize, explore, and analyze data.

Grafana (🥇33 · ⭐ 34K) - The tool for beautiful monitoring and metric analytics & dashboards for.. Apache-2
  • GitHub (👨‍💻 1.2K · 🔀 6.4K · 📦 7 · 📋 15K - 16% open · ⏱️ 24.01.2020):

     git clone https://github.com/grafana/grafana
    
  • Dockerhub (📥 720M · ⭐ 1.3K · ⏱️ 24.01.2020):

     docker pull grafana/grafana
    
Kibana (🥇31 · ⭐ 15K) - Your window into the Elastic Stack. Apache-2
  • GitHub (👨‍💻 520 · 🔀 5.4K · 📦 5 · 📋 22K - 27% open · ⏱️ 24.01.2020):

     git clone https://github.com/elastic/kibana
    
  • Dockerhub (📥 76M · ⭐ 1.7K · ⏱️ 23.01.2020):

     docker pull kibana
    
Orange (🥈27 · ⭐ 2.1K) - Orange: Interactive data analysis https://orange.biolab.si. ❗️GPL-3.0
  • GitHub (👨‍💻 99 · 🔀 580 · 📥 140 · 📦 250 · 📋 1.4K - 5% open · ⏱️ 24.01.2020):

     git clone https://github.com/biolab/orange3
    
  • PyPi (📥 4.8K / month · 📦 110 · ⏱️ 20.12.2019):

     pip install orange3
    
  • Conda (⏱️ 16.10.2019):

     conda install -c anaconda orange3
    
Gephi (🥈25 · ⭐ 3.6K) - Gephi - The Open Graph Viz Platform. ❗️GPL-3.0
  • GitHub (👨‍💻 81 · 🔀 1.3K · 📥 1.8M · 📦 20 · 📋 2K - 23% open · ⏱️ 13.01.2020):

     git clone https://github.com/gephi/gephi
    
  • Maven (📦 12 · ⏱️ 14.02.2016):

     <dependency>
     	<groupId>org.gephi</groupId>
     	<artifactId>project-api</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
CARTO (🥈25 · ⭐ 2.3K) - Location Intelligence & Data Visualization tool. BSD-3
  • GitHub (👨‍💻 180 · 🔀 620 · 📋 8.5K - 1% open · ⏱️ 24.01.2020):

     git clone https://github.com/CartoDB/cartodb
    
  • Dockerhub (📥 31K · ⭐ 37 · ⏱️ 07.11.2019):

     docker pull sverhoeven/cartodb
    
SandDance (🥉21 · ⭐ 4K) - Visually explore, understand, and present your data. MIT
  • GitHub (👨‍💻 8 · 🔀 260 · 📋 90 - 51% open · ⏱️ 22.01.2020):

     git clone https://github.com/microsoft/SandDance
    
  • NPM (📥 390 / month · 📦 6 · ⏱️ 09.01.2020):

     npm install @msrvida/sanddance
    
Facette (🥉19 · ⭐ 1.1K) - Time series data visualization software. BSD-3
  • GitHub (👨‍💻 30 · 🔀 76 · 📥 5.9K · 📋 350 - 10% open · ⏱️ 20.11.2019):

     git clone https://github.com/facette/facette
    
Voyager 2 (🥉19 · ⭐ 920 · 💤) - Visualization Tool for Data Exploration. BSD-3
  • GitHub (👨‍💻 16 · 🔀 110 · 📦 24 · 📋 450 - 18% open · ⏱️ 18.05.2019):

     git clone https://github.com/vega/voyager
    
  • NPM (📥 250 / month · 📦 12 · ⏱️ 06.07.2018):

     npm install datavoyager
    
ParaView (🥉19 · ⭐ 580) - VTK-based Data Analysis and Visualization Application. BSD-3
  • GitHub (👨‍💻 270 · 🔀 260 · ⏱️ 24.01.2020):

     git clone https://github.com/Kitware/ParaView
    
Datawrapper (🥉18 · ⭐ 1K) - An open source data visualization platform helping everyone to create simple,.. MIT
  • GitHub (👨‍💻 47 · 🔀 240 · ⏱️ 23.01.2020):

     git clone https://github.com/datawrapper/datawrapper
    
Falcon (🥉17 · ⭐ 410) - Brushing and linking for big data. BSD-3
  • GitHub (👨‍💻 5 · 🔀 23 · 📋 98 - 9% open · ⏱️ 24.01.2020):

     git clone https://github.com/uwdata/falcon
    
  • NPM (📥 82 / month · ⏱️ 29.08.2019):

     npm install falcon-vis
    
Banana (🥉16 · ⭐ 640) - Banana for Solr - A Port of Kibana. Apache-2
  • GitHub (👨‍💻 44 · 🔀 220 · 📋 150 - 67% open · ⏱️ 13.11.2019):

     git clone https://github.com/LucidWorks/banana
    
Papaya (🥉15 · ⭐ 340 · 💤) - A pure JavaScript medical research image viewer. BSD-3
  • GitHub (👨‍💻 10 · 🔀 130 · 📋 170 - 20% open · ⏱️ 05.05.2019):

     git clone https://github.com/rii-mango/Papaya
    
  • NPM (📥 190 / month · ⏱️ 05.05.2019):

     npm install papaya-viewer
    
Dex (🥉13 · ⭐ 1.2K · 💤) - Dex : The Data Explorer -- A data visualization tool written in.. Apache-2
  • GitHub (👨‍💻 1 · 🔀 300 · 📋 13 - 23% open · ⏱️ 12.02.2019):

     git clone https://github.com/PatMartin/Dex
    
Show 3 hidden projects...
Visdom (🥈26 · ⭐ 7K) - A flexible tool for creating, organizing, and sharing visualizations of.. ❗️CC-BY-NC-4.0
  • GitHub (👨‍💻 95 · 🔀 840 · 📦 1.7K · 📋 450 - 12% open · ⏱️ 12.12.2019):

     git clone https://github.com/facebookresearch/visdom
    
  • PyPi (📥 26K / month · 📦 410 · ⏱️ 12.09.2019):

     pip install visdom
    
Chronograf (🥈26 · ⭐ 1.2K) - Open source monitoring and visualization UI for the TICK stack. ❗️AGPL-3.0
  • GitHub (👨‍💻 79 · 🔀 200 · 📋 3.1K - 1% open · ⏱️ 23.01.2020):

     git clone https://github.com/influxdata/chronograf
    
  • Dockerhub (📥 71M · ⭐ 190 · ⏱️ 24.01.2020):

     docker pull chronograf
    
Bumblebee (🥉11 · ⭐ 47) - An agnostic data profiling GUI to make your data science tasks easier. Apache-2
  • GitHub (👨‍💻 5 · 🔀 6 · 📦 2 · 📋 11 - 54% open · ⏱️ 21.01.2020):

     git clone https://github.com/ironmussa/bumblebee
    
  • Dockerhub (📥 10 · ⏱️ 10.12.2019):

     docker pull ironmussa/bumblebee
    

Model Visualization

Back to top

Tools to visualize, explore, and understand neural networks and other machine learning models.

Netron (🥇23 · ⭐ 7.8K) - Visualizer for neural network, deep learning and machine learning models. MIT
  • GitHub (👨‍💻 2 · 🔀 950 · 📥 41K · 📋 380 - 2% open · ⏱️ 24.01.2020):

     git clone https://github.com/lutzroeder/netron
    
  • PyPi (📥 1.6K / month · 📦 4 · ⏱️ 21.01.2020):

     pip install netron
    
TensorSpace.js (🥈19 · ⭐ 4.1K · 💤) - Neural network 3D visualization framework, build interactive and.. Apache-2
  • GitHub (👨‍💻 10 · 🔀 360 · 📦 10 · 📋 200 - 12% open · ⏱️ 21.04.2019):

     git clone https://github.com/tensorspace-team/tensorspace
    
  • NPM (📥 140 / month · 📦 1 · ⏱️ 02.04.2019):

     npm install tensorspace
    
Netwulf (🥈18 · ⭐ 180) - Interactive visualization of networks based on Ulf Aslak's d3 web app. MIT
  • GitHub (👨‍💻 5 · 🔀 16 · 📦 6 · 📋 32 - 25% open · ⏱️ 17.01.2020):

     git clone https://github.com/benmaier/netwulf
    
  • PyPi (📥 190 / month · ⏱️ 09.09.2019):

     pip install netwulf
    
PlotNeuralNet (🥉16 · ⭐ 7.5K) - Latex code for making neural networks diagrams. MIT
  • GitHub (👨‍💻 9 · 🔀 1K · 📋 78 - 53% open · ⏱️ 17.01.2020):

     git clone https://github.com/HarisIqbal88/PlotNeuralNet
    
BertViz (🥉15 · ⭐ 1.3K) - Tool for visualizing attention in the Transformer model (BERT, GPT-2, Albert,.. Apache-2
  • GitHub (👨‍💻 3 · 🔀 230 · 📋 30 - 43% open · ⏱️ 02.12.2019):

     git clone https://github.com/jessevig/bertviz
    
GANDissect (🥉13 · ⭐ 1.5K · 💤) - Pytorch-based tools for visualizing and understanding the neurons of a GAN... MIT
  • GitHub (👨‍💻 3 · 🔀 220 · 📋 13 - 46% open · ⏱️ 11.03.2019):

     git clone https://github.com/CSAILVision/gandissect
    
exBERT (🥉10 · ⭐ 190 · 🐣) - A Visual Analysis Tool to Explore Learned Representations in Transformers.. Apache-2
  • GitHub (👨‍💻 3 · 🔀 21 · 📋 4 - 75% open · ⏱️ 15.10.2019):

     git clone https://github.com/bhoov/exbert
    
Show 1 hidden projects...
Fabrik (🥉14 · ⭐ 1K · 💀) - Collaboratively build, visualize, and design neural nets in browser. ❗️GPL-3.0
  • GitHub (👨‍💻 45 · 🔀 240 · 📋 130 - 30% open · ⏱️ 12.12.2018):

     git clone https://github.com/Cloud-CV/Fabrik
    

Model Deployment

Back to top

Tools and platforms to deploy, run, and serve machine learning models.

TensorFlow.js (🥇31 · ⭐ 13K) - A WebGL accelerated JavaScript library for training and deploying ML models. Apache-2
  • GitHub (👨‍💻 220 · 🔀 1K · 📥 12 · 📦 6.3K · 📋 1.9K - 25% open · ⏱️ 24.01.2020):

     git clone https://github.com/tensorflow/tfjs
    
  • NPM (📥 52K / month · 📦 1.3K · ⏱️ 20.12.2019):

     npm install @tensorflow/tfjs
    
TensorFlow Serving (🥇30 · ⭐ 4.2K) - A flexible, high-performance serving system for machine learning.. Apache-2
  • GitHub (👨‍💻 150 · 🔀 1.6K · 📋 1.1K - 5% open · ⏱️ 24.01.2020):

     git clone https://github.com/tensorflow/serving
    
  • PyPi (📥 1.7M / month · 📦 86 · ⏱️ 14.01.2020):

     pip install tensorflow-serving-api
    
  • Dockerhub (📥 9.9M · ⭐ 73 · ⏱️ 24.01.2020):

     docker pull tensorflow/serving
    
ONNX Runtime (🥇28 · ⭐ 1.7K) - ONNX Runtime: cross-platform, high performance scoring engine for ML models. MIT
  • GitHub (👨‍💻 130 · 🔀 360 · 📥 6.4K · 📦 150 · 📋 710 - 20% open · ⏱️ 24.01.2020):

     git clone https://github.com/microsoft/onnxruntime
    
  • PyPi (📥 65K / month · 📦 33 · ⏱️ 23.01.2020):

     pip install onnxruntime
    
  • Dockerhub (📥 1K · ⭐ 7 · ⏱️ 20.12.2019):

     docker pull onnx/onnx-ecosystem
    
Seldon (🥈27 · ⭐ 1.4K) - Machine Learning Deployment for Kubernetes. Apache-2
  • GitHub (👨‍💻 73 · 🔀 280 · 📋 700 - 13% open · ⏱️ 24.01.2020):

     git clone https://github.com/SeldonIO/seldon-core
    
  • PyPi (📥 4.7K / month · 📦 13 · ⏱️ 15.01.2020):

     pip install seldon-core
    
  • Dockerhub (📥 1.8M · ⏱️ 24.01.2020):

     docker pull seldonio/seldon-core-operator
    
plaidML (🥈26 · ⭐ 2.8K) - PlaidML is a framework for making deep learning work everywhere. Apache-2
  • GitHub (👨‍💻 39 · 🔀 270 · 📥 240 · 📋 350 - 35% open · ⏱️ 24.01.2020):

     git clone https://github.com/plaidml/plaidml
    
  • PyPi (📥 36K / month · 📦 22 · ⏱️ 17.01.2020):

     pip install plaidml-keras
    
MLeap (🥈25 · ⭐ 1K) - MLeap: Deploy Spark Pipelines to Production. Apache-2
  • GitHub (👨‍💻 58 · 🔀 240 · 📦 62 · 📋 350 - 26% open · ⏱️ 23.01.2020):

     git clone https://github.com/combust/mleap
    
  • PyPi (📥 270K / month · 📦 15 · ⏱️ 09.10.2017):

     pip install mleap
    
  • Dockerhub (📥 15K · ⭐ 5 · ⏱️ 23.01.2020):

     docker pull combustml/mleap-serving
    
  • Maven (⏱️ 10.09.2018):

     <dependency>
     	<groupId>ml.combust.mleap</groupId>
     	<artifactId>mleap-base_2.11</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
MXNet Model Server (🥈22 · ⭐ 580) - Multi Model Server is a tool for serving neural net models for.. Apache-2
  • GitHub (👨‍💻 54 · 🔀 160 · 📋 330 - 7% open · ⏱️ 08.01.2020):

     git clone https://github.com/awslabs/mxnet-model-server
    
  • PyPi (📥 3.5K / month · 📦 2 · ⏱️ 14.11.2019):

     pip install mxnet-model-server
    
  • Dockerhub (📥 33K · ⭐ 4 · ⏱️ 04.12.2019):

     docker pull awsdeeplearningteam/mxnet-model-server
    
KFServing (🥈22 · ⭐ 300) - Serverless Inferencing on Kubernetes. Apache-2
  • GitHub (👨‍💻 34 · 🔀 100 · 📥 37 · 📦 9 · 📋 270 - 36% open · ⏱️ 24.01.2020):

     git clone https://github.com/kubeflow/kfserving
    
  • PyPi (📥 1.5K / month · 📦 6 · ⏱️ 27.11.2019):

     pip install kfserving
    
DeepDetect (🥈21 · ⭐ 2K) - Deep Learning API and Server in C++11 support for Caffe, Caffe2,.. ❗️GPL-3.0
  • GitHub (👨‍💻 25 · 🔀 490 · 📋 370 - 19% open · ⏱️ 24.01.2020):

     git clone https://github.com/jolibrain/deepdetect
    
  • Dockerhub (📥 28K · ⏱️ 23.01.2020):

     docker pull jolibrain/deepdetect_cpu
    
PipelineAI (🥉20 · ⭐ 3.9K) - PipelineAI Kubeflow Distribution. Apache-2
  • GitHub (👨‍💻 13 · 🔀 950 · 📋 250 - 0% open · ⏱️ 12.01.2020):

     git clone https://github.com/PipelineAI/pipeline
    
  • PyPi (📥 2.4K / month · ⏱️ 17.05.2019):

     pip install cli-pipeline
    
OpenPAI (🥉20 · ⭐ 1.6K) - Resource scheduling and cluster management for AI. MIT
  • GitHub (👨‍💻 81 · 🔀 360 · 📋 1.4K - 10% open · ⏱️ 21.01.2020):

     git clone https://github.com/Microsoft/pai
    
  • PyPi (📥 140 / month · ⏱️ 22.07.2019):

     pip install paicli
    
TensorRT Inference Server (🥉20 · ⭐ 960) - The TensorRT Inference Server provides a cloud inferencing.. BSD-3
  • GitHub (👨‍💻 23 · 🔀 190 · 📥 12K · 📋 320 - 6% open · ⏱️ 24.01.2020):

     git clone https://github.com/NVIDIA/tensorrt-inference-server
    
BentoML (🥉20 · ⭐ 770) - Model Serving made easy. Apache-2
  • GitHub (👨‍💻 13 · 🔀 88 · 📥 96 · 📦 7 · 📋 68 - 7% open · ⏱️ 23.01.2020):

     git clone https://github.com/bentoml/bentoml
    
  • PyPi (📥 1K / month · 📦 4 · ⏱️ 23.01.2020):

     pip install bentoml
    
  • Dockerhub:

     docker pull bentoml/bento_api_server
    
Clipper (🥉19 · ⭐ 1.1K) - A low-latency prediction-serving system. Apache-2
  • GitHub (👨‍💻 36 · 🔀 240 · 📋 370 - 28% open · ⏱️ 09.07.2019):

     git clone https://github.com/ucbrise/clipper
    
  • PyPi (📥 340 / month · ⏱️ 07.06.2019):

     pip install clipper-admin
    
  • Dockerhub (📥 1.2M · ⏱️ 23.01.2020):

     docker pull clipper/management_frontend
    
ONNX.js (🥉19 · ⭐ 980 · 💤) - ONNX.js: run ONNX models using JavaScript. MIT
  • GitHub (👨‍💻 10 · 🔀 68 · 📦 53 · 📋 57 - 40% open · ⏱️ 01.06.2019):

     git clone https://github.com/microsoft/onnxjs
    
  • NPM (📥 380 / month · 📦 10 · ⏱️ 01.06.2019):

     npm install onnxjs
    
Simple TensorFlow Serving (🥉19 · ⭐ 640) - Generic and easy-to-use serving service for machine learning.. Apache-2
  • GitHub (👨‍💻 8 · 🔀 160 · 📋 65 - 49% open · ⏱️ 11.11.2019):

     git clone https://github.com/tobegit3hub/simple_tensorflow_serving
    
  • PyPi (📥 120 / month · ⏱️ 23.09.2019):

     pip install simple_tensorflow_serving
    
  • Dockerhub (📥 3.7K · ⭐ 1 · ⏱️ 11.11.2019):

     docker pull tobegit3hub/simple_tensorflow_serving
    
Hydrosphere Serving (🥉19 · ⭐ 190) - Machine Learning Serving cluster. Apache-2
  • GitHub (👨‍💻 18 · 🔀 31 · 📥 5.9K · 📋 120 - 18% open · ⏱️ 17.01.2020):

     git clone https://github.com/Hydrospheredata/hydro-serving
    
  • PyPi (📥 150 / month · ⏱️ 12.11.2019):

     pip install hs
    
  • Dockerhub (📥 190K · ⏱️ 13.01.2020):

     docker pull hydrosphere/serving-manager
    
Cortex (🥉18 · ⭐ 2.8K) - Deploy machine learning models in production. Apache-2
  • GitHub (👨‍💻 10 · 🔀 190 · 📋 400 - 21% open · ⏱️ 23.01.2020):

     git clone https://github.com/cortexlabs/cortex
    
OpenVINO DLDT (🥉14 · ⭐ 1K) - OpenVINO Toolkit - Deep Learning Deployment Toolkit repository. Apache-2
  • GitHub (👨‍💻 7 · 🔀 330 · 📥 140 · 📋 320 - 67% open · ⏱️ 15.11.2019):

     git clone https://github.com/opencv/dldt
    
Show 3 hidden projects...
RedisAI (🥉15 · ⭐ 300) - A Redis module for serving tensors and executing deep learning graphs. ❗️RSAL
  • GitHub (👨‍💻 15 · 🔀 31 · 📋 120 - 28% open · ⏱️ 24.01.2020):

     git clone https://github.com/RedisAI/RedisAI
    
  • Dockerhub (📥 14K · ⏱️ 24.01.2020):

     docker pull redisai/redisai
    
GraphPipe (🥉14 · ⭐ 690 · 💀) - Machine Learning Model Deployment Made Simple. ❗️UPL-1.0
  • GitHub (👨‍💻 4 · 🔀 96 · 📋 13 - 92% open · ⏱️ 16.10.2018):

     git clone https://github.com/oracle/graphpipe
    
  • PyPi (📥 180 / month · 📦 2 · ⏱️ 15.08.2018):

     pip install graphpipe
    
  • Dockerhub (📥 1K · ⏱️ 02.11.2018):

     docker pull sleepsonthefloor/graphpipe-tf
    
Openscoring (🥉13 · ⭐ 490 · 💤) - REST web service for the true real-time scoring (1 ms) of R, Scikit-.. ❗️AGPL-3.0
  • GitHub (👨‍💻 1 · 🔀 140 · 📥 1.5K · 📋 50 - 12% open · ⏱️ 19.06.2019):

     git clone https://github.com/openscoring/openscoring
    
  • Maven (⏱️ 12.01.2019):

     <dependency>
     	<groupId>org.openscoring</groupId>
     	<artifactId>openscoring</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    

ML Applications & Services

Back to top

Task-specific machine learning applications and services.

bert-as-service (🥇27 · ⭐ 6.6K) - Mapping a variable-length sentence to a fixed-length vector using BERT model. MIT
  • GitHub (👨‍💻 38 · 🔀 1.3K · 📋 390 - 44% open · ⏱️ 21.01.2020):

     git clone https://github.com/hanxiao/bert-as-service
    
  • PyPi (📥 10K / month · 📦 10 · ⏱️ 20.12.2019):

     pip install bert-serving-server
    
OCRmyPDF (🥇24 · ⭐ 2.2K) - OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be.. ❗️GPL-3.0
  • GitHub (👨‍💻 28 · 🔀 270 · 📦 25 · 📋 440 - 12% open · ⏱️ 18.01.2020):

     git clone https://github.com/jbarlow83/OCRmyPDF
    
  • PyPi (📥 6.9K / month · 📦 12 · ⏱️ 19.01.2020):

     pip install ocrmypdf
    
GNES (🥈23 · ⭐ 1K · 🐣) - GNES is Generic Neural Elastic Search, a cloud-native semantic search system.. Apache-2
  • GitHub (👨‍💻 11 · 🔀 180 · 📦 2 · 📋 23 - 69% open · ⏱️ 24.10.2019):

     git clone https://github.com/gnes-ai/gnes
    
  • PyPi (📥 250 / month · 📦 1 · ⏱️ 06.11.2019):

     pip install gnes
    
  • Dockerhub (📥 94K · ⏱️ 12.11.2019):

     docker pull gnes/gnes
    
face-api.js (🥈22 · ⭐ 880) - Context aware, pluggable and customizable data protection and anonymization.. MIT
  • GitHub (👨‍💻 18 · 🔀 75 · 📋 54 - 31% open · ⏱️ 19.01.2020):

     git clone https://github.com/microsoft/presidio
    
  • NPM (📥 14K / month · 📦 160 · ⏱️ 15.12.2019):

     npm install face-api.js
    
Hastic (🥈18 · ⭐ 180) - Server for managing data for analytics. Apache-2
  • GitHub (👨‍💻 5 · 🔀 8 · 📥 410 · 📋 490 - 36% open · ⏱️ 27.12.2019):

     git clone https://github.com/hastic/hastic-server
    
  • Dockerhub (📥 41K · ⏱️ 07.11.2019):

     docker pull hastic/server
    
Real-Time-Voice-Cloning (🥈17 · ⭐ 15K) - Clone a voice in 5 seconds to generate arbitrary speech in real-time. MIT
  • GitHub (👨‍💻 9 · 🔀 2.6K · 📋 230 - 41% open · ⏱️ 13.11.2019):

     git clone https://github.com/CorentinJ/Real-Time-Voice-Cloning
    
DeOldify (🥈17 · ⭐ 8.9K) - A Deep Learning based project for colorizing and restoring old images (and video!). MIT
  • GitHub (👨‍💻 23 · 🔀 960 · 📋 130 - 11% open · ⏱️ 21.01.2020):

     git clone https://github.com/jantic/DeOldify
    
DeepFaceLab (🥉16 · ⭐ 12K) - DeepFaceLab is a tool that utilizes machine learning to replace faces in.. ❗️GPL-3.0
  • GitHub (👨‍💻 17 · 🔀 2.7K · 📋 490 - 35% open · ⏱️ 24.01.2020):

     git clone https://github.com/iperov/DeepFaceLab
    
Automatic-Speech-Recognition (🥉15 · ⭐ 2.6K) - End-to-end Automatic Speech Recognition for Madarian and.. MIT
  • GitHub (👨‍💻 9 · 🔀 510 · 📋 86 - 77% open · ⏱️ 17.10.2019):

     git clone https://github.com/zzw922cn/Automatic_Speech_Recognition
    
Presidio (🥉15 · ⭐ 880) - Context aware, pluggable and customizable data protection and anonymization service.. MIT
  • GitHub (👨‍💻 18 · 🔀 75 · 📋 54 - 31% open · ⏱️ 19.01.2020):

     git clone https://github.com/microsoft/presidio
    
  • Dockerhub:

     docker pull mcr.microsoft.com/presidio-api
    
neural-style (🥉13 · ⭐ 5K) - Neural style in TensorFlow!. ❗️GPL-3.0
  • GitHub (👨‍💻 17 · 🔀 1.4K · 📋 120 - 4% open · ⏱️ 06.10.2019):

     git clone https://github.com/anishathalye/neural-style
    
Deep Colorization (🥉13 · ⭐ 2.1K) - Deep learning software for colorizing black and white images with a few.. MIT
  • GitHub (👨‍💻 7 · 🔀 340 · 📋 63 - 31% open · ⏱️ 03.10.2019):

     git clone https://github.com/junyanz/interactive-deep-colorization
    
Show 3 hidden projects...
FastPhotoStyle (🥉13 · ⭐ 10K · 💤) - Style transfer, deep learning, feature transform. ❗️CC-BY-4.0
  • GitHub (👨‍💻 5 · 🔀 1K · 📋 78 - 55% open · ⏱️ 27.02.2019):

     git clone https://github.com/NVIDIA/FastPhotoStyle
    
AlphaPose (🥉12 · ⭐ 3.5K) - Real-Time and Accurate Multi-Person Pose Estimation&Tracking System. ❗️Unlicensed
  • GitHub (👨‍💻 4 · 🔀 950 · 📋 490 - 13% open · ⏱️ 14.01.2020):

     git clone https://github.com/MVIG-SJTU/AlphaPose
    
BERTsearch (🥉7 · ⭐ 340 · 🐣) - Elasticsearch with BERT for advanced document search. ❗️Unlicensed
  • GitHub (👨‍💻 2 · 🔀 58 · 📋 9 - 22% open · ⏱️ 14.11.2019):

     git clone https://github.com/Hironsan/bertsearch
    

Performance Optimization & Accelerators

Back to top

Compilers, accelerators, and libraries to improve compute performance and optimize machine learning models.

Cython (🥇36 · ⭐ 4.8K) - The most widely used Python to C compiler. Apache-2
  • GitHub (👨‍💻 370 · 🔀 950 · 📦 41K · 📋 2.1K - 36% open · ⏱️ 23.01.2020):

     git clone https://github.com/cython/cython
    
  • PyPi (📥 3.6M / month · 📦 27K · ⏱️ 01.11.2019):

     pip install Cython
    
  • Conda (⏱️ 20.11.2019):

     conda install -c anaconda cython
    
Numba (🥈35 · ⭐ 4.9K) - NumPy aware dynamic Python compiler using LLVM. BSD-2
  • GitHub (👨‍💻 180 · 🔀 590 · 📦 14K · 📋 2.9K - 36% open · ⏱️ 22.01.2020):

     git clone https://github.com/numba/numba
    
  • PyPi (📥 860K / month · 📦 4K · ⏱️ 03.01.2020):

     pip install numba
    
  • Conda (⏱️ 17.01.2020):

     conda install -c anaconda numba
    
CuPy (🥈30 · ⭐ 3.9K) - NumPy-like API accelerated with CUDA. MIT
  • GitHub (👨‍💻 220 · 🔀 330 · 📋 800 - 41% open · ⏱️ 24.01.2020):

     git clone https://github.com/cupy/cupy
    
  • PyPi (📥 9.3K / month · 📦 190 · ⏱️ 22.01.2020):

     pip install cupy
    
  • Dockerhub (📥 44K · ⭐ 5 · ⏱️ 24.01.2020):

     docker pull cupy/cupy
    
mkl-dnn (🥈24 · ⭐ 1.8K) - Deep Neural Network Library (DNNL). Apache-2
  • GitHub (👨‍💻 110 · 🔀 430 · 📥 1.9M · 📋 550 - 4% open · ⏱️ 21.01.2020):

     git clone https://github.com/intel/mkl-dnn
    
  • Conda (⏱️ 18.09.2019):

     conda install -c anaconda mkl
    
TVM (🥉23 · ⭐ 4.9K) - Open deep learning compiler stack for cpu, gpu and specialized accelerators. Apache-2
  • GitHub (👨‍💻 320 · 🔀 1.3K · 📥 59 · 📋 1.3K - 10% open · ⏱️ 24.01.2020):

     git clone https://github.com/apache/incubator-tvm
    
  • Dockerhub (📥 1.7K · ⭐ 4 · ⏱️ 27.09.2019):

     docker pull tvmai/demo-cpu
    
nGraph (🥉22 · ⭐ 1.2K) - nGraph - open source C++ library, compiler and runtime for Deep Learning. Apache-2
  • GitHub (👨‍💻 94 · 🔀 180 · 📦 1 · 📋 250 - 38% open · ⏱️ 24.01.2020):

     git clone https://github.com/NervanaSystems/ngraph
    
  • PyPi (📥 300 / month · ⏱️ 09.11.2019):

     pip install ngraph-core
    
OpenBLAS (🥉21 · ⭐ 3.1K) - OpenBLAS is an optimized BLAS library based on GotoBLAS2 1.13 BSD version. BSD-3
  • GitHub (👨‍💻 160 · 🔀 860 · 📥 1.5K · 📋 1.4K - 10% open · ⏱️ 23.01.2020):

     git clone https://github.com/xianyi/OpenBLAS
    
  • Conda (⏱️ 18.07.2019):

     conda install -c anaconda openblas
    
Glow (🥉17 · ⭐ 1.9K) - Compiler for Neural Network hardware accelerators. Apache-2
  • GitHub (👨‍💻 130 · 🔀 340 · 📋 590 - 33% open · ⏱️ 24.01.2020):

     git clone https://github.com/pytorch/glow
    
TensorRT (🥉16 · ⭐ 1.9K) - TensorRT is a C++ library for high performance inference on NVIDIA GPUs and.. Apache-2
  • GitHub (👨‍💻 8 · 🔀 370 · 📋 300 - 26% open · ⏱️ 23.01.2020):

     git clone https://github.com/NVIDIA/TensorRT
    
PocketFlow (🥉14 · ⭐ 2.4K · 💤) - An Automatic Model Compression (AutoMC) framework for developing smaller.. BSD-3
  • GitHub (👨‍💻 11 · 🔀 450 · 📋 270 - 25% open · ⏱️ 28.05.2019):

     git clone https://github.com/Tencent/PocketFlow
    

Data Storage

Back to top

Dart Storage tools such as relational, document, graph, time-series, and key-value databases. For a more in-depth comparison, take a look at the DB Engines Ranking.

Elasticsearch (🥇40 · ⭐ 51K) - Open Source, Distributed, RESTful Search Engine. Apache-2
  • GitHub (👨‍💻 1.5K · 🔀 16K · 📋 23K - 10% open · ⏱️ 24.01.2020):

     git clone https://github.com/elastic/elasticsearch
    
  • PyPi (📥 3.1M / month · 📦 11K · ⏱️ 19.01.2020):

     pip install elasticsearch
    
  • NPM (📥 970K / month · 📦 8K · ⏱️ 15.01.2020):

     npm install elasticsearch
    
  • Dockerhub (📥 350M · ⭐ 4.1K · ⏱️ 23.01.2020):

     docker pull elasticsearch
    
  • Maven (📦 2.4K · ⏱️ 06.09.2019):

     <dependency>
     	<groupId>org.elasticsearch.client</groupId>
     	<artifactId>transport</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
MongoDB (🥇38 · ⭐ 24K) - The MongoDB Database. ❗️SSPL
  • GitHub (👨‍💻 660 · 🔀 4.3K · ⏱️ 24.01.2020):

     git clone https://github.com/mongodb/mongo
    
  • PyPi (📥 5.4M / month · 📦 36K · ⏱️ 08.01.2020):

     pip install pymongo
    
  • NPM (📥 5.3M / month · 📦 230K · ⏱️ 17.01.2020):

     npm install mongodb
    
  • Conda (⏱️ 02.11.2018):

     conda install -c anaconda mongodb
    
  • Dockerhub (📥 1.5B · ⭐ 6.5K · ⏱️ 18.01.2020):

     docker pull mongo
    
  • Maven (📦 210 · ⏱️ 17.01.2020):

     <dependency>
     	<groupId>org.mongodb</groupId>
     	<artifactId>mongodb-driver-sync</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
PostgreSQL (🥇38 · ⭐ 15K) - Object-relational database that supports an extended subset of the SQL.. ❗️PostgreSQL
  • GitHub (👨‍💻 50 · 🔀 2.1K · ⏱️ 24.01.2020):

     git clone https://github.com/postgres/postgres
    
  • PyPi (📥 5.9M / month · 📦 120K · ⏱️ 14.04.2019):

     pip install psycopg2
    
  • NPM (📥 4.1M / month · 📦 91K · ⏱️ 10.01.2020):

     npm install pg
    
  • Dockerhub (📥 2B · ⭐ 7.4K · ⏱️ 24.01.2020):

     docker pull postgres
    
  • Maven (📦 52K · ⏱️ 15.03.2018):

     <dependency>
     	<groupId>org.postgresql</groupId>
     	<artifactId>postgresql</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Redis (🥇37 · ⭐ 49K) - Redis is an in-memory database that persists on disk. The data model is key-value,.. BSD-3
  • GitHub (👨‍💻 400 · 🔀 16K · 📋 4.8K - 52% open · ⏱️ 15.01.2020):

     git clone https://github.com/antirez/redis
    
  • PyPi (📥 7.4M / month · 📦 49K · ⏱️ 13.10.2019):

     pip install redis
    
  • NPM (📥 4.9M / month · 📦 93K · ⏱️ 08.08.2017):

     npm install redis
    
  • Conda (⏱️ 24.01.2019):

     conda install -c anaconda redis
    
  • Dockerhub (📥 1.7B · ⭐ 7.7K · ⏱️ 18.01.2020):

     docker pull redis
    
  • Maven (📦 40K · ⏱️ 02.12.2018):

     <dependency>
     	<groupId>redis.clients</groupId>
     	<artifactId>jedis</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
InfluxDB (🥇37 · ⭐ 19K) - Scalable datastore for metrics, events, and real-time analytics. MIT
  • GitHub (👨‍💻 480 · 🔀 2.5K · 📋 9.3K - 6% open · ⏱️ 24.01.2020):

     git clone https://github.com/influxdata/influxdb
    
  • PyPi (📥 750K / month · 📦 1.9K · ⏱️ 26.08.2019):

     pip install influxdb
    
  • NPM (📥 140K / month · 📦 970 · ⏱️ 13.11.2019):

     npm install influx
    
  • Dockerhub (📥 400M · ⭐ 900 · ⏱️ 23.01.2020):

     docker pull influxdb
    
  • Maven (📦 960 · ⏱️ 12.09.2018):

     <dependency>
     	<groupId>org.influxdb</groupId>
     	<artifactId>influxdb-java</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Prometheus (🥈35 · ⭐ 30K) - The Prometheus monitoring system and time series database. Apache-2
  • GitHub (👨‍💻 480 · 🔀 4.2K · 📥 12M · 📋 3.3K - 14% open · ⏱️ 22.01.2020):

     git clone https://github.com/prometheus/prometheus
    
  • PyPi (📥 4.7M / month · 📦 1.2K · ⏱️ 20.06.2019):

     pip install prometheus_client
    
  • Conda:

     conda install -c conda-forge prometheus_client
    
  • Dockerhub (📥 510M · ⭐ 930 · ⏱️ 22.01.2020):

     docker pull prom/prometheus
    
  • Maven (📦 1.1K · ⏱️ 30.07.2018):

     <dependency>
     	<groupId>io.prometheus</groupId>
     	<artifactId>simpleclient</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Solr (🥈35 · ⭐ 4K) - Apache Lucene and Solr open-source search software. Apache-2
  • GitHub (👨‍💻 240 · 🔀 2.2K · 📦 2.8K · ⏱️ 24.01.2020):

     git clone https://github.com/apache/lucene-solr
    
  • PyPi (📥 51K / month · 📦 2.3K · ⏱️ 03.10.2018):

     pip install pysolr
    
  • NPM (📥 50K / month · 📦 840 · ⏱️ 19.05.2017):

     npm install solr-client
    
  • Dockerhub (📥 60M · ⭐ 730 · ⏱️ 23.01.2020):

     docker pull solr
    
  • Maven (📦 3K · ⏱️ 28.12.2019):

     <dependency>
     	<groupId>org.apache.solr</groupId>
     	<artifactId>solr-core</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Minio (🥈34 · ⭐ 20K) - MinIO is a high performance object storage server compatible with Amazon S3 APIs. Apache-2
  • GitHub (👨‍💻 220 · 🔀 1.9K · 📋 3.8K - 2% open · ⏱️ 23.01.2020):

     git clone https://github.com/minio/minio
    
  • PyPi (📥 120K / month · 📦 340 · ⏱️ 30.12.2019):

     pip install minio
    
  • NPM (📥 72K / month · 📦 350 · ⏱️ 24.12.2019):

     npm install minio
    
  • Conda:

     conda install -c conda-forge minio
    
  • Dockerhub (📥 310M · ⭐ 290 · ⏱️ 24.01.2020):

     docker pull minio/minio
    
  • Maven (📦 200 · ⏱️ 11.09.2019):

     <dependency>
     	<groupId>io.minio</groupId>
     	<artifactId>minio</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Neo4j (🥈34 · ⭐ 8.1K) - Graphs for Everyone. ❗️GPL-3.0
  • GitHub (👨‍💻 260 · 🔀 1.7K · 📦 560 · 📋 2.7K - 7% open · ⏱️ 22.01.2020):

     git clone https://github.com/neo4j/neo4j
    
  • PyPi (📥 47K / month · 📦 62 · ⏱️ 11.11.2019):

     pip install neo4j
    
  • NPM (📥 65K / month · 📦 980 · ⏱️ 16.12.2019):

     npm install neo4j-driver
    
  • Dockerhub (📥 66M · ⭐ 750 · ⏱️ 23.01.2020):

     docker pull neo4j
    
  • Maven (📦 3K · ⏱️ 17.10.2019):

     <dependency>
     	<groupId>org.neo4j</groupId>
     	<artifactId>neo4j</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Arrow (🥈34 · ⭐ 5K) - Apache Arrow is a cross-language development platform for in-memory data. It.. Apache-2
  • GitHub (👨‍💻 430 · 🔀 1.3K · 📦 5 · 📋 510 - 15% open · ⏱️ 24.01.2020):

     git clone https://github.com/apache/arrow
    
  • PyPi (📥 4.9M / month · 📦 800 · ⏱️ 01.11.2019):

     pip install pyarrow
    
  • NPM (📥 7.7K / month · 📦 30 · ⏱️ 01.11.2019):

     npm install apache-arrow
    
  • Conda:

     conda install -c conda-forge pyarrow
    
  • Maven (📦 250 · ⏱️ 30.09.2019):

     <dependency>
     	<groupId>org.apache.arrow</groupId>
     	<artifactId>arrow-vector</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
etcd (🥈33 · ⭐ 29K) - Distributed reliable key-value store for the most critical data of a distributed.. Apache-2
  • GitHub (👨‍💻 620 · 🔀 6K · 📥 22M · 📋 4.8K - 12% open · ⏱️ 23.01.2020):

     git clone https://github.com/etcd-io/etcd
    
  • PyPi (📥 74K / month · 📦 56 · ⏱️ 07.12.2019):

     pip install etcd3
    
  • NPM (📥 8.8K / month · 📦 34 · ⏱️ 03.07.2019):

     npm install etcd3
    
  • Dockerhub (📥 4M · ⭐ 25 · ⏱️ 24.01.2020):

     docker pull bitnami/etcd
    
  • Maven:

     <dependency>
     	<groupId>io.etcd</groupId>
     	<artifactId>jetcd-core</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
LevelDB (🥈33 · ⭐ 20K) - LevelDB is a fast key-value storage library written at Google that provides an.. Apache-2
  • GitHub (👨‍💻 46 · 🔀 4.5K · 📋 590 - 21% open · ⏱️ 15.01.2020):

     git clone https://github.com/google/leveldb
    
  • PyPi (📥 44K / month · 📦 630 · ⏱️ 22.01.2020):

     pip install plyvel
    
  • NPM (📥 940K / month · 📦 15K · ⏱️ 04.10.2019):

     npm install levelup
    
Hazelcast (🥈33 · ⭐ 3.6K) - Open Source In-Memory Data Grid. Apache-2
  • GitHub (👨‍💻 250 · 🔀 1.2K · 📦 9.5K · 📋 6.1K - 12% open · ⏱️ 24.01.2020):

     git clone https://github.com/hazelcast/hazelcast
    
  • PyPi (📥 5.7K / month · 📦 1 · ⏱️ 15.07.2019):

     pip install hazelcast-python-client
    
  • NPM (📥 1.7K / month · 📦 38 · ⏱️ 06.05.2019):

     npm install hazelcast-client
    
  • Dockerhub (📥 6.1M · ⭐ 58 · ⏱️ 10.01.2020):

     docker pull hazelcast/hazelcast
    
  • Maven (📦 5.5K · ⏱️ 23.02.2019):

     <dependency>
     	<groupId>com.hazelcast</groupId>
     	<artifactId>hazelcast</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
HBase (🥈33 · ⭐ 3.3K) - Non-relational distributed database modeled after Google's Bigtable and written.. Apache-2
  • GitHub (👨‍💻 470 · 🔀 2.2K · 📦 2.6K · ⏱️ 24.01.2020):

     git clone https://github.com/apache/hbase
    
  • PyPi (📥 180K / month · 📦 410 · ⏱️ 03.04.2017):

     pip install happybase
    
  • Maven (📦 5.1K · ⏱️ 27.10.2018):

     <dependency>
     	<groupId>org.apache.hbase</groupId>
     	<artifactId>hbase-client</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
RethinkDB (🥈32 · ⭐ 24K) - The open-source database for the realtime web. Apache-2
  • GitHub (👨‍💻 240 · 🔀 1.8K · 📥 2.9K · 📦 260 · 📋 6.2K - 23% open · ⏱️ 13.01.2020):

     git clone https://github.com/rethinkdb/rethinkdb
    
  • PyPi (📥 21K / month · 📦 870 · ⏱️ 02.11.2019):

     pip install rethinkdb
    
  • NPM (📥 54K / month · 📦 3.3K · ⏱️ 13.12.2019):

     npm install rethinkdb
    
  • Dockerhub (📥 54M · ⭐ 530 · ⏱️ 16.01.2020):

     docker pull rethinkdb
    
  • Maven (⏱️ 26.07.2016):

     <dependency>
     	<groupId>com.rethinkdb</groupId>
     	<artifactId>rethinkdb-driver</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
OrientDB (🥈32 · ⭐ 4.2K) - OrientDB is the most versatile DBMS supporting Graph, Document, Reactive,.. Apache-2
  • GitHub (👨‍💻 180 · 🔀 810 · 📦 350 · 📋 8.4K - 17% open · ⏱️ 24.01.2020):

     git clone https://github.com/orientechnologies/orientdb
    
  • PyPi (📥 3K / month · 📦 40 · ⏱️ 29.04.2017):

     pip install pyorient
    
  • NPM (📥 18K / month · 📦 200 · ⏱️ 11.12.2019):

     npm install orientjs
    
  • Dockerhub (📥 10M · ⭐ 130 · ⏱️ 09.01.2020):

     docker pull orientdb
    
  • Maven (📦 660 · ⏱️ 03.07.2019):

     <dependency>
     	<groupId>com.orientechnologies</groupId>
     	<artifactId>orientdb-core</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
ClickHouse (🥈31 · ⭐ 9.7K) - ClickHouse is a free analytics DBMS for big data. Apache-2
  • GitHub (👨‍💻 540 · 🔀 1.7K · 📋 3.8K - 31% open · ⏱️ 24.01.2020):

     git clone https://github.com/ClickHouse/ClickHouse
    
  • PyPi (📥 290K / month · 📦 18 · ⏱️ 20.09.2019):

     pip install clickhouse-driver
    
  • NPM (📥 4.5K / month · 📦 12 · ⏱️ 04.11.2018):

     npm install clickhouse
    
  • Dockerhub (📥 4.5M · ⭐ 210 · ⏱️ 23.01.2020):

     docker pull yandex/clickhouse-server
    
ArangoDB (🥈31 · ⭐ 9.3K) - ArangoDB is a native multi-model database with flexible data models for.. Apache-2
  • GitHub (👨‍💻 120 · 🔀 590 · 📋 3.6K - 15% open · ⏱️ 24.01.2020):

     git clone https://github.com/arangodb/arangodb
    
  • PyPi (📥 5.8K / month · 📦 10 · ⏱️ 30.10.2019):

     pip install pyarango
    
  • NPM (📥 14K / month · 📦 440 · ⏱️ 24.01.2020):

     npm install arangojs
    
  • Dockerhub (📥 17M · ⭐ 180 · ⏱️ 23.01.2020):

     docker pull arangodb
    
  • Maven (📦 130 · ⏱️ 05.09.2019):

     <dependency>
     	<groupId>com.arangodb</groupId>
     	<artifactId>arangodb-java-driver</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Ignite (🥈31 · ⭐ 3K) - Memory-centric distributed database, caching and processing platform designed to.. Apache-2
  • GitHub (👨‍💻 330 · 🔀 1.4K · 📦 1 · ⏱️ 24.01.2020):

     git clone https://github.com/apache/ignite
    
  • PyPi (📥 1.6K / month · ⏱️ 23.11.2018):

     pip install pyignite
    
  • NPM (📥 330 / month · ⏱️ 10.12.2018):

     npm install apache-ignite-client
    
  • Dockerhub (📥 9.6M · ⭐ 58 · ⏱️ 19.09.2019):

     docker pull apacheignite/ignite
    
  • Maven (📦 460 · ⏱️ 10.07.2018):

     <dependency>
     	<groupId>org.apache.ignite</groupId>
     	<artifactId>ignite-core</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Cassandra (🥉30 · ⭐ 6.8K) - Distributed, wide column store, NoSQL database designed to handle large.. Apache-2
  • GitHub (👨‍💻 400 · 🔀 2.5K · ⏱️ 24.01.2020):

     git clone https://github.com/apache/cassandra
    
  • PyPi (📥 220K / month · 📦 1.2K · ⏱️ 15.01.2020):

     pip install cassandra-driver
    
  • NPM (📥 130K / month · 📦 1.3K · ⏱️ 06.11.2019):

     npm install cassandra-driver
    
  • Dockerhub (📥 92M · ⭐ 1.1K · ⏱️ 28.12.2019):

     docker pull cassandra
    
  • Maven:

     <dependency>
     	<groupId>com.datastax.oss</groupId>
     	<artifactId>java-driver-core</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Graphite (🥉30 · ⭐ 4.9K) - A highly scalable real-time graphing system. Apache-2
  • GitHub (👨‍💻 410 · 🔀 1.2K · 📦 45 · 📋 1.1K - 33% open · ⏱️ 23.01.2020):

     git clone https://github.com/graphite-project/graphite-web
    
  • Dockerhub (📥 8.2M · ⭐ 70 · ⏱️ 24.10.2019):

     docker pull graphiteapp/graphite-statsd
    
TiDB (🥉28 · ⭐ 22K) - TiDB is an open source distributed HTAP database compatible with the MySQL protocol. Apache-2
  • GitHub (👨‍💻 400 · 🔀 3.3K · 📋 3.8K - 33% open · ⏱️ 23.01.2020):

     git clone https://github.com/pingcap/tidb
    
  • Dockerhub (📥 520K · ⭐ 51 · ⏱️ 24.01.2020):

     docker pull pingcap/tidb
    
CrateDB (🥉28 · ⭐ 2.8K) - CrateDB is a distributed SQL database that makes it simple to store and analyze.. Apache-2
  • GitHub (👨‍💻 79 · 🔀 350 · 📋 860 - 10% open · ⏱️ 24.01.2020):

     git clone https://github.com/crate/crate
    
  • PyPi (📥 11K / month · 📦 42 · ⏱️ 19.09.2019):

     pip install crate
    
  • Dockerhub (📥 14M · ⭐ 140 · ⏱️ 23.01.2020):

     docker pull crate
    
  • Maven:

     <dependency>
     	<groupId>io.crate</groupId>
     	<artifactId>crate-jdbc</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Dgraph (🥉27 · ⭐ 12K) - Fast, Distributed Graph DB. Apache-2
  • GitHub (👨‍💻 130 · 🔀 860 · 📥 110K · 📋 2K - 11% open · ⏱️ 24.01.2020):

     git clone https://github.com/dgraph-io/dgraph
    
  • PyPi (📥 4K / month · 📦 10 · ⏱️ 10.09.2019):

     pip install pydgraph
    
  • NPM (📥 3.6K / month · 📦 16 · ⏱️ 01.10.2019):

     npm install dgraph-js
    
  • Dockerhub (📥 1.8M · ⭐ 51 · ⏱️ 24.01.2020):

     docker pull dgraph/dgraph
    
  • Maven:

     <dependency>
     	<groupId>io.dgraph</groupId>
     	<artifactId>dgraph4j</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Geode (🥉27 · ⭐ 1.7K) - Data management platform that provides real-time, consistent access to data-.. Apache-2
  • GitHub (👨‍💻 170 · 🔀 570 · ⏱️ 24.01.2020):

     git clone https://github.com/apache/geode
    
  • Dockerhub (📥 59K · ⭐ 19 · ⏱️ 30.12.2019):

     docker pull apachegeode/geode
    
  • Maven (📦 140 · ⏱️ 20.12.2019):

     <dependency>
     	<groupId>org.apache.geode</groupId>
     	<artifactId>geode-core</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
OmniSciDB (🥉26 · ⭐ 2.1K) - OmniSciDB (formerly MapD Core). Apache-2
  • GitHub (👨‍💻 80 · 🔀 290 · 📋 350 - 35% open · ⏱️ 21.01.2020):

     git clone https://github.com/omnisci/omniscidb
    
  • PyPi (📥 2.4K / month · 📦 20 · ⏱️ 05.12.2019):

     pip install pymapd
    
  • NPM (📥 120 / month · 📦 26 · ⏱️ 22.08.2019):

     npm install @mapd/connector
    
  • Conda:

     conda install -c conda-forge pymapd
    
Druid (🥉25 · ⭐ 9.1K) - Apache Druid: a high performance real-time analytics database. Apache-2
  • GitHub (👨‍💻 380 · 🔀 2.2K · 📥 120 · 📋 3.2K - 26% open · ⏱️ 24.01.2020):

     git clone https://github.com/apache/incubator-druid
    
  • Dockerhub (📥 29K · ⭐ 9 · ⏱️ 10.12.2019):

     docker pull apache/incubator-druid
    
JanusGraph (🥉25 · ⭐ 3.1K) - JanusGraph: an open-source, distributed graph database. Apache-2
  • GitHub (👨‍💻 140 · 🔀 750 · 📥 110K · 📋 1.2K - 36% open · ⏱️ 15.01.2020):

     git clone https://github.com/JanusGraph/janusgraph
    
  • Dockerhub (📥 29K · ⭐ 6 · ⏱️ 21.10.2019):

     docker pull janusgraph/janusgraph
    
  • Maven (📦 84 · ⏱️ 08.10.2018):

     <dependency>
     	<groupId>org.janusgraph</groupId>
     	<artifactId>janusgraph-core</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Cayley (🥉24 · ⭐ 13K) - An open-source graph database. Apache-2
  • GitHub (👨‍💻 100 · 🔀 1.2K · 📥 26K · 📋 460 - 16% open · ⏱️ 18.01.2020):

     git clone https://github.com/cayleygraph/cayley
    
  • PyPi (📥 160 / month · ⏱️ 26.10.2019):

     pip install pyley
    
  • NPM (📥 46 / month · ⏱️ 06.11.2019):

     npm install @cayleygraph/cayley
    
  • Dockerhub (📥 3.6K · ⭐ 5 · ⏱️ 18.01.2020):

     docker pull cayleygraph/cayley
    
Riak (🥉24 · ⭐ 3.4K) - Riak is a decentralized datastore from Basho Technologies. Apache-2
  • GitHub (👨‍💻 120 · 🔀 510 · 📥 63 · 📋 440 - 31% open · ⏱️ 08.01.2020):

     git clone https://github.com/basho/riak
    
  • PyPi (📥 5.6K / month · 📦 170 · ⏱️ 12.12.2016):

     pip install riak
    
  • NPM (📥 780 / month · 📦 60 · ⏱️ 12.12.2016):

     npm install basho-riak-client
    
  • Dockerhub (📥 650K · ⭐ 28 · ⏱️ 04.04.2017):

     docker pull basho/riak-kv
    
  • Maven (📦 300 · ⏱️ 15.12.2016):

     <dependency>
     	<groupId>com.basho.riak</groupId>
     	<artifactId>riak-client</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Tile38 (🥉23 · ⭐ 6.6K) - Real-time Geospatial and Geofencing. MIT
  • GitHub (👨‍💻 36 · 🔀 370 · 📥 39K · 📋 410 - 21% open · ⏱️ 11.12.2019):

     git clone https://github.com/tidwall/tile38
    
  • Dockerhub (📥 680K · ⭐ 15 · ⏱️ 11.12.2019):

     docker pull tile38/tile38
    
Quilt (🥉23 · ⭐ 860) - Quilt is a versioned data portal for AWS. Apache-2
  • GitHub (👨‍💻 29 · 🔀 51 · 📦 16 · 📋 91 - 51% open · ⏱️ 22.01.2020):

     git clone https://github.com/quiltdata/quilt
    
  • PyPi (📥 1.1K / month · 📦 4 · ⏱️ 18.01.2020):

     pip install quilt3
    
Sonic (🥉22 · ⭐ 8K) - Fast, lightweight & schema-less search backend. An alternative to Elasticsearch that.. MPL-2.0
  • GitHub (👨‍💻 22 · 🔀 230 · 📋 170 - 12% open · ⏱️ 09.01.2020):

     git clone https://github.com/valeriansaliou/sonic
    
  • PyPi (📥 140 / month · ⏱️ 03.08.2019):

     pip install sonic-client
    
  • NPM (📥 830 / month · ⏱️ 26.04.2019):

     npm install sonic-channel
    
  • Dockerhub (📥 4.1K · ⭐ 4 · ⏱️ 14.10.2019):

     docker pull valeriansaliou/sonic
    
  • Maven:

     <dependency>
     	<groupId>com.github.twohou</groupId>
     	<artifactId>java-sonic</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
rqlite (🥉21 · ⭐ 5.5K) - The lightweight, distributed relational database built on SQLite. MIT
  • GitHub (👨‍💻 17 · 🔀 300 · 📥 5.9K · 📋 270 - 19% open · ⏱️ 24.01.2020):

     git clone https://github.com/rqlite/rqlite
    
  • PyPi:

     pip install sqlalchemy_rqlite
    
  • NPM (📥 140 / month · ⏱️ 22.07.2019):

     npm install rqlite-js
    
  • Dockerhub (📥 17K · ⭐ 8 · ⏱️ 11.01.2020):

     docker pull rqlite/rqlite
    
  • Maven:

     <dependency>
     	<groupId>com.rqlite</groupId>
     	<artifactId>rqlite-java</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Doris (🥉21 · ⭐ 1.5K) - MPP-based interactive SQL data warehousing for reporting and analysis. Apache-2
  • GitHub (👨‍💻 82 · 🔀 420 · 📋 960 - 35% open · ⏱️ 21.01.2020):

     git clone https://github.com/apache/incubator-doris
    
  • Dockerhub (📥 1.6K · ⭐ 4 · ⏱️ 04.01.2020):

     docker pull apachedoris/doris-dev
    
Gaffer (🥉20 · ⭐ 1.6K) - A large-scale entity and relation database supporting aggregation of properties. Apache-2
  • GitHub (👨‍💻 40 · 🔀 320 · 📥 26 · 📋 1.1K - 8% open · ⏱️ 23.01.2020):

     git clone https://github.com/gchq/Gaffer
    
  • Maven (📦 8 · ⏱️ 06.01.2020):

     <dependency>
     	<groupId>uk.gov.gchq.gaffer</groupId>
     	<artifactId>graph</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
EdgeDB (🥉19 · ⭐ 3.5K) - The next generation relational database. Apache-2
  • GitHub (👨‍💻 18 · 🔀 79 · 📋 390 - 25% open · ⏱️ 23.01.2020):

     git clone https://github.com/edgedb/edgedb
    
  • PyPi (📥 2.9K / month · ⏱️ 11.01.2020):

     pip install edgedb
    
  • Dockerhub (📥 4.4K · ⭐ 2 · ⏱️ 24.01.2020):

     docker pull edgedb/edgedb
    
Nebula (🥉19 · ⭐ 2.3K) - A distributed, fast open-source graph database featuring horizontal scalability.. Apache-2
  • GitHub (👨‍💻 40 · 🔀 300 · 📥 400 · 📋 660 - 32% open · ⏱️ 21.01.2020):

     git clone https://github.com/vesoft-inc/nebula
    
  • Dockerhub (📥 2.2K · ⭐ 3 · ⏱️ 24.01.2020):

     docker pull vesoft/nebula-graph
    
TileDB-Inc/TileDB (🥉19 · ⭐ 570) - The Fastest Array Storage Engine. MIT
  • GitHub (👨‍💻 27 · 🔀 72 · 📥 380 · 📋 620 - 16% open · ⏱️ 22.01.2020):

     git clone https://github.com/TileDB-Inc/TileDB
    
  • Conda:

     conda install -c conda-forge tiledb
    
  • Dockerhub (📥 520 · ⏱️ 09.10.2019):

     docker pull tiledb/tiledb
    
Show 5 hidden projects...
MySQL (🥈36 · ⭐ 24K) - MySQL Server, the world's most popular open source database, and MySQL Cluster, a.. ❗️GPL-2.0
  • GitHub (👨‍💻 570 · 🔀 2.1K · ⏱️ 09.12.2019):

     git clone https://github.com/mysql/mysql-server
    
  • PyPi (📥 2.1M / month · 📦 6.6K · ⏱️ 21.11.2019):

     pip install mysqlclient
    
  • NPM (📥 2.1M / month · 📦 100K · ⏱️ 23.01.2020):

     npm install mysql
    
  • Conda (⏱️ 18.11.2019):

     conda install -c anaconda mysql-connector-python
    
  • Dockerhub (📥 1.4B · ⭐ 9.1K · ⏱️ 15.01.2020):

     docker pull mysql
    
  • Maven (📦 300K · ⏱️ 27.09.2018):

     <dependency>
     	<groupId>mysql</groupId>
     	<artifactId>mysql-connector-java</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
MariaDB (🥉28 · ⭐ 6.4K) - MariaDB server is a community developed fork of MySQL server. Started by core.. ❗️GPL-2.0
  • GitHub (👨‍💻 1.6K · 🔀 920 · ⏱️ 24.01.2020):

     git clone https://github.com/MariaDB/server
    
  • NPM (📥 33K / month · 📦 580 · ⏱️ 19.07.2017):

     npm install mariadb
    
  • Dockerhub (📥 1.1B · ⭐ 3.2K · ⏱️ 16.01.2020):

     docker pull mariadb
    
TimescaleDB (🥉25 · ⭐ 8.1K) - An open-source time-series SQL database optimized for fast ingest and.. ❗️Unlicensed
  • GitHub (👨‍💻 43 · 🔀 430 · 📥 3.5K · 📋 660 - 28% open · ⏱️ 23.01.2020):

     git clone https://github.com/timescale/timescaledb
    
  • Dockerhub (📥 16M · ⭐ 65 · ⏱️ 15.01.2020):

     docker pull timescale/timescaledb
    
RavenDB (🥉25 · ⭐ 2.4K) - ACID Document Database. ❗️AGPL-3.0
  • GitHub (👨‍💻 320 · 🔀 690 · 📦 2.2K · 📋 400 - 1% open · ⏱️ 24.01.2020):

     git clone https://github.com/ravendb/ravendb
    
  • PyPi (📥 140 / month · 📦 2 · ⏱️ 27.02.2019):

     pip install pyravendb
    
  • NPM (📥 600 / month · 📦 28 · ⏱️ 16.09.2019):

     npm install ravendb
    
  • Dockerhub (📥 340K · ⭐ 27 · ⏱️ 21.01.2020):

     docker pull ravendb/ravendb
    
  • Maven (⏱️ 07.02.2018):

     <dependency>
     	<groupId>net.ravendb</groupId>
     	<artifactId>ravendb</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
Grakn (🥉25 · ⭐ 1.9K) - Grakn Core: The Knowledge Graph. ❗️AGPL-3.0
  • GitHub (👨‍💻 54 · 🔀 230 · 📥 13K · 📦 18 · 📋 1.9K - 10% open · ⏱️ 22.01.2020):

     git clone https://github.com/graknlabs/grakn
    
  • PyPi (📥 450 / month · 📦 6 · ⏱️ 25.11.2019):

     pip install grakn-client
    
  • NPM (📥 790 / month · 📦 2 · ⏱️ 25.11.2019):

     npm install grakn-client
    
  • Dockerhub (📥 290K · ⭐ 10 · ⏱️ 17.01.2020):

     docker pull graknlabs/grakn
    
  • Maven:

     <dependency>
     	<groupId>io.grakn.client</groupId>
     	<artifactId>grakn-client</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    

Database GUIs

Back to top

GUI tools for database administration and data management for a variety of databases.

mongo-express (🥇31 · ⭐ 4.1K) - Web-based MongoDB admin interface, written with Node.js and express. MIT
  • GitHub (👨‍💻 110 · 🔀 660 · 📦 590 · 📋 330 - 24% open · ⏱️ 30.12.2019):

     git clone https://github.com/mongo-express/mongo-express
    
  • NPM (📥 6.1K / month · 📦 390 · ⏱️ 24.12.2019):

     npm install mongo-express
    
  • Dockerhub (📥 45M · ⭐ 600 · ⏱️ 25.12.2019):

     docker pull mongo-express
    
Adminer (🥇29 · ⭐ 4.2K) - Database management in a single PHP file. Apache-2
  • GitHub (👨‍💻 130 · 🔀 750 · 📥 2.3M · 📦 98 · ⏱️ 20.12.2019):

     git clone https://github.com/vrana/adminer
    
  • Dockerhub (📥 100M · ⭐ 330 · ⏱️ 24.01.2020):

     docker pull adminer
    
SQLite Browser (🥈26 · ⭐ 12K) - Official home of the DB Browser for SQLite (DB4S) project. Previously.. Apache-2
  • GitHub (👨‍💻 120 · 🔀 1.5K · 📥 7.6M · 📋 1.7K - 23% open · ⏱️ 17.01.2020):

     git clone https://github.com/sqlitebrowser/sqlitebrowser
    
DBeaver (🥈26 · ⭐ 12K) - Free universal database tool and SQL client. Apache-2
  • GitHub (👨‍💻 150 · 🔀 1K · 📥 370K · 📋 7.1K - 19% open · ⏱️ 24.01.2020):

     git clone https://github.com/dbeaver/dbeaver
    
pgweb (🥈24 · ⭐ 6.3K) - Cross-platform client for PostgreSQL databases. MIT
  • GitHub (👨‍💻 45 · 🔀 420 · 📥 72K · 📋 260 - 16% open · ⏱️ 16.12.2019):

     git clone https://github.com/sosedoff/pgweb
    
  • Dockerhub (📥 4.3M · ⭐ 23 · ⏱️ 16.12.2019):

     docker pull sosedoff/pgweb
    
Sequel Pro (🥈23 · ⭐ 7.4K) - MySQL/MariaDB database management for macOS. MIT
  • GitHub (👨‍💻 41 · 🔀 640 · 📥 4M · 📋 3.5K - 29% open · ⏱️ 30.09.2019):

     git clone https://github.com/sequelpro/sequelpro
    
Dejavu (🥈22 · ⭐ 6.5K) - The Missing Web UI for Elasticsearch: Import, browse and edit data with rich filters.. MIT
  • GitHub (👨‍💻 24 · 🔀 410 · 📋 290 - 11% open · ⏱️ 24.01.2020):

     git clone https://github.com/appbaseio/dejavu
    
  • Dockerhub (📥 1.3M · ⭐ 24 · ⏱️ 19.12.2019):

     docker pull appbaseio/dejavu
    
Redis Desktop Manager (🥈21 · ⭐ 15K) - Cross-platform GUI management tool for Redis. ❗️GPL-3.0
  • GitHub (👨‍💻 70 · 🔀 2.5K · 📥 1.5M · 📋 4.3K - 0% open · ⏱️ 23.01.2020):

     git clone https://github.com/uglide/RedisDesktopManager
    
Kafka Manager (🥈21 · ⭐ 8.6K) - CMAK is a tool for managing Apache Kafka clusters. Apache-2
  • GitHub (👨‍💻 85 · 🔀 2K · 📋 530 - 70% open · ⏱️ 23.01.2020):

     git clone https://github.com/yahoo/kafka-manager
    
  • Dockerhub (📥 350K · ⭐ 41 · ⏱️ 12.04.2019):

     docker pull kafkamanager/kafka-manager
    
elasticsearch-head (🥈21 · ⭐ 6.8K) - A web front end for an elastic search cluster. Apache-2
  • GitHub (👨‍💻 68 · 🔀 1.4K · 📋 320 - 49% open · ⏱️ 24.09.2019):

     git clone https://github.com/mobz/elasticsearch-head
    
  • Dockerhub (📥 1.5M · ⭐ 56 · ⏱️ 31.01.2017):

     docker pull mobz/elasticsearch-head
    
OmniDB (🥈21 · ⭐ 1.9K) - Web tool for database management. MIT
  • GitHub (👨‍💻 22 · 🔀 240 · 📥 6.6K · 📋 500 - 37% open · ⏱️ 05.12.2019):

     git clone https://github.com/OmniDB/OmniDB
    
ElectroCRUD (🥉20 · ⭐ 980) - Database CRUD Application Built on Electron | MySQL, Postgres. MIT
  • GitHub (👨‍💻 44 · 🔀 220 · 📥 27K · 📋 46 - 17% open · ⏱️ 17.12.2019):

     git clone https://github.com/garrylachman/ElectroCRUD
    
Robo 3T (🥉19 · ⭐ 7.8K · 💤) - Native cross-platform MongoDB management tool. ❗️GPL-3.0
  • GitHub (👨‍💻 39 · 🔀 660 · 📥 41K · 📋 1.5K - 42% open · ⏱️ 12.04.2019):

     git clone https://github.com/Studio3T/robomongo
    
Mongoku (🥉19 · ⭐ 810) - The Web-scale GUI for MongoDB. MIT
  • GitHub (👨‍💻 7 · 🔀 40 · 📦 3 · 📋 29 - 41% open · ⏱️ 10.01.2020):

     git clone https://github.com/huggingface/Mongoku
    
  • NPM (📥 260 / month · ⏱️ 31.07.2019):

     npm install mongoku
    
  • Dockerhub (📥 210K · ⭐ 1 · ⏱️ 31.07.2019):

     docker pull huggingface/mongoku
    
Mirage (🥉18 · ⭐ 2K) - GUI for simplifying Elasticsearch Query DSL. Apache-2
  • GitHub (👨‍💻 9 · 🔀 100 · 📋 64 - 20% open · ⏱️ 11.10.2019):

     git clone https://github.com/appbaseio/mirage
    
  • Dockerhub (📥 320K · ⭐ 6 · ⏱️ 11.10.2019):

     docker pull appbaseio/mirage
    
Sequeler (🥉16 · ⭐ 510) - SQL Client built in Vala. ❗️GPL-3.0
  • GitHub (👨‍💻 34 · 🔀 48 · 📋 220 - 18% open · ⏱️ 17.11.2019):

     git clone https://github.com/Alecaddd/sequeler
    
FastoNoSQL (🥉15 · ⭐ 620) - FastoNoSQL is a crossplatform Redis, Memcached, SSDB, LevelDB, RocksDB,.. ❗️GPL-3.0
  • GitHub (👨‍💻 3 · 🔀 53 · 📋 69 - 15% open · ⏱️ 14.01.2020):

     git clone https://github.com/fastogt/fastonosql
    
Franchise (🥉14 · ⭐ 3.5K) - a notebook sql client. what you get when have a lot of sequels. MIT
  • GitHub (👨‍💻 15 · 🔀 230 · 📋 56 - 53% open · ⏱️ 13.11.2019):

     git clone https://github.com/HVF/franchise
    
  • Dockerhub (📥 390 · ⭐ 2 · ⏱️ 27.02.2019):

     docker pull binakot/franchise
    
Show 7 hidden projects...
Nosqlclient (🥇27 · ⭐ 3.1K) - Cross-platform and self hosted, easy to use, intuitive mongodb management.. ❗️AGPL-3.0
  • GitHub (👨‍💻 28 · 🔀 320 · 📥 170K · 📦 190 · 📋 390 - 5% open · ⏱️ 09.01.2020):

     git clone https://github.com/nosqlclient/nosqlclient
    
  • Dockerhub (📥 8.9M · ⭐ 79 · ⏱️ 09.01.2020):

     docker pull mongoclient/mongoclient
    
phpMyAdmin (🥈23 · ⭐ 4.7K) - A web interface for MySQL and MariaDB. ❗️GPL-2.0
  • GitHub (👨‍💻 1.7K · 🔀 2.6K · 📋 12K - 4% open · ⏱️ 24.01.2020):

     git clone https://github.com/phpmyadmin/phpmyadmin
    
pgAdmin (🥈21 · ⭐ 840) - Web-based administration tool for the PostgreSQL database. ❗️PostgreSQL
  • GitHub (👨‍💻 100 · 🔀 160 · ⏱️ 24.01.2020):

     git clone https://github.com/postgres/pgadmin4
    
  • Dockerhub (📥 29M · ⭐ 470 · ⏱️ 24.01.2020):

     docker pull dpage/pgadmin4
    
Sqlectron (🥉20 · ⭐ 2.9K · 💀) - UNMAINTAINED - SEE BELOW. A simple and lightweight SQL client desktop with.. MIT
  • GitHub (👨‍💻 29 · 🔀 330 · 📥 200K · 📋 330 - 30% open · ⏱️ 31.10.2018):

     git clone https://github.com/sqlectron/sqlectron-gui
    
Filestash (🥉18 · ⭐ 2.3K) - A modern web client for SFTP, S3, FTP, WebDAV, Git, Minio, LDAP, CalDAV,.. ❗️AGPL-3.0
  • GitHub (👨‍💻 18 · 🔀 120 · 📥 210 · 📋 190 - 13% open · ⏱️ 21.01.2020):

     git clone https://github.com/mickael-kerjean/filestash
    
  • Dockerhub (📥 540K · ⭐ 9 · ⏱️ 21.01.2020):

     docker pull machines/filestash
    
HeidiSQL (🥉18 · ⭐ 1.7K) - A lightweight client for managing MariaDB, MySQL, SQL Server and PostgreSQL,.. ❗️GPL-2.0
  • GitHub (👨‍💻 25 · 🔀 180 · 📋 830 - 36% open · ⏱️ 23.01.2020):

     git clone https://github.com/HeidiSQL/HeidiSQL
    
MySQL Workbench (🥉10 · ⭐ 430) - MySQL Workbench is a unified visual tool for database architects,.. ❗️GPL-2.0
  • GitHub (👨‍💻 18 · 🔀 140 · ⏱️ 15.12.2019):

     git clone https://github.com/mysql/mysql-workbench
    

Others

Back to top

Netdata (🥇29 · ⭐ 44K) - Real-time performance monitoring, done right! https://my-netdata.io/. ❗️GPL-3.0
  • GitHub (👨‍💻 360 · 🔀 4K · 📥 500K · 📋 4.6K - 14% open · ⏱️ 24.01.2020):

     git clone https://github.com/netdata/netdata
    
  • Dockerhub (📥 99M · ⭐ 130 · ⏱️ 17.01.2020):

     docker pull netdata/netdata
    
Apollo (🥇29 · ⭐ 16K) - An open autonomous driving platform. Apache-2
  • GitHub (👨‍💻 300 · 🔀 5.5K · 📥 36K · 📋 2.1K - 25% open · ⏱️ 24.01.2020):

     git clone https://github.com/ApolloAuto/apollo
    
  • Dockerhub (📥 1.8M · ⭐ 35 · ⏱️ 23.01.2020):

     docker pull apolloauto/apollo
    
Glances (🥇28 · ⭐ 15K) - Glances an Eye on your system. A top/htop alternative for GNU/Linux, BSD, Mac.. ❗️LGPL-3.0
  • GitHub (👨‍💻 120 · 🔀 990 · 📥 350 · 📦 160 · 📋 1.1K - 10% open · ⏱️ 22.01.2020):

     git clone https://github.com/nicolargo/glances
    
  • PyPi (📥 41K / month · 📦 44 · ⏱️ 27.08.2019):

     pip install glances
    
  • Dockerhub (📥 32M · ⭐ 51 · ⏱️ 29.01.2019):

     docker pull nicolargo/glances
    
ungit (🥈27 · ⭐ 8.7K) - The easiest way to use git. On any platform. Anywhere. MIT
  • GitHub (👨‍💻 90 · 🔀 570 · 📥 1.2K · 📦 49 · 📋 730 - 25% open · ⏱️ 18.01.2020):

     git clone https://github.com/FredrikNoren/ungit
    
  • NPM (📥 4.6K / month · 📦 36 · ⏱️ 22.11.2019):

     npm install ungit
    
File Browser (🥈25 · ⭐ 6.4K) - Web File Browser which can be used as a middleware or standalone app. Apache-2
  • GitHub (👨‍💻 77 · 🔀 820 · 📥 40K · 📋 790 - 16% open · ⏱️ 09.01.2020):

     git clone https://github.com/filebrowser/filebrowser
    
  • Dockerhub (📥 12M · ⭐ 67 · ⏱️ 09.01.2020):

     docker pull filebrowser/filebrowser
    
guess (🥈25 · ⭐ 6.1K) - Libraries & tools for enabling Machine Learning driven user-experiences on the web. MIT
  • GitHub (👨‍💻 18 · 🔀 150 · 📋 63 - 11% open · ⏱️ 23.01.2020):

     git clone https://github.com/guess-js/guess
    
  • NPM (📥 12K / month · 📦 78 · ⏱️ 23.01.2020):

     npm install guess-webpack
    
Portia (🥈24 · ⭐ 7.5K) - Visual scraping for Scrapy. BSD-3
  • GitHub (👨‍💻 50 · 🔀 1.2K · 📥 110 · 📦 9 · 📋 430 - 22% open · ⏱️ 10.07.2019):

     git clone https://github.com/scrapinghub/portia
    
  • PyPi (📥 200 / month · 📦 7 · ⏱️ 28.06.2017):

     pip install slybot
    
  • Dockerhub (📥 340K · ⭐ 26 · ⏱️ 10.07.2019):

     docker pull scrapinghub/portia
    
Kylin (🥈24 · ⭐ 2.5K) - Distributed Analytics Engine that provides SQL interface and multi-dimensional.. Apache-2
  • GitHub (👨‍💻 230 · 🔀 1.1K · 📦 1 · ⏱️ 23.01.2020):

     git clone https://github.com/apache/kylin
    
  • PyPi (📥 690 / month · ⏱️ 07.04.2019):

     pip install kylinpy
    
  • Dockerhub (📥 2.2K · ⭐ 2 · ⏱️ 28.08.2019):

     docker pull apachekylin/apache-kylin-standalone
    
OpenRefine (🥉23 · ⭐ 6.8K) - OpenRefine is a free, open source power tool for working with messy data and.. BSD-3
  • GitHub (👨‍💻 140 · 🔀 1.2K · 📥 910K · 📋 1.7K - 25% open · ⏱️ 24.01.2020):

     git clone https://github.com/OpenRefine/OpenRefine
    
  • Dockerhub (📥 95K · ⭐ 8 · ⏱️ 23.01.2020):

     docker pull vimagick/openrefine
    
Pravega (🥉22 · ⭐ 900) - Pravega - Streaming as a new software defined storage primitive. Apache-2
  • GitHub (👨‍💻 57 · 🔀 210 · 📥 4.6K · 📋 2.6K - 12% open · ⏱️ 24.01.2020):

     git clone https://github.com/pravega/pravega
    
  • Dockerhub (📥 200K · ⭐ 1 · ⏱️ 06.01.2020):

     docker pull pravega/pravega
    
Feast (🥉21 · ⭐ 620) - Feature Store for Machine Learning. Apache-2
  • GitHub (👨‍💻 23 · 🔀 96 · 📥 340 · 📦 4 · 📋 180 - 20% open · ⏱️ 21.01.2020):

     git clone https://github.com/gojek/feast
    
  • PyPi (📥 2K / month · ⏱️ 08.01.2020):

     pip install feast
    
TensorFlow for R (🥉20 · ⭐ 1.2K) - TensorFlow for R. Apache-2
  • GitHub (👨‍💻 21 · 🔀 320 · 📥 8 · 📋 260 - 5% open · ⏱️ 09.01.2020):

     git clone https://github.com/rstudio/tensorflow
    
Aleph (🥉20 · ⭐ 960) - Search and browse documents and data; find the people and companies you look for. MIT
  • GitHub (👨‍💻 39 · 🔀 140 · 📋 470 - 7% open · ⏱️ 19.01.2020):

     git clone https://github.com/alephdata/aleph
    
  • Dockerhub (📥 220K · ⭐ 1 · ⏱️ 23.01.2020):

     docker pull alephdata/aleph
    
Katib (🥉20 · ⭐ 620) - Repository for hyperparameter tuning. Apache-2
  • GitHub (👨‍💻 45 · 🔀 160 · 📥 95 · 📋 410 - 17% open · ⏱️ 24.01.2020):

     git clone https://github.com/kubeflow/katib
    
  • Dockerhub (📥 660K · ⏱️ 08.05.2019):

     docker pull katib/metrics-collector
    
Handout (🥉18 · ⭐ 1.8K) - Turn Python scripts into handouts with Markdown and figures. Apache-2
  • GitHub (👨‍💻 2 · 🔀 92 · 📦 7 · 📋 39 - 33% open · ⏱️ 08.11.2019):

     git clone https://github.com/danijar/handout
    
  • PyPi (📥 220 / month · 📦 1 · ⏱️ 08.11.2019):

     pip install handout
    
Shiny (🥉17 · ⭐ 3.7K) - Easy interactive web applications with R. ❗️GPL-3.0
  • GitHub (👨‍💻 53 · 🔀 1.5K · 📋 1.9K - 24% open · ⏱️ 16.01.2020):

     git clone https://github.com/rstudio/shiny
    
SQLFlow (🥉17 · ⭐ 3.5K) - Brings SQL and AI together. Apache-2
  • GitHub (👨‍💻 30 · 🔀 540 · 📋 660 - 26% open · ⏱️ 22.01.2020):

     git clone https://github.com/sql-machine-learning/sqlflow
    
  • Dockerhub (📥 16K · ⭐ 1 · ⏱️ 24.01.2020):

     docker pull sqlflow/sqlflow
    
MediaPipe (🥉15 · ⭐ 4.3K) - MediaPipe is a cross-platform framework for building multimodal applied.. Apache-2
  • GitHub (👨‍💻 2 · 🔀 760 · 📋 400 - 16% open · ⏱️ 18.01.2020):

     git clone https://github.com/google/mediapipe
    
EuclidesDB (🥉14 · ⭐ 570) - A multi-model machine learning feature embedding database. Apache-2
  • GitHub (👨‍💻 2 · 🔀 26 · 📥 220 · 📦 1 · 📋 22 - 50% open · ⏱️ 15.09.2019):

     git clone https://github.com/perone/euclidesdb
    
  • PyPi (📥 60 / month · ⏱️ 12.02.2019):

     pip install euclides
    
  • Dockerhub (📥 220 · ⭐ 1 · ⏱️ 12.02.2019):

     docker pull euclidesdb/euclidesdb
    
Show 2 hidden projects...
Botpress (🥇28 · ⭐ 8.3K) - The Conversational Platform with built-in language understanding (NLU),.. ❗️AGPL-3.0
  • GitHub (👨‍💻 120 · 🔀 920 · 📦 340 · 📋 940 - 12% open · ⏱️ 24.01.2020):

     git clone https://github.com/botpress/botpress
    
  • NPM (📥 1.2K / month · 📦 170 · ⏱️ 19.01.2019):

     npm install botpress
    
MLDB (🥉14 · ⭐ 560 · 💀) - MLDB is the Machine Learning Database. Apache-2
  • GitHub (👨‍💻 17 · 🔀 81 · 📋 27 - 88% open · ⏱️ 09.10.2018):

     git clone https://github.com/mldbai/mldb