GitHub - TatraDev/pipertool: Platform for data science and machine learning prototyping. Developed by Tatradev.com

Website • Docs • Chat (Community & Support) • Tutorials

Piper is an open-source platform for data science and machine learning prototyping. Concentrate only on your goals. Key features:

Simple python contexts experience. Helps to create and deploy pipelines. Does not depend on any proprietary online services.
Connect each module into a pipeline. Run it via docker or virtual environment. Then build whole infrastructure by using venv, Docker or Cloud.
Decreases routine and repetitive tasks. Speed up process from idea to production.
Well-tested and reproducible. Easily extendable by your own Executor.

Piper aims to help data-scientists and machine-learning developers to create and build full infrastructure for their projects.

Contents

How Piper works
Quick start
Quick start pipertool package compose env
Quick start pipertool package compose env
Installation
- pip (PyPI)
Comparison to related technologies
Contributing
Mailing List
Copyright

How Piper works

|Flowchart|

Quick start

Quick start pipertool package compose env

In root directory project run command in terminal

sudo -u root /bin/bash
create and activate venv
pip install -r requirements.txt
in configuration.py rename for correctly path for new directory
python setup.py install
piper --env-type compose start
0.0.0.0:7585 - FastApi
0.0.0.0:9001 - Milvus Console (minioadmin/minioadmin)
piper --env-type compose stop
pip uninstall piper

Quick start pipertool package compose env

In root directory project run command in terminal

sudo -u root /bin/bash
create and activate venv
pip install -r requirements.txt
in configuration.py rename for correctly path for new directory
python main.py
await click CTRL+C from compose env

Installation

pip (PyPI)

Comparison to related technologies

Jupyter - is the de facto experimental environment for most data scientists. However, it is desirable to write experimental code.
Data Engineering tools such as AirFlow or Luigi - These are very popular ML pipeline build tools. Airflow can be connected to a kubernetes cluster or collect tasks through a simple PythonOperator. The downside is that their functionality is generally limited on this, that is, they do not provide ML modules out of the box. Moreover, all developments will still have to be wrapped in a scheduler and this is not always a trivial task. However, we like them and we use Airflow and Luigi as possible context for executors.
Azure ML / Amazon SageMaker / Google Cloud - Cloud platforms really allow you to assemble an entire system from ready-made modules and put it into operation relatively quickly. Of the minuses: high cost, binding to a specific cloud, as well as small customization for specific business needs. For a large business, this is the most logical option - to build an ML infrastructure in the cloud. We also maintain cloud options as posible ways for the deployment step.
DataRobot/Baseten - They offer an interesting, but small set of ready-made modules. However, in Baseten, all integration is implied in the kubernetes cluster. This is not always convenient and necessary for Proof-of-Concept. Piper also provides an open-source framework in which you can build a truly customized pipeline from many modules. Basically, such companies either do not provide an open-source framework, or provide a very truncated set of modules for experiments, which limits the freedom, functionality, and applicability of these platforms. This is partly similar to the hub of models and datasets in huggingface.
Mlflow / DVC - There are also many excellent projects on the market for tracking experiments, serving and storing machine learning models. But they are increasingly utilitarian and do not directly help in the task of accelerating the construction of a machine learning MVP project. We plan to add integrations to Piper with the most popular frameworks for the needs of DS and ML specialists.

Contributing

|Maintainability| |Donate|

Contributions are welcome! Please see our Contributing Guide for more details. Thanks to all our contributors!

Mailing List

Copyright

This project is distributed under the Apache license version 2.0 (see the LICENSE file in the project root).

By submitting a pull request to this project, you agree to license your contribution under the Apache license version 2.0 to this project.

Name		Name	Last commit message	Last commit date
Latest commit History 102 Commits
.github/workflows		.github/workflows
gateway		gateway
piper		piper
templates		templates
tests		tests
usecases		usecases
.gitignore		.gitignore
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
Makefile		Makefile
Readme.rst		Readme.rst
main.py		main.py
piper_logo.jpg		piper_logo.jpg
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

How Piper works

Quick start

Quick start pipertool package compose env

Quick start pipertool package compose env

Installation

pip (PyPI)

Comparison to related technologies

Contributing

Mailing List

Copyright

About

Releases

Packages

Contributors 5

Languages

License

TatraDev/pipertool

Folders and files

Latest commit

History

Repository files navigation

How Piper works

Quick start

Quick start pipertool package compose env

Quick start pipertool package compose env

Installation

pip (PyPI)

Comparison to related technologies

Contributing

Mailing List

Copyright

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 5

Languages

Packages