No frameworks. No YAML. Just write your data processing code directly in Python, R or Julia.
๐ก Watch the full narrated video to learn more about building data pipelines in Orchest.
Note: Orchest is in beta.
- Visually construct pipelines through our user-friendly UI
- Code in Notebooks and scripts (quickstart)
- Run any subset of a pipelines directly or periodically (jobs)
- Easily define your dependencies to run on any machine (environments)
- Spin up services whose lifetime spans across the entire pipeline run (services)
- Version your projects using git (projects)
When to use Orchest? Read it in the docs.
๐ Get started with our quickstart tutorial or have a look at our video tutorials explaining some of Orchest's core concepts.
Missing a feature? Have a look at our public roadmap to see what the team is working on in the short and medium term. Still missing it? Please let us know by opening an issue!
Get started with an example project:
- Train and compare 3 regression models
- Connecting to an external database using SQLAlchemy
- Run dbt in Orchest for a dbt + Python transform pipeline
- Use PySpark in Orchest
๐ Check out the full list of example projects.
Want to skip the installation and jump right in? Then try out our managed service: Orchest Cloud.
Join our Slack to chat about Orchest, ask questions, and share tips.
The software in this repository is licensed as follows:
- All content residing under the
orchest-sdk/
andorchest-cli/
directories of this repository are licensed under theApache-2.0
license as defined inorchest-sdk/LICENSE
andorchest-cli/LICENSE
respectively. - Content outside of the above mentioned directories is available under the
AGPL-3.0
license.
Contributions are more than welcome! Please see our contributor guides for more details.
Alternatively, you can submit your pipeline to the curated list of Orchest examples that are automatically loaded in every Orchest deployment! ๐ฅ