A central repository for all open lakehouse related tools/utilities. Any contributions around managing the underlying files, table formats or methods to interact with the lakehouse using different engines such as PrestoDB, Db2, Netezza, Apache Spark, Apache Flink etc
-
Ingestion tools: Any/all tools that help with batch and streaming ingestion mechanism to load data from various data sources onto the open data lakehouse.
-
Metastore tools: Any/all tools that help understand/analyze, migrate the contents of the metadata repository.
-
Table formats: Any/all tools that help understand/analyze data stored in different open table formats.
-
File formats: Any/all tools that help understand/analyze data stored in different open file formats.
Thank you for considering to contribute to the open lakehouse tools project. Please go through the guidelines below before starting your contributions.
We believe in the power of the community and its ability to grow together. Please esure you treat other community members with respect.
There are a number of differnet ways in which you can help grow the community including:
- Reporting bugs in published utilities
- Enhancing documentation
- Adding new feature/functions
- Fork this project
- Clone repository to local machine
git clone https://github.com/IBM/open-lakehouse-tools.git
- Add new code/make changes to existing code
- Push updates to the repository
- Create pull request to this repository from your fork
- Wait for reviewers to approve changes and merge !