Skip to content
Anton Le edited this page Nov 5, 2021 · 6 revisions

Refinery

Refinery is a tool to extract the data from excel spreadsheets (both in .xls and .xlsx format) in a declarative way. The idea is that you focus on what you want to extract, but not on how the data should be extracted.

At Vortexa, we receive tons of excel spreadsheets with important information. Unfortunately, these spreadsheets have different formats and are populated in many different ways depending on the data source. That is why we came up with the idea of the refinery. The library allows you to do both extraction of the data and validation/transformation of the extracted data along the way.

This documentation is a step-by-step guide of all features, starting from the simple one and then covering more and more tricky cases. As the sample data, we will use Champions League 2021 stats located in /examples folder

Clone this wiki locally