- Meltano - Extraction and Loading
- DuckDB - Database
- Dbt - Data Transformation
- Streamlit - Visualisation
-
pip install -r requirements.txt
- This will install all the required dependencies which are necessary to run the project.
-
make build
- This step will install meltano extractors, loaders & utilities, which are defined in
meltano.yml
file.
- This step will install meltano extractors, loaders & utilities, which are defined in
-
make pipeline
- This step will take the file from the
data
folder, and will generate a parquet file from it. - After that the required
dbt
dependencies will be installed. - Finally,
dbt-duckdb build
command will be run, which will generate all the tables and views from the models folder.
- This step will take the file from the
-
make streamlit-visuals
- Will launch the streamlit app on port 8501
tap-spreadsheets-anywhere
extractor is used to load the local csv file.target-parquet
loader is used to output the ingested file as a parquet file, with addidional metadata.dbt-duckdb
utility is used to run the dbt models and load the data into DuckDB tables.