Skip to content

hynekdav/data-engineering

Repository files navigation

Data Engineering showcase

Simple project showcasing my data engineering skills. It downloads data from Czech Police using Dagster, cleans them up, processes them and provide simple visualization using Streamlit.

Requirements

Running

First you need to build the application image using:

docker build -t data-engineering .

And then run:

docker run --rm -it -p 8501:8501 data-engineering

What happens first is Dagster will load the and process the necessary data and store them in /tmp/criminality.pqt ( using /tmp is just for convenience here). You should see a lot of debug logs from Dagster telling you how its progressing. Then when data is prepared Streamlit is started, and you can access it either using the local URL you see in docker logs or at http://0.0.0.0:8501.

Author

License

MIT

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published