Skip to content
No description, website, or topics provided.
Branch: master
Clone or download
Latest commit 28f10ee Jul 9, 2019
Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
data
.env
Dockerfile first commit Jul 2, 2019
README.md update README.md Jul 2, 2019
docker-compose.yml first commit Jul 2, 2019
insert.py update Jul 9, 2019
utils.py update Jul 9, 2019

README.md

This repo refers to blog post: https://fruty.io/2019/07/02/excel-to-pandas-to-postgresql-data-science-for-strategy-consulting/

TEST

Start the containers. docker-compose up -d Pgweb container often starts too soon compared to postgres, you might have to run another docker-compose up -d once postgres had time to start. The worker container is in exit mode, that's normal, it's not supposed to be running all the time.

When postgres and pgweb containers are up, log into the worker container: docker-compose run worker /bin/bash Now import the excel spreadsheet into postgres: python insert.py exit

Pandas ability to auto-detect column data format is a killer feature, since it allows you to just process Excel sheets without having to specify all the headers, types and such.

Now go to pgweb interface in your web browser on localhost:8000 and you should see the table "sheet_1" with the excel sheet data in it.

Do this with all your spreadsheet sheets, and you can then perform all kinds of analysis on clean, structured data!

You can’t perform that action at this time.