ETL Pipeline on Crypto Data
is a team-based Python data analytics project done on daily crypto data for 6 coins and 14 index funds. It:
- extracts raw data in three different ways: direct file reading with pandas (.csv)
- transforms that data into aggregates by month (average, max, min, max growth)
- loads that data into a final .sql database
Along with the analysis, project also involved a written report.
- Python
- Pandas
- PostgreSQL
- pgAdmin
- Google
- Google Docs
- Python reading and API requesting
- Python web scraping
- Cleaning, sorting, filtering
- Summary statistics, aggregating
- Loading data into .SQL
- Synthesizing results for tentative conclusions
- Acknowledging potential pitfalls with results and techniques