Skip to content

Web scraping, Excel automation, and data analysis in Python — projects include scraping real-world data (movies, COVID, trading), NFL player dataset analysis, and Excel ↔ Python pipelines.

License

Notifications You must be signed in to change notification settings

deepfusi0n/data-projects-python

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Python Data Projects

This repository highlights applied Python work across web scraping, Excel automation, API integration, and data analysis.
It demonstrates my ability to collect real-world data, transform it, and build reproducible workflows that generate insights.


Project Categories

Web Scraping

  • bible/ — Scrapes Bible text into structured data for downstream NLP or analysis.
  • covid/ — Parses public health tables into clean CSVs for reporting.
  • movies/ — Collects titles, ratings, and genres for exploratory data analysis.
  • tradingview/ — Pulls market price data for monitoring and trend analysis.

Excel Automation

  • excel-automation/ — Moves data between Excel and Python: cleaning, transformation, and export.
    Demonstrates ETL (Extract, Transform, Load) workflows.

API Projects

  • api-calls/ — Demonstrates API integration and data retrieval.
    • fbi.py: Fetches data from FBI’s public crime API.
    • chucknorris.py: Retrieves random jokes via the Chuck Norris API.
      These projects showcase how to interact with REST APIs and handle JSON responses in Python.

NFL Player Analysis

  • nfl-analysis/ — Data analysis project using a raw NFL players dataset (NFL-players.csv).
    • Cleaned and prepared the dataset using pandas.
    • Conducted exploratory data analysis (EDA) to highlight player performance trends.
    • Demonstrates structured analysis of real-world sports data with reproducible Python code.

Miscellaneous

  • misc/ — Additional scripts and prototypes, including general data processing examples.

How to Run

  1. Clone this repo or download a project folder.
  2. Install dependencies: pip install -r requirements.txt
  3. Navigate to the desired folder and run:
  4. Outputs (CSV/Excel/JSON) are saved locally as defined in each script.

Tech Stack

  • Python 3.x
  • Libraries: pandas, requests, beautifulsoup4, lxml, openpyxl

Notes

  • Web scraping projects respect site terms of service and are for educational/portfolio use.
  • The NFL project highlights data cleaning and analysis workflows without machine learning.
  • API projects demonstrate calling REST endpoints, parsing JSON, and integrating external data.

About

Web scraping, Excel automation, and data analysis in Python — projects include scraping real-world data (movies, COVID, trading), NFL player dataset analysis, and Excel ↔ Python pipelines.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages