Statr

Sports statistics are often hard to get a hand on. You can get them on official league websites but they are often limited and hard to grasp. Likewise, you can use 3rd party services but they are often outdated and also bloated with ads and unecessary data.

Statr aims to solve these issues by providing an end-to-end, simple to use web interface for getting the data you want easily. The platform aims to provide immersive data visualizations and experiences that are accessible and usable.

These experiences are powered by an advanced web scraping data ingestion pipeline to power the underlying applications data layer that the applications backend uses to provide the UI with up-to-date sports data as well as historical analytics.

High Level Architecture

The architecture diagram is a high level overview of the application and its services. It is not a complete representation of the application and its services. To see a more detailed architecture diagram, please see the docs folder.

Application Tech Stack

Technology	Purpose
React	Typescript Web UI for Data Visualization using D3 and ChartJS
FastAPI	Python REST API
Python	Web Scraper Jobs (Daily Statistics and Historical Statistics 1 time run)
Spark	PySpark ETL pipeline to process S3 files and send off to Postgres to be the REST API data source
MongoDB	User and application data management
Airflow	Job Scheduler for web scrapers and ETL jobs
Docker	Containerization of application and services
AWS ECS	Container orchestration for application and services
AWS S3	Staging datalake for the raw CSV files from scraper jobs. Data serves as source for ETL jobs that will enrich, clean and process the data
AWS RDS	Postgres RDBMS for storing processed data from ETL jobs and serving as REST API data source

Kubernetes is also being considered as an alternative to AWS ECS but is not currently being used as this project is still in the early stages of development and not expected to scale to a point where Kubernetes is necessary.

Status

This application is under planning phase and therefore is more of a proposal.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
api		api
client		client
docs/architecture		docs/architecture
jobs		jobs
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

api

api

client

client

docs/architecture

docs/architecture

jobs

jobs

.gitignore

.gitignore

README.md

README.md

Repository files navigation

Statr

High Level Architecture

Application Tech Stack

Status

About

Releases

Packages

Languages

TannerBarcelos/statr

Folders and files

Latest commit

History

Repository files navigation

Statr

High Level Architecture

Application Tech Stack

Status

About

Resources

Stars

Watchers

Forks

Languages