triathlon

My motivation for this project was wanting to know, as a first time triathlon competitor, what kind of swim time I could aim for given my age, gender, and expected bike/run times (which I had much better idea of, given I had competed in those sports before). The outcome was a project that did the following:

Web-scraped triathlon race results from athlinks.com. They had the most comprehensive results and also easily filterable by distance (I specifically wanted Olympic distance results, since that's what I would be racing in). However, their website structure was not ideal for web-scraping. The relevant data was spread across many web-pages (even within races), and web-pages involved a lot of javascript. Maybe they were intentionally discouraging web-scraping efforts! The main script that completes this part is scraper.py
Built a machine learning model in Scikit-Learn. The data inspection, cleaning, and model building process can all be found in triathlon.ipynb
Designed a flask application that allows for users to input their own variables (age, gender, bike time, run time, etc.), and get back a predicted swim time (app.py)
Deployed web-app to AWS ec2 on a Docker container: http://ec2-34-216-76-122.us-west-2.compute.amazonaws.com/

Feel free to reach out with questions or feedback!

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
static		static
templates		templates
Dockerfile		Dockerfile
README.md		README.md
app.py		app.py
classes.py		classes.py
requirements.txt		requirements.txt
scraper.py		scraper.py
triathlon.ipynb		triathlon.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

triathlon

About

Releases

Packages

Languages

jgoffin/triathlon

Folders and files

Latest commit

History

Repository files navigation

triathlon

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages