ML project to predict usage of WRM
Data input file (compressed with 7z because standard zip had over 130MB which is 1/7 of the original file):
Packages:
tensorflow>=2.1
scikit-learn
seaborn>=0.10.0
pandas>=1.0.1
notebook>=6.0.3
matplotlib>=3.2.0
jupyter-core>=4.6.3
xlrd>=1.2.0
Best way to install dependencies and avoid unnecessary problems is to setup Anaconda env and run following command inside the environment
pip install -r requirements.txt
after that you should be able to run:
jupyter notebook
to open one of the notebooks.
Give an example
Explain what these tests test and why
Give an example
For weather data description check Files description This Link
For each day we're extracting data in format:
[
'date',
'time',
'totalSnow_cm',
'sunrise',
'sunset',
'tempC',
'FeelsLikeC',
'HeatIndexC',
'windspeedKmph',
'weatherCode',
'precipMM',
'humidity',
'visibility',
'pressure',
'cloudcover'
]
Most of the data comes directly from the source except:
- time - source data converted from string
1300
to number13*60
- sunrise - date converted from
HH:mm AM
to minutes, started from midnight - sunset - date converted from
HH:mm PM
to minutes, started from midnight
Process weather data
python weather_parser.py
Generate WRM data per year and list of places in data/bike_data
directory
python data_parser.py
Generated dataset has format of:
'bike_number',
'start_time',
'end_time',
'rental_place',
'return_place',
'year',
'week_day',
'totalSnow_cm',
'sunrise',
'sunset',
'tempC',
'FeelsLikeC',
'HeatIndexC',
'windspeedKmph',
'weatherCode',
'precipMM',
'humidity',
'visibility',
'pressure',
'cloudcover'
WARNING!!! It takes a while to run.
Add additional notes about how to deploy this on a live system
- Tensorflow - An end-to-end open source machine learning platform for everyone
- Pandas - Open source data analysis and manipulation tool
- WOD - Wroclaw Open Dataset
We use SemVer for versioning. For the versions available, see the https://github.com/burnpiro/wod-usage-predictor/tags.
See also the list of contributors who participated in this project.
This project is licensed under the MIT License - see the LICENSE.md file for details
- Hat tip to anyone whose code was used
- Inspiration
- etc