HydroSentry

Model Metrics

Intro

HydroSentry is a system designed to monitor the quality and condition of water pipe infrastructure, with a focus on identifying potential pipe failures before they occur.

The project utilizes data from the Water Point Data Exchange (WPdx), which can be accessed here: https://www.waterpointdata.org

Specifically, we use the following dataset: https://data.waterpointdata.org/dataset/Water-Point-Data-Exchange-Plus-WPdx-/eqje-vguj/about_data

HydroSentry incorporates a machine learning model trained on the above data to analyze patterns in water pipe conditions, usage, and failure history. Using these insights, the model predicts when pipes are likely to require maintenance or repair, which allows repairs to occur before pipes break and therefore take away access to water for some populations.

How It Was Setup

Data Cleaning

We first filtered through the several columns which the WPdx-Plus had (we used plus as it was precleaned to some extent) determining which ones could be important for a pipe's longevity. As a result we managed to narrow down the total of 73 columns to 10. From there we did even further narrowing, manipulating certain datasets such as report date and installment year to get years of service, and dropped report date and so on.

We also ended up cleaning the data by removing any NaNs, grouping together duplicates we thought were redundant, ensuring binary values for datapoints we needed the model to have its scope as (not functional and functional), etc.

Using the Model

Once the model was trained, it was saved with its encoder and integrated into the backend to make predictions on new data. When new water pipe data is provided as inputs on the frontend, the model then analyzes the input and predicts when the water point will need maintenance.

Making the Fullstack

The application was implemented using Django. The backend is written in Python and is responsible for loading the trained model and handling prediction requests.

When a user submits data into the HTML frontend, the backend processes the input, runs it through the model, and returns a prediction to be displayed to the frontend.

What we Learned and Challenges we Overcame

We learned a lot about different Machine Learning Models and the pipeline of turning it into an actual product. One of us didn't know at all about ML before this whilst the other two knew some of it but not expansive knowledge. However, as a result we all ended up getting stuck once we realized how far out of scope this all was. We needed models different than what we were used to and using different code which we had to learn. Furthermore, even though we may have had some experience with either full stack or making a model somewhat, we never had experience in the full pipeline of training the model, as a result we ran into several issues; we didn't know how to actually get the model after it was done training, we didn't know how to then make that interact with the frontend as there was no obvious API to just use, we even ran into several issues trying to refactor the model and figuring out which model to use as the ones which we were familiar with using such as random forest or k nearest neighbours and so required a lot of research and failures. Nonetheless, we managed to go against the odds and learn how to do these things within the 36 hour period of bitcamp and produce a working model and product.

Future Prospects

We are hoping to continue developing and refining the model and then perhaps making it useful enough for NGOs or other major coproations to use it and improve existing infrastructure making it such that communities are less impacted by water pipe infrastructure issues.

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
hydrosentry		hydrosentry
model_assets		model_assets
predictor		predictor
static/predictor		static/predictor
templates/predictor		templates/predictor
Hydro_Sentry_Initial_Prep.ipynb		Hydro_Sentry_Initial_Prep.ipynb
LICENSE		LICENSE
README.md		README.md
db.sqlite3		db.sqlite3
manage.py		manage.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

HydroSentry

Model Metrics

Intro

How It Was Setup

Data Cleaning

Using the Model

Making the Fullstack

What we Learned and Challenges we Overcame

Future Prospects

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

HydroSentry

Model Metrics

Intro

How It Was Setup

Data Cleaning

Using the Model

Making the Fullstack

What we Learned and Challenges we Overcame

Future Prospects

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages