Skip to content

I had done this project for data analytics course. In this project I'm trying to figure out what are the factors that majorly affect the potability of water. I have also written a machine learning algorithm which predicts whether the water is safe for drinking or not using DecisionTreeClassifier. At the end I have plotted the tree to give an ide…

License

Notifications You must be signed in to change notification settings

prasadposture/Water-Potability-Project

Repository files navigation

Water Potability Project

Introduction

The Water Potability Prediction project aims to predict the potability of water based on various physicochemical properties. The project involves analyzing various aspects of water quality, including pH, hardness, solids, and other chemical properties, to determine whether the water is potable or not. This was a semester-end project for the machine learning course offered by Jai Hind College Mumbai. I did the analysis and preprocessing in jupyter notebook and created a web application using python's Streamlit library. This project will provide a powerful tool for researchers, policymakers, and the general public to analyze the potability of water and make informed decisions based on the insights gained from the analysis. The project will help identify which physicochemical properties are most important in determining water potability, while the web app will provide a user-friendly tool for predicting water potability based on the input of various physicochemical properties.

Important Links:

  1. Web App
  2. EDA and MLA
  3. Source Code

Workflow:

  1. Importing required libraries and functions from models and loading the data.
  2. Exploratory data analysis and data visualization to understand the relationship between the attributes.
  3. Data preprocessing which involves filling in the missing values and scaling the numerical columns.
  4. Splitting the model into train & test data and evaluating its performance using a confusion matrix.
  5. Visualizing the decision tree and getting the feature importances.
  6. Saving the model for further use and concluding the notebook.
  7. Creating an interactive and user-friendly web application that predicts the water potability given the parameter values using the trained model.
  8. Deploying the web application on share.streamlit.io for making it available for everyone.

About

I had done this project for data analytics course. In this project I'm trying to figure out what are the factors that majorly affect the potability of water. I have also written a machine learning algorithm which predicts whether the water is safe for drinking or not using DecisionTreeClassifier. At the end I have plotted the tree to give an ide…

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published