Help Htoo Resale

Machine Learning Project to predict HDB Resale Prices

About The Project

This project was built to help bring more transparency on the real estate market by giving both buyers and sellers an estimate on the fair value of a flat. On the buyer's end, this also helps to tackle the issue with information asymmetry, who can only estimate the market value of the flat based on information provided by the seller and their own assessment of the flat and its surrounding amenities.

Process

I've created a slide deck to pen down my thought process, check it out here!

Project Directory Structure


|── archive   <- consists of previous iterations of the project housed in jupyter notebooks
|── frontend  <- consists of frontend assets for a simple web application to try out the project
|── model     <- houses all the regression models and cluster coordinates
|── main.py   <- backend entry point to the project.
|                It retrieves the appropriate pickle files from the model folder based on the information given from the frontend
|── tableau   <- consists of tableau files used for exploratory data analysis
|── .ipynb    <- jupyter notebooks ordered by logical order of progress

Starting the project

Run main.py first

python main.py

cd into frontend, and run

npm run dev

What I Learnt

This was my first time doing a machine learning project, and hence to go from data extraction and cleaning, to EDA, to building the model was difficult and required a lot of reading from different sources and youtube channels.
The original idea was to use a single regression model across all the flats in Singapore. But after performing my initial regression model, and looking at the insights from the EDA again, I realized that I could also cluster not just my sector code, but also my coordinates using KMeans clustering, to provide a more accurate model.

To do

Explore other clustering methods such as DBSCAN and Hierarchical-based clustering to further understand the tradeoffs between each clustering model

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
.ipynb_checkpoints		.ipynb_checkpoints
__pycache__		__pycache__
archive		archive
frontend		frontend
model		model
tableau		tableau
01. Data Cleaning.ipynb		01. Data Cleaning.ipynb
02. Clustering.ipynb		02. Clustering.ipynb
03. Regression.ipynb		03. Regression.ipynb
04. Prediction.ipynb		04. Prediction.ipynb
Mall_df.csv		Mall_df.csv
README.md		README.md
data.csv		data.csv
main.py		main.py
station_df.csv		station_df.csv
transformed_cluster.csv		transformed_cluster.csv
transformed_df.csv		transformed_df.csv
transformed_df_features2.csv		transformed_df_features2.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Help Htoo Resale

Help Htoo Resale

About The Project

Process

Project Directory Structure

Starting the project

What I Learnt

To do

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Help Htoo Resale

Help Htoo Resale

About The Project

Process

Project Directory Structure

Starting the project

What I Learnt

To do

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages