Handling Imbalanced Dataset

This project shows how we can handle imbalanced dataset using various methods!

😇 Motivation

While learning Machine Leanring, I came across few datasets which were highly imbalanced which resulted in me getting stuck in the very beginning. So I thought of making a notebook which will help in quickly refering and revising different ways to handle imbalanced datasets.

⭐ Features

Under-sampling
Over-sampling
imbalanced-learn module
Random Over-sampling and under-sampling
Tomek links
SMOTE
Over-sampling followed by under-sampling

Using Recall to measure accuracy
Performed Logistic Regression for all the preprocessed data
Used Recall Score as metric to measure how well the model is performing!

📁 Dataset

The dataset used can be downloaded here (Kaggle) - Click to Download

❤️ Owner

Made with ❤️ by Sahil Chachra

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
sampleImage		sampleImage
Handling Imbalance Dataset.ipynb		Handling Imbalance Dataset.ipynb
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Handling Imbalanced Dataset

This project shows how we can handle imbalanced dataset using various methods!

😇 Motivation

⭐ Features

📁 Dataset

❤️ Owner

👀 License

About

Languages

License

SahilChachra/Handling-Imbalanced-Dataset

Folders and files

Latest commit

History

Repository files navigation

Handling Imbalanced Dataset

This project shows how we can handle imbalanced dataset using various methods!

😇 Motivation

⭐ Features

📁 Dataset

❤️ Owner

👀 License

About

Topics

Resources

License

Stars

Watchers

Forks

Languages