The Leukaemia Image Classification project is a joint collaboration between Amanda Ho and Tiana Chen.
This project aims to build an image classification model for leukemia cell detection in microscopic images. This project is built upon the Leukemia Classification Challenge available in Kaggle. All code is implemented and tested on Google Colab using Pytorch packages.
Methodology:
- Training and validation datasets are merged and split into training and testing sets via patient ID.
- EfficientNet models are trained using K-fold cross validation and nested epochs.
- Ensemble EfficientNet model is implemented using the trained EfficientNet models as base models. Ensemble model uses majority voting as voting system.
- As a further improvement, Noisy Student training is implemented in addition to EfficientNet model. Noisy student training is carried out within a K-fold cross validation and nested epoch loop.
- Ensemble Noisy Student model is implemented using the trained Noisy Student models as base models. Ensemble model uses majority voting as voting system.
- The 4 models implemented in the project (EfficientNet, Ensemble EfficientNet, Noisy Student, Ensemble Noisy Student) are tested and evaluated for performance.