aerial-cactus-image-recognition/3-transfer-learning-model at master · cyleung428/aerial-cactus-image-recognition

History

Name		Name	Last commit message	Last commit date
parent directory ..
README.md		README.md
Transfer learning notebook.ipynb		Transfer learning notebook.ipynb

README.md

Transfer learning model

Overview

In this part, we use transfer learning to build a model for our competition. Transfer learning means that we use a pre-trained model and change it to fit our data. We do this by changing the top layers of the pre-trained model. This allows us to ‘transfer’ the knowledge of the model from its original domain to our domain.

Advantages of transfer learning

In general, transfer learning offers the following benefits:

Helps solve complex real-world problems with several constraints
Tackle problems like having little or almost no labeled data availability
Ease of transferring knowledge from one model to another based on domains and tasks
Use extremely complex model structures without having to develop them yourself
Reduce computational resources since only the top layers of a model need to be trained

In our case, the biggest benefit of using transfer learning is that we can use an extremely complex model structure without having to develop it and train it ourselves.

Pre-trained model we use

For our model, we use the pre-trained VGG16 model. VGG16 was developed by researchers at the University of Oxford and offered an improvement over AlexNet. The model achieves 92.7% top-5 test accuracy in ImageNet, which is a dataset of over 14 million images belonging to 1000 classes. The model consists of 13 convolutions and 5 poolings. It also includes 3 layers of fully connected layers. Here you can see an illustration of VGG16:

Customizing the pre-trained model

To fit VGG16 to our purpose of aerial cactus identification, we replace the top 3 layers with our own layers. This allows us to take advantage of the 13 convolutions and 5 poolings of VGG16 and at the same time ensures that our model is customized to cactus detection.

Our code

We have completed the following steps in our code:

Setting up
- Import packages
- Set directories
Data preparation
- Import data
- Prepare data for modelling
Import transfer model
- Download VGG16 model structure and weights
- Freeze weights of lower layers
Training our own custom layers on top of transfer model
- Conduct grid search to find optimal structure of top classification layers
- Train top classification layers
Testing model and prepare submission
- Run model on test data
- Prepare submission file for Kaggle competition

Results

Running this transfer model with 10 epochs yielded 94.2% accuracy on the Kaggle kernel. The same model with 500 epochs yielded 96.3%, and 97.7% at 750. We believe there is further room to train the model and one can reach 99% accuracy at 1000+ epochs.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

3-transfer-learning-model

3-transfer-learning-model

README.md

Transfer learning model

Overview

Advantages of transfer learning

Pre-trained model we use

Customizing the pre-trained model

Our code

Results

Files

3-transfer-learning-model

Directory actions

More options

Directory actions

More options

Latest commit

History

3-transfer-learning-model

Folders and files

parent directory

README.md

Transfer learning model

Overview

Advantages of transfer learning

Pre-trained model we use

Customizing the pre-trained model

Our code

Results