CE4041_Kaggle

As part of NTU's CZ/CE4041 Machine Learning Module, the team was tasked to compete in a Kaggle Competition (closed). The team chose to do the Northeastern SMILE Lab - Recognizing Faces in the Wild. The aim of the competition is to determine the probability of kinship given two images. In summary, our group used the siamese network to help achieve a score of 0.907 on the Kaggle Public Leaderboard.

Members

Sankar Samiksha
Jia Min
Xing Kun
Fan Yupei
Tabu

Video

https://youtu.be/5fy2kn8Mogc

Documents, train, test, CSVs etc.

Training images

Training relationships CSV

Test images

Test sample submission csv:

For monitoring all other submissions

Proposed Solution and Explanation of code

Randomized data set
Randomizing the images obtained allows us to have a different variety of images to train on.

trainData = []

targetRelatedCount = 36000 #32k data
relatedCount = 0
while relatedCount<targetRelatedCount:
  for k,v in relationshipDict.items():
     for relation in v:
      i2 = random.randint(0, len(personPathFile[k])-1) #random photo of person1
      i3 = random.randint(0, len(personPathFile[relation])-1) #random photo of person2
      trainData.append((personPathFile[k][i2], personPathFile[relation][i3], 1))
      relatedCount+=1
      if relatedCount>=targetRelatedCount:
          break
     if relatedCount>=targetRelatedCount:
        break

trainData = set(trainData)
trainData = list(trainData)
positiveRelationsCount = len(trainData)
print("Current Length of positive relationships: ", len(trainData))

#making non-relationships more random, with same length as trainData
notRelationAddedCount = 0

#might choose the same relation but handled later on when convert trainData to set and back to list
while notRelationAddedCount<positiveRelationsCount:
  for k,v in notRelationshipDict.items():
     i1 = random.randint(0, len(v)-1)
     i2 = random.randint(0, len(personPathFile[k])-1) #random photo of person1
     i3 = random.randint(0, len(personPathFile[v[i1]])-1) #random photo of person2
     trainData.append((personPathFile[k][i2], personPathFile[v[i1]][i3], 0))
     notRelationAddedCount+=1
     if notRelationAddedCount>=positiveRelationsCount:
      break

print("Current Length of not relationships: ", notRelationAddedCount)
print("Current Length of total relationships: ", len(trainData))

#change trainData to set, then back to list
trainData = set(trainData)
trainData = list(trainData)
print("Current Length of total relationships: ", len(trainData))

Fully Connected Layers
Changing the fully connected layers allows us to have more flexibility and control over the model whilst still using a pre trained model. The fully connected layers that are important are the DropOut Layer and the BatchNormId.

model.classifier = nn.Sequential(
            nn.Linear(3584,2048),
            nn.ReLU(),
            nn.Dropout(0.55),  # Add dropout for regularization
            nn.Linear(2048, 512),
            nn.ReLU(),
            nn.Linear(512, 256),
            nn.ReLU(),
            nn.BatchNorm1d(256), # Apply batch normalization
            nn.Linear(256, 2)
        )

Pre-trained Model, Learning Rate, Adam Optimizer and Loss Criterion
A pre-trained model was used. Facenet has been trained on the vggface2 image dataset.

# Create the Siamese network
net = SiameseNetwork(InceptionResnetV1(pretrained='vggface2', classify=False)).cuda()
# Define the contrastive loss
criterion = nn.CrossEntropyLoss()

# Define the optimizer (e.g., Adam)
optimizer = optim.Adam(net.parameters(), lr=0.0005)

Constantly changing data augmentation
A constantly changing data augmentation was implemented by the team. It is unique and has not been used by those in the competition.

    if epoch % 10 == 0 or epoch %10 == 1 or epoch %10 == 2 or epoch %10 == 3:
      print("Data Augmentation: None")
      trainloader = createTrain([transforms.Resize((IMG_SIZE,IMG_SIZE)),transforms.ToTensor()])
    elif epoch %10 == 4 or epoch %10 == 5:
       print("Data Augmentation: RandomGrayScale(0.5)")
       trainloader = createTrain([transforms.Resize((IMG_SIZE,IMG_SIZE)),transforms.RandomGrayscale(p=0.5),transforms.ToTensor()])
    elif epoch %10 == 6 or epoch %10 == 7:
      print("Data Augmentation: RandomCrop((90,90)),RandomGrayScale(0.8), RandomHorizontalFlip, GaussianBlur(kernel_size = 5, sigma=(0.1, 3.0)")
      trainloader = createTrain([transforms.RandomCrop((80,80)),transforms.Resize((IMG_SIZE,IMG_SIZE)),transforms.RandomGrayscale(p=0.8),transforms.RandomHorizontalFlip(),transforms.GaussianBlur(kernel_size = 5, sigma=(0.1, 3.0)),transforms.ToTensor()])
    elif epoch %10 == 8 or epoch %10 == 9:
      print("Data Augmentation: RandomGrayScale(0.8), RandomHorizontalFlip, ColorJitter(brightness=0.7, contrast=0.3),")
      trainloader = createTrain([transforms.Resize((IMG_SIZE,IMG_SIZE)),transforms.RandomGrayscale(p=0.5),transforms.RandomHorizontalFlip(),transforms.ColorJitter(brightness=0.7, contrast=0.3),transforms.ToTensor()])
    else:
      print("Data Augmentation: None")
      trainloader = createTrain([transforms.Resize((IMG_SIZE,IMG_SIZE)),transforms.ToTensor()])

Significant Milestones Achieved through Experiments

Facenet Version 4 (0.807 @ 30 Epochs)
- Add a lot more data to become 59K
- The data added is more within the same family. Meaning, the images are family but not kin (this data predominates the whole thing)
- The fully connected layers after the FaceNet convolutional layer has a dropout layer at 0.7 to prevent overfitting and batchnorm as well
- Every epoch, data augmentation is changed to introduce more variation to prevent overfitting
- Grayscale is important because some of the test data is grayscale. blur is important because some of the data is highly blurred. random horizontal flip to increase variation
- learning rate is set at 0.005 batch size is 64
Facenet Version 5 (0.867 @ 30 Epochs)
- Updated version from version 4
- Randomized data set
Facenet Version 7 (0.907 @ 30 Epochs)
- https://colab.research.google.com/drive/1FM-ls2q-9VKl3Ny2cOQjxbP5MXSQN2Wk?usp=sharing
- Changing dropout layer from 0.7 to 0.55
- Adding data augmentation of cropping as there are images with two faces in them, some faces are obscured by sunglasses, or other accessories.
- Adding data augmentation of color jittering as the images are of different brightness and contrast
- Randomizing the non-relationship dataset
- All relationships in the CSV however the pictures selected was randomized
- Submitted CSV

Name		Name	Last commit message	Last commit date
Latest commit History 37 Commits
Notebooks		Notebooks
Others		Others
.gitignore		.gitignore
Facenet_version_7.html		Facenet_version_7.html
Facenet_version_7.ipynb		Facenet_version_7.ipynb
Group44_report.pdf		Group44_report.pdf
README.md		README.md
test-public-relationships.csv		test-public-relationships.csv
test_Results_SAM_NO_MTCNN_30Epochs_BatchSize_64_new_csv_probability_new_data_new_data_augment_05Drouput.csv		test_Results_SAM_NO_MTCNN_30Epochs_BatchSize_64_new_csv_probability_new_data_new_data_augment_05Drouput.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CE4041_Kaggle

Members

Video

Documents, train, test, CSVs etc.

Proposed Solution and Explanation of code

Significant Milestones Achieved through Experiments

About

Releases

Packages

Contributors 4

Languages

S-Samiksha/CZ4041_SMILE_LABS

Folders and files

Latest commit

History

Repository files navigation

CE4041_Kaggle

Members

Video

Documents, train, test, CSVs etc.

Proposed Solution and Explanation of code

Significant Milestones Achieved through Experiments

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 4

Languages

Packages