Links
-
- Provides ResNet pre-trained networks that "won the 1st places in: ImageNet classification, ImageNet detection, ImageNet localization, COCO detection, and COCO segmentation", which could be good for the CNN layer.
- Implement the CNN-RNN architecture from the "Show and Tell" paper, download the pre-trained weights so save time if possible.
- test it on the mscoc dataset
- convert it to work with the fMRI data
- Add an attention mechanism
- Use a transformer network
- Create separate test set, to check for overfitting