You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Full-sized images used for training are split between the EOL images on Hugging Face and the iNat21 and BIOSCAN image sets through their own distribution sources.
However, the images were resized to 224x224 prior to training. For this project, it is a requirement that the nearest-neighbor images be presented in the format they were in for training to provide users an accurate representation of what the model 'knows'. This would probably best be done by putting the full webdataset formatted set of TAR files into a private Hugging Face space strictly for serving a handful of images at a time for this project rather than as a redistribution method. This should provide random access to individual images by filename.
Would it also be useful to see the nearest neighbor images in full original resolution? It would be possible using ratarmount, which could FUSE mount the contents of each dataset (.tar.gz for EOL and iNat and .zip for BIOSCAN) to the API server filesystem to enable random access to individual full-sized images as well.
The text was updated successfully, but these errors were encountered:
Full-sized images used for training are split between the EOL images on Hugging Face and the iNat21 and BIOSCAN image sets through their own distribution sources.
However, the images were resized to 224x224 prior to training. For this project, it is a requirement that the nearest-neighbor images be presented in the format they were in for training to provide users an accurate representation of what the model 'knows'. This would probably best be done by putting the full
webdataset
formatted set of TAR files into a private Hugging Face space strictly for serving a handful of images at a time for this project rather than as a redistribution method. This should provide random access to individual images by filename.Would it also be useful to see the nearest neighbor images in full original resolution? It would be possible using
ratarmount
, which could FUSE mount the contents of each dataset (.tar.gz
for EOL and iNat and.zip
for BIOSCAN) to the API server filesystem to enable random access to individual full-sized images as well.The text was updated successfully, but these errors were encountered: