The following repository provides a snippet to the proposed lightweight modifications to the vision transformer architectures.
- Training_Models folder comprises of the different variants of MobileVit and SWIN proposed in this research. Code corresponding to AID dataset has been provided. The framework remains consistent across different datasets and different splits, hence redundancy has been avoided.
- H5 folder comprises of the best models for each dataset that was generated in the training process. This can be loaded using predictor.py and used for inference.