A PyTorch implementation of "EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks".
-
Updated
Jul 24, 2024 - Python
A PyTorch implementation of "EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks".
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (ViT), MobileNetV4, MobileNet-V3 & V2, RegNet, DPN, CSPNet, Swin Transformer, MaxViT, CoAtNet, ConvNeXt, and more
Official Pytorch implementation of MICCAI 2024 paper (early accept, top 11%) Mammo-CLIP: A Vision Language Foundation Model to Enhance Data Efficiency and Robustness in Mammography
Final Year Project: Predicting Charpy impact test data from microstructure data using a machine learning model.
A PyTorch impl of EfficientDet faithful to the original Google impl w/ ported weights
EfficientNet Hyperparameter Sweep Tuning automates the hyperparameter tuning process for EfficientNet models using Optuna. It features dynamic model selection, MLflow integration for logging, and is easily extendable. Ideal for researchers and developers looking to optimize their EfficientNet models efficiently and effectively.
Code to detect rain/inundation using CCTV images, estimate affected area/depth and store data in MySQL. Image processing & ML for efficient flood monitoring & management.
Python toolkit for speech processing
🫁 Chest X-ray abnormalities localization via ensemble of deep convolutional neural networks
Here are all my code files of Advanced AI/ML architectures built from scratch using Pytorch.
Sign Language Recognition
Food Vision Big™, using all of the data from the Food101 dataset. Beat the DeepFood paper : https://www.researchgate.net/publication/304163308_DeepFood_Deep_Learning-Based_Food_Image_Recognition_for_Computer-Aided_Dietary_Assessment
PyTorch Volume Models for 3D data
Pretrained EfficientNet, EfficientNet-Lite, MixNet, MobileNetV3 / V2, MNASNet A1 and B1, FBNet, Single-Path NAS
The Multimodal Model for Vietnamese Visual Question Answering (ViVQA)
Insightface Keras implementation
🍔🍟🍗 Food analysis baseline with Theseus. Integrate object detection, image classification and multi-class semantic segmentation 🍞🍖🍕
Add a description, image, and links to the efficientnet topic page so that developers can more easily learn about it.
To associate your repository with the efficientnet topic, visit your repo's landing page and select "manage topics."