[ICCV2019] Robust Multi-Modality Multi-Object Tracking
-
Updated
Dec 7, 2019 - Python
[ICCV2019] Robust Multi-Modality Multi-Object Tracking
Final project for the course LT2318 Artificial Intelligence: Cognitive Systems. The project concerns multimodal hate speech detection in memes.
Official code for WACV 2021 paper - Compositional Learning of Image-Text Query for Image Retrieval
[ICCV 2021] Official implementation of the paper "TRAR: Routing the Attention Spans in Transformers for Visual Question Answering"
CrossCLR: Cross-modal Contrastive Learning For Multi-modal Video Representations, ICCV 2021
Simple command line tool for text to image generation using OpenAI's CLIP and Siren (Implicit neural representation network). Technique was originally created by https://twitter.com/advadnoun
Unifying Voxel-based Representation with Transformer for 3D Object Detection (NeurIPS 2022)
Queen's University - Data Mining (CISC 873)
This repo contains the official code of our work SAM-SLR which won the CVPR 2021 Challenge on Large Scale Signer Independent Isolated Sign Language Recognition.
Repository for the journal article, 'FedSepsis: A Federated Multi-Modal Deep Learning-Based Internet of Medical Things Application for Early Detection of Sepsis from Electronic Health Records Using Raspberry Pi and Jetson Nano Devices', Mahbub Ul Alam, Rahim Rahmani. Sensors 23, no. 2: 970, https://doi.org/10.3390/s23020970.
Official Implementation for Pre-CoFactv2 (AAAI-23 DeFactify2.0 Workshop 1st Place)
Cross-Modality Mutual Learning for Smart Contract Vulnerability Detection
[Pattern Recognition] The implementation of MoCA
Repository for the conference paper 'COVID-19 detection from thermal image and tabular medical data utilizing multi-modal machine learning', Mahbub Ul Alam, Jaakko Hollmén and Rahim Rahmani. IEEE 36th International Symposium on Computer-Based Medical Systems (CBMS), 2023, pp. 646-653.
Seed, Code, Harvest: Grow Your Own App with Tree of Thoughts!
An open-source cloud-native of large multi-modal models (LMMs) serving framework.
Collaborative Diffusion (CVPR 2023)
Visual Entities Empowered Zero-Shot Image-to-Text Generation Transfer Across Domains
Repository for the journal article, 'Federated Semi-Supervised Multi-Task Learning to Detect COVID-19 and Lungs Segmentation Marking Using Chest Radiography Images and Raspberry Pi Devices: An Internet of Medical Things Application', Mahbub Ul Alam, Rahim Rahmani. Sensors 21, no. 15: 5025, https://doi.org/10.3390/s21155025.
🏄 Scalable embedding, reasoning, ranking for images and sentences with CLIP
Add a description, image, and links to the multi-modality topic page so that developers can more easily learn about it.
To associate your repository with the multi-modality topic, visit your repo's landing page and select "manage topics."