Public repository of our assessment work in missing views for EO applications
-
Updated
Jun 4, 2024 - Python
Public repository of our assessment work in missing views for EO applications
Implementation of CLIP model with a reduced capacity. For self-educational purposes only.
About Implementation for paper "InstructionGPT-4: A 200-Instruction Paradigm for Fine-Tuning MiniGPT-4" (https://arxiv.org/abs/2308.12067)
Pytorch implementation of "Multi-domain translation between single-cell imaging and sequencing data using autoencoders" (https://www.nature.com/articles/s41467-020-20249-2) with custom models.
Modality Translation through Conditional Encoder-Decoder (2023-1 Machine Learning for Visual Understanding Team project)
[AAAI24] Learning Invariant Inter-pixel Correlations for Superpixel Generation
🌈 Official Code for **Spatio-Temporal Fuzzy-oriented Multi-modal Meta-learning for Fine-grained Emotion Recognition**
Under review. [IROS 2024] PGA: Personalizing Grasping Agents with Single Human-Robot Interaction
auto_labeler - An all-in-one library to automatically label vision data
Code the ICML 2024 paper: "EMC^2: Efficient MCMC Negative Sampling for Contrastive Learning with Global Convergence"
(BMVC23) Paper on 3D visual question answering at the lab of Prof. Dr. Niessner at Technical University of Munich.
[ICLR 2024 Spotlight] Deep Symbolic Regression with Multimodal Pretraining
PyTorch implementation of the paper: All For One: Multi-modal Multi-Task Learning
Public repository of our work in the search for an optimal multi-view crop classifier (considering encoder architectures and fusion strategies)
Adaptive Confidence Multi-View Hashing
The open source implementation of the model from "Scaling Vision Transformers to 22 Billion Parameters"
The code of the paper: M. Karami, D. Schuurmans, "Deep Probabilistic Canonical Correlation Analysis" AAAI 2021
Code for ACL SustaiNLP 2023 paper "Is a Video worth n × n Images? A Highly Efficient Approach to Transformer-based Video Question Answering"
Code for ACL SRW 2023 paepr "Semantic-aware Dynamic Retrospective-Prospective Reasoning for Event-level Video Question Answering"
PyTorch code for the paper "Complementarity is the king: A multi-modal and multi-grained hierarchical semantic enhancement network for cross-modal retrieval"
Add a description, image, and links to the multi-modal-learning topic page so that developers can more easily learn about it.
To associate your repository with the multi-modal-learning topic, visit your repo's landing page and select "manage topics."