multi-modal-learning

Pytorch implementation of "Multi-domain translation between single-cell imaging and sequencing data using autoencoders" (https://www.nature.com/articles/s41467-020-20249-2) with custom models.

multi-domain single-cell multi-modal single-cell-rna-seq shared-embedding multi-view-learning single-cell-omics multi-view data-alignment multi-modal-learning multi-domain-adaptation

Updated Oct 13, 2021
Python

Hleephilip / MLVU-project

Star

Modality Translation through Conditional Encoder-Decoder (2023-1 Machine Learning for Visual Understanding Team project)

multi-modal-learning latent-diffusion

Updated Jun 13, 2023
Python

rookiie / CDSpixel

Star

[AAAI24] Learning Invariant Inter-pixel Correlations for Superpixel Generation

superpixel domain-generalization multi-modal-learning aaai2024

Updated Mar 27, 2024
Python

WangJingyao07 / ST-F2M

Star

🌈 Official Code for **Spatio-Temporal Fuzzy-oriented Multi-modal Meta-learning for Fine-grained Emotion Recognition**

fuzzy-rules spatio-temporal-analysis meta-learning multi-modal-learning fine-grained-emotion-recognition

Updated Mar 5, 2024
Python

JHKim-snu / PGA

Star

Under review. [IROS 2024] PGA: Personalizing Grasping Agents with Single Human-Robot Interaction

personalization semi-supervised-learning vision-and-language robotic-manipulation visual-grounding multi-modal-learning

Updated Mar 30, 2024
Python

mailcorahul / auto_labeler

Star

auto_labeler - An all-in-one library to automatically label vision data

deep-learning image-classification deep-learning-library object-detection text-to-image instance-segmentation pseudo-labeling multi-modal-learning

Updated May 13, 2024
Python

amazon-science / contrastive_emc2

Star

Code the ICML 2024 paper: "EMC^2: Efficient MCMC Negative Sampling for Contrastive Learning with Global Convergence"

machine-learning deep-neural-networks machine-learning-algorithms multi-modal multi-modal-learning mcmc-sampling contrastive-learning

Updated May 22, 2024
Python

MunzerDw / Gen3DQA

Star

(BMVC23) Paper on 3D visual question answering at the lab of Prof. Dr. Niessner at Technical University of Munich.

reinforcement-learning deep-learning multi-modal-learning 3d-question-answering 3d-vision-language-understanding

Updated Nov 22, 2023
Python

deep-symbolic-mathematics / Multimodal-Symbolic-Regression

Star

[ICLR 2024 Spotlight] Deep Symbolic Regression with Multimodal Pretraining

transformers symbolic-regression multi-modal-learning latent-space-interpolation equation-discovery ai4science ai4math

Updated Jun 8, 2024
Python

itsShnik / allForOne

Star

PyTorch implementation of the paper: All For One: Multi-modal Multi-Task Learning

deep-learning sentiment-classification multi-task-learning visual-question-answering vision-and-language multi-modal-learning

Updated Jul 17, 2020
Python

fmenat / optimal-multiview-crop-classifier

Star

Public repository of our work in the search for an optimal multi-view crop classifier (considering encoder architectures and fusion strategies)

deep-learning sensor-fusion multi-view-learning neural-network-architectures cropland-mapping crop-classification multi-modal-learning

Updated Jun 5, 2024
Python

HackerHyper / ACMVH

Star

Adaptive Confidence Multi-View Hashing

multi-view-learning multi-modal-learning multi-modal-fusion

Updated Dec 13, 2023
Python

kyegomez / MegaVIT

Sponsor

Star

The open source implementation of the model from "Scaling Vision Transformers to 22 Billion Parameters"

computer-vision artificial-intelligence multi-modal vision-and-language multi-modal-learning vision-transformer gpt4 multi-modal-fusion

Updated May 17, 2024
Python

Karami-m / Deep-Probabilistic-Multi-View

Star

The code of the paper: M. Karami, D. Schuurmans, "Deep Probabilistic Canonical Correlation Analysis" AAAI 2021

deep-learning deep dnn generative-model vae canonical-correlation-analysis multi-view-learning multi-modal-learning

Updated Mar 29, 2022
Python

lyuchenyang / Efficient-VideoQA

Star

Code for ACL SustaiNLP 2023 paper "Is a Video worth n × n Images? A Highly Efficient Approach to Transformer-based Video Question Answering"

machine-learning natural-language-processing deep-learning artificial-intelligence video-question-answering multi-modal-learning

Updated Jul 4, 2023
Python

lyuchenyang / Semantic-aware-VideoQA

Star

Code for ACL SRW 2023 paepr "Semantic-aware Dynamic Retrospective-Prospective Reasoning for Event-level Video Question Answering"

machine-learning natural-language-processing deep-learning artificial-intelligence video-question-answering multi-modal-learning

Updated Jul 4, 2023
Python

Boreas-pxl / M2HSE

Star

PyTorch code for the paper "Complementarity is the king: A multi-modal and multi-grained hierarchical semantic enhancement network for cross-modal retrieval"

deep-learning pytorch cross-modal-retrieval multi-modal-learning

Updated Dec 10, 2022
Python

Improve this page

Add a description, image, and links to the multi-modal-learning topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the multi-modal-learning topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

multi-modal-learning

Here are 73 public repositories matching this topic...

fmenat / missingviews-study-EO

mattroz / miniCLIP

MIFA-Lab / InstructionGPT-4

talipucar / DomainTranslation

Hleephilip / MLVU-project

rookiie / CDSpixel

WangJingyao07 / ST-F2M

JHKim-snu / PGA

mailcorahul / auto_labeler

amazon-science / contrastive_emc2

MunzerDw / Gen3DQA

deep-symbolic-mathematics / Multimodal-Symbolic-Regression

itsShnik / allForOne

fmenat / optimal-multiview-crop-classifier

HackerHyper / ACMVH

kyegomez / MegaVIT

Karami-m / Deep-Probabilistic-Multi-View

lyuchenyang / Efficient-VideoQA

lyuchenyang / Semantic-aware-VideoQA

Boreas-pxl / M2HSE

Improve this page

Add this topic to your repo