multimodal-deep-learning

Star

Here are 215 public repositories matching this topic...

mobled37 / utils

Star

Deeplearning utils for multimodal research

finetuning multimodal-deep-learning

Updated Jul 28, 2023
Python

a-tabaza / binding_music

Star

Code and Models for Binding Text, Images, Graphs, and Audio for Music Representation Learning

music-information-retrieval multimodal-deep-learning joint-embedding

Updated May 18, 2024
Python

vijayvee / text-to-image-synthesis

Star

Project to transform a natural language description into an image using Generative Adversarial Networks.

generative-adversarial-networks text-to-image multimodal-deep-learning

Updated Dec 9, 2017
Python

Danesed / Ducho

Star

Accepted at The Web Conference 2024.

deep-learning artificial-intelligence feature-extraction recommender-system multimodal multimodal-deep-learning

Updated Feb 6, 2024
Python

marcomoldovan / cross-modal-speech-segment-retrieval

Star

Learning a common representation space from speech and text for cross-modal retrieval given textual queries and speech files.

natural-language-processing deep-learning speech transformer speech-recognition spoken-language-understanding multimodal-deep-learning self-supervised-learning

Updated Apr 27, 2023
Python

deeplsd / Syncnet_Analysis

Star

This code is part of the paper: "A Deep Dive Into Neural Synchrony Evaluation for Audio-visual Translation" published at ACM ICMI 2022.

interpretability synchrony adversarial-attacks multimodal-deep-learning audio-visual

Updated Apr 29, 2023
Python

endiqq / Multi-Feature-Semi-Supervised-Learning-for-COVID-19-CXR-Images

Star

Semi-Supervised Learning (SSL)

image-processing semi-supervised-learning image-classification multimodal-deep-learning

Updated Apr 14, 2021
Python

marcomoldovan / 3d-attention-video-understanding

Star

Using a 3D Nearby Self-Attention Transformer to leverage the spatiotemporal nature of video for representation learning.

deep-learning transformer attention representation-learning multimodal-deep-learning self-supervised-learning multimodal-alignment

Updated Jun 8, 2023
Python

Aeternalis-Ingenium / V4Vision-POC-Backend

Star

API to infer automated disease detection and report generation from medical images.

machine-learning software-engineering multimodality radiology multimodal-deep-learning med-tech llm

Updated May 20, 2024
Python

dermatologist / kedro-tf-text

Star

Kedro pipelines for preprocessing text and tabular data for multi-modal ML in TensorFlow.

medical healthcare gpt hacktoberfest nlp-machine-learning bert multimodal-deep-learning kedro

Updated Feb 9, 2023
Python

sitamgithub-MSIT / WiseEye

Star

artificial-intelligence question-answering gradio blip multimodal-deep-learning multimodal-data huggingface-transformers gradio-interface huggingface-spaces

Updated Apr 15, 2024
Python

prasoonvarshney / Multimodal-Transformer

Star

Adding Bottlenecked Fusion to [ACL'19] Multimodal Transformer

multimodal-deep-learning

Updated Dec 5, 2022
Python

benoriol / memes_processing

Star

machine-learning natural-language-processing computer-vision deep-learning memes multimodal-deep-learning

Updated Oct 5, 2019
Python

OmerShubi / DL_VQA

Star

Visual Question Answering (VQA) Model

attention-mechanism convolutional-neural-network visual-question-answering lstm-neural-network multimodal-deep-learning

Updated Sep 2, 2021
Python

a-tabaza / fairouz_demo

Star

Demo for Binding Text, Images, Graphs, and Audio for Music Representation Learning

music-information-retrieval multimodal-deep-learning joint-embedding

Updated May 24, 2024
Python

jena-shreyas / Efficient-VidQA

Star

Part of my work for my Bachelor's Thesis Project on Counterfactual Reasoning for Videos.

deep-learning multimodal-deep-learning scene-understanding video-question-answering

Updated Oct 4, 2023
Python

eliottcrancee / ParoleNet

Star

Utilizing a multimodal architecture to predict the appropriate speaker turn in a dialogue.

nlp deep-neural-networks deep-learning multimodal multimodal-deep-learning

Updated Feb 21, 2024
Python

cdcai / enriched-LSTMs

Star

Classifying multimodal health data with LSTMs

biomedical-informatics multimodal-deep-learning usg-artificial-intelligence

Updated Mar 16, 2020
Python

ibnaleem / mikael

Star

a Discord chatbot trained on Mistral and LLaVA language models

chatbot discord-bot artificial-intelligence discord-py mistral multimodal multimodal-deep-learning gpt-4 large-language-models llava mistral-7b mistral-ai

Updated Feb 29, 2024
Python

usc-sail / mica-context-emotion-recognition

Star

Repository for context based emotion recognition

computer-vision emotion-recognition multimodal-deep-learning multimodal-fusion context-understanding

Updated Sep 25, 2023
Python

Improve this page

Add a description, image, and links to the multimodal-deep-learning topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the multimodal-deep-learning topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

multimodal-deep-learning

Here are 215 public repositories matching this topic...

mobled37 / utils

a-tabaza / binding_music

vijayvee / text-to-image-synthesis

Danesed / Ducho

marcomoldovan / cross-modal-speech-segment-retrieval

deeplsd / Syncnet_Analysis

endiqq / Multi-Feature-Semi-Supervised-Learning-for-COVID-19-CXR-Images

marcomoldovan / 3d-attention-video-understanding

Aeternalis-Ingenium / V4Vision-POC-Backend

dermatologist / kedro-tf-text

sitamgithub-MSIT / WiseEye

prasoonvarshney / Multimodal-Transformer

benoriol / memes_processing

OmerShubi / DL_VQA

a-tabaza / fairouz_demo

jena-shreyas / Efficient-VidQA

eliottcrancee / ParoleNet

cdcai / enriched-LSTMs

ibnaleem / mikael

usc-sail / mica-context-emotion-recognition

Improve this page

Add this topic to your repo