-
Updated
Jul 8, 2020 - Python
multimodal
Here are 668 public repositories matching this topic...
🤖 A framework for building AI Agents with LLMs, integrating multimodal generative AI technologies including voice, images, videos, and digital humans 🌈💎✨
-
Updated
Jul 31, 2023
A notebook to learn about ML for astronomy through BTSbot.
-
Updated
Feb 7, 2024 - Jupyter Notebook
Visuo-haptic integration during texture exploration
-
Updated
Jan 12, 2024 - Processing
In this course, you’ll select open source models from Hugging Face Hub to perform NLP, audio, image and multimodal tasks using the Hugging Face transformers library.
-
Updated
Mar 22, 2024 - Jupyter Notebook
Here we will track the latest AI Multimodal Models, including Multimodal Foundation Models, LLM, Audio, Image, Video, Music and 3D content. 🔥
-
Updated
May 24, 2024
Collaborative generation of unique audiovisual experiences using NFC identity cards
-
Updated
Jan 20, 2021 - TypeScript
Todo o conteúdo produzido para a unidade curricular PF (Projeto FEUP), para o curso em Engenharia Informática e Computação na FEUP
-
Updated
Oct 11, 2021
Multitasking multimodal AI material that focus on human interaction and assistance
-
Updated
Apr 29, 2023 - PureBasic
Utilizing a multimodal architecture to predict the appropriate speaker turn in a dialogue.
-
Updated
Feb 21, 2024 - Python
Lightweight and Interpretable ML Model for Speech Emotion Recognition and Ambiguity Resolution (trained on IEMOCAP dataset)
-
Updated
Apr 14, 2021 - Jupyter Notebook
A paper presentation of Multimodal Neurons in Artificial Neural Networks by Goh et. al.
-
Updated
Apr 12, 2023 - HTML
Daily calories tracker based on user's goal - Multi Module + Clean Architecture + Use cases + Jetpack compose + Room + Retrofit + Testing
-
Updated
Feb 9, 2023 - Kotlin
A Python package housing a collection of deep-learning multi-modal data fusion method pipelines! From data loading, to training, to evaluation - fusilli's got you covered 🌸
-
Updated
Feb 1, 2024 - Python
-
Updated
Mar 10, 2024 - Python
Multimodal version of SlowFast (Vision + Audio inputs)
-
Updated
Apr 1, 2024 - Python
Code for the Paper "Tensor Decomposition for Compression of Multimodal Dual-Encoder Deep Neural Networks"
-
Updated
Sep 1, 2023 - Python
Telegram bot that lets you interact with ChatGPT with additional context and multimodal features.
-
Updated
May 19, 2024 - Python
This repo collects Multi-modal Machine Learning papers.
-
Updated
Jul 15, 2020
Improve this page
Add a description, image, and links to the multimodal topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the multimodal topic, visit your repo's landing page and select "manage topics."