#

multimodal

Here are 17 public repositories matching this topic...

lab-rasool / MINDS

🧠 | Multimodal Integration of Oncology Data System

data machine-learning deep-learning cancer nih oncology multimodal gdc-portal

Updated May 31, 2024
JavaScript

tobiasleibrock / terra-watch

TerraWatch is a proof of concept system developed during the TUM AI Hackathon 2024 to detect deforestation from satellite images and reason out the causes and potential environmental effects using computer vision models and multimodal large language models.

computer-vision tum tensorflow satellite-imagery multimodal gpt-4 llms

Updated May 7, 2024
JavaScript

msai-cereal / ai_fitness_trainer_v2

Web-Based Exercise Posture Evaluation and AI Voice Feedback System

computer-vision fitness-app buildship multimodal openai-api yolov8

Updated Dec 14, 2023
JavaScript

abdulhakkeempa / AccentAce

This is a simple application that generates scripts for the user to read. Based on the audio, the application would provide a score for their pronunciation and suggest possible methods to improve it.

docker nextjs google-cloud multimodal fastapi llm generative-ai gen-ai gemini-pro

Updated Jul 10, 2024
JavaScript

Qredence / Trueblock

Our project enhances Trulens analytics through two key initiatives: developing an interactive visual node for integration in Jupyter notebooks, and creating a comprehensive RAG framework for Trulens documentation. These efforts aim to simplify and enrich the user experience with Trulens, making advanced data analysis more accessible and intuitive.

gemini multimodal visualblocks trulens

Updated Dec 26, 2023
JavaScript

josemariagarcia95 / hera-system

Three-level multimodal emotion recognition framework to detect emotions combining different inputs with different formats.

detector affective-computing detect-emotions affective multimodal pad-form ensembler

Updated Dec 9, 2022
JavaScript

aws-samples / semantic-image-search-for-articles

How you can add semantic search to your applications. This sample shows how you can use a multimodal model to find images which are semantically similar to some text. New blog coming out soon.

search aws semantic vector multimodal vector-search generative-ai

Updated Feb 22, 2024
JavaScript

phetsims / paper-land

Build and explore multimodal web interactives with pieces of paper!

javascript open-source community paper ar augmented codesign multimodal

Updated Jun 24, 2024
JavaScript

sutdcv / multi-modal-video-reasoning

[ICCV2021 Workshop] Multi-Modal Video Reasoning and Analyzing Competition

workshop multimodality iccv multimodal multimodal-deep-learning iccv2021

Updated Jul 11, 2022
JavaScript

benursu / Afrosquared-ForkOnTheRoad

Amazon Alexa Skill - "Alexa, ask Fork On The Road"

nodejs alexa webgl multimodal

Updated Mar 24, 2019
JavaScript

saharmor / MonsterBooth

Turn yourself into a Halloween-styled character and get an original roast with the power of AI.

multimodal gpt4 generative-ai llava

Updated Feb 4, 2024
JavaScript

visionary

synapse2001 / visionary

A Vision Assistance Multimodal Application build on top of google gemini vision pro.

google accessibility gemini multimodal vision-assistance

Updated Feb 24, 2024
JavaScript

sutdcv / SUTD-TrafficQA

[CVPR2021] SUTD-TrafficQA: A Question Answering Benchmark and an Efficient Network for Video Reasoning over Traffic Events

paper annotations dataset vqa cvpr video-qa vqa-dataset traffic-events multimodal multimodal-deep-learning cvpr2021 video-reasoning

Updated Dec 13, 2022
JavaScript

aws-samples / improve-employee-productivity-using-genai

Employee Productivity GenAI Assistant Example is an innovative code sample and architecture pattern designed to enhance writing tasks efficiency using AWS serverless technologies and Amazon Bedrock's generative AI models.

aws aws-lambda aws-s3 aws-apigateway aws-serverless aws-dynamodb aws-sam multimodal servereless aws-cloud9 generative-ai anthropic-claude genai aws-bedrock bedrock-claude-llm

Updated Jun 28, 2024
JavaScript

rustic-ai / ui-components

React component library for crafting user-friendly and engaging conversational experiences

chat ai reactjs mui reactjs-components conversational-ai multimodal

Updated Jul 11, 2024
JavaScript

rimmi21-zz / Alexa-APL-Fact-Skill

Sample skill which demonstrates the new Alexa Presentation Language (APL). The multi modal skill functionality is same as Alexa Fact Skill template it will select a fact at random and tell it to the user when the multi modal skill is invoked and is compatible with devices having display.

Updated Jun 26, 2019
JavaScript

lxe / llavavision

A simple "Be My Eyes" web app with a llama.cpp/llava backend

machine-learning ai computer-vision artificial-intelligence webapp llama multimodal llm llamacpp local-llm

Updated Nov 28, 2023
JavaScript

Improve this page

Add a description, image, and links to the multimodal topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the multimodal topic, visit your repo's landing page and select "manage topics."