- China
Lists (1)
Sort Name ascending (A-Z)
Stars
A curated list of publications on image and video segmentation leveraging Multimodal Large Language Models (MLLMs), highlighting state-of-the-art methods, innovative applications, and key advanceme…
A Conversational Speech Generation Model
🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!
The official gpt4free repository | various collection of powerful language models | o3 and deepseek r1, gpt-4.5
🤯 Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / DeepSeek / Qwen), Knowledge Base (file upload / knowledge managemen…
✨ 易上手的多平台 LLM 聊天机器人及开发框架 ✨ 平台支持 QQ、QQ频道、Telegram、微信、企微、飞书 | MCP 服务器、OpenAI、DeepSeek、Gemini、硅基流动、月之暗面、Ollama、OneAPI、Dify 等。附带 WebUI。
HuixiangDou: Overcoming Group Chat Scenarios with LLM-based Technical Assistance
Integrate the DeepSeek API into popular softwares
✍ WeChat Markdown Editor | 一款高度简洁的微信 Markdown 编辑器:支持 Markdown 语法、色盘取色、多图上传、一键下载文档、自定义 CSS 样式、一键重置等特性
BiomedParse: A Foundation Model for Joint Segmentation, Detection, and Recognition of Biomedical Objects Across Nine Modalities
Generating diverse and realistic datasets for computer vision training using AI.
Implement a ChatGPT-like LLM in PyTorch from scratch, step by step
Multimodal Whole Slide Foundation Model for Pathology
WWW2025 Multimodal Intent Recognition for Dialogue Systems Challenge
Clean, Robust, and Unified PyTorch implementation of popular Deep Reinforcement Learning (DRL) algorithms (Q-learning, Duel DDQN, PER, C51, Noisy DQN, PPO, DDPG, TD3, SAC, ASL)
This repository is based on shouxieai/tensorRT_Pro, with adjustments to support YOLOv8.
使用OpenCV部署yolov8检测人脸和关键点以及人脸质量评价,包含C++和Python两个版本的程序,只依赖opencv库就可以运行,彻底摆脱对任何深度学习框架的依赖。
Transcriptomics-guided Slide Representation Learning in Computational Pathology - CVPR 2024
🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using RAG 🔄.
A minimal codebase for finetuning large multimodal models, supporting llava-1.5/1.6, llava-interleave, llava-next-video, llava-onevision, llama-3.2-vision, qwen-vl, qwen2-vl, phi3-v etc.
Virtual whiteboard for sketching hand-drawn like diagrams
Orchestrate zero-shot computer vision models
Implementation of Attention-based Deep Multiple Instance Learning in PyTorch
A general-purpose foundation model for computational pathology - Nature Medicine
Personal Project: MPP-Qwen14B & MPP-Qwen-Next(Multimodal Pipeline Parallel based on Qwen-LM). Support [video/image/multi-image] {sft/conversations}. Don't let the poverty limit your imagination! Tr…