SuperFeHanHan

Follow

SuperFeHanHan SuperFeHanHan

Follow

3 followers · 6 following

Achievements

Achievements

Stars

huggingface / open-r1

Fully open reproduction of DeepSeek-R1

Python 22,316 2,000 Updated Mar 7, 2025

opendatalab / MinerU

A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具，将PDF转换成Markdown和JSON格式。

Python 27,608 2,125 Updated Mar 7, 2025

ischintsan / cuda_by_example

GPU高性能编程CUDA实战随书代码

C 33 7 Updated May 24, 2022

microsoft / onnxruntime

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

C++ 15,876 3,083 Updated Mar 7, 2025

datawhalechina / easy-rl

强化学习中文教程（蘑菇书🍄），在线阅读地址：https://datawhalechina.github.io/easy-rl/

Jupyter Notebook 10,503 1,957 Updated Feb 20, 2025

Bin-Huang / chatbox

User-friendly Desktop Client App for AI Models/LLMs (GPT, Claude, Gemini, Ollama...)

TypeScript 32,957 3,128 Updated Mar 4, 2025

jingyaogong / minimind

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT！🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 14,689 1,634 Updated Feb 23, 2025

wyf3 / llm_related

记录大模型相关的一些知识和方法

Jupyter Notebook 953 154 Updated Feb 21, 2025

modelscope / ms-swift

Use PEFT or Full-parameter to finetune 450+ LLMs (Qwen2.5, InternLM3, GLM4, Llama3.3, Mistral, Yi1.5, Baichuan2, DeepSeek-R1, ...) and 150+ MLLMs (Qwen2.5-VL, Qwen2-Audio, Llama3.2-Vision, Llava, I…

Python 6,080 521 Updated Mar 7, 2025

yh-hust / PDF-Wukong

【ArXiv】PDF-Wukong: A Large Multimodal Model for Efficient Long PDF Reading with End-to-End Sparse Sampling

113 4 Updated Oct 18, 2024

BradyFU / Awesome-Multimodal-Large-Language-Models

✨✨Latest Advances on Multimodal Large Language Models

14,153 908 Updated Mar 5, 2025

illuin-tech / colpali

The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.

Python 1,568 133 Updated Mar 7, 2025

sgl-project / sglang

SGLang is a fast serving framework for large language models and vision language models.

Python 11,552 1,171 Updated Mar 7, 2025

gokayfem / awesome-vlm-architectures

Famous Vision Language Models and Their Architectures

Markdown 690 35 Updated Feb 24, 2025

BMPixel / moffee

moffee: Make Markdown Ready to Present

Python 1,097 49 Updated Nov 22, 2024

roboflow / notebooks

This repository offers a comprehensive collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything from foundational architectures like ResNet to cutting-e…

Jupyter Notebook 7,227 1,139 Updated Feb 24, 2025

FuxiaoLiu / MMC

[NAACL 2024] MMC: Advancing Multimodal Chart Understanding with LLM Instruction Tuning

Python 95 4 Updated Jan 7, 2025

mlfoundations / MINT-1T

MINT-1T: A one trillion token multimodal interleaved dataset.

802 20 Updated Jul 31, 2024

FuxiaoLiu / VisualNews-Repository

[EMNLP'21] Visual News: Benchmark and Challenges in News Image Captioning

Jupyter Notebook 93 9 Updated Jul 18, 2024

IDEA-Research / GroundingDINO

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Python 7,554 764 Updated Aug 12, 2024

QwenLM / Qwen2-Audio

The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 1,562 124 Updated Aug 13, 2024

opendatalab / PDF-Extract-Kit

A Comprehensive Toolkit for High-Quality PDF Content Extraction

Python 6,963 474 Updated Jan 3, 2025

CosmosShadow / gptpdf

Using GPT to parse PDF

Python 3,295 238 Updated Aug 7, 2024

libukai / Awesome-ChatTTS

官方推荐的 ChatTTS 资源汇总项目，整理了全网相关资源和常见问题 || Officially recommended ChatTTS resource collection project

1,526 92 Updated Jul 3, 2024

lukas-blecher / LaTeX-OCR

pix2tex: Using a ViT to convert images of equations into LaTeX code.

Python 13,755 1,093 Updated Jan 18, 2025

ucaslcl / Fox

official code for "Fox: Focus Anywhere for Fine-grained Multi-page Document Understanding"

Python 139 8 Updated May 31, 2024

Ucas-HaoranWei / Vary

[ECCV 2024] Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.

Python 1,816 146 Updated Dec 30, 2024

idootop / mi-gpt

🏠 将小爱音箱接入 ChatGPT 和豆包，改造成你的专属语音助手。

TypeScript 10,066 1,228 Updated Mar 1, 2025

VikParuchuri / marker

Convert PDF to markdown + JSON quickly with high accuracy

Python 22,146 1,351 Updated Mar 7, 2025

CircleRadon / Osprey

[CVPR2024] The code for "Osprey: Pixel Understanding with Visual Instruction Tuning"

Python 800 42 Updated Feb 27, 2025