Skip to content
View SuperFeHanHan's full-sized avatar

Block or report SuperFeHanHan

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Fully open reproduction of DeepSeek-R1

Python 22,316 2,000 Updated Mar 7, 2025

A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。

Python 27,608 2,125 Updated Mar 7, 2025

GPU高性能编程CUDA实战随书代码

C 33 7 Updated May 24, 2022

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

C++ 15,876 3,083 Updated Mar 7, 2025

强化学习中文教程(蘑菇书🍄),在线阅读地址:https://datawhalechina.github.io/easy-rl/

Jupyter Notebook 10,503 1,957 Updated Feb 20, 2025

User-friendly Desktop Client App for AI Models/LLMs (GPT, Claude, Gemini, Ollama...)

TypeScript 32,957 3,128 Updated Mar 4, 2025

🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!

Python 14,689 1,634 Updated Feb 23, 2025

记录大模型相关的一些知识和方法

Jupyter Notebook 953 154 Updated Feb 21, 2025

Use PEFT or Full-parameter to finetune 450+ LLMs (Qwen2.5, InternLM3, GLM4, Llama3.3, Mistral, Yi1.5, Baichuan2, DeepSeek-R1, ...) and 150+ MLLMs (Qwen2.5-VL, Qwen2-Audio, Llama3.2-Vision, Llava, I…

Python 6,080 521 Updated Mar 7, 2025

【ArXiv】PDF-Wukong: A Large Multimodal Model for Efficient Long PDF Reading with End-to-End Sparse Sampling

113 4 Updated Oct 18, 2024

✨✨Latest Advances on Multimodal Large Language Models

14,153 908 Updated Mar 5, 2025

The code used to train and run inference with the ColVision models, e.g. ColPali, ColQwen2, and ColSmol.

Python 1,568 133 Updated Mar 7, 2025

SGLang is a fast serving framework for large language models and vision language models.

Python 11,552 1,171 Updated Mar 7, 2025

Famous Vision Language Models and Their Architectures

Markdown 690 35 Updated Feb 24, 2025

moffee: Make Markdown Ready to Present

Python 1,097 49 Updated Nov 22, 2024

This repository offers a comprehensive collection of tutorials on state-of-the-art computer vision models and techniques. Explore everything from foundational architectures like ResNet to cutting-e…

Jupyter Notebook 7,227 1,139 Updated Feb 24, 2025

[NAACL 2024] MMC: Advancing Multimodal Chart Understanding with LLM Instruction Tuning

Python 95 4 Updated Jan 7, 2025

MINT-1T: A one trillion token multimodal interleaved dataset.

802 20 Updated Jul 31, 2024

[EMNLP'21] Visual News: Benchmark and Challenges in News Image Captioning

Jupyter Notebook 93 9 Updated Jul 18, 2024

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Python 7,554 764 Updated Aug 12, 2024

The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.

Python 1,562 124 Updated Aug 13, 2024

A Comprehensive Toolkit for High-Quality PDF Content Extraction

Python 6,963 474 Updated Jan 3, 2025

Using GPT to parse PDF

Python 3,295 238 Updated Aug 7, 2024

官方推荐的 ChatTTS 资源汇总项目,整理了全网相关资源和常见问题 || Officially recommended ChatTTS resource collection project

1,526 92 Updated Jul 3, 2024

pix2tex: Using a ViT to convert images of equations into LaTeX code.

Python 13,755 1,093 Updated Jan 18, 2025

official code for "Fox: Focus Anywhere for Fine-grained Multi-page Document Understanding"

Python 139 8 Updated May 31, 2024

[ECCV 2024] Official code implementation of Vary: Scaling Up the Vision Vocabulary of Large Vision Language Models.

Python 1,816 146 Updated Dec 30, 2024

🏠 将小爱音箱接入 ChatGPT 和豆包,改造成你的专属语音助手。

TypeScript 10,066 1,228 Updated Mar 1, 2025

Convert PDF to markdown + JSON quickly with high accuracy

Python 22,146 1,351 Updated Mar 7, 2025

[CVPR2024] The code for "Osprey: Pixel Understanding with Visual Instruction Tuning"

Python 800 42 Updated Feb 27, 2025
Next
Showing results