Skip to content
View zyds's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report zyds

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

📦 Repomix (formerly Repopack) is a powerful tool that packs your entire repository into a single, AI-friendly file. Perfect for when you need to feed your codebase to Large Language Models (LLMs) o…

TypeScript 13,863 595 Updated Mar 29, 2025

Official Repository for SIGIR2024 Demo Paper "An Integrated Data Processing Framework for Pretraining Foundation Models"

Python 74 12 Updated Aug 27, 2024

Training Turn-by-Turn Verifiers for Dialogue Tutoring Agents: The Curious Case of LLMs as Your Coding Tutors

Python 68 8 Updated Feb 26, 2025

🐳 Efficient Triton implementations for "Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention"

Python 601 30 Updated Mar 19, 2025

[TMLR] A curated list of language modeling researches for code (and other software engineering activities), plus related datasets.

2,313 144 Updated Mar 25, 2025

Fully open reproduction of DeepSeek-R1

Python 23,472 2,137 Updated Mar 29, 2025

System 2 Reasoning Link Collection

817 70 Updated Mar 16, 2025

A collection of LLM papers, blogs, and projects, with a focus on OpenAI o1 🍓 and reasoning techniques.

6,606 365 Updated Mar 26, 2025

APOLLO: SGD-like Memory, AdamW-level Performance

Python 195 7 Updated Mar 8, 2025

[arXiv preprint] Official Repository for "Evaluating Language Models as Synthetic Data Generators"

Jupyter Notebook 34 3 Updated Dec 13, 2024

Continual Learning of Large Language Models: A Comprehensive Survey

376 17 Updated Mar 3, 2025

Representation Engineering: A Top-Down Approach to AI Transparency

Jupyter Notebook 811 94 Updated Aug 14, 2024

收集各大AndroidTV的apk应用,可免费看vip和国外电影电视。如大家有也可以贡献一下。

4,104 356 Updated Mar 29, 2025
Python 400 33 Updated Mar 26, 2025

Rigourous evaluation of LLM-synthesized code - NeurIPS 2023 & COLM 2024

Python 1,421 144 Updated Jan 6, 2025

Efficient Triton Kernels for LLM Training

Python 4,746 286 Updated Mar 28, 2025

📰 Must-read papers and blogs on LLM based Long Context Modeling 🔥

1,372 47 Updated Mar 27, 2025

Tools for merging pretrained large language models.

Python 5,487 521 Updated Mar 28, 2025

LLM101n: Let's build a Storyteller

32,990 1,804 Updated Aug 1, 2024

Memory optimization and training recipes to extrapolate language models' context length to 1 million tokens, with minimal hardware.

Python 706 48 Updated Sep 27, 2024

The official Meta Llama 3 GitHub site

Python 28,558 3,338 Updated Jan 26, 2025

Reformatted Alignment

JavaScript 115 7 Updated Sep 23, 2024

基于DPO算法微调语言大模型,简单好上手。

Python 34 1 Updated Jul 3, 2024

Minimalistic large language model 3D-parallelism training

Python 1,733 167 Updated Mar 28, 2025

心理健康大模型、LLM、The Big Model of Mental Health、Finetune、InternLM2、InternLM2.5、Qwen、ChatGLM、Baichuan、DeepSeek、Mixtral、LLama3、GLM4、Qwen2、LLama3.1

Python 1,333 178 Updated Mar 23, 2025

Netease Youdao's open-source embedding and reranker models for RAG products.

Python 1,690 114 Updated Feb 5, 2025
Python 318 16 Updated Jul 16, 2024

Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]

Python 544 29 Updated Dec 9, 2024

A curated list of Large Language Model (LLM) Interpretability resources.

1,277 95 Updated Dec 21, 2024

OpenChat: Advancing Open-source Language Models with Imperfect Data

Python 5,317 411 Updated Sep 13, 2024
Next
Showing results