Skip to content
View alexxchen's full-sized avatar

Block or report alexxchen

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Pretraining LLM from scratch

Python 1 Updated Mar 17, 2025

Use PEFT or Full-parameter to finetune 500+ LLMs (Qwen2.5, InternLM3, GLM4, Llama3.3, Mistral, Yi1.5, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, Qwen2-Audio, Llama3.2-Vision, Llava…

Python 6,638 567 Updated Mar 28, 2025

Model interpretability and understanding for PyTorch

Python 5,152 512 Updated Mar 30, 2025

A demo to apply simultaneous interpretation based on Azure

Python 2 Updated Oct 17, 2023

Reproduction of paper "Lateral interaction by Lapalcian-based graph smoothing for deep neural networks"

Python 1 Updated Mar 2, 2025

Reproduce R1 Zero on Logic Puzzle

Python 2,238 147 Updated Mar 20, 2025

Official Repo for Open-Reasoner-Zero

Python 1,689 81 Updated Mar 5, 2025

Democratizing Reinforcement Learning for LLMs

Python 2,160 188 Updated Feb 16, 2025

One-click start reproduction of multi-modal DeepSeek R1-Zero

Python 6 1 Updated Mar 16, 2025

Witness the aha moment of VLM with less than $3.

Python 3,432 271 Updated Mar 1, 2025

Integrate the DeepSeek API into popular softwares

30,628 3,330 Updated Mar 28, 2025

Biomedical Question Answering Datasets.

99 7 Updated Jul 11, 2023

Fully open reproduction of DeepSeek-R1

Python 23,495 2,141 Updated Mar 30, 2025

Clean, minimal, accessible reproduction of DeepSeek R1-Zero

Python 11,427 1,445 Updated Mar 10, 2025

The all-in-one Desktop & Docker AI application with built-in RAG, AI agents, No-code agent builder, and more.

JavaScript 41,951 4,042 Updated Mar 29, 2025

RAGFlow is an open-source RAG (Retrieval-Augmented Generation) engine based on deep document understanding.

TypeScript 47,146 4,357 Updated Mar 29, 2025
Jupyter Notebook 52 8 Updated May 24, 2024

Agent Laboratory is an end-to-end autonomous research workflow meant to assist you as the human researcher toward implementing your research ideas

Python 4,091 589 Updated Mar 27, 2025

AllenAI's post-training codebase

Python 2,843 368 Updated Mar 30, 2025

Reproducible, flexible LLM evaluations

Python 180 20 Updated Mar 25, 2025

Data and tools for generating and inspecting OLMo pre-training data.

Python 1,173 130 Updated Mar 12, 2025

prime is a framework for efficient, globally distributed training of AI models over the internet.

Python 689 67 Updated Mar 28, 2025

仅需Python基础,从0构建大语言模型;从0逐步构建GLM4\Llama3\RWKV6, 深入理解大模型原理

Jupyter Notebook 2,575 365 Updated Aug 15, 2024

Modeling, training, eval, and inference code for OLMo

Python 5,448 585 Updated Mar 28, 2025

A generative world for general-purpose robotics & embodied AI learning.

Python 24,598 2,150 Updated Mar 30, 2025

Official code for Coupled Oscillatory RNN (ICLR 2021, Oral)

Python 43 8 Updated Aug 26, 2021

phy: interactive visualization and manual spike sorting of large-scale ephys data

Python 343 165 Updated Aug 2, 2024

Fast spike sorting with drift correction

Python 514 260 Updated Mar 26, 2025

[NeurIPS 2019] Code for the paper "Learning to Control Self-Assembling Morphologies: A Study of Generalization via Modularity"

Python 116 16 Updated Dec 13, 2019
Next
Showing results