-
SWE-bench Public
Forked from SWE-bench/SWE-benchSWE-bench [Multimodal]: Can Language Models Resolve Real-world Github Issues?
Python MIT License UpdatedFeb 25, 2025 -
SWE-agent Public
Forked from SWE-agent/SWE-agentSWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2…
Python MIT License UpdatedFeb 24, 2025 -
-
-
-
-
-
-
Megatron-LM Public
Forked from NVIDIA/Megatron-LMOngoing research training transformer models at scale
Python Other UpdatedNov 14, 2024 -
FastChat Public
Forked from lm-sys/FastChatAn open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
-
Pai-Megatron-Patch Public
Forked from alibaba/Pai-Megatron-PatchThe official repo of Pai-Megatron-Patch for LLM & VLM large scale training developed by Alibaba Cloud.
Python Apache License 2.0 UpdatedOct 23, 2024 -
Megatron-LLM Public
Forked from epfLLM/Megatron-LLMdistributed trainer for LLMs
Python Other UpdatedOct 3, 2024 -
CodeGeeX Public
Forked from THUDM/CodeGeeXCodeGeeX: An Open Multilingual Code Generation Model (KDD 2023)
Python Apache License 2.0 UpdatedAug 13, 2024 -
-
-
alignment-handbook Public
Forked from huggingface/alignment-handbookRobust recipes to align language models with human and AI preferences
Python Apache License 2.0 UpdatedMay 6, 2024 -
lm-evaluation-harness Public
Forked from EleutherAI/lm-evaluation-harnessA framework for few-shot evaluation of language models.
Python MIT License UpdatedMay 3, 2024 -
vllm Public
Forked from vllm-project/vllmA high-throughput and memory-efficient inference and serving engine for LLMs
Python Apache License 2.0 UpdatedApr 29, 2024 -
trl Public
Forked from huggingface/trlTrain transformer language models with reinforcement learning.
Python Apache License 2.0 UpdatedMar 15, 2024 -
-
alpaca_eval Public
Forked from tatsu-lab/alpaca_evalAn automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
Jupyter Notebook Apache License 2.0 UpdatedFeb 1, 2024 -
tevatron Public
Forked from texttron/tevatronTevatron - A flexible toolkit for neural retrieval research and development.
-
transformers Public
Forked from huggingface/transformers🤗 Transformers: State-of-the-art Natural Language Processing for Pytorch, TensorFlow, and JAX.
-
LoRA Public
Forked from microsoft/LoRACode for "LoRA: Low-Rank Adaptation of Large Language Models"
Python MIT License UpdatedJul 30, 2021 -
python-sample-vscode-flask-tutorial Public
Forked from microsoft/python-sample-vscode-flask-tutorialSample code for the Flask tutorial in the VS Code documentation
HTML MIT License UpdatedOct 29, 2020 -
fine-tuning-GPT2 Public
Forked from itsuncheng/fine-tuning-GPT2Codebase for the Medium Article on Fine-tuning GPT2 for Text Generation
Python UpdatedJun 27, 2020 -
cs224u Public
Forked from cgpotts/cs224uCode for Stanford CS224u
TeX Apache License 2.0 UpdatedJun 3, 2020 -
runtime Public
Forked from dotnet/runtime.NET is a cross-platform runtime for cloud, IoT, and desktop apps.
C# MIT License UpdatedMar 9, 2020 -
Prototypical-Networks Public
Forked from Hsankesara/Prototypical-NetworksA novel method for few shot learning
-
scikit-learn Public
Forked from scikit-learn/scikit-learnscikit-learn: machine learning in Python