Pinned Loading
-
ScienceQA
ScienceQA PublicForked from lupantech/ScienceQA
Data and code for NeurIPS 2022 Paper "Learn to Explain: Multimodal Reasoning via Thought Chains for Science Question Answering".
Python
-
PromptPG
PromptPG PublicForked from lupantech/PromptPG
Data and code for the paper "Dynamic Prompt Learning via Policy Gradient for Semi-structured Mathematical Reasoning".
Python
-
CHATS-lab/KokoMind
CHATS-lab/KokoMind PublicKokoMind: Can LLMs Understand Social Interactions?
-
WebAgent-R1
WebAgent-R1 PublicForked from weizhepei/WebAgent-R1
WebAgent-R1: Training Web Agents via End-to-End Multi-Turn Reinforcement Learning
-
DFT
DFT PublicForked from Optimization-AI/DFT
Discriminative Fine-tuning of LLMs without reward models and human preference data
Python
If the problem persists, check the GitHub status page or contact support.