Skip to content

Popular repositories Loading

  1. Cherry_LLM Cherry_LLM Public

    [NAACL'24] Self-data filtering of LLM instruction-tuning data using a novel perplexity-based difficulty score, without using any other models

    Python 363 23

  2. Reflection_Tuning Reflection_Tuning Public

    [ACL'24] Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-Tuning

    Python 354 29

  3. HallusionBench HallusionBench Public

    [CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models

    Python 281 8

  4. Superfiltering Superfiltering Public

    [ACL'24] Superfiltering: Weak-to-Strong Data Filtering for Fast Instruction-Tuning

    Python 148 13

  5. MoE-Embedding MoE-Embedding Public

    Code for "Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free"

    Python 67 8

  6. MiP-Overthinking MiP-Overthinking Public

    Missing Premise exacerbates Overthinking: Are Reasoning Models losing Critical Thinking Skill?

    Python 26 1

Repositories

Showing 10 of 17 repositories
  • ColorBench Public

    Official repo for ColorBench

    Python 10 Apache-2.0 0 0 0 Updated Apr 22, 2025
  • MiP-Overthinking Public

    Missing Premise exacerbates Overthinking: Are Reasoning Models losing Critical Thinking Skill?

    Python 26 MIT 1 1 0 Updated Apr 10, 2025
  • C3PO Public

    Code for "C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing"

    Jupyter Notebook 14 Apache-2.0 1 0 0 Updated Apr 9, 2025
  • CoSTAR Public

    Cost-Sensitive Toolpath Agent for Multi-turn Image Editing

    Jupyter Notebook 19 BSD-3-Clause 0 0 0 Updated Mar 26, 2025
  • R2-T2 Public

    Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"

    Python 15 MIT 1 0 0 Updated Mar 10, 2025
  • MosT Public

    Code for "Many-objective multi-solution transport"

    Python 2 0 0 0 Updated Feb 28, 2025
  • Mosaic-IT Public

    Mosaic IT: Enhancing Instruction Tuning with Data Mosaics

    Python 17 3 0 0 Updated Feb 11, 2025
  • RuleR Public

    RuleR: Improving LLM Controllability by Rule-based Data Recycling

    Python 12 1 1 0 Updated Feb 11, 2025
  • HallusionBench Public

    [CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models

    Python 281 BSD-3-Clause 8 0 0 Updated Nov 13, 2024
  • DisCL Public

    Official repo for Diffusion Curriculum (DisCL)

    Python 10 0 2 0 Updated Oct 18, 2024

Top languages

Loading…

Most used topics

Loading…