Skip to content
@CLUEbenchmark

CLUE benchmark

Organization of Language Understanding Evaluation benchmark for Chinese: tasks & datasets, baselines, pre-trained Chinese models, corpus and leaderboard

Pinned Loading

  1. CLUE Public

    中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard

    Python 4.1k 546

  2. SuperCLUE Public

    SuperCLUE: 中文通用大模型综合性基准 | A Benchmark for Foundation Models in Chinese

    3.1k 103

  3. SuperCLUE-Safety Public

    SC-Safety: 中文大模型多轮对抗安全基准

    127 9

  4. SuperCLUE-Auto Public

    汽车行业中文大模型测评基准,基于多轮开放式问题的细粒度评测

    33 3

  5. SuperCLUE-Agent Public

    SuperCLUE-Agent: 基于中文原生任务的Agent智能体核心能力测评基准

    83 3

  6. SuperCLUE-RAG Public

    中文原生检索增强生成测评基准

    112 3

Repositories

Showing 10 of 51 repositories
  • Math24o Public

    Math24o: 高中奥林匹克数学竞赛测评集 High School Olympiad Mathematics Chinese Benchmark

    Python 7 0 0 0 Updated Mar 20, 2025
  • 2024h1 Public

    中文大模型基准测评2024上半年度报告,Report of LLMs in Chinese, First Half of 2024

    1 0 1 0 Updated Jul 9, 2024
  • SuperCLUE-Video Public

    中文原生多层次文生视频测评基准

    17 1 0 0 Updated Jul 8, 2024
  • SuperCLUE-V Public

    中文原生多模态理解测评基准(测评方案)

    3 0 0 0 Updated Jul 8, 2024
  • SuperCLUE-Long Public

    中文原生长文本测评基准

    5 0 0 0 Updated Jul 8, 2024
  • SuperCLUE-Image Public

    中文原生文生图测评基准

    8 0 0 0 Updated Jul 8, 2024
  • SuperCLUE-Coder Public

    中文原生代码助手测评基准,产品级

    0 0 0 0 Updated Jul 8, 2024
  • SuperCLUElyb Public

    SuperCLUE琅琊榜:中文通用大模型匿名对战评价基准

    145 6 3 1 Updated Jun 19, 2024
  • SuperCLUE Public

    SuperCLUE: 中文通用大模型综合性基准 | A Benchmark for Foundation Models in Chinese

    3,127 103 36 0 Updated May 23, 2024
  • CLUE Public

    中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard

    Python 4,092 546 78 2 Updated May 23, 2024