Skip to content
@CLUEbenchmark

CLUE benchmark

Organization of Language Understanding Evaluation benchmark for Chinese: tasks & datasets, baselines, pre-trained Chinese models, corpus and leaderboard

Pinned Loading

  1. CLUE Public

    中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard

    Python 4.1k 545

  2. SuperCLUE Public

    SuperCLUE: 中文通用大模型综合性基准 | A Benchmark for Foundation Models in Chinese

    3.1k 102

  3. SuperCLUE-Safety Public

    SC-Safety: 中文大模型多轮对抗安全基准

    125 9

  4. SuperCLUE-Auto Public

    汽车行业中文大模型测评基准,基于多轮开放式问题的细粒度评测

    33 3

  5. SuperCLUE-Agent Public

    SuperCLUE-Agent: 基于中文原生任务的Agent智能体核心能力测评基准

    82 3

  6. SuperCLUE-RAG Public

    中文原生检索增强生成测评基准

    111 3

Repositories

Showing 10 of 50 repositories
  • 2024h1 Public

    中文大模型基准测评2024上半年度报告,Report of LLMs in Chinese, First Half of 2024

    1 0 1 0 Updated Jul 9, 2024
  • SuperCLUE-Video Public

    中文原生多层次文生视频测评基准

    17 1 0 0 Updated Jul 8, 2024
  • SuperCLUE-V Public

    中文原生多模态理解测评基准(测评方案)

    3 0 0 0 Updated Jul 8, 2024
  • SuperCLUE-Long Public

    中文原生长文本测评基准

    5 0 0 0 Updated Jul 8, 2024
  • SuperCLUE-Image Public

    中文原生文生图测评基准

    8 0 0 0 Updated Jul 8, 2024
  • SuperCLUE-Coder Public

    中文原生代码助手测评基准,产品级

    0 0 0 0 Updated Jul 8, 2024
  • SuperCLUElyb Public

    SuperCLUE琅琊榜:中文通用大模型匿名对战评价基准

    145 6 3 1 Updated Jun 19, 2024
  • SuperCLUE Public

    SuperCLUE: 中文通用大模型综合性基准 | A Benchmark for Foundation Models in Chinese

    3,113 102 37 0 Updated May 23, 2024
  • CLUE Public

    中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard

    Python 4,080 545 78 2 Updated May 23, 2024
  • SuperCLUE-Fin Public

    中文金融大模型测评基准,六大类二十五任务、等级化评价,国内模型获得A级

    8 0 0 0 Updated May 6, 2024