Skip to content
@CLUEbenchmark

CLUE benchmark

Organization of Language Understanding Evaluation benchmark for Chinese: tasks & datasets, baselines, pre-trained Chinese models, corpus and leaderboard

Pinned Loading

  1. CLUE Public

    中文语言理解测评基准 Chinese Language Understanding Evaluation Benchmark: datasets, baselines, pre-trained models, corpus and leaderboard

    Python 4.1k 545

  2. SuperCLUE Public

    SuperCLUE: 中文通用大模型综合性基准 | A Benchmark for Foundation Models in Chinese

    3.2k 107

  3. SuperCLUE-Safety Public

    SC-Safety: 中文大模型多轮对抗安全基准

    135 10

  4. SuperCLUE-Auto Public

    汽车行业中文大模型测评基准,基于多轮开放式问题的细粒度评测

    33 3

  5. SuperCLUE-Agent Public

    SuperCLUE-Agent: 基于中文原生任务的Agent智能体核心能力测评基准

    88 4

  6. SuperCLUE-RAG Public

    中文原生检索增强生成测评基准

    117 3

Repositories

Showing 10 of 51 repositories

Most used topics

Loading…