Skip to content
@PKU-DAIR

DAIR Lab

Data and Intelligence Research (DAIR) Lab @ Peking University

Pinned Loading

  1. Hetu Hetu Public

    Forked from Hsword/Hetu

    A high-performance distributed deep learning system targeting large-scale and automated distributed training.

    Python 310 34

  2. open-box open-box Public

    Forked from thomas-young-2013/open-box

    Generalized and Efficient Blackbox Optimization System

    Python 413 55

  3. Hetu-Galvatron Hetu-Galvatron Public

    Forked from AFDWang/Hetu-Galvatron

    Galvatron is an automatic distributed training system designed for Transformer models, including Large Language Models (LLMs).

    Python 159 10

  4. SGL SGL Public

    A scalable graph learning toolkit for extremely large graph datasets. (WWW'22, 🏆 Best Student Paper Award)

    Python 153 22

  5. mindware mindware Public

    Forked from thomas-young-2013/mindware

    An efficient open-source AutoML system for automating machine learning lifecycle, including feature engineering, neural architecture search, and hyper-parameter tuning.

    Python 56 9

  6. Starter-Guide Starter-Guide Public

    A comprehensive guide for beginners in the field of data management and artificial intelligence.

    328 12

Repositories

Showing 10 of 41 repositories
  • Hetu-Galvatron Public Forked from AFDWang/Hetu-Galvatron

    Galvatron is an automatic distributed training system designed for Transformer models, including Large Language Models (LLMs).

    PKU-DAIR/Hetu-Galvatron’s past year of commit activity
    Python 159 Apache-2.0 13 1 0 Updated Jul 2, 2025
  • DataFlow Public Forked from OpenDCAI/DataFlow

    Easy Data Preparation with latest LLMs-based Operators and Pipelines.

    PKU-DAIR/DataFlow’s past year of commit activity
    Python 2 Apache-2.0 25 0 0 Updated Jun 29, 2025
  • PKU-DAIR/DAIR_Portal_FE’s past year of commit activity
    Vue 1 0 1 0 Updated Jun 19, 2025
  • SAS-Bench Public

    Benchmarking large language models for short answer grading in a fine-grained, multi-subject, and human-aligned setting.

    PKU-DAIR/SAS-Bench’s past year of commit activity
    Python 67 Apache-2.0 3 0 0 Updated May 15, 2025
  • Hetu Public Forked from Hsword/Hetu

    A high-performance distributed deep learning system targeting large-scale and automated distributed training.

    PKU-DAIR/Hetu’s past year of commit activity
    Python 310 Apache-2.0 53 0 0 Updated Apr 21, 2025
  • Starter-Guide Public

    A comprehensive guide for beginners in the field of data management and artificial intelligence.

    PKU-DAIR/Starter-Guide’s past year of commit activity
    328 12 1 (1 issue needs help) 0 Updated Apr 8, 2025
  • A-Tune-Online Public

    [ICDE 2025] A-Tune-Online: Efficient and QoS-aware Online Configuration Tuning for Dynamic Workloads.

    PKU-DAIR/A-Tune-Online’s past year of commit activity
    Python 1 1 0 0 Updated Mar 28, 2025
  • CAFE Public Forked from HugoZHL/CAFE

    [SIGMOD 2024] CAFE: Towards Compact, Adaptive, and Fast Embedding for Large-scale Recommendation Models

    PKU-DAIR/CAFE’s past year of commit activity
    Python 3 MIT 6 0 0 Updated Mar 19, 2025
  • FreeHGC Public Forked from GooLiang/FreeHGC
    PKU-DAIR/FreeHGC’s past year of commit activity
    Python 3 1 0 0 Updated Dec 6, 2024
  • PKU-DAIR/Noisy-LLM-Oracle’s past year of commit activity
    Python 0 1 0 0 Updated Nov 30, 2024