Skip to content
View shjwudp's full-sized avatar

Organizations

@BaguaSys

Block or report shjwudp

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. c4-dataset-script Public

    Inspired by google c4, here is a series of colossal clean data cleaning scripts focused on CommonCrawl data processing. Including Chinese data processing and cleaning methods in MassiveText.

    Python 122 14

  2. megabyte Public

    A PyTorch implementation of MEGABYTE. This multi-scale transformer architecture has the excellent features of tokenization-free and sub-quadratic attention. The paper link: https://arxiv.org/abs/23…

    Python 5 4

  3. BaguaSys/bagua Public

    Bagua Speeds up PyTorch

    Python 878 81

  4. BaguaSys/bagua-net Public archive

    High performance NCCL plugin for Bagua.

    Rust 15 4

  5. shu Public

    中文书籍收录整理, Collection of Chinese Books

    Python 182 36

  6. blueprint-trainer Public

    Scaffolding for sequence model training research.

    Python 1