Skip to content
Change the repository type filter

All

    Repositories list

    • kenotron

      Public
      Experimental fork of Nanotron, a minimalistic large language model 4D-parallelism training
      Python
      Apache License 2.0
      0201Updated Dec 5, 2025Dec 5, 2025
    • Spack packages to use korovod libraries
      Python
      0000Updated Dec 5, 2025Dec 5, 2025
    • DeepEP

      Public
      DeepEP: an efficient expert-parallel communication library
      Cuda
      MIT License
      1.1k000Updated Dec 4, 2025Dec 4, 2025
    • Efficient asynchronous checkpointing using CUDA copy engines
      C++
      MIT License
      6000Updated Jul 18, 2025Jul 18, 2025
    • nanotron

      Public archive
      Minimalistic large language model 3D-parallelism training
      Python
      Apache License 2.0
      288000Updated Apr 18, 2025Apr 18, 2025
    • Asynchronously move optimizer states to NVMe storage
      C++
      0000Updated Mar 16, 2025Mar 16, 2025
    • neomem

      Public
      Torch rehearsal backend to mitigate catastrophic forgetting with a focus on performance, written in C++
      C++
      1000Updated Jan 14, 2025Jan 14, 2025
    • khorovod

      Public
      Distributed training framework for TensorFlow, Keras, and PyTorch. Experimental fork of Horovod.
      Python
      Other
      2.3k000Updated May 16, 2024May 16, 2024