Skip to content
@korovod

korovod

HPC for ML

Pinned Loading

  1. nanotron Public

    Forked from huggingface/nanotron

    Experimental fork of Nanotron, a minimalistic large language model 3D-parallelism training

    Python 1

  2. datastates Public

    Forked from DataStates/datastates-llm

    Efficient asynchronous checkpointing using CUDA copy engines

    C++

Repositories

Showing 7 of 7 repositories
  • nanotron Public Forked from huggingface/nanotron

    Experimental fork of Nanotron, a minimalistic large language model 3D-parallelism training

    Python 1 Apache-2.0 168 0 0 Updated Mar 23, 2025
  • korovod-spack-packages Public

    Spack packages to use korovod libraries

    Python 0 0 0 0 Updated Mar 23, 2025
  • datastates Public Forked from DataStates/datastates-llm

    Efficient asynchronous checkpointing using CUDA copy engines

    C++ 0 MIT 3 0 0 Updated Mar 22, 2025
  • DeepEP Public Forked from deepseek-ai/DeepEP

    DeepEP: an efficient expert-parallel communication library

    Cuda 0 MIT 676 0 0 Updated Mar 18, 2025
  • async-optimizer-states Public

    Asynchronously move optimizer states to NVMe storage

    C++ 0 0 0 0 Updated Mar 16, 2025
  • neomem Public Forked from thomas-bouvier/neomem

    Torch rehearsal backend to mitigate catastrophic forgetting with a focus on performance, written in C++

    C++ 0 1 0 0 Updated Jan 14, 2025
  • khorovod Public Forked from horovod/horovod

    Distributed training framework for TensorFlow, Keras, and PyTorch. Experimental fork of Horovod.

    Python 0 2,283 0 0 Updated May 16, 2024

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…