Large-scale Auto-Distributed Training/Inference Unified Framework | Memory-Compute-Control Decoupled Architecture | Multi-language SDK & Heterogeneous Hardware Support
high-performance-computing deep-learning-framework model-serving distributed-training heterogeneous-computing model-inference cuda-acceleration simd-optimization
-
Updated
Mar 20, 2025 - C++