LLaMA-Factory FP8 training environment for NVIDIA Hopper GPUs. Fixes common configuration issues causing 2x slowdown with FP8 mixed precision.
deep-learning pytorch nvidia hopper performance-optimization fp8 h100 llama-factory gh200 transformer-engine
-
Updated
Nov 16, 2025 - Python