[docs] training on specific hardware by stevhliu · Pull Request #44799 · huggingface/transformers

stevhliu · 2026-03-17T17:19:51Z

updates the Hardware section of the docs for training:

combined CPU/Distributed CPU into a single doc
add more info to the Gaudi doc (mixed precision, torch.compile, distributed training)
add more info to the MPS doc (mixed precision, model loading + device selection)
remove the GPU doc since all that info is covered elsewhere now making it redundant

HuggingFaceDocBuilderDev · 2026-03-17T17:32:08Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

stevhliu · 2026-03-17T18:37:37Z

docs/source/en/perf_train_gaudi.md

@@ -15,23 +15,64 @@ rendered properly in your Markdown viewer.

 # Intel Gaudi


@regisss, would you mind taking a look here please? 🙏

Thanks for this work @stevhliu, I just left one comment :)

docs/source/en/perf_train_special.md

pcuenca

Took a quick look at the mps section, it looks good. Happy to take a look at the rest if you need it @stevhliu!

docs/source/en/perf_train_special.md

stevhliu · 2026-03-18T16:01:15Z

thanks @pcuenca! happy to get your feedback on the rest if you don't mind/have the time!

pcuenca · 2026-03-19T08:36:00Z

docs/source/en/_toctree.yml

-    - local: perf_train_cpu_many
-      title: Distributed CPUs


Should we redirect to perf_train_cpu?

i think its better to merge the two cpu docs rather than redirect. the single perf_train_cpu doc is already quite thin and perf_train_cpu_many don't really fit the other docs in the section which are more focused on methods rather than hardware

Yes, I agree! I'm talking about avoiding a 404 when users visit https://huggingface.co/docs/transformers/en/perf_train_cpu_many after it's gone.

So adding an entry here.

ohhh yes, my bad i misunderstood!

docs/source/en/_toctree.yml

docs/source/en/perf_hardware.md

regisss · 2026-03-19T10:01:03Z

docs/source/en/perf_train_gaudi.md

-Refer to the [Gaudi docs](https://docs.habana.ai/en/latest/index.html) for more details.
+## Mixed precision
+
+All Gaudi generations support bf16 natively. Only Gaudi 2 and Gaudi 3 support fp16.


fp16 is not supported on any Gaudi generation 😁

ah thanks, i must've gotten confused here!

lol is that a bug then?

Good catch! I'm going to take a look at it and open a PR in Transformers :)

stevhliu commented Mar 17, 2026

View reviewed changes

docs/source/en/perf_train_special.md Show resolved Hide resolved

stevhliu requested a review from SunMarc March 17, 2026 18:38

pcuenca reviewed Mar 17, 2026

View reviewed changes

docs/source/en/perf_train_special.md Outdated Show resolved Hide resolved

stevhliu added 4 commits March 18, 2026 08:59

draft

bac46d2

mps

3113962

update not_doctested.txt

15ac311

feedback

9924778

stevhliu force-pushed the hardware branch from 3b2ee86 to 9924778 Compare March 18, 2026 15:59

pcuenca reviewed Mar 19, 2026

View reviewed changes

regisss reviewed Mar 19, 2026

View reviewed changes

stevhliu added 2 commits March 19, 2026 10:06

feedback

c70295c

feedback

d6fa93f

		@@ -15,23 +15,64 @@ rendered properly in your Markdown viewer.

		# Intel Gaudi

Conversation

stevhliu commented Mar 17, 2026

Uh oh!

HuggingFaceDocBuilderDev commented Mar 17, 2026

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

pcuenca left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

stevhliu commented Mar 18, 2026

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants