Skip to content

set_double_shard_weights_config(...) now supports a seq_axis_names arg.#236

Merged
ruomingp merged 1 commit intoapple:mainfrom
tgunter:double_shard_supports_seq
Dec 10, 2023
Merged

set_double_shard_weights_config(...) now supports a seq_axis_names arg.#236
ruomingp merged 1 commit intoapple:mainfrom
tgunter:double_shard_supports_seq

Conversation

@tgunter
Copy link
Copy Markdown
Contributor

@tgunter tgunter commented Dec 8, 2023

When provided, this will force linear layer activations to be sharded over the named axes.

@tgunter tgunter requested review from markblee and ruomingp December 8, 2023 20:53
@ruomingp ruomingp added this pull request to the merge queue Dec 10, 2023
Merged via the queue into apple:main with commit 77165cf Dec 10, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants