Combining Manual Pipeline Parallelism & Automatic SPMD Parallelism #46

zhuohan123 · 2021-06-30T08:44:48Z

No description provided.

merrymercy · 2021-07-08T23:43:21Z

tests/test_auto_sharding_attention.py

@@ -94,7 +94,7 @@ def loss_func(params):

        hidden_states = jnp.ones((batch_size, seq_len, hidden_size), dtype=jnp.float32)
        attention_mask = jnp.ones((batch_size, seq_len), dtype=jnp.int32)
-        label = jnp.ones((batch_size, seq_len, hidden_size), dtype=jnp.float32)
+        label = jnp.ones((batch_size, seq_len, hidden_size), dtype=jnp.float32) * 23.0 * np.arange(hidden_size)[None, None, :]


What's this?

merrymercy · 2021-07-08T23:53:49Z

parax/pipeline_custom_call/README.md

@@ -0,0 +1,11 @@
+# XLA Pipeline Marker Custom Call


Adding a custom call is not as simple as I thought. How can we simplify this? For example, can putting the code to tensorflow-parax simplify the compilation process?

merrymercy · 2021-07-08T23:55:37Z

parax/device_mesh.py


        xla_computation = xla_client.XlaComputation(hlo_proto)
        num_devices = np.prod(strategy_config.logical_mesh_shape)
        assert num_devices == len(self.backend.devices())

-        compiled = compile_with_given_strategy(
+        compiled = compile_without_auto_sharding(


This is not always correct. When not using 3d parallel, we will pass an unoptimized HLO Proto. In this case, we need to call compile_with_given_strategy. I can fix this for you later.

zhuohan123 and others added 30 commits May 17, 2021 16:23

add a simple test graph

4bd6e5d

add prints

572d259

copy everything from swap

5cca331

test call pipeline_marker

a5301ce

modify kernel.h

cbf995f

fix small bug

7dfc0b0

add pipeline marker python fun

25f7a64

fix input proto

0149b6b

add flattened_shape_byte_sizes

1877e31

fix flattened_shape_byte_sizes

8ef0d47

fix flattened_shape_byte_sizes

4ff6e2a

make flattened_shape_byte_sizes return np array

d783c6e

check sizes

88dcddb

make pipelinemarker do copy

05a4169

fix bug

4dcc4db

add readme in playground

90a02fa

Merge branch 'master' into pipeline-xla-marker

3de3433

add xla pipeline marker

33974ad

use xla_marker for jax primitive

40282d9

fix bugs

bd0a8cf

merge multiple jax stages to one big jaxpr

aa90b57

add compile to xla

73ab277

fix small bugs

f906e80

fix bugs & add prints

36a276a

mark global and local inputs

da23c07

fix bug

7f36c30

fix namedtuple

9d71384

fix literal

e3ddcd6

fix bugs

acea1f5

fix gensym

c74fc8e

zhuohan123 and others added 25 commits June 13, 2021 23:09

Merge branch 'master' into pipeline-xla-marker

8691b86

enable tests

d016934

add a python hook to get hlo module

ff1e1da

print get_last_auto_sharded_hlo_module

082e835

add stage compilation option

1a7f5d2

fix stage compilation option

2996161

build xla computation for stages

0d92915

remove redundant prints

3641bf6

[WIP] use physical device mesh to run auto sharded stages

a68584f

fix some bugs

2f56e9c

fix donated_invars

785900b

fix _physical_meshes

f0f155e

remove literal in jaxpr

830031d

make invars same order as pipeline markers

af6e795

fix dropvar

503851c

fix dropvar

f464ab7

add a temp bert test

b92e428

fix several tests

d3a11c6

Merge branch 'master' into pipeline-xla-marker

7908ef3

fix bug

a4f4b79

Merge branch 'master' into pipeline-xla-marker

b195fb6

Fix bugs for merge

0dc30f3

fix 3d attention tests

bda4be5

fix 3d attention tests

fda0ad2

fix readme

878f768

zhuohan123 merged commit f2be873 into master Jul 8, 2021

merrymercy mentioned this pull request Jul 8, 2021

Split and Merge Hlo #27

Closed

merrymercy reviewed Jul 8, 2021

View reviewed changes

merrymercy mentioned this pull request Jul 10, 2021

Support embedding & Test auto-sharding on the whole BERT model & Refine auto-sharding interface #49

Merged

merrymercy deleted the pipeline-xla-marker branch August 22, 2021 09:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Combining Manual Pipeline Parallelism & Automatic SPMD Parallelism #46

Combining Manual Pipeline Parallelism & Automatic SPMD Parallelism #46

zhuohan123 commented Jun 30, 2021

merrymercy Jul 8, 2021

merrymercy Jul 8, 2021

merrymercy Jul 8, 2021

Combining Manual Pipeline Parallelism & Automatic SPMD Parallelism #46

Combining Manual Pipeline Parallelism & Automatic SPMD Parallelism #46

Conversation

zhuohan123 commented Jun 30, 2021

merrymercy Jul 8, 2021

Choose a reason for hiding this comment

merrymercy Jul 8, 2021

Choose a reason for hiding this comment

merrymercy Jul 8, 2021

Choose a reason for hiding this comment