Move flags in scripts to a common function #92

lsy323 · 2024-05-17T23:46:53Z

Make the flag config cleaner

Move common configs to jetstream_pt/config.py
Remove the unused create_config in config.py

fixes #83

qihqi · 2024-05-17T23:52:02Z

jetstream_pt/config.py

+
+def define_common_flags():
+  """Add common config flags to global FLAG."""
+  flags.DEFINE_string(


just define those in the top level. and then whoever imports it will have that flags. see: https://source.corp.google.com/search?q=flags.DEFINE_string

Ok, done. Intention was to optionally import all common flags from scripts

qihqi · 2024-05-17T23:53:41Z

run_server.py

  devices = server_lib.get_devices()
  print(f"devices: {devices}")
-  sharding_config_path = _SHARDING_CONFIG.value
  engine = jetstream_pt.create_pytorch_engine(


why not use create_engine_from_flags here?

Missed this one, done

qihqi · 2024-05-17T23:54:25Z

run_server.py

  )
  server_config = ServerConfig(
-      interleaved_slices=(_PLATFORM.value,),
+      interleaved_slices=(FLAGS.platform,),


let's get rid of this:

let's do f"tpu={len(jax.devices())}" here.

qihqi · 2024-05-17T23:55:13Z

run_server.py

+define_common_flags()
+define_profiling_flags()

-_PORT = flags.DEFINE_integer("port", 9000, "port to listen on")


we should leave the thread / port flags in this file instead of in the config file.

Those 2 are defined in line 32 and 33, used a different way to define them and the global var can be avoided. The flag value lis are accessible by FLAGS.port, FLAGS is an exisiting global from absl.flag

FanhaiLu1

Thanks for refactor the flags!

* refactor flags * clean up: * fix run_server * move common flags to global * format * update * udpate readme * update run_interactive

* Stable version of ragged attention. * Converts the attention output types the same as q. * Fixes the typo for the ragged attention. * Provides the default value for partition_by_axis. * Provides mesh to the shard_map. * Fixes typo. * Fixes typo, should be start instead of start_pos. * Should use "//" instead of "/" to get int results. * Use block size // 2 as the starting current position for better initial performance. Fix the typo that should use jax.lax.div instead of jnp.div * Updates the run_interactive script to use the correct result token processing API from JetStream. * Fix typo, should use token_utils.process_result_token. * Fix typo. * Fixes the sampled tokens list. * Use text_tokens_to_str to convert the output tokens. * Reshape the precomputed grid indices to 1D. Removes the dense_attention_quantized and use option to control if it's quantization or not. Use the new torch_xla2 API. * Should check if X is None instead of if X * Fix the dense_attention not returning data. * Reshape the kv scaler to 3 dim for ragged attention. * Cannot stop the input_pos counter from increasing since we are using a ring buffer. Will cause error. * Adds starting_position and profiling_prefill for better testing and benchmarking. * Move flags in scripts to a common function (#92) * refactor flags * clean up: * fix run_server * move common flags to global * format * update * udpate readme * update run_interactive * Stable version of ragged attention. * Fix the merge conflicts * Fixes the missing pieces after merging conflicts. Adds couple of new flags for debugging and performance tuning. * Integrates ragged attention to Gemma too. * Somehow have some local changes to run_interactive, reverting them to align with main. * Set the default value for the newly added parameters. * Adds more descriptions to the ragged attention index precompuation function. * Merges the quantized ragged attention kernel with the non quantized version. * Moves the attention calculation to attention.py for better code structure. * Fix run issues refactoring. * Fix the quantized version for ragged attention. * Fix test_attention by adding default value for the newly added arguments. The error message is missing positional arguments. * Fixes unit tests, changes the Transformer model call argument order(input_pos) back to original to avoid unnecessary issues. * Format attention_kernel.py * Add descrpitions to ragged attention outputs. * Fix quantization tests by adding default value to quantization kernel class. * Reformat attention_kernel.py. Format with black doesn't comply with the pylink rules. * Ignores R0913: Too many arguments link error for ragged attention kernel. Fix other lint errors. * Ignore R0903: Too few public methods. Fix lint errors. * Fix the rest of the lint errors. --------- Co-authored-by: Siyuan Liu <lsiyuan@google.com>

lsy323 requested review from FanhaiLu1, bhavya01, qihqi and wang2yn84 and removed request for FanhaiLu1, bhavya01 and qihqi May 17, 2024 23:46

lsy323 added 3 commits May 17, 2024 23:51

refactor flags

abf8535

clean up:

fa831b6

fix run_server

75d7fc3

lsy323 force-pushed the lsiyuan/refactor-flags branch from 10e1c71 to 75d7fc3 Compare May 17, 2024 23:51

qihqi reviewed May 17, 2024

View reviewed changes

lsy323 added 2 commits May 18, 2024 00:14

move common flags to global

2dd6dd2

format

7e8a9c3

lsy323 requested a review from qihqi May 18, 2024 00:17

lsy323 added 3 commits May 18, 2024 00:18

update

67af707

udpate readme

3e504e2

update run_interactive

55c6c7f

qihqi approved these changes May 20, 2024

View reviewed changes

FanhaiLu1 approved these changes May 20, 2024

View reviewed changes

FanhaiLu1 merged commit 0fe239b into AI-Hypercomputer:main May 20, 2024

lsy323 deleted the lsiyuan/refactor-flags branch May 20, 2024 17:39

wang2yn84 pushed a commit that referenced this pull request May 21, 2024

Move flags in scripts to a common function (#92)

4e253f1

* refactor flags * clean up: * fix run_server * move common flags to global * format * update * udpate readme * update run_interactive

wang2yn84 pushed a commit that referenced this pull request May 23, 2024

Move flags in scripts to a common function (#92)

930eaa0

* refactor flags * clean up: * fix run_server * move common flags to global * format * update * udpate readme * update run_interactive

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Move flags in scripts to a common function #92

Move flags in scripts to a common function #92

Uh oh!

lsy323 commented May 17, 2024 •

edited

Loading

Uh oh!

qihqi May 17, 2024

Uh oh!

lsy323 May 18, 2024

Uh oh!

qihqi May 17, 2024

Uh oh!

lsy323 May 18, 2024

Uh oh!

qihqi May 17, 2024

Uh oh!

lsy323 May 18, 2024

Uh oh!

qihqi May 17, 2024

Uh oh!

lsy323 May 18, 2024 •

edited

Loading

Uh oh!

FanhaiLu1 left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Move flags in scripts to a common function #92

Move flags in scripts to a common function #92

Uh oh!

Conversation

lsy323 commented May 17, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

qihqi May 17, 2024

Choose a reason for hiding this comment

Uh oh!

lsy323 May 18, 2024

Choose a reason for hiding this comment

Uh oh!

qihqi May 17, 2024

Choose a reason for hiding this comment

Uh oh!

lsy323 May 18, 2024

Choose a reason for hiding this comment

Uh oh!

qihqi May 17, 2024

Choose a reason for hiding this comment

Uh oh!

lsy323 May 18, 2024

Choose a reason for hiding this comment

Uh oh!

qihqi May 17, 2024

Choose a reason for hiding this comment

Uh oh!

lsy323 May 18, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

FanhaiLu1 left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

lsy323 commented May 17, 2024 •

edited

Loading

lsy323 May 18, 2024 •

edited

Loading