Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Profiling] Model and device specific folders for profiled data, documentation for profiling #7

Merged
merged 26 commits into from
May 14, 2024

Conversation

AgrawalAmey
Copy link
Collaborator

@AgrawalAmey AgrawalAmey commented May 10, 2024

Description

This PR changes the structure for profiling data with the aim to make adding a new model, device follow a Bring Your Own File model. The new structure is described here. Several bugs also have been fixed, notably removed hard-coded number of blocks in attention profiling.

Test Plan

The entire profiling data is new, so it was a real test 🧪 of running all the profiling scripts.
Also, verified fidelity for 34b model on arxiv trace with both vllm and sarathi schedulers.

Next steps

  1. Add mlp profiling data for H100.
  2. Add cpu_overhead profiling data for all models.

@nitinkedia7 nitinkedia7 self-requested a review May 12, 2024 06:33
@nitinkedia7 nitinkedia7 changed the title Update readme Model and device specific folders for profiled data, documentation for profiling May 14, 2024
@nitinkedia7 nitinkedia7 changed the title Model and device specific folders for profiled data, documentation for profiling [Profiling] Model and device specific folders for profiled data, documentation for profiling May 14, 2024
@nitinkedia7 nitinkedia7 merged commit bee57da into main May 14, 2024
1 of 2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants