Skip to content

Prevent double serialization inside Flask server#3653

Merged
tdene merged 19 commits intoNVIDIA:mainfrom
tdene:tde/fix_double_serialize
Mar 12, 2026
Merged

Prevent double serialization inside Flask server#3653
tdene merged 19 commits intoNVIDIA:mainfrom
tdene:tde/fix_double_serialize

Conversation

@tdene
Copy link
Contributor

@tdene tdene commented Mar 2, 2026

What does this PR do ?

⚠️ For major changes (either in lines of code or in its impact), please make sure to first share a design doc with the team. If you're unsure what's the best way to do so, contact the @mcore-oncall.

Contribution process

flowchart LR
    A[Pre-checks] --> B[PR Tests]
    subgraph Code Review/Approval
        C1[Expert Review] --> C2[Final Review]
    end
    B --> C1
    C2 --> D[Merge]
Loading

Pre-checks

  • I want this PR in a versioned release and have added the appropriate Milestone (e.g., Core 0.8)
  • I have added relevant unit tests
  • I have added relevant functional tests
  • I have added proper typing to my code Typing guidelines
  • I have added relevant documentation
  • I have run the autoformatter.sh on my PR

Code review

The following process is enforced via the CODEOWNERS file for changes into megatron/core. For changes outside of megatron/core, it is up to the PR author whether or not to tag the Final Reviewer team.

For MRs into `main` branch

Feel free to message or comment the @mcore-oncall to help accelerate your merge into main. The less complex your PR is, the faster it will be approved and merged!

(Step 1): Add PR label Expert Review

(Step 2): Collect the expert reviewers reviews

  1. Attach the Expert Review label when your PR is ready for review.
  2. GitHub auto-assigns expert reviewers based on your changes. They will get notified and pick up your PR soon.

⚠️ Only proceed to the next step once all reviewers have approved, merge-conflict are resolved and the CI is passing.
Final Review might get declined if these requirements are not fulfilled.

(Step 3): Final Review

  1. Add Final Review label
  2. GitHub auto-assigns final reviewers based on your changes. They will get notified and pick up your PR soon.

(Optional Step 4): Cherry-pick into release branch

If this PR also needs to be merged into core_r* release branches, after this PR has been merged, select Cherry-pick to open a new PR into the release branch.

For MRs into `dev` branch The proposed review process for `dev` branch is under active discussion.

MRs are mergable after one approval by either eharper@nvidia.com or zijiey@nvidia.com.

Merging your PR

Any member of core-adlr and core-nemo will be able to merge your PR.

@copy-pr-bot
Copy link

copy-pr-bot bot commented Mar 2, 2026

Auto-sync is disabled for draft pull requests in this repository. Workflows must be run manually.

Contributors can view more details about this message here.

@tdene tdene force-pushed the tde/fix_double_serialize branch from 97dd320 to beac7eb Compare March 2, 2026 11:18
@tdene tdene force-pushed the tde/fix_double_serialize branch from beac7eb to 52e3d10 Compare March 2, 2026 12:32
@tdene tdene marked this pull request as ready for review March 2, 2026 18:28
@tdene tdene requested review from a team as code owners March 2, 2026 18:28
@svcnvidia-nemo-ci svcnvidia-nemo-ci requested a review from a team March 2, 2026 18:29
@svcnvidia-nemo-ci svcnvidia-nemo-ci added this to the Core 0.16 milestone Mar 2, 2026
@tdene tdene added the Expert Review [deprecated] Apply this label to indicate that your PR is ready for expert review. label Mar 2, 2026
@tdene tdene force-pushed the tde/fix_double_serialize branch from 23160db to f3077ee Compare March 6, 2026 16:16
@svcnvidia-nemo-ci svcnvidia-nemo-ci added the Final Review PR is in the "final review" stage label Mar 6, 2026
)
# Unwrap ("tensor", [...]) tuples from serialize() into plain lists.
result = {
k: v[1] if isinstance(v, (list, tuple)) and len(v) == 2 and v[0] == "tensor" else v
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This conversion seems like it's potentially error-prone, can we move this to a small helper function like _unwrap_completed_request so that it will be a bit easier to debug if necessary?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Addressed!

Comment on lines +138 to +140
prompt_tokens = (
torch.arange(len(prompt.split())) if isinstance(prompt, str) else torch.tensor(prompt)
)
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can you add a comment that you're doing this to mock tokenization? Looked wrong to me at first glance but then I released this was a test file

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Addressed!

@tdene
Copy link
Contributor Author

tdene commented Mar 11, 2026

/claude review

Copy link
Contributor

@claude claude bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@svcnvidia-nemo-ci svcnvidia-nemo-ci added Approved All necessary approvals have been made and removed Final Review PR is in the "final review" stage labels Mar 11, 2026
@tdene tdene enabled auto-merge March 12, 2026 14:22
@tdene tdene added this pull request to the merge queue Mar 12, 2026
@svcnvidia-nemo-ci
Copy link

🔄 Merge queue validation started!

You can track the progress here: https://github.com/NVIDIA/Megatron-LM/actions/runs/23010112598

Merged via the queue into NVIDIA:main with commit e08dc9d Mar 12, 2026
52 of 53 checks passed
@tdene tdene deleted the tde/fix_double_serialize branch March 12, 2026 16:17
chtruong814 added a commit to chtruong814/Megatron-LM that referenced this pull request Mar 13, 2026
This reverts commit e08dc9d.

Signed-off-by: Charlie Truong <chtruong@nvidia.com>
chtruong814 added a commit to chtruong814/Megatron-LM that referenced this pull request Mar 13, 2026
…DIA#3653)""

This reverts commit 37280fa.

Signed-off-by: Charlie Truong <chtruong@nvidia.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Approved All necessary approvals have been made Expert Review [deprecated] Apply this label to indicate that your PR is ready for expert review.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

6 participants