Skip to content

DeepSeek V3.1 announcement & Update benchmark tooling script#2817

Merged
copybara-service[bot] merged 1 commit intomainfrom
rbierneni-mmlu
Dec 16, 2025
Merged

DeepSeek V3.1 announcement & Update benchmark tooling script#2817
copybara-service[bot] merged 1 commit intomainfrom
rbierneni-mmlu

Conversation

@Rohan-Bierneni
Copy link
Copy Markdown
Collaborator

@Rohan-Bierneni Rohan-Bierneni commented Dec 11, 2025

Description

DeepSeek V3.1 has been verified in maxtext and this pr is to update MaxText to make the announcement. The files that were updated in the homepage Readme and the DeepSeek Readme. The steps taken to verify the V3.1 checkpoint in MaxText and command run are documented here: b/450580441.

Also, when running MMLU benchmark for DeepSeek V3.1, there was a small bug in a script for Maxtext api_server, the tool used to run benchmarks efficiently. The issue is when running multi-host via xpk, the script port_forward_xpk.sh fails with the message "PODS: unbound variable". This pr has the fix with the right filtering method to find the correct pod when running the xpk workload.

If the change fixes a bug or a Github issue, please include a link, e.g.,:
FIXES: b/450580441

Tests

The steps taken to verify the V3.1 checkpoint in MaxText and command run are documented here.

Checklist

Before submitting this PR, please make sure (put X in square brackets):

  • I have performed a self-review of my code. For an optional AI review, add the gemini-review label.
  • I have necessary comments in my code, particularly in hard-to-understand areas.
  • I have run end-to-end tests tests and provided workload links above if applicable.
  • I have made or will make corresponding changes to the doc if needed, including adding new documentation pages to the relevant Table of Contents (toctree directive) as explained in our documentation.

Copy link
Copy Markdown
Collaborator

@RissyRan RissyRan left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks! A few minor comments.

Comment thread README.md
Comment thread end_to_end/tpu/deepseek/Run_DeepSeek.md Outdated
Comment thread end_to_end/tpu/deepseek/Run_DeepSeek.md Outdated
Comment thread end_to_end/tpu/deepseek/Run_DeepSeek.md Outdated
Comment thread end_to_end/tpu/deepseek/Run_DeepSeek.md Outdated
Comment thread end_to_end/tpu/deepseek/Run_DeepSeek.md Outdated
Comment thread end_to_end/tpu/deepseek/Run_DeepSeek.md Outdated
Comment thread README.md Outdated
Updated Readme with Deepseek V3.1 news

Update Deepseek Readme with V3.1 info

Updated port forward script

Clean up script

Resolve comments

Update with R1 model in deepseek readme

update doc since fp8 is now supported
@Rohan-Bierneni
Copy link
Copy Markdown
Collaborator Author

Manually adding pull_review tag since all tests have passed

@copybara-service copybara-service Bot merged commit 84f67ec into main Dec 16, 2025
101 of 134 checks passed
@copybara-service copybara-service Bot deleted the rbierneni-mmlu branch December 16, 2025 20:00
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants