Skip to content

Upgrade TFX stack to 1.21.x and remove numpy/pandas bounds#39163

Open
shunping wants to merge 2 commits into
masterfrom
resolution-too-deep
Open

Upgrade TFX stack to 1.21.x and remove numpy/pandas bounds#39163
shunping wants to merge 2 commits into
masterfrom
resolution-too-deep

Conversation

@shunping

@shunping shunping commented Jun 30, 2026

Copy link
Copy Markdown
Collaborator
  • Upgraded tensorflow_transform, tensorflow-metadata, and tfx-bsl to >=1.21.0,<1.22.0 in mltransform_generate_vocab_requirements.txt, and removed the numpy<2 and pandas<2 upper bounds.
  • Appended the unique ${{env.NOW_UTC}} timestamp to both --artifact_location and --output_vocab arguments in the "run MLTransform Generate Vocab Batch workflow" step to prevent concurrent runs from failing due to folder overwrite restrictions in GCS.

See #38782 (comment) for more details.

fixes #38782

@gemini-code-assist

Copy link
Copy Markdown
Contributor

Summary of Changes

Hello, I'm Gemini Code Assist1! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request updates the TFX stack dependencies to version 1.21.x within the ML Transform example requirements to align with current release standards and removes restrictive version bounds on numpy and pandas to resolve potential compatibility issues.

Highlights

  • TFX Stack Upgrade: Upgraded tensorflow_transform, tensorflow-metadata, and tfx-bsl dependencies from version 1.14.x to 1.21.x.
  • Dependency Constraint Removal: Removed explicit version bounds for numpy and pandas to improve dependency flexibility.
New Features

🧠 You can now enable Memory (public preview) to help Gemini Code Assist learn from your team's feedback. This makes future code reviews more consistent and personalized to your project's style. Click here to enable Memory in your admin console.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point by creating a comment using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands on the current page.

Feature Command Description
Code Review /gemini review Performs a code review for the current pull request in its current state.
Pull Request Summary /gemini summary Provides a summary of the current pull request in its current state.
Comment @gemini-code-assist Responds in comments when explicitly tagged, both in pull request comments and review comments.
Help /gemini help Displays a list of available commands.

Customization

To customize the Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counterproductive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for GitHub and other Google products, sign up here.

Footnotes

  1. Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution.

@gemini-code-assist gemini-code-assist Bot left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review

This pull request updates the dependency requirements in mltransform_generate_vocab_requirements.txt by upgrading tensorflow_transform, tensorflow-metadata, and tfx-bsl to the >=1.21.0,<1.22.0 range, and removing the constraints on numpy and pandas. There are no review comments, and I have no additional feedback to provide.

Important

The consumer version of Gemini Code Assist on GitHub is being sunset. Starting June 18, 2026, new organization installations will be blocked, and all code review activity will officially cease on July 17, 2026.
For more details on the timeline and next steps, please review the Help Documentation.

@github-actions

Copy link
Copy Markdown
Contributor

Assigning reviewers:

R: @damccorm for label python.

Note: If you would like to opt out of this review, comment assign to next reviewer.

Available commands:

  • stop reviewer notifications - opt out of the automated review tooling
  • remind me after tests pass - tag the comment author after tests pass
  • waiting on author - shift the attention set back to the author (any comment or push by the author will return the attention set to the reviewers)

The PR bot will only process comments in the main thread (not review comments).

@Amar3tto

Copy link
Copy Markdown
Collaborator

Can we also add this?

@github-actions github-actions Bot added build and removed build labels Jun 30, 2026
@shunping

shunping commented Jun 30, 2026

Copy link
Copy Markdown
Collaborator Author

Rerunning the failed workflow: https://github.com/apache/beam/actions/runs/28448359077

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

The Inference Python Benchmarks Dataflow job is flaky

2 participants