Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

generate_geocoding_data: ensure reproducibility of the output code #2921

Closed
wants to merge 1 commit into from

Conversation

delroth
Copy link

@delroth delroth commented May 5, 2023

The output of the generate_geocoding_data tool is dependent on the filesystem iteration order, which can differ across systems / build invocations.

Explicitly sort returned directory entries returned by ListDirectory to prevent this problem, and document that this should be done for reproducibility.

@google-cla
Copy link

google-cla bot commented May 5, 2023

Thanks for your pull request! It looks like this may be your first contribution to a Google open source project. Before we can look at your pull request, you'll need to sign a Contributor License Agreement (CLA).

View this failed invocation of the CLA check for more information.

For the most up to date status, view the checks section at the bottom of the pull request.

874af79 introduced an accidental
regression by replacing std::map (sorted container) with
absl::btree_map. This makes the output of the generate_geocoding_data
tool dependent on the filesystem iteration order, which can differ
across system.

Explicitly sort returned directory entries returned by ListDirectory to
prevent this problem, and document that this should be done for
reproducibility.
@delroth
Copy link
Author

delroth commented Jan 17, 2024

No interest in waiting months to get someone to look at simple patches - dropping.

@delroth delroth closed this Jan 17, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
1 participant