[Google Blockly] strip random block ids when saving xml #56299

mikeharv · 2024-02-05T17:13:33Z

Recently, there was concern about the potential for storing identical solutions in the level_sources table in DB-backed levels for Google Blockly labs.

@breville said:

The level_sources table is designed to only save one copy of each unique solution. The saves us from writing tons of duplicate entries. However, if each solution has some random IDs in it, we suddenly are saving every copy of what's essentially the same source.

He added:

I think x/y is probably not such a big issue for levels where students are unlikely (or unable) to drag the parent block to a new position. However, if IDs are truly unique for every use, then they seem like a more concerning issue.

Currently, this issue impacts DB-backed levels using Flappy, Bounce, and Dance. As it happens, Sprite Lab/Poetry are never DB-backed, always using S3 for project storage. Minecraft also uses the database for many levels, so it makes sense to address this before migrating that lab.

Stripping block ids at the time of saving is a reasonable step forward, however there are times when blocks need to have explicitly set ids. For example, block ids are used with callouts, automated tests, and possibly some older forms of Blockly code validation.

This branch creates a new property on the Blockly Wrapper that stores a list of block ids that were explicitly set in the level (start blocks or toolbox). Any other ids found can safely be considered randomly generated by Blockly and discarded.

Links

Testing story

To test this change, I solved a Minecraft puzzle the same way twice. I logged the XML before and after the stripping and observed the following:

Before stripping, each program contained with completely different ids, except for a when_run block with id whenRun.
All random/unique ids were detected and removed.
After stripping, each program's XML was identical.

I had to get creative with the way the appOptions.level object was passed and accessed due to failing automated tests. While appOptions is typically available globally, this didn't prove true in the context of some are tests that needed to traverse these code changes.

PR Checklist:

Tests provide adequate coverage
Privacy and Security impacts have been assessed
Code is well-commented
New features are translatable or updates will not break translations
Relevant documentation has been added or updated
User impact is well-understood and desirable
Pull Request is labeled appropriately
Follow-up work items (including potential tech debt) are tracked and linked

mikeharv · 2024-02-05T17:13:35Z

This PR is a continuation of #56192, which was mis-closed by the Git LFS migration (#55759).

Previous Comments:

Previous Reviews:

molly-moen · 2024-02-05T17:51:21Z

apps/src/blockly/addons/cdoXml.js

+function removeIdsFromBlocks(element) {
+  if (element.nodeName === 'block') {
+    const id = element.getAttribute('id');
+    if (id && !Blockly.levelBlockIds.includes(id)) {


this function is only used in the context of google blockly right?

Correct. And true of everything in this file!

molly-moen

Nice!

mikeharv · 2024-02-05T18:15:10Z

I just added one more commit: c9c9475

The sharepage.feature UI test was failing: https://drone.cdn-code.org/code-dot-org/code-dot-org/39546

The reason for this is that loads in some block XML containing block ids which are needed to check the arrangement of the blocks.

https://github.com/code-dot-org/code-dot-org/blob/mike/strip-random-block-ids/dashboard/test/ui/features/step_definitions/flappy_steps.rb#L17-L21

In practice, these blocks would have random ids which are now getting stripped out. To preserve the ids for the test, we can just add them to the toolbox and start blocks XML.

ebeastlake · 2024-02-05T19:12:02Z

🎉 Should we hold off on merging this until after the Sprite Lab migration?

mikeharv · 2024-02-05T19:13:41Z

🎉 Should we hold off on merging this until after the Sprite Lab migration?

@ebeastlake I don't think it matters. We'll only be saving JSON for Sprite Lab (outside of levelbuilder functions), and never writing Sprite Lab solutions to the database.

Edit: Let's hold this until after the migration!

breville

Thanks for doing this!

breville · 2024-02-06T22:24:57Z

It might be good to check the production database before and after this is merged just to verify that we are indeed saving fewer unique solutions!

mikeharv · 2024-02-13T13:58:13Z

@breville Agreed! That is the plan.

mikeharv added 4 commits February 1, 2024 13:58

[Google Blockly] strip random block ids when saving xml

bdcb245

Update utils.js

b49fcd1

Update utils.js

b819066

pass in appOptions

580ec8d

Merge branch 'staging' into mike/strip-random-block-ids

9f52f3f

mikeharv requested a review from a team February 5, 2024 17:39

molly-moen reviewed Feb 5, 2024

View reviewed changes

molly-moen approved these changes Feb 5, 2024

View reviewed changes

add ids to level for sharepage ui test

c9c9475

breville approved these changes Feb 6, 2024

View reviewed changes

mikeharv merged commit dc71950 into staging Feb 13, 2024

mikeharv deleted the mike/strip-random-block-ids branch February 13, 2024 13:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Google Blockly] strip random block ids when saving xml #56299

[Google Blockly] strip random block ids when saving xml #56299

Uh oh!

mikeharv commented Feb 5, 2024 •

edited

Loading

Uh oh!

mikeharv commented Feb 5, 2024

Uh oh!

molly-moen Feb 5, 2024

Uh oh!

mikeharv Feb 5, 2024

Uh oh!

molly-moen left a comment

Uh oh!

mikeharv commented Feb 5, 2024

Uh oh!

ebeastlake commented Feb 5, 2024

Uh oh!

mikeharv commented Feb 5, 2024 •

edited

Loading

Uh oh!

breville left a comment

Uh oh!

breville commented Feb 6, 2024

Uh oh!

mikeharv commented Feb 13, 2024

Uh oh!

Uh oh!

[Google Blockly] strip random block ids when saving xml #56299

[Google Blockly] strip random block ids when saving xml #56299

Uh oh!

Conversation

mikeharv commented Feb 5, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Links

Testing story

PR Checklist:

Uh oh!

mikeharv commented Feb 5, 2024

Previous Comments:

Previous Reviews:

Uh oh!

molly-moen Feb 5, 2024

Choose a reason for hiding this comment

Uh oh!

mikeharv Feb 5, 2024

Choose a reason for hiding this comment

Uh oh!

molly-moen left a comment

Choose a reason for hiding this comment

Uh oh!

mikeharv commented Feb 5, 2024

Uh oh!

ebeastlake commented Feb 5, 2024

Uh oh!

mikeharv commented Feb 5, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

breville left a comment

Choose a reason for hiding this comment

Uh oh!

breville commented Feb 6, 2024

Uh oh!

mikeharv commented Feb 13, 2024

Uh oh!

Uh oh!

mikeharv commented Feb 5, 2024 •

edited

Loading

mikeharv commented Feb 5, 2024 •

edited

Loading