Perform granular updates if reloading textual files #366

krassowski · 2025-11-29T09:29:18Z

Add a test with expectation of granular reload and append for files
Fix Do not reload text document if text was apened out-of-band? #365

krassowski · 2025-11-29T10:29:36Z

jupyter_ydoc/yunicode.py

+
+            # for very different strings, just replace the whole content;
+            # this avoids generating a huge number of operations
+            if matcher.ratio() < 0.6:


Using 0.6 because:

As a rule of thumb, a ratio() value over 0.6 means the sequences are close matches

As per https://docs.python.org/3/library/difflib.html#sequencematcher-examples

Instead of ratio() we could use quick_ratio() or real_quick_ratio(). In case if the ratio is above the threshold this does not matter because we would end up computing matching_blocks anyways (this is computed and cached by a call to either raito() or get_opcodes(); in that case calling quick_ratio heuristic could actually end up taking more time as we end up doing more work. It all boils down to whether we think that we are more likely to see smaller updates or larger updates. One very cheap heuristic could be using len of both strings which is what real_quick_ratio does.

davidbrochart

That's great, I just have minor comments.

davidbrochart · 2025-12-01T08:04:59Z

jupyter_ydoc/yunicode.py

+                operations = matcher.get_opcodes()
+                offset = 0
+                for tag, i1, i2, j1, j2 in operations:
+                    if tag == "replace":


Since we require python 3.10+, maybe use match?

Done in b61975ef55b4e8011016e7fbdd83f601e5ae84df.

davidbrochart · 2025-12-01T08:08:04Z

jupyter_ydoc/yunicode.py

+                        del self._ysource[i1 + offset : i2 + offset]
+                        offset -= i2 - i1
+                    elif tag == "insert":
+                        self._ysource[i1 + offset : i2 + offset] = value[j1:j2]


Why is this not using Text.insert?

Changed to Text.insert in ca32120.

as `__setitem__` also checks if index is a number or slice and then checks the range of the slice; we can skip those knowing that `i1 == i2` in the `insert` opcode.

Add test with expectation of granular reload and append

a4b662d

krassowski added the enhancement New feature or request label Nov 29, 2025

github-actions bot assigned krassowski Nov 29, 2025

Use stdlib sequence matcher to perform granular text updates

ba21026

krassowski marked this pull request as ready for review November 29, 2025 10:20

krassowski commented Nov 29, 2025

View reviewed changes

Use real_quick_ratio to fast-reject very dissimilar updates

7e0627e

davidbrochart approved these changes Dec 1, 2025

View reviewed changes

krassowski added 2 commits December 1, 2025 09:33

Use insert() which skips some of the checks

ca32120

as `__setitem__` also checks if index is a number or slice and then checks the range of the slice; we can skip those knowing that `i1 == i2` in the `insert` opcode.

Use match-case instead of elif

b61975e

davidbrochart merged commit 87e2052 into jupyter-server:main Dec 1, 2025
22 checks passed

krassowski deleted the granular-file-reload branch December 1, 2025 10:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Perform granular updates if reloading textual files #366

Perform granular updates if reloading textual files #366

Uh oh!

krassowski commented Nov 29, 2025 •

edited

Loading

Uh oh!

krassowski Nov 29, 2025

Uh oh!

davidbrochart left a comment

Uh oh!

davidbrochart Dec 1, 2025

Uh oh!

krassowski Dec 1, 2025

Uh oh!

davidbrochart Dec 1, 2025

Uh oh!

krassowski Dec 1, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Perform granular updates if reloading textual files #366

Perform granular updates if reloading textual files #366

Uh oh!

Conversation

krassowski commented Nov 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

krassowski Nov 29, 2025

Choose a reason for hiding this comment

Uh oh!

davidbrochart left a comment

Choose a reason for hiding this comment

Uh oh!

davidbrochart Dec 1, 2025

Choose a reason for hiding this comment

Uh oh!

krassowski Dec 1, 2025

Choose a reason for hiding this comment

Uh oh!

davidbrochart Dec 1, 2025

Choose a reason for hiding this comment

Uh oh!

krassowski Dec 1, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

krassowski commented Nov 29, 2025 •

edited

Loading