Skip to content

Conversation

@jahorton
Copy link
Contributor

@jahorton jahorton commented Oct 1, 2025

Where #14790 introduced the new method for determining how input will transform context-tokenizations when applied, this method actually performs the transformation to produce the new (batched) tokenization, using subsets of inputs that all apply with the same properties in order to promote batching and efficiency in our correction-search goals.

Per #14876, note that this method will drop any previously-existing tokens that were erased by incoming input, replacing them if the input transforms have an insert string that results in a replacement.

Relates-to: #14679

Build-bot: skip build:web
Test-bot: skip

@keymanapp-test-bot
Copy link

keymanapp-test-bot bot commented Oct 1, 2025

User Test Results

Test specification and instructions

User tests are not required

Test Artifacts

  • Web
    • KeymanWeb Test Home - build : all tests passed (no artifacts on BuildLevel "build")

@keymanapp-test-bot keymanapp-test-bot bot changed the title feat(web): add method that evaluates precomputed tokenizations feat(web): add method that evaluates precomputed tokenizations 🚂 Oct 1, 2025
@keymanapp-test-bot keymanapp-test-bot bot added this to the A19S13 milestone Oct 1, 2025
@jahorton jahorton force-pushed the feat/web/evaluate-precomputed-tokenization branch from 8d9bbdb to fad631a Compare October 1, 2025 16:41
@jahorton jahorton changed the base branch from feat/web/merge-context-tokens to feat/web/compute-removed-token-count October 1, 2025 16:41
@jahorton jahorton force-pushed the feat/web/compute-removed-token-count branch from 90b7607 to b7a1f8d Compare October 2, 2025 16:34
@jahorton jahorton force-pushed the feat/web/evaluate-precomputed-tokenization branch from fad631a to 68c8c0c Compare October 2, 2025 16:36
Relates-to: #14679

Build-bot: skip build:web
Test-bot: skip
@jahorton jahorton force-pushed the feat/web/evaluate-precomputed-tokenization branch from 68c8c0c to 7ca6f27 Compare October 2, 2025 16:40
jahorton added a commit that referenced this pull request Oct 2, 2025
While handy, this method did not adequately account for 'split' / 'merge' edit cases and also required special handling to avoid certain degenerate edit-path cases.  The newer method put in place (see #14874) handles such edits and avoids the degeneracy problem that resulted when relying on edit paths for the edited portion of context.

Build-bot: skip build:web
Test-bot: skip
jahorton added a commit that referenced this pull request Oct 3, 2025
While handy, this method did not adequately account for 'split' / 'merge' edit cases and also required special handling to avoid certain degenerate edit-path cases.  The newer method put in place (see #14874) handles such edits and avoids the degeneracy problem that resulted when relying on edit paths for the edited portion of context.

Build-bot: skip build:web
Test-bot: skip
jahorton added a commit that referenced this pull request Oct 7, 2025
While handy, this method did not adequately account for 'split' / 'merge' edit cases and also required special handling to avoid certain degenerate edit-path cases.  The newer method put in place (see #14874) handles such edits and avoids the degeneracy problem that resulted when relying on edit paths for the edited portion of context.

Build-bot: skip build:web
Test-bot: skip
@jahorton jahorton marked this pull request as ready for review October 8, 2025 16:50
@darcywong00 darcywong00 modified the milestones: A19S13, A19S14 Oct 11, 2025
Base automatically changed from feat/web/compute-removed-token-count to epic/autocorrect October 16, 2025 14:09
@jahorton jahorton merged commit 750a9ac into epic/autocorrect Oct 16, 2025
7 of 8 checks passed
@github-project-automation github-project-automation bot moved this from Todo to Done in Keyman Oct 16, 2025
@jahorton jahorton deleted the feat/web/evaluate-precomputed-tokenization branch October 16, 2025 14:49
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

Archived in project

Development

Successfully merging this pull request may close these issues.

4 participants