You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Look thru JDSW and break it up into k: v store, where each key is every unbroken sequence of characters prior to an annotation
For each key: value pair...
a. Look through the source text (same chapter) and find the first instance of the key (unbroken) that occurs after the previous annotation (annotations must be sequential)
b. If that key is found and it's in the source text (not a commentary), leave it alone in the JDSW
c. If that key is found and it's in the commentary (indicated in SBCK editions in brackets), drop it from the JDSW
d. If that key isn't found at all, log it along with the previous and next annotations so that @GDRom can investigate manually
Assumption: If LDM annotates two successive characters, the second annotation refers to the instance of that character that is closest in the source text to the previous character.
this will produce a version of the JDSW that leaves out any annotations referring to commentaries, which we can later align to the 正文 versions.
The text was updated successfully, but these errors were encountered:
- Ensure we don't accidentally collapse repeat annotations
- Allow overwriting the input JDSW file with flag
- Add column to file indicating annotation status
See #10
- Ensure we don't accidentally collapse repeat annotations
- Allow overwriting the input JDSW file with flag
- Add column to file indicating annotation status
See #10
copied/adapted notes from 5/27 meeting:
a. Look through the source text (same chapter) and find the first instance of the key (unbroken) that occurs after the previous annotation (annotations must be sequential)
b. If that key is found and it's in the source text (not a commentary), leave it alone in the JDSW
c. If that key is found and it's in the commentary (indicated in SBCK editions in brackets), drop it from the JDSW
d. If that key isn't found at all, log it along with the previous and next annotations so that @GDRom can investigate manually
Assumption: If LDM annotates two successive characters, the second annotation refers to the instance of that character that is closest in the source text to the previous character.
this will produce a version of the JDSW that leaves out any annotations referring to commentaries, which we can later align to the 正文 versions.
The text was updated successfully, but these errors were encountered: