-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
pw revisions based on AB version(s), continued #102
Comments
Working directory for this issue: pwkissues/issue102. temp_pw_17a.txt -- 4 changes from temp_pw_ab_17.txt above. see changes_17a.txt. |
@Andhrabharati |
Yes I do, @funderburkjim ! BTW, I see that the transcoder file is giving some errors now on the revised file (and outputs a file just upto the first metaline, but not inclusive, only!!), which I wanted to use for "proofing" the pwkvn file once--
Could you pl. tell why the problem is occurring? |
Anyway, here is the pwkvn file to "adopt" for cdsl usage-- |
Now, coming to my pw_AB v.2 file, I had already mentioned about it earlier. I see that the original typed text also has the volume-page notation, as seen at the text given by Thomas recently, while commenting about the das. abbreviation. The dot between volume & page got missed in the present pw.txt, and probably Jim might not mind bringing it back. Otherwise, I shall revert the correction in my v.2 file, for giving it out (to start the next phase of corrections in cdsl pw.text) |
I do not find this problem when I run locally for pwkvn. This may be a python version problem. And your error message shows Python312. (version 3.12). Can you use version 3.9 of python? Also note -- While my local conversion of pwkvn gave no error, I DID find an |
I think that '.' in The '.' is not part of pw.txt, nor of the display of pw. Nor has it been previously. So there is nothing to 'bring back'. |
Even I got confused with this at times; so looked around and thought of changing the pc as v-p-c, as in other cdsl works. Even if not to "bring back", would you mind changing it @funderburkjim ? |
Uninstalled Python 3.12 and installed Python 3.9; but still the same error appears for me--
Would you pl. give me the converted pwkvn_deva file for now, so that I can start proofing the same? |
BTW, where did you find |
I see that the 'pwkvn' file also uses the 'v-page-col' form for 'pc'. When I get to the task of integrating pwkvn into pw, then maybe will be the time to change |
|
Good to hear this! So, shall I post my v.2 file as is now? [along with the steps involved in converting ab_17 to that form] |
Version confusion! In the comments above, we've mentioned both pw and pwkvn.
You asked
|
My copy pwkvn_AB_v.1.txt |
I had made v.2 (long back, over 2 months ago) from my v.1 file that was posted initially for the abbr. work. And I have been updating the same with your successive steps from 1 to 16, so far. Shall post the file tomorrow, as I had just shutdown my system and on my mobile now. |
|
why the conversion problem?The python errors above are occurring at line 74 of transcoder.py the et_example folder contains a published simple example of using ET.parse. @Andhrabharati If you try this example on your local system, Does it work? |
Here is the result--
|
ab_17 to ab_17a (adjustments)Step-1: merging the separate [Pagexxx] lines "into" the other lines. (a) Step-2: merging consecutive
Step-3: removing the italic terminations around [Pagexxx]
Step-4: Changing the page & column numbers after (a) Insert a '-' after the first (volume) digit. temp_pw_ab_17a.zip and pw (AB v2).zip The majority of changes are-- |
@Andhrabharati I'll switch to pwkvn now (#103) before further investigation of your changes in temp_pw_AB_v2.txt |
All the 20 "[Pagexxx]" changes mentioned in your addl. corrections file above were "included" in the Step-4 in my notes. And then there is one mistake (!?) in my file, which you have noted at
;; AB note |
Resolve the 7000 differencesThe work is done in the issue102/step2 directory. change_v2_1.txt (39 changes) documents the further changes to AB version temp_pw_v2_0.txt. change_v1_1.txt (7252 changes) documents the further changes to cdsl version temp_pw_v1_0.txt. Respectively applying these changes yields temp_pw_v1_1.txt and temp_pw_v2_1.txt. diff temp_pw_v1_1.txt temp_pw_v2_1.txt | wc -l This is the version pushed to csl-orig repository at the commit mentioned in above comment. Other repositories also required some change for xml-validation and proper behavior of the displays. Notably, the new ![]() |
@Andhrabharati I think this issue may now be closed. Agree? |
Out of the 32 Misc. changes done in the AB version, I've noticed 5 corrections (at 36207, 36215, 324234, 339882 and 426957)-- You may correct these in the cdsl text also. The rest are mostly related to italic marking, to which I deliberately didn't pay much attention earlier (having thought of doing a full text reading once; and I would probably take this up quite soon). Glad that the CDSL and AB versions are now tallying!! I have just returned home from a long journey (and too tired), and shall look at the rest of the actions (that you had taken) tomorrow. |
@Andhrabharati @maltenth has been working on two types of corrections
Let's defer your further work (including your small 'corrections' file) until this work with Thomas is finished. BTW, I am doubtful of the 'print change' suggestions of your corrections file, but we can discuss further in another issue. |
This issue continues the revisions of PW digitization at #88, based upon work done by @Andhrabharati.
We start with AB's temp_pw_ab_17.zip
The text was updated successfully, but these errors were encountered: