-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
MWS accent correction, continue phase 3 #142
Comments
Recall from #141, that changes for page 1-59 are in issue141/change_mw_6.txt, of 10-17-2022. The current work is done in mwissues/issue142 directory,. This table is a log of progress.
|
So far about 48800 changes in 499 pages, i.e. almost 100/page on average [in this issue alone]. Guess @funderburkjim feels this exercise is worthy enough of his time, and would be alloting further time to continue the work in the remaining pages. |
Yes, currently at page 617. Almost half-way. Probably 5-7 weeks to end (about 20 pages/day). |
Today's correction file (06) is dated as 20th, instead of 12th, by error. |
So until end of 2022. When should we plan for our yearly call? Right after? |
Corrected. |
You forgot to add the last commit (1100-1199) in the 'progress log table' above. Also pl. see my post at sanskrit-lexicon/SKD#16 (comment) reg. the annexure pages. Hope you would be finishing this commendable task before this Christmas; this indeed is the most worthy exercise (in my view) in the last 25+ years of MW work at CDSL. |
Updated progress log table. Thanks for mentioning. |
Regarding annexure pages accent review, I was hoping you would cover that when I finish the body text accent review. However, if you decide not to do that, then I will consider it later. |
Sure, I can resume my unfinished task (referred above) once you're done with your work and give me the updated iast file. |
Good to know. |
accent review completed.Review ends at page 1308. |
two accents.There are a relatively few cases where an entry headword is marked with two accents. These should be reviewed manually sometime. |
a few extraBased on notes made during the accent review, several additional entries were reviewed. |
I bow to the way you deal with issues.
@Andhrabharati would you be willing to look at the 200 lines? |
@funderburkjim hasn't made room for my stepping in, @gasyoun ; he wants to do something still (look at his prev. post). |
A crude statistic shows that the primary difference between the original and final versions of MW in this exercise
|
15001 changes? |
See revision of comment above. Roughly 58000+ metalines changed. |
Closing this issue. |
See what pwk says on this word-- I would again request you to make a page like https://sanskrit-lexicon.uni-koeln.de/scans/csl-apidev/pwkvn/03/ (option 3 of https://sanskrit-lexicon.uni-koeln.de/scans/csl-apidev/pwkvn/03/), for MW, PWG and pwk+pwkvn; this definitely would be helpful to easily track such queries, as MW is heavily depending on those two works. |
Such a display with MW is not as easy as the pwkvn/03/ page, because there are differences in spelling conventions between MW and Boehtlingk. e.g. kar (pw) vs kf (mw) [slp1 spelling of root 'to do']. Without handling these spelling differences, the pwkvn/03 display could be adapted, but would sometimes stumble. So getting a perfected display is non-trivial. My solution when working with the mw accents and consulting occasionally pw or pwg has been to |
@funderburkjim kar (pw) vs kf (mw) - with the acceneted words it will not become an issue at all. @Andhrabharati is not asking for a universal tool for all cases.
Yeah, it's not even reachable from homepage. Hope it can get some love in 2023.
Three open tabs is 2 tabs too much for me.
Thanks, so the last one. Is it still a single |
yes, it is a dvandvasamAsa. |
But why only a small part of them have two accents at once? Archaic ones? |
Almost every 'dual' category entry that I came across is with double accent; just wait till I finish reading through the MW entries. |
In #141, the last comments pertained began what was called phase 3. This is a page-by-page, column-by-column comparison of the scanned images with the cologne digitization of mw.txt. This comparison focuses primarily on the accents in the metaline and headline portion (before the broken bar) of the digitization.
This issue continues that task.
The text was updated successfully, but these errors were encountered: