-
Notifications
You must be signed in to change notification settings - Fork 239
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Problem converting DSL file (Papyros Greek) #265
Comments
The DSL file contains html tags! Have you tested it with DSL Compiler? |
I thought DSL was supposed to be a non-HTML format, that's why PyGlossary escaped I can't see any mention of allowing html tags in these official documentations. |
No, it has tags, just not html-tags. This is the same article from the decompressed dsl-file:
So you got the [m1], [b] and [c] tags right, missed the [i] tags and converted the [p] tags to <i class=""p""> tags. |
They are given here: http://lingvo.helpmax.net/en/troubleshooting/dsl-compiler/dsl-tags/ |
Checking the current dictionary:
Tags not found in current dictionary:
|
I pushed a change. |
👍 Will do. Thanks! |
Quick check with Koreader, for the same entry, second page has a beautiful result: I'll check further by running it through my script to convert it to Pocketbook format and ask ichnilatis for some more Goldendict dictionaries for testing. Ichnilatus is a mobileread forum member who wanted the Goldendict dictionaries converted to Pocketbook. The forum thread is here. EDIT: You even got rid of the quotes around the dictionary name. Nice! |
Great. |
I received one English-Greek dictionary and two other Greek-Greek ones. Will the En-Gr work for you unit tests? The dictionaries were uploaded here. |
Target language is important for testing, so anything to English would be good. |
Ichnalatis uploaded a Russian-English dictionary for you here. It converts without errors. Haven't tested it further. (Can't read Russian either. :) ) |
Thanks |
The same dictionary is used as in #264. Both pyglossary 4.00 and 4.07 generate this error.
These images are based on the dictionary generated with pyglossary:
![Screenshot from 2020-11-12 15-58-55](https://user-images.githubusercontent.com/5269101/98956967-f4124d80-2500-11eb-9c5b-3b1b7e93a673.png)
![Screenshot from 2020-11-12 15-59-14](https://user-images.githubusercontent.com/5269101/98956970-f4aae400-2500-11eb-9d7d-e4ce27185919.png)
Looking at the offending article in the xml-format generated with stardict-editor:
Looking at the csv-output of pyglossary:
So it starts okay, but messes up when the line gets too long.
The text was updated successfully, but these errors were encountered: