Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

long lines in the top level of bullets are truncated before they appear on "Front" in Anki #78

Closed
EntropyOrSloth opened this issue Jul 15, 2020 · 6 comments
Labels

Comments

@EntropyOrSloth
Copy link

Please follow the template below to report your issue.

  • [https://pastebin.com/zA9iSkuL] Raw text
  • [none] Error report provided by Anki
  • [MacOS X Catalina 10.15.5] Operatin system
  • [html] Original file type

Please include the following information:

Any general information

I imported an HTML bullet list from MS Word and imported using this add-on. There were items like this: "animate noun 2nd declension masculine nominative singular". On importing into Anki using this add-on, the "Fronts" were mostly truncated. For example, to "animate noun 2nd declension masculine", instead of the original "animate noun 2nd declension masculine nominative singular" line.

The only way I found to fix this is to edit each card in Anki, which defeats the purpose of this add-on.

These are the input files: russian declensions - nominative, accusative, genitive.zip

These include both the original MS Word file as well as the HTML file created from it which was imported into Anki via this add-on.

Raw text of the file you tried to upload

The PasteBin text from here can probably be used to recreate my file, however better would be to use the original input files given above: PasteBin URL link.

Error report from the popup

There were no errors reported by Anki.

What is your operating system

MacOS X Catalina 10.15.5

What was the original file type

MS Word *.docx file which was then saved as a *.html, the latter then being imported into Anki.

@c-okelly
Copy link
Owner

Hey,

I will have a look into this and get back to you. I'm not sure why that is happening.

I have been meaning to remove all mentions for support for Microsoft word in general though. Especially on Windows machines it is very hard to predict exactly what type of output Word is going to generate.

I would appreciate if you keep you issue reporting to Github rather then on the reviews for the add-on but feel free to write what you think on the reviews one way or another.

@EntropyOrSloth
Copy link
Author

EntropyOrSloth commented Jul 15, 2020 via email

@EntropyOrSloth
Copy link
Author

EntropyOrSloth commented Jul 15, 2020 via email

@c-okelly c-okelly added the bug label Jul 16, 2020
@c-okelly
Copy link
Owner

Thanks for all the info. Currently looking into the issue though might be a day or two to get to the bottom of it

@c-okelly
Copy link
Owner

c-okelly commented Jul 20, 2020

Hey,

Thanks for sending on all the information. Especially the actually html files.

The short and long of the issue is there Word just does not produce predictable HTML when you save a Word document. It appears the superscript symbols actually break the each line and the parser stop looking for information after this.

Unfortunately after having done a bit of work with the Word HTML files I have found they are actually too difficult to support consistently across systems.

To close this ticket I'm going to do the following:

  • Remove all word references from the docs

  • Add a word to LibreOffice migration docs

TLDR
Word won't work but there is a simple migration path to fix your error:

  1. Install LibreOffice (Free / OpenSource Word equvilant)
  2. Open the file in LibreOffice
  3. Save as a HTML file
  4. Upload file to Anki

@c-okelly
Copy link
Owner

Let me know if that doesn't work!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants