-
Notifications
You must be signed in to change notification settings - Fork 157
Description
Using to_markdown there are the following issues with the results:
Slides 1-3 show the issue of duplicated headings, inconsistencies in md heading formats, and generic md formatting without customization.
Slide 4 shows the issue of losing hierarchal structure as the way text was extracted from the table makes it unclear on the order of information
Slide 5 shows the issue of tables not properly structured when converted to Markdown. Columns are misaligned, and data is scattered, making interpretation difficult.
Slide 6 shows indentation issues that led to bullet points blending into plain text, causing a loss of structural clarity.
In the overall md file, you don't see slide number dividers which makes it hard to differentiate which extracted text is from which slide and to pinpoint what information was lost.