Splitting text into separate columns from PDF #985
PiotrKrosniak
started this conversation in
Ask for help with specific PDFs
Replies: 1 comment
-
Hi @PiotrKrosniak Appreciate your interest in the library. The "text" strategy for vertical lines does not always give you the desired results as it is more of a best guess implementation. You can use the explicit strategy to do the same. It can be done by using a table settings like {
"vertical_strategy": "explicit",
"explicit_vertical_lines": [100, 200, 300], # These are the X coordinates where you want the vertical line separator to be present
"horizontal_strategy": "text",
"snap_y_tolerance": 6,
} You can learn more about them here. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I'm extracting tables from PDF document and my problem is that the script is splitting sentences into separate columns. How I can adjust table settings to force text to be in single column.
Beta Was this translation helpful? Give feedback.
All reactions