Skip to content
This repository has been archived by the owner on Jun 18, 2019. It is now read-only.

PDF Parsing? #62

Closed
2 tasks
Drakulix opened this issue May 31, 2017 · 2 comments
Closed
2 tasks

PDF Parsing? #62

Drakulix opened this issue May 31, 2017 · 2 comments
Assignees

Comments

@Drakulix
Copy link
Owner

Drakulix commented May 31, 2017

  • Evaluate Libraries for PDF parsing
  • Maybe writeup small (backend independent) example, if it is easy to do.
@DubbleClick
Copy link
Collaborator

I've had a look and messed around with pdf parsing (extracing text) using PDFMiner (open source, not stl), I am however not entirely sure what our use case with that information would be as the result is unformatted.
Save it within a project json?

@Drakulix
Copy link
Owner Author

Drakulix commented Jun 6, 2017

no, just pipe it into elastic. this is mainly a research topic for later, nothing that needs to be fleshed out now.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Projects
None yet
Development

No branches or pull requests

2 participants