Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Crawling JSON #601

Open
coprisanu opened this issue May 10, 2019 · 1 comment
Open

Crawling JSON #601

coprisanu opened this issue May 10, 2019 · 1 comment

Comments

@coprisanu
Copy link

Hi,

   We need to crawl a JSON file and to split its content into smaller documents to be indexed in Elasticsearch. We have noticed there are already implementations like CVSSplitter, DOMSplitter or PDFsplitter, is there one for JSON?

Thank you

@essiembre
Copy link
Contributor

No, there are currently none. Good idea though. I will mark as a feature request. In the meantime, if you know your Java, you can implement your own solution by extending AbstractDocumentSplitter (feel free to share).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants