Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Page numbers returned in response #57

Open
stevelizcano opened this issue Mar 2, 2024 · 5 comments
Open

Page numbers returned in response #57

stevelizcano opened this issue Mar 2, 2024 · 5 comments
Assignees
Labels
enhancement New feature or request

Comments

@stevelizcano
Copy link

For some indexing into our VectorDB, it's very helpful to know the page numbers if possible.

Is this feature possible now, or to add?

Either way, love the library. Great work!

@anoopshrma
Copy link
Collaborator

Currently page number is not returned in the metadata. But page content is separated using "-----------". You can separate the text based on the string pattern and provide page number for that by yourself.

PS: Adding more info in metadata is being implemented and will be available soon.

@hexapode hexapode added the enhancement New feature or request label Mar 4, 2024
@hexapode
Copy link
Contributor

hexapode commented Mar 4, 2024

We will provide a more reliable way to get page number (JSON output) in a future release

@hexapode hexapode self-assigned this Mar 4, 2024
@Laktus
Copy link

Laktus commented Apr 25, 2024

@hexapode When would such a release come? Until then is @anoopshrma suggestion a valid solution? Can we just replace each occurence of "-----------" with its respective page number?

Would be nice if you could approve this @hexapode

@anoopshrma
Copy link
Collaborator

Hey @Laktus , json mode is out for quite some time now.
You can give it a try: https://github.com/run-llama/llama_parse/blob/main/examples/demo_json_parsing.ipynb

@Laktus
Copy link

Laktus commented Apr 26, 2024

@anoopshrma Hi, is there also LlamaIndexTS support? Can i somewhere see a list of all supported modes? (Markdown, JSON, any more?). Thanks.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

4 participants