-
-
Notifications
You must be signed in to change notification settings - Fork 199
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Querying on Subset of Document_IDs #83
Comments
Hey, this is a great suggestion and has already been requested! It's doable without too many code changes because ColBERT natively supports querying only a subset of passage IDs. @anirudhdharmarajan can talk a bit more about this but right now we do have a mapping of |
I had looked a bit at the pids when reviewing @anirudhdharmarajan 's pull request for document_metadata so a little familiar with what you're describing. I'll take a rough swing this week, but it's possible I'm out of my depth. Will look for feedback when I have a a plan. |
Deleted my previous comments because it had the wrong approach. Will send a pull request later today hopefully. |
Pull Request complete. |
Are you refering to something similar to this issue in Colbert: stanford-futuredata/ColBERT#304 |
@hehuan2363 |
Thank you! Closing this issue as it's been merged |
Would love to be able to pass in an array of document_IDs as argument to query function, representing the subset of documents to query. Not familiar enough with the inner workings of the technology to propose a resolution myself. Would gladly take some guidance from someone more senior so I can produce a pull request myself.
The text was updated successfully, but these errors were encountered: