Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Extract doc lengths #791

Merged
merged 11 commits into from Sep 6, 2019

Conversation

@Chriskamphuis
Copy link
Contributor

commented Sep 6, 2019

Script that produces tsv file which contains the length of all documents, the unique term count and the lossy unique term count as used in the BM25similarity class.

Add date filter to background linking reranker
Date is saved as an additional field
Date filter can be toggled using command line argument
Utility script dumping out doc lengths
fix #787
Fixed typo in SearchCollection.java
@lintool
lintool approved these changes Sep 6, 2019
@lintool

This comment has been minimized.

Copy link
Member

commented Sep 6, 2019

Please add license header, update branch, then go ahead and merge.

Chriskamphuis and others added 3 commits Sep 6, 2019

@lintool lintool merged commit 61f6f20 into castorini:master Sep 6, 2019

1 check passed

continuous-integration/travis-ci/pr The Travis CI build passed
Details
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
2 participants
You can’t perform that action at this time.