Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Calculate the length of matched sections for the bill-to-bill page #104

Closed
aih opened this issue Dec 22, 2020 · 3 comments
Closed

Calculate the length of matched sections for the bill-to-bill page #104

aih opened this issue Dec 22, 2020 · 3 comments

Comments

@aih
Copy link
Collaborator

aih commented Dec 22, 2020

The 'similarity' between sections is related to the length of the section, and the amount of overlap. For the bill-to-bill page, it would be useful to show the length of each matched section. If possible, the length of the top 10 sections for each bill.

In a separate series of issues we'll look at creating a 'compare bills' page, which dynamically matches sections and compares them for both whole bills (not just the top 10, and not using Elasticsearch).

@aih
Copy link
Collaborator Author

aih commented Dec 22, 2020

Related to #85

@aih
Copy link
Collaborator Author

aih commented Dec 24, 2020

We no longer store the section_text. So this would require a recalculation of bill_similarity. It means that instead of the section text to calculate the length of the text and save that. E.g. 'This is the section text. It may be very short or very long'.length = 59

@aih
Copy link
Collaborator Author

aih commented Feb 11, 2021

Possible future enhancement. Not a priority at the moment.

@aih aih closed this as completed Feb 11, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants