Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

remove rank entries for continuous spans on import #317

Closed
thomaskrause opened this issue May 6, 2014 · 0 comments
Closed

remove rank entries for continuous spans on import #317

thomaskrause opened this issue May 6, 2014 · 0 comments
Assignees
Milestone

Comments

@thomaskrause
Copy link
Member

For each spanning relation (thus a span covering a token) there is an entry in the rank table with the edge. This information is never used in the query generation, but when re-constructing the graph.

For continuous spans the graph can be re-constructed without this explicit storage of the edges (using the left/right_token_index). We should remove the unnecessary rank entries on import.

This will reduce the size of the facts table and it's indexes dramatically on corpora that only contain spans.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant