Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Handle lists of more than 2^32 triples #136

Closed
wants to merge 1 commit into from

Conversation

Arkanosis
Copy link

This is required but probably not enough to handle big datasets such
as Wikidata (4.65 B triples).

This should partially address issue #135.

This is required but probably not enough to handle big datasets such
as Wikidata (4.65 B triples).

This should partially address issue rdfhdt#135.
@wouterbeek
Copy link
Contributor

@Arkanosis Thank you for working on the important topic of scalability.

Can you rebase this against develop-64 (the branch for 64-bit support)?

@v4ss4llo
Copy link

As far as I understand, HDT allows triples to be specified with any number of bits (to be defined in a HDT header), so my guess is that they used 32bits by default, and for larger datasets more work is needed to parse the header correctly and support an arbitrary number of bits.

@v4ss4llo
Copy link

The branch structure of this repository is really confusing...

@LaurensRietveld
Copy link
Member

We've merged the develop-64 branch to our develop branch. Let me know if you have any other issues. I'm assuming we can close this PR

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

4 participants