-
Notifications
You must be signed in to change notification settings - Fork 65
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add a version number to the .hdt.index files #7
Comments
See also LinkedDataFragments/HDT-Node#1. |
Just encountered |
My proposal for a versioning strategy: Given that the version number is x.y.z: Original release: 1.0.0 Index files can, for instance, be named Any thoughts? |
Makes sense to me (I'd just put the |
Has anyone perchance made progress on this front in recent months? |
Would it be acceptable to put the version number in the makefile? |
Nah, I guess that won't work because of other build strategies |
Simply put it in an include file? |
Sounds reasonable. Probably this will have to happen in the Java version as well for compatibility? |
Fixed with merge of #36 |
Excellent. Shall we publish and tag a v1.2.0 soon then? |
depends. Did the index change in a breaking way? Else it's 1.1.2 :) We should include the versioning strategy in the readme |
Not sure I agree:
|
Sorry, I might be going off in a very different direction from what you've been discussing here... I would very much prefer the index and HDT file to be one and the same. It is technically trivial to do so. IIUC then the only reason why index and HDT are not one and the same file is because you can save the size of the index file when using HDTs as a transmission format. However, the size of the index is not so big so this size benefit is not so large. If we use the HDT as a storage format, then the size difference does not matter at all (because disk is so cheap these days). Having 1 file with no versioning/synchronization overhead between HDT and index would significantly simplify handling HDTs. |
@RubenVerborgh keeping them separate just seems confusing to me. But if this is common practice, by all means. @wouterbeek from my experience, indexes can be quite large. But I have to admit, it would simplify things. |
@mielvds The index files are between 10% and 40% of the size of the HDT file. There may be exceptions, but this is the ballpark figure IINM. Having a versioning system is of course better than the current situation where we have to do the bookkeeping ourselves. |
@mielvds What I suggested is semver, which becomes more and more common. But I don't mind too much in this case; I'd just want a new release somewhere soon. @wouterbeek Not sure I follow the argument of an index to be the same everywhere. We recently had a commit in which the index was improved for certain lookups. And as you know, the index file is not information by itself, as it can be computed in its entirety from the HDT file. In almost all cases, it will be faster to generate it than to download it. |
@RubenVerborgh sounds reasonable, but we'll need to add that distinction to the code then. @wouterbeek I couldn't remember the argument against it, and @RubenVerborgh found it. Some indexes are optional, and we don't know which ones will be added in the future. |
Different .hdt.index files for the same .hdt file are incompatible with each other; this causes problems if different applications read them. Maybe they should receive some kind of version number.
But this then creates the problem: how to find the index?
Perhaps, instead of supplying the .hdt file name, we can provide the index file name.
The text was updated successfully, but these errors were encountered: