-
Notifications
You must be signed in to change notification settings - Fork 293
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Textual licence formatting is broken #1076
Comments
Moving this issue to the License-List-XML repo. @uiopaubo changing from XML to a different format is a rather large effort. The SPDX legal team would be the group to decide if we change format. Note the related enhancement request spdx/license-list-data#44 which is currently being worked on. |
I took a look at the source XML for the GPL license. The space and line break are caused by the license-list-XML/src/GPL-3.0-or-later.xml Line 838 in 531e5b5
I can think of 2 solutions - we could remove the If someone was willing to volunteer to update the license XML to use the spacing attribution (e.g. |
I'm not advocating anyone changing the format of anything. As the referenced issue points out, the upstream licence files should be the canonical form of the licence in those formats. Otherwise, one has to verify that the content has not been modified semantically, which is far more work than just comparing data byte for byte. Indeed, I can imagine these reformatted text versions causing all sorts of problems by appearing superficially different to the widely distributed forms of these licences. One thing I would also be worried about is the authenticity of marked up versions of licences which have not originated from the authors of those licences. Although one might argue that one has the freedom to make such content available in any form, I seem to recall that some organisations (the FSF, for example) retain strict control over the use of licence texts in order to avoid the proliferation of lookalike licences. |
Hi @goneall, I'm happy to update the |
@swinslow @uiopaubo The license text file issue may already be resolved by spdx/LicenseListPublisher#83 This can be confirmed by reviewing the text files in the license-list-data repo (e.g. GPL-3.0-or-later.txt). There is still an issue with the HTML formatting, however (reference https://spdx.org/licenses/GPL-3.0-or-later.html the end of which contains I'm thinking we should open a separate issue to track the HTML formatting and close this one as resolved. I have not yet implemented the |
@swinslow I think we should close this issue since there are no more specific pull requests and issues logged to track the solution implementation. |
Thanks @goneall! |
While double-checking the output of the REUSE tool (https://reuse.software/), I noticed that the licence texts it downloads in order to populate a licensing manifest did not match those traditionally distributed with software. At least the GPL3 licence text differs in formatting from the more established form:
https://github.com/spdx/license-list-data/blob/master/text/GPL-3.0-or-later.txt
In this downloaded document, indentation appears to have been stripped, and the text appears to have been reformatted. Such reformatting actually introduces errors such as in the final URL of the above document:
<https://www.gnu.org/
licenses /why-not-lgpl.html>.
It is broken across two lines and includes an erroneous space. Neither of these things are present in the established, canonical version.
Although this issue may seem trivial, discovering superficial differences required me to investigate whether the actual content of the downloaded document was correct. By reformatting, any comparison with existing documents is made difficult and, to be certain that no mistakes or tampering have occurred with the published documents, requires more effort instead of the anticipated reduced effort in using such a centralised licence repository in the first place.
May I suggest that the canonical textual forms of these licences be used instead? I have read about the use of XML forms of licences as the origin of published licence data, but this is arguably the wrong decision or one that needed to be taken with the fidelity of the textual output in mind.
The text was updated successfully, but these errors were encountered: