Drop empty arrays (sets) and empty lists in expansion #220

lanthaler · 2013-02-17T12:07:33Z

Now that we remove free-floating values and nodes during expansion, shouldn't we also drop empty arrays (sets) and empty lists?

For example:

{
  "@context": {
    "name": "http://xmlns.com/foaf/0.1/name",
    "homepage": {
      "@id": "http://xmlns.com/foaf/0.1/homepage",
      "@type": "@id"
    }
  },
  "@id": "http://me.markus-lanthaler.com/",
  "name": "Markus Lanthaler",
  "homepage": [ ]
}

Shouldn't we drop the homepage property when expanding?

[
  {
    "@id": "http://me.markus-lanthaler.com/",
    "http://xmlns.com/foaf/0.1/name": [ { "@value": "Markus Lanthaler" } ]
  }
]

PROPOSAL 1: Drop empty arrays (sets) when expanding

PROPOSAL 2: Drop empty lists when expanding

Lists are a bit special here since an empty list is actually a value. So it might make sense to keep them but drop empty sets.

The text was updated successfully, but these errors were encountered:

lanthaler · 2013-02-17T12:08:28Z

PROPOSAL 1: +1
PROPOSAL 2: -0.5

gkellogg · 2013-02-17T15:58:55Z

PROPOSAL 1: Drop empty arrays (sets) when expanding

+0

PROPOSAL 2: Drop empty lists when expanding

-1 as you note, an empty list does express information, and is consistent with every other RDF serialization.

dlongley · 2013-02-18T15:24:03Z

PROPOSAL 1 and 2:

-1. I would find this annoying when working with JSON.

lanthaler · 2013-02-18T15:56:46Z

More annoying than the fact that properties which are not mapped to an IRI are dropped? More annoying than the fact that free-floating nodes are dropped? More annoying than the fact the null is dropped? :-)
.
The point is, it doesn’t mean anything. Actually we can’t even represent such data with our data model. It’s a subject-predicate tuple -- there's no object.

dlongley · 2013-02-18T16:22:06Z

More annoying than the fact that properties which are not mapped to an IRI are dropped? More annoying than the fact that free-floating nodes are dropped? More annoying than the fact the null is dropped? :-)

Yes, no, ...yeah.

I'm willing to live with dropping null because if null is a value you expect to see for a property in your application it's not that much more work to deal with its non-existence (sometimes the check is exactly the same). Dropping free-floating nodes is not an issue at all with me (that I can think of). Dropping properties that are not mapped to an IRI is perfectly fine ... my application won't be looking at them anyway.

Now dropping properties that my application wants to see ... and where it is expecting an array, I find that annoying should they disappear. Now I have to permit validators to accept input that is missing the property and then either re-add it myself or do another check for its existence. Why? What does that buy anyone? Suggesting that several other layers of software could alleviate this issue is also a non-starter for me. I don't understand the utility of removing the properties.

lanthaler · 2013-02-18T16:37:55Z

Now I have to permit validators to accept input that is missing the property and then either re-add it myself or do another check for its existence. Why? What does that buy anyone?

Effectively the property doesn't exist if it has no value. There's no arc in the graph because there's no node it could point to. As soon as you round-trip to RDF you would loose it (I know, you are not concerned about that) exactly because of to that reason.

What if the incoming data doesn't contain the property? Is that data then invalid according your validator? Is there somewhere a must contain this property even if it has no value requirement? If there isn't, you need both checks, property not there and empty array. If you require that the property exists and has a value, you also need more checks: property exists and value != empty array instead of just, property exists.

dlongley · 2013-02-18T17:02:51Z

I can reject inputs that don't have the property if I want to, yes. Maybe I want to ensure people are very explicit when they say they don't have any values for property X... and if they aren't, I won't accept their data.

JSON-LD is, primarily, about JSON, IMO. I would expect that most applications that consume JSON but use JSON-LD to preprocess their data do the following:

Only consider those terms that are in the default JSON-LD context for their application. This means that dropping values that aren't mapped to properties likely isn't an issue.
Have situations where the existence of certain properties is expected, and if they don't exist, the application raises an error at the validation layer. This means that having to detect that a property doesn't exist vs. it had no values is an extra step that has to be considered if a JSON-LD processor drops the property. I'd prefer to avoid having to add this extra step because I don't see the utility of dropping the property.

This seems like a case where consistency between processing output and the graph in the abstract is a bad idea. There is extra meaning that is useful to applications that needn't be dropped just for consistency's sake.

lanthaler · 2013-02-18T17:23:59Z

Maybe I want to ensure people are very explicit when they say they don't have any values for property X... and if they aren't, I won't accept their data.

They are not saying that - they say nothing. They would need to express it explicitly by using something like owl:Nothing.

Have situations where the existence of certain properties is expected, and if they don't exist, the application raises an error at the validation layer. This means that having to detect that a property doesn't exist vs. it had no values is an extra step that has to be considered if a JSON-LD processor drops the property. I'd prefer to avoid having to add this extra step because I don't see the utility of dropping the property.

Linked Data (RDF) and thus also JSON-LD are based on the open world assumption. Thus, the scenario you explain makes semantically absolutely no sense, IMO at least. We had discussions quite some time ago about what null means and we decided that it means nothing. If we wouldn't have made that decision, you could use it to explicitly express that that property has no value. But we didn't. An empty array is the same as null. It means nothing.

So, why is it that "homepage": null is dropped but "homepage": [ ] isn't? That is clearly inconsistent IMO and we should resolve it. If you really need that property to be there, frame your data and give it a default value.

dlongley · 2013-02-18T19:38:48Z

They are not saying that - they say nothing. They would need to express it explicitly by using something like owl:Nothing.

This is the point at which JSON developers stop using JSON-LD.

IMO, there is a very large group of developers that JSON-LD can meet the needs of and bring into the linked data world. When we start bringing in esoteric concepts like owl:Nothing instead of letting people use homepage: [], we quickly whittle away the size of that group. While consistency between processing output and the abstract data model is important, I believe it is trumped by usefulness and adoptability. Obviously, we don't want to introduce glaring inconsistencies, but I really don't think leaving properties w/empty arrays in the output does that. I do think that requiring JSON developers to now grasp owl:Nothing begins to impose too onerous a learning curve.

This isn't about getting the exact semantics correct for "really really has no value" vs. "i didn't specify anything", this is about getting people to use linked data in JSON without giving themselves a headache.

"When I don't have any values for a property in JSON, I use an empty array. If I want to preserve that array when I run it through a JSON-LD processor I have to link to the "owl" vocabulary and use the owl:Nothing property? Forget it, I'll just use JSON."

dlongley · 2013-02-18T19:43:49Z

I'll just say this -- I think if we take this sort of direction, we're going to end up requiring framing everywhere. When that's coupled with the fact that we didn't include framing in version 1.0 of the API (not questioning that decision right now, btw) it seems problematic to me ... like a potential barrier to adoption.

dlongley · 2013-02-18T20:01:15Z

We have several different ways of saying "nothing" in the JSON-LD syntax. It seems to me that we ought to pick the one that is most advantageous to JSON developers in our processing output. There's nothing incorrect about that, it only has an upside, IMO.

lanthaler · 2013-02-18T20:16:05Z

Well, I see it differently. I don't have a reasonable explanation at hand for the fact that we drop properties with a value that equals null and we don’t do the same for properties whose value is an empty array. It becomes even more confusing when you consider the fact that we compact arrays containing just one element... but it stays an array if there’s no element.

We don’t need to bring in "esoteric concepts like owl:Nothing" at all to explain the behavior. I just mentioned it to illustrate the difference here in this discussion. All we have to say is that properties without value are dropped just as free-floating values (values that are not connected by a property to another value). Draw it as graph and it's even easier to understand.

We have several different ways of saying "nothing" in the JSON-LD syntax. It seems to me that we ought to pick the one that is most advantageous to JSON developers in our processing output. There's nothing incorrect about that, it only has an upside, IMO.

The more ways you have to express something, the more checks you need to find out what has been said.

msporny · 2013-02-18T20:18:09Z

PROPOSAL 1: -1 (we may have to fix the data model, which seems to be broken in this regard)
PROPOSAL 2: -1

@lanthaler I don't think that specifying the empty set is meaningless. I definitely don't think that we should make developers use owl:Nothing. I also agree with @dlongley that we're skirting dangerously close to forcing JSON developers to do something that is very strange in JSON.

So, I think that being able to express empty sets is almost as important as being able to express empty lists in JSON-LD. Empty sets were supported in the original RDF/XML Grammar Event Matching Notation. I don't have a strong opinion yet on whether it should round-trip to RDF or not. I don't think it has to, or if we think it has to, we might want to generate a blank node to represent the set that is an rdfs:Container. We can fix the JSON-LD data model by allowing an object to be the empty set, which I'd expect is still aligned with RDF because you can do _:subject predicate _:object . _:object a rdfs:Container . We could also use rdf:Bag, but I think that's been deprecated.

lanthaler · 2013-03-26T15:26:02Z

RESOLVED: Do not drop empty arrays (sets) and empty lists in expansion and compaction.

lanthaler · 2013-03-26T16:09:37Z

We discussed this in today's telecon and decided to not change the current behavior, i.e., to keep empty arrays (representing sets & lists) when expanding/compacting a JSON-LD document.

Unless I hear objections, I will close the issue in 24 hours.

garpinc · 2022-10-14T13:06:23Z

I am not understanding the resolution here. What if I do want empty lists to be turned into owl:Nothing? Is there I hook I can add to enable this translation?

Debugging code it seems that https://github.com/jsonld-java/jsonld-java/blob/master/core/src/main/java/com/github/jsonldjava/core/RDFDataset.java provides no hook to change what you do when values is an empty list where it should either via JsonLdOptions or otherwise. There is also no clear way to provide your own implementation of RDFDataset so you can do what u want. it seems to me that providing a callback via JsonLdOptions for the implementation of com.github.jsonldjava.core.RDFDataset.graphToRDF(String, Map<String, Object>) would be the way to go.

lanthaler closed this as completed Mar 27, 2013

adlerfaulkner mentioned this issue Jun 3, 2022

multivalued and required slot does not allow empty array when converted to OWL linkml/linkml#826

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Drop empty arrays (sets) and empty lists in expansion #220

Drop empty arrays (sets) and empty lists in expansion #220

lanthaler commented Feb 17, 2013

lanthaler commented Feb 17, 2013

gkellogg commented Feb 17, 2013

dlongley commented Feb 18, 2013

lanthaler commented Feb 18, 2013

dlongley commented Feb 18, 2013

lanthaler commented Feb 18, 2013

dlongley commented Feb 18, 2013

lanthaler commented Feb 18, 2013

dlongley commented Feb 18, 2013

dlongley commented Feb 18, 2013

dlongley commented Feb 18, 2013

lanthaler commented Feb 18, 2013

msporny commented Feb 18, 2013

lanthaler commented Mar 26, 2013

lanthaler commented Mar 26, 2013

garpinc commented Oct 14, 2022 •

edited

Drop empty arrays (sets) and empty lists in expansion #220

Drop empty arrays (sets) and empty lists in expansion #220

Comments

lanthaler commented Feb 17, 2013

lanthaler commented Feb 17, 2013

gkellogg commented Feb 17, 2013

dlongley commented Feb 18, 2013

lanthaler commented Feb 18, 2013

dlongley commented Feb 18, 2013

lanthaler commented Feb 18, 2013

dlongley commented Feb 18, 2013

lanthaler commented Feb 18, 2013

dlongley commented Feb 18, 2013

dlongley commented Feb 18, 2013

dlongley commented Feb 18, 2013

lanthaler commented Feb 18, 2013

msporny commented Feb 18, 2013

lanthaler commented Mar 26, 2013

lanthaler commented Mar 26, 2013

garpinc commented Oct 14, 2022 • edited

garpinc commented Oct 14, 2022 •

edited