WeakKeyDict should not convert keys as otherwise it can drop them #24941

mauro3 · 2017-12-06T11:33:16Z

Consider:

julia> wkh = WeakKeyDict{Vector{Int}, Any}()
WeakKeyDict{Array{Int64,1},Any} with 0 entries

julia> v = Float64[1:10^5;];

julia> wkh[v] = 1
1

julia> gc()

julia> keys(wkh)
Base.KeyIterator for a WeakKeyDict{Array{Int64,1},Any} with 0 entries

I would argue that this is not the intended behavior and instead an error should be thrown on wkh[v] = 1. Although, maybe there is a use for this which escapes me, e.g. hash consing, see #3002.

Open questions:

is whether this strict behavior should also be enforced for getindex, i.e. no key conversion on getters either.
one inconsistency with non-conversion is that k1==k2 does not imply wkh[k1]==wkh[k2]. Not sure whether this is bad or not (or worse than above example).

I'm not sure whether this issue should be classified as breaking (I guess that depends whether the behavior is a bug or a feature).

I have a branch in which I changed this here, which I could turn into a PR. Alternatively, if this is a feature and not a bug, a test should be added. Could above example be used or is that too fragile due to gc()?

The text was updated successfully, but these errors were encountered:

mauro3 · 2018-07-11T08:48:55Z

This should get the collections label and probably the bug label.

StefanKarpinski · 2018-07-11T18:15:53Z

WeakKeyDict should really be WeakKeyIdDict, anything else is pretty broken.

StefanKarpinski · 2018-07-17T15:59:08Z

I realize it's very late in the game, but is it fundamentally broken for WeakKeyDict not to do egal-based key comparison? @JeffBezanson

JeffBezanson · 2018-07-17T16:43:27Z

No, I don't think it's fundamentally broken. The weakness controls when keys get deleted, and the comparison controls which keys can be found. For example if [1] is a key, other [1] arrays will match it but the key will only be removed when the original [1] is freed. That might not be the behavior you want, but it doesn't seem totally broken to me. But on the whole I agree WeakKeyIdDict makes a bit more sense.

mauro3 · 2018-07-17T18:40:14Z

I totally agree that the ID-version makes more sense.

With the non-ID version, I cannot imagine a situation when conversion of the key upon insertion is the desired outcome. Do you have an example when that could be useful and not a bug? (I'm ok with comparison doing a "conversion")

StefanKarpinski · 2018-07-17T19:32:30Z

Ok, the current version isn't "fundamentally broken" in the sense that it crashes or gives incorrect behavior, but I cannot imagine a situation where someone is using a WeakKeyDict and a WeakKeyIdDict (or WeakIdDict?) would not actually be more correct. Which I think is roughly the same as what @mauro3 is saying.

mlhetland · 2018-07-17T19:37:00Z

A recent data point in favor of that here.

mauro3 · 2018-07-18T08:46:28Z

I implemented the using-object-id change in PR #28161, maybe this can help the triage-call tomorrow.

JeffBezanson · 2018-07-18T16:11:32Z

See #3002 --- Distributed uses the ==-based hashing behavior of WeakKeyDict. For now we should maybe just remove the conversion, and add WeakKeyIdDict later.

mauro3 · 2018-07-19T10:35:25Z

I found three occurences of WeakKeyDict in stdlib (and none in base/):

julia/stdlib/Distributed/src/clusterserialize.jl

Line 30 in d556784

const object_numbers = WeakKeyDict()
julia/stdlib/Distributed/src/remotecall.jl

Line 11 in cdd4e84

const client_refs = WeakKeyDict{Any, Nothing}() # used as a WeakKeySet
julia/stdlib/Serialization/src/Serialization.jl

Line 361 in f104ea4

const object_numbers = WeakKeyDict()

As per Jeff's comment, the first two need ==-based hashing. The third looks suspiciously similar. So, if julia itself only needs the == version of WeakKeyDict, then the Id version should probably go to the DataStructures.jl package. However, procrastinating I coded a WeakKeyDict+WeakKeyIdDict over in PR #28182, so you can see how a possible implementation would look like.

It would probably be good though to add tests which test the ==-needing features of above three WeakKeyDict occurrences (as my PR #28161 shows, changing to ===-hashing does not lead to any test failures in their tests).

StefanKarpinski · 2018-07-19T18:42:26Z

Thanks for the investigation, @mauro3! The conclusion from discussion on the triage call is that we should just make the minimal change here and make WeakKeyDict not do autoconversion anymore.

mauro3 · 2018-07-19T19:22:47Z

Cool, I can prepare a PR tomorrow. Did you discuss whether the no-conversion applies to getters as well?

vtjnash · 2018-07-19T19:36:56Z

We said getters should continue to convert – it's just setters that should type-assert.

mauro3 · 2018-07-19T20:43:09Z

get! is a bit odd then:

a = [1]
wkd = WeakKeyDict(a=>1)
get!(wkd, [1.0], 1) # works
get!(wkd, [2.0], 1) # errors

but I guess that is ok. Or maybe more consistent to throw on the first too? (The latter would be easier to implement ;-)

Fixes JuliaLang#24941

StefanKarpinski · 2018-07-19T21:30:19Z

I would favor erroring on the first one too.

mauro3 · 2018-07-20T11:33:04Z

For those following along at home and still wondering when the == behavior is needed. It's only used here where it stores RemoteRefs which are compared/hashed with the methods here. The other two WeakKeyDicts (here and here) take objects which fall back to === comparison/hashing. Sadly I couldn't come up with a test case for the RemoteRefs case which tests the behavior needing ==.

…liaLang#28198)

StefanKarpinski added the triage This should be discussed on a triage call label Jul 17, 2018

mauro3 mentioned this issue Jul 18, 2018

WIP/RFC: Converted WeakKeyDict to hash & compare with object-id #28161

Closed

4 tasks

mauro3 mentioned this issue Jul 19, 2018

WeakKeyIdDict implementation (keeping WeakKeyDict) #28182

Open

3 tasks

Keno added this to the 0.7 milestone Jul 19, 2018

JeffBezanson removed the triage This should be discussed on a triage call label Jul 19, 2018

mauro3 added a commit to mauro3/julia that referenced this issue Jul 19, 2018

Also update get! (which was in fact broken before)

1db307f

Fixes JuliaLang#24941

mauro3 mentioned this issue Jul 19, 2018

WeakKeyDict with no conversion on key-insertion (#24941) #28198

Merged

mauro3 added a commit to mauro3/julia that referenced this issue Jul 19, 2018

Also update get! (which was in fact broken before)

c1eb6c0

Fixes JuliaLang#24941

StefanKarpinski closed this as completed in #28198 Jul 22, 2018

StefanKarpinski pushed a commit that referenced this issue Jul 22, 2018

WeakKeyDict with no conversion on key-insertion (#24941) (#28198)

41e749f

Liozou pushed a commit to Liozou/julia that referenced this issue Jul 23, 2018

WeakKeyDict with no conversion on key-insertion (JuliaLang#24941) (Ju…

feb5b87

…liaLang#28198)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WeakKeyDict should not convert keys as otherwise it can drop them #24941

WeakKeyDict should not convert keys as otherwise it can drop them #24941

mauro3 commented Dec 6, 2017

mauro3 commented Jul 11, 2018

StefanKarpinski commented Jul 11, 2018

StefanKarpinski commented Jul 17, 2018

JeffBezanson commented Jul 17, 2018

mauro3 commented Jul 17, 2018

StefanKarpinski commented Jul 17, 2018

mlhetland commented Jul 17, 2018

mauro3 commented Jul 18, 2018

JeffBezanson commented Jul 18, 2018

mauro3 commented Jul 19, 2018

StefanKarpinski commented Jul 19, 2018

mauro3 commented Jul 19, 2018

vtjnash commented Jul 19, 2018

mauro3 commented Jul 19, 2018 •

edited

StefanKarpinski commented Jul 19, 2018

mauro3 commented Jul 20, 2018

WeakKeyDict should not convert keys as otherwise it can drop them #24941

WeakKeyDict should not convert keys as otherwise it can drop them #24941

Comments

mauro3 commented Dec 6, 2017

mauro3 commented Jul 11, 2018

StefanKarpinski commented Jul 11, 2018

StefanKarpinski commented Jul 17, 2018

JeffBezanson commented Jul 17, 2018

mauro3 commented Jul 17, 2018

StefanKarpinski commented Jul 17, 2018

mlhetland commented Jul 17, 2018

mauro3 commented Jul 18, 2018

JeffBezanson commented Jul 18, 2018

mauro3 commented Jul 19, 2018

StefanKarpinski commented Jul 19, 2018

mauro3 commented Jul 19, 2018

vtjnash commented Jul 19, 2018

mauro3 commented Jul 19, 2018 • edited

StefanKarpinski commented Jul 19, 2018

mauro3 commented Jul 20, 2018

mauro3 commented Jul 19, 2018 •

edited