You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I can confirm and can reproduce this. My best guess is that this is a serialization error or we're calling a stringify or smth too eagerly.
basically the list is cast to a tuple and then stringified somewhere, i.e. ["a"] becomes "('a',)". Passing a tuple directly or the literal works as expected.
If I had to guess, this looks suspicious but I don't fully understand what's going on there. Maybe @rjzamora ?
Thanks for raising @ayushdg ! I ran into this bug yesterday, but didn't get a chance to raise and issue and investigate yet. @fjetter is correct that this is likely another HLG-serialization edge case (probably having to do with msgpack not distinguishing lists/tuples). I will try to figure out a fix, but also look forward to something like dask/distributed#6028 avoiding these problems altogether :)
Describe the issue:
The broadcast merge codepath for dataframes throws an error when passing a list of columns in the
on
argument with akeyError
Minimal Complete Verifiable Example:
Anything else we need to know?:
Same issue persists when joining on multiple columns
Environment:
The text was updated successfully, but these errors were encountered: