Better handle unicode column names #558

maartenbreddels · 2020-01-21T19:41:22Z

Creating a dataframe with unicode chars like this:

df = vaex.from_dict({'ÚÚ': vaex.vrange(0, 100)})

Will replace the two characters with two underscores, which effectively hides the column (column names beginning with __ are hidden).
This was mentioned here:
https://stackoverflow.com/questions/59738879/python-vaex-how-to-create-dataframe-from-a-csv-file

The text was updated successfully, but these errors were encountered:

markkoob · 2020-01-21T19:55:51Z

I think this will be obvious (to you!) but since I am here anyway:

df = vaex.from_dict({'./label': vaex.vrange(0,100)})

In our case these characters have special meaning for us, so mutating the string is pretty inconvenient. I'm hoping the solution involves preserving the original string!

maartenbreddels · 2020-01-21T20:13:42Z

Thanks for sharing that.

To give some understanding, vaex uses the column names as variable names, thus they have to be valid identifiers (like variable names), since it's all built on valid Python expressions. In #370 we improved this to provide automatic translations, but it seems we need to do some more work.

maartenbreddels added bug enhancement labels Jan 21, 2020

maartenbreddels mentioned this issue Feb 26, 2020

Fix invalid name rewrites #617

Merged

maartenbreddels closed this as completed in #617 Apr 15, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Better handle unicode column names #558

Better handle unicode column names #558

maartenbreddels commented Jan 21, 2020

markkoob commented Jan 21, 2020

maartenbreddels commented Jan 21, 2020

Better handle unicode column names #558

Better handle unicode column names #558

Comments

maartenbreddels commented Jan 21, 2020

markkoob commented Jan 21, 2020

maartenbreddels commented Jan 21, 2020