Remove zero-width chars from key, before matches #243

egregors · 2023-02-23T10:03:32Z

As a last way to compare keys, remove all zero-width characters from the key.

Solved #242

pikanezi · 2023-02-23T10:49:13Z

Isn't it normal that "\u200Bdate" does not match "date"? I'm not sure if it is up to gocsv to normalize your data.

Can't you fix your data before using gocsv?

egregors · 2023-02-23T11:44:38Z

Isn't it normal that "\u200Bdate" does not match "date"? I'm not sure if it is up to gocsv to normalize your data.

But for some reason, you're already doing it. I didn't see a big difference between strings.TrimSpace(key) == k which already in matchesKey method and cleaning zw-chars. Why “\u200Bdate” does not match “date” is normal, but “ date“ does not match “date” isn't?

A user-story here is pretty clear.
Let's say I'm a consumer of the lib. And I'd like to parse some csv.
So, I open a csv file and see titles row: data;a;b;c;;d which contains some \u200B. Obviously, I don't see any zero-width chars in titles, and write csv annotation as I see it in the raw doc.

But, after Unmarshal call, I am getting invalid result. Some cols are not parsed.

For me, it looks, like libs responsibility to maintain expected behavior. As far as you're already doing some normalization in fieldInfo.matchesKey method.

pikanezi · 2023-02-23T11:55:38Z

Good point, thanks

Remove zero-width chars from key, before matches

ca4a2f7

pikanezi merged commit dc4ee9d into gocarina:master Feb 23, 2023

egregors mentioned this pull request Feb 23, 2023

getCSVFieldPosition works wrong if column name contains zero width characters #242

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove zero-width chars from key, before matches #243

Remove zero-width chars from key, before matches #243

egregors commented Feb 23, 2023

pikanezi commented Feb 23, 2023

egregors commented Feb 23, 2023

pikanezi commented Feb 23, 2023

Remove zero-width chars from key, before matches #243

Remove zero-width chars from key, before matches #243

Conversation

egregors commented Feb 23, 2023

pikanezi commented Feb 23, 2023

egregors commented Feb 23, 2023

pikanezi commented Feb 23, 2023