Fix np.genfromtxt field name handling when dtype != None #4649

alanbriolat · 2014-04-29T10:32:51Z

np.genfromtxt validates field names twice: once in genfromtxt and once in easy_dtype. Whilst the arguments to genfromtxt are used in the first validation, they aren't passed to easy_dtype (which is used only when dtype != None) and therefore in this case the default validation (strip non-alphanum, replace spaces) gets confusingly applied, ignoring genfromtxt's arguments.

This patch adds failing tests for the issue and fixes genfromtxt by passing the appropriate arguments onwards to easy_dtype.

This is probably the least invasive way to fix the issue. In my opinion, the whole thing is a nest of poorly-defined responsibilities between genfromtxt, easy_dtype and NameValidator, especially since the latter two are only used in the former. I'm willing to take the time to try and clean that up if it's likely to be well-received.

jaimefrio · 2014-09-21T06:07:12Z

It took me some time to realize that, if the current behavior was some form of intentional safeguard, it is very easily circumvented by simply setting dtype=None. If no one has any strong opposition, I would like to merge this, will give it a day or two for people to complain.

My only nitpick is whether the tests, rather than comparing against hardcoded values for the array and its dtype, should compare against the result of running np.genfromtxt on the same input, and with all arguments set identically, except for dtype=None.

… aeronet variable names (e.g. 'Datetime(dd-mm-yy)' was becoming 'Datetimeddmmyy'). See this incredibly frustrating bug: numpy/numpy#4649

charris · 2015-01-23T22:02:41Z

Squashed commits and rewrote commit message in #5459. Thanks @alanbriolat..

charris · 2015-01-23T22:05:15Z

As you say, genfromtxt is a bit of a mess. Any work you want to do cleaning it up is welcome.

BUG: Fix genfromtext NameValidator arguments passed to easy_dtype.

alanbriolat added 2 commits April 29, 2014 10:58

Add genfromtxt tests to show broken field names when dtype!=None

096f195

Pass NameValidator arguments to easy_dtype for consistency

235fe87

alanbriolat mentioned this pull request Apr 29, 2014

genfromtxt strips brackets from names (Trac #1916) #2509

Closed

charris added the component: numpy.lib label Jan 23, 2015

charris mentioned this pull request Jan 23, 2015

BUG: Fix genfromtext NameValidator arguments passed to easy_dtype. #5495

Merged

charris closed this Jan 23, 2015

charris added the 00 - Bug label Jan 23, 2015

charris referenced this pull request Jan 24, 2015

Merge pull request #5495 from charris/cleanup-gh-4649

e1ff626

BUG: Fix genfromtext NameValidator arguments passed to easy_dtype.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix np.genfromtxt field name handling when dtype != None #4649

Fix np.genfromtxt field name handling when dtype != None #4649

alanbriolat commented Apr 29, 2014

jaimefrio commented Sep 21, 2014

charris commented Jan 23, 2015

charris commented Jan 23, 2015

Fix np.genfromtxt field name handling when dtype != None #4649

Fix np.genfromtxt field name handling when dtype != None #4649

Conversation

alanbriolat commented Apr 29, 2014

jaimefrio commented Sep 21, 2014

charris commented Jan 23, 2015

charris commented Jan 23, 2015