API: Deprecate skip_footer in read_csv #13386

gfyoung · 2016-06-07T15:09:41Z

Title is self-explanatory.

Closes gh-13349 and partially undoes this commit back in v0.9.0. With such a massive API now, having duplicate arguments makes managing it way less practical.

gfyoung · 2016-06-07T15:12:15Z

In light of removing duplicate arguments and stripping down the API a bit, here are a couple of others I would like to propose (perhaps not in this PR but for discussion nonetheless):

Deprecate skipfooter in read_excel (it's allowed as an alternative to skip_footer, though it is not surfaced in the signature but rather encompassed in a **kwargs argument)?
Should we choose between sep and delimiter in read_csv?

jreback · 2016-06-07T15:41:08Z

the problem is that we also have skiprows. I would actually deprecate skip_footer instead.

gfyoung · 2016-06-07T16:06:22Z

Hmmm...that's a flip-flop AFAICT from what you said earlier here. However, deprecating skip_footer seems a little strange since we use skip_footer all over the place in the code (internally especially). That's why I chose to deprecate skipfooter, since the code impact is not very significant as you can tell. In addition, we use the _ in a lot of the other arguments, so I'd rather side with the majority.

jreback · 2016-06-07T16:10:43Z

just change it internally to the correct kw. I think they diverged at some point. It looks (just a quick skim), that skipfooter is more consistent (as we have skiprows, usecols) etc. excel should be fixed to be the same as well.

gfyoung · 2016-06-07T16:18:28Z

@jreback : let's actually do the count of multi-word argument names (excluding the skip(_)footer ones) using the documentation as of 0.18.1 here:

Arguments that use _:
filepath_or_buffer
index_col
mangle_dupe_cols
true_values
false_values
na_values
keep_default_na
na_filter
skip_blank_lines
error_bad_lines
warn_bad_lines
delim_whitespace
as_recarray
compact_ints
use_unsigned
low_memory
buffer_lines
memory_map
float_precision

Arguments that don't use _:
usecols
skipinitialspace
skiprows
chunksize
lineterminator
doublequote
quotechar
escapechar

Even excluding ones that have been recently deprecated (compact_ints, buffer_lines, use_unsigned, as_recarray), arguments that use _ vastly outnumber those that don't.

jreback · 2016-06-07T16:22:55Z

@gfyoung I am trying to minimize back-compat pain.

usecols and skiprows are prob some of the most used options. I agree we should just deprecate and move to _ separated, but that would be in 0.19.0 if we go whole hog.

codecov-io · 2016-06-07T16:26:56Z

Current coverage is 85.23% (diff: 100%)

Merging #13386 into master will increase coverage by <.01%

@@             master     #13386   diff @@
==========================================
  Files           140        140          
  Lines         50420      50419     -1   
  Methods           0          0          
  Messages          0          0          
  Branches          0          0          
==========================================
  Hits          42975      42975          
+ Misses         7445       7444     -1   
  Partials          0          0

Powered by Codecov. Last update cc216ad...d21345f

gfyoung · 2016-06-07T22:24:58Z

@jreback : I have no issues waiting until 0.19.0 and going full hog. This duplicate argument is not breaking, and it will then give us the time to properly deprecate and rename arguments with _.

shoyer · 2016-07-14T16:04:22Z

I would lean against this change -- I don't think deprecating or even adding aliases with underscores for these keywords is worth it. Yes, consistency is nice, but these are keywords that are mostly either (a) copied straight from the API docs or an example or (b) auto-completed. Omitting underscores between lower case words is also common enough in PEP8 compliant code that it doesn't hurt my eyes in the way that camelCase would, for example. I would save changes like this for next time somebody writes a DataFrame library ;). There's just very little upside to the change at this point.

chris-b1 · 2016-07-14T16:37:43Z

I'd also be inclined to not do this.

Consistency is great, but to me, big breaking (or warning triggering) changes for consistency are only worth it to the extent they actually improve the user experience. E.g. resample, the new sort_... api, or a closer example, when the inconsistency between the rows and index keyword was resolved in pivot_table / pivot - #5505

This doesn't seem to meet that hurdle. Like @shoyer said, these argument would be typically tab-completed or looked up, or even if not, the inconsistency doesn't cause any confusion.

gfyoung · 2016-07-14T19:23:11Z

To put things in perspective, this whole discussion was generated because I wanted to remove the duplicate 'skipfooter/skip_footer' argument in the signature. It really should only be one of the two IMO.

The consistency part came up because we weren't sure which way to go i.e do we take the underscore version or the non-underscore?

I perfectly understand if we don't want to make such a change as drastic as it has become, though some input on the initial goal would be good then.

chris-b1 · 2016-07-14T20:49:34Z

I don't have a strong opinion on that, but given that all the docs / SO answers / etc in the wild use skipfooter, I'd think deprecating skip_footer makes more sense?

gfyoung · 2016-07-15T05:47:28Z

In light of the push back, I will rollback my changes and just deprecate skip_footer as @chris-b1 indicated. If skipfooter is being used in the wild, let's stick with that one then.

gfyoung · 2016-07-15T07:44:08Z

@jreback , @jorisvandenbossche : Rolled back my changes successfully (i.e. Travis is passing) to just deprecating skip_footer for the reason that @chris-b1 outlined above. Ready to merge if there are no other concerns.

jreback · 2016-07-15T10:40:34Z

lgtm. @jorisvandenbossche

gfyoung · 2016-07-19T14:38:08Z

@jorisvandenbossche : any updates on this?

jorisvandenbossche · 2016-07-19T22:58:46Z

What do we do other read_ functions for the skipfooter case?

gfyoung · 2016-07-19T23:21:30Z

Same as here pre-PR (i.e. it just accepts both of them). However, I will handle all other cases similarly if this one is merged.

gfyoung · 2016-07-26T01:14:52Z

@jorisvandenbossche : Any updates on this?

jreback · 2016-07-27T10:31:32Z

doc/source/io.rst

@@ -1351,7 +1351,7 @@ back to python if C-unsupported options are specified. Currently, C-unsupported
 options include:

 - ``sep`` other than a single character (e.g. regex separators)
- ``skip_footer``
+- ``skipfooter``


did the doc-string get changed (to add DEPRECATED)?

Hmmm...not sure where those changes went. Added them back.

gfyoung · 2016-07-28T03:40:59Z

@jreback : added the documentation back about the deprecation, and Travis is still happy after rebase. Ready to merge if there are no other concerns.

jreback · 2016-07-29T00:25:31Z

thanks @gfyoung

Deprecated back in 0.19.0. xref pandas-devgh-13386.

Deprecated back in 0.19.0. xref gh-13386.

gfyoung force-pushed the deprecate-dup-skipfooter branch from 23c07a2 to b563179 Compare June 7, 2016 15:10

jreback added API Design IO CSV read_csv, to_csv labels Jun 7, 2016

gfyoung force-pushed the deprecate-dup-skipfooter branch from b563179 to 4560c12 Compare June 7, 2016 22:24

jreback added this to the 0.19.0 milestone Jun 7, 2016

gfyoung changed the title ~~DEPR: deprecate skipfooter in read_csv~~ API: Use underscore in read_* arguments Jun 8, 2016

gfyoung force-pushed the deprecate-dup-skipfooter branch 8 times, most recently from fdb6f97 to 85a58f5 Compare June 9, 2016 00:55

gfyoung changed the title ~~API: Use underscore in read_* arguments~~ API: Use underscore in read_csv arguments Jun 9, 2016

gfyoung force-pushed the deprecate-dup-skipfooter branch 7 times, most recently from ec04105 to 5ae47e9 Compare June 15, 2016 09:29

gfyoung force-pushed the deprecate-dup-skipfooter branch from f675a02 to e6a72c8 Compare July 14, 2016 07:58

gfyoung changed the title ~~API: Use underscore in read_csv arguments~~ API: Deprecate skip_footer in read_csv Jul 15, 2016

gfyoung force-pushed the deprecate-dup-skipfooter branch from e6a72c8 to 1fdc1ae Compare July 15, 2016 07:04

gfyoung force-pushed the deprecate-dup-skipfooter branch from 1fdc1ae to 3208f02 Compare July 27, 2016 00:54

jreback reviewed Jul 27, 2016
View reviewed changes

gfyoung force-pushed the deprecate-dup-skipfooter branch from 3208f02 to 9ddb3bf Compare July 28, 2016 02:37

API: Deprecate skip_footer in read_csv

d21345f

gfyoung force-pushed the deprecate-dup-skipfooter branch from 9ddb3bf to d21345f Compare July 28, 2016 02:38

jsexauer mentioned this pull request Jul 29, 2016

DEPR: Clean up list of deprecations from prior versions #6581

Closed

1 task

jreback closed this in aa88215 Jul 29, 2016

gfyoung deleted the deprecate-dup-skipfooter branch July 29, 2016 02:29

gfyoung added a commit to forking-repos/pandas that referenced this pull request Dec 11, 2017

CLN: Drop the skip_footer parameter in read_csv

be127de

Deprecated back in 0.19.0. xref pandas-devgh-13386.

gfyoung mentioned this pull request Dec 11, 2017

CLN: Drop the skip_footer parameter in read_csv #18724

Merged

jreback mentioned this pull request Dec 11, 2017

DEPR: deprecations log for removed issues #13777

Closed

gfyoung added a commit to forking-repos/pandas that referenced this pull request Dec 11, 2017

CLN: Drop the skip_footer parameter in read_csv

cefc2e6

Deprecated back in 0.19.0. xref pandas-devgh-13386.

gfyoung added a commit that referenced this pull request Dec 11, 2017

CLN: Drop the skip_footer parameter in read_csv (#18724)

4091f64

Deprecated back in 0.19.0. xref gh-13386.

jorisvandenbossche mentioned this pull request Sep 12, 2018

API: Standardize usage of underscores in multi-word kwargs #22639

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

API: Deprecate skip_footer in read_csv #13386

API: Deprecate skip_footer in read_csv #13386

gfyoung commented Jun 7, 2016 •

edited

Loading

gfyoung commented Jun 7, 2016 •

edited

Loading

jreback commented Jun 7, 2016

gfyoung commented Jun 7, 2016 •

edited

Loading

jreback commented Jun 7, 2016

gfyoung commented Jun 7, 2016 •

edited

Loading

jreback commented Jun 7, 2016

codecov-io commented Jun 7, 2016 •

edited

Loading

gfyoung commented Jun 7, 2016 •

edited

Loading

shoyer commented Jul 14, 2016

chris-b1 commented Jul 14, 2016

gfyoung commented Jul 14, 2016

chris-b1 commented Jul 14, 2016

gfyoung commented Jul 15, 2016

gfyoung commented Jul 15, 2016 •

edited

Loading

jreback commented Jul 15, 2016

gfyoung commented Jul 19, 2016

jorisvandenbossche commented Jul 19, 2016

gfyoung commented Jul 19, 2016 •

edited

Loading

gfyoung commented Jul 26, 2016

jreback Jul 27, 2016

gfyoung Jul 28, 2016

gfyoung commented Jul 28, 2016

jreback commented Jul 29, 2016

API: Deprecate skip_footer in read_csv #13386

API: Deprecate skip_footer in read_csv #13386

Conversation

gfyoung commented Jun 7, 2016 • edited Loading

gfyoung commented Jun 7, 2016 • edited Loading

jreback commented Jun 7, 2016

gfyoung commented Jun 7, 2016 • edited Loading

jreback commented Jun 7, 2016

gfyoung commented Jun 7, 2016 • edited Loading

jreback commented Jun 7, 2016

codecov-io commented Jun 7, 2016 • edited Loading

Current coverage is 85.23% (diff: 100%)

gfyoung commented Jun 7, 2016 • edited Loading

shoyer commented Jul 14, 2016

chris-b1 commented Jul 14, 2016

gfyoung commented Jul 14, 2016

chris-b1 commented Jul 14, 2016

gfyoung commented Jul 15, 2016

gfyoung commented Jul 15, 2016 • edited Loading

jreback commented Jul 15, 2016

gfyoung commented Jul 19, 2016

jorisvandenbossche commented Jul 19, 2016

gfyoung commented Jul 19, 2016 • edited Loading

gfyoung commented Jul 26, 2016

jreback Jul 27, 2016

Choose a reason for hiding this comment

gfyoung Jul 28, 2016

Choose a reason for hiding this comment

gfyoung commented Jul 28, 2016

jreback commented Jul 29, 2016

gfyoung commented Jun 7, 2016 •

edited

Loading

gfyoung commented Jun 7, 2016 •

edited

Loading

gfyoung commented Jun 7, 2016 •

edited

Loading

gfyoung commented Jun 7, 2016 •

edited

Loading

codecov-io commented Jun 7, 2016 •

edited

Loading

gfyoung commented Jun 7, 2016 •

edited

Loading

gfyoung commented Jul 15, 2016 •

edited

Loading

gfyoung commented Jul 19, 2016 •

edited

Loading