-
Notifications
You must be signed in to change notification settings - Fork 135
Closed
Description
I've switched to tibble::set_tidy_names() now in the dev version of readxl. And frequently readxl::read_excel() does a lot of name repair. Especially before you really dial in the range or skip values that isolate the data rectangle. I'm beginning to wonder if the messaging around new names should have some upper limit, after which you get a summary. Below is a very modest example. It can get much worse.
readxl::read_excel("investigations/Medicare Part D 2015 Plan Report 03182015.xls")
#> New names:
#> -> ..2
#> -> ..3
#> -> ..4
#> -> ..5
#> -> ..6
#> -> ..7
#> -> ..8
#> -> ..9
#> -> ..10
#> -> ..11
#> -> ..12
#> -> ..13
#> -> ..14
#> -> ..15
#> -> ..16
#> -> ..17
#> -> ..18
#> -> ..19
#> -> ..20
#> -> ..21
#> -> ..22
#> -> ..23
#> -> ..24
#> -> ..25
#> -> ..26
#> # A tibble: 16,665 x 26
#> `2015 Plan and P… ..2 ..3 ..4 ..5 ..6 ..7 ..8 ..9 ..10
#> <chr> <chr> <chr> <chr> <chr> <chr> <chr> <chr> <chr> <chr>
#> 1 Alabama to Monta… <NA> <NA> <NA> <NA> <NA> <NA> <NA> <NA> <NA>
#> 2 * See next works… <NA> <NA> <NA> <NA> <NA> <NA> <NA> <NA> <NA>
#> 3 State Coun… Orga… Plan… Cont… Plan… Segm… Orga… Plan… Spec…
#> 4 Alabama (All… Aetn… Aetn… S5810 182 0 PDP Medi… No
#> 5 Alabama (All… Aetn… Aetn… S5810 46 0 PDP Medi… No
#> 6 Alabama (All… Blue… Blue… S1030 6 0 PDP Medi… No
#> 7 Alabama (All… Blue… Blue… S1030 1 0 PDP Medi… No
#> 8 Alabama (All… Cign… Cign… S5617 220 0 PDP Medi… No
#> 9 Alabama (All… Cign… Cign… S5617 182 0 PDP Medi… No
#> 10 Alabama (All… Cign… Cign… S5617 257 0 PDP Medi… No
#> # ... with 16,655 more rows, and 16 more variables: ..11 <chr>,
#> # ..12 <chr>, ..13 <chr>, ..14 <chr>, ..15 <chr>, ..16 <chr>,
#> # ..17 <chr>, ..18 <chr>, ..19 <chr>, ..20 <chr>, ..21 <chr>,
#> # ..22 <chr>, ..23 <chr>, ..24 <chr>, ..25 <chr>, ..26 <chr>Created on 2018-04-28 by the reprex package (v0.2.0).
Metadata
Metadata
Assignees
Labels
No labels