Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Master list of most-requested issues #3189

Open
13 of 61 tasks
MichaelChirico opened this issue Dec 6, 2018 · 1 comment
Open
13 of 61 tasks

Master list of most-requested issues #3189

MichaelChirico opened this issue Dec 6, 2018 · 1 comment

Comments

@MichaelChirico
Copy link
Member

@MichaelChirico MichaelChirico commented Dec 6, 2018

Redirecting from post at /orgs/.../discussions. Inspired by happening to notice an issue getting more support (through GitHub reactions) and borrowing the idea to scrape the GH API from this nice article over at tidyverse. Here are all open issues with at least 5 positive reactions:

  • #635 |[| - Delete rows by reference
  • #1828 |NEW| - [Request] data.table spark interface
  • #2273 |NEW| - compatibility with sf library
  • #3992 |nafill| - nafill, setnafill for character, factor and other types
  • #1639 |on| - Support for on-the-fly columns in on
  • #1336 |NEW| - File-backed data.tables
  • #795 | .SD | - names(.SD) := ... should work
  • #3957 | NEW | - Need long-vector support
  • #1063 |NEW| - Set up rowwise() and colwise() functions for use in .SD
  • #657 | i | - [R-Forge #1611] Allow 2 column i matrix to return a list() (or vector if same type)
  • #2778 | froll | - rolling functions, rolling aggregates, sliding window, moving average
  • #2146 | NEW | - Create an efficient unnest function
  • #3467 | NEW | - Add psum?
  • #4258 | NEW | - Shouldn't fcase() recycle?
  • #788 | [ | - add a 'having' parameter to [.data.table
  • #4358 |setcolorder| - Additional arguments for setcolorder: before, after
  • #4348 |INSTALLATION| - support OpenMP with system macOS toolchain
  • #3739 |uniqueN| - uniqueN() is very slow compared to length(unique())
  • #3721 | JOINS | - foverlaps() should be easier to do point-to-interval join
  • #3060 | NEW | - progress indicator/bar for "by" operations
  • #2879 |NEW| - collect more statistics about the data
  • #2655 | META | - improvement on programmatically substituting columns in expressions
  • #499 | JOINS | - [R-Forge #2461] Faster version of Reduce(merge, list(DT1,DT2,DT3,...)) called mergelist (a la rbindlist)
  • #4697 |ALTREP| - Be less conservative about ALTREP
  • #4152 | NEW | - "not in" operator %!in%
  • #3977 | GForce | - GForce optimize weighted.mean
  • #3670 |INTERNAL| - What about data.table powered by GPU along with CPU?
  • #3241 | froll | - rolling funs / shift could support logical window
  • #3002 |JOINS| - Timeline / interest for not yet implemented foverlaps() functionality
  • #2219 | NEW | - Expose setlevels() and document
  • #1717 |JOINS| - allow cross join in [.data.table
  • #1414 |GForce| - GForce should be able to work with := as well.
  • #1120 | GForce | - implement guniqueN
  • #935 | JOINS | - Should be able to refer to i's .SD during a join.
  • #857 |JOINS| - Expand nomatch to accept any value (control fills, rolls, omits during joins)
  • #4677 |JOINS| - output from merge
  • #4066 |nafill| - nafill new type: approx
  • #3795 |:=| - Add an alias for functional version of :=
  • #3104 |NEW| - Share data.table among R sessions by reference
  • #3009 | JOINS | - allow.cartesian could allow .N on 2^31+ rows to finish
  • #1700 |JOINS| - join column naming with on syntax
  • #1086 |NEW| - Allow locking or protecting a data.table from any modifications
  • #2020 |fwrite| - fwrite ignores options(scipen = 999)
  • #1886 |transpose| - feature request : transpose(keep.rownames = T)?
  • #2016 - Support for writing to gzip (.gz) with fwrite
  • #1878 - patterns in .SDcols
  • #3042 - [FR] grouping in parallel
  • #1708 - shift should accept negative n
  • #1770 |fwrite| - fwrite UTF8
  • #4247 | DOCUMENTATION | - Vignette for programmatic data.table (eval, get, etc)
  • #3396 | RESHAPING | - melt.data.table should offer variable to match on the name, rather than the number

Separately for fread since there are many:

  • #2431 - Automatic detection of dec=',' in Europe
  • #583 - [R-Forge #2605] add filtering option to fread so it can load part of a file
  • #856 - Implement comment.char argument in fread
  • #1162 - [question] when will sep2 in fread be implemented?
  • #711 - fread option to skip lines with format errors (with warning)
  • #561 - [R-Forge #4931] Support file connections for fread
  • #4329 - fread tries to map memory for the entire file when using nrows
  • #2691 - fread fails with uneven number of columns when max columns in final row (with fill=TRUE and col.names set)
  • #2714 - fread not reading in column of all character "NA" as character with na.strings = ""
  • #1812 - fread problem with different number of columns

Please note that upvoting/reacting to this issue will make the universe collapse, probably.

Code to reproduce:

library(gh)
library(data.table)
# reactions API is "in development" so requires special header:
react_header = 'application/vnd.github.squirrel-girl-preview+json'

repo = 'Rdatatable/data.table'
issue_query_fmt = "/repos/%s/issues?state=open&per_page=100&page=%d"
issues = list()
page = 1L
while (all(nzchar(resp <- gh(sprintf(issue_query_fmt, repo, page),
                             .send_headers = c(Accept = react_header))))) {
  cat(sprintf("Reading page %d\n", page))
  issues = c(issues, resp)
  page = page + 1L
}
    
# filter pull requests
issues = issues[sapply(issues, function(issue) is.null(issue$pull_request))]
names(issues) = as.character(sapply(issues, `[[`, 'number'))

reactions = rbindlist(lapply(issues, `[[`, 'reactions'))

reactions[ , issue := gsub('.*issues/([0-9]+)/reactions', '\\1', url)]
reactions[ , url := NULL]
reactions[ , positive := total_count - `-1` - confused]
reactions[ , setdiff(names(reactions), c('issue', 'positive')) := NULL]

reactions[ , title := sapply(issues[issue], `[[`, 'title')]

# don't include this issue...
reactions[order(-positive)][positive >= 5 & issue != '3189']
@sritchie73 sritchie73 pinned this issue Dec 21, 2018
@sritchie73

This comment has been hidden.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Linked pull requests

Successfully merging a pull request may close this issue.

None yet
2 participants