Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Data profiling #51

Closed
davidc92 opened this issue Jan 22, 2021 · 2 comments
Closed

Data profiling #51

davidc92 opened this issue Jan 22, 2021 · 2 comments
Assignees
Labels
enhancement New feature or request

Comments

@davidc92
Copy link
Contributor

A function to carry out data profiling as typically required by data management. What I mean by that is:

  • A summary of the completeness/missingness of each variable in a dataset
  • If a variable is a date, what is the range?
  • If variable is a factor, frequencies of each factor level

I strongly suspect this ask may be fulfilled by another package, but if not I don't think this should be too tricky a function to write.

@davidc92 davidc92 added the enhancement New feature or request label Jan 22, 2021
@jvillacampa
Copy link
Contributor

As you say there are a couple of packages out there for this purpose. See this one:
https://towardsdatascience.com/simple-fast-exploratory-data-analysis-in-r-with-dataexplorer-package-e055348d9619
And functions to do this type of jobs in different packages:
https://www.r-bloggers.com/2018/08/exploratory-data-analysis-in-r-introduction/
Things like your second and third points can be done in base R (range(df$date); table(df$factor)

There might specific jobs for which developing a new function is required, but perhaps, it's more a matter of having a guide on how to do this type of stuff?

@alice-hannah
Copy link
Contributor

This function might be similar to what you're after too - https://github.com/jackhannah95/jafun/blob/master/R/prop_missing.R

@Moohan Moohan closed this as not planned Won't fix, can't repro, duplicate, stale Jul 2, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

5 participants