-
Notifications
You must be signed in to change notification settings - Fork 26
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support list columns #9
Comments
I am having a hard time coming up with ideas for relevant checks and summaries to perform on (all) lists. The very core idea of dataMaid is to perform a standard suite of checks for each variable class. Do you have any suggestions for relevant checks for lists in mind yourself? Or did you perhaps have a specific example in mind, when you opened this issue? |
If you have a list column inside a data frame, you typically want each element to have the same form. For example, if you call strsplit(), then the output is a list of character vectors, and you might want to store this as a field in a data frame. So some useful checks on list columns are "Does each element have the same class/typeof/length/dim?". |
I do see the point in your concrete example, but I'm concerned that other people would use lists differently in datasets. Personally, I would usually choose to store something in a list (rather than a vector) exactly because the entries were of different data types or varying lengths, and even though that does not instantly generalize to the role of lists in I will consider implementing a |
List columns can cause errors in
clean()
.The text was updated successfully, but these errors were encountered: