Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Support explicit column types. #33

Open
ericemc3 opened this issue Dec 2, 2020 · 3 comments
Open

Support explicit column types. #33

ericemc3 opened this issue Dec 2, 2020 · 3 comments
Labels
enhancement New feature or request

Comments

@ericemc3
Copy link

ericemc3 commented Dec 2, 2020

First congratulations for that impressive work, which i consider, being a R user and a D3 fan, as a huge step forward for live and sexy datavisualisation and dataflows!

It looks like Arquero, from CSV for instance, is able to infer column types (Date, Numeric, String...), as we can see with the view() display (columns right or left-alignement), or by testing values type with typeof. Could that information be exposed in the table object, allowing for instance to test for numeric columns only?

A cool feature, starting from this, could be for instance:
select(1, aq.isNumeric())
or groupby(v1).rollup(*...here sum all numeric variables keeping same name...*)

I am used to this convenient R/dplyr syntax : summarise_if(is_numeric, sum) or summarise( across(where(is_numeric), sum) )

@jheer
Copy link
Member

jheer commented Dec 2, 2020

Arquero is largely type-agnostic by design, so it may take a while for column-type-specific features to develop. That said, more fine-grained type inference is necessary for binary serialization (e.g., to Apache Arrow columns as in #31), so that should help push this forward.

@jheer jheer changed the title column types Support explicit column types. Dec 4, 2020
@jheer jheer added the enhancement New feature or request label Dec 4, 2020
@jheer
Copy link
Member

jheer commented Dec 4, 2020

I'm marking this an enhancement / feature request, though the exact form this might take is still not yet clear. See also #2.

@bmschmidt
Copy link

There might be some logic to making this feature exist specifically when using Arrow columns/vectors as the backend data store, because then you could piggyback off the Arrow types rather than have to work out some logic about mixed column types inside JS arrays.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

3 participants