Extractors need chunking ability #5

conorkcorbin · 2023-02-07T00:14:36Z

STARR extractors take cohort tables and join them to specific tables within our STARR data extract on bigquery to create a timeline of features for each ML example.

When the number of rows in the provided cohort table is too large, bigquery complains.

TODO: implement chunking so that extractors can join chunks of a cohort table iteratively so that bigquery does not complain. Needed for all extractors.

conorkcorbin · 2023-02-19T02:37:26Z

@jyx-su did you end up implementing something like this for your project?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Extractors need chunking ability #5

Extractors need chunking ability #5

conorkcorbin commented Feb 7, 2023

conorkcorbin commented Feb 19, 2023

Extractors need chunking ability #5

Extractors need chunking ability #5

Comments

conorkcorbin commented Feb 7, 2023

conorkcorbin commented Feb 19, 2023