0.3.5 #345

sigmafelix · 2024-07-16T14:46:51Z

Base learner targets (draft; should be changed in accordance with Big Data Considerations #325)
Download targets
Post-checkout hooks
Utility functions for the new targets

- attach_xy: Retaining all columns - generate_cv_index: Retaining all columns

- following amadeus changes (process_blackmarble) - New GPU-enabled R torch library binding

- path setting is changed to if-else - pipeline error failing mode to "abridge"

- removing irrelevant arguments passed to terra::rast

- Internal functions are not exported - attach_xy logic fix: join to the "grand" data rather than the leaned one - fit_base_brulee and fit_base_xgb: added device to manually distribute GPU workload

- Branching base learner fitting

- shared interface for branching cv set generation

- Pipeline base learner: 3 CV strategies and hyperparameter tuning targets were added - fit_base_* function get `return_best` and `tune_bayes_iter` for `workflow` compatibility - prepare_cvindex assigns fold ids using function names when spatialsample functions are used - restore_rset_full: as speed-up and disk saving measures - rset objects are generated based on essential coordinates only; this function restores full data for subsequent steps - TODO: size issues persist in hyperparamter tuning and identifying the best model as the entire workflow should be saved. rsample always saves training/test data. - TODO: duplicates in features; identify where the duplicates come

- Data size reduction for memory / storage management: added trim_resamples argument in fit_base_* - make_subdata: bootstrapping (currently 30%) - restore_fit_best: restore full data with CV rsample rset objects, extract the best tuned results, then fit the data - Dealing with nested list in tibbles from tidymodels workflow/hyperparameter tuning - TODO/Q: Do we save fitted model object or just keep predictions?

- set_args_download and feature_raw_download are written - targets_download.R is revised to reflect the structure of the two functions

- Base learner: lightgbm - Added dependencies: bonsai, lightgbm - Explicit definition of hyperparameter search controls for each

- LICENSE file gets 554

- set_args_download update - _targets.R update to generate arglist_download

- README.md update - setup_hook.sh is capable of immediately activating permission change

- targets_download.R: duplicate target names - Roxygen2 documentation typo fix in fit_base_lightgbm

kyle-messier · 2024-07-16T15:10:56Z

@sigmafelix It looks like main is one ahead of pipeline-compact and needs to be merged that way. It's probably a minor README change. After that, I'll review ASAP. Thanks! 🚀

sigmafelix · 2024-07-16T15:28:05Z

@kyle-messier I merged main into pipeline-compact. Most functions are still in nocov status, but I will work on improving actual coverage soon.

sigmafelix added 21 commits June 18, 2024 10:47

download part dev in progress

dba0b4a

Bug fix: cv utility functions

ceb8633

- attach_xy: Retaining all columns - generate_cv_index: Retaining all columns

update

e8dcc9d

- following amadeus changes (process_blackmarble) - New GPU-enabled R torch library binding

set_args_calc

5a2129a

- path setting is changed to if-else - pipeline error failing mode to "abridge"

path fix + inject_geos fix

12c67f6

- removing irrelevant arguments passed to terra::rast

impute_all update

29bda22

0.3.2

8e9f6ce

- Internal functions are not exported - attach_xy logic fix: join to the "grand" data rather than the leaned one - fit_base_brulee and fit_base_xgb: added device to manually distribute GPU workload

0.3.2 update

955e975

- Branching base learner fitting

typo fix

0fcdba4

wrapper: prepare_cvindex

1f943e5

- shared interface for branching cv set generation

0.3.4 update

5e9757f

- set_args_download and feature_raw_download are written - targets_download.R is revised to reflect the structure of the two functions

0.3.5

15b7fff

- Base learner: lightgbm - Added dependencies: bonsai, lightgbm - Explicit definition of hyperparameter search controls for each

set_args_download update + write permission test

136720b

- LICENSE file gets 554

post-checkout hook

3f1b2da

- set_args_download update - _targets.R update to generate arglist_download

another try

8a33177

hook update

68ed2a6

Instruction to setup hooks

c7f6757

- README.md update - setup_hook.sh is capable of immediately activating permission change

Update the list of users with write permission

716ede2

Lint and typo fix

24e7376

- targets_download.R: duplicate target names - Roxygen2 documentation typo fix in fit_base_lightgbm

sigmafelix requested a review from kyle-messier July 16, 2024 15:05

sigmafelix added 2 commits July 16, 2024 11:16

Merge remote-tracking branch 'origin/main' into pipeline-compact

1331561

base learners to lightgbm + merged main

89b0e4d

kyle-messier approved these changes Jul 16, 2024

View reviewed changes

sigmafelix merged commit a5d13bb into main Jul 16, 2024
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

0.3.5 #345

0.3.5 #345

sigmafelix commented Jul 16, 2024

kyle-messier commented Jul 16, 2024

sigmafelix commented Jul 16, 2024

0.3.5 #345

0.3.5 #345

Conversation

sigmafelix commented Jul 16, 2024

kyle-messier commented Jul 16, 2024

sigmafelix commented Jul 16, 2024