Model training and optimization 🎯 #7

KarelZe · 2022-10-07T13:17:44Z

Decide whether I want to treat option trade classification as a proabilistic classification problem or not
Bundle training in a docker contrainer that can run in any pod or on scc infrastructure
Add parametrized script to run studies
Get access to BwUniCluster2.0
Write script to start studies
Free memory with gc.collect()
Set-up pre-commit hooks e. g., mypy
Implement gradient boosting using catboost
Implement TabTransformer using TabSurvey
Try out TabTransformer closer to original paper without einops and default PyTorch implementations for attention. (see. e. g., here.
Write custom DataSet for PyTorch
Improve training performance of TabTransformer
Look into data pipes / data loader 2 for PyTorch
Implement save callback for PyTorch models
PyTorch use multiple GPUs. (dataparallel) See here.
Implement FTTransformer
Add timing code
Implement TabNet using TabSurvey
Set up weights and biases integration as shown here or here
Set up test cases for gbms
Set up test cases for classical classifier
Set up test cases for TabTransformer.
Set up test cases for FTTransformer. See here
Set up test cases for TabNet
Simplify objective code through train() and test() method as done here.
Add weighted loss for neural networks
Adjust early stopping of neural networks to work with accuracy
Migrate heroku db
Save completed study objects to gcs and track in wandb
Track saved models in wandb
Visualize learning curves for best CatBoost model
Research, if early stopping in neural nets should also be done based on the accuracy
Add code to obtain feature importances from trained models
Add shared embeddings to TabTransformer. See paper or here.
Adress problem how to adress high dimensionality of categorical variables
Research if there is a broader theory / concept to decay e. g. exponential smooting or weighted regression etc.
Visualize decay parameter and find optimal decay factor
Study samples / probabilites where the prediction is wrong
Fully understand how the target value should be like for gradient boosting -1/1 or 0/1 like in neural nets?
Add code to study learning curves e. g., in wandb
Add code for pre-training
Experiment with learning rate scheduling
Add code for attention visualization. First viz done in Interpretability with SHAP and attention maps 🐇 #85, but still have to research best approach to combine maps over different attention heads and layers. Finally decided for a method proposed by Chefer et. al. See here and here.
Get slurm running
go through Google playbook
Define rounds for what I want to optimize
Do searches more systematically. See this article here. and here. and here
Extend experiment tracking as shown here.
normalize data
Set up configuration for training
Set up concrete action plan how to improve training
pytorch 2.0 integration
Differentiate into exploration and exploitation phase
Think about folding the validation set into the training set and retrain the best configuration
Check out retraining
change early stopping criterion
Set up option to fix some hyperparams through a config or so
Verify hyperparameter search space
Use batch size finder (Implemented in Automatically find maximum batch size🥐 #125)
Add some option to generate results fast
Implement a retraining
Replace early stopping with checkpointing?
Check if logits is the right word in code
What samples does the model get wrong?
Add training curves to wandb
Check in wandb if the hyperparameter search space is chosen optimal
Rerun studies with different initializations. See how it affects the results
Add code to average results from different initializations
Add code for visualizations e. g., hyperparameter search space, influence of randomness etc.
Do code review with @pheusel or @lxndrblz

The text was updated successfully, but these errors were encountered:

- adds docker image based on 'runpod/stack' image - adds gitignore - adds shell script `start.sh` Adresses #7

Relates to #7

Adresses #7

- use both double quotes and env variables Relates to #7

Adresses #7

Adresses #7.

Relates to #7.

Adresses #7

- add login to gcloud - add project config - add wandb code Adresses #7

Adresses #7.

- to resume a study see optuna docs Adresses #7

Adresses #7

Adresses Model training and optimization 🎯 #7

- updated dependencies - removed broken Adresses Model training and optimization 🎯 #7

Adresses Model training and optimization 🎯 #7

relates to Model training and optimization 🎯 #7

Relates to Model training and optimization 🎯 #7

- finished reviewing errors from mypy pre-commit hook - reviewed errors in `check_formalia.py` Adresses Model training and optimization 🎯 #7

- refactored constants - adjusted search space - removed redundant code - adjusted gitignore Adresses Model training and optimization 🎯 #7

- added `typing` support - added new pre-commit hooks - resolved code issues Relates to Model training and optimization 🎯 #7

Relates to Model training and optimization 🎯 #7

Adresses #7

Required for `shap` implementation, as shap (and probably other tools) pass np.array to `predict()`. Adresses #7.

* Remove outdated files ❎ * Start `TabTransformerClassifier` implementation 🦜 * Rewrite callback 👽 * Allow `np.arrays` in classical classifier 👽 Required for `shap` implementation, as shap (and probably other tools) pass np.array to `predict()`. Adresses #7. * Add support for `np.ndarray` 🦜 * Simplified objective code 🐫 * Fixed tests 👽 * Fixed some `pre-commit` tests 👽 * Add `predict_proba` 🍕 * Refactored `TransformerClassifier` to own class 🏠 * Add tests for `predict_proba` 🍕 * Improved checks in ClassicalClassifier 🧪 * Remove redundant params ❎ * Simplify tests 🍝 * Restore two files 💯 * Ran `pre-commit` hooks 🪝 * Fixed doc strings 🍩 * Add aditional checks `TransformerClassifier` 🤖 * Fix failing test 🍩 * Enhanced sklearn compatiblity🍓 * Update typehint 📣 * Added tests for `TransformerClassifier` 🤖 * Free up cache + smaller batches 🍕

Adresses #7 .

Relates to #7.

Relates to #7

Adresses #7 .

Relates to #7 .

Relates to #7.

Relates to #7

Adresses #7 .

Relates to #7 .

* investigate cylical encoding * fixed cylical encoding * Finalize feature engineering script 🪄 * Add sample weighting to `TransformerClassifier` 🏋️ (#100) Relates to #7. * Early stopping based on accuracy for `TransformerClassifier`🧁 (#102) Relates to #7 * Improve robustness and tests of `TabDataset` 🚀 (#101) Adresses #7 . * Add instructions on using `SLURM` 🐧 (#103) Relates to #7 . * rerun feature generation notebook * Add refs to wandb 🪄 * Renamed notebooks for consistency 🍫 * Simplify notebook 🍫 * Add log-transform and encode day ⏰ * Add aversarial validation after feature engineering 🪄 * Update `build_features.py` to new features 🪄 * Update notes to feature set definition * Update `Feature Sets.md` Adresses #30.

Adresses #7.

Adresses #7

Adresses #7.

Adresses #7 and #10.

KarelZe · 2023-06-23T06:45:17Z

Most done by now. Won't work on remaining tasks. 🚀

KarelZe self-assigned this Oct 7, 2022

KarelZe changed the title ~~Model training and optimization~~ Model training and optimization 🎯 Oct 25, 2022

KarelZe added this to the Implementation milestone Oct 25, 2022

KarelZe added the code Everything related to code label Nov 6, 2022

KarelZe added a commit that referenced this issue Nov 10, 2022

Added basic docker image 🐳

83f1bd6

- adds docker image based on 'runpod/stack' image - adds gitignore - adds shell script `start.sh` Adresses #7

KarelZe mentioned this issue Nov 10, 2022

Add basic docker support 🐳 #28

Merged

KarelZe added a commit that referenced this issue Nov 11, 2022

Add .netrc to dockerfile 🐳

32b0a78

Relates to #7

KarelZe added a commit that referenced this issue Nov 11, 2022

Added updated comments 🧃

35f6a58

Adresses #7

KarelZe added a commit that referenced this issue Nov 11, 2022

Moved contents from dockerfile to start.sh 🐳

7e0cbbd

Adresses #7

KarelZe added a commit that referenced this issue Nov 14, 2022

Fixed wrong escaping of json🪲

d35293d

- use both double quotes and env variables Relates to #7

KarelZe added a commit that referenced this issue Nov 15, 2022

Added notes on optuna + pytorch 🧜‍♂️

d197000

Adresses #7

KarelZe added a commit that referenced this issue Nov 15, 2022

Added callback for saving 📦

86c86c4

Adresses #7.

KarelZe added a commit that referenced this issue Nov 15, 2022

Add GradientBoostingObj. to objective.py 🦊

a290eb4

Relates to #7.

KarelZe added a commit that referenced this issue Nov 15, 2022

Added pruning callback 🌵

d8fb782

Adresses #7

KarelZe added a commit that referenced this issue Nov 15, 2022

Load tracked files only 🌵

4dfdf26

- add login to gcloud - add project config - add wandb code Adresses #7

KarelZe added a commit that referenced this issue Nov 15, 2022

Fixed some typos in ClassicalClassifier 🪲

3b06be5

Adresses #7.

KarelZe added a commit that referenced this issue Nov 15, 2022

Log unfinished studies to storage 🌵

95e784f

- to resume a study see optuna docs Adresses #7

KarelZe added a commit that referenced this issue Nov 15, 2022

Added saving of GBM to GCS 🐈

0e20ebc

Adresses #7

KarelZe added a commit that referenced this issue Nov 16, 2022

Improved saving strategy 🛟

0f8eadb

Adresses Model training and optimization 🎯 #7

KarelZe added a commit that referenced this issue Nov 16, 2022

Save plot and summary to W& B 🌵

0a7bc09

Adresses Model training and optimization 🎯 #7

KarelZe added a commit that referenced this issue Nov 16, 2022

Updated pre-commit config 🌵

2e5bdfc

- updated dependencies - removed broken Adresses Model training and optimization 🎯 #7

KarelZe added a commit that referenced this issue Nov 16, 2022

Added cmd-args to script 🌵

45581af

Adresses Model training and optimization 🎯 #7

KarelZe added a commit that referenced this issue Nov 16, 2022

Added additional logging 🗝️

d820bf1

Adresses Model training and optimization 🎯 #7

KarelZe added a commit that referenced this issue Nov 16, 2022

Fixed some errors from Pre-commit hooks 🪲

11aff2a

Adresses Model training and optimization 🎯 #7

KarelZe added a commit that referenced this issue Nov 16, 2022

Fixed several issues from pre-commit hooks 🪝

dff13a0

relates to Model training and optimization 🎯 #7

KarelZe added a commit that referenced this issue Nov 16, 2022

Added missing typehints 🌵

4cd467b

Relates to Model training and optimization 🎯 #7

KarelZe added a commit that referenced this issue Nov 17, 2022

Added typehints to ClassicalClassifier 🦄

5efdf48

- finished reviewing errors from mypy pre-commit hook - reviewed errors in `check_formalia.py` Adresses Model training and optimization 🎯 #7

KarelZe added a commit that referenced this issue Nov 17, 2022

Simplified training scripts 🎍

ef4e951

- refactored constants - adjusted search space - removed redundant code - adjusted gitignore Adresses Model training and optimization 🎯 #7

KarelZe added a commit that referenced this issue Nov 17, 2022

Formatted files to fullfil pre-commit tests 🐙

01765f1

- added `typing` support - added new pre-commit hooks - resolved code issues Relates to Model training and optimization 🎯 #7

KarelZe added a commit that referenced this issue Nov 17, 2022

Added basic implementation from Borisov paper 🤖

a96c04b

Relates to Model training and optimization 🎯 #7

KarelZe added a commit that referenced this issue Dec 22, 2022

[WIP] Improve gpu utilization 🚂 (#87)

85c44bc

Adresses #7

KarelZe mentioned this issue Dec 23, 2022

Create sklearn-compatible estimators 🦜 #93

Merged

KarelZe added a commit that referenced this issue Dec 23, 2022

Allow np.arrays in classical classifier 👽

89b126e

Required for `shap` implementation, as shap (and probably other tools) pass np.array to `predict()`. Adresses #7.

KarelZe added a commit that referenced this issue Jan 4, 2023

Interpretability with SHAP and attention maps 🐇 (#85)

98484ea

Adresses #7 .

KarelZe added a commit that referenced this issue Jan 5, 2023

Add sample weighting to TransformerClassifier 🏋️ (#100)

2766bcc

Relates to #7.

KarelZe mentioned this issue Jan 5, 2023

Early stopping based on accuracy for TransformerClassifier🧁 #102

Merged

KarelZe added a commit that referenced this issue Jan 5, 2023

Early stopping based on accuracy for TransformerClassifier🧁 (#102)

aea8df6

Relates to #7

KarelZe added a commit that referenced this issue Jan 5, 2023

Improve robustness and tests of TabDataset 🚀 (#101)

d41383a

Adresses #7 .

KarelZe added a commit that referenced this issue Jan 6, 2023

Add instructions on using SLURM 🐧 (#103)

5e6371f

Relates to #7 .

KarelZe added a commit that referenced this issue Jan 6, 2023

Add sample weighting to TransformerClassifier 🏋️ (#100)

af09fcb

Relates to #7.

KarelZe added a commit that referenced this issue Jan 6, 2023

Early stopping based on accuracy for TransformerClassifier🧁 (#102)

85b9e21

Relates to #7

KarelZe added a commit that referenced this issue Jan 6, 2023

Improve robustness and tests of TabDataset 🚀 (#101)

08e3039

Adresses #7 .

KarelZe added a commit that referenced this issue Jan 6, 2023

Add instructions on using SLURM 🐧 (#103)

7f9bcfc

Relates to #7 .

KarelZe added a commit that referenced this issue Jan 19, 2023

Restore softlinks to files 🔗 (#120)

579c2fa

Adresses #7.

KarelZe added a commit that referenced this issue Jan 19, 2023

Add current results⚡ (#121)

c6a8027

Adresses #7.

This was referenced Jan 20, 2023

Issues from code review 🐛 #116

Closed

Change from code review 🧼 #124

Merged

KarelZe added a commit that referenced this issue Jan 21, 2023

Shared embeddings and pre-norm in TabTransformer 🤖 (#118)

b1bf06f

Adresses #7

KarelZe added a commit that referenced this issue Jan 22, 2023

Automatically find maximum batch size🥐 (#125)

547d10a

Adresses #7

KarelZe added a commit that referenced this issue Jan 23, 2023

Feature engineering for very large dataset 🌌 (#126)

477a414

Adresses #7

KarelZe added a commit that referenced this issue Jan 24, 2023

Add retraining for gradient boosting [+ 2 %] 🍾 (#130)

32689eb

Adresses #7

KarelZe added a commit that referenced this issue Jan 24, 2023

Improve accuracy of TabTransformer [+ 5 % from prev.]🪅 (#129)

df8e5ae

Adresses #7

KarelZe mentioned this issue Jan 24, 2023

Fix cardinalities of Transformer implementation🪲 #132

Merged

KarelZe added a commit that referenced this issue Jan 24, 2023

Fix CUBLAS error in TabTransformer implementation🪲 (#132)

489041b

Adresses #7.

KarelZe mentioned this issue Feb 26, 2023

Add notes, code, tests, and chapter on effective spread🍕 #184

Merged

KarelZe added a commit that referenced this issue Feb 26, 2023

Add notes, code, tests, and chapter on effective spread🍕 (#184)

12041e7

Adresses #7 and #10.

KarelZe closed this as completed Jun 23, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Model training and optimization 🎯 #7

Model training and optimization 🎯 #7

KarelZe commented Oct 7, 2022 •

edited

Loading

KarelZe commented Jun 23, 2023

Model training and optimization 🎯 #7

Model training and optimization 🎯 #7

Comments

KarelZe commented Oct 7, 2022 • edited Loading

KarelZe commented Jun 23, 2023

KarelZe commented Oct 7, 2022 •

edited

Loading