ENH Rework narrative of GBDT notebook #763

ArturoAmorQ · 2024-02-21T14:15:11Z

Originally I wanted to rework only the wording, as well explaining the GBDT algo is the cornerstone for understanding HGBT. But then I decided to factorize the code by introducing a helper function to keep the focus on the narrative other than the code.

glemaitre

Here are some comments.

python_scripts/ensemble_gradient_boosting.py

glemaitre · 2024-04-26T14:20:56Z

python_scripts/ensemble_gradient_boosting.py


 # %%
 import pandas as pd
 import numpy as np

-# Create a random number generator that will be used to set the randomness
-rng = np.random.RandomState(0)
+rng = np.random.RandomState(0)  # Create a random number generator


Let's move the generator next to the data generation. We should avoid showing a pattern where people would use the generator across different function and estimators.
So the best practice is just to show it next to the data generation. we could even slightly change the code and have:

def generate_data(n_samples=50, seed=0): rng = np.random.default_rng(seed) x = rng.normal(size=(n_samples,)) * ... noise = rng.normal(size=(s_samples,)) * 0.3

using default_rng should be encourage nowadays.

python_scripts/ensemble_gradient_boosting.py

Co-authored-by: Guillaume Lemaitre <g.lemaitre58@gmail.com>

Co-authored-by: ArturoAmorQ <arturo.amor-quiroz@polytechnique.edu> Co-authored-by: Guillaume Lemaitre <g.lemaitre58@gmail.com> 31bfaaf

ENH Rework narrative of GBDT notebook

3083498

glemaitre self-requested a review April 26, 2024 14:14

glemaitre reviewed Apr 26, 2024

View reviewed changes

ArturoAmorQ and others added 7 commits April 29, 2024 16:21

Apply suggestions from code review

b274fd2

Co-authored-by: Guillaume Lemaitre <g.lemaitre58@gmail.com>

Use default_rng as good practice

eb359b2

Adapt sample of interest to new rng

2e86f09

Add docstring to helper function

99bed3b

Solve conflicts

d357154

Wording

16c35ad

Tweak

32407a5

ArturoAmorQ mentioned this pull request May 13, 2024

ENH Rework bagging notebook #778

Merged

glemaitre merged commit 31bfaaf into INRIA:main May 17, 2024
3 checks passed

github-actions bot pushed a commit that referenced this pull request May 17, 2024

[ci skip] ENH Rework narrative of GBDT notebook (#763)

1531294

Co-authored-by: ArturoAmorQ <arturo.amor-quiroz@polytechnique.edu> Co-authored-by: Guillaume Lemaitre <g.lemaitre58@gmail.com> 31bfaaf

ArturoAmorQ deleted the gbdt_wording branch May 17, 2024 09:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ENH Rework narrative of GBDT notebook #763

ENH Rework narrative of GBDT notebook #763

ArturoAmorQ commented Feb 21, 2024

glemaitre left a comment

glemaitre Apr 26, 2024

ENH Rework narrative of GBDT notebook #763

ENH Rework narrative of GBDT notebook #763

Conversation

ArturoAmorQ commented Feb 21, 2024

glemaitre left a comment

Choose a reason for hiding this comment

glemaitre Apr 26, 2024

Choose a reason for hiding this comment