fix: minor basic stats quality fixes #2521

ethanglaser · 2025-06-09T18:53:26Z

Description

A few small corrections

PR completeness and readability

I have reviewed my changes thoroughly before submitting this pull request.
I have commented my code, particularly in hard-to-understand areas.
I have updated the documentation to reflect the changes or created a separate PR with update and provided its number in the description, if necessary.
Git commit message contains an appropriate signed-off-by string (see CONTRIBUTING.md for details).
I have added a respective label(s) to PR if I have a permission for that.
I have resolved any merge conflicts that might occur with the base branch.

Testing

I have run it locally and tested the changes extensively.
All CI jobs are green or I have provided justification why they aren't.
I have extended testing suite if new functionality was introduced in this PR.

codecov · 2025-06-09T19:41:04Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Flag	Coverage Δ
azure	`79.93% <100.00%> (+0.01%)`	⬆️
github	`71.62% <100.00%> (+<0.01%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Files with missing lines	Coverage Δ
onedal/basic_statistics/basic_statistics.py	`96.29% <100.00%> (ø)`

... and 2 files with indirect coverage changes

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

icfaust

requires a 'black' fix. otherwise good to go

icfaust · 2025-06-09T22:45:28Z

onedal/basic_statistics/basic_statistics.py

@@ -157,7 +157,7 @@ def fit(self, data, sample_weight=None, queue=None):
        data_table, weights_table = to_table(data, sample_weight, queue=queue)

        dtype = data_table.dtype
-        raw_result = raw_result = self._compute_raw(
+        raw_result = self._compute_raw(


david-cortes-intel · 2025-06-10T06:26:46Z

examples/sklearnex/basic_statistics_spmd.py

@@ -48,7 +48,7 @@ def generate_data(par, size, seed=777):

 params_spmd = {"ns": 19, "nf": 31}

-data, weights = generate_data(params_spmd, size)
+data, weights = generate_data(params_spmd, rank)


Shouldn't this be made to generate different data for each rank?

Yes - that was the original mistake here. size is the same for every rank, so the same data is generated. rank is different on every rank, so different data is generated.

But it's still being generated in a loop where each rank contains the data from the previous one.

Although maybe that would be reflected in the seed parameter and this should be tweaked further. The data generation function here is pretty wonky. I'll take a closer look tomorrow.

examples/sklearnex/basic_statistics_spmd.py

* fix: minor basic stats quality fixes * blacked * vary seed by rank instead of size

fix: minor basic stats quality fixes

9c50bb6

ethanglaser requested review from Alexsandruss and icfaust as code owners June 9, 2025 18:53

ethanglaser added the enhancement New feature or request label Jun 9, 2025

icfaust reviewed Jun 9, 2025

View reviewed changes

blacked

3235902

david-cortes-intel reviewed Jun 10, 2025

View reviewed changes

ethanglaser marked this pull request as draft June 10, 2025 18:34

ethanglaser commented Jun 10, 2025

View reviewed changes

examples/sklearnex/basic_statistics_spmd.py Outdated Show resolved Hide resolved

vary seed by rank instead of size

d3065e3

ethanglaser marked this pull request as ready for review June 10, 2025 23:28

ethanglaser requested review from icfaust and david-cortes-intel June 10, 2025 23:29

Alexsandruss approved these changes Jun 16, 2025

View reviewed changes

Alexsandruss merged commit b742d86 into uxlfoundation:main Jun 16, 2025
27 of 28 checks passed

icfaust mentioned this pull request Jun 16, 2025

[CI] fix spelling mistakes in the codebase by adding codespell #2550

Merged

13 tasks

david-cortes-intel pushed a commit to david-cortes-intel/scikit-learn-intelex that referenced this pull request Jun 18, 2025

fix: minor basic stats quality fixes (uxlfoundation#2521)

237e75e

* fix: minor basic stats quality fixes * blacked * vary seed by rank instead of size

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix: minor basic stats quality fixes #2521

fix: minor basic stats quality fixes #2521

Uh oh!

ethanglaser commented Jun 9, 2025 •

edited

Loading

Uh oh!

codecov bot commented Jun 9, 2025 •

edited

Loading

Uh oh!

icfaust left a comment

Uh oh!

icfaust Jun 9, 2025

Uh oh!

david-cortes-intel Jun 10, 2025

Uh oh!

ethanglaser Jun 10, 2025

Uh oh!

david-cortes-intel Jun 10, 2025

Uh oh!

ethanglaser Jun 10, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

fix: minor basic stats quality fixes #2521

fix: minor basic stats quality fixes #2521

Uh oh!

Conversation

ethanglaser commented Jun 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Uh oh!

codecov bot commented Jun 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

icfaust left a comment

Choose a reason for hiding this comment

Uh oh!

icfaust Jun 9, 2025

Choose a reason for hiding this comment

Uh oh!

david-cortes-intel Jun 10, 2025

Choose a reason for hiding this comment

Uh oh!

ethanglaser Jun 10, 2025

Choose a reason for hiding this comment

Uh oh!

david-cortes-intel Jun 10, 2025

Choose a reason for hiding this comment

Uh oh!

ethanglaser Jun 10, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ethanglaser commented Jun 9, 2025 •

edited

Loading

codecov bot commented Jun 9, 2025 •

edited

Loading