Add dataset split and percentage selection for local explanations, update KernelShap sampling to use fractions by Irozuku · Pull Request #313 · DashAISoftware/DashAI

Irozuku · 2025-09-29T18:47:20Z

This pull request introduces support for selecting a dataset split (train, test, validation, or all) and a percentage of that split to use when generating local explanations in the DashAI platform. This allows users to control which subset of the data is used for explainability, improving flexibility and efficiency, especially for large datasets. The changes span both the backend and frontend, including schema updates, API modifications, UI enhancements, and logic for handling splits and percentages in the explanation process.

Backend changes:

Added a new scope field (as a JSON object) to the local explainer schema, database model, API endpoint, and job logic to specify which split and percentage of the dataset to use. [1] [2] [3]
Updated the local explainer job logic to use the specified split and percentage from scope, including validation and sample selection. [1] [2] [3]

Frontend changes:

Added a SplitSelector UI component that allows users to choose the dataset split and percentage, displaying the number of rows selected. [1] [2]
Integrated the split/percentage selection into the local explainer creation flow, including default values and state management. [1] [2] [3] [4] [5]
Updated the API calls and modal logic to send the scope field when creating a local explainer. [1] [2]

KernelShap explainer improvements:

Changed background data sampling parameters from an absolute number (n_background_samples) to a fraction (background_fraction), making the sampling proportional to the dataset size and more intuitive for users. [1] [2] [3] [4] [5] [6] [7] [8]

…ion and improve parameter descriptions

…ction functionality

…ows and splits

…unction

…l explainers

Irozuku added 7 commits September 26, 2025 12:28

fix: update KernelShapSchema to use float for background sample fract…

149aad2

…ion and improve parameter descriptions

feat: add scope parameter to local explainer and implement split sele…

45da6ef

…ction functionality

feat: enhance SplitSelector to display selected rows based on total r…

175fbed

…ows and splits

refactor: clean up unused imports and improve code structure

3c0f4db

refactor: remove unnecessary console logs from createLocalExplainer f…

e3e60a4

…unction

feat: add support for datasets with different amount of rows for loca…

6fee6cc

…l explainers

fix: add scope parameter to local explainer tests for split selection

da84e28

cristian-tamblay approved these changes Oct 2, 2025

View reviewed changes

cristian-tamblay merged commit d9f25ea into develop Oct 2, 2025
5 checks passed

cristian-tamblay deleted the fix/explainability-module branch October 2, 2025 14:55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add dataset split and percentage selection for local explanations, update KernelShap sampling to use fractions#313

Add dataset split and percentage selection for local explanations, update KernelShap sampling to use fractions#313
cristian-tamblay merged 7 commits into
developfrom
fix/explainability-module

Irozuku commented Sep 29, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Irozuku commented Sep 29, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants