Upgrade dataset analysis and visualization with extended metadata and interactive UI#377
Merged
Conversation
…meric, Correlations, Quality, Header, StatBox, MetricRow, IssueCard)
…er responsiveness
…tVisualization, OverviewTab, and QualityTab
cristian-tamblay
approved these changes
Nov 11, 2025
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
This pull request introduces a major upgrade to the dataset analysis and visualization features in DashAI. The backend now computes and stores extended metadata for datasets, including detailed statistical summaries and quality indicators. The frontend has been refactored to present this information in a more interactive, tabbed interface, allowing users to explore different aspects of their datasets such as overview, numeric and categorical analysis, data quality, and correlations. Additionally, the frontend adds new dependencies and UI improvements for enhanced visualization.
Backend: Extended Metadata Computation and Storage
nan_per_columnis replaced bycompute_metadata, which now calculates and stores comprehensive metadata for each dataset, including NaN counts, column types, numeric/categorical/text stats, quality indicators, and correlations, all inself.splitsfor frontend visualization.compute_metadatainstead of the old NaN calculation method, ensuring all new metadata is available after dataset processing.Frontend: Visualization Refactor and UI Enhancements
Tabs,Tab), with dedicated tabs for Overview, Numerical Analysis, Categorical, Data Quality, and Correlations, each powered by the new backend metadata.rechartsfor charts, new MUI icons and controls) to support richer data presentation and interactivity.Video_251111121229.mp4