feat: consolidate DataSpark web/core/cli capabilities by markhazleton · Pull Request #9 · markhazleton/DataSpark

markhazleton · 2026-03-31T13:34:00Z

Summary

add SQLite database management flow in web UI (upload, analyze schema/table, export, generate DTOs)
harden AI analysis flow with graceful degradation when OpenAI settings are missing
extend analysis features with bivariate SVG endpoint and unified analysis result model
refactor console app commands into dedicated discover/export/schema/generate command modules
add DataTables SearchPanes/filter export support and WebSpark branding updates
update implementation checklist progress in spec tasks file

Validation

dotnet build DataSpark.sln
dotnet test DataSpark.Tests/DataSpark.Tests.csproj (119 passed)

- Introduced a comprehensive feature specification document outlining user stories, acceptance criteria, functional requirements, and success criteria for the consolidation of sql2csv, DataAnalysisDemo, and related repositories into a unified DataSpark platform. - Document includes detailed user scenarios for data upload, exploratory analysis, chart creation, pivot tables, SQLite database tools, AI insights, statistical analysis, CLI automation, and more. - Established clear requirements for data ingestion, analysis, visualization, and API integration, ensuring a robust foundation for the DataSpark platform. docs: Create consolidation recommendation document for DataSpark - Added a detailed analysis and recommendation document for the consolidation of sql2csv and DataAnalysisDemo repositories. - Provided an executive summary, repository comparison, feature overlap matrix, and a phased approach for rebranding and feature porting. - Outlined unique features from DataAnalysisDemo worth porting and recommended a unified architecture for the DataSpark solution.

…acts - plan.md: technical context, constitution check (PASS), project structure - research.md: 8 topics resolved (namespace rename, SQL fix, API auth, samples) - data-model.md: 10 entities with fields, types, constraints, validation rules - contracts/web-api.md: 10 REST API endpoints with request/response schemas - contracts/cli.md: 4 CLI commands (discover, export, schema, generate) - quickstart.md: developer getting-started guide - tasks.md: 112 tasks across 14 phases, organized by 11 user stories - Updated copilot agent context with tech stack from plan Feature: 001-dataspark-consolidation

…racts Addresses all 16 findings from /speckit.analyze: CRITICAL: - C1: Add T113 (integration tests FR-043) + T114 (CI coverage gate) to tasks.md - C2: Add T115 (constitution.md rename post Phase 1) to tasks.md HIGH: - H1: FR-014 updated to 5 formats (add JPEG); T048 updated to match - H2: plan.md corrective action #2 corrected from UnifiedAnalysisService -> DatabaseAnalysisService line ~729 (both Gate Result and Principle VI note) - H3: T041/T099 scope split clarified; T099 now depends on T041 MEDIUM: - M1: FR-003 'first rows' -> 'first 10 rows' (matches edge case spec) - M2: Add T117 (GitHub archival for DataAnalysisDemo) to tasks.md - M3: Add T118 (pivot 50K perf test SC-005) to tasks.md - M4: Add T119 (CLI batch test SC-006) to tasks.md - M5: plan.md Principle V note corrected: only Sql2Csv.Tests missing TreatWarningsAsErrors - M6: FR-007 now defines data quality score; T037 updated to verify it - M7: Add T116 (GitHub repo rename) to tasks.md LOW: - L1: T028 tightened to 'controller actions'; T103 tightened to 'view forms' - L2: T007/T009 ordering dependency noted; T009 excludes RootNamespace/AssemblyName - L3: T017 now covers both .github/copilot-instructions.md files - L4: T065 specifies ZIP output; contracts/web-api.md adds GET /api/Database/export-all Total tasks: 112 -> 119

… middleware - Created SchemaServiceTests to validate the functionality of SchemaService methods, including GetTablesAsync, GetTableNamesAsync, and GenerateSchemaReportAsync. - Added tests for handling null and invalid connection strings, cancellation tokens, and ensuring correct row counts and schema report formats. - Implemented ApiKeyAuthMiddleware to enforce API key authentication on API routes, including error handling for missing or invalid keys. - Added a solution file to organize the project structure and included a strong name key file for assembly signing.

…nd CLI consolidation

…compliance assessment and recommendations

… and logging

…n implementation plan

…olations in DataSpark.Core services

…R documentation

…ackaging service with logging

github-actions · 2026-03-31T16:39:44Z

$(cat coverage-summary.txt 2>/dev/null || echo "Coverage unavailable")

* feat: Add feature specification for DataSpark platform consolidation - Introduced a comprehensive feature specification document outlining user stories, acceptance criteria, functional requirements, and success criteria for the consolidation of sql2csv, DataAnalysisDemo, and related repositories into a unified DataSpark platform. - Document includes detailed user scenarios for data upload, exploratory analysis, chart creation, pivot tables, SQLite database tools, AI insights, statistical analysis, CLI automation, and more. - Established clear requirements for data ingestion, analysis, visualization, and API integration, ensuring a robust foundation for the DataSpark platform. docs: Create consolidation recommendation document for DataSpark - Added a detailed analysis and recommendation document for the consolidation of sql2csv and DataAnalysisDemo repositories. - Provided an executive summary, repository comparison, feature overlap matrix, and a phased approach for rebranding and feature porting. - Outlined unique features from DataAnalysisDemo worth porting and recommended a unified architecture for the DataSpark solution. * docs(spec): add DataSpark consolidation plan, tasks, and design artifacts - plan.md: technical context, constitution check (PASS), project structure - research.md: 8 topics resolved (namespace rename, SQL fix, API auth, samples) - data-model.md: 10 entities with fields, types, constraints, validation rules - contracts/web-api.md: 10 REST API endpoints with request/response schemas - contracts/cli.md: 4 CLI commands (discover, export, schema, generate) - quickstart.md: developer getting-started guide - tasks.md: 112 tasks across 14 phases, organized by 11 user stories - Updated copilot agent context with tech stack from plan Feature: 001-dataspark-consolidation * docs(spec): apply analysis remediation across spec, plan, tasks, contracts Addresses all 16 findings from /speckit.analyze: CRITICAL: - C1: Add T113 (integration tests FR-043) + T114 (CI coverage gate) to tasks.md - C2: Add T115 (constitution.md rename post Phase 1) to tasks.md HIGH: - H1: FR-014 updated to 5 formats (add JPEG); T048 updated to match - H2: plan.md corrective action #2 corrected from UnifiedAnalysisService -> DatabaseAnalysisService line ~729 (both Gate Result and Principle VI note) - H3: T041/T099 scope split clarified; T099 now depends on T041 MEDIUM: - M1: FR-003 'first rows' -> 'first 10 rows' (matches edge case spec) - M2: Add T117 (GitHub archival for DataAnalysisDemo) to tasks.md - M3: Add T118 (pivot 50K perf test SC-005) to tasks.md - M4: Add T119 (CLI batch test SC-006) to tasks.md - M5: plan.md Principle V note corrected: only Sql2Csv.Tests missing TreatWarningsAsErrors - M6: FR-007 now defines data quality score; T037 updated to verify it - M7: Add T116 (GitHub repo rename) to tasks.md LOW: - L1: T028 tightened to 'controller actions'; T103 tightened to 'view forms' - L2: T007/T009 ordering dependency noted; T009 excludes RootNamespace/AssemblyName - L3: T017 now covers both .github/copilot-instructions.md files - L4: T065 specifies ZIP output; contracts/web-api.md adds GET /api/Database/export-all Total tasks: 112 -> 119 * Add unit tests for SchemaService and implement API key authentication middleware - Created SchemaServiceTests to validate the functionality of SchemaService methods, including GetTablesAsync, GetTableNamesAsync, and GenerateSchemaReportAsync. - Added tests for handling null and invalid connection strings, cancellation tokens, and ensuring correct row counts and schema report formats. - Implemented ApiKeyAuthMiddleware to enforce API key authentication on API routes, including error handling for missing or invalid keys. - Added a solution file to organize the project structure and included a strong name key file for assembly signing. * feat(dataspark): implement database tools, AI hardening, analytics, and CLI consolidation * docs(spec): add pull request review for DataSpark consolidation with compliance assessment and recommendations * fix(pr-9): address review findings for security, architecture, async, and logging * fix(pr-9): close remaining security and architecture review findings * docs(pr-review): update PR #9 follow-up review report * fix(pr-9): address remaining findings and enhance security measures in implementation plan * fix(pr-9): update review metadata and address async I/O discipline violations in DataSpark.Core services * fix(core,web,tests): resolve PR #9 review findings * fix(pr-9): update review metadata and improve assessment details in PR documentation * feat(core): implement database discovery summary service and export packaging service with logging

markhazleton added 10 commits March 30, 2026 21:33

feat(dataspark): implement database tools, AI hardening, analytics, a…

88d9480

…nd CLI consolidation

docs(spec): add pull request review for DataSpark consolidation with …

1f94da4

…compliance assessment and recommendations

fix(pr-9): address review findings for security, architecture, async,…

6511308

… and logging

fix(pr-9): close remaining security and architecture review findings

77a1079

docs(pr-review): update PR #9 follow-up review report

4f962ab

fix(pr-9): address remaining findings and enhance security measures i…

a979f1e

…n implementation plan

markhazleton self-assigned this Mar 31, 2026

markhazleton added 4 commits March 31, 2026 10:36

fix(pr-9): update review metadata and address async I/O discipline vi…

598fbe9

…olations in DataSpark.Core services

fix(core,web,tests): resolve PR #9 review findings

686eaaa

fix(pr-9): update review metadata and improve assessment details in P…

ef037cc

…R documentation

feat(core): implement database discovery summary service and export p…

a753d64

…ackaging service with logging

markhazleton mentioned this pull request Mar 31, 2026

bug: get-pr-context.ps1 resolves REPO_ROOT incorrectly (off by one directory level) markhazleton/spec-kit#5

Closed

Copilot AI mentioned this pull request Mar 31, 2026

fix: get-pr-context scripts resolve REPO_ROOT one level too shallow after deployment markhazleton/spec-kit#6

Merged

markhazleton merged commit 5ae3a75 into main Mar 31, 2026
2 checks passed

markhazleton deleted the 001-dataspark-consolidation branch March 31, 2026 20:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: consolidate DataSpark web/core/cli capabilities#9

feat: consolidate DataSpark web/core/cli capabilities#9
markhazleton merged 14 commits intomainfrom
001-dataspark-consolidation

markhazleton commented Mar 31, 2026

Uh oh!

github-actions Bot commented Mar 31, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

markhazleton commented Mar 31, 2026

Summary

Validation

Uh oh!

github-actions Bot commented Mar 31, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant