[RFC/Experimental] Server-side Search & DataTables Integration by mattiabonzi · Pull Request #39 · lance-format/lance-data-viewer

mattiabonzi · 2026-04-06T21:18:34Z

This PR is a significant architectural experiment that replaces the static HTML table with jQuery DataTables and implements server-side processing. Previously, the app had no filtering capabilities; this change introduces global search, column ordering, and complex querying.

Key Changes:

Server-Side Engine: Rewrote the backend to support DataTables pagination and search protocols.
DuckDB Integration: Added DuckDB to the backend to lazily scan Lance datasets, enabling SQL-powered filtering and sorting without loading entire tables into memory.
Advanced Querying: Integrated DataTables SearchBuilder, allowing for complex visual construction of AND/OR logic.
UI Features: Added a "Wrap Text" toggle to handle large cell content and updated the sidebar for better column management.
Other frontend enhancement

Breaking Changes:
The backend logic has been largely rewritten. Specifically, the standard /rows endpoint is superseded by a new /datatables POST endpoint to handle the structured query payloads from the frontend.

Current Limitations:
This is an experimental build and requires further stabilization:

Query Guards: There are currently no guards in the SearchBuilder for vector fields; attempting to apply standard text filters to a vector column will cause the query to fail.
Type Coverage: While basic types are handled, the serialization for more obscure Arrow types needs more robust testing.

Feedback & Testing Request:
I am looking for feedback on whether this DataTables-driven direction aligns with the project goals. If you feel this aligns with the project, it would be great if you could provide or point me to a diverse test dataset containing a wide variety of supported field types (e.g., nested structs, different vector dimensions, and timestamps) to ensure the DuckDB-to-Arrow serialization is seamless.

Please let me know if you would like me to proceed with these changes. If this is not interesting for the project, I will just delete the PR. I've made this cause i currently need a way to filter my own dataset while developing.

mattiabonzi · 2026-04-06T21:21:21Z

@gordonmurray would love to hear your thoughts on this. As I mentioned, this is experimental and will need more work to be stable, but I believe it could leads to a better user experience overall.

gordonmurray · 2026-04-06T21:34:51Z

Hey @mattiabonzi, thanks for putting this together. The search and filtering need is real, and I can see the use case for this.

That said, I think this moves in a different direction from where the project is headed. The core goal is to stay a lightweight, zero-setup viewer: mount a folder, open a browser, browse your data.

Vanilla JS is a deliberate choice. The project avoids frameworks, bundlers, and external libraries by design. Adding jQuery, DataTables, and Select2 is a significant shift in that philosophy.
GET-only API. All existing endpoints are stateless GETs with no writes. The new POST /datatables endpoint changes that contract, and opening CORS to POST moves away from the read-only model.
DuckDB as a runtime dependency adds considerable weight to the container image and install footprint for what's meant to be a minimal tool.

I think filtering and search would be a genuinely useful addition, but ideally as something lighter: client-side sorting and filtering on the current page without new backend dependencies. Issue #30 covers column sorting as a starting point, and that could evolve from there.

If you're interested in contributing toward that lighter approach, I'd welcome it. And if you want to discuss a server-side search direction, opening an issue first to talk through the design would be a good next step.

Thanks again for the effort here, I hope this makes sense?

mattiabonzi · 2026-04-06T21:44:27Z

@gordonmurray Totally makes sense. I figured I'd share it since I’d already written the code for my own project, but I completely understand the intention of the project. Thanks for the feedback, closing the PR now!

Names the load-bearing design constraints that shape the project so that proposals touching them can be discussed against a written baseline rather than reconstructed in each thread. Covers: - Vanilla JS with no build step - GET-only, stateless API - No metadata database - No in-app authentication - Read-only access Also adds a short "proposing changes" section pointing to prior design discussions (lance-format#5, lance-format#29, lance-format#39) and a minimal development workflow snippet. Fixes lance-format#42

…ts (#43) Names the load-bearing design constraints that shape the project so that proposals touching them can be discussed against a written baseline rather than reconstructed in each thread. Covers: - Vanilla JS with no build step - GET-only, stateless API - No metadata database - No in-app authentication - Read-only access Also adds a short "proposing changes" section pointing to prior design discussions (#5, #29, #39) and a minimal development workflow snippet. Fixes #42

mattiabonzi added 6 commits February 20, 2026 14:05

fix: Enhance serialization and vector handling

83ba9d3

feat: manage nested vectors and complex objects in the UI

5b3980b

feat: add text wrapping

19b0dcc

feat: Add basic datatble support with filtering and search

a054a33

feat: full datatable integration, server-side processing and search

bef2494

docs: Update README and refresh the screenshot.

c45d742

mattiabonzi closed this Apr 6, 2026

This was referenced Apr 7, 2026

switching the project to typescript? #5

Open

docs: add CONTRIBUTING.md documenting design philosophy and constraints #42

Closed

gordonmurray mentioned this pull request Apr 7, 2026

docs: add CONTRIBUTING.md documenting design philosophy and constraints #43

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[RFC/Experimental] Server-side Search & DataTables Integration#39

[RFC/Experimental] Server-side Search & DataTables Integration#39
mattiabonzi wants to merge 6 commits intolance-format:mainfrom
TuchSoft:feat/datatable-integration

mattiabonzi commented Apr 6, 2026

Uh oh!

mattiabonzi commented Apr 6, 2026

Uh oh!

gordonmurray commented Apr 6, 2026

Uh oh!

mattiabonzi commented Apr 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

mattiabonzi commented Apr 6, 2026

Uh oh!

mattiabonzi commented Apr 6, 2026

Uh oh!

gordonmurray commented Apr 6, 2026

Uh oh!

mattiabonzi commented Apr 6, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants