update hightable and implement dataframe v2 #30

severo · 2025-07-31T06:12:38Z

@platypii do you want to review this one?

I'm not really sure about the dataframe I created for the iceberg files.

Three things to think about:

how to handle the case where the file contains less than the estimated numRows. In this implementation, I put -1 in the row number (as we only support numbers - it could also be NaN) and undefined in the cells
I'm not sure how icebergRead works, but I fear it has to fetch from the first row, even if we ask for [1000, 2000]. Is it right? If so, do you have an idea of how we could improve? maybe asking for [0, rowEnd] and cache everything to avoid later fetches (consumes more memory)
Currently, if we scroll slowly, we send a lot of icebergRead for one or two rows. We could improve by batching by 1000 rows (a bit like VirtualRowGroup in hyperparam-cli)

Re cache: we might want to implement hyparam/hightable#232, and use it here, to simplify the cache management.

platypii

Looks good. Honestly I'm not that worried about the iceberg functionality. Eventually I think we will need some support for variable number of rows, but this is good for now 👍

update hightable and implement dataframe v2

812cd96

severo requested a review from platypii July 31, 2025 06:12

platypii approved these changes Aug 1, 2025

View reviewed changes

severo merged commit 88d94f2 into master Aug 1, 2025
4 checks passed

severo deleted the update-hightable-in-icebird-demo branch August 1, 2025 19:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

update hightable and implement dataframe v2 #30

update hightable and implement dataframe v2 #30

Uh oh!

severo commented Jul 31, 2025 •

edited

Loading

Uh oh!

platypii left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

update hightable and implement dataframe v2 #30

update hightable and implement dataframe v2 #30

Uh oh!

Conversation

severo commented Jul 31, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

platypii left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

severo commented Jul 31, 2025 •

edited

Loading