fix: refactor get_dataset_file to obtain total rows from metadata by Irozuku · Pull Request #287 · DashAISoftware/DashAI

Irozuku · 2025-09-05T19:43:54Z

This pull request refactors how the total number of rows is determined in the get_dataset_file endpoint. Instead of incrementally counting rows while reading batches from the Arrow file, it now retrieves the total row count using the get_dataset_info function, which simplifies the code and fixes the count of rows

Dataset row count calculation:

Removed the on-the-fly calculation of total_rows during batch iteration, eliminating the need to incrementally count rows while processing the file. [1] [2]
Added a call to get_dataset_info to retrieve the total number of rows after batch processing, streamlining the logic and making the code easier to maintain.

Before

After

…set_info

fix: refactor get_dataset_file to calculate total_rows using get_data…

61ee147

…set_info

cristian-tamblay approved these changes Sep 8, 2025

View reviewed changes

cristian-tamblay merged commit ace328b into develop Sep 8, 2025
5 checks passed

cristian-tamblay deleted the fix/total-rows-table branch September 8, 2025 12:21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: refactor get_dataset_file to obtain total rows from metadata#287

fix: refactor get_dataset_file to obtain total rows from metadata#287
cristian-tamblay merged 1 commit into
developfrom
fix/total-rows-table

Irozuku commented Sep 5, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Irozuku commented Sep 5, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants