fix: fix chunk not display in indexed document#33942
fix: fix chunk not display in indexed document#33942crazywoola merged 1 commit intolanggenius:mainfrom
Conversation
Summary of ChangesHello, I'm Gemini Code Assist1! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed! This pull request resolves a bug that prevented chunks from being displayed in indexed documents. It also refactors the data source handling logic to accommodate different file formats and ensures that only valid file IDs are used for indexing parameter estimation, improving the reliability and accuracy of document processing. Highlights
🧠 New Feature in Public Preview: You can now enable Memory to help Gemini Code Assist learn from your team's feedback. This makes future code reviews more consistent and personalized to your project's style. Click here to enable Memory in your admin console. Using Gemini Code AssistThe full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips. Invoking Gemini You can request assistance from Gemini at any point by creating a comment using either
Customization To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a Limitations & Feedback Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments. If you're interested in giving your feedback about your experience with Gemini Code Assist for GitHub and other Google products, sign up here. Footnotes
|
There was a problem hiding this comment.
Pull request overview
Fixes the document settings “preview chunks” panel for indexed documents by ensuring the correct file identifiers are passed through to the indexing-estimate request, including handling a newer data_source_info shape.
Changes:
- Filter out falsy/empty file IDs before sending
file_idsin the indexing estimate request. - Populate
filesin document settings whendata_source_infoprovidesupload_file_id(new format), restoring preview/estimate behavior for indexed documents.
Reviewed changes
Copilot reviewed 2 out of 2 changed files in this pull request and generated 3 comments.
| File | Description |
|---|---|
web/service/knowledge/use-create-dataset.ts |
Ensures file_ids excludes falsy IDs before calling the indexing estimate API. |
web/app/components/datasets/documents/detail/settings/document-settings.tsx |
Adds support for data_source_info.upload_file_id so settings mode can construct a usable files list for preview/estimate. |
💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.
web/app/components/datasets/documents/detail/settings/document-settings.tsx
Outdated
Show resolved
Hide resolved
web/app/components/datasets/documents/detail/settings/document-settings.tsx
Outdated
Show resolved
Hide resolved
There was a problem hiding this comment.
Code Review
This pull request addresses an issue where document chunks were not being displayed. The fix involves updating the logic to correctly handle a new file identifier format (upload_file_id) in the document settings component. Additionally, the code for estimating indexing has been improved to filter out invalid file IDs, making it more robust. My review includes a suggestion to improve type safety and consistency in type assertions within the document settings component.
Important
Fixes #<issue number>.Summary
fix #33941
Screenshots
before
Screen.Recording.2026-03-24.at.09.53.44.mov
after
Screen.Recording.2026-03-23.at.19.20.02.mov
Checklist
make lintandmake type-check(backend) andcd web && npx lint-staged(frontend) to appease the lint gods