Skip to content

[comp] Production Deploy#2828

Merged
tofikwest merged 5 commits into
releasefrom
main
May 12, 2026
Merged

[comp] Production Deploy#2828
tofikwest merged 5 commits into
releasefrom
main

Conversation

@github-actions
Copy link
Copy Markdown
Contributor

@github-actions github-actions Bot commented May 12, 2026

This is an automated pull request to release the candidate branch into production, which will trigger a deployment.
It was created by the [Production PR] action.


Summary by cubic

Moves questionnaire parsing to an async pipeline with stronger content extraction and classification for PDFs, images, and Excel. Improves accuracy by extracting only answerable items and standardizes the upload flow to return a Trigger run for progress tracking.

  • New Features

    • Async parsing via @trigger.dev/sdk; upload now triggers parse-questionnaire and returns { runId, publicAccessToken } with progress metadata.
    • More reliable PDF extraction with Claude first and OpenAI fallback; splits large PDFs by page for stability.
    • Smarter Excel extraction that skips formulas, scoring columns, and placeholders; better header detection; XML fallback for edge cases.
    • Question parser now classifies content and returns only answerable items; saved answers are always null for auto-fill flows; deduplicates across chunks.
    • Tightened constants and concurrency for faster, safer classification; clearer errors (e.g., “Questionnaire with ID X not found”, S3 upload failures).
  • Migration

    • API change: uploadAndParse now returns { runId, publicAccessToken } instead of { questionnaireId, totalQuestions }. Use these to poll job status.
    • .xls uploads are rejected; accept .xlsx, .csv, .pdf, images, and .docx. UI updated to remove .xls.
    • Newly parsed items are created with answer = null and status = 'untouched' by design; do not rely on source-file answers during upload.

Written for commit 7a51f4c. Summary will update on new commits.

github-actions Bot and others added 5 commits May 12, 2026 15:05
Add a new function to handle PDF extraction that falls back to OpenAI when Claude's extraction fails. Update tests to cover this new behavior and refactor existing PDF extraction methods for improved clarity and functionality.
…naire-parse

[dev] [tofikwest] fix/improve-security-questionnaire-parse
@vercel
Copy link
Copy Markdown

vercel Bot commented May 12, 2026

The latest updates on your projects. Learn more about Vercel for GitHub.

Project Deployment Actions Updated (UTC)
app (staging) Building Building May 12, 2026 8:30pm
comp-framework-editor (staging) Building Building May 12, 2026 8:30pm
1 Skipped Deployment
Project Deployment Actions Updated (UTC)
portal (staging) Skipped Skipped May 12, 2026 8:30pm

Request Review

@tofikwest tofikwest merged commit 0309a11 into release May 12, 2026
10 of 12 checks passed
@claudfuen
Copy link
Copy Markdown
Contributor

🎉 This PR is included in version 3.51.0 🎉

The release is available on GitHub release

Your semantic-release bot 📦🚀

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants