Conversation
fixed the issue of PDF function used to insert bullet point for every line
…le between both modes
…dundant endpoints - Summary tab now correctly displays summary of transcript. - Job polling no longer runs infinitely after clicking on a transcript. - Removed redundant backend endpoints to debloat.
removed unnecessary console.log(summary) used for debugging
logging messages are more consistent
…tMemo into dev/final-touches
renamed buttons and changed logo
changed css to fit osix identity guidelines and colour palette
fixed error where meetings list used to display old colours
fixed issue where certain elements were still using the font Arial
Added another example file for `package.json` for Docker compose operations.
…tMemo into dev/final-touches
fixed issue where meetings font colour was too similar to background, using universal colour now
…tMemo into dev/final-touches
Not yet tested. For Yan Siong to test out on DGX after lunch.
…tMemo into dev/final-touches
This reverts commit 8f0dbd2.
This reverts commit 1fade98.
system + user prompts (predefined + userdefined)
feature: added the ability to allow users to define their own system and user prompts
adding padding to renaming functions and buttons to reduce crampedness
created a detailed docs that specifies the functions of the app, how to set up and use the app, and any common issues that may arise in the process.
added particles in bg to align with toolkit theme
There was a problem hiding this comment.
Summary of Changes
Hello @ThePrunerator, I'm Gemini Code Assist1! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!
This pull request introduces several key features and improvements to the transcription web application, focusing on enhancing user control over transcripts and summaries. It enables timestamping, speaker renaming, and custom summarization prompts, alongside new export options and a comprehensive documentation update, all while refining the user interface.
Highlights
- Transcript Timestamping: The backend's
format_resultfunction now includesstartandendtimestamps for each transcribed segment, and the frontend displays these timestamps alongside each transcript entry. - Speaker Tagging and Renaming: New API endpoints (
PATCH /jobs/{uuid}/speakers) and frontend functionality allow users to rename speakers within a transcript, making speaker identification more recognizable. The frontend also assigns unique IDs and colors to speakers. - Transcript Export to .txt: A new feature has been added to the frontend, enabling users to export the full diarized transcript into a
.txtfile for data management purposes. - User-Defined Prompts for Summarization: The summarization feature now supports custom user and system prompts, allowing for more tailored AI-generated summaries. The frontend provides input fields for these prompts.
- Meeting and Speaker Renaming: Users can now rename entire meetings via a new
PATCH /jobs/{uuid}/renameendpoint, which updates the job metadata, audio file, and transcript file names. Speaker names can also be edited directly in the transcript view. - Enhanced Documentation: A comprehensive
detailed_doc.mdfile has been added, providing in-depth information on features, architecture, prerequisites, installation, usage, API reference, configuration, troubleshooting, and development guidelines. - UI/UX Improvements: The frontend has undergone significant styling updates with new color themes, improved layout (transcript and summary in tabs), and visual enhancements like a
particles.jsbackground and updated fonts.
Using Gemini Code Assist
The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.
Invoking Gemini
You can request assistance from Gemini at any point in your pull request via creating an issue comment (i.e. comment on the pull request page) using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands.
| Feature | Command | Description |
|---|---|---|
| Code Review | /gemini review |
Performs a code review for the current pull request in its current state. |
| Pull Request Summary | /gemini summary |
Provides a summary of the current pull request in its current state. |
| Comment | @gemini-code-assist | Responds in comments when explicitly tagged, both in issue comments and review comments. |
| Help | /gemini help |
Displays a list of available commands. |
Customization
To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.
Limitations & Feedback
Gemini Code Assist may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments or fill out our survey to provide feedback.
You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.
Footnotes
-
Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩
There was a problem hiding this comment.
Code Review
This is a substantial pull request that adds several great features and a major UI refresh. The new capabilities for timestamping, speaker renaming, and custom summarization prompts are excellent additions. The backend code, particularly for the new endpoints, is well-structured and uses robust patterns for file I/O.
I've identified a few issues, including a critical one in the .gitignore configuration and a high-severity bug in the backend's file path logic. I've also provided some suggestions to improve code quality and documentation. Once these points are addressed, this PR will be in great shape. Fantastic work!
…ontainer logs When an error is encountered when generating summary, it now correctly flags up in the docker container logs instead of showing as 200 OK.
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
Co-authored-by: gemini-code-assist[bot] <176961590+gemini-code-assist[bot]@users.noreply.github.com>
- Compacted README to reduce confusion and simplify setup - Restricted backend origins to only frontend URL
…tMemo into dev/final-touches
Summary
Enabled multiple features for the transcription web-app, mainly:
.txtformat (for data management)FastAPI Application Checklist (Delete if PR is not relevant)
/healthendpoint is implemented and returns 200 OK/docsor/redocfeature/*,bugfix/*) — do not usedevdirectlyGeneral Checklist