Get Updates by xenoblade246 · Pull Request #63 · NotYuSheng/MeetMemo

xenoblade246 · 2025-07-01T07:56:22Z

No description provided.

Merging feat/full stack into main

feat: Add CODEOWNERS file and configure branch protection to restrict access to dev branch

…restrict access to dev branch"

…er-merge Revert "feat: Add CODEOWNERS file and configure branch protection to restrict access to dev branch"

Revert "Merging feat/full stack into main"

… access to dev branch

feat: Add CODEOWNERS file and configure branch protection to restrict…

status code polling feature implemented, status text now more concise

fixed unbound variable "uuid" instances, giving error message when there are no more available UUIDs to assign

updated status codes for applicable functions for easier reference

docs: new sub-issue template

docs: new pull_request_template

style: removed version & formatting

…es package

…r, and Stylelint

Add frontend linting workflow (ESLint, Prettier, Stylelint)

feat: Configure React frontend for GitHub Pages deployment via gh-pag…

…rkflow

ci: Add auto-fix and push logic to frontend linting GitHub Actions wo…

ci: added workflow_dispatch

ci: Add Stylelint configuration file to support CSS linting in CI

fix: duplicate env declaration

refactor: move README back to root

fix: missing env

docs: improve clarity

Merge feat/full stack into main

…ry status codes Added extra status codes functions for other functions besides basic ones Removed in-memory status_codes -> status codes now persist in the csv file

changed syntax to be more readable

combined all gitignore files into one in the root directory

Wrapped applicable functions in CSV_LOCK Fixed crashing illogical code in csv file reading

Feat/polling codes + condensed gitignore

backend endpoints are now working as intended with the exception of "cuda" parameter outputs are now more standardised and predictable, less errors to occur

dockerised backend + tested -> Invalid credentials in Authorization header

updated README updated docker compose updated env

build: backend testing on venv + docker successful

gemini-code-assist

Summary of Changes

Hello @ThePrunerator, I'm Gemini Code Assist¹! I'm currently reviewing this pull request and will post my feedback shortly. In the meantime, here's a summary to help you and other reviewers quickly get up to speed!

This pull request focuses on enhancing the project's core audio transcription capabilities by upgrading the backend to support GPU acceleration and implementing a robust job tracking system. Concurrently, the frontend has been simplified and restructured to provide a more focused user experience. Significant improvements to development tooling and documentation have also been made to streamline future contributions.

Highlights

Backend Job Management: Implemented a robust CSV-based job tracking system for audio transcription, including status updates (processing, completed, error) and standardized UUID handling across API endpoints.
GPU Acceleration: Configured the backend Docker environment to leverage NVIDIA GPUs for PyTorch-based audio processing, significantly enhancing transcription performance.
Frontend Simplification & Restructuring: Refactored the frontend by removing react-router-dom and consolidating the user interface into a single TextInterface component for streamlined audio upload and transcription display.
Improved Development Workflow & Documentation: Introduced new issue and pull request templates, refined .gitignore for better dependency management, added Stylelint for consistent CSS, and provided comprehensive README.md instructions for easier project setup and execution.

Using Gemini Code Assist

The full guide for Gemini Code Assist can be found on our documentation page, here are some quick tips.

Invoking Gemini

You can request assistance from Gemini at any point in your pull request via creating an issue comment (i.e. comment on the pull request page) using either /gemini <command> or @gemini-code-assist <command>. Below is a summary of the supported commands.

Feature	Command	Description
Code Review	`/gemini review`	Performs a code review for the current pull request in its current state.
Pull Request Summary	`/gemini summary`	Provides a summary of the current pull request in its current state.
Comment	@gemini-code-assist	Responds in comments when explicitly tagged, both in issue comments and review comments.
Help	`/gemini help`	Displays a list of available commands.

Customization

To customize Gemini Code Assist for GitHub experience, repository maintainers can create a configuration file and/or provide a custom code review style guide (such as PEP-8 for Python) by creating and adding files to a .gemini/ folder in the base of the repository. Detailed instructions can be found here.

Limitations & Feedback

Gemini Code Assist is currently in preview and may make mistakes. Please leave feedback on any instances where its feedback is incorrect or counter productive. You can react with 👍 and 👎 on @gemini-code-assist comments to provide feedback.

You can also get AI-powered code generation, chat, as well as code reviews directly in the IDE at no cost with the Gemini Code Assist IDE Extension.

Review the Privacy Notices, Generative AI Prohibited Use Policy, Terms of Service, and learn how to configure Gemini Code Assist in GitHub here. Gemini can make mistakes, so double check it and use code with caution. ↩

gemini-code-assist

Code Review

This pull request introduces significant infrastructure and code changes, enhancing the project's setup, backend job management, and frontend architecture. Key improvements include updated .gitignore and README files, a GPU-enabled Dockerfile for the backend, and a refactored backend API for more robust job tracking via CSV. The frontend has been simplified to a single-page application with new linting and deployment scripts. While these changes generally improve maintainability and correctness, some areas could benefit from further optimization, such as the scalability of CSV operations and the configurability of environment-specific settings.

gemini-code-assist · 2025-07-01T08:00:05Z

+    uuid=""
+    used = set()

-        for i in range(10000):
-            if i not in used:
-                uuid = f"{i:04d}"
-                break
+    if not os.path.isfile(CSV_FILE):
+            with open(CSV_FILE, "w", newline="") as f:
+                writer = csv.DictWriter(f, fieldnames=FIELDNAMES)
+                writer.writeheader()

+    with open(CSV_FILE, "r") as f:
+        reader = csv.reader(f)
+        for row in reader:
+            try:
+                if row:
+                    used.add(int(row[0]))
+            except (ValueError,IndexError):
+                continue
+    for i in range(10000):
+        if i not in used:
+            uuid = f"{i:04d}"
+            break
+    if uuid == "":
+        timestamp = get_timestamp()
+        file_name = file.filename
+        logging.error(f"{timestamp}: Error generating UUID for transcription request for file: {file_name}.wav")
+        return {"error": "No available UUIDs.", "file_name": file_name}


The current UUID generation logic iterates up to a fixed limit of 10,000. If the number of jobs exceeds this, the system will fail to generate a new UUID. To ensure continuous operation and avoid this hard limit, consider generating the next UUID by finding the maximum existing UUID in the CSV and incrementing it. This approach scales better for integer-based IDs.

Suggested change

uuid=""

used = set()

for i in range(10000):

if i not in used:

uuid = f"{i:04d}"

break

if not os.path.isfile(CSV_FILE):

with open(CSV_FILE, "w", newline="") as f:

writer = csv.DictWriter(f, fieldnames=FIELDNAMES)

writer.writeheader()

with open(CSV_FILE, "r") as f:

reader = csv.reader(f)

for row in reader:

try:

if row:

used.add(int(row[0]))

except (ValueError,IndexError):

continue

for i in range(10000):

if i not in used:

uuid = f"{i:04d}"

break

if uuid == "":

timestamp = get_timestamp()

file_name = file.filename

logging.error(f"{timestamp}: Error generating UUID for transcription request for file: {file_name}.wav")

return {"error": "No available UUIDs.", "file_name": file_name}

uuid=""

max_uuid = -1

if not os.path.isfile(CSV_FILE):

with open(CSV_FILE, "w", newline="") as f:

writer = csv.DictWriter(f, fieldnames=FIELDNAMES)

writer.writeheader()

with open(CSV_FILE, "r") as f:

reader = csv.reader(f)

for row in reader:

try:

if row and row[0].isdigit(): # Ensure row[0] is a digit string before converting

current_uuid = int(row[0])

if current_uuid > max_uuid:

max_uuid = current_uuid

except (ValueError,IndexError):

continue

uuid = f"{max_uuid + 1:04d}"

gemini-code-assist · 2025-07-01T08:00:05Z

+def add_job(uuid: str, file_name: str, status_code: str) -> None:
+    """
+    Inserts a new job in the CSV.  
+    Reads all rows, adds the new one, sorts by numeric uuid,  
+    then rewrites the entire file.
+    """
+    with CSV_LOCK:
+        rows = []
+        if os.path.isfile(CSV_FILE):
+            with open(CSV_FILE, "r", newline="") as f:
+                reader = csv.DictReader(f)
+                for row in reader:
+                    rows.append(row)
+
+        rows.append({
+            "uuid":        uuid,
+            "file_name":   file_name,
+            "status_code": status_code
+        })
+
+        rows.sort(key=lambda r: int(r["uuid"]))
+
+        with open(CSV_FILE, "w", newline="") as f:
+            writer = csv.DictWriter(f, fieldnames=FIELDNAMES)
+            writer.writeheader()
+            writer.writerows(rows)
+
+
+def update_status(uuid: str, new_status: str) -> None:
    """
-    Parses a transcript and returns a dict mapping each speaker to a list of {start, end, text} entries.
+    Read the existing CSV, update the status_code for the matching uuid,
+    and write out to a temporary file which then replaces the original.
    """
+    with CSV_LOCK:
+        dir_name = os.path.dirname(CSV_FILE) or "."
+        fd, temp_path = tempfile.mkstemp(dir=dir_name, text=True)
+        try:
+            with os.fdopen(fd, "w", newline="") as tmpf, open(CSV_FILE, "r", newline="") as csvf:
+                reader = csv.DictReader(csvf)
+                writer = csv.DictWriter(tmpf, fieldnames=FIELDNAMES)
+                writer.writeheader()
+
+                for row in reader:
+                    if row["uuid"] == uuid:
+                        row["status_code"] = new_status
+                    writer.writerow(row)
+            os.replace(temp_path, CSV_FILE)
+        except Exception:
+            os.remove(temp_path)
+            raise        


The add_job and update_status functions rewrite the entire CSV file for each operation. While this ensures data consistency and correctness (especially with the atomic update using temporary files in update_status), it can become inefficient for a large number of job entries. For a small-scale application, this might be acceptable, but for future scalability, consider using a more performant data storage solution (e.g., a simple embedded database like SQLite) if the number of jobs is expected to grow significantly.

gemini-code-assist · 2025-07-01T08:00:05Z

+    if not os.path.isfile(CSV_FILE):
+            with open(CSV_FILE, "w", newline="") as f:
+                writer = csv.DictWriter(f, fieldnames=FIELDNAMES)
+                writer.writeheader()


The logic to check for and create the CSV file if it doesn't exist is duplicated across multiple functions (get_jobs, transcribe, delete_job, get_file_name, get_job_status). This can lead to inconsistencies and makes maintenance harder. Consider centralizing this initialization logic, perhaps in an application startup hook or a dedicated utility function that ensures the CSV file and its header are present before any operations are attempted.

gemini-code-assist · 2025-07-01T08:00:05Z

      - audiofiles:/app/audiofiles
      - logs:/app/logs
+      - transcripts:/app/transcripts


The transcripts volume is mounted to the frontend container. Unless the frontend application directly reads or writes files from the /app/transcripts directory, this volume mount is unnecessary. Frontend applications typically interact with backend APIs to retrieve data, rather than directly accessing shared file system volumes. Consider removing this volume mount from the frontend service if it's not explicitly used, to maintain better separation of concerns.

gemini-code-assist · 2025-07-01T08:00:05Z

+# Optional: install express and multer for the upload server
+RUN npm install express multer


The frontend/src/components/Transcribe.js component directly sends audio files to the backend's /jobs endpoint. This implies that the frontend itself does not run an upload server. If express and multer are not used by the frontend application, installing them in the Dockerfile adds unnecessary dependencies and increases the image size. Consider removing this line if these packages are not actively utilized by the frontend.

gemini-code-assist · 2025-07-01T08:00:06Z

+    try {
+      const formData = new FormData();
+      formData.append("file", audioFile);
+      const response = await fetch("http://localhost:8000/jobs", {


The backend API URL http://localhost:8000/jobs is hardcoded. This makes the application difficult to deploy to different environments (e.g., staging, production) without code changes. Consider making API URLs configurable, for example, by using environment variables that can be set during the build process or at runtime.

Suggested change

const response = await fetch("http://localhost:8000/jobs", {

const response = await fetch(process.env.REACT_APP_BACKEND_URL + "/jobs", {

xenoblade246 and others added 30 commits June 19, 2025 14:03

Merge pull request #37 from NotYuSheng/feat/full-stack

e313a29

Merging feat/full stack into main

Create CODEOWNERS

e4b78be

Merge pull request #38 from NotYuSheng/feat/enforce-codeowner-merge

797c36c

feat: Add CODEOWNERS file and configure branch protection to restrict access to dev branch

Revert "feat: Add CODEOWNERS file and configure branch protection to …

297007d

…restrict access to dev branch"

Merge pull request #39 from NotYuSheng/revert-38-feat/enforce-codeown…

5e99e98

…er-merge Revert "feat: Add CODEOWNERS file and configure branch protection to restrict access to dev branch"

Revert "Merging feat/full stack into main"

f7879da

Merge pull request #40 from NotYuSheng/revert-37-feat/full-stack

8b57960

Revert "Merging feat/full stack into main"

feat: Add CODEOWNERS file and configure branch protection to restrict…

672f2be

… access to dev branch

Merge pull request #41 from NotYuSheng/feat/enforce-codeowner-merge

508dcfe

feat: Add CODEOWNERS file and configure branch protection to restrict…

feat: status code polling implemented

12b608b

status code polling feature implemented, status text now more concise

fix: unbound variable "uuid" instances

1d471dd

fixed unbound variable "uuid" instances, giving error message when there are no more available UUIDs to assign

build: updated status codes

30be038

updated status codes for applicable functions for easier reference

docs: new sub-issue template

0cb0010

docs: new pull_request_template

6579c72

style: removed version & formatting

3c8e46c

Merge pull request #42 from NotYuSheng/docs/sub-issue-template

52ce155

docs: new sub-issue template

Merge pull request #43 from NotYuSheng/docs/pull-request-template

20d6ffe

docs: new pull_request_template

Merge pull request #44 from NotYuSheng/style/formatting

078b41d

style: removed version & formatting

feat: Configure React frontend for GitHub Pages deployment via gh-pag…

f06d054

…es package

ci: Add GitHub Actions workflow to lint frontend with ESLint, Prettie…

d434a07

…r, and Stylelint

Merge pull request #50 from NotYuSheng/ci/frontend-lint-workflow

a1a5a61

Add frontend linting workflow (ESLint, Prettier, Stylelint)

Merge pull request #49 from NotYuSheng/feature/frontend-gh-pages-deploy

ee4649f

feat: Configure React frontend for GitHub Pages deployment via gh-pag…

ci: Add auto-fix and push logic to frontend linting GitHub Actions wo…

e304064

…rkflow

Merge pull request #51 from NotYuSheng/ci/frontend-lint-autofix

5204327

ci: Add auto-fix and push logic to frontend linting GitHub Actions wo…

ci: added workflow_dispatch

f042a83

Merge pull request #53 from NotYuSheng/ci/lint-workflow_dispatch

8625cc0

ci: added workflow_dispatch

ci: Add Stylelint configuration file to support CSS linting in CI

e19bec0

ci: Install stylelint-config-standard to fix missing config error in CI

9708847

ci: Add Stylelint devDependencies and lint script for full CI support

a4d772d

fix: kebab case

8ddc59c

NotYuSheng and others added 27 commits June 23, 2025 10:35

ci: fix detached HEAD state issue

6944fbe

ci: fix workflow permission issue

166cd60

ci: fix unusual refs

40f854d

fix: Pull before push (fast-forward)

a01ea74

ci: auto-fix frontend linting issues

46185fa

Merge pull request #54 from NotYuSheng/ci/add-stylelint-config

18c3ef4

ci: Add Stylelint configuration file to support CSS linting in CI

fix: duplicate env declaration

d42f5d7

Merge pull request #55 from NotYuSheng/fix/duplicate-env-declaration

0d5eeb7

fix: duplicate env declaration

refactor: move README back to root

1f963df

Merge pull request #56 from NotYuSheng/refactor/move-readme

3468f2b

refactor: move README back to root

fix: missing env

d34b2a8

Merge pull request #57 from NotYuSheng/fix/missing-env

00588f6

fix: missing env

docs: improve clarity

4fe8bb2

Merge pull request #58 from NotYuSheng/docs/clarity

06a37d6

docs: improve clarity

Merge pull request #59 from NotYuSheng/feat/full-stack

ec8cbb1

Merge feat/full stack into main

build: added extra status codes for other functions + removed in-memo…

d0a0fae

…ry status codes Added extra status codes functions for other functions besides basic ones Removed in-memory status_codes -> status codes now persist in the csv file

build: updated status codes function for better visibility

4b88205

changed syntax to be more readable

build: condensed all gitignore files to one in root directory

e9d4f17

combined all gitignore files into one in the root directory

build: added status codes for health check

b04f0bc

build: reflected suggested changes by gemini

7be75de

Wrapped applicable functions in CSV_LOCK Fixed crashing illogical code in csv file reading

Merge pull request #60 from NotYuSheng/feat/polling-codes

858a464

Feat/polling codes + condensed gitignore

build: backend endpoints working as intended + code cleanup

9710b60

backend endpoints are now working as intended with the exception of "cuda" parameter outputs are now more standardised and predictable, less errors to occur

build: dockerised version of backend

3619850

dockerised backend + tested -> Invalid credentials in Authorization header

build: updated readme, dockerfile, env

927f806

updated README updated docker compose updated env

build: added status code to missed out function

ed38e20

build: fix readme + main function

76dc17b

Merge pull request #61 from NotYuSheng/build/testing-on-venv

d1385ec

build: backend testing on venv + docker successful

gemini-code-assist Bot reviewed Jul 1, 2025

View reviewed changes

xenoblade246 merged commit dd1706b into feat/interface-2 Jul 2, 2025
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Get Updates#63

Get Updates#63
xenoblade246 merged 58 commits intofeat/interface-2from
main

xenoblade246 commented Jul 1, 2025

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

gemini-code-assist Bot Jul 1, 2025

Uh oh!

gemini-code-assist Bot Jul 1, 2025

Uh oh!

gemini-code-assist Bot Jul 1, 2025

Uh oh!

gemini-code-assist Bot Jul 1, 2025

Uh oh!

gemini-code-assist Bot Jul 1, 2025

Uh oh!

gemini-code-assist Bot Jul 1, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		# Optional: install express and multer for the upload server
		RUN npm install express multer

	const response = await fetch("http://localhost:8000/jobs", {
	const response = await fetch(process.env.REACT_APP_BACKEND_URL + "/jobs", {

Conversation

xenoblade246 commented Jul 1, 2025

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Summary of Changes

Highlights

Footnotes

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

gemini-code-assist Bot Jul 1, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Jul 1, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Jul 1, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Jul 1, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Jul 1, 2025

Choose a reason for hiding this comment

Uh oh!

gemini-code-assist Bot Jul 1, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants