feat: add github actions and fixes to speaker-recognition by 0xrushi · Pull Request #150 · SimpleOpenSoftware/chronicle

0xrushi · 2025-11-09T00:35:35Z

Add CI workflow to build/push Docker Compose services (incl. CUDA variants) with versioned tags
Add Parakeet streaming consumer + dedicated worker to match the new UI
Fix a bug where DEEPGRAM_KEY was not present and PARAKEET was enabled, it was still searching for DEEPGRAM
Fix a bug to include transcription text in the conversation page, instead of just the summary for parakeet
Refresh init docs and setup script tweaks.

Summary by CodeRabbit

New Features
- Added support for Parakeet ASR as an alternative transcription provider with automatic fallback detection
- Enhanced speaker recognition configuration with HuggingFace token validation during setup
Documentation
- Updated setup instructions with SSL certificate generation requirements for speaker recognition
- Improved containerized environment configuration guidance
Bug Fixes
- Workers now gracefully handle missing transcription providers instead of crashing on startup

…nd deployment

…Docker workflow

…kflow

… use 'docker images'

…rkflow

…r extras services

…b Actions workflow

…workflow

…ctions workflow

…selection and adding fallback to default runner

…mage pushing

…ndling in GitHub Actions workflow

…flow by using a structured array for Docker Compose services

…itHub Actions workflow

- Changed speaker service URL from `http://host.docker.internal:8085` to `http://127.0.0.1:8085` in `wizard.py` and updated related documentation. - Added validation for `HF_TOKEN` in the speaker-recognition setup, prompting the user if it's missing or invalid. - Introduced a new `ParakeetStreamConsumer` for handling audio streams with Parakeet, including graceful shutdown handling. - Updated `docker-compose.yml` and related files to ensure proper environment variable usage and service health checks. - Enhanced error handling in audio stream workers for better logging and user feedback.

coderabbitai · 2025-11-09T00:35:46Z

Important

Review skipped

Auto reviews are disabled on this repository.

Please check the settings in the CodeRabbit UI or the .coderabbit.yaml file in this repository. To trigger a single review, invoke the @coderabbitai review command.

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

Walkthrough

This PR introduces Parakeet ASR as a dynamic transcription provider alongside Deepgram, adds conditional worker startup logic, updates Speaker Recognition networking to use localhost, and enhances the setup wizard with HF_TOKEN validation. Multiple backend controllers, models, and workers are updated to support dynamic provider selection and graceful fallback handling.

Changes

Cohort / File(s)	Change Summary
CI/CD Workflow `\.github/workflows/advanced-docker-compose-build\.yml`	New GitHub Actions workflow for manual and push-triggered Docker Compose builds with support for self-hosted and ubuntu-latest runners, iterative service building with version tagging, and CUDA variant handling
Documentation Updates `Docs/init-system\.md`, `extras/speaker-recognition/README\.md`	Speaker Recognition URL changed from host.docker.internal to 127.0.0.1; added SSL certificate generation prerequisites and troubleshooting guidance for nginx
Docker Compose & Templates `backends/advanced/docker-compose\.yml`, `extras/speaker-recognition/docker-compose\.yml`, `extras/speaker-recognition/\.env\.template`, `extras/speaker-recognition/nginx\.conf\.template`	Added PARAKEET_ASR_URL to friend-backend, expanded CORS_ORIGINS with additional localhost variants, updated Speaker Recognition service URLs from speaker-service to 127.0.0.1, changed nginx upstream and HTTP/2 configuration
Conversation Management `backends/advanced/src/advanced_omi_backend/controllers/conversation_controller\.py`, `backends/advanced/src/advanced_omi_backend/models/conversation\.py`	Added legacy transcript field population from active version, updated API output to include segments while excluding high-volume fields, removed redis_conn from queue_controller import
Transcription Provider Selection `backends/advanced/src/advanced_omi_backend/controllers/websocket_controller\.py`	Replaced hardcoded "deepgram" provider with dynamic inference logic: checks TRANSCRIPTION_PROVIDER env var, auto-detects based on PARAKEET_ASR_URL/OFFLINE_ASR_TCP_URI/DEEPGRAM_API_KEY, raises error if none configured
Parakeet Transcription Services `backends/advanced/src/advanced_omi_backend/services/transcription/parakeet\.py`, `backends/advanced/src/advanced_omi_backend/services/transcription/parakeet_stream_consumer\.py`	Added os module import for file cleanup; introduced ParakeetStreamConsumer class for Redis Streams-based transcription with per-result confidence calculation and error handling
Audio Stream Workers `backends/advanced/src/advanced_omi_backend/workers/audio_stream_deepgram_worker\.py`, `backends/advanced/src/advanced_omi_backend/workers/audio_stream_parakeet_worker\.py`	Deepgram worker now logs warnings and returns gracefully instead of crashing on missing API key; new Parakeet worker script with signal handling, Redis integration, and configurable buffer size
Transcription Job Processing `backends/advanced/src/advanced_omi_backend/workers/transcription_jobs\.py`	Added conditional speaker segment generation: use provided segments or build from transcript_text with computed timings; derives start/end times from word data or estimates from word count
Worker Management Script `backends/advanced/start-workers\.sh`	Conditional startup of Deepgram and Parakeet workers based on environment variables; guards process termination with PID existence checks; adapts status output to reflect which workers are running
Setup Wizard `wizard\.py`	Changed speaker URL from host.docker.internal to 127.0.0.1, introduced mandatory HF_TOKEN validation with user prompts, replaced prior token reuse logic with explicit validation flow

Sequence Diagram(s)

sequenceDiagram
    participant WebSocket as WebSocket<br/>Controller
    participant Env as Environment<br/>Variables
    participant Producer as Audio Stream<br/>Producer
    participant Consumer as Stream<br/>Consumer

    WebSocket->>Env: Check TRANSCRIPTION_PROVIDER
    alt Provider is "offline" or "parakeet"
        WebSocket->>Producer: Use "parakeet"
    else Provider is "deepgram"
        WebSocket->>Producer: Use "deepgram"
    else Auto-detect
        WebSocket->>Env: Check PARAKEET_ASR_URL /<br/>OFFLINE_ASR_TCP_URI
        alt Parakeet configured
            WebSocket->>Producer: Use "parakeet"
        else Check DEEPGRAM_API_KEY
            alt Deepgram configured
                WebSocket->>Producer: Use "deepgram"
            else No provider
                WebSocket->>WebSocket: Raise ValueError
            end
        end
    end
    Producer->>Consumer: init_session(provider)

sequenceDiagram
    participant Script as start-workers.sh
    participant Env as Environment
    participant Deep as Deepgram<br/>Worker
    participant Para as Parakeet<br/>Worker
    participant Monitor as Process<br/>Monitor

    Script->>Env: Check DEEPGRAM_API_KEY
    alt API key present
        Script->>Deep: Start deepgram worker
        Note over Script: AUDIO_STREAM_WORKER_PID set
    else Missing
        Note over Script: Skip deepgram<br/>AUDIO_STREAM_WORKER_PID empty
    end

    Script->>Env: Check PARAKEET_ASR_URL /<br/>OFFLINE_ASR_TCP_URI
    alt URL present
        Script->>Para: Start parakeet worker
        Note over Script: PARAKEET_STREAM_WORKER_PID set
    else Missing
        Note over Script: Skip parakeet<br/>PARAKEET_STREAM_WORKER_PID empty
    end

    Script->>Monitor: Wait for child processes
    Monitor-->>Script: Any worker exits
    Script->>Deep: Kill if PID set
    Script->>Para: Kill if PID set

Estimated code review effort

🎯 4 (Complex) | ⏱️ ~45 minutes

Areas requiring extra attention:

websocket_controller.py: Dynamic provider selection logic with multiple conditional paths; ensure all provider resolution branches are correctly tested and error cases are properly handled
parakeet_stream_consumer.py: New public class integrating Redis Streams, ParakeetProvider, and confidence calculation; verify Redis connection handling, error propagation, and async lifecycle management
transcription_jobs.py: Complex conditional logic for speaker segment generation with timing computation from words or estimates; validate edge cases (empty words, missing timings, text-only scenarios)
conversation_controller.py & conversation.py: Changes to data population and API output involving legacy field handling; ensure backward compatibility and verify segment inclusion doesn't break existing clients
start-workers.sh: Subtle control flow changes with conditional PID checks; test all combinations (no workers, single worker, both workers) to prevent zombie processes
wizard.py: HF_TOKEN validation flow introduces user interaction and error handling; verify all paths (missing token, invalid token, valid token) and ensure graceful recovery

Possibly related PRs

Refactor diarization configuration and unify transcription provider interfaces with networking fixes #89: Modifies Speaker Recognition networking stack (nginx and docker-compose) with related networking changes to speaker-service configuration
feat: Adds "closing conversation", beefs up and fixes Queue management page #144: Modifies start-workers.sh startup logic with overlapping conditional worker PID and worker composition changes
long audio fix #95: Modifies Parakeet/ASR transcription stack including backends/advanced/services/transcription/parakeet.py and ASR service integration

Suggested reviewers

thestumonkey

Poem

🐰 A whisker-twitching update hops into the fray,
With Parakeet and Deepgram dancing all day!
Providers now chosen with logic so divine,
Workers wake gently on their conditional line—
Adaptive, graceful, our streams all align! 🎙️

Pre-merge checks and finishing touches

❌ Failed checks (1 inconclusive)

Check name	Status	Explanation	Resolution
Title check	❓ Inconclusive	The title covers multiple distinct changes (GitHub Actions, Parakeet streaming, fixes to speaker-recognition) but doesn't specify the main focus or prioritize which change is primary.	Consider clarifying the primary change or refining the title to indicate whether the focus is the CI workflow, the Parakeet integration, or the speaker-recognition fixes.

✅ Passed checks (2 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Docstring Coverage	✅ Passed	No functions found in the changed files to evaluate docstring coverage. Skipping docstring coverage check.

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

AnkushMalaker · 2025-11-09T10:09:48Z

@coderabbitai review

coderabbitai · 2025-11-09T10:09:57Z

✅ Actions performed

Review triggered.

Note: CodeRabbit is an incremental review system and does not re-review already reviewed commits. This command is applicable only when automatic reviews are paused.

coderabbitai

Actionable comments posted: 4

🧹 Nitpick comments (7)

backends/advanced/src/advanced_omi_backend/models/conversation.py (1)

162-173: Consider migration script for heavy data transformations.

The logic correctly populates missing legacy fields from the active transcript version. However, running this in clean_legacy_data means it executes on every model load, which could impact performance for large datasets.

For better performance, consider:

A one-time migration script to backfill these fields across existing documents

Using this validator only as a fallback for unmigrated data
backends/advanced/src/advanced_omi_backend/workers/transcription_jobs.py (1)
245-245: Remove unnecessary f-string prefix.

The string at line 245 has no placeholders and doesn't need the f prefix.
-        logger.info(f"📊 Created single segment from transcript text (no segments returned by provider)")
+        logger.info("📊 Created single segment from transcript text (no segments returned by provider)")
backends/advanced/src/advanced_omi_backend/services/transcription/parakeet_stream_consumer.py (1)
27-40: Consider explicit Optional type annotation.

The service_url parameter should use explicit Optional[str] or str | None to comply with PEP 484, improving type safety and IDE support.
-    def __init__(self, redis_client, service_url: str = None, buffer_chunks: int = 30):
+    def __init__(self, redis_client, service_url: Optional[str] = None, buffer_chunks: int = 30):
Don't forget to import Optional from typing at the top of the file:
from typing import Optional
backends/advanced/docker-compose.yml (1)
35-35: Review hardcoded IPs in CORS origins.

The CORS_ORIGINS includes a hardcoded IP 192.168.1.153, which appears to be a development/testing IP. Consider:

Moving this to an .env file variable for environment-specific configuration

Using a pattern like CORS_ORIGINS=${CORS_ORIGINS:-http://localhost:3010,...} to allow override

Documenting why specific IPs are needed
-      - CORS_ORIGINS=http://localhost:3010,http://localhost:8000,http://192.168.1.153:3010,http://192.168.1.153:8000,https://localhost:3010,https://localhost:8000,https://100.105.225.45,https://localhost
+      - CORS_ORIGINS=${CORS_ORIGINS:-http://localhost:3010,http://localhost:8000,https://localhost:3010,https://localhost:8000,https://localhost}
backends/advanced/src/advanced_omi_backend/controllers/websocket_controller.py (1)
316-339: Dynamic provider resolution looks good with minor suggestion.

The provider selection logic properly handles:

Explicit provider configuration via TRANSCRIPTION_PROVIDER

Auto-detection based on available credentials

Clear error when no provider is configured

One minor improvement: the error message on line 331 is inline with the exception, which static analysis flags as TRY003. Consider defining it as a constant if this pattern is used elsewhere.
+# Error messages
+NO_PROVIDER_ERROR = "No transcription provider configured (DEEPGRAM_API_KEY or PARAKEET_ASR_URL required)"
+
 # Determine transcription provider from environment
 transcription_provider = os.getenv("TRANSCRIPTION_PROVIDER", "").lower()
 if transcription_provider in ["offline", "parakeet"]:
     provider = "parakeet"
 elif transcription_provider == "deepgram":
     provider = "deepgram"
 else:
     # Auto-detect: prefer Parakeet if URL is set, otherwise Deepgram
     parakeet_url = os.getenv("PARAKEET_ASR_URL") or os.getenv("OFFLINE_ASR_TCP_URI")
     deepgram_key = os.getenv("DEEPGRAM_API_KEY")
     if parakeet_url:
         provider = "parakeet"
     elif deepgram_key:
         provider = "deepgram"
     else:
-        raise ValueError("No transcription provider configured (DEEPGRAM_API_KEY or PARAKEET_ASR_URL required)")
+        raise ValueError(NO_PROVIDER_ERROR)
extras/speaker-recognition/README.md (1)
71-73: Add language identifier to fenced code block.

The error message example should specify it's a shell/text output for better rendering.
-```
+```text
 cannot load certificate "/etc/nginx/ssl/server.crt": BIO_new_file() failed
</blockquote></details>
<details>
<summary>backends/advanced/src/advanced_omi_backend/workers/audio_stream_parakeet_worker.py (1)</summary><blockquote>

`1-1`: **Make file executable to match shebang.**

The file has a shebang line but isn't marked as executable. Either make it executable or remove the shebang if it's only run via `python -m`.



```shell
# If file should be directly executable:
chmod +x backends/advanced/src/advanced_omi_backend/workers/audio_stream_parakeet_worker.py
Or remove the shebang if it's not needed (since it's run via uv run python -m in start-workers.sh).

📜 Review details

Configuration used: CodeRabbit UI

Review profile: CHILL

Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 14f5242 and 257c7c1.

📒 Files selected for processing (17)

.github/workflows/advanced-docker-compose-build.yml (1 hunks)
Docs/init-system.md (2 hunks)
backends/advanced/docker-compose.yml (4 hunks)
backends/advanced/src/advanced_omi_backend/controllers/conversation_controller.py (3 hunks)
backends/advanced/src/advanced_omi_backend/controllers/websocket_controller.py (1 hunks)
backends/advanced/src/advanced_omi_backend/models/conversation.py (1 hunks)
backends/advanced/src/advanced_omi_backend/services/transcription/parakeet.py (1 hunks)
backends/advanced/src/advanced_omi_backend/services/transcription/parakeet_stream_consumer.py (1 hunks)
backends/advanced/src/advanced_omi_backend/workers/audio_stream_deepgram_worker.py (1 hunks)
backends/advanced/src/advanced_omi_backend/workers/audio_stream_parakeet_worker.py (1 hunks)
backends/advanced/src/advanced_omi_backend/workers/transcription_jobs.py (1 hunks)
backends/advanced/start-workers.sh (4 hunks)
extras/speaker-recognition/.env.template (1 hunks)
extras/speaker-recognition/README.md (4 hunks)
extras/speaker-recognition/docker-compose.yml (2 hunks)
extras/speaker-recognition/nginx.conf.template (2 hunks)
wizard.py (2 hunks)

🧰 Additional context used

📓 Path-based instructions (3)

**/*.py

📄 CodeRabbit inference engine (CLAUDE.md)

**/*.py: Format Python code with Black using a 100-character line length
Use isort to organize Python imports
Place all imports at the top of the Python file after the docstring; never import modules in the middle of functions or files
Use lazy imports only when absolutely necessary to resolve circular import issues
Group Python imports by: standard library, third-party, then local imports
Always raise errors rather than silently ignoring failures; use explicit exceptions
Understand data structures instead of adding defensive hasattr checks; prefer correct models/parsing over ad-hoc guards

Files:

backends/advanced/src/advanced_omi_backend/services/transcription/parakeet.py
backends/advanced/src/advanced_omi_backend/controllers/websocket_controller.py
wizard.py
backends/advanced/src/advanced_omi_backend/workers/audio_stream_deepgram_worker.py
backends/advanced/src/advanced_omi_backend/models/conversation.py
backends/advanced/src/advanced_omi_backend/controllers/conversation_controller.py
backends/advanced/src/advanced_omi_backend/workers/audio_stream_parakeet_worker.py
backends/advanced/src/advanced_omi_backend/workers/transcription_jobs.py
backends/advanced/src/advanced_omi_backend/services/transcription/parakeet_stream_consumer.py

{**/*.sh,**/Dockerfile}

📄 CodeRabbit inference engine (CLAUDE.md)

Use uv to run Python: replace direct python/python3 invocations with 'uv run python3 ...'

Files:

backends/advanced/start-workers.sh

**/*.sh

📄 CodeRabbit inference engine (CLAUDE.md)

**/*.sh: Use 'docker compose build' without --no-cache by default; only add --no-cache when explicitly needed
Do not use 'docker compose restart' to pick up code changes; rebuild images when src is not volume-mounted

Files:

backends/advanced/start-workers.sh

🪛 actionlint (1.7.8)

.github/workflows/advanced-docker-compose-build.yml

257-257: "steps" section is missing in job "build-default"

(syntax-check)

268-268: "steps" section must be sequence node but got alias node with "" tag

(syntax-check)

🪛 markdownlint-cli2 (0.18.1)

extras/speaker-recognition/README.md

71-71: Fenced code blocks should have a language specified

(MD040, fenced-code-language)

🪛 Ruff (0.14.3)

backends/advanced/src/advanced_omi_backend/controllers/websocket_controller.py

331-331: Avoid specifying long messages outside the exception class

(TRY003)

backends/advanced/src/advanced_omi_backend/workers/audio_stream_parakeet_worker.py

1-1: Shebang is present but file is not executable

(EXE001)

57-57: Unused function argument: frame

(ARG001)

59-59: Store a reference to the return value of asyncio.create_task

(RUF006)

backends/advanced/src/advanced_omi_backend/workers/transcription_jobs.py

245-245: f-string without any placeholders

Remove extraneous f prefix

(F541)

backends/advanced/src/advanced_omi_backend/services/transcription/parakeet_stream_consumer.py

27-27: PEP 484 prohibits implicit Optional

Convert to T | None

(RUF013)

40-40: Avoid specifying long messages outside the exception class

(TRY003)

🔇 Additional comments (21)

backends/advanced/src/advanced_omi_backend/services/transcription/parakeet.py (1)

10-10: LGTM!

The os import addition correctly supports the temporary file cleanup logic at lines 89-90.

backends/advanced/src/advanced_omi_backend/workers/audio_stream_deepgram_worker.py (1)

32-35: LGTM! Graceful startup behavior.

The change from error+exit to warning+return aligns well with the conditional worker startup pattern seen in start-workers.sh. This allows the system to continue operating with alternative transcription providers when Deepgram is not configured.

backends/advanced/src/advanced_omi_backend/controllers/conversation_controller.py (2)

140-146: LGTM! Ensures transcript data availability.

Calling _update_legacy_transcript_fields() before formatting ensures the API response includes transcript and segments from the active version. The explicit inclusion of segments in the output (while excluding heavy nested fields) aligns with the PR's goal to display transcription text on the conversation page.

355-355: LGTM! Clean import.

The import cleanup removes unused redis_conn from the import path, keeping the code clean and aligned with the decorator-based injection pattern noted at line 406.

backends/advanced/src/advanced_omi_backend/workers/transcription_jobs.py (1)

200-244: LGTM! Good fallback for segment-less transcription.

The conditional logic properly handles providers like Parakeet that return transcript text without pre-segmented speaker data. The duration calculation is robust:

Prefers word-level timestamps when available

Falls back to reasonable estimation (0.4s per word)

Defensive check ensures end > start (line 239)

This ensures consistent downstream processing regardless of provider capabilities.

backends/advanced/src/advanced_omi_backend/services/transcription/parakeet_stream_consumer.py (1)

59-75: LGTM! Robust confidence calculation.

The confidence averaging logic provides a sensible fallback (0.9) when word-level confidence is unavailable, and properly computes the mean when confidence scores are present. This ensures consistent transcription result formatting for downstream consumers.

.github/workflows/advanced-docker-compose-build.yml (1)

140-186: LGTM! Well-structured sequential build process.

The base services build loop properly:

Handles different compose files and project directories

Resolves built image IDs to avoid name mismatches

Tags with both version and latest

Includes cleanup of local tags

The error handling with continue on missing images is appropriate.

extras/speaker-recognition/.env.template (1)

31-32: LGTM! Localhost URL aligns with Docker networking.

The change to 127.0.0.1:8085 (localhost) instead of the hostname is appropriate for the Docker setup, and the added comment clearly explains the reasoning for different service variants.

extras/speaker-recognition/docker-compose.yml (2)

32-32: LGTM - Correct healthcheck endpoint.

The change from http://speaker-service:8085/health to http://localhost:8085/health is correct. Healthcheck commands run inside the container, so they should check the service via localhost rather than the service name.

99-100: LGTM - Proper service dependency.

Adding depends_on with service_healthy condition ensures nginx only starts after the web-ui is healthy and ready to serve requests. This prevents potential connection errors during startup.

extras/speaker-recognition/nginx.conf.template (1)

54-55: LGTM - Modern nginx HTTP/2 syntax.

The change from listen 443 ssl http2; to separate listen 443 ssl; and http2 on; directives follows the modern nginx configuration syntax introduced in newer versions.

backends/advanced/docker-compose.yml (2)

20-20: LGTM - Parakeet ASR URL configuration.

Adding PARAKEET_ASR_URL to the backend environment enables dynamic transcription provider selection, aligning with the changes in websocket_controller.py.

68-68: The file already has correct executable permissions; no changes needed.

The start-workers.sh file is already executable (permissions: rwxr-xr-x). Docker will successfully execute it without permission issues.

extras/speaker-recognition/README.md (2)

53-74: Excellent documentation of SSL requirements.

The new section clearly explains:

SSL certificates are required for nginx

How to generate them

What files are created

Expected errors when missing

This will help users avoid a common setup issue.

418-426: LGTM - Helpful troubleshooting guidance.

The new troubleshooting section provides clear steps to diagnose and fix SSL certificate issues, including verification commands and restart instructions.

backends/advanced/src/advanced_omi_backend/workers/audio_stream_parakeet_worker.py (1)

26-76: LGTM - Clean worker implementation.

The worker properly:

Checks for required environment variables before starting

Provides clear logging about why it's not starting if config is missing

Handles graceful shutdown

Cleans up Redis connection

This aligns well with the conditional startup logic in start-workers.sh.

backends/advanced/start-workers.sh (3)

80-87: LGTM - Conditional Deepgram worker startup.

The conditional logic properly:

Checks for DEEPGRAM_API_KEY before starting the worker

Provides clear logging when skipping

Sets AUDIO_STREAM_WORKER_PID to empty string when not started

This prevents unnecessary processes and aligns with the dynamic provider resolution in websocket_controller.py.

91-99: LGTM - Conditional Parakeet worker startup.

The conditional logic properly:

Checks for either PARAKEET_ASR_URL or OFFLINE_ASR_TCP_URI

Uses variable fallback pattern to handle both environment variables

Provides clear logging when skipping

Sets PARAKEET_STREAM_WORKER_PID to empty string when not started

44-45: LGTM - Safe PID cleanup with guards.

The shutdown logic properly guards all kill commands with non-empty checks [ -n "$PID" ], preventing errors when optional workers weren't started. This is a clean approach to handling conditional processes.

Also applies to: 109-114, 128-128

wizard.py (2)

161-161: Loopback URL aligns with the updated container topology.

Pointing the advanced backend at http://127.0.0.1:8085 keeps it consistent with the refreshed speaker-recognition compose/env defaults, so the services stay reachable without relying on host.docker.internal indirection. Nicely done.

177-204: HF token validation flow is solid.

Love that we now hard-stop on placeholders, prompt once, and propagate the validated token via --hf-token; this prevents silent setup failures and keeps the init script happy in one go.

.github/workflows/advanced-docker-compose-build.yml

backends/advanced/src/advanced_omi_backend/workers/audio_stream_parakeet_worker.py

Docs/init-system.md

extras/speaker-recognition/nginx.conf.template

…down handling

Caeddie2 · 2025-11-18T06:16:52Z

I am pretty sure this one would fix the issues, if you build your offline infrastructure from the start.

- Introduced a new `status.py` script for checking the health status of services, including container and HTTP health checks. - Added a `status.sh` script for easier execution of the health checker. - Updated `CLAUDE.md` to include instructions for setting up the test environment and running the health status checker. - Enhanced `setup-requirements.txt` by adding `requests` as a dependency. - Modified `.dockerignore` to include `Caddyfile` for better Docker management. - Updated service URL for speaker recognition in `wizard.py` to use Docker service name.

AnkushMalaker

Thank you for this :)
Looks great

feat: add github actions and fixes to speaker-recognition

0xrushi added 23 commits October 28, 2025 21:16

feat: add GitHub Actions workflow for advanced Docker Compose build a…

5a9c033

…nd deployment

fix: improve .env creation logic and ensure correct image tagging in …

2d211f6

…Docker workflow

secrets

c0186da

env

4a65381

space

5f8f498

dea

43c1f67

docker b

0eeea39

fix: enhance error handling and JSON validation in Docker Compose wor…

0c386b2

…kflow

refactor: update image retrieval method in Docker Compose workflow to…

5db1410

… use 'docker images'

refactor: update service names and tagging logic in Docker Compose wo…

7bdc71c

…rkflow

refactor: enhance .env copying logic and add Docker Compose builds fo…

777b87c

…r extras services

refactor: implement dynamic runner selection in GitHub Actions workflow

d47a5e1

fix: improve error handling for self-hosted runner selection in GitHu…

ed4d667

…b Actions workflow

fix: set working directory for runner display step in GitHub Actions …

da92136

…workflow

fix: update error logging for self-hosted runner fallback in GitHub A…

33355ad

…ctions workflow

refactor: streamline GitHub Actions workflow by consolidating runner …

3ad9280

…selection and adding fallback to default runner

fix: correct loop termination in GitHub Actions workflow for Docker i…

ecbd16d

…mage pushing

refactor: consolidate Docker Compose build steps and improve image ha…

c6c631c

…ndling in GitHub Actions workflow

refactor: simplify service build configuration in GitHub Actions work…

cd903f7

…flow by using a structured array for Docker Compose services

fix: update image tag removal logging and adjust build condition in G…

b58b781

…itHub Actions workflow

Merge branch 'main' of github.com:0xrushi/friend-lite into feat/actions

c17a3a1

cuda variants

257c7c1

coderabbitai bot reviewed Nov 9, 2025

View reviewed changes

0xrushi added 2 commits November 9, 2025 15:15

Refactor GitHub Actions workflow and enhance audio stream worker shut…

4d968e0

…down handling

Update service URLs for speaker recognition and Nginx configuration

e0ef7f4

Caeddie2 mentioned this pull request Nov 18, 2025

Constant restarting of "advanced workers" container because missing DEEPGRAM_API_KEY #159

Closed

AnkushMalaker approved these changes Nov 18, 2025

View reviewed changes

AnkushMalaker merged commit eeb4395 into SimpleOpenSoftware:main Nov 18, 2025
1 of 3 checks passed

thestumonkey pushed a commit to Ushadow-io/chronicle that referenced this pull request Nov 28, 2025

Merge pull request SimpleOpenSoftware#150 from 0xrushi/feat/actions

ad0963c

feat: add github actions and fixes to speaker-recognition

This was referenced Dec 14, 2025

Diar 5 #195

Closed

Fix deepgram/parakeet workers bug #199

Merged

This was referenced Dec 30, 2025

Enhance configuration management and add new setup scripts #235

Merged

Update configuration management and enhance file structure, add test-matrix #237

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Comments

feat: add github actions and fixes to speaker-recognition#150

feat: add github actions and fixes to speaker-recognition#150
AnkushMalaker merged 26 commits intoSimpleOpenSoftware:mainfrom
0xrushi:feat/actions

0xrushi commented Nov 9, 2025 •

edited by coderabbitai bot

Loading

Uh oh!

coderabbitai bot commented Nov 9, 2025 •

edited

Loading

Review skipped

Uh oh!

AnkushMalaker commented Nov 9, 2025

Uh oh!

coderabbitai bot commented Nov 9, 2025

Uh oh!

coderabbitai bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Caeddie2 commented Nov 18, 2025

Uh oh!

AnkushMalaker left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Comments

Conversation

0xrushi commented Nov 9, 2025 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

coderabbitai bot commented Nov 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Review skipped

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Possibly related PRs

Suggested reviewers

Poem

Pre-merge checks and finishing touches

Uh oh!

AnkushMalaker commented Nov 9, 2025

Uh oh!

coderabbitai bot commented Nov 9, 2025

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Caeddie2 commented Nov 18, 2025

Uh oh!

AnkushMalaker left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

0xrushi commented Nov 9, 2025 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Nov 9, 2025 •

edited

Loading