Hush! - PCM Silence Detection & Preprocessing Engine

Project Description

Hush! is a high-performance C++ engine designed for real-time speech intelligence pipelines (such as Whisper or klama-whisper). It identifies and eliminates silent segments from audio streams, significantly reducing processing time and storage requirements for downstream STT (Speech-to-Text) systems.

Unlike traditional file-based tools, Hush! is built as a PCM-first streaming component, making it suitable for both batch file processing and live recording environments.

Core Features

PCM-Only Core Engine: Standalone, sample-rate agnostic SilenceDetector that operates directly on raw S16_LE Mono PCM buffers.
Real-Time Streaming: Support for chunked audio processing with state preservation and explicit flush mechanisms.
Professional FFI Bridge: A clean extern "C" API for seamless integration with Kotlin Native, Python, or other high-level languages.
Whisper Optimization: Internal normalization bridge forces 16kHz Mono S16 output, the industry standard for speech-to-text.
Legacy MP3 Support: Maintains a CLI bridge for processing existing MP3 files.
Efficient RMS Analysis: Optimized silence detection logic with timing-aware state machines and adjustable aggression levels.

Requirements

A C++17 compliant compiler
CMake (version 3.10 or higher)
FFmpeg development libraries (required for the CLI legacy bridge only).

Building the Project

mkdir build && cd build
cmake ..
make

Upon successful compilation, the following targets are created:

libhush_core.a: The standalone PCM engine.
libhush_ffi.so: The shared library for FFI integration.
hush_remover: The legacy CLI tool for file processing.

Usage (CLI)

The hush_remover tool remains available for processing files:

./hush_remover [options] <input_path> <output_path>

Options:

-t <dB>, --threshold <dB>: Silence threshold (default: -40.0).
-a <level>, --aggression <level>: Trimming sensitivity (default: 1.0).
--dry-run: Analyze reductions without writing output.

FFI Integration

For speech intelligence pipelines, use the hush_api.h interface:

hush_config_t config = {-40.0, 1.0, 16000};
hush_engine_t* engine = hush_engine_create(config);

// Feed chunked PCM data
hush_engine_process(engine, input_ptr, samples, output_ptr, &out_samples);

// Get processing stats
hush_stats_t stats = hush_engine_get_stats(engine);
printf("Reduction: %.2f%%\n", stats.reduction_percentage);

hush_engine_destroy(engine);

Project Structure

core/: The canonical PCM processing logic (Silence detection, RMS).
ffi/: The extern "C" bridge for interoperability.
codecs/: Legacy FFmpeg bridge for MP3/WAV file I/O.
cli/: Command-line interface implementation.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
include		include
src		src
test/unit		test/unit
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
PROJECT_OVERVIEW.md		PROJECT_OVERVIEW.md
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Hush! - PCM Silence Detection & Preprocessing Engine

Project Description

Core Features

Requirements

Building the Project

Usage (CLI)

FFI Integration

Project Structure

hush.cpp

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Hush! - PCM Silence Detection & Preprocessing Engine

Project Description

Core Features

Requirements

Building the Project

Usage (CLI)

FFI Integration

Project Structure

hush.cpp

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages