g729

Website: https://g729.huny.dev/. The site includes listening samples and a browser-side WebAssembly encoder/decoder demo built from the same pure-Go code.

Pure-Go, MIT-licensed G.729A-compatible speech codec for RTP G729/8000 send paths.

This project provides a clean-room Go implementation of a G.729A-compatible encoder and decoder with no cgo, no native dependencies, and no vendored codec source. It is intended for SIP/RTP, MRCP, TTS, IVR, and server-side media applications that need G729/8000 with annexb=no.

Status: v0.1.0-rc1. The outbound encoder/RTP send path is black-box tested against FFmpeg. The decoder is included for loopback, tooling, and limited inbound regression testing. Broad interoperability certification and ITU byte-exact conformance are not claimed.

Project summary

github.com/hunydev/g729 is an independent, clean-room, pure-Go implementation compatible with the ITU-T G.729 Annex A 8 kbps CS-ACELP speech codec, intended primarily as a server-side encoder/decoder for RTP payload type 18 (G729/8000) inside MRCP / TTS / VoIP deployments.

The codec was implemented from public ITU-T specifications and public textbooks only. No ITU reference C source, bcg729, Sipro Lab implementation, FFmpeg libavcodec/g729dec.c, or any other extant G.729 implementation source was consulted at any point. See the Clean-room statement below.

This module is the G.729 piece of a multi-codec deployment. G.711 (µ-law / A-law) and G.722 are intentionally out of scope for this module and should live in separate codec packages in a deployment stack.

Why this project exists

Many practical G.729 deployments still rely on native C libraries, GPL/commercial licensing, or platform-specific codec modules. This package aims to provide a small, dependency-free, MIT-licensed Go codec for server-side RTP media paths where cgo and native codec packaging are undesirable.

Supported codec scope

Capability	v0.1.0 status
`G729/8000` RTP payload type 18	Encoder/send path supported for `annexb=no`; decoder included with limited interoperability coverage
10 ms frame: 80 int16 samples ↔ 10 packed bytes	Supported
`ptime=10` (one frame per RTP packet)	Supported
`ptime=20` (two frames per RTP packet)	Supported (caller bundles two encoder outputs)
`annexb=no` SDP advertisement	Required
Single-stream `Encoder` / `Decoder`	Supported
Opt-in `DecodeFrameEnhanced` listening aid	Experimental; not a conformance claim
Streaming `Encoder.Write` / `Encoder.Flush`	Supported
Hot-path 0-allocation steady state	Verified
ITU reference byte-exact conformance	Not claimed (see Known limitations)
ITU vector full byte-EQ	Not claimed
G.729 Annex B (SID / CNG / DTX)	Not supported
G.729.1 (wideband / scalable)	Not supported
G.729D / G.729E	Not supported
ITU certified conformance claim	Not made

Installation

go get github.com/hunydev/g729

Module is pure Go (stdlib only). Go 1.22 or newer.

Usage

Minimal frame-at-a-time encode + decode:

package main

import (
    "github.com/hunydev/g729"
)

func main() {
    enc := g729.NewEncoder()
    dec := g729.NewDecoder()

    pcmIn := make([]int16, g729.FrameSamples) // 80 samples = 10 ms @ 8 kHz
    bits := make([]byte, g729.FrameBytes)     // 10 bytes
    pcmOut := make([]int16, g729.FrameSamples)

    // Fill pcmIn from your audio source (8 kHz mono int16) ...

    if err := enc.EncodeFrame(pcmIn, bits); err != nil {
        panic(err)
    }
    if err := dec.DecodeFrame(bits, pcmOut); err != nil {
        panic(err)
    }
}

See examples/ for fuller programs:

examples/encode_pcm — raw PCM int16 LE 8 kHz mono → G.729 frames
examples/decode_g729 — G.729 frames → raw PCM int16 LE 8 kHz mono
examples/streaming_encode — NewStreamingEncoder + Write + Flush
examples/rtp_packetize — illustrative RTP payload packetization
cmd/g729rtpcheck — raw payload / Ethernet IPv4 UDP RTP pcap validator
cmd/g729wasm — Go WebAssembly wrapper used by the project website demo

Each Encoder and each Decoder is single-threaded. Concurrent calls on the same instance are a data race; one instance per stream.

EncodeFrame and DecodeFrame are zero-allocation in steady state. DecodeFrameEnhanced is available as an opt-in, non-strict local listening aid. It is not used by the default decoder and is not an ITU conformance claim.

RTP packetization

G.729 frames are 10 ms each (80 samples / 10 bytes). RFC 3551 assigns static payload type 18 to G729/8000.

For ptime=10, one G.729 frame is the RTP payload (10 bytes).

For ptime=20, the sender concatenates two consecutive 10-byte encoder outputs into a single 20-byte RTP payload; the receiver hands them to two consecutive DecodeFrame calls.

This module does not implement RTP framing itself — the caller owns RTP header / sequence-number / timestamp generation. See examples/rtp_packetize/main.go for an illustrative payload builder and cmd/g729rtpcheck for black-box validation of raw payload streams or Ethernet/IPv4/UDP/RTP pcap captures.

# Validate raw one-frame-per-packet payload bytes.
go run ./cmd/g729rtpcheck -mode=payload -ptime=10 -in output.g729

# Validate payload type 18 packets in a pcap and check RTP continuity.
go run ./cmd/g729rtpcheck -mode=pcap -pt=18 -ptime=20 -strict-ts -in capture.pcap

SDP examples

ptime=10, single G.729 frame per RTP packet:

m=audio 49170 RTP/AVP 18
a=rtpmap:18 G729/8000
a=fmtp:18 annexb=no
a=ptime:10
a=maxptime:10

ptime=20, two G.729 frames bundled per RTP packet (20 bytes payload):

m=audio 49170 RTP/AVP 18
a=rtpmap:18 G729/8000
a=fmtp:18 annexb=no
a=ptime:20
a=maxptime:20

annexb=no MUST be advertised — this codec does not implement Annex B SID / CNG / DTX. Receiving SID frames is not supported in v0.1.0 and may return an error or produce invalid audio.

MRCP / TTS integration note

The codec's intended deployment target is the server-side audio egress path of MRCP-driven TTS and IVR systems: the TTS engine produces 8 kHz int16 PCM; this module produces RTP-shaped 10-byte G.729 frames; the MRCP/SIP framework wraps them in RTP packets and sends them to the SIP endpoint.

Decoder side is provided for inbound audio (e.g. ASR ingress), loopback testing, and tooling.

Current status: the outbound TTS/RTP send path now passes the binding FFmpeg black-box encoder quality gate for G729/8000 annexb=no payloads. This is not an ITU byte-exact or certification claim. The strict local decoder also passes the current FFmpeg black-box regression gates for this repository's local encoder payload and a local, non-redistributed Asterisk-origin .g729 payload sample. That is enough for current tooling and loopback confidence, but it is still not broad interoperability certification for every external G.729 sender.

An experimental DecodeFrameEnhanced path remains available for listening diagnostics. It is non-strict and is not used as evidence for the G729/8000 annexb=no product claim.

Known limitations

This release does not claim ITU byte-exact or certified G.729 conformance. The outbound encoder/RTP send path is now black-box gated against FFmpeg, and the strict local decoder is black-box gated against FFmpeg for the local encoder payload plus a local, non-redistributed Asterisk-origin payload sample.

Concretely:

FFmpeg black-box encoder gate passes. On the ITU SPEECH corpus, SPEECH.BIT -> ffmpeg tracks SPEECH.PST at about GlobalSNR=7.04 dB, SegSNR=4.39 dB, while SPEECH.IN -> our encoder -> ffmpeg currently measures about GlobalSNR=5.09 dB, SegSNR=3.05 dB. The deltas (-1.95 dB global, -1.34 dB segmental) pass the project-defined >= -2.00 dB release gate for outbound encoder quality.
Local decoder roundtrip gate passes against FFmpeg. On the local encoder's own SPEECH payload, our encoder -> local decoder now tracks our encoder -> ffmpeg at about GlobalSNR=13.78 dB, SegSNR=13.99 dB, and RMS ratio 0.991 local-vs-FFmpeg. The end-to-end source quality is still bounded by the outbound encoder gate, not by ITU byte-exact vector certification.
Local Asterisk payload decoder gate passes against FFmpeg. The strict local decoder now tracks FFmpeg on a local, non-redistributed Asterisk-origin .g729 payload sample at about GlobalSNR=14.64 dB, SegSNR=15.23 dB, corr=0.983, and RMS ratio 0.985. This is a useful inbound regression gate for MRCP/SIP integration, but it is not a blanket claim that arbitrary external G.729 payloads from every sender have been exhaustively qualified. The non-strict enhanced listening path is currently worse than strict on this gate and is not conformance evidence.
0 encoder byte-EQ expected failures in the conformance suite. The LSP vector, TAME byte-EQ, former Phase 2c closed-loop pitch, and Phase 2d FCB pins now pass as source-divergence diagnostics after clean-room numeric handoff audits. These measurements remain informational and are not sufficient to certify audio quality. Excluded from the default test suite via the conformance build tag.
5 decoder PSTdomain PASS-by-design FAIL pins (Phase 1o D-3, sample 40-41 drift). Documented; identical pre/post Phase 3. Excluded from the default test suite via the diagnostic build tag.
TestDiagnostic_SinglePulseChain is retained as a diagnostic-only instrumentation log and currently PASSes. Excluded from the default test suite via the diagnostic build tag.

FFmpeg black-box quality gate

The quality gate uses FFmpeg only as an external decoder executable; no external implementation source is inspected.

G729_FFMPEG_BLACKBOX_QUALITY=1 \
G729_REQUIRE_FFMPEG_BLACKBOX_QUALITY=1 \
go test -run TestExternalFFmpegBlackboxQuality_SPEECH -count=1 -v

The gate passes when the local encoder decode quality is within 2 dB of the SPEECH.BIT -> ffmpeg reference path on both global SNR and segmental SNR.

The inbound/local decoder Asterisk sample gate is intentionally separate from the outbound encoder claim:

G729_DECODER_ASTERISK_FFMPEG_QUALITY=1 \
G729_REQUIRE_DECODER_ASTERISK_FFMPEG_QUALITY=1 \
go test ./internal/decoder -run TestPhase3rAsteriskFFmpegQualityGate -count=1 -v

At this checkpoint the strict Asterisk sample gate passes. The enhanced Asterisk listening gate is non-strict and is not part of the default decoder/inbound conformance boundary:

G729_DECODER_ASTERISK_FFMPEG_QUALITY=1 \
G729_REQUIRE_ENHANCED_DECODER_ASTERISK_FFMPEG_QUALITY=1 \
go test ./internal/decoder -run TestPhase3rAsteriskFFmpegQualityGate -count=1 -v

The local decoder gate for the local encoder stream is also separate from the passing outbound encoder claim:

G729_FFMPEG_BLACKBOX_QUALITY=1 \
G729_REQUIRE_LOCAL_DECODER_FFMPEG_QUALITY=1 \
go test -run TestExternalFFmpegBlackboxLocalDecoderDelta_SPEECH -count=1 -v

At this checkpoint the strict local decoder gate passes. The enhanced local listening gate is non-strict and does not change the default decoder conformance boundary:

G729_FFMPEG_BLACKBOX_QUALITY=1 \
G729_REQUIRE_ENHANCED_LOCAL_DECODER_FFMPEG_QUALITY=1 \
go test -run TestExternalFFmpegBlackboxLocalDecoderDelta_SPEECH -count=1 -v

For a user-provided problem sample, run the opt-in external sample diagnostic. WAV/MP3 inputs are converted to 8 kHz mono signed 16-bit PCM through the local FFmpeg executable; raw .pcm, .raw, .sln, .s16le, and .in files are assumed to already be 8 kHz mono signed little-endian int16 PCM.

G729_EXTERNAL_SAMPLE_QUALITY=/path/to/input.wav \
go test -run TestExternalSampleQualityDiagnostic -count=1 -v

This prints input -> our encoder -> ffmpeg, input -> our encoder -> local, and local decoder vs ffmpeg on the same aligned SNR scale used by the web and release diagnostics.

Test suite layout

The repository ships multiple test layers, each with a distinct release gate role:

Suite	Invocation	Release gate role
Default (release)	`go test ./...`	Binding. Must PASS at the v0.1.0-rc1 tag commit.
FFmpeg quality (product)	`G729_FFMPEG_BLACKBOX_QUALITY=1 G729_REQUIRE_FFMPEG_BLACKBOX_QUALITY=1 go test -run TestExternalFFmpegBlackboxQuality_SPEECH -count=1 -v`	Binding for outbound G.729 encoder support. Currently PASSes.
Local decoder quality	`G729_FFMPEG_BLACKBOX_QUALITY=1 G729_REQUIRE_LOCAL_DECODER_FFMPEG_QUALITY=1 go test -run TestExternalFFmpegBlackboxLocalDecoderDelta_SPEECH -count=1 -v`	Binding for strict local decoder regression coverage. Currently PASSes against FFmpeg on the local encoder SPEECH payload.
Asterisk local decode quality	`G729_DECODER_ASTERISK_FFMPEG_QUALITY=1 G729_REQUIRE_DECODER_ASTERISK_FFMPEG_QUALITY=1 go test ./internal/decoder -run TestPhase3rAsteriskFFmpegQualityGate -count=1 -v`	Binding when a local non-redistributed Asterisk-origin inbound sample is present. PASSed during rc1 verification against FFmpeg; not broad sender certification.
Conformance (informational)	`go test -tags=conformance ./...`	Non-blocking. Currently expects 0 failures; new failures must be triaged.
Diagnostic (informational)	`go test -tags=diagnostic ./...`	Non-blocking. Currently expects 5 documented PSTdomain drift-monitoring FAILs.

The conformance and diagnostic suites do not block release; their expected-failure inventories are catalogued in docs/releases/v0.1.0-rc1-checklist.md.

Clean-room statement

This project maintains a clean-room constraint. No ITU reference C, bcg729, FFmpeg, Sipro, or other G.729 implementation source was used. Public specifications, test vectors, and independently written tests were used. v0.1.0 does not claim ITU byte-exact conformance. See IP_PROVENANCE.md for the distribution provenance record and THIRD_PARTY_NOTICES.md for the redistribution notice inventory.

Permitted reference materials, used during development:

ITU-T Recommendation G.729 (06/2012), main body PDF
ITU-T Recommendation G.729 Annex A (06/2012), PDF
Salami, Laflamme, Adoul, Massaloux (1998), Description of ITU-T Recommendation G.729 Annex A, IEEE Transactions on Speech and Audio Processing, §V.B
Kondoz (2004), Digital Speech, §6 (CS-ACELP)
Chu (2003), Speech Coding Algorithms, LP analysis chapter
Goldberg & Riek (2000), A Practical Handbook of Speech Coders
Quackenbush, Barnwell, Clements (1988), Objective Measures of Speech Quality
Oppenheim & Schafer, Discrete-Time Signal Processing (3rd ed.)

Forbidden sources, never consulted at any point:

ITU C reference source files (g729a.c, cb_search.c, dec_gain.c, any other reference distribution file)
bcg729 (Belledonne Communications)
Sipro Lab implementations
FFmpeg G.729 decoder (libavcodec/g729dec.c)
Any other extant G.729 implementation

Each diagnostic and design note in docs/superpowers/diagnostics/ and docs/superpowers/plans/ carries its own per-document I1 declaration with citation list.

License

MIT. See LICENSE. The repository includes an engineering provenance record in IP_PROVENANCE.md and a third-party notice inventory in THIRD_PARTY_NOTICES.md.

Development status

Phase 0 / 1 / 2 — encoder/decoder core implementation, completed. See docs/superpowers/plans/2026-05-02-phase2-encoder-plan.md (master plan).
Phase 3 — CLOSED-PARTIAL. Encoder and decoder diagnostics were closed enough to proceed to RC packaging, with the current public claim bounded by the FFmpeg black-box outbound encoder gate and the decoder limitations above. See Phase 3-final closure report.
Phase 4 — CLOSED. Release packaging cycle for v0.1.0-rc1. See Phase 4 plan.

This is a release candidate. The public API (Encoder, Decoder, NewEncoder, NewDecoder, NewStreamingEncoder, EncodeFrame, DecodeFrame, Reset, Write, Flush, sentinel errors, frame-shape constants) is intended to be stable across the v0.1.x line.

Name		Name	Last commit message	Last commit date
Latest commit History 432 Commits
cmd		cmd
docs		docs
examples		examples
internal		internal
testdata		testdata
.gitignore		.gitignore
AGENTS.md		AGENTS.md
IP_PROVENANCE.md		IP_PROVENANCE.md
LICENSE		LICENSE
README.md		README.md
THIRD_PARTY_NOTICES.md		THIRD_PARTY_NOTICES.md
decoder_root.go		decoder_root.go
decoder_root_test.go		decoder_root_test.go
distribution_audit_test.go		distribution_audit_test.go
doc.go		doc.go
encoder.go		encoder.go
encoder_closedloop_oracle_handoff_test.go		encoder_closedloop_oracle_handoff_test.go
encoder_encodeframe_test.go		encoder_encodeframe_test.go
encoder_streaming.go		encoder_streaming.go
encoder_streaming_test.go		encoder_streaming_test.go
encoder_test.go		encoder_test.go
errors.go		errors.go
errors_test.go		errors_test.go
external_payload_distribution_test.go		external_payload_distribution_test.go
external_sample_quality_test.go		external_sample_quality_test.go
ffmpeg_blackbox_quality_test.go		ffmpeg_blackbox_quality_test.go
ffmpeg_gain_predictor_sweep_test.go		ffmpeg_gain_predictor_sweep_test.go
frame.go		frame.go
frame_test.go		frame_test.go
go.mod		go.mod
itu_vector_helpers_test.go		itu_vector_helpers_test.go
lsp_itu_vector_test.go		lsp_itu_vector_test.go
oracle_artifact_test.go		oracle_artifact_test.go
oracle_handoff_integrity_test.go		oracle_handoff_integrity_test.go
oracle_handoff_write_guard_test.go		oracle_handoff_write_guard_test.go
pack.go		pack.go
pages_audio_golden_test.go		pages_audio_golden_test.go
phase2a_int1_d10_corpus_diagnostic_test.go		phase2a_int1_d10_corpus_diagnostic_test.go
phase2a_int1_d8_groundtruth_test.go		phase2a_int1_d8_groundtruth_test.go
phase2a_int2b_lpcstep_zeroalloc_test.go		phase2a_int2b_lpcstep_zeroalloc_test.go
phase2b_hcenter_hoq2_diagnostic_test.go		phase2b_hcenter_hoq2_diagnostic_test.go
phase2b_hcenter_hphase_diagnostic_test.go		phase2b_hcenter_hphase_diagnostic_test.go
phase2b_int0_openloop_wiring_test.go		phase2b_int0_openloop_wiring_test.go
phase2b_int2_openloop_zeroalloc_test.go		phase2b_int2_openloop_zeroalloc_test.go
phase2c_int0_closedloop_wiring_test.go		phase2c_int0_closedloop_wiring_test.go
phase2c_int1_pitch_byteeq_test.go		phase2c_int1_pitch_byteeq_test.go
phase2c_int2_closedloop_zeroalloc_test.go		phase2c_int2_closedloop_zeroalloc_test.go
phase2d_int0_fcb_wiring_test.go		phase2d_int0_fcb_wiring_test.go
phase2d_int1a_fcb_byteeq_test.go		phase2d_int1a_fcb_byteeq_test.go
phase2d_int2_fcb_zeroalloc_test.go		phase2d_int2_fcb_zeroalloc_test.go
phase2f_int1_per_vector_byteeq_test.go		phase2f_int1_per_vector_byteeq_test.go
phase2f_int1_tame_byteeq_test.go		phase2f_int1_tame_byteeq_test.go
phase2f_int2_public_api_alloc_test.go		phase2f_int2_public_api_alloc_test.go
phase2f_pack1_buildframe_test.go		phase2f_pack1_buildframe_test.go
phase2f_tame1_slot5_sweep_test.go		phase2f_tame1_slot5_sweep_test.go
phase3_roundtrip_quality_diag_test.go		phase3_roundtrip_quality_diag_test.go
phase3_roundtrip_quality_test.go		phase3_roundtrip_quality_test.go
phase3diag_01_unpacked_indices_test.go		phase3diag_01_unpacked_indices_test.go
pitch_itu_vector_test.go		pitch_itu_vector_test.go
tame_gain_taming_oracle_handoff_test.go		tame_gain_taming_oracle_handoff_test.go
throughput_bench_test.go		throughput_bench_test.go

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

g729

Project summary

Why this project exists

Supported codec scope

Installation

Usage

RTP packetization

SDP examples

MRCP / TTS integration note

Known limitations

FFmpeg black-box quality gate

Test suite layout

Clean-room statement

License

Development status

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

g729

Project summary

Why this project exists

Supported codec scope

Installation

Usage

RTP packetization

SDP examples

MRCP / TTS integration note

Known limitations

FFmpeg black-box quality gate

Test suite layout

Clean-room statement

License

Development status

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages