Multicast for the scaled network #134

jefflightweb · 2026-04-24T13:56:49Z

jefflightweb
Apr 24, 2026

I've begun building the multicast-in-multicast reference network topology.

More info: https://singulargrit.substack.com/p/multicast-within-multicast-anycast

To start, I've proposed a new raw wire frame format for transactions that dovetails onto BRC-12 (and probably the other extended formats also). BRC-124 was assigned after I had already started on BRC-122, hence the conflicting branch name.

NOTE: An early version of BRC-124 was committed to the BRC repo without peer review during some organizational cleanup. I've since updated the PR to incorporate hash-chain sequencing instead of numerical counters.

PR: #133

Here are some design details: https://github.com/lightwebinc/bitcoin-multicast/blob/main/DESIGN.md

Service implementations are available also. The proxy has previously been tested to 400K PPS before my old development environment starts dropping packets. I have tested listener functionality with proxy. I am currently working on testing retransmission functionality. Retransmissions require proper sequencing and source attribution in the transaction network header, and thus this frame format was submitted for new BRC.

jefflightweb · 2026-04-24T14:01:22Z

jefflightweb
Apr 24, 2026
Author

Fun things to think about, at 1 Billion TPS, every 1 byte in the network header is 1 Gigabyte per second of data. I have tried to reduce space as much as possible. There may be more optimization required. CRC32c is essentially free with hardware acceleration and so the source ID was able to be reduced down to 4 bytes from 16 as the number of actual senders on the multicast network should be limited, and also segregated via the temporal sequence ID.

0 replies

jefflightweb · 2026-05-04T22:59:04Z

jefflightweb
May 4, 2026
Author

I'm still working on improvements to the wire frame format. In testing, I learned of shortcomings in the design of the monotonic sequence numbers. I am shifting to a hash chain to solve this and will have further updates to the pull request soon.

0 replies

jefflightweb · 2026-05-05T16:50:58Z

jefflightweb
May 5, 2026
Author

Background on the retransmission work:

Multicast flows are UDP based, without reliable delivery guarantees. If a receiver host misses a packet, it needs an efficient way to determine such data is missing and request retransmission. The way this is achieved using BRC-124 is via a sender attributed, shard group bounded hash chain sequencing function applied to every packet.

The tooling within the Retry Endpoint [1] repo implements a caching retry endpoint to facilitate both multicast and unicast transmission. It includes a beacon discovery mechanism for consumers, NACK ACK/MISS signaling mechanisms, as well as tier and preference based hierarchical escalation configuration capabilities.

Shard Listener [2] demonstrates gap detection and requests.

Shard Proxy [3] demonstrates sequence stamping as well as shard group and sender ID attribution.

[1] Retry Endpoint
[2] Shard Listener
[2] Shard Proxy

0 replies

jefflightweb · 2026-05-06T18:47:07Z

jefflightweb
May 6, 2026
Author

Added PR BRC-126 for NACK-based retransmission protocol.

0 replies

jefflightweb · 2026-05-07T16:16:34Z

jefflightweb
May 7, 2026
Author

The gap sequence retransmission is a bit tricky, as listeners must track chain sequences across group indexes/shards, and across senders (both original sources and/or proxies). Identifying a chain is important for retry endpoint rate limiting. I'm doing some work here now.

1 reply

jefflightweb May 9, 2026
Author

The hardware-accelerated XXH64 PrevSeq/CurSeq hash chain with new chain PrevSeq=0 seems to be working well so far. Each chain is unique as it is composed of XXH64(Source IP || TXID shard bits || sequence) and stamped either by the source, or ingress proxy, and ensures uniqueness across shards and senders. The hash size is adequate even at high throughput for retransmission key index over a reasonable time horizon.

jefflightweb · 2026-05-09T14:58:26Z

jefflightweb
May 9, 2026
Author

I'll probably be making changes to the sharding procedure. 24 bits is probably too long for the current group addressing. I can't see network operators willingly choosing to break up the full transaction stream into more than say, 1024 total groups. Even that would probably be an administrative nightmare with a hard requirement for full configuration automation/debug.

The shard portion of the multicast group address should take up no more than 16 bits, maybe more if some sort of hash derivation is used. This leaves us 48 to use for possible subtree group addressing, if it should ever be needed, which would be stacked with the shard addressing, so the same address covers both. Think downstream networks connected by listeners that bridge the domain to the core sharded flows, but filtered by subtrees and re-transmitted multicast. The multicast group can then be determined by subtree group, and then shard group. In this way, the multicast group limits of switching gear are transcended through specialty subscription.

I'm working on a subtree group ID announcement protocol that links subtree IDs to subtree groups. The goal is to be able to encompass all subtrees associated with an arbitrary specialization or categorization of transaction flows over time. New subtrees are announced continuously, linking them to a particular group which downstream interested shard-listeners pick up. They can then filter packets for just the flows they are interested in. The flows may still be optionally sharded for load balancing.

Expect a few things to change. Work in progress.

0 replies

jefflightweb · 2026-05-10T01:45:55Z

jefflightweb
May 10, 2026
Author

BRC-127: Subtree group announcement protocol: #140

A method for transaction specialization filtering for network segments. Not every network needs the full transaction stream. Filter by groups of subtrees.

0 replies

jefflightweb · 2026-05-10T17:40:06Z

jefflightweb
May 10, 2026
Author

BRC-128: Multicast Extended Transaction Frame Format: #141

Tacks on EF payload to BRC-124 frames.

0 replies

jefflightweb · 2026-05-14T22:59:26Z

jefflightweb
May 14, 2026
Author

BRC-129: IPv6 Multicast Address Assignments: #143

Describes how to carve up the FF0X::B allocation assigned to BSV Association for Bitcoin SV Node Groups. See https://www.iana.org/assignments/ipv6-multicast-addresses/ipv6-multicast-addresses.xhtml for more detail.

The last 16 bits are available for use in the assignment. This scheme tries to take a pragmatic approach by limiting the transaction shard groups to no more than 4096 shards. It leaves approximately 56K addresses in the middle for future expansion, and leaves the top 2048 suffixes (ending in :FFFF) for important network control groups.

Regarding the shard group counts, this would mean a maximum 4096 group subscriptions, over 4096 or less network links PER miner, to take on a full transaction feed! Imagine the administration on that! At this point, I can not imagine that the administrative burden would be worth that level of load balancing, especially with 12 Terabit optical interfaces being in development by the network industry at this time (2026).

0 replies

jefflightweb · 2026-05-15T14:20:59Z

jefflightweb
May 15, 2026
Author

I'll be changing BRC-124 yet again, back to a hybrid of the original submission (that was merged without proper review) and the new format. The key fields are a hash key composed of a 16-bit XXH64(Source IP || Shard bits || Subtree ID || Sequence number), and then the 16 bit sequence number repeated as an integer after. This will allow for numerical gaps to be detected more simply than following a chain, and opens up a path for multi-frame retransmission sequences. This will work better at very high throughput levels than trying to walk an arbitrary length missing hash chain (even from both ends).

The frame format is 92 bytes with this arrangement, and I really don't want to make it any longer as that is already 92 GB/s at 1B TPS. I need to make sure XXH64 is a sound choice even at high throughput with a lot of retransmission keys being added continuously (1 per multicast frame).

More changes coming. Development is active.

1 reply

jefflightweb May 17, 2026
Author

This was fixed by moving back to a Hashkey identifier for the temporal transaction sequence flow, along with an integer counter. It will allow gaps to be detected and fixed faster, and opens up a path toward range-based frame retransmission also.

jefflightweb · 2026-05-15T17:56:38Z

jefflightweb
May 15, 2026
Author

A comment on the existing multicast work in the BRC repo, Ty and Project Babbage did great work here and I've read through BRC 80, 82, and 83 a few different times. These were written before the architecture was elucidated in the blog post I reference in the design document. This blog post described most of the details I needed to get started with an actual implementation that met the reference.

There are a lot of good ideas in BRC 80 and 83, as well as 82. The more I contemplated meeting the goals of the reference material, I realized that some of the concerns about MLDv2 were probably unfounded because source specific multicast is problematic in an environment where there can be many injection points or senders. The rudimentary group announcement frame types I've implemented follow with 80 and could probably be improved a lot. The existing BRCs were written in the period before Teranode source code and concrete implementation details around subtrees were publicly known.

I would like to reconcile the work I'm doing here with the architectural features expressed previously in 80, 82, and 83 because I feel that brain work is valuable. The architecture expressed in the reference material I consider a good starting point, because it's close enough to get an actual implementation going. I look forward to collaboration to bring a real, actual multicast network to deployment for the good of all network participants.

1 reply

jefflightweb May 15, 2026
Author

Oh, the other thing I found was IANA restriction on multicast group addressing. We have 16 bits. It kind-of changes the picture and I wasn't aware of it before. The Bitcoin SV Node Group allocation specified FF0X::B:[0-FFFF]

jefflightweb · 2026-05-16T19:46:43Z

jefflightweb
May 16, 2026
Author

More complication. I started to plan what an actual deployment would look like starting at the very bottom with ip6gre tunnels making up the fabric link. The internet today is built on 1500 MTU size. To build without direct links, we need to handle fragmentation of packets. IPV4 does a lot of this for you, but we're building on V6, which does not. We need to handle fragmentation of UDP packets in the application. This means an encoding, serialization, and fragmentation scheme, along with error correction and retransmission guarantees for all fragments.

BRC-124 and 128 are fine for payload of 1324 bytes for a basic test network using GRE6 tunnels. There is 84 byte overhead for the packets + tunnel, plus the 92 byte header.

It's either cap the size of accepted transactions at the ingress point, or engineer around it. My instinct says to engineer for this because even with 9200 byte via an end to end fabric supporting jumbo frame packets, we still would fragment to serve a 10MB transaction, which is the current upper limit in BSV as I understand it.

2 replies

jefflightweb May 16, 2026
Author

There's a lot more to learn from NORM than I first realized, especially the forward error correction design and how it can speed up the NACK retransmission process by correcting for multiple receivers at once. I should have studied this a little more closely before embarking upon the implementation stage.

jefflightweb May 17, 2026
Author

Added BRC-130: Multicast Transaction Frame Fragmentation Format -- It seems that fragmentation is not too big of an issue with the current retransmission mechanisms already implemented. NORM's design is around block-level data, and a 20% overhead for Reed-Solomon error correction is an acceptable tradeoff. In the case of Bitcoin, that's 20GB per second overall, continuously, when retransmissions can be a fraction of that figure. It may be something to re-visit later, but for now, this format is a good compromise.

Components have been updated, unit as well as integration testing already implemented. The edge components handle both BRC-124 and BRC-130. This allows senders to customize their flows based on the expected transaction payload size. For less than 1300 bytes, use BRC-124. For bigger, use BRC-130.

jefflightweb · 2026-05-18T22:22:18Z

jefflightweb
May 18, 2026
Author

Shipped BRC-131 and BRC-132

All test scenarios are passing now too. Did some work on a few issues and also expanded test coverage.

Ran: 28
Passed: 28
Failed: 0
Skipped: 2 (04-extended-dashboard 21-subtree-group-ramp)

PASSED
✓ 00-firewall
✓ 01-functional-all-shards
✓ 02-functional-shard-filter
✓ 03-functional-subtree-filter
✓ 05-mc-egress-bridge
✓ 06-functional-brc128
✓ 07-functional-brc128-mixed
✓ 08-nack-retransmit-brc128
✓ 09-listener-payload-verification
✓ 10-single-endpoint-ack
✓ 11-permanent-gap-miss
✓ 12-burst-gap-ratelimit
✓ 13-miss-escalation-tier
✓ 14-multi-endpoint-ratelimit
✓ 15-chain-ratelimit
✓ 16-group-ratelimit
✓ 20-subtree-group-announce
✓ 22-fragmentation-delivery
✓ 23-fragmentation-shard-filter
✓ 24-fragmentation-hash-verify
✓ 25-fragmentation-loss
✓ 26-fragmentation-throughput
✓ 30-block-announce-delivery
✓ 31-block-announce-retransmit
✓ 32-subtree-data-delivery
✓ 33-subtree-data-fragmentation
✓ 34-subtree-data-retransmit
✓ 99-nack-retransmit

All scenarios passed.

1 reply

jefflightweb Jun 9, 2026
Author

There are more than 50 tests now. Comprehensive coverage is being expanded continuously.

jefflightweb · 2026-05-22T14:57:53Z

jefflightweb
May 22, 2026
Author

I've started to lab up BGP Equal Cost Multi Path (ECMP) with BGP AnyCast advertisement over both IPV4 and IPV6 to demonstrate the horizontal scalability and distribution-potential of the ingress proxy. I'm using a combination of FRR and BIRD2 (separate routers) to setup the adjacency configurations. BIRD2 on the proxies themselves. FRR for their upstream router and the external ASN router.

Also, the BRCs were merged:

0 replies

jefflightweb · 2026-05-24T20:36:44Z

jefflightweb
May 24, 2026
Author

BGP AnyCast ingress works great. Load balancing is easy with the stateless design. Could also be done via hardware/software L4-L7 load balancer such as HAproxy or F5 also. The two methods can be combined as well for incredible scalability at the ingestion layer.

I'm now adding more end-to-end testing to try and cover as many scenarios as possible. To this end, I've added a Go test framework with full Docker containerization support including multicast bridging. It won't work in GitHub Actions using hosted runners, so I also implemented Dagger to pull the CI process out into the codebase. This will allow for automated e2e testing development while waiting for a self-hosted runner solution to be available. When one is, GitHub executes the same CI flow naturally.

I also shipped Helm charts for all the Go repos and am progressing towards full Kubernetes deployment capability also.

Reviews and feedback are desirable.

0 replies

jefflightweb · 2026-05-26T14:31:01Z

jefflightweb
May 26, 2026
Author

Ran into some problems with GitHub while trying to migrate repositories to a new organization (same name as old one). Hopefully I don't have to re-name all the repositories because of it. I have a support ticket open. 9 of them are in limbo currently. :-(

1 reply

jefflightweb May 27, 2026
Author

Migrated all repos to new names, but of course there are links from the old ones. Re-organized. Moving forward.

https://github.com/orgs/lightwebinc/repositories

jefflightweb · 2026-05-27T20:15:36Z

jefflightweb
May 27, 2026
Author

I'm going to evaluate the solution for moving to Source Specific Multicast, or at least baking in support for it at the component level.

I don't think it's manageable in an environment where miners can come and go and sender/receiver IP addresses can change. It requires each subscriber to subscribe to each shard group PLUS sender source. Multiply N sources by X shards and it only gets more complicated the bigger the network gets. The group manifest advertisement service I'm building could be used to share manifest lists and coordinate this, but given the potentially disconnected nature of senders and listeners (who aren't necessarily also senders), this can get complicated quickly. Coordination must be handled at the application level.

Unfortunately, RFC 8815 essentially deprecates Any Source Multicast (ASM) over inter-domain links, so I don't think there will be any hope of ISPs carrying the multicast group advertisements even if they were able to get over reluctance to run MP-BGP + PIM6 in the first place.

The way forward for the foreseeable future is privately peered specialty ASNs, focused on multicast delivery, using inter-domain MP-BGP with ASM group advertisements as standard if the network needs to grow beyond one private operator.

0 replies

jefflightweb · 2026-05-30T00:22:30Z

jefflightweb
May 30, 2026
Author

Current ingress proxy performance testing on a single system results in full saturation of 10 Gbit NIC interface. I am using an old 6-core Intel PC from about 2017 and I can get 400,000+ packets per second. The throughput and PPS varies with payload size. Still, the results are promising and I'm planning an upgrade to 25 Gbit NIC interface for further testing. I test using both software loopback (dummy interface) as well as hard loopback cable connected back-to-back on the dual-port NIC. There is a cross-over point where packet size determines the throughput ceiling on the two different configurations. Smaller packet sizes favor the dummy interface at this time, and jumbo MTU size benefits the NIC scenario, of course.

2 replies

jefflightweb Jun 4, 2026
Author

Update -- some efficiency-targeted work has resulted in breaking the 630,000 packet per second benchmark (at 256 byte packet sizes) on the old hardware. I am hopeful of even further improvements soon as work is on-going. I swapped to a 25 Gbit Mellanox Connect-X adapter for loopback baseline establishment. It's not yet hitting wireline speed as the proxy has to do some work and is currently CPU bound. Also, I'm using the same system as a UDP transaction generator source and this effectively doubles the UDP socket related operations. I'll be moving the source to an external host and put most of the strain on the NICs themselves soon. I hope to break the 1 million PPS barrier just on one single collapsed proxy+listener+retransmission system.

I'll be shipping another PR soon for BRC-139: Multicast Shard Manifest Announcement Protocol, which will directly support automated discovery and coordination of a Source Specific Multicast (SSM) topology, which is cleared for (possible) eventual inter-domain deployment between service providers or transit network peering points.

I did some other work including unified logging, metrics improvements, and also downstream NACK-recovery for listener egress in the event that a transmission is recorded by the listener but the packet, for whatever reason, doesn't make it out the egress interface. The downstream domain retry endpoint must be able to proxy a NACK request to the upstream multicast network.

jefflightweb Jun 4, 2026
Author

BRC-139: Multicast Shard Manifest Announcement Protocol

jefflightweb · 2026-06-05T23:23:03Z

jefflightweb
Jun 5, 2026
Author

Concrete implementation deployment testing in progress: https://1bsv.net

Beta participants welcome. Get in touch.

0 replies

Multicast for the scaled network #134

Uh oh!

Uh oh!

jefflightweb Apr 24, 2026

Replies: 19 comments · 9 replies

Uh oh!

jefflightweb Apr 24, 2026 Author

Uh oh!

jefflightweb May 4, 2026 Author

Uh oh!

Uh oh!

jefflightweb May 5, 2026 Author

Uh oh!

Uh oh!

jefflightweb May 6, 2026 Author

Uh oh!

jefflightweb May 7, 2026 Author

Uh oh!

Uh oh!

jefflightweb May 9, 2026 Author

Uh oh!

Uh oh!

jefflightweb May 9, 2026 Author

Uh oh!

jefflightweb May 10, 2026 Author

Uh oh!

Uh oh!

jefflightweb May 10, 2026 Author

Uh oh!

Uh oh!

jefflightweb May 14, 2026 Author

Uh oh!

Uh oh!

jefflightweb May 15, 2026 Author

Uh oh!

jefflightweb May 17, 2026 Author

Uh oh!

Uh oh!

jefflightweb May 15, 2026 Author

Uh oh!

jefflightweb May 15, 2026 Author

Uh oh!

Uh oh!

jefflightweb May 16, 2026 Author

Uh oh!

Uh oh!

jefflightweb May 16, 2026 Author

Uh oh!

Uh oh!

jefflightweb May 17, 2026 Author

Uh oh!

Uh oh!

jefflightweb May 18, 2026 Author

Uh oh!

jefflightweb Jun 9, 2026 Author

Uh oh!

Uh oh!

jefflightweb May 22, 2026 Author

Uh oh!

jefflightweb
Apr 24, 2026

Replies: 19 comments 9 replies

jefflightweb
Apr 24, 2026
Author

jefflightweb
May 4, 2026
Author

jefflightweb
May 5, 2026
Author

jefflightweb
May 6, 2026
Author

jefflightweb
May 7, 2026
Author

jefflightweb May 9, 2026
Author

jefflightweb
May 9, 2026
Author

jefflightweb
May 10, 2026
Author

jefflightweb
May 10, 2026
Author

jefflightweb
May 14, 2026
Author

jefflightweb
May 15, 2026
Author

jefflightweb May 17, 2026
Author

jefflightweb
May 15, 2026
Author

jefflightweb May 15, 2026
Author

jefflightweb
May 16, 2026
Author

jefflightweb May 16, 2026
Author

jefflightweb May 17, 2026
Author

jefflightweb
May 18, 2026
Author

jefflightweb Jun 9, 2026
Author

jefflightweb
May 22, 2026
Author