Thunderdome: Scale Up Experiments #18

iand · 2022-08-05T11:53:20Z

What Is It?

Enable the use of bigger virtual machine instances and replay significant fractions of live traffic to the gateways in an experiment.

Why Are We Doing It?

The previous phases were about building baseline functionality. This phase is about scaling Thunderdome so that experiments can be performed closer to production/live conditions. By hooking into the live gateway request logs we are more likely to manifest the types of bottlenecks and problems that are faced by gateway instances in production. The dynamic nature of the IPFS network makes a replay of live requests for current data better than a frozen corpus that goes stale over time. Also because we will be sending production level rates of requests to each gateway under test we need to be able to give them more resources to cope with the higher load.

Notes

This phase is all about enabling experiments that reflect performance similar to real world loads: better backend infrastructure, replaying significant fractions of live gateway logs

The log stream should permit scalably and promptly sending logs that permit high-fidelity playback to a large number of dealgood driven experiments.

We can use the existing logs - but we dont for example know POST payloads (if any), ranges for range requests, what request headers were set, etc - so we should probably make a separate log for this purpose

We should pick a (or trial several) log streaming or messaging system(s) to see if one meets our needs (bonus points if its hosted)

Project overview is on Notion

Tasks

iand added epic Overarching issue for an extended piece of work project/thunderdome labels Aug 5, 2022

iand assigned iand and thattommyhall Aug 5, 2022

iand changed the title ~~Thunderdome Phase 3: Scale Replay Stream~~ Thunderdome: Scale Replay Stream Aug 9, 2022

iand changed the title ~~Thunderdome: Scale Replay Stream~~ Thunderdome: Scale Up Experiments Aug 9, 2022

JesseXie mentioned this issue Aug 29, 2022

Thunderdome: Self Service Experiments #19

Open

iand closed this as completed Sep 2, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Thunderdome: Scale Up Experiments #18

Thunderdome: Scale Up Experiments #18

iand commented Aug 5, 2022 •

edited

Loading

Thunderdome: Scale Up Experiments #18

Thunderdome: Scale Up Experiments #18

Comments

iand commented Aug 5, 2022 • edited Loading

What Is It?

Why Are We Doing It?

Notes

Tasks

iand commented Aug 5, 2022 •

edited

Loading