Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Thunderdome: Scale Up Experiments #18

Closed
4 tasks done
iand opened this issue Aug 5, 2022 · 0 comments
Closed
4 tasks done

Thunderdome: Scale Up Experiments #18

iand opened this issue Aug 5, 2022 · 0 comments
Assignees
Labels
epic Overarching issue for an extended piece of work project/thunderdome

Comments

@iand
Copy link
Collaborator

iand commented Aug 5, 2022

What Is It?

Enable the use of bigger virtual machine instances and replay significant fractions of live traffic to the gateways in an experiment.

Why Are We Doing It?

The previous phases were about building baseline functionality. This phase is about scaling Thunderdome so that experiments can be performed closer to production/live conditions. By hooking into the live gateway request logs we are more likely to manifest the types of bottlenecks and problems that are faced by gateway instances in production. The dynamic nature of the IPFS network makes a replay of live requests for current data better than a frozen corpus that goes stale over time. Also because we will be sending production level rates of requests to each gateway under test we need to be able to give them more resources to cope with the higher load.

Notes

This phase is all about enabling experiments that reflect performance similar to real world loads: better backend infrastructure, replaying significant fractions of live gateway logs

The log stream should permit scalably and promptly sending logs that permit high-fidelity playback to a large number of dealgood driven experiments.

We can use the existing logs - but we dont for example know POST payloads (if any), ranges for range requests, what request headers were set, etc - so we should probably make a separate log for this purpose

We should pick a (or trial several) log streaming or messaging system(s) to see if one meets our needs (bonus points if its hosted)

Project overview is on Notion

Tasks

@iand iand added epic Overarching issue for an extended piece of work project/thunderdome labels Aug 5, 2022
@iand iand changed the title Thunderdome Phase 3: Scale Replay Stream Thunderdome: Scale Replay Stream Aug 9, 2022
@iand iand changed the title Thunderdome: Scale Replay Stream Thunderdome: Scale Up Experiments Aug 9, 2022
@iand iand closed this as completed Sep 2, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
epic Overarching issue for an extended piece of work project/thunderdome
Projects
None yet
Development

No branches or pull requests

2 participants