New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

proposal: Unified Scheduler #28307

Closed

ryoqun wants to merge 23 commits into solana-labs:master from ryoqun:unified-scheduler-proposal

+205 −0

Member

ryoqun commented Oct 9, 2022 •

edited

Loading

Problem

conceived as outlined at #23548, my idea improves replay time 2x as evidenced at #27666, but it still lacks any design doc for others to review/evaluate it.

(my draft pr stack is too deep; hopefully, i think i can pop them one-by-one, starting from this one.)

Summary of Changes

context #23548

ryoqun added 2 commits

October 9, 2022 23:29


          Just dumped some of items

fcd2410


          more random notes

ee2c09c

apfitzge self-requested a review

October 10, 2022 14:08

apfitzge reviewed

View reviewed changes

docs/src/proposals/unified-scheduler.md


		## Currnet implementation problem

		- why batching isn't good?

Contributor

apfitzge Oct 10, 2022

I can definitely see that this may be the case for replay, but is it for banking as well? Still learning all the different parts of banking, but I thought one of the main benefits of batching for banking is that we record to PoH less often which is more efficient.

@carllin, any thoughts here?

Member Author

ryoqun Oct 10, 2022 •

edited

Loading

I thought one of the main benefits of batching for banking is that we record to PoH less often which is more efficient.

that concern is correct, however...:

poh recorder's lock contention can be reduced significantly (even can be eliminated maybe) once there is only a single unified thread recording to it. (ie. the scheduler thread)
also, scheduler can just buffer txes to record to poh until lock conflicting tx is encountered.

that said, I thought main benefits of batching would be AccountsDb locking overhead reduction.

docs/src/proposals/unified-scheduler.md

+              - determinicity
+              - strict fairness, only considering priority_fee/fcfs
+              - approx. O(n) where n is gross total of addresses in transactions.
+              - strict adherance to local fee market

Contributor

apfitzge Oct 10, 2022

this seems more fair from the user perspective, but it may not yield the best fee-collection results for validators. A greedy validator will not adhere to this and we have no way to enforce it.

Just making a note, not saying it shouldn't be your goal; it may be the case that you can make some optimizations this way which does result in higher throughput and fee-collection!

Member Author

ryoqun Oct 10, 2022

this seems more fair from the user perspective, but it may not yield the best fee-collection results for validators. A greedy validator will not adhere to this and we have no way to enforce it.

yep. greedy validator will just seek to mev revenue, rather than fee-collection maximization, considering relatively cheap tx fees. cc: @buffalu

Just making a note, not saying it shouldn't be your goal; it may be the case that you can make some optimizations this way which does result in higher throughput and fee-collection!

yep, i'll write that reasoning down later for any scrutinization.

Contributor

buffalu Oct 10, 2022

mev tries to maximize fees collected. have a feeling fully respecting priority payment won’t maximize fees

docs/src/proposals/unified-scheduler.md

+              - strict adherance to local fee market
+              - censorship resistent
+              - 100k scheduling/s
+              - highly contended address with 1m pending txes doesn't affect overall performance at all.

Contributor

apfitzge Oct 10, 2022

I've talked with @taozhu-chicago about this, likely we'll want to move the QoS checks for account limits up into the scheduler:

scheduler will already be looking at accounts locked by the tx, so easy to fit in per-account so we could track account costs as we schedule
this could allow us to drop lower-priority tx early in the scheduling stage so we don't even add them into the scheduling structures

good goal to have, but if we inlcude these changes we probably won't have to deal with 1m pending txs on the same address

Contributor

apfitzge Oct 10, 2022

ah, but this would conflict with your goal of "no estimator/prediction" 😅

docs/src/proposals/unified-scheduler.md

+              ## further work
+              - rework compute unit, consider runtime account state transition verification/N addresses (i.e. scheduling cost)
+              - what is going with the bankless?: meh
+              - scrambled tx

Contributor

apfitzge Oct 10, 2022

what does this mean?

Member Author

ryoqun Oct 10, 2022 •

edited

Loading

this: #23837

note that this proposal is way more futuristic than even this unified scheduler thing.

apfitzge reviewed

View reviewed changes

docs/src/proposals/unified-scheduler.md

+              - single threaded
+              - determinicity
+              - strict fairness, only considering priority_fee/fcfs
+              - approx. O(n) where n is gross total of addresses in transactions.

Contributor

apfitzge Oct 10, 2022

Not sure O(n) isn't going to be possible w/ guaranteeing priority order.
Simplest case I can think of is n tx all write-locking account A that we receive in a batch in random order. Even building a simple priority queue wiil be O(nlogn).

ryoqun added 10 commits

October 11, 2022 17:20


          Elaborate transaction pattern

0781a73


          Elaborate transaction pattern

b1ed2c0


          more random notes


          more random notes

4617e77


          more random notes

295028c


          more random notes

35432dc


          more random notes


          minor formatting

0b3aeb6


          minor formatting

3bde395


          tweak

39ed092

ryoqun commented

View reviewed changes

docs/src/proposals/unified-scheduler.md


		also, increasing replaying/banking threads doesn't linearly scale to the number of cpu cores.

		## Present (and projected) transaction patterns

Member Author

ryoqun Oct 14, 2022 •

edited

Loading

(1/2)

ryoqun commented

View reviewed changes

docs/src/proposals/unified-scheduler.md

+              That means, synthesized benchmark results should be taken with a grain of salt
+              because they tend to be overly uniform, not reflecting the realistic usage.
+              ## Redefined scheduler's problem space

Member Author

ryoqun Oct 14, 2022

(2/2)

ryoqun added 11 commits

October 14, 2022 15:45


          tweak

cfc27ec


          tweak

c101217


          tweak

525b5fc


          tweak

80a2875


          tweak

420da3c


          more random notes

98e12cd


          more random notes

96f1905


          more random notes

aaf7eb3


          more random notes

d041995


          more random notes

8c8481a


          tweak

ab2ffcb

github-actions bot added the stale label

github-actions bot closed this

ryoqun reopened this

github-actions bot removed the stale label

github-actions bot added the stale label

github-actions bot closed this

tao-stones reopened this

github-actions bot removed the stale label

Huzaifa696 commented Feb 20, 2023

"Hello! I have a question about transaction patterns. In the third paragraph of the document, It says, "when seen from the viewpoint of the on-chain state, a very few of its addresses can be highly contended." I was wondering if you could give me an estimate of what "a very few" means in terms of the total transactions that a particular leader is processing.

For instance, if a leader is handling 6,000 transactions per second that involve a total of 30,000 accounts (chosen arbitrarily), would it be reasonable to assume that, on average, around 300 of these accounts, or 0.1%, are in conflict with each other?"

github-actions bot added the stale label

github-actions bot closed this

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

stale