Adaptive log range #911

leoyvens · 2019-05-04T11:08:16Z

Resolves #776. If we start getting this error even when requesting logs for a single block, we should re-evaluate in a new issue.

The first thing this does is have ETHEREUM_FAST_SCAN_END affect the block stream as a whole, we now multiply ETHEREUM_BLOCK_RANGE_SIZE by 10 if doing a fast scan. The one-size-fits-all nature of these values can be an issue, I also added a special case to go one block at a time if there are unconditional block triggers. We should at some point change the way we control the amount of triggers processed at a time.

The adaptive logic for log ranges first tries requesting the entire given range, if that doesn't work the step is reduced on each failure until we no longer get the 1000 logs error.

And then there are some refactors along the way.

…anges

Resolves #776. The idea is to adapt on the fly if calls start failing due to too much logs. Caching the previous step is an idea, but I don't think it would be that helpful and would increase complexity.

…gers

leoyvens · 2019-05-04T11:08:49Z

@yanivtal Yes, it's specific to Infura.

Jannis

Looking good!

Jannis · 2019-05-06T13:54:58Z

datasource/ethereum/src/ethereum_adapter.rs

+        // Code returned by Infura if a requests returns too many logs.
+        // web3 doesn't seem to offer a better way of checking the error code.
+        const TOO_MANY_LOGS_FINGERPRINT: &str = "ServerError(-32005)";
+        let eth_get_logs = move |eth_adapter: Self,


I'd prefer if this remained in its own method, taking in the error fingerprint.

Jannis · 2019-05-06T13:56:02Z

datasource/ethereum/src/ethereum_adapter.rs

-                    .from_err()
-            })
-            .map_err(move |e| {
-                e.into_inner().unwrap_or_else(move || {


It looks like we're losing this error message. Is that deliberate?

Yes, since this is retried indefinitely, this error can never be printed afaict.

Jannis · 2019-05-06T13:57:58Z

datasource/ethereum/src/ethereum_adapter.rs

+                        let string_err = e.to_string();
+                        if string_err.contains(TOO_MANY_LOGS_FINGERPRINT) {
+                            let new_step = (step / 10).max(1);
+                            debug!(logger, "Reducing log step"; "new_step" => new_step);


How about "Reducing block range size to scan for events" + "new_size", since that's what the environment variable is called?

This reverts commit 810892a.

Also realize it's correct to have step 0, we always increase the start and advance by a block.

leoyvens · 2019-05-06T14:43:37Z

Review addressed. This will help with slow syncs, but I've seen subgraphs quickly drop to a range of 10 blocks. Bringing parallel log requests back will probably be worth it.

Jannis · 2019-05-06T14:45:44Z

Are we not doing parallel log requests anymore?

Another thought: How soon do we go back to the configured block range size? If we have a block range size of 10,000 and we have to reduce the range size to 5 to get past the first block in this range, does that mean we stick to a range size of 5 for the remaining 9,999 blocks?

leoyvens · 2019-05-06T14:56:53Z

Are we not doing parallel log requests anymore?

No, we dropped it in the block triggers PR. I was ok with that since it could complicate making this change which we need quickly, but we should bring it back with adaptive ranges in mind.

Yes, that's right. We can try to increase the range more often, but there is a tradeoff where in one extreme we keep the low range forever and in the other extreme we start from the large range every time.

leoyvens added 6 commits May 3, 2019 19:54

ethereum_adapter: Refactor blocks_with_logs

59ca57e

ethereum: Move fast scan logic up, prepare log request for adaptive r…

8822279

…anges

ethereum_adapter: Adaptive log range

fb11f4e

Resolves #776. The idea is to adapt on the fly if calls start failing due to too much logs. Caching the previous step is an idea, but I don't think it would be that helpful and would increase complexity.

ethereum_adapter: Turn logs_with_sigs into a closure

810892a

ethereum_adapter: Use constant for error code

48849d0

ethereum_adapter: Go one block at a time for unconditional block trig…

24fe00e

…gers

leoyvens requested review from Jannis, lutter and a user May 4, 2019 11:08

leoyvens self-assigned this May 4, 2019

ghost added the pending review label May 4, 2019

Jannis requested changes May 6, 2019

View reviewed changes

leoyvens added 2 commits May 6, 2019 11:28

Revert "ethereum_adapter: Turn logs_with_sigs into a closure"

49a6e47

This reverts commit 810892a.

ethereum_adapter: Improve log message

bd7c9f5

Also realize it's correct to have step 0, we always increase the start and advance by a block.

Jannis approved these changes May 6, 2019

View reviewed changes

leoyvens merged commit c5ad9a5 into master May 6, 2019

ghost removed the pending review label May 6, 2019

leoyvens deleted the leo/adaptive-log-range branch May 6, 2019 16:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adaptive log range #911

Adaptive log range #911

leoyvens commented May 4, 2019

leoyvens commented May 4, 2019

Jannis left a comment

Jannis May 6, 2019

Jannis May 6, 2019

leoyvens May 6, 2019

Jannis May 6, 2019

leoyvens commented May 6, 2019

Jannis commented May 6, 2019

leoyvens commented May 6, 2019

Adaptive log range #911

Adaptive log range #911

Conversation

leoyvens commented May 4, 2019

leoyvens commented May 4, 2019

Jannis left a comment

Choose a reason for hiding this comment

Jannis May 6, 2019

Choose a reason for hiding this comment

Jannis May 6, 2019

Choose a reason for hiding this comment

leoyvens May 6, 2019

Choose a reason for hiding this comment

Jannis May 6, 2019

Choose a reason for hiding this comment

leoyvens commented May 6, 2019

Jannis commented May 6, 2019

leoyvens commented May 6, 2019