Skip to content

geth progress when switching to trie download #20938

@CedMaire

Description

@CedMaire

Dear ETH Community,

I am trying to sync from scratch an Ethereum node in the default mode (fast). Without success as of now. To give some context, this is not my first attempt, more like the 4-5th one. This attempt is now running since more than 24 hours and the ones before where running for a few days (3-5) before I killed them and tried again.

System information

Geth version: Geth/v1.9.13-stable-cbc4ac26/linux-amd64/go1.14.2
OS & Version: Ubuntu 18.04.4 LTS, Linux 5.3.0-45-generic, x86-64
Commit hash : none

Memory: 16 GB

$ lscpu 
Architecture:        x86_64
CPU op-mode(s):      32-bit, 64-bit
Byte Order:          Little Endian
CPU(s):              8
On-line CPU(s) list: 0-7
Thread(s) per core:  2
Core(s) per socket:  4
Socket(s):           1
NUMA node(s):        1
Vendor ID:           GenuineIntel
CPU family:          6
Model:               60
Model name:          Intel(R) Core(TM) i7-4710MQ CPU @ 2.50GHz
Stepping:            3
CPU MHz:             1197.391
CPU max MHz:         3500.0000
CPU min MHz:         800.0000
BogoMIPS:            4988.47
Virtualization:      VT-x
L1d cache:           32K
L1i cache:           32K
L2 cache:            256K
L3 cache:            6144K
NUMA node0 CPU(s):   0-7
Flags:               fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx pdpe1gb rdtscp lm constant_tsc arch_perfmon pebs bts rep_good nopl xtopology nonstop_tsc cpuid aperfmperf pni pclmulqdq dtes64 monitor ds_cpl vmx est tm2 ssse3 sdbg fma cx16 xtpr pdcm pcid sse4_1 sse4_2 movbe popcnt tsc_deadline_timer aes xsave avx f16c rdrand lahf_lm abm cpuid_fault epb invpcid_single pti ssbd ibrs ibpb stibp tpr_shadow vnmi flexpriority ept vpid ept_ad fsgsbase tsc_adjust bmi1 avx2 smep bmi2 erms invpcid xsaveopt dtherm ida arat pln pts md_clear flush_l1d
$ sudo smartctl -a /dev/sda
=== START OF INFORMATION SECTION ===
Device Model:     SAMSUNG MZ7TD512HAGM-000L1
Serial Number:    S151NYAF401503
LU WWN Device Id: 5 002538 500000000
Add. Product Id:  00000000
Firmware Version: DXT06L0Q
User Capacity:    512'110'190'592 bytes [512 GB]
Sector Size:      512 bytes logical/physical
Rotation Rate:    Solid State Device
Device is:        Not in smartctl database [for details use: -P showall]
ATA Version is:   ACS-2, ATA8-ACS T13/1699-D revision 4c
SATA Version is:  SATA 3.1, 6.0 Gb/s (current: 6.0 Gb/s)
Local Time is:    Sat Apr 18 01:17:01 2020 CEST
SMART support is: Available - device has SMART capability.
SMART support is: Enabled

Expected behaviour

The default fast sync completes in few hours as described in this blog post: geth-v1-9-0

Actual behaviour

The sync is pretty fast (few hours) to download the blocks until current - 64, then switches into the trie download ("Imported new state entries" logs) and never leaves this states even after days of syncing.

Steps to reproduce the behaviour

nohup geth --verbosity 3 --cache 8192 &> geth.log &

The nohup is required for me because I ssh into this machine.

Explanation

I already read pretty much all issues here on GitHub, many stackoverflow posts and also blogs all over the internet. Here are useful information:

  • CPU load is low: < 20% for the geth process on average
  • My disk is an SSD and the load is sometimes very low with an average of 20-30% and sometimes higher ~60%, I didn't figure out when this is changing
  • Network load is very low when downloading the trie states

I know about this explanation: issue-16218

From what I read online and also in the blog post, a default fast sync with geth is counted in the hour unit and not day unit. May it be that, even though I got and SSD, it is not fast enough and keeps slowing down at some points? The disk load is not constant, sometimes like this afternoon it was very low, and tonight while writing this issue it is higher.

I got many peers (~16 constantly). Here is my current eth.syncing:

{
  currentBlock: 9892652,
  highestBlock: 9892756,
  knownStates: 315252660,
  pulledStates: 315249558,
  startingBlock: 0
}

Here are logs that might help:
geth-0.log
geth-1.log
geth-2.log
geth-3.log
geth-4.log
geth-5.log
geth-6.log
geth-7.log

Thank you in advance for any help provided!

Best,

CedMaire

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions