Parallelize RAM Timestamp Count Initialization #292

sragss · 2024-04-15T18:05:05Z

For a 64 core machine at a cycle count of ~16M, Jolt spends ~3.5% of its time in a segment called memory_trace_processing here. This segment allocates and computes the offline memory checking (a,v,t) polynomials used for the combined registers and RAM. Some additional details can be found in the wiki.

Currently this ~300 line segment takes ~3.5% of end-to-end time because it is computed completely serially. No use of additional CPU cores. We should parallelize this to get up to a NUM_CPUx speedup.

The goal is to fill out the following.
Trace Length sized

a_ram
v_read
v_write_rd
v_write_ram
t_read
t_write_ram

Memory sized

v_final
t_final

It may be helpful to review the tracing strategy for performance testing.

The text was updated successfully, but these errors were encountered:

PatStiles · 2024-04-18T20:01:30Z

Interested!

moodlezoup · 2024-05-08T18:26:49Z

done in #338

sragss added good first issue Good for newcomers help wanted Extra attention is needed labels Apr 15, 2024

sragss mentioned this issue Apr 15, 2024

Optimize / Parallelize InstructionLookups::polynomialize #293

Open

tahsintunan mentioned this issue May 1, 2024

Parallelize Memory Trace Processing #338

Merged

moodlezoup closed this as completed May 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Parallelize RAM Timestamp Count Initialization #292

Parallelize RAM Timestamp Count Initialization #292

sragss commented Apr 15, 2024 •

edited

PatStiles commented Apr 18, 2024

moodlezoup commented May 8, 2024

Parallelize RAM Timestamp Count Initialization #292

Parallelize RAM Timestamp Count Initialization #292

Comments

sragss commented Apr 15, 2024 • edited

PatStiles commented Apr 18, 2024

moodlezoup commented May 8, 2024

sragss commented Apr 15, 2024 •

edited