Reed solomon erasure code #225

mrain · 2023-03-31T22:56:46Z

Description

closes: #215

A very naive implementation of Reed Solomon erasure code.

Encoding is a very naive polynomial evaluation. It could be accelerated by NTT/FFT which may be already in arkworks repo (PrimeField is also a FftField in ark_ff).
Decoding is also a very naive Lagrange interpolation. I don't find an easy way to optimize it yet.

Before we can merge this PR, please make sure that all the following items have been
checked off. If any of the checklist items are not applicable, please leave them but
write a little note why.

Targeted PR against correct branch (main)
Linked to GitHub issue with discussion and accepted design OR have an explanation in the PR that describes this work.
Wrote unit tests
Updated relevant documentation in the code
Added a relevant changelog entry to the Pending section in CHANGELOG.md
Re-reviewed Files changed in the GitHub PR explorer

ggutoski

API doc could be improved but otherwise ok for now.

primitives/src/erasure_code/mod.rs

primitives/src/erasure_code/reed_solomon_erasure.rs

ggutoski

a couple more questions

primitives/src/erasure_code/reed_solomon_erasure.rs

ggutoski · 2023-04-06T18:57:51Z

Now that I've had a chance to think about this PR wrt the VID use case: The level of abstraction here is too high. Encoding/decoding should not automatically split/recombine data into chunks of length reconstruction_size. (I need to do this again anyway in VID.)

I have a good idea of the API I want. Let's chat next week.

…de trait

ggutoski · 2023-04-11T13:59:04Z

Ready for review @mrain @chancharles92 .

TODO/Questions:

Proposal: change new() args to be data_size and parity_size so that num_shards = data_size + parity_size and it's impossible to give invalid args to new(). Any thoughts?
Shall we encode in coordinate form or coefficient form as per discussion?

primitives/src/erasure_code/mod.rs

primitives/src/erasure_code/reed_solomon_erasure.rs

ggutoski · 2023-04-11T15:12:00Z

Current erasure code API looks like this:

fn encode(&self, data: &[F])
fn decode(&self, shards: &[Self::Shard])

where data size and codeword size are implicit and set by the constructor new().

Problem: This API is stateful. Also, there's nothing to stop a user from taking shards from one erasure code and passing them to another (with different data and codeword sizes), so we don't get much argument safety anyway.

Proposal: make encode and decode associated functions (ie. don't take &self):

fn encode(codeword_size: usize, data: &[F]) // data size is data.len()
fn decode(data_size: usize, shards: &[Self::Shard]) // need data_size <= shards.len()

@mrain @chancharles92 any opinions?

primitives/src/erasure_code/mod.rs

reed solomon erasure code

9485ff0

mrain requested a review from ggutoski March 31, 2023 22:56

RS code for any input length

fb80661

ggutoski previously approved these changes Apr 3, 2023

View reviewed changes

ggutoski self-requested a review April 3, 2023 18:26

ggutoski reviewed Apr 3, 2023

View reviewed changes

primitives/src/erasure_code/reed_solomon_erasure.rs Outdated Show resolved Hide resolved

primitives/src/erasure_code/reed_solomon_erasure.rs Outdated Show resolved Hide resolved

ggutoski reviewed Apr 5, 2023

View reviewed changes

primitives/src/erasure_code/reed_solomon_erasure.rs Outdated Show resolved Hide resolved

erasure_code PrimeField -> Field

a387329

ggutoski dismissed their stale review via a387329 April 5, 2023 19:35

Merge branch 'main' into erasure-code

60836a9

ggutoski added 3 commits April 10, 2023 12:45

enforce single-chunk encoding, remove Field assoc type from ErasureCo…

83aec6b

…de trait

remove unneeded new(), trait bounds from ErasureCode trait

c6dff5d

error handling and comments

43a4d2a

mrain commented Apr 11, 2023

View reviewed changes

primitives/src/erasure_code/mod.rs Outdated Show resolved Hide resolved

mrain commented Apr 11, 2023

View reviewed changes

primitives/src/erasure_code/reed_solomon_erasure.rs Outdated Show resolved Hide resolved

ggutoski added 5 commits April 11, 2023 11:22

num_shards -> parity_size, don't return Result from new()

f059a05

stateless API, can't return error

3d42950

remove unnecessary where clause

b3d6f01

restore trait bounds on Shard, return Result from encode,decode

984e358

clean comments

ef10f7a

mrain commented Apr 11, 2023

View reviewed changes

primitives/src/erasure_code/mod.rs Outdated Show resolved Hide resolved

add data_size arg to decode

2acd2c7

ggutoski previously approved these changes Apr 12, 2023

View reviewed changes

make commented url into hyperlink :rolleyes:

9cfb723

ggutoski dismissed their stale review via 9cfb723 April 12, 2023 13:17

ggutoski approved these changes Apr 12, 2023

View reviewed changes

ggutoski merged commit 26a84c0 into main Apr 12, 2023

ggutoski deleted the erasure-code branch April 12, 2023 14:09

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reed solomon erasure code #225

Reed solomon erasure code #225

mrain commented Mar 31, 2023 •

edited

Loading

ggutoski left a comment

ggutoski left a comment

ggutoski commented Apr 6, 2023

ggutoski commented Apr 11, 2023

ggutoski commented Apr 11, 2023 •

edited

Loading

Reed solomon erasure code #225

Reed solomon erasure code #225

Conversation

mrain commented Mar 31, 2023 • edited Loading

Description

ggutoski left a comment

Choose a reason for hiding this comment

ggutoski left a comment

Choose a reason for hiding this comment

ggutoski commented Apr 6, 2023

ggutoski commented Apr 11, 2023

ggutoski commented Apr 11, 2023 • edited Loading

mrain commented Mar 31, 2023 •

edited

Loading

ggutoski commented Apr 11, 2023 •

edited

Loading