feat(information_theory/linear_code): Define linear codes #16774

BoltonBailey · 2022-10-03T04:39:49Z

Add linear codes and defines the Reed-Solomon code.

depends on: [Merged by Bors] - feat(topology/metric_space/infsep): Add "infimum separation". #15689

YaelDillies · 2022-10-08T09:32:56Z

src/information_theory/linear_code.lean

+
+* `linear_code 𝓓 F`: The type of linear codes with domain `𝓓` over field `F`


Suggested change

* `linear_code 𝓓 F`: The type of linear codes with domain `𝓓` over field `F`

* `linear_code 𝓓 F`: The type of linear codes with domain `𝓓` over field `F`

YaelDillies · 2022-10-08T09:33:49Z

src/information_theory/linear_code.lean

+A linear error-correcting code, defined as a subspace of the vector space of functions from a
+domain into a field.
+-/
+def linear_code (𝓓 F : Type) [fintype 𝓓] [field F] := submodule F ( 𝓓 -> F )


The fintype assumption is unused.

Which incidentally is to be expected - this is a problem I ran into. It's actually not really necessary that this be finite....

Given that you're interested in matrices associated with the code, maybe it would be best to go with domain fin n

Not for the general definition.

Incidentally, you should use hamm (𝓓 -> F) as the type of the vectors: this is 𝓓 -> F equipped with the hamming norm as a metric.

Yeah I tried that but module F (hamming (λ (d : 𝓓), F)) failed to synthesize, presumably at some point I should go back and switch that.

Hmm - it shouldn't have done, in theory. I would be grateful if you could try and replicate the issue in case it needs a patch.

Maybe one should not hardcode hamm as the metric. Different types of codes use different metrics. E.g. your code may live in a matrix space, with distance the rank distance, i.e. dist(A,B)=rank(A-B).

Yes, this is another reason I have hesitated before defining codes. I think it does want to be generalised ideally. It might want to be specific to integer-valued distances but this might not actually be necessary.

YaelDillies · 2022-10-08T09:34:32Z

src/information_theory/linear_code.lean

+def length (C : linear_code 𝓓 F) : ℕ := fintype.card 𝓓
+
+/-- The set of all valid codewords -/
+def codewords (C : linear_code 𝓓 F) := C.carrier


Probably this should be a set_like instance instead.

YaelDillies · 2022-10-08T09:35:42Z

src/information_theory/linear_code.lean

+hamming distance of 0 from any nonzero element of the code.
+-/
+noncomputable def distance [decidable_eq F] (C : linear_code 𝓓 F) : ℕ :=
+Inf (set.image (λ w : hamming (λ i : 𝓓, F), hamming_dist w 0) (C.codewords \ {0}))


Please use the '' notation You can also use nat.find maybe.

I've been working on this for a little while (actually longer than I probably should have been). Well done for submitting it. I'm not sure this is the right approach to the minimum distance, however - at the very least, there's a number of theorems you'll want to show are true about it. I have a branch called infsep where I've been trying to define this concept rigourously.

YaelDillies · 2022-10-08T09:37:21Z

src/information_theory/linear_code.lean

+Inf (set.image (λ w : hamming (λ i : 𝓓, F), hamming_dist w 0) (C.codewords \ {0}))
+
+/-- The proportion of the code dimension to the size of the code -/
+noncomputable def rate (C : linear_code 𝓓 F) : ℚ := rat.mk C.dimension C.length


rat.mk has a better name in informal maths 😉

Suggested change

noncomputable def rate (C : linear_code 𝓓 F) : ℚ := rat.mk C.dimension C.length

noncomputable def rate (C : linear_code 𝓓 F) : ℚ := C.dimension / C.length

YaelDillies · 2022-10-08T09:38:43Z

src/information_theory/linear_code.lean

+      intros a b ha hb,
+      rw set.mem_set_of at ha hb ⊢,
+      rcases ha with ⟨pa, hap⟩,
+      rcases hb with ⟨pb, hbp⟩,


Suggested change

intros a b ha hb,

rw set.mem_set_of at ha hb ⊢,

rcases ha with ⟨pa, hap⟩,

rcases hb with ⟨pb, hbp⟩,

rintro a b ⟨pa, hap⟩ ⟨pb, hbp⟩,

linesthatinterlace · 2022-10-08T10:00:34Z

I feel a bit stupid because I've written versions of this same file a bunch of times in the last few months, I never submitted it, and now you actually did it - well done. However: my suspicion is that you might run into the same issues I did.

What this file lacks is many or any theorems. You're doing a lot of defining things - but every definition comes with a cost. What I want to see is some theorems proving (not necessarily complicated!) facts about these objects.

In addition - it doesn't make sense to me to define linear codes without first defining block codes, which was a sticking point for me. You're not really using the linearity that much in what you've done so far.

I would expect to see some mention of generating matrices and parity check matrices...

linesthatinterlace · 2022-10-08T15:09:36Z

I would be interested to see if you could base this off my infsep branch. That construction was intended for use in a PR like this - it was held up for a month by some PR issues.

mathlib-dependent-issues-bot · 2022-10-15T10:58:09Z

This PR/issue depends on:

~~[Merged by Bors] - feat(topology/metric_space/infsep): Add "infimum separation". #15689~~
By Dependent Issues (🤖). Happy coding!

eric-wieser · 2022-10-18T23:58:28Z

src/information_theory/linear_code.lean

+/-- The repetition code, where all symbols in each codeword are the same. This is equivalent to a
+Reed-Solomon code with max degree 0 -/
+def repetition : linear_code 𝓓 F :=
+{ carrier :=  {w | ∃ f : F, w = (λ x, f)},


This would be much easier to prove as something like

linear_map.range { to_fun := function.const, map_add' := pi.const_add }

eric-wieser · 2022-10-19T00:00:27Z

src/information_theory/linear_code.lean

+field.
+-/
+def reed_solomon (k : ℕ) (D : finset F) : linear_code D F :=
+{ carrier := {w | ∃ p : polynomial F, p.nat_degree ≤ k ∧ w = (λ x, polynomial.eval x p)},


You should be able to write this as the submodule.image of eval on the polynomials of degree-less- than-k.

BoltonBailey added 2 commits October 2, 2022 21:38

linear code

6b70943

move, add doc

f988de6

BoltonBailey changed the title ~~feat(information_theory/block_code): Define linear codes~~ feat(information_theory/linear_code): Define linear codes Oct 4, 2022

BoltonBailey added the awaiting-CI The author would like to see what CI has to say before doing more work. label Oct 4, 2022

BoltonBailey added 4 commits October 4, 2022 23:38

docs

f520e64

reword

5ee9e76

add repetition code as inhabited instance

13bb42d

fixing docs

a0b63b0

YaelDillies reviewed Oct 8, 2022

View reviewed changes

YaelDillies requested a review from linesthatinterlace October 8, 2022 09:39

eric-wieser reviewed Oct 18, 2022

View reviewed changes

eric-wieser reviewed Oct 19, 2022

View reviewed changes

BoltonBailey added the WIP Work in progress label Oct 19, 2022

semorrison added the too-late This PR was ready too late for inclusion in mathlib3 label Jul 16, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(information_theory/linear_code): Define linear codes #16774

feat(information_theory/linear_code): Define linear codes #16774

BoltonBailey commented Oct 3, 2022 •

edited by YaelDillies

Loading

YaelDillies Oct 8, 2022

YaelDillies Oct 8, 2022

linesthatinterlace Oct 8, 2022

BoltonBailey Oct 8, 2022

linesthatinterlace Oct 8, 2022

linesthatinterlace Oct 8, 2022

BoltonBailey Oct 8, 2022

linesthatinterlace Oct 8, 2022

dimpase Oct 19, 2022

linesthatinterlace Oct 19, 2022

YaelDillies Oct 8, 2022

YaelDillies Oct 8, 2022

linesthatinterlace Oct 8, 2022

YaelDillies Oct 8, 2022

YaelDillies Oct 8, 2022

linesthatinterlace commented Oct 8, 2022

linesthatinterlace commented Oct 8, 2022

mathlib-dependent-issues-bot commented Oct 15, 2022

eric-wieser Oct 18, 2022

eric-wieser Oct 19, 2022


		* `linear_code 𝓓 F`: The type of linear codes with domain `𝓓` over field `F`

	noncomputable def rate (C : linear_code 𝓓 F) : ℚ := rat.mk C.dimension C.length
	noncomputable def rate (C : linear_code 𝓓 F) : ℚ := C.dimension / C.length

feat(information_theory/linear_code): Define linear codes #16774

Are you sure you want to change the base?

feat(information_theory/linear_code): Define linear codes #16774

Conversation

BoltonBailey commented Oct 3, 2022 • edited by YaelDillies Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

linesthatinterlace commented Oct 8, 2022

linesthatinterlace commented Oct 8, 2022

mathlib-dependent-issues-bot commented Oct 15, 2022

Choose a reason for hiding this comment

Choose a reason for hiding this comment

BoltonBailey commented Oct 3, 2022 •

edited by YaelDillies

Loading