perf(Data/Multiset/Powerset): redefine powersetAux #7388

collares · 2023-09-26T19:39:30Z

This fixes a porting note and fixes the timeouts reported at https://leanprover.zulipchat.com/#narrow/stream/287929-mathlib4/topic/brute.20force.20calculation.20of.20Bell.2FStirling.20numbers

Golfs welcome!

eric-wieser · 2023-09-27T22:08:36Z

Mathlib/Data/Multiset/Powerset.lean

@@ -23,15 +23,21 @@ variable {α : Type*}

 /-! ### powerset -/

--Porting note: TODO: Write a more efficient version


In what sense is the new version more efficient? In terms of algorithmic complexity they're the same, right? Is the problem the intermediate memory?

If nothing else, it fixes timeouts and seems to be a more faithful port of the Lean 3 version...

I suspect the cause of the slowdown was the use of Array in List.sublists. An Array in the Kernel is just a List, but compiled to something more efficient. List.sublists uses Array.push, which is slow in the kernel becuase it's l ++ [a] on the underlying list, so it's linear time instead of constant time. So this new implementation is a better complexity in the kernel.

Does that mean List.sublists also is inefficient in the kernel? Can we change it back to be array-free (using the definition in mathlib3port for example), and then this definition will get the same improvement?

If we're worried about runtime performance, then we can add a csimp lemma in Std that converts the kernel-friendly version into the execution-friendly version. The advantage of doing that in Std to List.sublists is that it should cause everything downstream to be optimal automatically.

For reference, my check for whether alternate definitions are "good enough" is the following example, which essentially comes from the Zulip thread:

import Mathlib.Order.Partition.Finpartition open Finset instance Finpartition.fintype_finset {α : Type _} [DecidableEq α] (a : Finset α) : Fintype (Finpartition a) where elems := a.powerset.powerset.image (λ p => if h : p.SupIndep id ∧ p.sup id = a ∧ ⊥ ∉ p then ⟨p, h.1, h.2.1, h.2.2⟩ else ⊥) complete := by rintro p rw [mem_image] refine' ⟨p.parts, _, _⟩ · simp only [mem_powerset] intros i hi rw [mem_powerset] exact p.le hi · rw [dif_pos] simp only [p.supIndep, p.supParts, p.not_bot_mem, eq_self_iff_true, not_false_iff, and_self] example : @Fintype.card (Finpartition (range 3)) (Finpartition.fintype_finset _) = 5 := by rfl

@eric-wieser I checked by reverting my changes and replacing the Std definition of sublists by

l.foldr (fun a acc => join (acc.map fun x => [x, a :: x])) [[]]

(and sorrying out the affected theorems) and the example in the previous comment didn't time out. Therefore, I fully agree that we should fix it in Std. Unfortunately I don't have the time to prepare and shepherd a Std PR right now, but I'd be happy if you or someone else did it.

I've tried to adopt this in #7746

Do recent elaborator changes have any effect on this problem?

The performance difference is still noticable; see the example in the std issue.

Mathlib/Data/Multiset/Powerset.lean

…owersetAux

This replaces leanprover-community/mathlib4#7388.

redefine powersetAux

eb5b2ad

collares requested review from b-mehta and ChrisHughes24 September 26, 2023 19:40

collares changed the title ~~feat(Data/Multiset/Powerset): redefine powersetAux~~ perf(Data/Multiset/Powerset): redefine powersetAux Sep 26, 2023

lint

b375380

collares added the awaiting-review The author would like community review of the PR label Sep 26, 2023

eric-wieser reviewed Sep 27, 2023

View reviewed changes

Mathlib/Data/Multiset/Powerset.lean Show resolved Hide resolved

eric-wieser reviewed Sep 27, 2023

View reviewed changes

Mathlib/Data/Multiset/Powerset.lean Outdated Show resolved Hide resolved

eric-wieser added awaiting-author A reviewer has asked the author a question or requested changes and removed awaiting-review The author would like community review of the PR labels Sep 27, 2023

collares force-pushed the collares/redefine-powersetAux branch from 7ffbf19 to fcd3053 Compare September 28, 2023 11:35

address review comments

456ab83

collares force-pushed the collares/redefine-powersetAux branch from fcd3053 to 456ab83 Compare September 28, 2023 14:41

collares added help-wanted The author needs attention to resolve issues please-adopt and removed awaiting-author A reviewer has asked the author a question or requested changes help-wanted The author needs attention to resolve issues labels Oct 3, 2023

Merge remote-tracking branch 'origin/master' into collares/redefine-p…

0af98dc

…owersetAux

eric-wieser added a commit to eric-wieser/std4 that referenced this pull request Oct 18, 2023

perf: improve kernel reduction of List.sublists

f9de3c7

This replaces leanprover-community/mathlib4#7388.

eric-wieser added a commit to eric-wieser/std4 that referenced this pull request Oct 18, 2023

perf: improve kernel reduction of List.sublists

528fb3d

This replaces leanprover-community/mathlib4#7388.

eric-wieser mentioned this pull request Oct 18, 2023

perf: improve kernel reduction of List.sublists leanprover-community/batteries#301

Merged

eric-wieser added a commit to eric-wieser/std4 that referenced this pull request Oct 18, 2023

perf: improve kernel reduction of List.sublists

7db655a

This replaces leanprover-community/mathlib4#7388.

leanprover-community-mathlib4-bot added the merge-conflict The PR has a merge conflict with master, and needs manual merging. label Oct 19, 2023

collares closed this Oct 20, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf(Data/Multiset/Powerset): redefine powersetAux #7388

perf(Data/Multiset/Powerset): redefine powersetAux #7388

collares commented Sep 26, 2023

eric-wieser Sep 27, 2023

b-mehta Sep 28, 2023

ChrisHughes24 Sep 28, 2023

eric-wieser Sep 28, 2023 •

edited

Loading

eric-wieser Sep 28, 2023

collares Sep 28, 2023

collares Sep 28, 2023 •

edited

Loading

eric-wieser Oct 18, 2023

ChrisHughes24 Oct 18, 2023

eric-wieser Oct 18, 2023

		@@ -23,15 +23,21 @@ variable {α : Type*}

		/-! ### powerset -/

		--Porting note: TODO: Write a more efficient version

perf(Data/Multiset/Powerset): redefine powersetAux #7388

perf(Data/Multiset/Powerset): redefine powersetAux #7388

Conversation

collares commented Sep 26, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

eric-wieser Sep 28, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

collares Sep 28, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

eric-wieser Sep 28, 2023 •

edited

Loading

collares Sep 28, 2023 •

edited

Loading