feat: OrderedAssocList #556

kim-em · 2024-01-23T06:50:25Z

Ordered wrapper around AssocList, and basic functions find?, insert?, filterMapVal, and merge. An extensionality lemma (∀ a, l₁.find? a = l₂.find? a) → l₁ = l₂.

I will later use this to implement sparse coefficients in omega.

joehendrix · 2024-01-30T01:20:37Z

I'm assuming this is useful for kernel execution efficiency. Is that the case? Could you test performance against RBMap?

Std/Data/OrderedAssocList.lean

kim-em · 2024-01-30T06:29:04Z

I'm assuming this is useful for kernel execution efficiency. Is that the case? Could you test performance against RBMap?

No, actually, there's no need to run any of this in the kernel. I want this because I think it will be easier to write the theorems about algebraic operations on finitely supported functions Nat \to Int that I need in this representation than in RBMap.

Co-authored-by: Mario Carneiro <di.gama@gmail.com>

digama0 · 2024-01-30T06:55:25Z

In that case I'm a bit dubious of adding a new data structure like this to std, since it has neither good kernel performance nor good runtime performance, and we already have other ways to write finitely supported functions for proving purposes.

kim-em · 2024-01-30T06:57:12Z

Hmm, okay. I will see how proofs go on top of RBMap + a wrapper asserting there are no zero values.

kim-em · 2024-01-30T07:00:03Z

Currently RBMap.mergeWith is O(n₂ * log (n₁ + n₂)), whereas the merge here is O(n₁ + n₂). (I hope. :-)

digama0 · 2024-01-30T07:13:45Z

Hmm, okay. I will see how proofs go on top of RBMap + a wrapper asserting there are no zero values.

What are you trying to do more specifically? Why do you need proofs in this case? If this is an omega data structure I would expect proofs to not be necessary.

Currently RBMap.mergeWith is O(n₂ * log (n₁ + n₂)), whereas the merge here is O(n₁ + n₂). (I hope. :-)

Wikipedia describes a better algorithm for RBMap merge with complexity O(n₂ * log (n₁ / n₂ + 1)), which is strictly better than O(n₁ + n₂) (it wins mainly in the unbalanced case). The implementation here is a bit lazy, but with some proof work I think it wouldn't be too bad to implement the more efficient algorithm.

kim-em · 2024-01-30T07:33:29Z

Hmm, okay. I will see how proofs go on top of RBMap + a wrapper asserting there are no zero values.

What are you trying to do more specifically? Why do you need proofs in this case? If this is an omega data structure I would expect proofs to not be necessary.

This is very slightly inaccurate (because of subsequent laziness taking some shortcuts), but the functions and theorems I need on the representation of coefficients are given in https://github.com/leanprover/std4/blob/main/Std/Tactic/Omega/Coeffs/IntList.lean

Currently all these functions are implemented on List Int, i.e. a dense representation of coefficients, only trimming trailing zeros as needed to make the theorems true!

As you can see the intention is that I should be able to swap out any representation satisfying that API.

This is not particularly urgent, as right now now one is actually calling omega with more than ~10 variables... But Leo assures me that that someone will want to eventually.

digama0 · 2024-01-30T07:39:02Z

So why not just use AssocList there? Or Array (Nat x Int) for that matter.

kim-em · 2024-01-30T08:02:54Z

So why not just use AssocList there? Or Array (Nat x Int) for that matter.

Because then the merge operation (which is actually run, not just proved about) is O(n1 * n2), and I wanted to avoid that.

digama0 · 2024-01-30T21:19:03Z

So why not just use AssocList there? Or Array (Nat x Int) for that matter.

Because then the merge operation (which is actually run, not just proved about) is O(n1 * n2), and I wanted to avoid that.

I mean, you can still implement a O(n1 + n2) merge operation on AssocList which works provided the inputs are sorted. That is, you are working with ordered AssocLists, just not as a distinct type.

kim-em · 2024-01-31T06:43:50Z

So why not just use AssocList there? Or Array (Nat x Int) for that matter.

Because then the merge operation (which is actually run, not just proved about) is O(n1 * n2), and I wanted to avoid that.

I mean, you can still implement a O(n1 + n2) merge operation on AssocList which works provided the inputs are sorted. That is, you are working with ordered AssocLists, just not as a distinct type.

That's what this PR does! AssocList.orderedMerge. It seems really cumbersome to then state all the theorems about it (e.g. how it relates to find?, associativity, etc) if you forbid yourself from mentioning the bundled type that wraps up the witness of ordered-ness.

digama0 · 2024-02-01T05:32:28Z

Is it? I would expect it to just be a simple predicate, e.g.

theorem orderedMerge_ordered : Ordered l1 -> Ordered l2 -> Ordered (l1.orderedMerge l2) := ...

Actually it might not even be Ordered, it could be Pairwise R or something, at least for the low level primitive. You can therefore manipulate the ordered-ness independently of the list itself, and this features quite prominently in sorting functions for lists, for example, so I don't think it's a particularly bad design.

kim-em · 2024-02-03T23:29:07Z

Is it?

I've started refactoring, and agree this is better. Thanks for talking me around. :-)

Std/Data/AssocList.lean

fgdorais · 2024-02-06T17:26:20Z

I think all my concerns have been addressed. Just one additional remark: the design of AssocList so far is that defs are specced using toList, correctness and other lemmas are then proved using the toList translation and the List library. I think this makes good sense and should be considered in the revisions.

kim-em · 2024-02-07T22:55:19Z

the design of AssocList so far is that defs are specced using toList, correctness and other lemmas are then proved using the toList translation and the List library.

This is still WIP, I will probably eventually remove the bundled OrderedAssocList entirely when I get back to this PR.

However, I am not intending to use toList for specification: the great property of an ordered assoc list is that find? suffices for specifications, and this is intentionally different from what we have to do for AssocList.

kim-em · 2024-11-23T11:00:35Z

I've cleaned this up. I decided to keep the bundled structure because the whole point is the nice extensionality lemma available there.

leanprover-community-bot · 2024-11-23T11:44:25Z

Mathlib CI status (docs):

✅ Mathlib branch batteries-pr-testing-556 has successfully built against this PR. (2024-11-23 11:44:24) View Log

fgdorais

I only made it part way but I don't know when I will come back to it :/

fgdorais · 2024-11-23T14:21:36Z

Batteries/Data/OrderedAssocList.lean

+The predicate that the keys of an `AssocList` are
+in strictly increasing order according to the comparator `cmp`.
+-/
+def keysOrdered (cmp : α → α → Ordering) : AssocList α β → Prop


Make Bool valued. It's nicely tail recursive too!

I'm skeptical. What is the pay-off? It's kind of annoying to adapt to this.

True, it's a major refactor. My concern is that the decision procedure for the current version is not necessarily as efficient. Can we have both? Also, keysOrdered should be KeysOrdered if it's a proposition.

Have you considered using ltHeadKey? here to save some cases?

def KeysOrdered (cmp : α → α → Ordering) : AssocList α β → Prop | .nil => True | .cons a _ t => ltHeadKey? cmp a t ∧ KeysOrdered cmp t

fgdorais · 2024-11-24T00:35:18Z

Batteries/Data/OrderedAssocList.lean

+/--
+The condition that an element is less than the first key of an `AssocList`, or that list is empty.
+-/
+abbrev ltHeadKey? (cmp : α → α → Ordering) (a : α) (t : AssocList α β) : Prop :=


Makes more sense as Bool valued. Something about the ? makes me unhappy here but it's hard to think of a more accurate alternative.

fgdorais · 2024-11-24T00:40:14Z

Batteries/Data/OrderedAssocList.lean

+  | .cons _ _ .nil => True
+  | .cons a _ (.cons x y t) => cmp a x = .lt ∧ keysOrdered cmp (.cons x y t)
+
+instance instKeysOrderedDecidablePred : DecidablePred (keysOrdered cmp : AssocList α β → Prop) := by


Won't be necessary if keysOrdered returns Bool.

kim-em added 2 commits January 23, 2024 17:47

OrderedAssocList

655c9af

add to Std.lean

1a1d1a1

kim-em added the awaiting-review This PR is ready for review; the author thinks it is ready to be merged. label Jan 23, 2024

lint

ceaf996

digama0 reviewed Jan 30, 2024

View reviewed changes

Std/Data/OrderedAssocList.lean Outdated Show resolved Hide resolved

Update Std/Data/OrderedAssocList.lean

aa2147c

Co-authored-by: Mario Carneiro <di.gama@gmail.com>

kim-em added 4 commits February 1, 2024 12:10

start unbundling

6de3b08

in progress

390d17f

ugh

6ebc8d9

checkpoint

618c18c

kim-em added WIP work in progress and removed awaiting-review This PR is ready for review; the author thinks it is ready to be merged. labels Feb 3, 2024

kim-em added 3 commits February 5, 2024 23:19

more progresss unbundling

50a48f6

Merge remote-tracking branch 'origin/main' into ordered_assoc_list

33c0a78

fix after merge

1f0f20d

fgdorais reviewed Feb 6, 2024

View reviewed changes

Std/Data/AssocList.lean Outdated Show resolved Hide resolved

leanprover-community-mathlib4-bot added the merge-conflict This PR has merge conflicts with the `main` branch which must be resolved by the author. label Mar 5, 2024

big merge

aa85aae

leanprover-community-mathlib4-bot removed the merge-conflict This PR has merge conflicts with the `main` branch which must be resolved by the author. label Nov 23, 2024

kim-em added 5 commits November 23, 2024 21:18

merge main

009746c

GetElem?

552b38a

cleanup

2476418

module doc

161e49b

lint

17d5249

kim-em added 2 commits November 23, 2024 22:01

.

22fd20e

updateBatteries

eae28d1

leanprover-community-mathlib4-bot added a commit to leanprover-community/mathlib4 that referenced this pull request Nov 23, 2024

Update Batteries branch for testing leanprover-community/batteries#556

2952699

leanprover-community-bot added the builds-mathlib label Nov 23, 2024

fgdorais reviewed Nov 24, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: OrderedAssocList #556

feat: OrderedAssocList #556

kim-em commented Jan 23, 2024

joehendrix commented Jan 30, 2024

kim-em commented Jan 30, 2024

digama0 commented Jan 30, 2024 •

edited

Loading

kim-em commented Jan 30, 2024

kim-em commented Jan 30, 2024 •

edited

Loading

digama0 commented Jan 30, 2024 •

edited

Loading

kim-em commented Jan 30, 2024

digama0 commented Jan 30, 2024

kim-em commented Jan 30, 2024

digama0 commented Jan 30, 2024 •

edited

Loading

kim-em commented Jan 31, 2024

digama0 commented Feb 1, 2024 •

edited

Loading

kim-em commented Feb 3, 2024

fgdorais commented Feb 6, 2024

kim-em commented Feb 7, 2024

kim-em commented Nov 23, 2024 •

edited

Loading

leanprover-community-bot commented Nov 23, 2024

fgdorais left a comment

fgdorais Nov 23, 2024

kim-em Nov 24, 2024

fgdorais Nov 24, 2024 •

edited

Loading

fgdorais Nov 24, 2024

fgdorais Nov 24, 2024

feat: OrderedAssocList #556

Are you sure you want to change the base?

feat: OrderedAssocList #556

Conversation

kim-em commented Jan 23, 2024

joehendrix commented Jan 30, 2024

kim-em commented Jan 30, 2024

digama0 commented Jan 30, 2024 • edited Loading

kim-em commented Jan 30, 2024

kim-em commented Jan 30, 2024 • edited Loading

digama0 commented Jan 30, 2024 • edited Loading

kim-em commented Jan 30, 2024

digama0 commented Jan 30, 2024

kim-em commented Jan 30, 2024

digama0 commented Jan 30, 2024 • edited Loading

kim-em commented Jan 31, 2024

digama0 commented Feb 1, 2024 • edited Loading

kim-em commented Feb 3, 2024

fgdorais commented Feb 6, 2024

kim-em commented Feb 7, 2024

kim-em commented Nov 23, 2024 • edited Loading

leanprover-community-bot commented Nov 23, 2024

fgdorais left a comment

Choose a reason for hiding this comment

fgdorais Nov 23, 2024

Choose a reason for hiding this comment

kim-em Nov 24, 2024

Choose a reason for hiding this comment

fgdorais Nov 24, 2024 • edited Loading

Choose a reason for hiding this comment

fgdorais Nov 24, 2024

Choose a reason for hiding this comment

fgdorais Nov 24, 2024

Choose a reason for hiding this comment

digama0 commented Jan 30, 2024 •

edited

Loading

kim-em commented Jan 30, 2024 •

edited

Loading

digama0 commented Jan 30, 2024 •

edited

Loading

digama0 commented Jan 30, 2024 •

edited

Loading

digama0 commented Feb 1, 2024 •

edited

Loading

kim-em commented Nov 23, 2024 •

edited

Loading

fgdorais Nov 24, 2024 •

edited

Loading