feat: asynchronous kernel checking #6368

Kha · 2024-12-11T22:44:21Z

This PR implements executing kernel checking in parallel to elaboration, which is a prerequisite for parallelizing elaboration itself.

Stacked on #6214

Co-Authored-By: Sebastian Ullrich <sebasti@nullri.ch>

leanprover#5835 contains a brittle test that uses an FVar ID, which caused a failure on master. This PR changes that test to use a declaration instead.

This is syntactically more general than before, though up to eta expansion it make no difference.

…eanprover#5837) This PR fixes an old auto-completion bug where `x.` would issue nonsensical completions when `x.` could not be elaborated as a dot completion.

…5840) This PR changes `lean_sharecommon_{eq,hash}` to only consider the salient bytes of an object, and not any bytes of any unspecified/uninitialized unused capacity. Accessing uninitialized storage results in undefined behaviour. This does not seem to have any semantics disadvantages: If objects compare equal after this change, their salient bytes are still equal. By contrast, if the actual identity of allocations needs to be distinguished, that can be done by just comparing pointers to the storage. If we wanted to retain the current logic, we would need initialize the otherwise unused parts to some specific value to avoid the undefined behaviour. Closes leanprover#5831

This PR fixes a bug at `isDefEq` when `zetaDelta := false`. See new test for a small example that exposes the issue.

This PR adds a case to `Level.geq` that is present in the kernel's level `is_geq` procedure, making them consistent with one another. This came up during testing of `lean4lean`. Currently `Level.geq` differs from `level::is_geq` in the case of `max u v >= imax u v`. The elaborator function is overly pessimistic and yields `false` on this while the kernel function yields true. This comes up concretely in the `Trans` class: ```lean class Trans (r : α → β → Sort u) (s : β → γ → Sort v) (t : outParam (α → γ → Sort w)) where trans : r a b → s b c → t a c ``` The type of this class is `Sort (max (max (max (max (max (max 1 u) u_1) u_2) u_3) v) w)` (where `u_1 u_2 u_3` are the levels of `α β γ`), but if you try writing that type explicitly then the `class` command fails. Omitting the type leaves the `class` to infer the universe level (the command assumes the level is correct, and the kernel agrees it is), but including the type then the elaborator checks the level inequality with `Level.geq` and fails. --------- Co-authored-by: Kyle Miller <kmill31415@gmail.com>

…prover#6131) This PR fixes a bug at the definitional equality test (`isDefEq`). At unification constraints of the form `c.{u} =?= c.{v}`, it was not trying to unfold `c`. This bug did not affect the kernel. closes leanprover#6117

This PR improves the `#print` command for structures to show all fields and which parents the fields were inherited from, hiding internal details such as which parents are represented as subobjects. This information is still present in the constructor if needed. The pretty printer for private constants is also improved, and it now handles private names from the current module like any other name; private names from other modules are made hygienic. Example output for `#print Monad`: ``` class Monad.{u, v} (m : Type u → Type v) : Type (max (u + 1) v) number of parameters: 1 parents: Monad.toApplicative : Applicative m Monad.toBind : Bind m fields: Functor.map : {α β : Type u} → (α → β) → m α → m β Functor.mapConst : {α β : Type u} → α → m β → m α Pure.pure : {α : Type u} → α → m α Seq.seq : {α β : Type u} → m (α → β) → (Unit → m α) → m β SeqLeft.seqLeft : {α β : Type u} → m α → (Unit → m β) → m α SeqRight.seqRight : {α β : Type u} → m α → (Unit → m β) → m β Bind.bind : {α β : Type u} → m α → (α → m β) → m β constructor: Monad.mk.{u, v} {m : Type u → Type v} [toApplicative : Applicative m] [toBind : Bind m] : Monad m resolution order: Monad, Applicative, Bind, Functor, Pure, Seq, SeqLeft, SeqRight ``` Suggested by Floris van Doorn [on Zulip](https://leanprover.zulipchat.com/#narrow/channel/270676-lean4/topic/.23print.20command.20for.20structures/near/482503637).

…er#6120) This PR adds theorems `BitVec.(getMsbD, msb)_(rotateLeft, rotateRight)`. We follow the same strategy taken for `getLsbD`, constructing the necessary auxilliary theorems first (relying on different hypotheses) and then generalizing. --------- Co-authored-by: Siddharth <siddu.druid@gmail.com> Co-authored-by: Tobias Grosser <tobias@grosser.es>

…#6132) This PR duplicates the verification API for `List.attach`/`attachWith`/`pmap` over to `Array`.

…er#6133) This PR replaces `Array.feraseIdx` and `Array.insertAt` with `Array.eraseIdx` and `Array.insertIdx`, both of which take a `Nat` argument and a tactic-provided proof that it is in bounds. We also have `eraseIdxIfInBounds` and `insertIdxIfInBounds` which are noops if the index is out of bounds. We also provide a `Fin` valued version of `Array.findIdx?`. Together, these quite ergonomically improve the array indexing safety at a number of places in the compiler/elaborator.

This PR adds support for displaying multiple threads in the trace profiler output. `TraceState.tid` needs to be adjusted for this purpose, which is not done yet by the Lean elaborator as it is still single-threaded.

This PR fixes `trace.profiler.pp` not using the term pretty printer. Fixes leanprover#5872

This PR fixes the run-time evaluation of `(default : Float)`.

…ver#6141)

This PR does the same fix as leanprover#6104, but such that it doesn't break the test/the file in `Plausible`. This is done by not creating unused let binders in metavariable types that are made by `elimMVar`. (This is also a positive thing for users looking at metavariable types, for example in error messages) We get rid of `skipAtMostNumBinders`. This function was originally defined for the purpose of making this test work, but it is a hack because it allows cycles in the metavariable context. It would make sense to split these changes into 2 PRs, but I combined them here to show that the combination of them closes leanprover#6013 without breaking anything Closes leanprover#6013

This PR converts 3 doc-string to module docs since it seems that this is what they were intended to be!

This PR avoids runtime array bounds checks in places where it can trivially be done at compile time. None of these changes are of particular consequence: I mostly wanted to learn how much we do this, and what the obstacles are to doing it less.

This PR adds lemmas for extracting a given bit of a `BitVec` obtained via `sub`/`neg`/`sshiftRight'`/`abs`. --------- Co-authored-by: Kim Morrison <scott@tqft.net>

This PR enables contributors to modify `changelog-*` labels simply by writing a comment with the desired label.

…orem (leanprover#6146) This PR fixes a non-termination bug that occurred when generating the match-expression splitter theorem. The bug was triggered when the proof automation for the splitter theorem repeatedly applied `injection` to the same local declaration, as it could not be removed due to forward dependencies. See issue leanprover#6065 for an example that reproduces this issue. closes leanprover#6065

…nprover#6161) This PR ensures whitespace is printed before `+opt` and `-opt` configuration options when pretty printing, improving the experience of tactics such as `simp?`. Reported [on Zulip](https://leanprover.zulipchat.com/#narrow/channel/270676-lean4/topic/Minor.20simp.3F.20annoyances/near/483736310)

This PR adds a slight performance improvement to reflection of `if` statements that I noticed by profiling Leanwuzla against SMTCOMP's `non-incremental/QF_BV/fft/Sz256_6616.smt2`. In particular: 1. The profile showed about 4 percent of the total run time were spent constructing these decidable instances in reflection of `if` statements. We can construct them much quicker by hand as they always have the same structure 2. This delays construction of these statements until we actually generate the reflection proof that we wish to submit to the kernel. Thus if we encounter a SAT instad of an UNSAT problem we will not spend time generating these expressions anymore. ``` baseline Time (mean ± σ): 31.236 s ± 0.258 s Range (min … max): 30.899 s … 31.661 s 10 runs after Time (mean ± σ): 30.671 s ± 0.288 s Range (min … max): 30.350 s … 31.156 s 10 runs ```

This PR adds support for `Float32` and fixes a bug in the runtime.

This PR brings Vector lemmas about membership and indexing to parity with List and Array.

…over#3696) This PR makes all message constructors handle pretty printer errors. Prior to this change, pretty printer errors in messages were not uniformly handled. In core, some printers capture their errors (e.g., `ppExprWithInfos` and `ppTerm` ) and some do not (e.g., `ppGoal` and `ppSignature`) propagate them to whatever serializes the message (e.g., the frontend). To resolve this inconsistency and uniformly handle errors, the signature for `ofLazy` now uses `BaseIO`. As such, all printers been adapted to capture any errors within them and print similar messages to `ppExprWithInfos` and `ppTerm` on such errors.

This PR generalizes `DecidableRel` to allow a heterogeneous relation.

Fixes leanprover#4460 (and similar future changes) making prior messages inaccessible to metaprograms such as linters

leanprover-community-bot · 2024-12-13T09:54:31Z

Mathlib CI status (docs):

❗ Batteries/Mathlib CI will not be attempted unless your PR branches off the nightly-with-mathlib branch. Try git rebase b862e2d25163584322bcca54e5e56d671da1f258 --onto 48be424eaa2ae06972e9cfec4d355906b532204d. (2024-12-13 09:54:31)
❗ Batteries/Mathlib CI will not be attempted unless your PR branches off the nightly-with-mathlib branch. Try git rebase b862e2d25163584322bcca54e5e56d671da1f258 --onto fe45ddd6105078a0a3bd855e5d94673e794f6b88. (2024-12-28 17:23:57)
❗ Batteries/Mathlib CI will not be attempted unless your PR branches off the nightly-with-mathlib branch. Try git rebase b862e2d25163584322bcca54e5e56d671da1f258 --onto 24a8561ec4e302f4e0cba07632fddacd6f6e0323. (2024-12-30 11:43:03)

Kha · 2024-12-28T22:05:46Z

!bench

leanprover-bot · 2024-12-28T22:14:34Z

Here are the benchmark results for commit 9b36754.
The entire run failed.
Found no significant differences.

Kha · 2024-12-30T11:22:38Z

!bench

leanprover-bot · 2024-12-30T11:40:59Z

Here are the benchmark results for commit ed17ab3.
There were significant changes against commit 4600bb1:

  Benchmark                  Metric                    Change
  ========================================================================
- big_do                     branches                    4.0%    (329.8 σ)
- big_do                     instructions                1.1%     (63.6 σ)
- big_do                     maxrss                      5.8%    (124.1 σ)
+ big_omega.lean             branch-misses             -64.9%   (-143.9 σ)
+ big_omega.lean             branches                  -70.2%  (-4059.2 σ)
+ big_omega.lean             instructions              -71.1%  (-3756.4 σ)
+ big_omega.lean             maxrss                    -54.7%   (-105.7 σ)
+ big_omega.lean             task-clock                -74.6%   (-347.4 σ)
+ big_omega.lean             wall-clock                -74.8%   (-327.5 σ)
+ big_omega.lean MT          branch-misses             -69.1%   (-546.0 σ)
+ big_omega.lean MT          branches                  -72.4%  (-9861.4 σ)
+ big_omega.lean MT          instructions              -72.9% (-13535.0 σ)
+ big_omega.lean MT          maxrss                    -66.4%    (-47.5 σ)
+ big_omega.lean MT          task-clock                -80.9%   (-135.4 σ)
+ big_omega.lean MT          wall-clock                -80.9%   (-135.2 σ)
+ binarytrees                instructions               -8.4%  (-3211.4 σ)
+ binarytrees.st             instructions               -8.4%  (-5177.1 σ)
- bv_decide_mul              branches                    1.5%    (382.5 σ)
+ bv_decide_mul              instructions               -1.2%   (-215.5 σ)
- bv_decide_mul              maxrss                      1.6%     (42.3 σ)
+ bv_decide_realworld        branches                   -1.5%   (-178.8 σ)
+ bv_decide_realworld        instructions               -3.9%   (-529.8 σ)
- bv_decide_realworld        maxrss                      4.1%     (15.4 σ)
+ bv_decide_realworld        task-clock                 -1.8%    (-11.5 σ)
+ bv_decide_realworld        wall-clock                 -2.1%    (-15.1 σ)
+ const_fold                 instructions               -3.5%    (-82.7 σ)
+ deriv                      instructions               -1.9%    (-45.7 σ)
+ ilean roundtrip            instructions               -3.3%   (-500.2 σ)
- import Lean                branches                  340.3%   (6011.9 σ)
- import Lean                instructions              238.0%   (3468.3 σ)
- import Lean                maxrss                      2.9%     (30.2 σ)
- import Lean                task-clock                 84.4%     (17.1 σ)
- import Lean                wall-clock                 84.1%     (17.1 σ)
- language server startup    branches                   50.5%     (88.1 σ)
- language server startup    instructions               38.5%     (61.2 σ)
- language server startup    wall-clock                 19.1%     (14.3 σ)
- liasolver                  instructions                3.5%   (1661.0 σ)
+ liasolver                  maxrss                     -1.4%    (-37.9 σ)
- libleanshared.so           binary size                 3.9%
+ nat_repr                   instructions               -5.0%  (-1756.3 σ)
+ parser                     instructions               -4.4%  (-1586.4 σ)
+ parser                     maxrss                     -1.4%    (-37.9 σ)
+ qsort                      instructions               -4.0%  (-1866.2 σ)
+ qsort                      maxrss                     -1.4%    (-16.6 σ)
- rbmap                      instructions                1.5%    (184.0 σ)
+ rbmap_1                    instructions               -1.8%   (-102.6 σ)
+ rbmap_fbip                 instructions              -10.3%   (-708.9 σ)
+ rbmap_library              instructions               -1.9%   (-171.1 σ)
- reduceMatch                instructions               17.1%    (675.1 σ)
- reduceMatch                maxrss                     19.8%    (820.9 σ)
- reduceMatch                task-clock                 12.8%     (14.3 σ)
- reduceMatch                wall-clock                 12.4%     (12.6 σ)
- simp_arith1                branch-misses               3.6%     (12.7 σ)
- simp_arith1                branches                  129.5%   (4425.6 σ)
- simp_arith1                instructions               84.0%   (2749.9 σ)
- simp_arith1                maxrss                      3.6%     (64.0 σ)
- simp_arith1                task-clock                 43.7%     (66.8 σ)
- simp_arith1                wall-clock                 43.3%     (55.9 σ)
- stdlib                     attribute application     199.1%     (90.3 σ)
- stdlib                     dsimp                      13.6%     (20.7 σ)
- stdlib                     fix level params           10.8%     (21.2 σ)
- stdlib                     instructions               13.0%   (4005.3 σ)
- stdlib                     maxrss                     39.4%    (163.9 σ)
- stdlib                     process pre-definitions   162.7%    (292.1 σ)
- stdlib                     share common exprs         12.5%    (131.1 σ)
- stdlib                     tactic execution           25.2%    (338.6 σ)
- stdlib                     task-clock                 16.7%   (1478.4 σ)
- stdlib                     type checking              38.8%     (56.0 σ)
- stdlib                     wall-clock                 14.9%     (77.2 σ)
- stdlib size                bytes .olean                2.7%
- stdlib size                lines                       2.3%
- stdlib size                lines C                     1.4%
- stdlib size                max dynamic symbols         1.4%
- tests/bench/ interpreted   instructions                3.1%    (515.5 σ)
- tests/bench/ interpreted   maxrss                      3.6%     (11.5 σ)
- tests/bench/ interpreted   task-clock                 32.8%     (35.8 σ)
- tests/bench/ interpreted   wall-clock                 12.2%     (21.4 σ)
- tests/compiler             sum binary sizes            4.7%
+ unionfind                  instructions               -8.2%  (-2666.8 σ)
+ unionfind                  task-clock                -11.6%    (-20.6 σ)
+ unionfind                  wall-clock                -11.6%    (-20.6 σ)
- workspaceSymbols           instructions                1.7%   (2541.3 σ)
- workspaceSymbols           maxrss                      3.5%     (73.0 σ)

mhuisi and others added 30 commits November 19, 2024 09:26

feat: structure auto-completion & partial InfoTrees

2a02c12

refactor: split Completion.lean

95bf45f

feat: use new structInstFields parser to tag structure instance fields

aadf3f1

chore: prepare for bootstrap

64538cf

Co-Authored-By: Sebastian Ullrich <sebasti@nullri.ch>

chore: update stage0

4bef358

test: fix brittle structure instance completion test (leanprover#6127)

a385666

leanprover#5835 contains a brittle test that uses an FVar ID, which caused a failure on master. This PR changes that test to use a declaration instead.

chore: generalize List.get_mem (leanprover#6095)

d6f8980

This is syntactically more general than before, though up to eta expansion it make no difference.

fix: don't issue atomic id completions when there is a dangling dot (l…

b7667c1

…eanprover#5837) This PR fixes an old auto-completion bug where `x.` would issue nonsensical completions when `x.` could not be elaborated as a dot completion.

chore: update stage0

69530af

fix: isDefEq when zetaDelta := false (leanprover#6129)

1f32477

This PR fixes a bug at `isDefEq` when `zetaDelta := false`. See new test for a small example that exposes the issue.

feat: duplicate List.attach/attachWith/pmap API for Array (leanprover…

c8b4f6b

…#6132) This PR duplicates the verification API for `List.attach`/`attachWith`/`pmap` over to `Array`.

feat: thread support for trace.profiler.output (leanprover#6137)

17419ac

This PR adds support for displaying multiple threads in the trace profiler output. `TraceState.tid` needs to be adjusted for this purpose, which is not done yet by the Lean elaborator as it is still single-threaded.

fix: trace.profiler pretty-printing (leanprover#6138)

2fbc466

This PR fixes `trace.profiler.pp` not using the term pretty printer. Fixes leanprover#5872

fix: Inhabited Float produced a bogus run-time value (leanprover#6136)

7fbe8e3

This PR fixes the run-time evaluation of `(default : Float)`.

refactor: make use of recursive structures in snapshot types (leanpro…

b30903d

…ver#6141)

doc: doc-strings to module docs in Data/Array/Lemmas (leanprover#6144)

56a80de

This PR converts 3 doc-string to module docs since it seems that this is what they were intended to be!

feat: BitVec.getElem_[sub|neg|sshiftRight'|abs] (leanprover#6126)

459c6e2

This PR adds lemmas for extracting a given bit of a `BitVec` obtained via `sub`/`neg`/`sshiftRight'`/`abs`. --------- Co-authored-by: Kim Morrison <scott@tqft.net>

chore: add changelog-* labels via comment (leanprover#6147)

9cf8370

This PR enables contributors to modify `changelog-*` labels simply by writing a comment with the desired label.

refactor: one more recursive structure (leanprover#6159)

19a701e

leodemoura and others added 8 commits December 11, 2024 02:55

feat: add Float32 support (leanprover#6366)

633c825

This PR adds support for `Float32` and fixes a bug in the runtime.

feat: lemmas about indexing and membership for Vector (leanprover#6367)

cb31ddc

This PR brings Vector lemmas about membership and indexing to parity with List and Array.

chore: DecidableRel allows a heterogeneous relation (leanprover#6341)

8709ca3

This PR generalizes `DecidableRel` to allow a heterogeneous relation.

chore: preserve reported messages in MessageLog (leanprover#6307)

b862e2d

Fixes leanprover#4460 (and similar future changes) making prior messages inaccessible to metaprograms such as linters

feat: split Lean.Kernel.Environment from Lean.Environment

d9a71be

chore: update stage0

0606179

chore: clean up after update-stage0

a9fd0d1

Kha added the changelog-language Language features, tactics, and metaprograms label Dec 11, 2024

Kha force-pushed the async-tc branch 3 times, most recently from 68aba49 to 08859a4 Compare December 11, 2024 23:09

refactor: move registration of namespaces on kernel add into elaborator

4aaed4c

Kha force-pushed the async-tc branch from 08859a4 to 9df65ba Compare December 12, 2024 16:07

Kha added 2 commits December 13, 2024 10:23

feat: Environment.addConstAsync

271c785

fix: do not double-report snapshotTasks after wrapAsyncAsSnapshot

a2efd60

Kha force-pushed the async-tc branch from 282c65c to 06a6e24 Compare December 13, 2024 09:23

Kha added 2 commits December 13, 2024 10:36

feat: asynchronous kernel checking

afc0c16

also disable async TC in bv_decide et al

b44a2ff

Kha force-pushed the async-tc branch from 06a6e24 to b44a2ff Compare December 13, 2024 09:36

github-actions bot added the toolchain-available A toolchain is available for this PR, at leanprover/lean4-pr-releases:pr-release-NNNN label Dec 13, 2024

Kha changed the base branch from master to kernel-env-base December 28, 2024 16:59

fix compilation skip on kernel error

9b36754

Kha force-pushed the async-tc branch from 076fe30 to 9b36754 Compare December 28, 2024 17:04

disable Lake benchmarks

ed17ab3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: asynchronous kernel checking #6368

feat: asynchronous kernel checking #6368

Kha commented Dec 11, 2024

leanprover-community-bot commented Dec 13, 2024 •

edited

Loading

Kha commented Dec 28, 2024

leanprover-bot commented Dec 28, 2024

Kha commented Dec 30, 2024

leanprover-bot commented Dec 30, 2024

feat: asynchronous kernel checking #6368

Are you sure you want to change the base?

feat: asynchronous kernel checking #6368

Conversation

Kha commented Dec 11, 2024

leanprover-community-bot commented Dec 13, 2024 • edited Loading

Kha commented Dec 28, 2024

leanprover-bot commented Dec 28, 2024

Kha commented Dec 30, 2024

leanprover-bot commented Dec 30, 2024

leanprover-community-bot commented Dec 13, 2024 •

edited

Loading