Improve MHP precision using ancestor locksets #1865

dabund24 · 2025-11-05T18:14:11Z

First part of #1805.
Second case will be handled in a separate PR.

To be handled

Non-transitive version

When creating $t_1$, $t_0$ must hold a lock $l$. If $l$ is not released before $t_1$ is definitely joined into $t_0$, $t_1$ is protected by $l$.

Examples

graph TB;
subgraph t1;
    E["..."]-->F["return;"];
end;
subgraph t0;
    A["lock(l);"]-->B;
    B["create(t1);"]-->C;
    C["join(t1);"]-->D["unlock(l);"]
end;
B-.->E
F-.->C

graph TB;
subgraph t1;
    E["..."]-->F["return;"];
end;
subgraph t0;
    A["lock(l);"]-->B;
    B["create(t1);"]-->C[return;];
end;
B-.->E

General version

Let $t_d$ be a may-descendant of $t_1$. When creating $t_1$, $t_0$ must hold a lock $l$. If $l$ is not released before $t_d$ is definitely joined into $t_0$, $t_d$ is protected by $l$.

Example

graph TB;
subgraph td;
    G["..."]-->H["return;"];
end;
subgraph t1;
    E["create(td);"]-->F["return;"];
end;
subgraph t0;
    A["lock(l);"]-->B;
    B["create(t1);"]-->C;
    C["join(td);"]-->D["unlock(l);"]
end;
B-.->E
E-.->G
H-.->C

Dependency Analyses

$\mathcal T$: Ego Thread Id at program point with ana.thread.domain set to "history" and ana.thread.include-node and ana.thread.context.create-edges both enabled
$\mathcal L$: Must-Lockset at program point
$\mathcal C$: May-Creates of ego thread before program point
$\mathcal J$: Transitive Must-Joins of ego thread before program point
$\mathcal{DES}(t)$: Descendant threads of $t$ (implemented in this PR)

Conditions to satisfy

maybe $\exists$ create(t1) in $t_0$ with $l\in\mathcal L$ and $t_d\in\mathcal{DES}(t_1)$
$\neg$ (maybe $\exists$ create(t1) in $t_0$ with $l\notin\mathcal L$ and $t_d\in\mathcal{DES}(t_1)$ ). If 1. holds, we get this for free; see final section
$\neg$ (maybe $\exists$ unlock(l) in $t_0$ with $t_d\in\left(\mathcal C\cup\bigcup_{c\in\mathcal C}\mathcal{DES}(c)\right)\setminus\mathcal J$ )

Analyses

Creation Lockset

$\mathcal{CL}\subseteq T\to 2^{T\times L}$
May-Set
Flow-Insensitive
Condition 1 is satisfied if $(t_0, l)\in\mathcal{CL}(t_d)$

Contributions

create(t1):
$\forall t\in \{t_1\}\cup\mathcal{DES}(t_1):\mathcal{CL}(t)\sqsupseteq\{\mathcal T\}\times\mathcal L$

Tainted Creation Lockset

$\mathcal{TCL}\subseteq T\to 2^{T\times L}$
May-Set
Flow-Insensitive
Condition 3 is satisfied if $(t_0, l)\notin\mathcal{TCL}(t_d)$

Contributions

unlock(l):
$\forall t\in \left(\mathcal C\cup\bigcup_{c\in\mathcal C}\mathcal{DES}(c) \right)\setminus\mathcal J:\mathcal{TCL}(t)\sqsupseteq \{(\mathcal T,l)\}$

Rules for MHP exclusion

Let $\mathcal{IL}(t):=\mathcal{CL}(t)\setminus\mathcal{TCL}(t)$.
Program points $s_1$ with $\mathcal T_1$, $\mathcal L_1$ and $\mathcal{IL}_1$ and $s_2$ with $\mathcal T_2$, $\mathcal L_2$ and $\mathcal{IL}_2$ cannot happen in parallel if at least one condition holds:

$\exists (t_a,l_a)\in\mathcal{IL}_1:l_a\in\mathcal L_2,t_a\neq \mathcal T_2$
$\exists (t_a,l_a)\in\mathcal{IL}_2:l_a\in\mathcal L_1,t_a\neq \mathcal T_1$
$\exists(t_{a1},l)\in\mathcal{IL}_ 1,(t_{a2},l)\in\mathcal {IL}_ 2: t_{a1}\neq t_{a2}$

Notes on non-unique thread ids

By requiring thread ids to include the full history and the creation point, we work around the problem of incorrectly marking two program points as sequential because of an ambiguous creation history. Notes on some edge cases:

Ambiguity due to multiple thread creations in one thread

graph TB;
    A((t1))-->B((t2));
    A-->B;

If $l$ is included at only some, but not all locksets at create(t2) statements in $t_1$, the analysis would not work. However, since we include the creation node in the thread id, this case is impossible.

Ambiguity due to diamond-like thread creations

graph TB;
    A((t1))-->B((t2));
    A-->C((t3));
    B-->D((t4));
    C-->D;

If $t_4$ were marked as protected by $t_2$ only, this would be incorrect, since a creation via $t_3$ is also possible. Though, this cannot happen, since $t_4$ would have two different histories (each following a path of the diamond) and thus two different thread ids.

Ambiguity due to circular thread creations

graph TB;
    A((t1))-->B((t2));
    B-->C((t3));
    C-->B;

Marking $t_2$ as protected by $t_3$ would be problematic, since the loop does not need to be entered at all. However, the first iteration of circular loops still receives a non-unique thread id which is not marked as a descendant of $t_3$, so problematic program points in $t_2$ are flagged as racing nevertheless.

…ter-proc lock regression tests

… tests

…t analysis

…ion lockset analyses

…tered

dabund24

Some general questions:

Is there a way to enforce configuration settings to be set to certain values in order for the analysis to work (i.e. similar to how analyses can be declared as dependencies)? This would be necessary for the settings revolving around thread ids as described in the pr summary. Checking settings using GobConfig.get_string at the start of transfer functions doesn't really seem ideal to me
is there an ideal amount of tests I should write or can I write as many as I feel to be necessary?

src/analyses/creationLockset.ml

dabund24 · 2025-12-04T13:15:07Z

src/analyses/creationLockset.ml

+    match tid_lifted, child_tid_lifted with
+    | `Lifted tid, `Lifted child_tid ->
+      let descendants = descendants_closure child_ask child_tid in
+      let lockset = ask.f Queries.MustLockset in
+      let to_contribute = cartesian_prod (TIDs.singleton tid) lockset in
+      TIDs.iter (contribute_lock man to_contribute) descendants
+    | _ -> (* TODO deal with top or bottom? *) ()


What does it mean for a thread id to be top or bottom? Not sure how to deal with this here and in some other places

src/analyses/creationLockset.ml

src/domains/queries.ml

sim642 · 2025-12-05T08:22:17Z

Is there a way to enforce configuration settings to be set to certain values in order for the analysis to work (i.e. similar to how analyses can be declared as dependencies)? This would be necessary for the settings revolving around thread ids as described in the pr summary. Checking settings using GobConfig.get_string at the start of transfer functions doesn't really seem ideal to me

Maingoblint.check_arguments has a bunch of ad-hoc checks like this.

sim642 · 2025-12-05T08:33:55Z

is there an ideal amount of tests I should write or can I write as many as I feel to be necessary?

As many as necessary to cover all the functionality and ideally the corner cases of the domains/analyses.
We also have code coverage available, but it's not required, although it might be useful for finding untested stuff.

…et analyses to must lock domain

dabund24 · 2025-12-05T10:51:07Z

Is there a way to enforce configuration settings to be set to certain values in order for the analysis to work (i.e. similar to how analyses can be declared as dependencies)? This would be necessary for the settings revolving around thread ids as described in the pr summary. Checking settings using GobConfig.get_string at the start of transfer functions doesn't really seem ideal to me

Maingoblint.check_arguments has a bunch of ad-hoc checks like this.

that's what I needed, nice :D
added in 13443f9

dabund24 · 2025-12-05T10:54:12Z

is there an ideal amount of tests I should write or can I write as many as I feel to be necessary?

As many as necessary to cover all the functionality and ideally the corner cases of the domains/analyses. We also have code coverage available, but it's not required, although it might be useful for finding untested stuff.

I'm going to add some more in the coming days 👍

dabund24 · 2025-12-05T11:57:51Z

The ci builds failed due to an incorrect semi colon. I fixed this in f1efd32, so everything should be working now

dabund24 added 5 commits October 17, 2025 10:27

add inter-procedural lock c files

7831cbf

add lock-fork hb relationship c file

3ef7082

use pthread_create() and pthread_join() instead of race macros for in…

9d88584

…ter-proc lock regression tests

activate creationLockset analysis for inter-threaded lock regressions…

b7d0d35

… tests

initial version of creationLockset analysis

93a513a

sim642 added feature student-job precision labels Nov 5, 2025

dabund24 added 6 commits November 7, 2025 12:26

AncestorLocksetSpec as common base module

946536b

initial version of TaintedCreationLockset analysis

a78e44c

use thread domain instead of lifted thread domain

8ba9094

add threadJoins as dependency for TaintedCreationLockset analysis

ead4843

initial version of transitive descendants analysis

73e6d9c

some comments in transitiveDescendats analysis

f212c45

sim642 changed the title ~~Improve mhp precision using ancestor locksets~~ Improve MHP precision using ancestor locksets Nov 10, 2025

dabund24 added 15 commits November 11, 2025 12:29

query for descendant analysis

4633ec7

get rid of unnecessary match expression

3f3e2f3

MayCreationLockset query

691cdfd

InterThreadedLockset query

61d5c3f

fix incorrect query answer type in transitive descendants analysis

e1719b2

cartesian product helper functions

8720b23

remove unused function from TaintedCreationLocksetSpec

7c6e5d9

correct comment in tainted lockset analysis

61a48dc

replace threadset and lockset module references with shorthand

31ceff8

function for getting currently running tids

450d349

inter-threaded lockset A module

7d1fa1a

use topped set for global domain in AncestorLocksetSpec

3c52c76

replace comparison operators with equals function of domains

8b1727f

add creationLockset analysis to dependencies of taintedCreationLockse…

b0751dc

…t analysis

fix regression test files

09f28ba

dabund24 added 14 commits November 20, 2025 12:10

hash descendant thread query param

3a5513d

transitive version of (tainted) creation locksets

cecf244

add race and transitive descendants analyses as dependencies to creat…

55184fe

…ion lockset analyses

regression tests for transitive creation locksets

611a91e

rename regression test for second case

c1ad680

add thread param to MayCreationLockset query

70dabb7

handle unlock of unknown mutex

0e63731

edit some comments

3514a37

Merge branch 'goblint:master' into master

19ce960

remove redundant must-ancestor check

61b9609

fix and update some comments

52ce9f0

minimize contributions to tcl when unlock of unknown thread is encoun…

7f4e531

…tered

align naming style of A module with other modules

2158f23

Merge branch 'master' into master

e531002

dabund24 marked this pull request as ready for review December 4, 2025 13:04

remove irrelevant test case

85aad48

dabund24 commented Dec 4, 2025

View reviewed changes

dabund24 added 2 commits December 4, 2025 14:57

add new modules to goblint_lib

4b9c2c7

add params to tests

2b62168

dabund24 added 5 commits December 5, 2025 11:05

remove comment/question concerning contexts

dbf73f1

align hash calls for threads in queries with other hash calls

e1299ea

move address to must lock conversion from mutex ghosts/creation locks…

87dde05

…et analyses to must lock domain

should_print in A module

b8416ef

impose conditions on config before running creation lockset analyses

13443f9

remove bad semicolon

f1efd32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Improve MHP precision using ancestor locksets #1865

Improve MHP precision using ancestor locksets #1865

Uh oh!

dabund24 commented Nov 5, 2025 •

edited

Loading

Uh oh!

dabund24 left a comment •

edited

Loading

Uh oh!

Uh oh!

dabund24 Dec 4, 2025

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sim642 commented Dec 5, 2025

Uh oh!

sim642 commented Dec 5, 2025

Uh oh!

dabund24 commented Dec 5, 2025

Uh oh!

dabund24 commented Dec 5, 2025

Uh oh!

dabund24 commented Dec 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Improve MHP precision using ancestor locksets #1865

Are you sure you want to change the base?

Improve MHP precision using ancestor locksets #1865

Uh oh!

Conversation

dabund24 commented Nov 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

To be handled

Non-transitive version

Examples

General version

Example

Dependency Analyses

Conditions to satisfy

Analyses

Creation Lockset

Contributions

Tainted Creation Lockset

Contributions

Rules for MHP exclusion

Notes on non-unique thread ids

Ambiguity due to multiple thread creations in one thread

Ambiguity due to diamond-like thread creations

Ambiguity due to circular thread creations

Uh oh!

dabund24 left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

dabund24 Dec 4, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

sim642 commented Dec 5, 2025

Uh oh!

sim642 commented Dec 5, 2025

Uh oh!

dabund24 commented Dec 5, 2025

Uh oh!

dabund24 commented Dec 5, 2025

Uh oh!

dabund24 commented Dec 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

dabund24 commented Nov 5, 2025 •

edited

Loading

dabund24 left a comment •

edited

Loading