Experiment: Attempt to run tests in parallel in github actions. #1407

jwboth · 2025-04-13T06:21:49Z

Proposed changes

This PR aims at utilizing the parallel environment on github. Currently the tests are performed sequentially. If exploting pytest-xdist, it seems that one can invoke up to 4 workers to run the tests in parallel.

The tutorials pass the parallel tests, and the run time is cut a bit.

The remaining tests seem to have some trouble when e.g. external files need to be accessed. Four tests fail. Still the runtime seems to be cut.

@keileg @IvarStefansson if this of interest, I can try to understand what goes wrong in the failing tests.

Types of changes

What types of changes does this PR introduce to PorePy?
Put an x in the boxes that apply.

Minor change (e.g., dependency bumps, broken links).
Bugfix (non-breaking change which fixes an issue).
New feature (non-breaking change which adds functionality).
Breaking change (fix or feature that would cause existing functionality to not work as expected).
Testing (contribution related to testing of existing or new functionality).
Documentation (contribution related to adding, improving, or fixing documentation).
Maintenance (e.g., improve logic and performance, remove obsolete code).
Other:

Checklist

Put an x in the boxes that apply or explain briefly why the box is not relevant.

The documentation is up-to-date.
Static typing is included in the update.
This PR does not duplicate existing functionality.
The update is covered by the test suite (including tests added in the PR).
If new skipped tests have been introduced in this PR, pytest was run with the --run-skipped flag.

keileg · 2025-04-14T06:00:47Z

@Yuriyzabegaev and I looked into this some months ago. The problem is all tests that interact with gmsh, where the common file name gmsh_frac_file leads to all kind of trouble. A solution may be to assign random names to files during testing. I'll do some experiments now.

This allows for running tests in parallel

jwboth · 2025-04-14T06:43:29Z

@Yuriyzabegaev and I looked into this some months ago. The problem is all tests that interact with gmsh, where the common file name gmsh_frac_file leads to all kind of trouble. A solution may be to assign random names to files during testing. I'll do some experiments now.

Cool. I am looking forward to the outcome of the currently running tests!

It all makes sense. This is consistent with my experience in running many simulations side-by-side. Seems like classical race conditions - in my simple cases there was a simple fix of including some seconds waiting times (I also used dedicated file names for gmsh files, but if I remember correctly this was not sufficient - I did not dig too deep as it worked in the end). Here, it is not so trivial I find.

keileg · 2025-04-14T06:45:41Z

I pushed a change that will assign random names to gmsh files during testing, but not regular use. The good news is that the time for testing is almost cut in half locally (which in a sense is disappointing from four workers, but I'll take the improvement still). The not so good news are:

There are occasional random errors from tests that seem to trace back to conflicting gmsh file names. The file names are assigned a random number between 1 and 10^6 (or was it ^8), so the chance of collision should be miniscule, but it could be that the workers all start with the same random seed.
There are also issues with some vtk exporting. I'll leave these to you, @jwboth.

It remains to be seen whether we get a meaningful speedup also on GH actions, but local speedups will also help.

In total, I would say this is worth pursuing a bit more, but not necessarily to be assigned very high priority right now. Thoughts?

jwboth · 2025-04-14T06:50:46Z

I pushed a change that will assign random names to gmsh files during testing, but not regular use. The good news is that the time for testing is almost cut in half locally (which in a sense is disappointing from four workers, but I'll take the improvement still). The not so good news are:

There are occasional random errors from tests that seem to trace back to conflicting gmsh file names. The file names are assigned a random number between 1 and 10^6 (or was it ^8), so the chance of collision should be miniscule, but it could be that the workers all start with the same random seed.

There are also issues with some vtk exporting. I'll leave these to you, @jwboth.

It remains to be seen whether we get a meaningful speedup also on GH actions, but local speedups will also help.

In total, I would say this is worth pursuing a bit more, but not necessarily to be assigned very high priority right now. Thoughts?

I would suggest one more attempt. Look here: https://pytest-xdist.readthedocs.io/en/stable/distribution.html

It is possible to control the distribution. Without having done any deeper analysis, my intuition would say that the conflicts are if certain tests are run in parallel. The likelihood that these tests come from the same file may be quite high. It is possible to request that tests from the same file are run by the same worker. This can be done again by a simple flag in the run command.

jwboth

Distribute same test files among same workers - should hopefully fix vtk tests.

.github/workflows/check_tutorials.yml

.github/workflows/run-pytest-all.yml

.github/workflows/run-pytest.yml

.github/workflows/run-pytest-all.yml

Distribute tests according to file.

.github/workflows/check_tutorials.yml

.github/workflows/run-pytest-all.yml

.github/workflows/run-pytest.yml

Fix parsed arguments

.github/workflows/check_tutorials.yml

Allow for aggressive distribution of tutorials in testing.

jwboth · 2025-04-14T07:11:15Z

Distribute same test files among same workers - should hopefully fix vtk tests.

The last test run succeeded. Indeed, checking the tutorials now with aggressive distribution of tasks seems to save ~1 minute, using 2 workers. The standard test suite at the moment with conservative distribution has no groundbreaking speed up but still 2-3 minutes. Just to test the potential, I will run one experiment with aggressive distribution (and massively many fails, I assume).

.github/workflows/run-pytest-all.yml

.github/workflows/run-pytest.yml

Experiment with aggressive distribution of tests

Yuriyzabegaev · 2025-04-14T10:07:37Z

I pushed a change that will assign random names to gmsh files during testing, but not regular use. The good news is that the time for testing is almost cut in half locally (which in a sense is disappointing from four workers, but I'll take the improvement still). The not so good news are:

There are occasional random errors from tests that seem to trace back to conflicting gmsh file names. The file names are assigned a random number between 1 and 10^6 (or was it ^8), so the chance of collision should be miniscule, but it could be that the workers all start with the same random seed.

There are also issues with some vtk exporting. I'll leave these to you, @jwboth.

It remains to be seen whether we get a meaningful speedup also on GH actions, but local speedups will also help.

In total, I would say this is worth pursuing a bit more, but not necessarily to be assigned very high priority right now. Thoughts?

I was also thinking towards doing this when we were investigating this. As a sidenote, could it avoid the random collisions if naming files after the test name, not at random? The test name can be accessed from the same env variable PYTEST_CURRENT_TEST

keileg · 2025-04-15T04:57:22Z

I was also thinking towards doing this when we were investigating this. As a sidenote, could it avoid the random collisions if naming files after the test name, not at random? The test name can be accessed from the same env variable PYTEST_CURRENT_TEST

If so, the file name would also need to include information on test parametrization. Better then to generate a temporary file using techniques like this.

IvarStefansson · 2025-04-16T09:07:30Z

When reading through the discussion, I wondered whether the times.json files might also interfere with each other. Just a thought.

jwboth · 2025-05-23T03:51:36Z

I did not investigate the latest suggestions as the current setup seems to have worked (for the few test runs above). The latest failing tests were merely due to mypy which are fixed now. By simple comparison to a current PR: Indeed the gains in time are not too large. Tutorials have no true speed up (<1 min possibly of 6 minutes). The remaining tests are ca. 2-3 mins faster of 10-11 minutes for reference.

MAINT: Attempt to run tests in parallel.

93f47ca

jwboth requested review from keileg and IvarStefansson as code owners April 13, 2025 06:21

jwboth marked this pull request as draft April 13, 2025 06:22

MAINT: Make tutorial testing parallel.

3ea1012

keileg added 2 commits April 14, 2025 06:24

MAINT: Random names for gmsh files in testing

0ae460e

This allows for running tests in parallel

TST: Delete gmsh files after tests

2e6eb23