Regression in Gatling Mill build clean compile performance due to worker closing #3920

lihaoyi · 2024-11-07T15:25:08Z

In Mill 0.11.12, ./mill clean && time ./mill -j1 __.compile on the example/thirdparty/gatling/ build used to start at about 30s, and when run repeatedly it would drop to 10-15s as the JVM warmed up. In Mill 0.12.0 and after, it seems to stick around 20-30s, which is a significant slowdown. We should investigate and figure out why.

At a cursory investigation, the total number of source files reported to the logs seems identical between the two versions, and the Netty Mill build does not seem to suffer from this slowdown (when I re-ran the benchmarks in #3918). --ticker false to disable the new PromptLogger doesn't help

The text was updated successfully, but these errors were encountered:

lihaoyi · 2024-11-07T15:54:45Z

Seems the regression happens somewhere between 0.12.0-RC1 and 0.12.0-RC2

lefou · 2024-11-07T16:00:47Z

It could be the mill clean command, which now also cleans workers, hence any hot zinc compiler instance is dropped.

lefou · 2024-11-07T16:02:01Z

Maybe, we should not clean workers by default. We could introduce an option or a dedicated cleanWorker command.

lihaoyi · 2024-11-07T16:09:18Z

@lefou good catch, yes that probably explains the behavior I'm seeing. The Java compiler is probably fast enough it doesn't matter for Netty, and for day-to-day work we don't clean everything that often so it doesn't impact much, so it only really slows down my benchmarks. The other alternative is to clean but preserve workers. Might need to think about this to decide what the proper thing to do is; if it's just a benchmarking problem we might introduce a cleanNonWorkers instead

lefou · 2024-11-07T16:26:28Z

I think cleanup worker isn't what users want. It's most likely a seldomly used method to free up memory, close ports or deal with some poorly implemented worker. But most of the time, when using clean, we don't want to clean workers. Esp. since shutting down the Mill server already has a reset-worker behavior. I vote to disable worker cleanup in clean and add a dedicated cleanWorker command. Alternatively, we could add an option clean --worker=yes or something like that.

lihaoyi · 2024-11-07T16:29:39Z

One issue about not cleaning workers I thought about: what if the workers depend on upstream tasks output files? Won't the references to the upstream task outputs be invalidated? I thought such a failure mode would cause issues with any workers that load upstream classpath files into a classloader, so I'm not sure why that hasn't caused any problems in the past when we used to clean the workers. Maybe when the task graph is evaluated, any missing files end up getting re-generated anyway before the worker is used?

lefou · 2024-11-07T16:45:54Z

Yeah, Worker were always rebuild when their inputs changed. But since a clean job typically does not result in changed input hashes as workers typically only depend on some rather stable versions or tooling classpaths. Using clean has typically the purpose to prune the Mill caches and chances are high that workers won't re-triggered after a clean. Instead, workers might re-trigger after a version bump, even without a clean.

lihaoyi · 2024-11-08T08:06:55Z

From #3276, it seems like the issue is we were wiping the worker out dir, because the naive clean does not discriminate for who it is deleting.

A possible workaround they preserves the workers would be to make clean resolve each tasks metadata json before deleting it, preserving the dest folder if it is a worker. That adds considerable complexity to clean which currently has a shortcut when run without args, but maybe could be worth it

Another approach would be to provide another workflow to run my benchmarks. I think most users out there don't mind workers being wiped as part of clean, and as discussed in the original ticket it kind of does make sense to delete workers and their in memory state.

Probably need to think about this a bit, since it probably doesnt impact users we can take our time

roman-mibex-2 · 2024-11-08T10:53:30Z

Yes, the origin of the worker stopping was that working directory of a worker also got wiped.
So, if that worker relied on on the working directory, then a ./mill clean brought the build into a unstable and inconsistent state.

lefou · 2024-11-08T13:30:55Z

Another approach would be to provide another workflow to run my benchmarks.

For the benchmark, you could try to run clean with a narrower selector like gatlin.__, which should exclude workers living in external modules.

Additionally, we could try to improve out type selector support, which currently only works for modules. If we could request something in the spitit of clean __:^Worker or clean __:^WorkerType (where WorkerType is the T in Worker[T] or Task[T]), it would be great.

lihaoyi changed the title ~~Regression in Gatling Mill build clean compile performance~~ Regression in Gatling Mill build clean compile performance due to worker closing Nov 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Regression in Gatling Mill build clean compile performance due to worker closing #3920

Regression in Gatling Mill build clean compile performance due to worker closing #3920

lihaoyi commented Nov 7, 2024 •

edited

Loading

lihaoyi commented Nov 7, 2024 •

edited

Loading

lefou commented Nov 7, 2024 •

edited

Loading

lefou commented Nov 7, 2024

lihaoyi commented Nov 7, 2024

lefou commented Nov 7, 2024 •

edited

Loading

lihaoyi commented Nov 7, 2024 •

edited

Loading

lefou commented Nov 7, 2024 •

edited

Loading

lihaoyi commented Nov 8, 2024

roman-mibex-2 commented Nov 8, 2024

lefou commented Nov 8, 2024 •

edited

Loading

Regression in Gatling Mill build clean compile performance due to worker closing #3920

Regression in Gatling Mill build clean compile performance due to worker closing #3920

Comments

lihaoyi commented Nov 7, 2024 • edited Loading

lihaoyi commented Nov 7, 2024 • edited Loading

lefou commented Nov 7, 2024 • edited Loading

lefou commented Nov 7, 2024

lihaoyi commented Nov 7, 2024

lefou commented Nov 7, 2024 • edited Loading

lihaoyi commented Nov 7, 2024 • edited Loading

lefou commented Nov 7, 2024 • edited Loading

lihaoyi commented Nov 8, 2024

roman-mibex-2 commented Nov 8, 2024

lefou commented Nov 8, 2024 • edited Loading

lihaoyi commented Nov 7, 2024 •

edited

Loading

lihaoyi commented Nov 7, 2024 •

edited

Loading

lefou commented Nov 7, 2024 •

edited

Loading

lefou commented Nov 7, 2024 •

edited

Loading

lihaoyi commented Nov 7, 2024 •

edited

Loading

lefou commented Nov 7, 2024 •

edited

Loading

lefou commented Nov 8, 2024 •

edited

Loading