perf: cache activation environment variables #1832

borchero · 2024-08-18T19:32:37Z

Motivation

Resolves #973.

This PR obviously misses some tests, I just wanted to check in early if this goes into the right direction.

ruben-arts · 2024-08-19T06:03:58Z

Thanks @borchero, cool feature. I'm a little worried about the invalidation as on windows this might not work as well. @baszalmstra can you give your feedback.

baszalmstra · 2024-08-19T15:20:16Z

I like this a lot! Thanks for the contribution.

The biggest problem is that some of these environment variables are initialized from system environment variables. For instance, when you add the compiler activation scripts on Windows the path to a specific MSVC compiler is added. However, when a new update of the compiler is installed (which happens from time to time) this path is no longer valid. The same goes for the system PATH variable, this can be updated but the changes would not be reflected in our cached entry.

I think it would be good to add a time-to-live to the cache entry so that the cache entry will be recomputed after a period of time (30 minutes?).

Maybe we can also invalidate the cache if certain environment variables no longer match what they were when the cache was created? I think the PATH variable is a prime example.

The entries should also be cleared on pixi clean, or is that already the case?

borchero · 2024-08-19T16:12:59Z

I think it would be good to add a time-to-live to the cache entry so that the cache entry will be recomputed after a period of time (30 minutes?).

I'm not sure this is the best solution to be honest. I think it introduces hard-to-debug issues that are non-deterministic over time 👀

That being said, I think it's important that we identify a set of items that need to be hashed. The lockfile is an obvious one. Using the PATH environment variable also makes sense to me. Eventually, I'm unsure how one can get around non-deterministic activation scripts (such as the compiler activation scripts you mentioned) 👀

baszalmstra · 2024-08-19T16:18:29Z

Yeah I agree a TTL is very much suboptimal.

Maybe we can start by identifying a larger set of environment variables to use as input? I guess its less anoying when you have a stale cache than an invalid one.

borchero · 2024-08-19T16:58:09Z

Maybe we can start by identifying a larger set of environment variables to use as input

That'd be great 😄 do you have any proposal where to start with this? I personally know very little about Windows and that's usually where most issues arise 🙃

baszalmstra · 2024-08-19T18:06:14Z

If you start the implementation and have a list somewhere Ill investigate what we need on windows!

ruben-arts · 2024-09-16T14:50:34Z

@borchero are you planning to work on this?

borchero added 2 commits August 18, 2024 21:30

perf: Cache activation environment variables

f14f9c8

Fix ci

6e1a3a3

baszalmstra self-assigned this Aug 20, 2024

ruben-arts mentioned this pull request Sep 17, 2024

Cache environment activation #973

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf: cache activation environment variables #1832

perf: cache activation environment variables #1832

borchero commented Aug 18, 2024

ruben-arts commented Aug 19, 2024

baszalmstra commented Aug 19, 2024

borchero commented Aug 19, 2024

baszalmstra commented Aug 19, 2024

borchero commented Aug 19, 2024

baszalmstra commented Aug 19, 2024

ruben-arts commented Sep 16, 2024

perf: cache activation environment variables #1832

Are you sure you want to change the base?

perf: cache activation environment variables #1832

Conversation

borchero commented Aug 18, 2024

Motivation

ruben-arts commented Aug 19, 2024

baszalmstra commented Aug 19, 2024

borchero commented Aug 19, 2024

baszalmstra commented Aug 19, 2024

borchero commented Aug 19, 2024

baszalmstra commented Aug 19, 2024

ruben-arts commented Sep 16, 2024