GitHub Action
Cache Nix store
A GitHub Action to cache Nix store paths using GitHub Actions cache.
This action is based on actions/cache.
- Restore and save the Nix store and/or other paths.
- Collect garbage in the Nix store before saving a new cache.
- Merge caches produced by several jobs.
- Purge caches created or last accessed at least the given time ago.
-
The nix-quick-install-action installs Nix and makes
/nix/store
owned by an unpriviliged user. -
Restore
phase:-
The
cache-nix-action
tries to restore a cache whose key is the same as the primary key. -
When it can't restore, the
cache-nix-action
tries to restore a cache whose key matches a prefix in a given list of key prefixes. -
The
cache-nix-action
restores all caches whose keys match some of the prefixes in another given list of key prefixes.
-
-
Other job steps run.
-
Post Restore
phase:-
The
cache-nix-action
purges caches whose keys are the same as the primary key and that were created more than a given time ago. -
When there's no cache whose key is the same as the primary key, the
cache-nix-action
collects garbage in the Nix store and saves a new cache. -
The
cache-nix-action
purges caches whose keys match some of the given prefixes in a given list of key prefixes and that were created more than a given time ago.
-
- Requires
nix-quick-install-action
. - Supports only
Linux
andmacOS
runners. GitHub
allows only10GB
of caches and then removes the least recently used entries (see its eviction policy). Workarounds:- Nix store size is limited by a runner storage size (link). Workaround:
- The jlumbroso/free-disk-space action frees
~30GB
of disk space in several minutes.
- The jlumbroso/free-disk-space action frees
- Caches are isolated for restoring between refs (link). Workaround:
- Provide caches for PRs on the default or base branches.
- For purging, a workflow requires the permission
action: write
and thetoken
must have arepo
scope (link). - Purges caches scoped to the current GITHUB_REF.
- Purge time is calculated relative to the start of the
Post Restore
phase of this action. - Purges caches by keys without considering caches versions (see Cache version).
See Caching Approaches.
- uses: nixbuild/nix-quick-install-action@v26
with:
nix_conf: |
substituters = https://cache.nixos.org/ https://nix-community.cachix.org
trusted-public-keys = cache.nixos.org-1:6NCHdD59X431o0gWypbMrAURkbJ16ZPMQFGspcDShjY= nix-community.cachix.org-1:mB9FSh9qf2dCimDSUo8Zy7bkq5CX+/rkCWyvRCYg3Fs=
keep-outputs = true
- name: Restore and cache Nix store
uses: nix-community/cache-nix-action@v5
with:
primary-key: nix-${{ runner.os }}-${{ hashFiles('**/*.nix') }}
restore-prefixes-first-match: nix-${{ runner.os }}-
purge: true
purge-prefixes: cache-${{ matrix.os }}-
purge-created: 42
purge-last-accessed: 42
- Due to
gc-max-store-size-linux: 1073741824
, onLinux
runners, Nix store will be garbage collected until its size reaches1GB
or until there's nothing to garbage collect. - Since
gc-max-store-size-macos
isn't set to a number, onmacOS
runners, Nix store won't be garbage collected. - The
cache-nix-action
will purge caches:- with a key prefix
cache-${{ matrix.os }}-
AND (created more than42
seconds ago OR last accessed more than42
seconds ago).
- with a key prefix
See action.yml.
name | description | default | required |
---|---|---|---|
|
|
| |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| The action uses it to communicate with GitHub API. |
|
|
name | description |
---|---|
|
|
|
|
|
|
|
|
|
|
|
|
- Use action-tmate to debug on a runner via SSH.
On Linux
runners, when gc-max-store-size-linux
is set to a number, the cache-nix-action
will run nix store gc --max R
before saving a cache.
Here, R
is max(0, S - gc-max-store-size-linux)
, where S
is the current store size.
Respective conditions hold for macOS
runners.
There are alternative approaches to garbage collection (see Garbage collection).
The cache-nix-action
allows to delete old caches after saving a new cache (see purge-*
inputs in Inputs and the compare-run-times
job in the Example workflow).
The purge-cache action allows to remove caches based on their last accessed
or created
time without branch limitations.
Alternatively, you can use the GitHub Actions Cache API.
GitHub
evicts least recently used caches when their total size exceeds 10GB
(see Limitations).
If you have multiple similar caches produced on runners with the same OS (Linux
or macOS
), you can merge them into a single cache and store just it to save space.
In short:
- Matrix jobs produce similar individual caches.
- The next job restores all of these individual caches, saves a common cache, and purges individual caches.
- On subsequent runs, matrix jobs use the common cache.
See the make-similar-caches
and merge-similar-caches
jobs in the example workflow.
Pros: if N
individual caches are very similar, a common cache will take approximately N
times less space.
Cons: if caches aren't very similar, run time may increase due to a bigger common cache.
Discussed in more details here and here.
Caching approaches work at different "distances" from /nix/store
of GitHub Actions runner.
These distances affect the restore and save speed.
Pros:
- Free.
- Uses
GitHub Actions Cache
and works fast. - Easy to set up.
- Allows to save a store of at most a given size (see Garbage collection parameters).
- Allows to save outputs from garbage collection (see Garbage collection).
- When there's a cache hit, restoring from a GitHub Actions cache can be faster than downloading multiple paths from binary caches.
- You can compare run times of jobs with and without store caching in Actions.
- Open a run and click on the time under
Total duration
.
- Open a run and click on the time under
- You can compare run times of jobs with and without store caching in Actions.
Cons: see Limitations
Pros (link):
- Free.
- Uses
GitHub Actions Cache
and works fast. - Easy to set up.
- Restores and saves paths selectively.
Cons:
- Collects telemetry (link)
- May trigger rate limit errors (link).
- Follows the GitHub Actions Cache semantics (link).
- Caches are isolated between branches (link).
- Saves a cache for each path in a store and quickly litters
Caches
.
If used with nix-quick-install-action, it's similar to the cache-nix-action.
If used with install-nix-action and a chroot local store:
Pros:
- Quick restore and save
/tmp/nix
.
Cons:
- Slow nix copy from
/tmp/nix
to/nix/store
.
If used with install-nix-action and this trick, it's similar to the cache-nix-action, but slower (link).
See binary cache, HTTP Binary Cache Store.
Pros:
- Restore and save paths selectively.
- Provide least recently used garbage collection strategies (cachix, attic).
- Don't cache paths available from the NixOS cache (cachix).
- Allow to share paths between projects (cachix).
Cons:
- Have limited free storage (cachix gives 5GB for open-source projects).
- Need good bandwidth for receiving and pushing paths over the Internet.
- Can be down.
When restoring a Nix store from a cache, the store may contain old unnecessary paths. These paths should be removed sometimes to limit cache size and ensure the fastest restore/save steps.
Produce a cache once, use it multiple times. Don't collect garbage.
Advantages:
- Unnecessary paths are saved to a cache only during a new save.
Disadvantages:
- Unnecessary paths can accumulate between new saves.
- A job at the firs run produces a path
A
and saves a cache. - The job at the second run restores the cache, produces a path
B
, and saves a cache. The cache has bothA
andB
. - etc.
- A job at the firs run produces a path
Collect garbage before saving a cache.
Advantages:
- Automatically keep cache at a minimal/limited size
Disadvantages:
- No standard way to gc only old paths.
- Use
nix profile install
to save installables from garbage collection. - Keep inputs (see this issue and this issue).
- Start direnv in background.
- Use nix-heuristic-gc for cache eviction via
atime
. - gc via gc roots nix-cache-cut.
- gc based on time cache-gc.
-
Improve README.
-
Report errors, suggest improvements in issues.
-
Upgrade code.
- Read about JavaScript actions
- See main files:
-
Upgrade docs.
-
Edit action.nix.
-
Update
action.yml
-s andREADME.md
-s:nix run .#write
-
-
Update actions-toolkit. It was added via
git subtree
. See tutorial.
See "Caching dependencies to speed up workflows".
- Added support for caching in GHES 3.5+.
- Fixed download issue for files > 2GB during restore.
- Updated the minimum runner version support from node 12 -> node 16.
- Fixed avoiding empty cache save when no files are available for caching.
- Fixed tar creation error while trying to create tar with path as
~/
home folder onubuntu-latest
. - Fixed zstd failing on amazon linux 2.0 runners.
- Fixed cache not working with github workspace directory or current directory.
- Fixed the download stuck problem by introducing a timeout of 1 hour for cache downloads.
- Fix zstd not working for windows on gnu tar in issues.
- Allowing users to provide a custom timeout as input for aborting download of a cache segment using an environment variable
SEGMENT_DOWNLOAD_TIMEOUT_MINS
. Default is 10 minutes. - New actions are available for granular control over caches - restore and save.
- Added option to fail job on cache miss. See Exit workflow on cache miss for more info.
- Fix zstd not being used after zstd version upgrade to 1.5.4 on hosted runners
- Added option to lookup cache without downloading it.
- Reduced segment size to 128MB and segment timeout to 10 minutes to fail fast in case the cache download is stuck.
See the v2 README.md for older updates.
Create a workflow .yml
file in your repository's .github/workflows
directory. An example workflow is available below. For more information, see the GitHub Help Documentation for Creating a workflow file.
If you are using this inside a container, a POSIX-compliant tar
needs to be included and accessible from the execution path.
If you are using a self-hosted
Windows runner, GNU tar
and zstd
are required for Cross-OS caching to work. They are also recommended to be installed in general so the performance is on par with hosted
Windows runners.
SEGMENT_DOWNLOAD_TIMEOUT_MINS
- Segment download timeout (in minutes, default10
) to abort download of the segment if not completed in the defined number of minutes. Read more
The cache is scoped to the key, version, and branch. The default branch cache is available to other branches.
See Matching a cache key for more info.
name: Caching Primes
on: push
jobs:
build:
runs-on: ubuntu-latest
steps:
- uses: actions/checkout@v3
- name: Cache Primes
id: cache-primes
uses: actions/cache@v3
with:
primary-key: ${{ runner.os }}-primes
paths: prime-numbers
- name: Generate Prime Numbers
if: steps.cache-primes.outputs.cache-hit != 'true'
run: /generate-primes.sh -d prime-numbers
- name: Use Prime Numbers
run: /primes.sh -d prime-numbers
The cache
action provides a cache-hit
output which is set to true
when the cache is restored using the primary key
and false
when the cache is restored using restore-keys
or no cache is restored.
name: Caching Primes
on: push
jobs:
build:
runs-on: ubuntu-latest
steps:
- uses: actions/checkout@v3
- name: Restore cached Primes
id: cache-primes-restore
uses: nix-community/cache-nix-action/restore@v5
with:
primary-key: ${{ runner.os }}-primes
paths: |
path/to/dependencies
some/other/dependencies
# other steps
- name: Save Primes
id: cache-primes-save
uses: nix-community/cache-nix-action/save@v5
with:
primary-key: ${{ steps.cache-primes-restore.outputs.cache-primary-key }}
paths: |
path/to/dependencies
some/other/dependencies
Note You must use the
cache
orrestore
action in your workflow before you need to use the files that might be restored from the cache. If the providedkey
matches an existing cache, a new cache is not created and if the providedkey
doesn't match an existing cache, a new cache is automatically created provided the job completes successfully.
With the introduction of the restore
and save
actions, a lot of caching use cases can now be achieved. Please see the caching strategies document for understanding how you can use the actions strategically to achieve the desired goal.
Every programming language and framework has its own way of caching.
See Examples for a list of actions/cache
implementations for use with:
- C# - NuGet
- Clojure - Lein Deps
- D - DUB
- Deno
- Elixir - Mix
- Go - Modules
- Haskell - Cabal
- Haskell - Stack
- Java - Gradle
- Java - Maven
- Node - npm
- Node - Lerna
- Node - Yarn
- OCaml/Reason - esy
- PHP - Composer
- Python - pip
- Python - pipenv
- R - renv
- Ruby - Bundler
- Rust - Cargo
- Scala - SBT
- Swift, Objective-C - Carthage
- Swift, Objective-C - CocoaPods
- Swift - Swift Package Manager
- Swift - Mint
A cache key can include any of the contexts, functions, literals, and operators supported by GitHub Actions.
For example, using the hashFiles
function allows you to create a new cache when dependencies change.
- uses: nix-community/cache-nix-action@v5
with:
primary-key: ${{ runner.os }}-${{ hashFiles('**/lockfiles') }}
paths: |
path/to/dependencies
some/other/dependencies
Additionally, you can use arbitrary command output in a cache key, such as a date or software version:
# http://man7.org/linux/man-pages/man1/date.1.html
- name: Get Date
id: get-date
run: echo "date=$(/bin/date -u "+%Y%m%d")" >> $GITHUB_OUTPUT
shell: bash
- uses: nix-community/cache-nix-action@v5
with:
primary-key: ${{ runner.os }}-${{ steps.get-date.outputs.date }}-${{ hashFiles('**/lockfiles') }}
paths: path/to/dependencies
See Using contexts to create cache keys
A repository can have up to 10GB of caches. Once the 10GB limit is reached, older caches will be evicted based on when the cache was last accessed. Caches that are not accessed within the last week will also be evicted.
Using the hit-primary-key
output, subsequent steps (such as install or build) can be skipped when a cache hit occurs on the primary key. It is recommended to install missing/updated dependencies in case of a partial key match when the key is dependent on the hash
of the package file.
Example:
steps:
- uses: actions/checkout@v3
- uses: nix-community/cache-nix-action@v5
id: cache
with:
primary-key: ${{ runner.os }}-${{ hashFiles('**/lockfiles') }}
path: path/to/dependencies
- name: Install Dependencies
if: steps.cache.outputs.hit-primary-key != true
run: /install.sh
Note The
id
defined innix-community/cache-nix-action
must match the[id]
in theif
statement (i.e.steps.[id].outputs.hit-primary-key
)
Cache version is a hash generated for a combination of compression tool used (Gzip, Zstd, etc. based on the runner OS) and the path
of directories being cached. If two caches have different versions, they are identified as unique caches while matching. This, for example, means that a cache created on a windows-latest
runner can't be restored on ubuntu-latest
as cache Version
s are different.
Pro tip: The list caches API can be used to get the version of a cache. This can be helpful to troubleshoot cache miss due to version.
Example
The workflow will create 3 unique caches with same keys. Ubuntu and windows runners will use different compression technique and hence create two different caches. And `build-linux` will create two different caches as the `paths` are different.jobs:
build-linux:
runs-on: ubuntu-latest
steps:
- uses: actions/checkout@v3
- name: Cache Primes
id: cache-primes
uses: nix-community/cache-nix-action@v5
with:
primary-key: primes
paths: prime-numbers
- name: Generate Prime Numbers
if: steps.cache-primes.outputs.cache-hit != 'true'
run: ./generate-primes.sh -d prime-numbers
- name: Cache Numbers
id: cache-numbers
uses: nix-community/cache-nix-action@v5
with:
primary-key: primes
paths: numbers
- name: Generate Numbers
if: steps.cache-numbers.outputs.cache-hit != 'true'
run: ./generate-primes.sh -d numbers
build-windows:
runs-on: windows-latest
steps:
- uses: actions/checkout@v3
- name: Cache Primes
id: cache-primes
uses: nix-community/cache-nix-action@v5
with:
nix: false
primary-key: primes
paths: prime-numbers
- name: Generate Prime Numbers
if: steps.cache-primes.outputs.cache-hit != 'true'
run: ./generate-primes -d prime-numbers
There are a number of community practices/workarounds to fulfill specific requirements. You may choose to use them if they suit your use case. Note these are not necessarily the only solution or even a recommended solution.
A cache gets downloaded in multiple segments of fixed sizes (1GB
for a 32-bit
runner and 2GB
for a 64-bit
runner). Sometimes, a segment download gets stuck which causes the workflow job to be stuck forever and fail. Version v3.0.8
of actions/cache
introduces a segment download timeout. The segment download timeout will allow the segment download to get aborted and hence allow the job to proceed with a cache miss.
Default value of this timeout is 10 minutes and can be customized by specifying an environment variable named SEGMENT_DOWNLOAD_TIMEOUT_MINS
with timeout value in minutes.
A cache today is immutable and cannot be updated. But some use cases require the cache to be saved even though there was a hit during the Restore phase
. To do so, always purge old versions of that cache:
- name: update cache on every commit
uses: actions/cache@v3
with:
primary-key: primes-${{ runner.os }}
paths: prime-numbers
purge: true
purge-primary-key: always
Please note that this will create a new cache on every run and hence will consume the cache quota.
Reusing cache across feature branches is not allowed today to provide cache isolation. However if both feature branches are from the default branch, a good way to achieve this is to ensure that the default branch has a cache. This cache will then be consumable by both feature branches.
Caches have branch scope restriction in place. This means that if caches for a specific branch are using a lot of storage quota, it may result into more frequently used caches from default
branch getting thrashed. For example, if there are many pull requests happening on a repo and are creating caches, these cannot be used in default branch scope but will still occupy a lot of space till they get cleaned up by eviction policy. But sometime we want to clean them up on a faster cadence so as to ensure default branch is not thrashing. In order to achieve this, gh-actions-cache cli can be used to delete caches for specific branches.
This workflow uses gh-actions-cache
to delete all the caches created by a branch.
Example
name: cleanup caches by a branch
on:
pull_request:
types:
- closed
workflow_dispatch:
jobs:
cleanup:
runs-on: ubuntu-latest
permissions:
# `actions:write` permission is required to delete caches
# See also: https://docs.github.com/en/rest/actions/cache?apiVersion=2022-11-28#delete-a-github-actions-cache-for-a-repository-using-a-cache-id
actions: write
contents: read
steps:
- name: Check out code
uses: actions/checkout@v3
- name: Cleanup
run: |
gh extension install actions/gh-actions-cache
REPO=${{ github.repository }}
BRANCH=refs/pull/${{ github.event.pull_request.number }}/merge
echo "Fetching list of cache key"
cacheKeysForPR=$(gh actions-cache list -R $REPO -B $BRANCH | cut -f 1 )
## Setting this to not fail the workflow while deleting cache keys.
set +e
echo "Deleting caches..."
for cacheKey in $cacheKeysForPR
do
gh actions-cache delete $cacheKey -R $REPO -B $BRANCH --confirm
done
echo "Done"
env:
GH_TOKEN: ${{ secrets.GITHUB_TOKEN }}
The scripts and documentation in this project are released under the MIT License