taskrunner

Features

Runs tasks (which are defined as shell scripts)
Ensures only one instance of a task runs at a time (globally in the whole system) - using file-based locks
Tasks can run in parallel
Output from parallel tasks is correctly annotated with original task name
Output lines are timestamped
Parallelism can be nested
- But task name annotations are flattened, i.e. are not added repeatedly at each level
Output from each task is also collected to a separate file, to easier debug that particular task
Tasks can have specified inputs
- Input can be a file, an environment variable, or result of an arbitrary command
If a task is invoked again for the same input (on a given machine), it is not re-run
Tasks can have specified outputs (files)
Task outputs can be cached in an object store
If a task is invoked with inputs for which it was already computed on CI, the result is fetched from remote cache and task is not re-run
If a task is called multiple times in the same execution, it is only executed once.
When running on CI:
- Task output (stdout and stderr) is uploaded to an object store after task is finished
- task status is reported as a GitHub check for a commit
  - pending while it's running
  - success or failed when finished
- GitHub check details link points to the uploaded task output
Unstable inputs (i.e. inputs that have changed during the job execution) are detected
- TODO: do something about input-output files like package-lock.json
Tasks can have command-line arguments
- But some options are meant for the task runner, such as -f
Can force reexecution of a specific task (excl. dependencies) (-f)
- Note: in previous implementation -f forced reexecution of all tasks. This seems less useful, will be under another option.
Task can be cancelled using SIGINT or SIGTERM, and state is maintained appropriately
Fast - if there's nothing to do, returns quickly (<1s, ideally <300ms)

"Prime cache" mode

When migrating from another system behind a flag, it is sometimes desirable to build on the old system but still fill remote cache one the new one. For that occasion, a special "prime cache mode" is there.

It modifies the behavior in the following way:

snapshot never downloads remote cache (incl. fuzzy) - to avoid overwriting stuff (which we assume is already built via another mechanism)
snapshot always skips the job
remote cache is uploaded, despite job being skipped

To use it, first build using another system, and the run taskrunner with TASKRUNNER_PRIME_CACHE_MODE=1

Directory structure

$TASKRUNNER_STATE_DIRECTORY (default: /tmp/taskrunner)
- locks - global locks per job
  - ${jobName}.lock - job lock file, job takes the lock when running
- hash - hashes of inputs of already-done jobs
  - ${jobName}.hash - first line is hash, rest is hash input (for debugging)
- builds - build state directory for each build. Each toplevel invocation creates a subdirectory here.
  - ${buildId} - state dir of a specific build. buildId is derived from the invocation time.
    - logs - logs produced by jobs in that build.
      - ${jobName}.log - log, without ANSI sequences stripped
    - results - per-build cache of job results (we don't re-run jobs twice inside a build, even without snapshot)
      - ${jobName} - file with status code of the job

Configuration variables

TASKRUNNER_DEBUG - whether to output debug messages to toplevel output. Note that debug messages are always written to per-task logs, regardless of this setting.
TASKRUNNER_LOG_INFO - whether to output "info" messages to toplevel output. They are minimal messages, produced only when there's actually something to be done (including fetching from cache).
more...

Possible features

Support stdin? For now redirected from /dev/null
Quiet output like gradle, only report what is running and progress, not full output, and no output if nothing to do
Marker files - additional hash file in .stack-work, node_modules etc., so that if that dir is cleared, we redo the action
- Or: remember hash of some of the output files and check they're still there
- Only some because there can be benign changes
Can dump enough info to reproduce failures
- For example: hashes of inputs, caches etc.
Generate a trace (otlp for analysis, or render to a gantt chart)

Things we should do better than previous version

less confusing output for cache miss (no "error")
??? Something, can't recall now
--cmd replaced with --raw, since we can't really execute in the context of the original script

Things to handle

Task leaks stdin/out/err handle - have a timeout on draining output
Parallel task failed and we're killed - report status correctly
snapshot - how to communicate with controller process?
- pipe and pass fd to child process?
- named pipe and pass name to child process via env?
Nested tasks - each should write to original stdout
Unmerged files when hashing
bad usage of snapshot - e.g. called twice
why ls-tree -r is needed - git option of quoting
save cache tar error

Misc TODO

Better output of error messages (to normal streams)
String/Text unification
Debugging - show hash input
More specialized tests for input handling
In parent task's log, add reference to nested log file
Debugging aid: when replacing saved hash, show diff between old and new hash input (or save old hash input to compare)
Bug: pendolino sometimes rebuilds randomly with scripts/UPDATE
- Probable cause: helper generation races with its input hashing
  - nope, it generates in another directory
More tests for interaction between remote cache and local hash, especially:
- restoring remote cache should also store local hash, but not store remote cache again
test for root dir != cwd
test for commit status
Somehow test content-type in log upload?

Output principles (output generated by runner itself)

"quiet" operation - no output except when an error happens
standard operation ("info" mode):
- when job does nothing (already done locally), no output
- when resuming from cache, output one line for start (so that we know something's happening), one for done
- when running, output one line for start, one for done - only for snapshottable jobs
debug: log everything (maybe later categorize)

Performance goals

Previous impl no-op tests/scripts/UPDATE: ~1.6s
Current impl no-op tests/scripts/UPDATE: ~2.3s

Snapshot Command Flags

The snapshot command supports the following flags:

--outputs: Specifies files to be cached in remote cache.
--cache-success: Use remote cache even when no outputs are specified. The task is not rerun if it succeeded previously with the same inputs. Useful e.g. for test suites.
--raw: Specifies raw input strings that are used to compute the task's hash.
--fuzzy-cache: Enables the use of a fuzzy cache, which attempts to restore from a cache of a similar task if the exact cache is not available.
--cache-root: Specifies the root directory for caching. Use when caching things outside of the repository, e.g. ~/.stack.
--cache-version: Specifies a version string for the cache. --fuzzy-cache will not download cache from another version, allowing clean breaks when making big changes, e.g. upgrading a compiler.
--commit-status: Enables reporting of the task's status to a commit status system, such as GitHub checks.
--long-running: Indicates that the task is expected to run for a long time (e.g. a server). Currently doens't have any effect though, TODO: can we remove it?

Testing

This project uses tasty-golden for snapshot-based testing.

Running Tests

# Run all tests (auto-detects S3 credentials)
stack test

# Run tests, skipping slow ones for faster development
export SKIP_SLOW_TESTS=1
stack test

# Run specific test by pattern
stack test --test-arguments "--pattern hello"

# List all available tests
stack test --test-arguments "--list-tests"

Test Structure

Tests are located in test/t/ directory with two files per test:

.txt file - Shell script to execute
.out file - Expected output (golden file)

Test Directives

Special comments in .txt files control test behavior:

# check output - Check stdout/stderr (default)
# check github - Check GitHub API calls
# no toplevel - Don't wrap in taskrunner
# s3 - Requires S3 credentials (auto-skipped if missing)
# github keys - Provide GitHub credentials
# quiet - Run in quiet mode

S3 Test Auto-Detection

15 tests require S3 credentials and are automatically skipped if credentials are missing.

To run S3 tests, set these environment variables:

export TASKRUNNER_TEST_S3_ENDPOINT=your-s3-endpoint
export TASKRUNNER_TEST_S3_ACCESS_KEY=your-access-key
export TASKRUNNER_TEST_S3_SECRET_KEY=your-secret-key
stack test

Accepting Golden Test Changes

When golden tests fail due to expected output changes:

stack test --test-arguments --accept

This updates the .out files with new expected output. Review changes carefully before committing.

Name		Name	Last commit message	Last commit date
Latest commit History 122 Commits
.claude		.claude
.github/workflows		.github/workflows
.tmp		.tmp
app		app
src		src
test		test
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
README.md		README.md
Setup.hs		Setup.hs
package.yaml		package.yaml
stack.yaml		stack.yaml
stack.yaml.lock		stack.yaml.lock
taskrunner.cabal		taskrunner.cabal

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Repository files navigation

taskrunner

Features

"Prime cache" mode

Directory structure

Configuration variables

Possible features

Things we should do better than previous version

Things to handle

Misc TODO

Output principles (output generated by runner itself)

Performance goals

Snapshot Command Flags

Testing

Running Tests

Test Structure

Test Directives

S3 Test Auto-Detection

Accepting Golden Test Changes

About

Uh oh!

Releases 23

Packages

Uh oh!

Contributors 4

Uh oh!

Languages

Uh oh!

License

Uh oh!

restaumatic/taskrunner

Folders and files

Latest commit

History

Repository files navigation

taskrunner

Features

"Prime cache" mode

Directory structure

Configuration variables

Possible features

Things we should do better than previous version

Things to handle

Misc TODO

Output principles (output generated by runner itself)

Performance goals

Snapshot Command Flags

Testing

Running Tests

Test Structure

Test Directives

S3 Test Auto-Detection

Accepting Golden Test Changes

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 23

Packages 0

Uh oh!

Contributors 4

Uh oh!

Languages

Packages