Implement `NoThrowDAG` #6

hannahilea · 2023-08-01T18:42:19Z

Implement concrete AbstractTransformationSpecification NoThrowTransformChain that consists of an ordered set of NoThrowTransformations interleaved with transforms that create a step's input from all outputs of the upstream steps.

Okay, so here's the deal with this one: there are a gazillion different design decisions that could be made here, and I had to fight not to fall into a self-sniped state (and did, in fact, change my mind about several of these decisions mid-implementation). So for the sake of "steady progress", I'm opening this for review now, and in future PRs we can iterate on things like (but not limited to):

better/nicer error throwing during chain construction
whether this should actually be parameterized in some way and/or allow a non-NoThrow option
the type and construction of the transformation called between each step

To see the specific tasks where the Asana app for GitHub is being used, see below:
- https://app.asana.com/0/0/1205118159536349

src/nothrow_chain.jl

test/nothrow_chain.jl

Co-authored-by: Glenn Moynihan <glennmoy@gmail.com>

kleinschmidt

okay, haven't done a complete review but I've thought a bit more about my high level/spiritual concern with the "input assembler" bit: it basically throws the specification bit out the window, if you're allowed (nay, encouraged!) to go in and rename/modify fields in your inputs willy-nilly. especially when that is happening inside a (possibly anonymous) function that's basically a black box as far as readers of the code/the compiler is concerned. if a step takes two distinct kinds of input then I think we should just bite the bullet and do the refactoring to all that to be declared, or use some kind of composition to make it so that multiple outputs from shared input can be merged in a way that's declarative (something like a "multi step" that does the merging of different steps before checking the output spec).

kleinschmidt

okay, as we discussed on slack I'm okay with this more-or-less as-is. I had one suggestion for a possibility to use a more restrictive input assembler that just maps fields from previous output to next step input, since that seems to be what the current validation assumes is happening. but you can take or leave that!

src/nothrow_dag.jl

kleinschmidt · 2023-08-09T16:35:11Z

src/nothrow_dag.jl

+function _validate_input_assembler(dag::NoThrowDAG,
+                                   input_assembler::TransformSpecification)
+    transform(input_assembler, dag._step_output_fields) # Will throw if any field doesn't exist
+    return nothing
+end


okay, let me check that I'm understanding what's going on here: the idea is that in order to verify that an input assembler will not error at run time, when we construct the dag, we use a stand-in dictionary with the same structure as the workspace dict tha twill be created during actual execution (maps the names of the existing steps to Dicts that can be accessed with getindex(fields, ::Symbol)).

if that's right, it suggests to me that the actual intent of the input assembler is actually much more restrictive than any generic anonymous function, and if someone tries to create one that doesn't follow that structure you're going to get some kind of obscure error at DAG construction time.

but it also means that we might be able to do something like this:

struct Field in::Symbol out::Symbol end # only really needed for showing these... Base.show(io::IO, f::Field) = f.in == f.out ? show(io, f.in) : show(io, f.in => f.out) Field(x::Symbol) = Field(x, x) Field(x::Pair{Symbol,Symbol}) = Field(x...) struct InputAssembler <: AbstractTransformSpecification deps::Dict{String,Vector{Field}} function InputAssembler(deps::Dict{String,Vector{Field}}) # make sure that there are no name collisions out_fields = (f.out for f in Iterators.flatten(values(deps))) # TODO: actually find the non-unique fields to help users allunique(out_fields) || throw(ArgumentError("all output fields must be unique")) new(deps) end end function InputAssembler(; inputs...) deps = Dict(string(dep) => Field.(fields) for (dep, fields) in inputs) return InputAssembler(deps) end _get(x, f::Field) = f.out => x[f.in] function transform(input::InputAssembler, upstream) output = (;) for (dep, fields) in input.deps # get the upstream namedtuple-like for this dependency updep = upstream[dep] # create the output namedtuple with the new field names this_output = (; [_get(updep, field) for field in fields]...) output = merge(output, this_output) end return output end

Filed to implement in #8.

Co-authored-by: Dave Kleinschmidt <dave.f.kleinschmidt@gmail.com>

…rmSpecifications.jl into hr/chain-brain

src/nothrow_dag.jl

hannahilea added 30 commits July 21, 2023 16:38

created empty subpackage in sleeplab repo, copied over

a0297f5

clean up

6174bc2

more cleanup

b845d64

shuffle prototype for new repo

56faf81

Update dependencies

4ea8341

Merge branch 'main' into hr/init-prototype

0bae5f2

whoops forgot in init pr

71896d7

pull out mermaid stuff

88b5632

move memaid to next pr

8c6942e

ctd

cbb7087

kill your darlings

7be9818

rename in code

ea137d5

rename files

1f3dd7a

Merge branch 'hr/a-repo-by-any-other-name' into hr/init-prototype

cf6a78f

update names

2facaa1

rename ctd

74dcf3d

clean up for rename

7d53730

more renaming

929d0d5

clean up basic transform

099e8ad

clean up identity

85f51bb

fix

4ed064b

Merge branch 'main' into hr/init-prototype

32ee6ea

non-mutating

b88785a

update docstring

71356c8

remove chain from this pr

daca5ec

lint

ee69817

comments

f3d82ab

Add additional safety test

16a6282

add interpret_input function

83a4bca

don't use fun feature not supported by julia 1.7

a60164c

hannahilea added 4 commits August 7, 2023 20:14

remove doctest here

bd3eba6

add example docstring

09b8f21

fix doctests

f77fd29

fix doctests

d65e3e4

glennmoy reviewed Aug 8, 2023

View reviewed changes

hannahilea and others added 6 commits August 8, 2023 10:46

Update src/nothrow_chain.jl

2643131

Co-authored-by: Glenn Moynihan <glennmoy@gmail.com>

update var naming in examples

fc1c461

update comment

c330eef

rename NoThrowTransformChain to NoThrowDAG

ceb798d

rename files too

05bf94b

rename ChainStep to DAGStep

67a3256

kleinschmidt reviewed Aug 9, 2023

View reviewed changes

hannahilea added 5 commits August 9, 2023 10:35

rename chain to dag

45ed9b3

missed rename

d7d8835

reformat for code review response

47443ac

add more comments

5a22e1c

rename field_map

591bba5

kleinschmidt approved these changes Aug 9, 2023

View reviewed changes

hannahilea changed the title ~~Implement NoThrowTransformChain~~ Implement NoThrowDAG Aug 9, 2023

hannahilea and others added 2 commits August 9, 2023 13:51

review feedback

5bbc03c

Update src/nothrow_dag.jl

f37dbd6

Co-authored-by: Dave Kleinschmidt <dave.f.kleinschmidt@gmail.com>

This was referenced Aug 9, 2023

Refactor NoThrowDAG fields into single vector of DAGSteps #7

Open

Refactor input_assembler to be more specific #8

Open

hannahilea added 2 commits August 9, 2023 14:59

clean-up docstrings

77e394d

Merge branch 'hr/chain-brain' of github.com:beacon-biosignals/Transfo…

183ce0f

…rmSpecifications.jl into hr/chain-brain

github-actions bot reviewed Aug 9, 2023

View reviewed changes

src/nothrow_dag.jl Outdated Show resolved Hide resolved

hannahilea added 2 commits August 9, 2023 15:04

oh linting you're so fine

8b9afd9

docfix

0753e11

hannahilea merged commit 58d7a41 into main Aug 9, 2023

hannahilea deleted the hr/chain-brain branch August 9, 2023 19:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement `NoThrowDAG` #6

Implement `NoThrowDAG` #6

hannahilea commented Aug 1, 2023 •

edited

Loading

kleinschmidt left a comment

kleinschmidt left a comment

kleinschmidt Aug 9, 2023

kleinschmidt Aug 9, 2023

hannahilea Aug 9, 2023

Implement NoThrowDAG #6

Implement NoThrowDAG #6

Conversation

hannahilea commented Aug 1, 2023 • edited Loading

kleinschmidt left a comment

Choose a reason for hiding this comment

kleinschmidt left a comment

Choose a reason for hiding this comment

kleinschmidt Aug 9, 2023

Choose a reason for hiding this comment

kleinschmidt Aug 9, 2023

Choose a reason for hiding this comment

hannahilea Aug 9, 2023

Choose a reason for hiding this comment

Implement `NoThrowDAG` #6

Implement `NoThrowDAG` #6

hannahilea commented Aug 1, 2023 •

edited

Loading