Graph Spec improvements #227

EvanDietzMorris · 2024-05-10T19:55:08Z

There are a few obvious things we need to change about the way graph specs are processed.

We should have the ability to specify multiple graph specs at once and queue up building graphs from multiple specs at once.
We also need the ability to have sub graph dependencies cross over from one spec to another. For example the Baseline graph is shared by robokopkg and yobokop but currently there's no way to have it in one place and have each reference the same thing.
Right now you can build just one graph from a graph spec but it still checks for latest versions of every source in the spec (for sources that don't have a pinned version). This is bad because when you just want to build one graph in a spec, it's a waste of time to check them all, and a failed version check could disrupt building a graph that doesn't even use that source.

EvanDietzMorris · 2024-05-10T20:57:38Z

It might also be nice, but is a way lower priority, to have the ability to reference another graph but say you want to build a that graph without a particular source. For example the rule mining kp is the baseline minus tmkp, but currently there's no way to do that without just making another copy of the spec that's mostly redundant.

EvanDietzMorris · 2024-07-23T16:17:10Z

Another thing that is simple but would be a nice change, is that currently if the load_manager pipeline fails for a single data source it crashes the entire graph it was part of, but it should just continue to process the rest of the data sources (but not attempt to build the graph) so that when you come back and fix the failure the rest of the work is done

EvanDietzMorris mentioned this issue Jul 23, 2024

get_latest_source_version() being called for data sources that aren't needed for graph being built. #236

Open

DnlRKorn mentioned this issue Jul 23, 2024

Fix for get_latest_source_version being called unnecessarily. #237

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Graph Spec improvements #227

Graph Spec improvements #227

EvanDietzMorris commented May 10, 2024

EvanDietzMorris commented May 10, 2024

EvanDietzMorris commented Jul 23, 2024

Graph Spec improvements #227

Graph Spec improvements #227

Comments

EvanDietzMorris commented May 10, 2024

EvanDietzMorris commented May 10, 2024

EvanDietzMorris commented Jul 23, 2024