Disconnected operation of a controller #111
Closed
billdenney
started this conversation in
Ideas
Replies: 1 comment 2 replies
-
If the connection is unreliable, it would be much better to log directly into the login node of the cluster and run the pipeline from there. In fact, taking a step back here, I hope to make this sort of situation easier to navigate using cloud storage in |
Beta Was this translation helpful? Give feedback.
2 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I have an idea that would take a different form from #106, but could possibly accomplish a similar purpose.
Use case 1: I want to be able to close my laptop and have jobs continue to run on a computational cluster
Use case 2: I want to use my laptop to send jobs to a cluster where I may then have intermittent access (e.g. I'm flying on a plane)
If there were some form of store-and-forward operation at the controller level, then disconnected operation like the above could become feasible. It would also enable multi-hop topologies as in the diagram in #106.
The concept that I'm thinking of is that crew controllers could have scratch space to store results locally. Then, the user could connect intermittently to the controller and ask "do you have any results in your scratch space to send me?" If the user is already connected (i.e. how crew operates now), then results would be sent back immediately.
I see important complexity overhead here that should not be underestimated, but at the same time, the benefits could be significant in terms of allowing different topologies (as in #106).
Beta Was this translation helpful? Give feedback.
All reactions