-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
(non-standard-operations) add tier2,3 and 4 to computer shutdown #88
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
One minor comment, I have asked Tiago Ribeiro for confirmation. Otherwise this looks good to me. Once we get the confirmation from Tiago I will approve.
* Control System | ||
yagan[13-20].cp.lsst.org | ||
azar[02-03].cp.lsst.org |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think we now have control system dependencies to the /net/obs nfs mount on azar02 via the scriptqueue and scheduler, and taking this offline would take down the control system. If so, we should move azar02 to Tier 3 (but not azar03). @tribeiro can you confirm that is the case?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think at this point we can consider that we are running the system in degraded mode and won't be able to update the obs env.
Alternatively, we can move this functionality to azar1. I don't think it is worth keeping an additional node for this.
There should be another tier between 3 and 4, which is supporting camera liveness (does not need the control network, but does require the camera clusters). this includes the switches, DAQs, and most CCS machines. |
The tiers in addition to an order of shutdown, represent a threat level for the hardware, so adding another tier wouldn't change the end result since tier 3 is already a high threat level that requires to shutdown as much as possible (including the daqs). Tier 3 also requires coordination with "system owners" so that would be sufficient to wait for the warm-ups. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good to me!
No description provided.