-
Dear all, We are having troubles with the Configuration Service which seems to be saturated. I've observed that many clients (jobs) get timeout errors when connecting to the master instance dips://dcta-servers02.pic.es:9135/Configuration/Server (while I don't observe errors for connecting to the other slave instances). We have several slave CS services, but I suspect that pilots are currently pointing to a single CS instance (the master) and it's maybe the cause of CS getting saturated.
I was thinking about adding ExtraPilotOptions to add the other CS endpoints, something like:
(Even if I'm not sure this is the right way to do.) Then, we also have only 1 CS endpoint (not the master) configured under:
and I would like to add all other endpoints. However, since I'm not able to connect to the CS (neither from the Web portal, neither from dirac-configuration-cli) I'm not able to change any configuration parameter. I've tried to change the section directly on the dirac.cfg on the server where the CS master instance is running:
but it doesn't seem to have any positive effect. Can you suggest some other means to fix this problem and to commit changes to the CS in this particular situation? Thank you. |
Beta Was this translation helpful? Give feedback.
Replies: 5 comments 7 replies
-
Just to let you know that when the load went down, I've been able to update CS with:
and as far as I understand, this is the list of CS endpoints also used for pilots. Is that correct? In the pilot log I find:
which seems to confirm that all the list is taken into account. So in principle I don't need any ExtraPilotOptions. I will monitor anyway if the timeouts to the Master endpoints will decraese:
Thanks. |
Beta Was this translation helpful? Give feedback.
-
Hi Luisa,
|
Beta Was this translation helpful? Give feedback.
-
Hi Federico,
So reading your questions, I understand that we still have a unique endpoint as first list used by the pilot and that we will probably hit the saturation problem again. Is that correct? Thank you. |
Beta Was this translation helpful? Give feedback.
-
The risk of saturation in the current situation is still real. |
Beta Was this translation helpful? Give feedback.
-
Thanks. |
Beta Was this translation helpful? Give feedback.
Just to let you know that when the load went down, I've been able to update CS with:
and as far as I understand, this is the list of CS endpoints also used for pilots. Is that correct?
In the pilot log I find:
2022-02-23 08:13:01 UTC DE…