Skip to content
This repository has been archived by the owner on Sep 12, 2024. It is now read-only.

email "phedex is acting up" #510

Closed
vlimant opened this issue Mar 24, 2020 · 5 comments
Closed

email "phedex is acting up" #510

vlimant opened this issue Mar 24, 2020 · 5 comments

Comments

@vlimant
Copy link
Contributor

vlimant commented Mar 24, 2020

For the record on a mystery failure of Phedex, that had impacted unified in the past, that has a protection which @nsmith- and @dr-stringfellow are wondering about.

subscribor is try to get all unassigned blocks

https://github.com/CMSCompOps/WmAgentScripts/blob/master/Unified/subscribor.py#L56

and make a subscription to DataOps (without an actual transfer) just so that it belongs and is counted against dataops quota : in an effort to "fix" dmwm/WMCore#5945 (in the wrong place obviously)

getDatasetBlockAndSite has been failing very badly rarely dmwm/PHEDEX#1117, providing things completely irrelevant to the initial query to phedex (@nataliaratnikova)

this 0522c2e and the email eb4687a was put in place so that things don't go astray in unified and downstream.

a band-aid on a band-aid, on a band-aid ...

@dr-stringfellow
Copy link

@amaltaro @vlimant @nsmith- @z4027163
Since a week or so, we are seeing this email dozens of times a day. There is no development work going on in phedex anymore, but something is now clearly causing this to happen very frequently.

@vlimant
Copy link
Contributor Author

vlimant commented May 5, 2020

I have not seen any such emails recently myself. Anything bad happening, is happening on phedex side c.f. dmwm/PHEDEX#1117

@nsmith-
Copy link

nsmith- commented May 5, 2020

Can you be more specific about what date the rate of such mails increased? Maybe we can correlate it with more load on datasvc, e.g. via rucio sync or otherwise

@dr-stringfellow
Copy link

Sorry, couldn't find a better way to get you this information other than:

https://www.dropbox.com/sh/c90bs1830rm5vlg/AABzw7p2cNBc7Z050IIymSkXa?dl=0

@vlimant Yes, the bug is on the PhEDEx side, and it won't be fixed :(, but maybe other services need to know about this right now. In case they see something strange, it might be because of the total nonsense phedex response.

@haozturk
Copy link
Collaborator

haozturk commented Dec 3, 2020

Phedex is not used anymore, closing this issue.

@haozturk haozturk closed this as completed Dec 3, 2020
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants