-
Notifications
You must be signed in to change notification settings - Fork 301
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
HPCC-32540 Roxie may flood NIC of target agents if no agents running on a channel #19050
Conversation
This change should not affect functionality at all, but makes it possible to make future changes allowing per-channel back-off. Signed-off-by: Richard Chapman <rchapman@hpccsystems.com>
…owledged Signed-off-by: Richard Chapman <rchapman@hpccsystems.com>
Jira Issue: https://hpccsystems.atlassian.net//browse/HPCC-32540 Jirabot Action Result: |
Note HPCC-31061 (found when going through changes for the last year) |
The scaled back-off on the ack timeout may help address the concerns that led to HPCC-31061 |
…cknowledge enabled Signed-off-by: Richard Chapman <rchapman@hpccsystems.com>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
That looks good so far to me.
{ | ||
return timeFirstSent && !acknowledged && now-timeFirstSent > timeout; | ||
bool ret = timeFirstSent && !acknowledged && now-timeFirstSent > packetAcknowledgeTimeout*(resends+1); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
for discussion: (packetAcknowledgeTimeout << resends) might be better
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks ok to me.
I want to test it.
@mckellyln have you had a chance to test this yet? |
Close and open to restart the tests. @mckellyln I am inclined to merge, but some extra testing would be valuable. |
Jira Issue: https://hpccsystems.atlassian.net//browse/HPCC-32540 Jirabot Action Result: |
I don't really have access anymore to a large cluster with real data on it for testing. |
Type of change:
Checklist:
Smoketest:
Testing: