Provide a way to force the driver to reconnect #2909

tishun · 2024-07-09T11:19:03Z

Discussed in #2870

^{Originally posted by e-ts June 3, 2024}
Can I reconnect to the node used when I catch a RedisCommandTimeoutException for a command to a Redis Cluster?

We are having a problem where the old master does not respond for 10 seconds after a FAILOVER is issued to its replica. TCP packets with new requests still get acked during these 10 seconds. As the connection is clearly not dead, Lettuce keeps sending new commands to the old master. Eventually, it will receive all the MOVED response at once but this is too late for us.

For our specific problem, it would be better if Lettuce reconnected to the node on command timeout as the bug only seems to affect a single TCP socket. A command on a new socket will get an immediate MOVED response, allowing Lettuce to continue on the master.

I guess it could be tricky to get this right as all the requests in flight will time out at different times and we probably do not want to reconnect for each timeout.

Of course, we are trying to get the underling problem with Redis resolved too, see #2572 but a work-around like this would still be useful until that gets fixed.

I have checked the wiki, GitHub issues and GitHub Discussions and found #2082 which is similar but in that case, the TCP packets do not get acked, leading to another solution.

I tried setting an absurdly low periodic refresh of a few hundred milliseconds but that does not seem to help, which might be a bug but I have not looked into it yet.

tishun · 2024-07-17T21:20:50Z

Suggested a solution in the discussion, waiting for user feedback.

tishun modified the milestones: 7.x, Backlog Jul 9, 2024

tishun added the status: help-wanted An issue that a contributor can help us with label Jul 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Provide a way to force the driver to reconnect #2909

Provide a way to force the driver to reconnect #2909

tishun commented Jul 9, 2024

tishun commented Jul 17, 2024

Provide a way to force the driver to reconnect #2909

Provide a way to force the driver to reconnect #2909

Comments

tishun commented Jul 9, 2024

Discussed in #2870

tishun commented Jul 17, 2024