moped icon indicating copy to clipboard operation
moped copied to clipboard

If a secondary goes down, nodes take a long time to refresh

Open dblock opened this issue 11 years ago • 3 comments

We had a secondary go down, causing a heartbeat check failure in MongoDB and the following in the MongoDB logs:

Sun Sep 28 13:27:21.905 [rsHealthPoll] couldn't connect to c0.gravity.member4.mongolayer.com:27017: couldn't connect to server c0.gravity.member4.mongolayer.com:27017 

This made our response time 5x slower, taking about 400ms to create a new Moped session. I didn't debug where exactly it was sitting, but I am going to imagine it's not too hard to replicate.

I would expect something a bit more efficient, IMO node rediscovery should not be blocking a request.

dblock avatar Sep 28 '14 20:09 dblock

Hi @dblock could you check if the code on #352 fix this? If not, let me know.

wandenberg avatar Feb 13 '15 23:02 wandenberg

We'll try to upgrade production to moped HEAD this week.

dblock avatar Feb 14 '15 17:02 dblock

FYI I believe this is the same bug I reported on Mongoid https://github.com/mongoid/mongoid/issues/3892

johnnyshields avatar Feb 15 '15 21:02 johnnyshields