Skip to content

Conversation

@hawka
Copy link
Contributor

@hawka hawka commented Aug 7, 2015

In our system, we've been seeing issues where, when etcd becomes unreachable to registrator, it needs to be kicked manually to publish its local service entries to etcd again. We believe that by using etcd client's SyncCluster functionality, we can synchronize the set of etcd servers with registrator and avoid this situation. Our proposal is that a client should initially use the local proxy, then via a proxied request, it discovers the current cluster topology. That topology is subsequently used directly, and the proxy is no longer used. We think this will buy us better retry behavior against etcd servers.

@progrium
Copy link
Contributor

progrium commented Aug 7, 2015

This is great, and sounds reasonable. Has this improved the situation you initially described?

@hawka
Copy link
Contributor Author

hawka commented Aug 10, 2015

We haven't deployed to production yet, as we wanted to get your input first, and we're also planning to make a similar patch on our copy of confd as well. However, we did perform testing on a test etcd cluster, and the results were positive.

@progrium
Copy link
Contributor

I don't see anything wrong with it, but I'll leave this open for anybody else to comment until you want to confirm you're happy with it.

@progrium
Copy link
Contributor

progrium commented Sep 1, 2015

@hawka any update on this working out?

@hawka
Copy link
Contributor Author

hawka commented Sep 8, 2015

@progrium I wanted to make sure we were running successfully in production before I updated this PR, and we've been running in production for about two weeks now. This patch has fixed the error we were seeing, with no negative side effects. I'm confident in asking for it to be committed, if you are amenable!

@progrium
Copy link
Contributor

progrium commented Sep 8, 2015

@hawka that's great news! Looked it over again and it still looks great.

progrium added a commit that referenced this pull request Sep 8, 2015
Synchornize etcd cluster in registrator on service registration
@progrium progrium merged commit 987704d into gliderlabs:master Sep 8, 2015
This was referenced Jun 25, 2018
Closed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants