[Questions] Quorum queue setup on 2 WAN datacenters with exactly two availability zones (racks) each #13165
-
Community Support Policy
RabbitMQ version used4.0.5 Erlang version used27.2.x Operating system (distribution) usedlinux How is RabbitMQ deployed?Community Docker image rabbitmq-diagnostics status outputN/A Logs from node 1 (with sensitive values edited out)N/A Logs from node 2 (if applicable, with sensitive values edited out)N/A Logs from node 3 (if applicable, with sensitive values edited out)N/A rabbitmq.confN/A Steps to deploy RabbitMQ clusterN/A Steps to reproduce the behavior in questionN/A advanced.configNo response Application codeNo response Kubernetes deployment fileNo response What problem are you trying to solve?Hi, If this is possible, and if the WAN network works well enough (no lost connections, only higher latencies) during the period of outage, is this kind of setup supported ? This could be compared to running an elasticsearch cluster with a dedicated tie breaker like in this post https://discuss.elastic.co/t/tie-breaker-master-node-with-higher-latency/89683 Thanks in advance |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment
-
@jonenst it is not possible, witness replicas/node types have never made it to our Raft implementation despite two attempts, and this idea does not fit the quorum queue or stream replication "model" and user-exposed features. Four node clusters are explicitly not recommended, you'd need 3 or 5 nodes. If the latency is low (I'd say, a few ms), you can try spanning a cluster across a WAN. Some users successfully do it over availability zones on AWS, for example. However, for as long as the Mnesia is supported in any way, and thus partition handling strategies still exist, you will run the risk of a latency spike creating completely unnecessary leader migrations and such. |
Beta Was this translation helpful? Give feedback.
@jonenst it is not possible, witness replicas/node types have never made it to our Raft implementation despite two attempts, and this idea does not fit the quorum queue or stream replication "model" and user-exposed features.
Four node clusters are explicitly not recommended, you'd need 3 or 5 nodes.
If the latency is low (I'd say, a few ms), you can try spanning a cluster across a WAN. Some users successfully do it over availability zones on AWS, for example. However, for as long as the Mnesia is supported in any way, and thus partition handling strategies still exist, you will run the risk of a latency spike creating completely unnecessary leader migrations and such.