moby--moby

mirror of https://github.com/moby/moby.git synced 2022-11-09 12:21:53 -05:00

Author	SHA1	Message	Date
Flavio Crisciani	55e4cc7262	Optimize networkDB queue Added some optimizations to reduce the messages in the queue: 1) on join network the node execute a tcp sync with all the nodes that it is aware part of the specific network. During this time before the node was redistributing all the entries. This meant that if the network had 10K entries the queue of the joining node will jump to 10K. The fix adds a flag on the network that would avoid to insert any entry in the queue till the sync happens. Note that right now the flag is set in a best effort way, there is no real check if at least one of the nodes succeed. 2) limit the number of messages to redistribute coming from a TCP sync. Introduced a threshold that limit the number of messages that are propagated, this will disable this optimization in case of heavy load. Signed-off-by: Flavio Crisciani <flavio.crisciani@docker.com>	2018-07-02 16:59:45 -07:00
Flavio Crisciani	8c31217a44	NetworkDB create NodeID for cluster nodes Separate the hostname from the node identifier. All the messages that are exchanged on the network are containing a nodeName field that today was hostname-uniqueid. Now being encoded as strings in the protobuf without any length restriction they plays a role on the effieciency of protocol itself. If the hostname is very long the overhead will increase and will degradate the performance of the database itself that each single cycle by default allows 1400 bytes payload Signed-off-by: Flavio Crisciani <flavio.crisciani@docker.com>	2017-09-26 10:48:04 -07:00
Flavio Crisciani	2d2a2bc568	Fix reapTime logic in NetworkDB - Added remainingReapTime field in the table event. Wihtout it a node that did not have a state for the element was marking the element for deletion setting the max reapTime. This was creating the possibility to keep the entry being resync between nodes forever avoding the purpose of the reap time itself. - On broadcast of the table event the node owner was rewritten with the local node name, this was not correct because the owner should continue to remain the original one of the message Signed-off-by: Flavio Crisciani <flavio.crisciani@docker.com>	2017-09-21 09:37:37 -07:00
Flavio Crisciani	e77c245e45	2x faster to converge - Introduced back the Invalidate - optimized the rebroadcast logic Signed-off-by: Flavio Crisciani <flavio.crisciani@docker.com>	2017-08-01 13:47:18 -07:00
Alessandro Boch	1323730eca	On send node envents, notify only if there are peers - Otherwise operation will unnecessarely block for five seconds. - This is particularly noticeable on graceful shutdown of daemon in one node cluster. Signed-off-by: Alessandro Boch <aboch@docker.com>	2017-04-21 10:19:08 -07:00
Alessandro Boch	9c3c86a931	Do not invalidate table event messages - Do not run the risk of suppressing meaningful messages for the rest of the cluster, as a many services depend on it, like the service records and the distributed load balancers. Signed-off-by: Alessandro Boch <aboch@docker.com>	2017-03-16 00:49:58 -07:00
Ke Li	23ac56fdd0	Remove unnecessary string formats Signed-off-by: Ke Li <kel@splunk.com>	2016-11-22 09:29:53 +08:00
Jana Radhakrishnan	5f5dad3c02	Recover from transient gossip failures Currently if there is any transient gossip failure in any node the recoevry process depends on other nodes propogating the information indirectly. In cases if these transient failures affects all the nodes that this node has in its memberlist then this node will be permenantly cutoff from the the gossip channel. Added node state management code in networkdb to address these problems by trying to rejoin the cluster via the failed nodes when there is a failure. This also necessitates the need to add new messages called node event messages to differentiate between node leave and node failure. Signed-off-by: Jana Radhakrishnan <mrjana@docker.com>	2016-09-19 15:58:14 -07:00
Jana Radhakrishnan	98b571a524	Make sure broadcast queue is valid broadcasting When broadcasting table event, make sure the broadcast queue is valid. The network may have been removed while in the process of sending the broadcast. Signed-off-by: Jana Radhakrishnan <mrjana@docker.com>	2016-06-12 11:58:03 -07:00
Jana Radhakrishnan	77abea9c1e	Use protobuf in networkdb core messages Convert all networkdb core message types from go message types to protobuf message types. This faciliates future modification of the message structure without breaking backward compatibility. Signed-off-by: Jana Radhakrishnan <mrjana@docker.com>	2016-05-17 09:18:24 -07:00
Jana Radhakrishnan	28f4561e3f	Add network scoped gossip database Network DB is a network scoped gossip database built on top of hashicorp/memberlist providing an eventually consistent state store. It limits the scope of the gossip and periodic bulk syncing for table entries to only the nodes which participate in the network to which the gossip belongs. This designs make the gossip layer scale better and only consumes resources for the network state that the node participates in. Since the complete state for a network is maintained by all nodes participating in the network, all nodes will eventually converge to the same state. NetworkDB also provides facilities for the users of the package to watch on any table (or all tables) and get notified if there are state changes of interest that happened anywhere in the cluster when that state change eventually finds it's way to the watcher's node. Signed-off-by: Jana Radhakrishnan <mrjana@docker.com>	2016-04-08 12:58:09 -07:00

11 commits